BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004349
(759 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255550407|ref|XP_002516254.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus
communis]
gi|223544740|gb|EEF46256.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus
communis]
Length = 755
Score = 1044 bits (2700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/759 (65%), Positives = 620/759 (81%), Gaps = 4/759 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F +VEDGLKLSKRLY+GKDR V PP+ M+KS + LP +PM+YAVI+DP
Sbjct: 1 MAEEFGNSVEDGLKLSKRLYYGKDRAVAPPRPVVHMEKSA--EAYLPTAPMIYAVISDPS 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPD+PSYQP+VHGRCDPP+LIPLQMN +EL+ D +LD+A VRV+G+WRVHCVMGSKS
Sbjct: 59 IVDNPDMPSYQPHVHGRCDPPALIPLQMNRIELEADSFLDSAIVRVNGSWRVHCVMGSKS 118
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIA+PMG+QGSILGVE E++ KSY+T+LIA+ + K E G FL P IFTLT
Sbjct: 119 CDCRIAIPMGEQGSILGVEVEVARKSYYTELIAMDDKKDTEKECRPENGGFLTPQIFTLT 178
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+P++DGGS +SI++ W QKL Y DGE S+ VPF FPE+VTPA+KK+PK+EKI LNVN+GT
Sbjct: 179 IPKVDGGSSISIKVSWIQKLLYHDGELSLTVPFSFPEFVTPAVKKLPKKEKILLNVNSGT 238
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
GTEV+C T+SH LKQL+R+ GKLG+SYESEVL W++IDF SY+VS SHIFGGV+LQSPS
Sbjct: 239 GTEVVCKTTSHPLKQLKREAGKLGFSYESEVLTWTDIDFAVSYSVSSSHIFGGVMLQSPS 298
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
HDVDQR+MF +YL PG + KVF+K+++FIVDISGSM+GKPLE KNA++ AL+KL+P
Sbjct: 299 AHDVDQRDMFYLYLFPGDQPNMKVFRKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNP 358
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFNGETYLFS+ MELAT++ VERA +W+ +NFIAGG TNI PL +A+EM++N
Sbjct: 359 KDSFNIIAFNGETYLFSSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSN 418
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
++GS+P+IFLVTDGAVEDER ICD+MK + G+ICPRIYTFGIG+YCNHYFLRMLA +
Sbjct: 419 TQGSLPVIFLVTDGAVEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLRMLATV 478
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG Y AAYD+DS++ +M+ F+RG S+VLAN+ IDTL DLD+ E+YPS IPDLSSES L
Sbjct: 479 CRGQYDAAYDVDSVQARMEIFFSRGLSAVLANVMIDTLDDLDDVEVYPSNIPDLSSESLL 538
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
I+SGRY G FP +KA+G LG+LSNFVV+LK+Q KD+P D+I AKQQIDLLTAQAW+SE
Sbjct: 539 IISGRYHGNFPGIVKAEGVLGNLSNFVVDLKIQKTKDVPFDKIFAKQQIDLLTAQAWYSE 598
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
+K+LEEKV+KMS+QTGV EYTR+ ++E + N A ESP K S K+D K+ D + +
Sbjct: 599 NKQLEEKVAKMSIQTGVASEYTRLTLLEMERGNQAIESPRAHKFSNKTDSLKV-DYKGRR 657
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
I+LQN G+GFG+L ATA+N PPG E KLPE AE+ +KAASNCC + +CCC+CCI+C
Sbjct: 658 RILLQNFGVGFGNLAATADNIPPGVEELKLPEAAELIMKAASNCCGRVCGQCCCMCCIQC 717
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDGQ 759
C+++NDQC I LTQL ALACFGC ECCS LCC G DG
Sbjct: 718 CSRMNDQCAIALTQLFAALACFGCVECCS-LCCSGGDGH 755
>gi|224142781|ref|XP_002324729.1| predicted protein [Populus trichocarpa]
gi|222866163|gb|EEF03294.1| predicted protein [Populus trichocarpa]
Length = 751
Score = 1042 bits (2694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/757 (67%), Positives = 622/757 (82%), Gaps = 6/757 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F ++V+DGLKLSKR+Y GKDR VTPPK PS MDKS LP +PMVYAVI++P
Sbjct: 1 MAEEFGRSVQDGLKLSKRIYLGKDRAVTPPKPPSHMDKSPVA--YLPTAPMVYAVISNPA 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDIPSYQP+VHGRCDPP+LIPLQM +EL+ D YLDTAFV++SG+WRVHCVMGS+S
Sbjct: 59 IVDNPDIPSYQPHVHGRCDPPALIPLQMTRIELEADSYLDTAFVKISGSWRVHCVMGSES 118
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCR+AVPMG+QGSILGVE E S K Y+T+L+A+ + K +E G FLKP+ FT+T
Sbjct: 119 CDCRLAVPMGEQGSILGVEIEASRKLYYTELVAIEDRKDLEKEVRIENGGFLKPHTFTIT 178
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+P++DGGS LSI++RW QKL Y +GEFS+ VPF FPEYVTP +KK+P++EKI LNV GT
Sbjct: 179 IPKVDGGSTLSIKVRWMQKLLYHNGEFSLIVPFSFPEYVTPHVKKLPRKEKIQLNVTTGT 238
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
GTE++C TSSH LK LRR+VGKLG+SYESEVL W+NIDF FSY VS SHIFGGVLLQSPS
Sbjct: 239 GTEIVCKTSSHPLKGLRREVGKLGFSYESEVLTWTNIDFTFSYAVSFSHIFGGVLLQSPS 298
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
LHDVDQR+MFC YL PG SRKVF+K+++F+VDISGSM+G PLE TK AL+ AL+ LD
Sbjct: 299 LHDVDQRDMFCAYLFPGGHHSRKVFRKEIVFVVDISGSMEGAPLEGTKIALSAALTNLDS 358
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFNGETYLFS+SMELA+++ VERA +W+ +N IAGG TNI PL +A EML+
Sbjct: 359 KDSFNIIAFNGETYLFSSSMELASEDTVERAVEWMSMNLIAGGDTNILVPLKQATEMLSK 418
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S GSIP IFLVTDGAVEDER ICD MKS +T GGSI PRI TFGIGSYCNH+FLRMLAMI
Sbjct: 419 SGGSIPFIFLVTDGAVEDERHICDIMKSHITGGGSIHPRICTFGIGSYCNHHFLRMLAMI 478
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
SRG Y AAYD+DS+E +MQKL +R S+++ANI I DLDE E+YPSRIPDLSS++PL
Sbjct: 479 SRGQYDAAYDIDSVESRMQKLLSRISSTIIANITIKAFDDLDEVEVYPSRIPDLSSDNPL 538
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
IVSGR+QG FPDT+ A GF GDLSNF ++LK+Q AKDIPL + AKQQIDLLTAQAWFSE
Sbjct: 539 IVSGRFQGNFPDTVVATGFFGDLSNFSLDLKVQKAKDIPLHSVSAKQQIDLLTAQAWFSE 598
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
+K+LEEKV+K+S+QTGV+ EYT M ++ETD N A+ESPG G K + K+ DS+ +
Sbjct: 599 NKQLEEKVAKLSIQTGVISEYTCMSLLETDRGNQAAESPG---GHKVCANLKV-DSQGRR 654
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
I L+NLG+GFG+LTATAEN PG+E KLPE AEI +KAASNCCS + +CCC+CC++C
Sbjct: 655 RIFLRNLGVGFGNLTATAENLRPGAEESKLPEAAEIIIKAASNCCSIMCKQCCCMCCVQC 714
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
C K+N+Q IVLTQLCTA+ACFGC ECCS +CC GQ+
Sbjct: 715 CFKINNQFAIVLTQLCTAVACFGCIECCSEICCGGQE 751
>gi|225444686|ref|XP_002277641.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Vitis
vinifera]
gi|297738541|emb|CBI27786.3| unnamed protein product [Vitis vinifera]
Length = 756
Score = 1028 bits (2658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 503/759 (66%), Positives = 623/759 (82%), Gaps = 5/759 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MADDF K+VEDGLKL+KR+YFGKDR VTPPK P SMDKS LP +PMVYAV+ DP
Sbjct: 1 MADDFTKSVEDGLKLAKRIYFGKDRSVTPPK-PVSMDKSS--KSYLPTAPMVYAVVFDPA 57
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDIPSYQP+V+GRCDP +LIPLQMN + L+VDCYLDTAFV ++G+WRVHCVMGS+S
Sbjct: 58 IVDNPDIPSYQPHVYGRCDPAALIPLQMNGIALEVDCYLDTAFVTLTGSWRVHCVMGSRS 117
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCR+AVPMG+QGSILGVE ++ KSY TQLI + EN K++ G FLKP+IFTLT
Sbjct: 118 CDCRLAVPMGEQGSILGVEVDVPRKSYCTQLITMEENKDMEKASRAGDGGFLKPHIFTLT 177
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+P++DGGS LSI++RWSQKL Y DG+ ++ VPF FPEYVTPA KK+ K+EKI LNVN GT
Sbjct: 178 IPEVDGGSNLSIKVRWSQKLLYHDGQLTLIVPFHFPEYVTPAGKKVSKKEKIQLNVNCGT 237
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
G EVLC T+SH LK++RR VGKL +SYE+EVL WS+ DF FS++V SHI G VLLQSPS
Sbjct: 238 GMEVLCKTTSHPLKEIRRQVGKLSFSYEAEVLTWSSADFTFSFSVPSSHIHGSVLLQSPS 297
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HDVDQRE+FC+YL PG ++ KVF+K+V+F+VDISGSM+GK LEDTKNAL+ ALSKLD
Sbjct: 298 VHDVDQREIFCLYLFPGNQQTGKVFRKEVVFVVDISGSMRGKLLEDTKNALSAALSKLDS 357
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSF+I+AFNGE ++FS+S++LATKEA+E A QWI +NFIAGG TNI P+ KA+E+ ++
Sbjct: 358 KDSFSIIAFNGEIFIFSSSVQLATKEAIENAIQWISMNFIAGGDTNILLPMNKAMELFSH 417
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S GSIPIIFL+TDG+VEDER ICD M S LTN SI PRIYTFGIG YCNHYFL+MLAMI
Sbjct: 418 SPGSIPIIFLITDGSVEDERHICDVMTSYLTNEESIHPRIYTFGIGLYCNHYFLKMLAMI 477
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG+Y AAYD +SIE+++++LFTR S+VLANI ID L+DLD+FE+YPS +PDLSSES
Sbjct: 478 GRGHYDAAYDANSIELRVERLFTRASSTVLANITIDDLEDLDDFEVYPSHMPDLSSESVW 537
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
VSGRY+G FPDT++A+G DL+NFV +LK+Q AK+IPLDR+ AKQQI LTAQAWFSE
Sbjct: 538 TVSGRYKGNFPDTIQARGIFADLNNFVTDLKVQKAKEIPLDRVLAKQQIGWLTAQAWFSE 597
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQK-ILDSEVP 659
+K+LEEK+++MS+QTGV+ EYTRMI++ET SE ++ K ++Q+ ++DS+V
Sbjct: 598 NKQLEEKIAEMSIQTGVISEYTRMILLETQGGAQVSEPGRVQEPPKTMEYQRPVVDSKVQ 657
Query: 660 KTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIE 719
K I+LQ+LG+GFG++ ATAEN+PPGSE KLPE AEIFVKAASNCC+ + CCC+CCI
Sbjct: 658 KIILLQSLGVGFGNVNATAENYPPGSEEVKLPEAAEIFVKAASNCCAKMCGYCCCMCCIR 717
Query: 720 CCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
C ++NDQC IVLTQLC+ALA GCF C+ LCC GQ+G
Sbjct: 718 MCTRMNDQCAIVLTQLCSALAILGCFS-CAELCCSGQEG 755
>gi|30686226|ref|NP_173345.2| inter-alpha-trypsin inhibitor heavy chain-like protein [Arabidopsis
thaliana]
gi|22531102|gb|AAM97055.1| unknown protein [Arabidopsis thaliana]
gi|23197960|gb|AAN15507.1| unknown protein [Arabidopsis thaliana]
gi|332191682|gb|AEE29803.1| inter-alpha-trypsin inhibitor heavy chain-like protein [Arabidopsis
thaliana]
Length = 754
Score = 988 bits (2553), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/758 (63%), Positives = 596/758 (78%), Gaps = 4/758 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA+DF +AV+DGLKL+KR+YFGKDR V P+ P+ MD+S LP +PMVYAVI DP
Sbjct: 1 MAEDFARAVDDGLKLAKRIYFGKDRAVAAPRPPAPMDRSSTTQPYLPTAPMVYAVIPDPG 60
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPD+PSYQP+VHGRCDPP+LIPLQMN++ELDVDCYLDTA V V+G+WRVHCVMGSK
Sbjct: 61 IVDNPDLPSYQPHVHGRCDPPALIPLQMNSIELDVDCYLDTALVTVTGSWRVHCVMGSKR 120
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIA+PMG+QGSILGVE EI KSY TQLI + + K+A ETG FLKPNIFTLT
Sbjct: 121 CDCRIAIPMGEQGSILGVEVEIPRKSYTTQLITAEDGNEFEKTALPETGGFLKPNIFTLT 180
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+PQ+DGG+ LSI++ WSQKL+Y G+F +++PF FPEYVTPA+KKI KREKI+L+VNAGT
Sbjct: 181 IPQVDGGTNLSIKMTWSQKLTYNQGQFFLDIPFNFPEYVTPAVKKISKREKIYLSVNAGT 240
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
GTEVLC SH LK+ R GKL ++YE++VLKWSN DF FSYT S S+I GG+ LQS
Sbjct: 241 GTEVLCKGCSHQLKEKLRSAGKLRFAYEADVLKWSNTDFSFSYTASSSNIVGGLFLQSAP 300
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HDVDQR++F YL PG + K FK++V+F+VDIS SM GKPLED KNA++ ALSKLDP
Sbjct: 301 VHDVDQRDIFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVKNAISTALSKLDP 360
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
GDSFNI+ F+ +T LFSTSME T +AVER +W+ NF+ TN+ PL KAVEML+N
Sbjct: 361 GDSFNIITFSNDTALFSTSMESVTSDAVERGIEWMNKNFVVADGTNMLPPLEKAVEMLSN 420
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+RGSIP+IF VTDG+VEDER ICD MK L + GS+ PRI+TFG+G +CNHYFL+MLA I
Sbjct: 421 TRGSIPMIFFVTDGSVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHYFLQMLANI 480
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
S G + + Y+ D IE +M KLFT+ S++L NIAI+ L+ LDE E+YPS IPDL+S SPL
Sbjct: 481 SCGQHESVYNTDHIEERMDKLFTKALSTILVNIAIEPLQSLDEVEVYPSNIPDLTSASPL 540
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
++ GRY+GKFP+ + AKG LGDLS+F +L +Q AKD+PLD++ AK IDLLTA+AWFSE
Sbjct: 541 MIYGRYRGKFPENVIAKGLLGDLSSFSTDLTVQSAKDMPLDKVFAKNVIDLLTAEAWFSE 600
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
DK+L+EK++K+S+QTGVL EYTRMI +E E SE+ G KK + + QK+ + +
Sbjct: 601 DKQLKEKIAKLSIQTGVLSEYTRMIQLENTEELKPSETGGKKKTTSNGEKQKM----ISR 656
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
TI LQ+LGIGFGD TAT EN PPG K P+ AE FVKAAS+CC +L NKCCC+CC++C
Sbjct: 657 TIPLQSLGIGFGDKTATRENVPPGFGEQKAPDAAEKFVKAASSCCVSLCNKCCCMCCVQC 716
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
C+KLNDQCV+V TQL TA+AC CFECCS +CC G DG
Sbjct: 717 CSKLNDQCVLVFTQLFTAIACIACFECCSTVCCSGDDG 754
>gi|225442075|ref|XP_002271630.1| PREDICTED: uncharacterized protein LOC100244700 [Vitis vinifera]
gi|297742983|emb|CBI35850.3| unnamed protein product [Vitis vinifera]
Length = 747
Score = 974 bits (2517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/756 (60%), Positives = 583/756 (77%), Gaps = 10/756 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA+DF AVE GL+LSKR+Y+GKDR + PK P M+KS + + LP +PMVYAVI +P
Sbjct: 1 MAEDFSTAVEYGLRLSKRIYYGKDRSASAPK-PQEMEKSESQESFLPTAPMVYAVIPEPS 59
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPD+PSYQPYVHGRCDPP+LIPL M + ++VDCYLDTAF+ VSGTWRVHCVMGS+
Sbjct: 60 IVDNPDVPSYQPYVHGRCDPPALIPLHMLGIAMEVDCYLDTAFISVSGTWRVHCVMGSRR 119
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCR+A+PMG+QGS+LGVE +++G+S +++LI + GK A E G FL+ I+TL
Sbjct: 120 CDCRVAIPMGEQGSVLGVEVDVTGRSCNSRLITEEDTKDNGKPAKTEDGRFLRWRIYTLK 179
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+P+++GG+ LSI+ WSQKL Y+DG+FS+ +PF FP VTPA KKIPK+EKI LNVN+GT
Sbjct: 180 VPKVEGGTNLSIKFSWSQKLLYQDGQFSLIIPFSFPTCVTPAGKKIPKKEKIRLNVNSGT 239
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
GTE+LC +SH LK+LR VGKLG+ YE+EV WS DF+FSYTVS ++IFGG L+QSPS
Sbjct: 240 GTEILCKATSHLLKELRHQVGKLGFLYEAEVQTWSRADFNFSYTVSSNNIFGGALMQSPS 299
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HD DQREMFC YL PG+ S KVF+K+V+FIVDIS SM+G+PLE+TK+A+ AL L+
Sbjct: 300 VHDFDQREMFCFYLFPGSNISGKVFRKEVVFIVDISRSMRGRPLENTKDAVLAALLNLNT 359
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFNG+T LFS++M LATKE V A QWI NF A G TNI PL++A+EMLT
Sbjct: 360 QDSFNIIAFNGDTCLFSSTMVLATKEEVGNATQWINTNFNAEGGTNILNPLSQAIEMLTK 419
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ GS+P+IFL+TDGAVEDER IC+ ++ L N ICPRI TFGIGSYCNHYFL+MLA I
Sbjct: 420 TSGSMPLIFLITDGAVEDERHICNVVEGHLRNRSPICPRICTFGIGSYCNHYFLQMLAQI 479
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RGYY AAYD DSI MQ+LFT+ S +LANI I+TL+ LD E++P IPDLSS SPL
Sbjct: 480 GRGYYDAAYDADSIAFHMQRLFTKASSVILANITIETLEHLDSLELFPFHIPDLSSGSPL 539
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
I+SGRY G+FPD+LKA G L D+SNF ++LK+Q AK+IPLDR+ A++QI+LLT QAW SE
Sbjct: 540 IISGRYHGEFPDSLKASGSLADMSNFTIDLKIQKAKEIPLDRVFARRQINLLTVQAWLSE 599
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
K+LEE++++MS+QTG EYT MI++ TD+ ASES + +K++DS K
Sbjct: 600 SKQLEERIARMSMQTGFPSEYTCMILLVTDKGKQASESVMLQ--------EKMVDSIGQK 651
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
I+L LGIGFGDLTAT+EN P G E PK PE ++ KAA+NCC+ + ++ CC+C I+
Sbjct: 652 IILLGTLGIGFGDLTATSENLPSGVEEPKPPEGTDVIFKAATNCCAMVADRICCMCFIQA 711
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCC-CG 755
C+KLN+QC I TQLC+AL CFGC +CC LC CG
Sbjct: 712 CSKLNNQCAIAFTQLCSALTCFGCMDCCCELCVSCG 747
>gi|297844874|ref|XP_002890318.1| hypothetical protein ARALYDRAFT_472131 [Arabidopsis lyrata subsp.
lyrata]
gi|297336160|gb|EFH66577.1| hypothetical protein ARALYDRAFT_472131 [Arabidopsis lyrata subsp.
lyrata]
Length = 755
Score = 954 bits (2466), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/759 (61%), Positives = 587/759 (77%), Gaps = 5/759 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA+DF +AV+DGLKL+KR+YFGKDR V P+ P+ M++S LP +PMVYAVI DP
Sbjct: 1 MAEDFARAVDDGLKLAKRIYFGKDRAVAAPRPPAPMERSSTTQSYLPTAPMVYAVIPDPG 60
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPD+PSYQP+VHGRCDPP+LIPLQMN++ELDVDCYLDTA V V+G+WRVHCVMGSK
Sbjct: 61 IVDNPDLPSYQPHVHGRCDPPALIPLQMNSIELDVDCYLDTALVTVTGSWRVHCVMGSKR 120
Query: 121 CDCRIAVPMGDQG-SILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTL 179
CDCRIA+PMG+Q SILGVE EI KSY TQLI + + K+A ETG FLKPNIF L
Sbjct: 121 CDCRIAIPMGEQVCSILGVEVEIPRKSYTTQLITAEDGNEFEKTALPETGGFLKPNIFIL 180
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAG 239
T+PQ+DGG+ LSI++ WSQKL+Y +G+F +++PF FPEYVTPA+KKI KREKI+L+VNAG
Sbjct: 181 TIPQVDGGTNLSIKMTWSQKLTYNEGQFFLDIPFNFPEYVTPAVKKISKREKIYLSVNAG 240
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSP 299
TGTEVLC SH LK+ R GKL + YE++VLKWSN DF FSYT S S+I GG+ LQS
Sbjct: 241 TGTEVLCKGCSHQLKEKLRSAGKLRFVYEADVLKWSNTDFSFSYTASSSNIVGGLFLQSA 300
Query: 300 SLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
+HD DQR++F YL PG + K FK++V+F+VDIS SM GKPLED NA++ ALSKL
Sbjct: 301 PVHDDDQRDVFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVNNAISTALSKLS 360
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
PGDSFNI+ F+ +T LFSTSME T +AVER +W+ NF+ TN+ PL KAVEML+
Sbjct: 361 PGDSFNIITFSDDTSLFSTSMESVTSDAVERGIEWMNKNFVVADGTNMLLPLEKAVEMLS 420
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
N+RGSIP+IF VTDG+VEDER ICD MK L + GS+CPRI+TFG+G +CNHYFL++LA
Sbjct: 421 NTRGSIPMIFFVTDGSVEDERHICDVMKKHLASAGSVCPRIHTFGLGIFCNHYFLQILAN 480
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+S G + + Y+ D IE +M KLFT+ S++L NI I+ L+ LDE E+YPS IPDL+S SP
Sbjct: 481 LSCGQHESVYNTDHIEERMDKLFTKALSTILVNITIEPLQSLDEVEVYPSNIPDLTSASP 540
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFS 599
L++ GRY+GKFP+ + AKG LGDLS+F +L ++ AKD+PLD++ AK IDLLTA+ W S
Sbjct: 541 LMIYGRYRGKFPENVIAKGLLGDLSSFSTDLTVRSAKDMPLDKVFAKNVIDLLTAEVWLS 600
Query: 600 EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVP 659
EDK+L+EK++K+S+QTGV EYTRMI +E E SE+ KK + + QK+ +
Sbjct: 601 EDKQLKEKIAKLSIQTGVPSEYTRMIQLENTEELKPSETGVKKKTTSNGEKQKM----IS 656
Query: 660 KTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIE 719
+TI LQ+LGIGFGD TAT EN PPG K P+ AE FVKAAS+CC +L NKCCC+CC++
Sbjct: 657 RTIPLQSLGIGFGDKTATRENVPPGFGEQKAPDAAEKFVKAASSCCVSLCNKCCCMCCVQ 716
Query: 720 CCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
CC+KLNDQCV+V TQL TA+AC CFECCS +CC G DG
Sbjct: 717 CCSKLNDQCVLVFTQLFTAIACIACFECCSTVCCAGDDG 755
>gi|356565764|ref|XP_003551107.1| PREDICTED: uncharacterized protein LOC100777542 [Glycine max]
Length = 754
Score = 952 bits (2460), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/758 (64%), Positives = 597/758 (78%), Gaps = 4/758 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA DF KAV+DGLKLSKR+YFGKDR V PPK P M +S LP +PMVYAVI+DP
Sbjct: 1 MAQDFSKAVDDGLKLSKRIYFGKDRAVAPPKPPPPMTRSATA--FLPSAPMVYAVISDPG 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDIPSYQP+V+GRCDPP+LIPLQMNA++++ DCY +TAF+ VSGTWR+HCVMGS+S
Sbjct: 59 IVDNPDIPSYQPHVYGRCDPPALIPLQMNAIQMEADCYHETAFITVSGTWRLHCVMGSRS 118
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIAVP+ QGSILGVE +S KSY TQL+ + +++G ++ + G FL P+IFTLT
Sbjct: 119 CDCRIAVPVSHQGSILGVEVSVSRKSYSTQLVVMEDDNGNQNASPPQNGGFLIPDIFTLT 178
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+PQIDGGS LSI++RWSQK+ + G+FS+NVPF FP++V PA KKI KREKI +NV+A T
Sbjct: 179 IPQIDGGSNLSIKVRWSQKIVFSKGQFSLNVPFTFPDFVNPAGKKISKREKIQINVDAVT 238
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
G E+LC T SH LK++RR VG +G+ Y+S+VL WS +DF FSY VS SHI GGVLL+S S
Sbjct: 239 GGELLCKTLSHPLKEVRRHVGSMGFLYDSDVLSWSKVDFSFSYAVSSSHITGGVLLESAS 298
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HD DQREMF MYL G +S KVFKKD+IFI+DISGSM+GK +EDTKNAL ALSKL+
Sbjct: 299 VHDFDQREMFYMYLSTGDIQSDKVFKKDIIFIIDISGSMRGKLIEDTKNALLTALSKLNQ 358
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFNGETYLFS +MELA+ +AVERA +WI NF+AGG TNI PL A+EML+N
Sbjct: 359 ADSFNIIAFNGETYLFSKTMELASGDAVERATEWINTNFVAGGGTNISHPLNTAIEMLSN 418
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ S+PIIFLVTDG VEDERQIC +K+R+ NG SICPRIYTFGIGS+CNHYFLRMLAMI
Sbjct: 419 IQSSVPIIFLVTDGTVEDERQICAMVKNRMINGESICPRIYTFGIGSFCNHYFLRMLAMI 478
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG Y AA D+D IE +M LF + S +LANI +DTL DLD+ E+YP IPDLSSE PL
Sbjct: 479 GRGQYDAALDVDLIEPRMLTLFGKASSLILANIKMDTLDDLDDLEVYPPHIPDLSSEGPL 538
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
I+SGRY+G FP TLK +G L D SNFVV++K+Q AKDIP+ +I A+ QI+ LTAQAW E
Sbjct: 539 ILSGRYRGNFPKTLKIEGILADFSNFVVDMKIQNAKDIPVQKISARDQIEHLTAQAWLME 598
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
+K+LE+KV+K+S+QTG + EYTRMII+ETD ES GTK+ SKKS Q + +
Sbjct: 599 NKQLEQKVAKLSLQTGFMSEYTRMIILETDHLKKVKESAGTKEASKKSHPQYEAPVQGQR 658
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
I+L +LGIGFG+LTATAEN PPG E K PEV EIF KAA+NCC TL + CCC+CCI+C
Sbjct: 659 MILLPHLGIGFGNLTATAENTPPGFE-SKFPEVPEIF-KAATNCCETLCSYCCCMCCIQC 716
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
C ++N+QC LTQLC L CFGC CCS++CC G +G
Sbjct: 717 CTRINNQCATALTQLCIGLGCFGCITCCSDICCSGNEG 754
>gi|8778285|gb|AAF79294.1|AC068602_17 F14D16.26 [Arabidopsis thaliana]
Length = 736
Score = 942 bits (2434), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/758 (61%), Positives = 579/758 (76%), Gaps = 22/758 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA+DF +AV+DGLKL+KR+YFGKDR V P+ P+ MD+S LP +PMVYAVI DP
Sbjct: 1 MAEDFARAVDDGLKLAKRIYFGKDRAVAAPRPPAPMDRSSTTQPYLPTAPMVYAVIPDPG 60
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPD+PSYQP+VHGRCDPP+LIPLQMN++ELDVDCYLDTA V V+G+WRVHCVMGSK
Sbjct: 61 IVDNPDLPSYQPHVHGRCDPPALIPLQMNSIELDVDCYLDTALVTVTGSWRVHCVMGSKR 120
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIA+PMG+QGSILGVE EI KSY TQLI + + K+A ETG+
Sbjct: 121 CDCRIAIPMGEQGSILGVEVEIPRKSYTTQLITAEDGNEFEKTALPETGT---------- 170
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
LSI++ WSQKL+Y G+F +++PF FPEYVTPA+KKI KREKI+L+VNAGT
Sbjct: 171 --------NLSIKMTWSQKLTYNQGQFFLDIPFNFPEYVTPAVKKISKREKIYLSVNAGT 222
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
GTEVLC SH LK+ R GKL ++YE++VLKWSN DF FSYT S S+I GG+ LQS
Sbjct: 223 GTEVLCKGCSHQLKEKLRSAGKLRFAYEADVLKWSNTDFSFSYTASSSNIVGGLFLQSAP 282
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HDVDQR++F YL PG + K FK++V+F+VDIS SM GKPLED KNA++ ALSKLDP
Sbjct: 283 VHDVDQRDIFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVKNAISTALSKLDP 342
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
GDSFNI+ F+ +T LFSTSME T +AVER +W+ NF+ TN+ PL KAVEML+N
Sbjct: 343 GDSFNIITFSNDTALFSTSMESVTSDAVERGIEWMNKNFVVADGTNMLPPLEKAVEMLSN 402
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+RGSIP+IF VTDG+VEDER ICD MK L + GS+ PRI+TFG+G +CNHYFL+MLA I
Sbjct: 403 TRGSIPMIFFVTDGSVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHYFLQMLANI 462
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
S G + + Y+ D IE +M KLFT+ S++L NIAI+ L+ LDE E+YPS IPDL+S SPL
Sbjct: 463 SCGQHESVYNTDHIEERMDKLFTKALSTILVNIAIEPLQSLDEVEVYPSNIPDLTSASPL 522
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
++ GRY+GKFP+ + AKG LGDLS+F +L +Q AKD+PLD++ AK IDLLTA+AWFSE
Sbjct: 523 MIYGRYRGKFPENVIAKGLLGDLSSFSTDLTVQSAKDMPLDKVFAKNVIDLLTAEAWFSE 582
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
DK+L+EK++K+S+QTGVL EYTRMI +E E SE+ G KK + + QK+ + +
Sbjct: 583 DKQLKEKIAKLSIQTGVLSEYTRMIQLENTEELKPSETGGKKKTTSNGEKQKM----ISR 638
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
TI LQ+LGIGFGD TAT EN PPG K P+ AE FVKAAS+CC +L NKCCC+CC++C
Sbjct: 639 TIPLQSLGIGFGDKTATRENVPPGFGEQKAPDAAEKFVKAASSCCVSLCNKCCCMCCVQC 698
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
C+KLNDQCV+V TQL TA+AC CFECCS +CC G DG
Sbjct: 699 CSKLNDQCVLVFTQLFTAIACIACFECCSTVCCSGDDG 736
>gi|357450103|ref|XP_003595328.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Medicago truncatula]
gi|355484376|gb|AES65579.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Medicago truncatula]
Length = 751
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 452/759 (59%), Positives = 582/759 (76%), Gaps = 10/759 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F K+VE+G+KLSKR+YFG DR VT PK P +M K+ LP SPMVYAVI DPK
Sbjct: 1 MAEEFSKSVEEGIKLSKRIYFGNDRAVTAPKPPPTMSKTNTA--FLPTSPMVYAVIHDPK 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQM-NAVELDVDCYLDTAFVRVSGTWRVHCVMGSK 119
IVDNPD+PSYQP+VHGRCDPP+LIPLQM N V+L VD +LDTAFV VSG WR+HCV GS+
Sbjct: 59 IVDNPDVPSYQPHVHGRCDPPALIPLQMINGVDLRVDAWLDTAFVEVSGCWRLHCVSGSR 118
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSAS-VETGSFLKPNIFT 178
SCDC + VPM QGSILGVE + KSY TQL+ + + G +A+ G FLK NIFT
Sbjct: 119 SCDCIVVVPMSSQGSILGVEVSVHRKSYSTQLVDMEDKTGGKDNATKTHDGGFLKSNIFT 178
Query: 179 LTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
LT+PQIDGGS LSI+L WSQK++ + S+NVPF FP++V PA K++ K+EKI +NVNA
Sbjct: 179 LTIPQIDGGSNLSIKLSWSQKITCCNDVSSLNVPFTFPDFVNPAGKRMGKKEKIQINVNA 238
Query: 239 GTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQS 298
TG++V C T+SH +K++RR G +G+S+++++L WS DF FSY+VS S I GGV+L+S
Sbjct: 239 VTGSDVKCKTASHPMKEVRRSAGSIGFSHDTDLLSWSKCDFSFSYSVSSSQINGGVILES 298
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
S+ D+DQREMFCMYL PG +SRKVF+KD++F++DISGSM+GK ++DTKNAL+ ALSKL
Sbjct: 299 ASVDDIDQREMFCMYLSPGNLQSRKVFRKDIVFVIDISGSMRGKLIDDTKNALSAALSKL 358
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
DP DSF+I+AFNGE Y FSTSMELA+K+AVERA +WIGINF+AGG TNI PL A+EML
Sbjct: 359 DPDDSFSIIAFNGEIYQFSTSMELASKDAVERAVEWIGINFVAGGDTNILHPLNTALEML 418
Query: 419 TNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
++ R S+PIIFLVTDG VEDER+ CD + ++ G SI PRIYT GIGS+CN YFL+MLA
Sbjct: 419 SDGRRSVPIIFLVTDGTVEDERETCDMINKKI-RGESIFPRIYTLGIGSFCNQYFLKMLA 477
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
MISRG + AA D+D +E+QM KLF + S VLANI +D DLDEFE+YPS+IPDLSS
Sbjct: 478 MISRGQHVAALDVDLVELQMLKLFDKAASVVLANITMDVFDDLDEFEVYPSQIPDLSSNG 537
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
PLI+ GR++G FP+ +K KG L D SNFV++LK+Q AKD+PL+R+ A++QI+ LTAQAW
Sbjct: 538 PLILPGRFRGNFPEVVKVKGVLPDFSNFVIDLKIQNAKDMPLERVFAREQIEYLTAQAWL 597
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEV 658
+ +K+LE K++K+S+QTG L EYTRM ++E D A +S GTK KK + +
Sbjct: 598 TNNKQLEHKIAKLSLQTGFLSEYTRMSVLEVDHLKTAKQSDGTKASKKKGEKD-----QG 652
Query: 659 PKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCI 718
+T++L LGIGFG+L ATAEN PPG + P L + AE+FVKAA++CC + N CCC+CCI
Sbjct: 653 QRTLLLPKLGIGFGNLAATAENTPPGGDEPSLNDSAEMFVKAATHCCGSCLNHCCCMCCI 712
Query: 719 ECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
+ C K+N+QC V +QLC L C+ C CCS +CC G +
Sbjct: 713 QVCTKMNNQCATVFSQLCIGLGCYSCLTCCSEMCCTGDE 751
>gi|356547946|ref|XP_003542365.1| PREDICTED: uncharacterized protein LOC100800834 [Glycine max]
Length = 754
Score = 912 bits (2356), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/758 (63%), Positives = 588/758 (77%), Gaps = 4/758 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA DF KAV+DGLKLSKR+YF KDR V+PPK P M + LP +PMVYAVI+DP
Sbjct: 1 MAVDFSKAVDDGLKLSKRIYFVKDRAVSPPKPPPPMARWATA--FLPSAPMVYAVISDPG 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDI SYQP+V+GRCDPP+LIPLQMNA+E++ DCY +TAFV VSGTWR+HCVMGS+S
Sbjct: 59 IVDNPDISSYQPHVYGRCDPPALIPLQMNAIEMEADCYHETAFVTVSGTWRLHCVMGSRS 118
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIAVP+ QGSILGVE +S KSY TQL+ +G+ +G SA + G FL PNIFTLT
Sbjct: 119 CDCRIAVPVSHQGSILGVEVSVSRKSYSTQLVVMGDENGNQNSAPPQNGGFLIPNIFTLT 178
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+PQ+DGGS LSI++RWSQK+ Y G+FS+NVPF FP++V PA K+I KREKI +NV+A T
Sbjct: 179 IPQVDGGSNLSIKVRWSQKIVYSKGQFSLNVPFTFPDFVNPAGKRISKREKIQINVDAVT 238
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
G E+LC T SH LK++RR G +G+ Y+S+VL WS +DF FSY VS SHI GGVLL+S S
Sbjct: 239 GVELLCKTISHPLKEVRRHAGSMGFLYDSDVLSWSKVDFSFSYAVSSSHINGGVLLESAS 298
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+ D D+REMF MYL PG +S KVFKKD+IFI+DISGSM+GK +EDTKNAL ALSKL+
Sbjct: 299 VQDFDEREMFYMYLSPGDIQSHKVFKKDIIFIIDISGSMRGKLIEDTKNALLTALSKLNH 358
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFNGETYLFS +M+LA+ +AVERA +WI NFIAG TNI PL A+EML+N
Sbjct: 359 DDSFNILAFNGETYLFSKAMDLASGDAVERATEWINTNFIAGSGTNISHPLNTAIEMLSN 418
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ S+PI+FLVTDG VEDERQIC +K+R+ NG SICPRIYTFGIGS+CNHYFLRMLAMI
Sbjct: 419 IQSSVPIVFLVTDGTVEDERQICAMVKNRMINGESICPRIYTFGIGSFCNHYFLRMLAMI 478
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG Y AA D+D IE +M LF + S +LANI +DTL DLD+ E+YP IPDLSSE PL
Sbjct: 479 GRGQYDAALDVDLIEPRMLTLFDKASSLILANIKMDTLDDLDDLEVYPPHIPDLSSEGPL 538
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
I+SGRY+G FP TLK KG L D SNFVV++K+Q AKDIP+ +I A+ QI+ LTAQAW E
Sbjct: 539 ILSGRYRGNFPKTLKVKGILADFSNFVVDMKIQNAKDIPVQKISARDQIEHLTAQAWLME 598
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
+K+LE+KV+K+S+QTG EYTRM+I ETD ES G K+ SKKS+ + +
Sbjct: 599 NKQLEQKVAKLSLQTGFTSEYTRMMIHETDHLKKVKESSGPKEASKKSNPLFEAPVQGQR 658
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
I+L +LGIGFG+LTATAEN PPG E KLPEV EIF KAA+NC TL + CCC+CCI+C
Sbjct: 659 MILLPHLGIGFGNLTATAENTPPGFE-SKLPEVPEIF-KAATNCFETLCSYCCCMCCIQC 716
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
C ++N QC L QLC L CFGC CCS++CC G +G
Sbjct: 717 CTRINSQCATALAQLCIGLGCFGCISCCSDICCSGNEG 754
>gi|255583954|ref|XP_002532724.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus
communis]
gi|223527532|gb|EEF29655.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus
communis]
Length = 752
Score = 894 bits (2311), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/755 (55%), Positives = 551/755 (72%), Gaps = 3/755 (0%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F V+ GL+LSKR+Y+GK+ PP M K+ + + LP + MVYAV+ +P+
Sbjct: 1 MAEEFGTCVDYGLRLSKRIYYGKE---LPPAPEPEMSKTVSWESYLPTAVMVYAVVPEPE 57
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
VDNPD+PSYQPYVHGRC PP+LIPLQM+ V +DVDC LD A V +G WRVHCV S+
Sbjct: 58 AVDNPDVPSYQPYVHGRCQPPALIPLQMHGVAMDVDCCLDYANVSFNGKWRVHCVKTSRK 117
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
CDCRIAVPMG+QGSILG+E +I+G SYH+QLI E GK++ G +LK +I+T
Sbjct: 118 CDCRIAVPMGEQGSILGLEVDITGGSYHSQLIIAKETIDKGKTSKGGDGRYLKGSIYTFK 177
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+PQ+ GG+ +S+++ WSQKL+Y +G+F +N+PF FP +V P KKI KREKI LNVN+G
Sbjct: 178 IPQVVGGTTISVKVTWSQKLTYNEGQFCLNIPFSFPAFVNPVAKKITKREKILLNVNSGV 237
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
E+L +SH LK+LRR+VGK+ + YE EV WS D +FSYTV +FG V LQSP
Sbjct: 238 SKEILFRCTSHALKELRREVGKMAFLYEEEVTTWSIADLNFSYTVVSEDLFGEVFLQSPL 297
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
L D+D+R+MFC YL PG +SRK F+KDVIFI+DISGSM+G PLE+ KNAL +LSKL+
Sbjct: 298 LRDIDERQMFCFYLFPGNKQSRKAFRKDVIFIIDISGSMKGGPLENAKNALMSSLSKLNS 357
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
DSFNI+AFN ETYLFS+ ME ATKEA+ +A W+ N AGG TNI PL +A+++L
Sbjct: 358 EDSFNIIAFNDETYLFSSLMEPATKEALSKASLWLNDNLTAGGGTNIMVPLKQAMKLLAQ 417
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ SIP+IFL+TDGAV+DER+IC+ +K LT+GG I PRI +FGIG+YCNHYFL+MLA I
Sbjct: 418 TTDSIPLIFLITDGAVQDEREICNFVKGSLTSGGPISPRICSFGIGAYCNHYFLQMLAQI 477
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RGY+ +AYD DS++ +MQ+LFT S +LAN+ +D L+ LD E+ P RIPDLS +PL
Sbjct: 478 GRGYFDSAYDADSVDFRMQRLFTTASSVILANVTVDALEHLDSLELLPFRIPDLSCGTPL 537
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE 600
+VSGRY GKFPD++K G L D+SNF +ELK Q AKD+ LD++ A++QID+LTA AW SE
Sbjct: 538 VVSGRYNGKFPDSVKISGILADMSNFTIELKTQKAKDVQLDKVIARRQIDVLTANAWMSE 597
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
K LE+KV+KMS+QT V EYT MI+ T + E+ ++ K + K + SE K
Sbjct: 598 SKDLEQKVAKMSIQTRVPSEYTHMILHRTYTGDKEPETILMQEVLNKINPLKQVKSERDK 657
Query: 661 TIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIEC 720
T+ML NLG+GFG+L ATA N PPG++ K P+ E VKAAS CCS L ++ CC+C I+
Sbjct: 658 TVMLGNLGVGFGNLKATAANIPPGTDAIKSPDATEKIVKAASGCCSRLLDRFCCMCFIQT 717
Query: 721 CNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCG 755
C+ +NDQC IVL+QLC ALACF C CC +C CG
Sbjct: 718 CSSMNDQCAIVLSQLCAALACFECVNCCFEICECG 752
>gi|449452582|ref|XP_004144038.1| PREDICTED: uncharacterized protein LOC101210279 [Cucumis sativus]
gi|449526722|ref|XP_004170362.1| PREDICTED: uncharacterized protein LOC101231061 [Cucumis sativus]
Length = 755
Score = 889 bits (2297), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/757 (59%), Positives = 564/757 (74%), Gaps = 12/757 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA+DF KAV+DGL+LSKRLYFGKDR V PP+ +MD+ LP +PMVYAVI DP
Sbjct: 1 MAEDFAKAVDDGLRLSKRLYFGKDRAVAPPRSFPTMDRMD--HSFLPTAPMVYAVIHDPG 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDIPSYQP+VHGRCDPP+LIPLQMNAVEL DCYLD A +R++G+WRVHCVMGS+S
Sbjct: 59 IVDNPDIPSYQPHVHGRCDPPALIPLQMNAVELQADCYLDKAIIRITGSWRVHCVMGSRS 118
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGE-NDGAGKSASVETGSFLKPNIFTL 179
CDCRIA+PMG+QGS+LG E + KSY T LIAL + + A + G FL PNIFTL
Sbjct: 119 CDCRIAIPMGEQGSVLGCEVDDPRKSYRTSLIALEDKSKNASEKPERVDGGFLTPNIFTL 178
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRD-GEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
T+PQ+DGG+ LSI + WSQKL + G+ ++VPF FP YV PA KK+ K+EKI LN+N
Sbjct: 179 TIPQVDGGTTLSITMTWSQKLLFNSSGDLCLDVPFTFPHYVIPAGKKMSKKEKIVLNINV 238
Query: 239 GTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQS 298
G+ EV C T+SH LK+ R GKL + YESEVL WS + FSY++S S I GG+LLQS
Sbjct: 239 GSAVEVSCKTTSHPLKESMRKPGKLSFVYESEVLVWSKSNLSFSYSISSSQISGGILLQS 298
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
P + D DQREMFCMYL PG K KVF+K ++F+VD+SGSMQGK L+D KN L+ ALSKL
Sbjct: 299 PPVDDADQREMFCMYLYPGKDKG-KVFRKKIVFVVDVSGSMQGKALDDVKNVLSTALSKL 357
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
P D FNI+AFN +T FS SME+AT +AVERA QWI +NF+A G T+I PLTKA EML
Sbjct: 358 PPEDMFNIIAFNEDTRQFSESMEMATMDAVERALQWIKMNFVARGGTDILLPLTKATEML 417
Query: 419 TN--SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
+ + S+PIIFLVTDGAV +ER ICD M+ T SI PRIYTFGIG++CNHYFLRM
Sbjct: 418 NDGGNGDSVPIIFLVTDGAVHNERHICDVMQKNRTKKQSIHPRIYTFGIGTFCNHYFLRM 477
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LAMI RG Y AAYDLD +E Q+Q L+ R S++ NIA+DT DLDE E+YPS IPDLSS
Sbjct: 478 LAMIGRGQYDAAYDLDLVEPQLQNLYKRAASTIFVNIAVDTFDDLDEVEVYPSSIPDLSS 537
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQA 596
ESP+ VSGRY+GKFP+ +KA+G L +L N V++L +Q AKDIP+D++ AK QI+ LTA+A
Sbjct: 538 ESPMTVSGRYRGKFPEVVKARGLLANLDNIVLDLNVQEAKDIPIDKLFAKDQIEQLTAEA 597
Query: 597 WFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDS 656
W SE+K+L E V KMS + GVL EYT+MII + ++ S K KK+ ++K++
Sbjct: 598 WCSENKQLVEMVKKMSTKMGVLSEYTQMIIFQNVDKVIES----IKVQQKKNAYEKMVAP 653
Query: 657 EVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLC 716
+ K ++L G+GFG+L AT++N P G+ K PE AEIF KAASNCC L + CCC C
Sbjct: 654 KGDKMLLLPLFGVGFGNLEATSDNTPLGNGERK-PEAAEIFAKAASNCCGKLCSFCCCPC 712
Query: 717 CIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCC 753
CIE C+++N+QC I+LTQLCTAL CFGCF+CC +CC
Sbjct: 713 CIEACSRMNNQCAILLTQLCTALTCFGCFDCCLEMCC 749
>gi|356532788|ref|XP_003534952.1| PREDICTED: uncharacterized protein LOC100820133 [Glycine max]
Length = 747
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/759 (58%), Positives = 566/759 (74%), Gaps = 15/759 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F K+VE+GL+LSKR+YFG DR V+PPK P SM KS LP +PM+YAVI DP
Sbjct: 1 MAEEFSKSVEEGLRLSKRIYFGNDRAVSPPKPPPSMSKSHTA--CLPTAPMLYAVIHDPA 58
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQM-NAVELDVDCYLD-TAFVRVSGTWRVHCVMGS 118
IVDNPD+PSYQPYVHG+CDPP+LIPL M V L CY++ AFV VSG+WR+HCV GS
Sbjct: 59 IVDNPDVPSYQPYVHGQCDPPALIPLHMIGNVHLQAHCYVNAVAFVTVSGSWRLHCVSGS 118
Query: 119 KSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFT 178
++CDC +A+P+ QGSILGVE + KSY TQL+ + +N+G + G F+ N+FT
Sbjct: 119 RTCDCIVALPVPPQGSILGVEVSVHRKSYSTQLVDMKDNNGKENMIRAQDGGFVGLNLFT 178
Query: 179 LTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
L +PQIDGGS LS+++ W QK+ Y + F +NVPF FP++V P KK+ K+EKI + +NA
Sbjct: 179 LRIPQIDGGSNLSVKISWYQKILYSNDVFCLNVPFNFPDFVNPVGKKMEKKEKIEIIMNA 238
Query: 239 GTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQS 298
TG+E+LC T SH LK++R + G +G+SYES+VL WS DF FSY S SHI GGVLL+S
Sbjct: 239 ITGSEILCKTMSHPLKKVRHNAGSMGFSYESDVLSWSKTDFSFSYADSSSHISGGVLLES 298
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
PS+ D DQREMFCMYL PG + RK+F+KD++F++DISGSM+GK ++ TKNAL+ ALSKL
Sbjct: 299 PSVDDFDQREMFCMYLSPGNLQGRKIFRKDIVFVIDISGSMRGKLIDHTKNALSAALSKL 358
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
+P DSF+I+AFNGE Y FS SMELA+K+AVERA +WI +NFIAGG TNI PL A+EML
Sbjct: 359 NPHDSFSIIAFNGEIYQFSKSMELASKDAVERAIEWINMNFIAGGDTNILHPLNTAIEML 418
Query: 419 TNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
++++ S+PIIFLVTDG VEDERQICD MK+ +TNG SI PRIYTFGIGS+CNHYFLRML+
Sbjct: 419 SDAQSSVPIIFLVTDGTVEDERQICDMMKNHMTNGESISPRIYTFGIGSFCNHYFLRMLS 478
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
MI RG + AA D+D IE QM KLF + S VLANI +D D+DE E+ PS IPDLSS+
Sbjct: 479 MIGRGQHVAALDVDLIEPQMLKLFDKASSLVLANITMDIFNDVDEVEVCPSHIPDLSSDG 538
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
PL++ GRY+G FP L+ KG L D SNFV++LK+Q AK IP+ RICA+ QI+ LTAQAW
Sbjct: 539 PLLLCGRYKGSFPKDLEIKGVLPDFSNFVIDLKIQEAKGIPVQRICARDQIEYLTAQAWL 598
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEV 658
S+D++LE+ V+K+S+QTG EYT M I+E D R A ES G K KK
Sbjct: 599 SKDEKLEQMVAKLSLQTGFTSEYTSMTILEIDLRKKAKESDGKKVCIKKQ---------- 648
Query: 659 PKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCI 718
+ I+L LGIGFG+LTATAEN PGSE K P+ AEIFVKAA+NCC T N CCC+CCI
Sbjct: 649 -RVILLPPLGIGFGNLTATAENIQPGSEETKGPDGAEIFVKAATNCCGTCCNYCCCMCCI 707
Query: 719 ECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
+ ++N+QC I TQLC L CF C CC+++CC G +
Sbjct: 708 QAFTRINNQCAIAFTQLCVGLGCFSCINCCADICCSGNE 746
>gi|224119512|ref|XP_002331179.1| predicted protein [Populus trichocarpa]
gi|222873300|gb|EEF10431.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 846 bits (2185), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/762 (54%), Positives = 538/762 (70%), Gaps = 27/762 (3%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGA------VDMLLPKSPMVYA 54
MA +F VE GL LSKR+Y+GK+ P V ++M +S + + LP + M YA
Sbjct: 1 MAREFATCVEYGLSLSKRIYYGKE---MTPAVSAAMTRSMSSKSSELAESYLPTAVMAYA 57
Query: 55 VITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMN-AVELDVDCYLDTAFVRVSGTWRVH 113
V+ +P++VDNPD+PSYQPYVHGRC+PP+LIPLQM+ AV +++DC D A V SG WRVH
Sbjct: 58 VVPEPELVDNPDVPSYQPYVHGRCEPPALIPLQMHGAVAMEIDCCFDHANVCFSGAWRVH 117
Query: 114 CVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLK 173
C+ S+ CD RIAVPMG+QGS+LGVE +++G+SYH+QLI + +G K + G +K
Sbjct: 118 CIKASRKCDVRIAVPMGEQGSLLGVEVDVTGRSYHSQLIQAEDANGNEKVSRGWNGRLIK 177
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIH 233
++++ +P++ GGS SI++ WSQKL Y +G+FS+NVPF FP +V P KKI KREKI
Sbjct: 178 GSMYSFEIPEVGGGSTFSIKVTWSQKLLYHEGQFSLNVPFSFPSFVNPIGKKISKREKIL 237
Query: 234 LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGG 293
LNVN+G G E+LC +SH LK+LRR+VGK+G+ Y++EVL WS+ DF FSY V +FGG
Sbjct: 238 LNVNSGVGKEILCRCASHALKELRREVGKMGFLYDAEVLTWSSADFSFSYNVYSKDLFGG 297
Query: 294 VLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
VLLQSP L D D R+MFC YL PG +S K F+K+VIF++DISGSM+G P E KN L
Sbjct: 298 VLLQSPFLRDFDDRQMFCCYLFPGDNQSMKAFRKEVIFLIDISGSMKGNPFESAKNGLLS 357
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+L KL+P DSFNI+AFN ETYLFS+ ME ATKEA+ +A QW+ N A G TNI APL +
Sbjct: 358 SLQKLNPEDSFNIIAFNVETYLFSSLMEQATKEAILKATQWLNDNLTADGGTNILAPLEQ 417
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
A+++L + SIP+IFL+TDGAVEDER IC+ +K LT+GGSI RI TFGIG+YCNHYF
Sbjct: 418 ALKLLAETTDSIPLIFLITDGAVEDERDICNFVKGSLTSGGSISLRICTFGIGTYCNHYF 477
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
LRMLA I RG++ AYD DS++ +MQ+LF S +LANI +D L+ LD E+ P IPD
Sbjct: 478 LRMLAQIGRGHFDTAYDADSVDFRMQRLFATASSIILANITVDALESLDSLELLPFCIPD 537
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLT 593
LS PLIVSGRY G FPD++K G L D+ F +++K Q AKD+P+DR+ A++QIDLLT
Sbjct: 538 LSCGCPLIVSGRYSGNFPDSVKLSGILADMRKFTIDIKAQKAKDLPVDRVVARRQIDLLT 597
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKI 653
A AW S K LE+KV+KMS+Q+GV EYT M++ T ASE+ +
Sbjct: 598 ANAWLSGSKELEQKVAKMSIQSGVPSEYTLMVLHHTLREEKASETILIQN---------- 647
Query: 654 LDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCC 713
IML NL +GFG+L+ATAEN PPG+E K + E+F AASNCCS + +CC
Sbjct: 648 ------TKIMLGNLCVGFGNLSATAENIPPGTEETKSSDATEMF-NAASNCCSRVVERCC 700
Query: 714 CLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCG 755
C+C I+ C+ +N+QC IVL+QLC ALACF C CC LC CG
Sbjct: 701 CMCFIQTCSYMNNQCAIVLSQLCAALACFECMNCCIELCECG 742
>gi|224073632|ref|XP_002304123.1| predicted protein [Populus trichocarpa]
gi|222841555|gb|EEE79102.1| predicted protein [Populus trichocarpa]
Length = 757
Score = 843 bits (2177), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/761 (52%), Positives = 535/761 (70%), Gaps = 10/761 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPS-----SMDKSGAVDMLLPKSPMVYAV 55
MA +F VE GL LSKR+Y+GK+ +TP + S S + P + M YAV
Sbjct: 1 MAREFATCVEYGLSLSKRIYYGKE--MTPAATAAMTRSVSEKSSDVAESYFPTAVMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAV-ELDVDCYLDTAFVRVSGTWRVHC 114
+++P++V+NPD+PSYQPYVHGRC+PP+LIPL M+ V +++DC D A V SG WRVHC
Sbjct: 59 VSEPELVENPDVPSYQPYVHGRCEPPALIPLHMHGVVTMEIDCCFDHANVCFSGAWRVHC 118
Query: 115 VMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKP 174
+ G+ CD RIAVPMG+QGS+LGVE ++ G+S H+QLI + + K + G ++
Sbjct: 119 IKGNSKCDVRIAVPMGEQGSLLGVEVDVPGRSCHSQLIEAEDANYKEKVSKGGNGRLMRG 178
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
+++T +PQ++GG+ +SI++ WSQKL Y +G+FS+NVPF FP +V P KKI KREKI L
Sbjct: 179 SVYTFKIPQVEGGATVSIKVIWSQKLLYHEGQFSLNVPFSFPAFVNPVGKKISKREKILL 238
Query: 235 NVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGV 294
NV++G G E+LC ++SH LK+LRR+VGK+G+ Y++EVL WS+ DF FSY V +FGGV
Sbjct: 239 NVSSGVGKEILCKSTSHALKELRREVGKMGFLYDAEVLTWSSADFSFSYNVYSKDLFGGV 298
Query: 295 LLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
LLQSP L D+D R+MFC YL PG +S K F+K+VIFI+DISGSM+G P E KN L +
Sbjct: 299 LLQSPLLRDIDDRQMFCFYLFPGNNQSMKAFRKEVIFIIDISGSMKGGPFESAKNGLLSS 358
Query: 355 LSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L KL+P DSFNI+AF +TYLFS+ ME AT+EA+ A +W+ A G TNI PL +A
Sbjct: 359 LQKLNPEDSFNIIAFKMDTYLFSSVMEQATEEAIIEATRWLNDKLTADGGTNILGPLKQA 418
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+++L + SIP+IFL+TDGAVEDER IC+ +K L +GGSI RI TFGIG+YCNH+FL
Sbjct: 419 IKLLAETTNSIPVIFLITDGAVEDERDICNFVKGYLPSGGSISLRISTFGIGTYCNHHFL 478
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
RMLA I RG++ AYD DS++ +MQKLFT S +LA+I +D L+ LD E+ P IPDL
Sbjct: 479 RMLAQIGRGHFDTAYDADSVDFRMQKLFTTASSIILADITVDALEHLDSLELLPFCIPDL 538
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTA 594
S SPLIVSGRY G FPD +K G L D+ F ++LK Q KD+P DR+ ++QIDLLTA
Sbjct: 539 SRGSPLIVSGRYSGNFPDPVKISGTLADMRKFTIDLKAQKTKDLPFDRVVGRRQIDLLTA 598
Query: 595 QAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKIL 654
AW S K LE+KV+KMS+QTG EYT M++ +T + A E+ + K + K +
Sbjct: 599 NAWLSGSKELEQKVAKMSIQTGFPSEYTLMVLHQTLQDEKAPETILIQDVFNKINPLKKM 658
Query: 655 DSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCC 714
D + KTIML NL +GFG+LTATAEN PPG+E K E E+ KAASNCC + ++CCC
Sbjct: 659 D--LQKTIMLGNLCVGFGNLTATAENIPPGTEETKSSEATEMLFKAASNCCGRVVDRCCC 716
Query: 715 LCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCG 755
+C I+ C+ +N+QC IVL+Q+C ALAC C CC LC CG
Sbjct: 717 MCFIQTCSYMNNQCAIVLSQICAALACVECINCCIELCECG 757
>gi|449447914|ref|XP_004141711.1| PREDICTED: uncharacterized protein LOC101206287 [Cucumis sativus]
gi|449480498|ref|XP_004155911.1| PREDICTED: uncharacterized LOC101206287 [Cucumis sativus]
Length = 754
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/757 (51%), Positives = 542/757 (71%), Gaps = 9/757 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA +F +VE GL LSKR+Y+GK TP + M + + LP++PMVYAVI +P
Sbjct: 1 MATEFSNSVEYGLHLSKRIYYGKGS--TPAALARQMSR--VSEDYLPRAPMVYAVIPEPT 56
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDIPSYQPYVHGRC PP+LIPL MN V ++++C DTAF+ V+GTWRVHCVM +S
Sbjct: 57 IVDNPDIPSYQPYVHGRCVPPALIPLHMNGVSMEINCCFDTAFICVNGTWRVHCVMAGRS 116
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLK-PNIFTL 179
C+C IAVPMG+QGS+LGVE +++G S+ T+L+++ + + K E G FLK I+TL
Sbjct: 117 CECLIAVPMGEQGSLLGVEVDVTGTSHRTELVSMEDAEAIEKLTKSEDGKFLKGRRIYTL 176
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAG 239
+P+++GG LS+R+ WSQ++ Y D F ++VPF FP Y+ P KKI +KI L++N+G
Sbjct: 177 KIPKVEGGCTLSVRINWSQRIPYLDDLFCLSVPFSFPAYLVPPGKKIKNSQKILLHINSG 236
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSP 299
+EV+C +SH +K LRR+VG L +S ++EV WSN+DFD SY++SP+ +FGGVLLQSP
Sbjct: 237 VSSEVVCKHTSHPMKILRREVGNLSFSNDAEVSAWSNMDFDLSYSISPNDLFGGVLLQSP 296
Query: 300 SLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
SLHD DQREMFC+Y+ PG +R+VF+K+V+FI+DISGSM+ PLE TK A+ +LS+L+
Sbjct: 297 SLHDFDQREMFCLYIFPGQNHNRQVFRKEVVFIIDISGSMKDGPLESTKRAVLASLSQLN 356
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
P D+FNI+ FNG+T LFS +ME ATKEA+ RA WI N +A G TNI P+ +A++ML
Sbjct: 357 PEDAFNIIGFNGDTKLFSLTMEQATKEAITRATDWINANLVANGGTNILLPVEQAIKMLA 416
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ SIP+IFL+TDG+V++ER+IC+ +K+ L +G I PR+ TFGIG++CNHYFL+ML+
Sbjct: 417 ETGNSIPLIFLITDGSVDNEREICNLVKASLKSGNKISPRLCTFGIGTFCNHYFLQMLSE 476
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
I RG Y AAYD+D I+ + Q LFT+ S LANI +D K LD FE++P++IPDL+ SP
Sbjct: 477 IGRGIYDAAYDVDLIDTRFQTLFTKASSLFLANITVDAFKHLDSFELFPTQIPDLACGSP 536
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFS 599
LI+SGRY G FP++ K G D+SN + L+ Q AK++ LDR+ A++QID++T+ AW
Sbjct: 537 LIISGRYNGCFPESFKVSGTSADMSNSTIHLQAQRAKELLLDRVLARRQIDIMTSHAWLL 596
Query: 600 EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSD--HQKILDSE 657
E K L++K++K+S Q+G EYTR+I+V E A S ++K K+ D K ++ +
Sbjct: 597 ESKDLQDKIAKLSKQSGFSSEYTRLILVLAKEGKKAP-SIISQKMRKRFDLTKSKKVEWK 655
Query: 658 VPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCC 717
K I+L N G+GFG+LTATAEN PG EI K + ++ VKAA+NC L ++ CCLC
Sbjct: 656 GQKIILLGNQGVGFGNLTATAENLQPGKEI-KATQATDLLVKAATNCFGRLIDRFCCLCF 714
Query: 718 IECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
I+ C +ND+CV+ TQL ALAC F CC LC C
Sbjct: 715 IKSCMYMNDRCVVAFTQLTAALACCEIFNCCFELCEC 751
>gi|356519838|ref|XP_003528576.1| PREDICTED: uncharacterized protein LOC100807956 [Glycine max]
Length = 756
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/759 (53%), Positives = 546/759 (71%), Gaps = 12/759 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTP-PKVPSSMDKSGAVDMLLPKSPMVYAVITDP 59
MAD F +VE GL LSKR+Y+GK +P P + S+ KS + LP +PM YAVI+DP
Sbjct: 1 MADYFSGSVEFGLNLSKRIYYGKGSAPSPAPAMSRSLSKSP--EGYLPTAPMCYAVISDP 58
Query: 60 KIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSK 119
+ V+NPDI SYQPYV G+C+PP+LIPL+++ V ++V+C LDTAFV V+G WRVHCV S
Sbjct: 59 EKVENPDIRSYQPYVLGQCEPPALIPLELHGVAMEVECSLDTAFVTVTGKWRVHCVTASS 118
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTL 179
+CDC++A+P+G++GS+LG+E + SG+SYHT+LI+L + K A + G FLK +I+T+
Sbjct: 119 TCDCQVAIPIGEEGSLLGLEVDGSGRSYHTELISLKDEKDKNKVAKAKDGYFLKSHIYTV 178
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAG 239
+PQ GGS SI++RWSQK+ + DG+ S+ VPF FP YV PA + I K+EKI L VN+G
Sbjct: 179 KIPQFRGGSVFSIKIRWSQKILFHDGQLSLCVPFSFPSYVNPAGRNISKKEKIFLKVNSG 238
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSP 299
TEVLC T+SH K+L R GKL SYE+EVL WS+ D FSY+VS + IFG VLLQSP
Sbjct: 239 ATTEVLCKTTSHPFKELLRQAGKLNLSYEAEVLTWSSTDLSFSYSVSSNDIFGAVLLQSP 298
Query: 300 SLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
L D D+RE+FC+YL PG + RKVFKKDV+F+VDIS SM+G PL++TKN+L +LS+L+
Sbjct: 299 FLRDFDEREIFCLYLYPGNSPDRKVFKKDVVFVVDISASMKGTPLDNTKNSLLTSLSQLN 358
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
D+FNI+AFNG YLFS+SME AT+EA+ A W+ NFIA G TNI PLT+A+++L
Sbjct: 359 AQDTFNIIAFNGAVYLFSSSMERATEEAILNATTWVDTNFIANGDTNIMLPLTQAMKLLE 418
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSI-CPRIYTFGIGSYCNHYFLRMLA 478
S S+P+IFLVTDGAVEDER IC+ +KS +++G S PRIYTFGIG YCNHYFL+MLA
Sbjct: 419 KSTDSVPLIFLVTDGAVEDERDICNFVKSYVSSGQSFRTPRIYTFGIGLYCNHYFLQMLA 478
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
I RG+Y +A+DLDSI+ +MQ+LF+ S ++A+I I +L+ LD E++P+ I D+S ES
Sbjct: 479 QIGRGHYDSAHDLDSIDFRMQRLFSSASSVMVADITIKSLEGLDSLELFPTHIQDISFES 538
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
PLI+SGRY G FP+ +K G L D++NFVV+LK++ KD+ L + +K+ ID +TAQAW
Sbjct: 539 PLILSGRYSGTFPELVKVTGTLADMTNFVVDLKVKREKDMQLSNVLSKRHIDQVTAQAWL 598
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRM--IIVETDERNNASESPGTKKGSKKSDHQKI-LD 655
+ K LEEKV+KMS+Q V EYT M ++V++DE A E +K K QK+ L+
Sbjct: 599 LKSKELEEKVTKMSIQNKVPSEYTCMNIVLVQSDEGKKAPEQFLLQKAYNKLSFQKLELN 658
Query: 656 SEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCL 715
++ + L L +GFGDL ATAEN PP + K PE + KAASNCC L + CC +
Sbjct: 659 NQ---NLFLGGLNLGFGDLKATAENLPPAIKEAKPPE--GLLGKAASNCCGRLADTCCGM 713
Query: 716 CCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
C ++ C +NDQC IV TQ+C ALACF +CC LC C
Sbjct: 714 CLLQTCTFVNDQCTIVCTQICAALACFELIKCCVELCDC 752
>gi|357479409|ref|XP_003609990.1| von Willebrand factor A domain-containing protein [Medicago
truncatula]
gi|355511045|gb|AES92187.1| von Willebrand factor A domain-containing protein [Medicago
truncatula]
Length = 844
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/667 (59%), Positives = 500/667 (74%), Gaps = 28/667 (4%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
MA++F +AV+DG+KLSKR+YFGKDR V PPKVP+ M KS LLPK+PMVYAVI DP
Sbjct: 1 MAEEFCRAVDDGMKLSKRIYFGKDRAVAPPKVPT-MSKSST--SLLPKAPMVYAVIHDPG 57
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
IVDNPDI SYQP+V+GRCDPP+LIPLQMNA+E+++DCYLDTAFV VSGTWRVHCV GS +
Sbjct: 58 IVDNPDIASYQPHVYGRCDPPALIPLQMNAIEMEIDCYLDTAFVTVSGTWRVHCVTGSST 117
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLT 180
DCR+A+P+GD+GSILGVE KSY T+L+ + E + + G FLK NIFTLT
Sbjct: 118 TDCRLAIPIGDEGSILGVEVNALRKSYSTRLVVMDECKENQSATPAQNGGFLKSNIFTLT 177
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+PQIDGGS LSI++RWSQK+ +GE+S+NVPF FPE++ PA K++ KREKI LNV+
Sbjct: 178 IPQIDGGSNLSIKMRWSQKVVCSNGEYSLNVPFTFPEFINPAGKRMSKREKIRLNVDVVA 237
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
G+E+L T+SH L+Q+RR G +G+ ++EVL WS DF FSY VS SHI GGVLL+S S
Sbjct: 238 GSELLFKTTSHPLQQVRRHAGSMGFVCDNEVLSWSKDDFSFSYAVSSSHINGGVLLESAS 297
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
+HD DQREMF +YL PG + +KVFKKD+IFI+DISGSMQ K ++DTKNAL ALSKL+
Sbjct: 298 VHDFDQREMFYLYLSPGNIQKKKVFKKDIIFIIDISGSMQEKLMDDTKNALLSALSKLNS 357
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
D F I+AFNGE++L+S SMELA+ +AVERA +WI +NF+AGG TNI PL A+E+L+
Sbjct: 358 NDMFTIIAFNGESHLYSKSMELASNDAVERAREWINLNFVAGGGTNISHPLNTAIEILSG 417
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
++ S+PIIFLVTDG VEDERQIC +K+ + NG SICPRIYTFGIG +CNHYFLRMLA I
Sbjct: 418 AQSSVPIIFLVTDGTVEDERQICTMVKNHMINGESICPRIYTFGIGPFCNHYFLRMLATI 477
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFE--------------- 525
RG Y AA D+D IE +M LF + S +LANI +D L ++D+ E
Sbjct: 478 GRGQYDAALDVDMIESRMLTLFEKASSLILANIKMDMLDEIDQVEVHSCCWGICCYSLDH 537
Query: 526 ----------MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLA 575
+YPS IPDLSSE PLI+SGRY+G FP+TLK KG L D SNF ++LK+Q
Sbjct: 538 SVSFYTFFSVVYPSHIPDLSSEGPLILSGRYKGNFPETLKVKGILADFSNFEIDLKIQKD 597
Query: 576 KDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
K IP+ R+ A+ QI LTAQAW SE+K+LE+KV+K+S+QTG + EYT MII+E D
Sbjct: 598 KGIPVQRVSARDQIQHLTAQAWLSENKQLEQKVAKLSLQTGFISEYTCMIILENDHLKKI 657
Query: 636 SESPGTK 642
ESPG K
Sbjct: 658 KESPGEK 664
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/114 (50%), Positives = 76/114 (66%), Gaps = 3/114 (2%)
Query: 645 SKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNC 704
SKK + + + + I+L NLG+GFG+L ATAEN PPG E E+AEIF KAASNC
Sbjct: 734 SKKKHSRYEANVQGQRMILLPNLGVGFGNLKATAENTPPGYEAKI--EMAEIF-KAASNC 790
Query: 705 CSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQDG 758
T+ CCC+CCI+C +++N+QC+ L QLC L C C CCS++CC G DG
Sbjct: 791 FDTVCGYCCCMCCIQCFSRMNNQCLTALNQLCIGLGCLACLNCCSHICCSGNDG 844
>gi|357111882|ref|XP_003557739.1| PREDICTED: uncharacterized protein LOC100836064 [Brachypodium
distachyon]
Length = 754
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/759 (49%), Positives = 518/759 (68%), Gaps = 13/759 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDM--LLPKSPMVYAVITD 58
M +DF +AVEDGLKLSKRL P + P+ MD++ A D LLP +PM YAV+TD
Sbjct: 1 MDEDFARAVEDGLKLSKRLVLPGGAPPP--RPPAGMDRAAAWDAAPLLPTAPMAYAVVTD 58
Query: 59 PKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLD--TAFVRVSGTWRVHCVM 116
P VD+PD+PSYQP+V+GR +PP+LIPLQM ++L VDC TA V V W +HC+
Sbjct: 59 PGTVDSPDVPSYQPHVYGRLNPPALIPLQMREIDLRVDCVAGCATAEVTVRARWWLHCIT 118
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNI 176
S+ CDCR+ VPMG+QGS+LG E + +SY+T +I + E++ A A E+G+ LK
Sbjct: 119 RSRDCDCRVVVPMGEQGSLLGAEVTVGKRSYNTHVIDV-EDNSAVSIAKTESGALLKSQF 177
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNV 236
F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K KREKI L V
Sbjct: 178 FSLTIPQVVGGEDIFATIRWSQKLLYDNGQFSVDIPFRFPQYVNPLPKLFTKREKIQLTV 237
Query: 237 NAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLL 296
N+G EVL +SH LK+ R KL + +E+ V WS DF F+Y+V + + GGVL+
Sbjct: 238 NSGVSKEVLLQGTSHPLKEKGRQGDKLSFLHEAIVESWSIKDFTFAYSVYSADLSGGVLV 297
Query: 297 QSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALS 356
Q +L D D R+MFC++LLPG ++RKVF+K V+FIVD SGSMQGKP+E+ K+A++ ALS
Sbjct: 298 QPSTLRDYDDRDMFCIFLLPGNNQNRKVFRKAVVFIVDTSGSMQGKPIENVKSAVSTALS 357
Query: 357 KLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE 416
+L GD FNIV FN E + FS+ +E + E A +W+ ++F+A G T+I PL++A+
Sbjct: 358 ELVQGDYFNIVTFNDELHSFSSCLEQVNDKTTENAIEWMNLHFVAQGGTDIMHPLSEALA 417
Query: 417 MLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
+L+NS ++P IFLV DG+VEDER IC +K++LTN G + PRI TFG+GSYCNHYFLRM
Sbjct: 418 LLSNSHDTLPQIFLVADGSVEDERNICRTVKTQLTNRGPMSPRISTFGLGSYCNHYFLRM 477
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LA I +G+Y AA+D SIE +M + F + S+++ +I+ID KD+ EFE+Y IPD+S+
Sbjct: 478 LASIGKGHYDAAFDTGSIEGRMLQWFRKASSTIVTDISIDATKDIQEFEVYSEYIPDVSA 537
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQA 596
PL VSGRY+GK P+TL AKG+L D S +ELK+Q KDIPL ++ AKQQ+DLLT++A
Sbjct: 538 NYPLFVSGRYRGKLPETLSAKGYLADKSEISIELKVQHIKDIPLGKVLAKQQVDLLTSKA 597
Query: 597 WFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDS 656
W SE+K+LE+KV+K+S+Q + EYTR ++++T G +K K + S
Sbjct: 598 WLSENKQLEQKVAKLSIQNSIPSEYTRTVLLQTILEKVDPAQQGKQKPKKHGSADE--SS 655
Query: 657 EVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLC 716
P L L +GFG++ AT EN G K P+ E+F K+ CCS + + CCC+C
Sbjct: 656 ATP----LNGLTLGFGNVAATMENLTSGFGDTKAPDKFEMFGKSVGGCCSRVADCCCCMC 711
Query: 717 CIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCG 755
I C+K+NDQC IV+ QLC AL+ GCFECCS LCC G
Sbjct: 712 FINACSKMNDQCAIVMVQLCAALSFLGCFECCSELCCGG 750
>gi|218200012|gb|EEC82439.1| hypothetical protein OsI_26857 [Oryza sativa Indica Group]
Length = 863
Score = 766 bits (1979), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/762 (50%), Positives = 509/762 (66%), Gaps = 29/762 (3%)
Query: 3 DDFVKAVEDGLKLSKRLYF-----------GKDRPVTPPKVPSSMDKSGAVDMLLPKSPM 51
+ F +AVEDGLKLSKRL G DR V D S A +LLP +PM
Sbjct: 114 EGFARAVEDGLKLSKRLVLPGGLPPPRPLAGMDRGVG-----GGGDASVAA-LLLPSAPM 167
Query: 52 VYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWR 111
YAV+TDP VD PD+PSYQPYV+GR DPP+LIPLQM V+L VDC L A V V W
Sbjct: 168 AYAVVTDPGAVDTPDVPSYQPYVYGRLDPPALIPLQMKEVDLSVDCALGEAAVTVRARWW 227
Query: 112 VHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSF 171
+HC+ S+ CD RI VPMGDQGSILG E + +SY+TQ+I + E+ +A ++G
Sbjct: 228 LHCITRSRDCDVRIVVPMGDQGSILGAEVTVGRRSYNTQVIEV-EDHTTENAAKADSGGL 286
Query: 172 LKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREK 231
LKP +F LT+ QI+GG+ +S +RWSQKL Y +G FSV++PF+FP YV P K KREK
Sbjct: 287 LKPQLFYLTISQIEGGADISATIRWSQKLHYDNGRFSVDIPFRFPYYVNPLPKVFMKREK 346
Query: 232 IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIF 291
I L VN+G EVL +SH LK+ R KL + +E+ V WS+ DF+FSY+V +
Sbjct: 347 IQLTVNSGFSKEVLLQGTSHPLKEKGRQGEKLSFRHEATVENWSSKDFNFSYSVYSGDLS 406
Query: 292 GGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
GGVL+Q +L D D R+ FC++LLPG +RKVF+K V+F++D SGSMQG PLE+ KNA+
Sbjct: 407 GGVLVQPSTLRDYDDRDRFCIFLLPG-GGNRKVFRKAVVFVIDTSGSMQGHPLENVKNAM 465
Query: 352 AVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPL 411
+ ALS+L GD FNI+ FN E + FS+ +E ++++ A WI +NF+AGG T+I PL
Sbjct: 466 STALSELTEGDYFNIITFNDELHSFSSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPL 525
Query: 412 TKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNH 471
+A+ L+++ +P IFL+TDG+V+DE IC +K+ L + GS PRI TFG+G YCNH
Sbjct: 526 NEAMASLSSAHDVLPQIFLMTDGSVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNH 585
Query: 472 YFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRI 531
YFLRMLA I RG+Y AA++ SIE Q+ + F + S+++ANI+ID LDEFE+ I
Sbjct: 586 YFLRMLASIGRGHYDAAFETGSIESQVLRWFRKASSTIVANISIDATAHLDEFEIDSEYI 645
Query: 532 PDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDL 591
PD+S++SPL +SG+YQGKFPD + AKG+L D+ +ELK+Q KDIPLD++ A QQI L
Sbjct: 646 PDISAKSPLCISGKYQGKFPDMVTAKGYLADMREISIELKVQHIKDIPLDKVLAAQQIGL 705
Query: 592 LTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTK-KGSKKSDH 650
LTA+AW S DK+LE KV K+S+Q + EYT M++++T E+ +A++ K KG K D
Sbjct: 706 LTAKAWLSSDKQLERKVVKLSIQNSIPSEYTSMVLLQTLEKVDAAQKVKQKLKGHKGPDE 765
Query: 651 QKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGN 710
P+ I LQ L +GFGD AT EN G K E EI KAA CCS L +
Sbjct: 766 --------PRRIPLQCLKLGFGDRAATRENLVTGFGDVKPLETFEILNKAA-GCCSRLAD 816
Query: 711 KCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLC 752
CC+CCI+ CNK+NDQC IV+TQ+C A AC GC+ECC+ LC
Sbjct: 817 CLCCMCCIKACNKMNDQCAIVMTQVCAAFACLGCYECCAELC 858
>gi|222637454|gb|EEE67586.1| hypothetical protein OsJ_25118 [Oryza sativa Japonica Group]
Length = 755
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/760 (50%), Positives = 507/760 (66%), Gaps = 25/760 (3%)
Query: 5 FVKAVEDGLKLSKRLYF-----------GKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVY 53
F +AVEDGLKLSKRL G DR V D S A +LLP +PM Y
Sbjct: 4 FARAVEDGLKLSKRLVLPGGLPPPRPLAGMDRGVGVGGG-GGGDASVAA-LLLPSAPMAY 61
Query: 54 AVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVH 113
AV+TDP VD PD+PSYQPYV+GR DPP+LIPLQM V+L VDC L A V V W +H
Sbjct: 62 AVVTDPGAVDTPDVPSYQPYVYGRLDPPALIPLQMKEVDLSVDCALGEAAVTVRARWWLH 121
Query: 114 CVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLK 173
C+ S+ CD RI VPMGDQGSILG E + +SY+TQ+I + E+ + ++G LK
Sbjct: 122 CITRSRDCDVRIVVPMGDQGSILGAEVTVGRRSYNTQVIEV-EDHTTENAPKADSGGLLK 180
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIH 233
P +F LT+ QI+GG+ +S +RWSQKL Y +G FSV++PF+FP YV P K KREKI
Sbjct: 181 PQLFYLTISQIEGGADISATIRWSQKLHYDNGRFSVDIPFRFPYYVNPLPKVFMKREKIQ 240
Query: 234 LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGG 293
L VN+G EVL +SH LK+ R KL + +E+ V WS+ DF+FSY+V + GG
Sbjct: 241 LTVNSGFSKEVLLQGTSHPLKEKGRQGEKLSFRHEATVENWSSKDFNFSYSVYSGDLSGG 300
Query: 294 VLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
VL+Q +L D D R+ FC++LLPG +RKVF+K V+F++D SGSMQG PLE+ KNA++
Sbjct: 301 VLVQPSTLRDYDDRDRFCIFLLPGGG-NRKVFRKAVVFVIDTSGSMQGHPLENVKNAMST 359
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
ALS+L GD FNI+ FN E + FS+ +E ++++ A WI +NF+AGG T+I PL +
Sbjct: 360 ALSELTEGDYFNIITFNDELHSFSSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPLNE 419
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
A+ L+++ +P IFL+TDG+V+DE IC +K+ L + GS PRI TFG+G YCNHYF
Sbjct: 420 AMASLSSAHDVLPQIFLMTDGSVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNHYF 479
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
LRMLA I RG+Y AA++ SIE Q+ + F + S+++ANI+ID LDEFE+ IPD
Sbjct: 480 LRMLASIGRGHYDAAFETGSIESQVLRWFRKASSTIVANISIDATAHLDEFEIDSEYIPD 539
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLT 593
+S++SPL +SG+YQGKFPD + AKG+L D+ +ELK+Q KDIPLD++ A QQI LLT
Sbjct: 540 ISAKSPLCISGKYQGKFPDMVTAKGYLADMREISIELKVQHIKDIPLDKVLAAQQIGLLT 599
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTK-KGSKKSDHQK 652
A+AW S DK+LE KV K+S+Q + EYT M++++T E+ +A++ K KG K D
Sbjct: 600 AKAWLSSDKQLERKVVKLSIQNSIPSEYTSMVLLQTLEKVDAAQKVKQKLKGHKGPDE-- 657
Query: 653 ILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKC 712
P+ I LQ L +GFGD AT EN G K E EI KAA CCS L +
Sbjct: 658 ------PRRIPLQCLKLGFGDRAATRENLVTGFGDVKPLETFEILNKAA-GCCSRLADCL 710
Query: 713 CCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLC 752
CC+CCI+ CNK+NDQC IV+TQ+C A AC GC+ECC+ LC
Sbjct: 711 CCMCCIKACNKMNDQCAIVMTQVCAAFACLGCYECCAELC 750
>gi|242046276|ref|XP_002461009.1| hypothetical protein SORBIDRAFT_02g039160 [Sorghum bicolor]
gi|241924386|gb|EER97530.1| hypothetical protein SORBIDRAFT_02g039160 [Sorghum bicolor]
Length = 751
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/766 (48%), Positives = 511/766 (66%), Gaps = 28/766 (3%)
Query: 3 DDFVKAVEDGLKLSKRLYF-----------GKDRPVTPPKVPSSMDKSGAVDMLLPKSPM 51
+DF +AVEDGLKLSKRL G +R V S+ SG LLP +PM
Sbjct: 2 EDFARAVEDGLKLSKRLVLPGGLPPPRPPAGMERTV------SAAAASGPDPRLLPTAPM 55
Query: 52 VYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWR 111
YAV+TDP VD PD+PSYQPYV+G DPP+LIPLQM V+L VDC +DTA V + W
Sbjct: 56 AYAVVTDPAAVDTPDVPSYQPYVYGNLDPPALIPLQMKEVDLTVDCAVDTAHVTLRARWW 115
Query: 112 VHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSF 171
+HC+ S+ CD R+ VP+ +QGSILG E I +SY+TQ+I + D A +E+G
Sbjct: 116 LHCITRSRECDVRLVVPLVEQGSILGAEVAIGRRSYNTQVIEV--EDTLESHAKIESGGL 173
Query: 172 LKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREK 231
LKP++F LT+PQ++GG+ + RWSQKL Y G FSV +PF+FP +V P K KREK
Sbjct: 174 LKPHLFFLTIPQVEGGADIYATFRWSQKLLYDSGHFSVEIPFRFPYFVNPLPKVFMKREK 233
Query: 232 IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIF 291
I L VN+G EVL +SH LK+ R KL + +E+ V WS+ DF+FSY+V +
Sbjct: 234 IQLTVNSGFSKEVLLQGTSHPLKEKSRQGDKLFFHHEAIVENWSSKDFNFSYSVYSGDLS 293
Query: 292 GGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
GGVL+Q +L D D+R+MFC++LLPG+ +RKVF+K V+FIVD SGSMQG+PLE+ K A+
Sbjct: 294 GGVLVQPATLRDYDERDMFCIFLLPGSG-NRKVFRKAVVFIVDASGSMQGRPLENVKRAV 352
Query: 352 AVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPL 411
+ ALS+L GD FNI+ FN E + F++ +E +A+ A W+ NF+A G T+I PL
Sbjct: 353 STALSELVEGDYFNIITFNDELHSFTSCLEQVNDKAIASATDWMNANFVAEGGTDIMHPL 412
Query: 412 TKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNH 471
++A+ +L++ ++P I+L+TDG+V+DE IC K+ LTN GS PRI TFG+GSYCNH
Sbjct: 413 SEAMALLSSVHDTLPQIYLITDGSVDDEHNICQTTKTELTNRGSKSPRISTFGLGSYCNH 472
Query: 472 YFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRI 531
YFLRMLA I +G+Y AA + SIE ++ K F R S+++ANI+ID + LDEFE+ I
Sbjct: 473 YFLRMLASIGKGHYDAALETASIENRILKWFRRASSTIVANISIDAMTHLDEFEVDSEYI 532
Query: 532 PDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDL 591
PDLS+ SPL VSG+YQGKFPDT+KAKG+L D+ +ELK+Q K+IPLD++ +QIDL
Sbjct: 533 PDLSANSPLCVSGKYQGKFPDTVKAKGYLADMKEISIELKVQQLKEIPLDKVLVTKQIDL 592
Query: 592 LTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQ 651
+TA+AW S DK+LE KV K+S+Q GV EYT M +++T N + GT+K +K Q
Sbjct: 593 MTAKAWLSADKQLERKVIKLSIQNGVPSEYTEMTLLQT----NLDKVDGTQKVKQKLKGQ 648
Query: 652 KILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNK 711
K D + I++ L +GFG+ AT EN G + P + + CCS++ +
Sbjct: 649 KGRDEQ---RILMHGLKLGFGNKDATRENLITGFG-DRNPAEKLVMLHKGKGCCSSVADC 704
Query: 712 CCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
CC+CCI+ CN++NDQC I++ Q+C ALAC GC+ECCS +CC G +
Sbjct: 705 LCCMCCIKACNRMNDQCAILMAQICAALACLGCYECCSEVCCGGSE 750
>gi|297841999|ref|XP_002888881.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp.
lyrata]
gi|297334722|gb|EFH65140.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp.
lyrata]
Length = 757
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/763 (48%), Positives = 510/763 (66%), Gaps = 18/763 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
M+++F VE GLKLS+R+Y+GK +TPP VP D + + LP + YA ITDP
Sbjct: 1 MSEEFALRVEQGLKLSRRIYYGKG--ITPPVVP---DSPSSPENFLPTAITAYASITDPV 55
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
VDNPD+PSYQPYVH RCDP +L+PLQM +E+ VDC+LD AFV V+G WRVHCVM SK
Sbjct: 56 AVDNPDVPSYQPYVHARCDPSALVPLQMLGIEMRVDCWLDMAFVTVTGRWRVHCVMPSKR 115
Query: 121 CDCRIAVPMGDQGSILGVEAEI--SGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFT 178
DC + VPMG++G++LG E ++ + KSY T+L+ E + F K +I+T
Sbjct: 116 FDCCVGVPMGEKGTLLGAEIDVLSNEKSYKTKLVTEDETSDFENVHKDKDSRFFKSHIYT 175
Query: 179 LTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
+P + GGS S+ + WSQKL YRDG+F +NVPF+FP YV P K+I KREKI LN+N+
Sbjct: 176 FKIPHVVGGSIFSVNVNWSQKLIYRDGKFHLNVPFRFPAYVNPVGKEITKREKIVLNMNS 235
Query: 239 G-TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQ 297
+G EV + +SH LK + R+ GKL YE+EV WS +DF S+ VS + G VL++
Sbjct: 236 CVSGGEVASSYTSHPLKVIHREAGKLSCEYEAEVPSWSRVDFGVSFNVSSGDLSGNVLIK 295
Query: 298 SPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
SPS D D R +FC+YL PGT K +K+FK+ V+F++DIS SM+ KPLED K AL L+K
Sbjct: 296 SPSPWDSDDRGIFCLYLFPGTTKHKKLFKRRVVFVIDISASMKWKPLEDVKKALLECLAK 355
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L D FNI+AFN E FSTSME AT E + +W+ N IA G TN+ PL +A+++
Sbjct: 356 LQAEDVFNIIAFNDEILEFSTSMEFATDETISAVTEWLDTNLIANGGTNMLLPLKQAIKL 415
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLT-NGGSICPRIYTFGIGSYCNHYFLRM 476
L S +P+++LVTDG+VE+ER+IC+AMK + NG SI PRI TFGIGS+CNHYFL+M
Sbjct: 416 LEGSNIGVPLVYLVTDGSVENEREICNAMKESCSRNGKSISPRISTFGIGSFCNHYFLQM 475
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LA I GYY + DS E QM +LF S+++AN D LK L E++P IPD++
Sbjct: 476 LARIGNGYYDGTNNTDSFEHQMSRLFDIASSTIVANTTFDALKLLRSVELFPCHIPDITL 535
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQA 596
PLI+SGRY+G+FPD ++ +G L D+S F +EL +Q AKDIPLD++ A++QI+ LTA+A
Sbjct: 536 CDPLILSGRYKGEFPDEVELRGTLADMSCFTIELTVQKAKDIPLDKVLARRQINELTARA 595
Query: 597 WFSEDKRLEEKVSKMSVQTGVLCEYTRMII-VETDERNNASESPGTKKGSKKSD----HQ 651
WF + K L+EKV ++S+QTG EYT+M++ ++ DE P + K ++ H+
Sbjct: 596 WFEDKKELQEKVMRLSIQTGFPSEYTQMVLSLKHDEEEKTMARPVSIKEILRNPPYQIHK 655
Query: 652 KILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNK 711
++ S +T +L G GFG++TAT +N PP E PK E E+ ++AAS + ++
Sbjct: 656 QMQRSNSVRTSLLGKQGYGFGNVTATLKNVPPWIEEPKETEGTELLIRAASG----VVDR 711
Query: 712 CCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
CC+CC++C ++++DQC IV +QLC ALACF C CC +C C
Sbjct: 712 VCCMCCLQCMSRVSDQCTIVFSQLCAALACFQCIGCCFEVCGC 754
>gi|326500868|dbj|BAJ95100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 752
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/760 (49%), Positives = 513/760 (67%), Gaps = 17/760 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDM---LLPKSPMVYAVIT 57
M +DF +AVEDGL+LSKRL PP+ ++MD+ GA + LLP +PM YAV+T
Sbjct: 1 MEEDFERAVEDGLRLSKRLVLPGG--AGPPRPHAAMDRYGAPEAATSLLPVAPMAYAVVT 58
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLD--TAFVRVSGTWRVHCV 115
DP VD+PD+PSY+P+V+GR PP+LIPLQM VEL VDC TA V + W +HC+
Sbjct: 59 DPAAVDSPDVPSYKPHVYGRLSPPALIPLQMREVELRVDCAAGCATAEVTLVARWWLHCL 118
Query: 116 MGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPN 175
S+ CDCR+ VPMG++GSILG E + KSY+T +I + N K E+G LK
Sbjct: 119 TRSRDCDCRLVVPMGEEGSILGAEVTVGKKSYNTHVIDVDGNSPV-KIVKTESGGILKRQ 177
Query: 176 IFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLN 235
F LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP YV P K K+EKI L
Sbjct: 178 FFQLTIPQVVGGEDIFATIRWSQKLLYDNGQFSVDIPFRFPHYVNPLPKIFTKKEKIQLT 237
Query: 236 VNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVL 295
VN+G EVL +SH LK+ R KL + +E+ V WS DF+FSY+V + GGVL
Sbjct: 238 VNSGVSKEVLLQGTSHPLKEKIRQAEKLFFLHEAVVENWSIKDFNFSYSVYSGDVSGGVL 297
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
+Q +L D D R++F ++LLPG ++RK+F+K V+FIVD SGSMQGKP+E+ KNA++ A+
Sbjct: 298 VQRSTLRDYDDRDIFSIFLLPGNNQNRKIFRKAVVFIVDTSGSMQGKPIENVKNAISTAV 357
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
S+L+ GD FNIV FN E + FS+ +E + E A W+ F+A G T+I PLT+A+
Sbjct: 358 SELEEGDYFNIVTFNDELHSFSSCLEKVNGKTTESAIDWMNSKFVAQGGTDIMHPLTEAL 417
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
+L+NS G++P IFLVTDG+VEDER IC +K+ LTN GSI PRI TFG+GSYCNHYFLR
Sbjct: 418 ALLSNSHGALPQIFLVTDGSVEDERNICHTVKTELTNKGSISPRISTFGLGSYCNHYFLR 477
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
MLA I +G+Y AA+D SIE +M + F + S+++A+I+ID K EFE+ IPD+S
Sbjct: 478 MLASIGKGHYDAAFDTGSIEGRMLQWFQKASSTIVADISIDVTKHTQEFEVDCEYIPDVS 537
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
++ PL VSGR++G+ P+TL A+G+L D+S +ELK+Q KDIPLD++ AKQQ+DLLTA+
Sbjct: 538 AKYPLFVSGRFRGELPETLYAEGYLSDMSKISIELKVQQIKDIPLDKVLAKQQMDLLTAK 597
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILD 655
AW SE+K+LE+ V+KMS+Q G+ EYTR ++++T G +K K
Sbjct: 598 AWLSENKQLEQMVAKMSIQNGIPSEYTRTVLLQTIMEKIDPAQQGKQKPKK--------- 648
Query: 656 SEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCL 715
E P + LQ L +GFG++ AT EN G + P+ ++F KA CC + + CCC+
Sbjct: 649 PEEPSAMPLQGLTLGFGNVPATTENLTSGFGDTRAPDKFDMFGKAVGGCCGRVADCCCCM 708
Query: 716 CCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCG 755
C I CC+K+NDQC IV+ Q+C AL+ GCFECCS LCC G
Sbjct: 709 CFINCCSKMNDQCAIVMVQMCAALSFLGCFECCSELCCGG 748
>gi|108708732|gb|ABF96527.1| von Willebrand factor type A domain containing protein, expressed
[Oryza sativa Japonica Group]
gi|218193020|gb|EEC75447.1| hypothetical protein OsI_11983 [Oryza sativa Indica Group]
Length = 751
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/765 (48%), Positives = 496/765 (64%), Gaps = 22/765 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK-----SGAVDMLLPKSPMVYAV 55
M +DF +AVEDGLKLSKRL P +P VP MD+ + A LLP +PM YAV
Sbjct: 1 MEEDFARAVEDGLKLSKRLVLPGGAP-SPRPVPG-MDRGLLPDAAAASSLLPSAPMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAF---VRVSGTWRV 112
+ DP VD+PD+PSYQP+V+GR DPP+LIPL M V L VDC V + W V
Sbjct: 59 VVDPAAVDSPDVPSYQPHVYGRLDPPALIPLHMREVALAVDCAAAGCAAAEVTLRARWWV 118
Query: 113 HCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFL 172
HC+ S C CRI VPMG+QG+ILG E + +SY T +I + E+ GA K A E+G L
Sbjct: 119 HCLTRSCDCHCRIVVPMGEQGTILGAEVTVGKRSYKTHVIDV-EDQGAVKIAKTESGGLL 177
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
K F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K K+EKI
Sbjct: 178 KRQFFSLTIPQVGGGEDIFATIRWSQKLLYDNGQFSVDIPFQFPQYVNPLPKVFMKKEKI 237
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFG 292
L +N+G E++ SSH LK+ R KL + +E+ V WSN DF F+Y+V + G
Sbjct: 238 QLTLNSGVSNEIVLKGSSHPLKERSRQGEKLSFFHEAVVENWSNKDFTFAYSVYSGDLSG 297
Query: 293 GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALA 352
G+L+Q + D D R+MFC++LLPG + RKVF+ +FI+D SGSMQGKPLE KNA+
Sbjct: 298 GMLVQPSTSDDYDDRDMFCIFLLPGNNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMY 357
Query: 353 VALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT 412
LS+L GD FNI+ FN E + FS+ +E ++ +E A +W+ NFIA G T+I PL+
Sbjct: 358 TTLSELVQGDYFNIITFNDELHSFSSCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLS 417
Query: 413 KAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
+A+ +L+NS ++P IFLVTDG+VEDER IC +K +L GS PRI TFG+GSYCNHY
Sbjct: 418 EAIALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY 477
Query: 473 FLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIP 532
FLRMLA I +G+Y AA+D SIE +M + F + S+++ NI+ID K + EFE+ I
Sbjct: 478 FLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTIVINISIDATKYIREFEVDSEYIQ 537
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLL 592
D+S++ PL VSGRYQGK P+TL AKG+L D+S +EL++Q KDI LD++ AKQQ+DLL
Sbjct: 538 DISAKCPLCVSGRYQGKLPETLTAKGYLADMSEISIELRVQHVKDISLDKVLAKQQMDLL 597
Query: 593 TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQK 652
TA+AWF E+ +LE KV K+S+Q + EYTR I+++ KK S +++
Sbjct: 598 TAKAWFYENNQLEMKVVKLSIQNSIPSEYTRTILLQNFVEKIEQGKQKPKKNSTQNEQ-- 655
Query: 653 ILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKC 712
L L +GFGD AT EN G K PE E+F KA CCS L + C
Sbjct: 656 -------SATSLNGLTLGFGDTAATRENLSAGFGDTKPPERFEMFDKAV-GCCSRLTDCC 707
Query: 713 CCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
CC+C I C+K+ND+C IV+ QLC AL+C CFECCS LCC G D
Sbjct: 708 CCMCFINTCSKMNDRCAIVMVQLCGALSCLACFECCS-LCCGGSD 751
>gi|414887572|tpg|DAA63586.1| TPA: hypothetical protein ZEAMMB73_270893 [Zea mays]
Length = 750
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/760 (48%), Positives = 503/760 (66%), Gaps = 17/760 (2%)
Query: 3 DDFVKAVEDGLKLSKRLYFGKDRPVTP-----PKVPSSMDKSGAVDMLLPKSPMVYAVIT 57
+DF +AVEDGLKLSKRL P + S+ SG LLP +PM YA++T
Sbjct: 2 EDFARAVEDGLKLSKRLVLPGGVPPPRPPAGMERTVSAASASGPDPRLLPTAPMAYALVT 61
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMG 117
DP VD PD+PSYQPYV+G DPP+LIPLQM V+L VDC LD A V + W +HC+
Sbjct: 62 DPAAVDTPDVPSYQPYVYGDLDPPALIPLQMKEVDLAVDCALDAAHVTLRARWWLHCITR 121
Query: 118 SKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIF 177
S+ D R+ VP+G+QGSILG E + +SY+TQ+I + E+ K A +E+G L+P++F
Sbjct: 122 SRDSDVRLVVPLGEQGSILGAEVTVGRRSYNTQVIEV-EDHTLEKHAKIESGGLLQPHLF 180
Query: 178 TLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVN 237
LT+PQ++GG+ + +WSQKL Y G FSV++PF+FP +V P K K+EKI L V+
Sbjct: 181 FLTIPQVEGGADICATFQWSQKLLYDSGRFSVDIPFRFPYFVNPLPKVFMKKEKIQLTVH 240
Query: 238 AGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQ 297
+G E+L +SH LK+ R KL + +E+ V WS DF+FSY V + GGVL+Q
Sbjct: 241 SGFSKEILLQGTSHPLKEKSRQGDKLFFHHEAIVENWSIKDFNFSYNVYSGDLSGGVLVQ 300
Query: 298 SPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
+L D D R+MFC++LLPG+ +RKVF K V+FIVD SGSMQGKPLE+ K+A++ ALS+
Sbjct: 301 PATLCDYDDRDMFCIFLLPGSG-NRKVFGKAVVFIVDTSGSMQGKPLENVKHAVSTALSE 359
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L GD FNI+ FN E + FS+ +E +A+ A +W+ NF+A G T+I PL++A+ +
Sbjct: 360 LVEGDYFNIITFNDELHSFSSCLEQVNDKAIASATEWMNANFVAEGGTDIMHPLSEAMAL 419
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
L++ ++P I+L+TDG+V+DE IC K+ L+N GS PRI TFG+GSYCNHYFLRML
Sbjct: 420 LSSVHDTLPQIYLITDGSVDDEHNICRTTKTDLSNRGSRSPRISTFGLGSYCNHYFLRML 479
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A I +G+Y AA SIE Q+ K F R S+++ANI+ID L LDEFE+ IPDLS+
Sbjct: 480 ASIGKGHYDAALGTASIENQILKWFRRASSTIVANISIDALTHLDEFEVDSEYIPDLSAN 539
Query: 538 SPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAW 597
SPL VSG+Y+GKFPDT+KAKG+L D+ VELK+Q K+IPLD++ QQID +T++AW
Sbjct: 540 SPLCVSGKYKGKFPDTIKAKGYLADMKEISVELKVQQLKEIPLDKVLVIQQIDFMTSKAW 599
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSE 657
S DK++E KV K+S Q GV EYT MI ++T N + GT+K +K K D +
Sbjct: 600 MSADKQVERKVIKLSTQNGVPSEYTDMIFLQT----NLDKVEGTQKVKQKLKGHKGRDEQ 655
Query: 658 VPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCC 717
+ L +GFG+ AT EN G P + ++ S CCS + + CC+CC
Sbjct: 656 -----QIYGLKLGFGNKDATRENLITGFGDMN-PAEKSVMLQKGSGCCSRVADCLCCMCC 709
Query: 718 IECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
I+ CN++NDQC +++ Q+C ALAC GC+ECC+ +CC G +
Sbjct: 710 IKACNRMNDQCAVLMAQICAALACLGCYECCAEVCCGGSE 749
>gi|242035517|ref|XP_002465153.1| hypothetical protein SORBIDRAFT_01g032840 [Sorghum bicolor]
gi|241919007|gb|EER92151.1| hypothetical protein SORBIDRAFT_01g032840 [Sorghum bicolor]
Length = 756
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/764 (48%), Positives = 505/764 (66%), Gaps = 15/764 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK----SGAVDMLLPKSPMVYAVI 56
M + F +AVEDGLKL+KRL + PP+ +MD+ + A+ L+P SPM YAV+
Sbjct: 1 MDEAFARAVEDGLKLTKRLVLPNGGGLPPPRPHLAMDRDNDPAAALLQLMPASPMAYAVV 60
Query: 57 TDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLD---TAFVRVSGTWRVH 113
DP VD+PD+PSYQP+V+GR DPP+LIPLQM V+L VDC TA V + W VH
Sbjct: 61 VDPGAVDSPDVPSYQPHVYGRLDPPALIPLQMREVDLRVDCAAAGCATAEVALRARWWVH 120
Query: 114 CVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLK 173
C+ S++C CRI VPMG QGSILG E + +SY+T +I + E++ A K E+G LK
Sbjct: 121 CITRSRACHCRIVVPMGHQGSILGAEVTVGKRSYNTHVIDI-EDNCAVKIEIPESGGLLK 179
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIH 233
+F+LT+PQ+ GG + +RWSQKL Y +G+FSV VPF+FP+YV P K K+EKI
Sbjct: 180 QELFSLTIPQVGGGEDIFATIRWSQKLLYHNGQFSVEVPFRFPQYVNPLPKVFMKKEKIQ 239
Query: 234 LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGG 293
L VN+G EV+ +SH LK+ R KL + +E+ V WS DF F+YTV + GG
Sbjct: 240 LTVNSGASKEVILQGTSHPLKEKNRQGEKLSFLHEAVVENWSTKDFTFAYTVYSGDLSGG 299
Query: 294 VLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
VL+Q +L D D R+MFC++LLPG +RKVF+K V++I+D SGSMQGKPLE KNA++
Sbjct: 300 VLVQPSTLRDYDDRDMFCLFLLPGNNANRKVFRKAVVYIIDTSGSMQGKPLESVKNAMST 359
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
LS L GD FNI+ FN E + FS+ +E + + A +W+ +NF+A G T++ PL++
Sbjct: 360 TLSDLMQGDYFNIITFNDELHSFSSRLEQVNERTIGNAIEWMNLNFVAQGGTDMMHPLSE 419
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
A+ +L++S +P I+ VTDG+V+DER IC +K++L GS PRI TFG+GSYCNHYF
Sbjct: 420 AMALLSSSHDVLPQIYFVTDGSVDDERNICHTLKTQLIKSGSKSPRISTFGLGSYCNHYF 479
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
LRMLA I +G+Y AA D SIE +M + F + S++++N++ID +K + FE+ IPD
Sbjct: 480 LRMLASIGKGHYAAALDTGSIEGRMLQWFQKASSTIVSNVSIDGIKHIQGFEVDSEYIPD 539
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLT 593
+S++ PL VSGRY GK P+TL AKG L D S +ELK+Q KDIPLD++ AKQQ+DLLT
Sbjct: 540 ISAKYPLCVSGRYNGKLPETLIAKGHLADTSEISIELKVQHIKDIPLDKVLAKQQMDLLT 599
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKI 653
A++W E+K LE KV K+S+Q + EYTRM++++T K +K+S
Sbjct: 600 AKSWLMENKELERKVVKLSIQNSLPSEYTRMVLLQTSLDKIDPAQQAKNKPTKQSS---- 655
Query: 654 LDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCC 713
E+P + L L +GFGD+ AT EN G K PE IF KA CCS + + CC
Sbjct: 656 -PDELP-AMPLGGLALGFGDVAATRENLTTGFGDMKAPEKFVIFEKAV-GCCSRVADCCC 712
Query: 714 CLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
C+C I+ C+K+NDQC IV+ Q C ALAC GCFECCS LCC G +
Sbjct: 713 CMCFIKACSKMNDQCAIVMAQACAALACLGCFECCSELCCGGSN 756
>gi|334183862|ref|NP_177394.2| uncharacterized protein [Arabidopsis thaliana]
gi|332197210|gb|AEE35331.1| uncharacterized protein [Arabidopsis thaliana]
Length = 756
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/763 (46%), Positives = 501/763 (65%), Gaps = 18/763 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
M+++F VE GL+L++R+Y+GK D + + LP + YA ITDP
Sbjct: 1 MSEEFALRVEQGLQLARRIYYGKGIAPP-----VVPDPPSSPENFLPTAITAYASITDPV 55
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
VDNPD+PSYQPYVH RCDP +L+PLQM +E+++DC+LDTAFV V+G WRVHCV SK
Sbjct: 56 AVDNPDVPSYQPYVHARCDPSALVPLQMLGIEMNIDCWLDTAFVTVTGRWRVHCVRPSKR 115
Query: 121 CDCRIAVPMGDQGSILGVEAEI--SGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFT 178
DC + VPMG++GS LG E ++ + KSY T+L+ E + F K +I+T
Sbjct: 116 FDCCVGVPMGEKGSFLGAEIDVLNNEKSYQTKLVTEDETSDFDNVHKDKDSRFFKSHIYT 175
Query: 179 LTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
+P + GGS S+ + WSQKL Y+DG+F +NVPF+FP YV P K+I KREKI LN+N+
Sbjct: 176 FKIPHVAGGSIFSVNVTWSQKLIYKDGKFHLNVPFRFPSYVNPIGKEIIKREKIVLNMNS 235
Query: 239 G-TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQ 297
+G E+ + +SH LK + R G+L YE+EV WS +DF S+TVS + G VL++
Sbjct: 236 CVSGGEIASSFTSHPLKIIHRVAGELSCEYEAEVPSWSRVDFGVSFTVSSGDLCGNVLVK 295
Query: 298 SPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
SPS D D R +FC+YL PGT K K+FK+ V+F++DIS SM+ KPLED K AL L+K
Sbjct: 296 SPSPWDSDDRGIFCLYLFPGTTKHTKLFKRRVVFVIDISASMKWKPLEDVKKALLECLAK 355
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L D FNI+AFN E FSTSME AT E + +W+ N IA G TN+ PL +A+++
Sbjct: 356 LQAEDVFNIIAFNDEILEFSTSMEFATDETISAVTEWLDSNLIANGGTNMLLPLKQAMKL 415
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLT-NGGSICPRIYTFGIGSYCNHYFLRM 476
L S +P+++LVTDG+VE+ER+IC AMK + NG SI PRI TFGIGS+CNHYFL+M
Sbjct: 416 LEGSNIGVPLVYLVTDGSVENEREICHAMKESCSRNGKSISPRISTFGIGSFCNHYFLQM 475
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LA I GYY + DS E QM +LF S+++AN D LK L E++P ++PD++
Sbjct: 476 LARIGNGYYDGTNNTDSFEHQMSRLFEIASSTIVANTTFDALKLLRSVELFPCQVPDITL 535
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQA 596
PLI+SGRY+G+FPD ++ +G L D+S F +EL +Q AKDIPLD++ A++QI+ LTA+A
Sbjct: 536 GDPLILSGRYKGEFPDEVELRGTLADMSCFTIELTVQKAKDIPLDKVLARRQINELTARA 595
Query: 597 WFSEDKRLEEKVSKMSVQTGVLCEYTRMII-VETDERNNASESPGTKKGSKKSD----HQ 651
WF + K L+EKV ++S+QTG EYT+M++ V+ DE P + K ++ H+
Sbjct: 596 WFEDKKELQEKVMRLSIQTGFPSEYTQMVLSVKHDEEEKTMARPVSIKEILRNPPYQIHK 655
Query: 652 KILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNK 711
++ S + +L G GFG++ AT +N PP E PK E E+ ++AAS + ++
Sbjct: 656 QMQRSNSMRRSLLGKQGYGFGNVAATLKNVPPWMEEPKEVEGTELLIRAASG----VVDR 711
Query: 712 CCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
CC+CC++C ++++DQC IV +QLC ALACF C CC +C C
Sbjct: 712 VCCMCCLQCMSRVSDQCTIVFSQLCAALACFQCIGCCFEVCGC 754
>gi|12325279|gb|AAG52586.1|AC016529_17 hypothetical protein; 14673-17893 [Arabidopsis thaliana]
Length = 758
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/765 (46%), Positives = 501/765 (65%), Gaps = 20/765 (2%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPK 60
M+++F VE GL+L++R+Y+GK D + + LP + YA ITDP
Sbjct: 1 MSEEFALRVEQGLQLARRIYYGKGIAPP-----VVPDPPSSPENFLPTAITAYASITDPV 55
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
VDNPD+PSYQPYVH RCDP +L+PLQM +E+++DC+LDTAFV V+G WRVHCV SK
Sbjct: 56 AVDNPDVPSYQPYVHARCDPSALVPLQMLGIEMNIDCWLDTAFVTVTGRWRVHCVRPSKR 115
Query: 121 CDCRIAVPMGDQGSILGVEAEI--SGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFT 178
DC + VPMG++GS LG E ++ + KSY T+L+ E + F K +I+T
Sbjct: 116 FDCCVGVPMGEKGSFLGAEIDVLNNEKSYQTKLVTEDETSDFDNVHKDKDSRFFKSHIYT 175
Query: 179 LTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNA 238
+P + GGS S+ + WSQKL Y+DG+F +NVPF+FP YV P K+I KREKI LN+N+
Sbjct: 176 FKIPHVAGGSIFSVNVTWSQKLIYKDGKFHLNVPFRFPSYVNPIGKEIIKREKIVLNMNS 235
Query: 239 G-TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTV--SPSHIFGGVL 295
+G E+ + +SH LK + R G+L YE+EV WS +DF S+TV S + G VL
Sbjct: 236 CVSGGEIASSFTSHPLKIIHRVAGELSCEYEAEVPSWSRVDFGVSFTVRVSSGDLCGNVL 295
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
++SPS D D R +FC+YL PGT K K+FK+ V+F++DIS SM+ KPLED K AL L
Sbjct: 296 VKSPSPWDSDDRGIFCLYLFPGTTKHTKLFKRRVVFVIDISASMKWKPLEDVKKALLECL 355
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
+KL D FNI+AFN E FSTSME AT E + +W+ N IA G TN+ PL +A+
Sbjct: 356 AKLQAEDVFNIIAFNDEILEFSTSMEFATDETISAVTEWLDSNLIANGGTNMLLPLKQAM 415
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLT-NGGSICPRIYTFGIGSYCNHYFL 474
++L S +P+++LVTDG+VE+ER+IC AMK + NG SI PRI TFGIGS+CNHYFL
Sbjct: 416 KLLEGSNIGVPLVYLVTDGSVENEREICHAMKESCSRNGKSISPRISTFGIGSFCNHYFL 475
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+MLA I GYY + DS E QM +LF S+++AN D LK L E++P ++PD+
Sbjct: 476 QMLARIGNGYYDGTNNTDSFEHQMSRLFEIASSTIVANTTFDALKLLRSVELFPCQVPDI 535
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTA 594
+ PLI+SGRY+G+FPD ++ +G L D+S F +EL +Q AKDIPLD++ A++QI+ LTA
Sbjct: 536 TLGDPLILSGRYKGEFPDEVELRGTLADMSCFTIELTVQKAKDIPLDKVLARRQINELTA 595
Query: 595 QAWFSEDKRLEEKVSKMSVQTGVLCEYTRMII-VETDERNNASESPGTKKGSKKSD---- 649
+AWF + K L+EKV ++S+QTG EYT+M++ V+ DE P + K ++
Sbjct: 596 RAWFEDKKELQEKVMRLSIQTGFPSEYTQMVLSVKHDEEEKTMARPVSIKEILRNPPYQI 655
Query: 650 HQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLG 709
H+++ S + +L G GFG++ AT +N PP E PK E E+ ++AAS +
Sbjct: 656 HKQMQRSNSMRRSLLGKQGYGFGNVAATLKNVPPWMEEPKEVEGTELLIRAASG----VV 711
Query: 710 NKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
++ CC+CC++C ++++DQC IV +QLC ALACF C CC +C C
Sbjct: 712 DRVCCMCCLQCMSRVSDQCTIVFSQLCAALACFQCIGCCFEVCGC 756
>gi|53370749|gb|AAU89244.1| von Willebrand factor type A domain containing protein [Oryza
sativa Japonica Group]
Length = 801
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/806 (46%), Positives = 501/806 (62%), Gaps = 54/806 (6%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK-----SGAVDMLLPKSPMVYAV 55
M +DF +AVEDGLKLSKRL P +P VP MD+ + A LLP +PM YAV
Sbjct: 1 MEEDFARAVEDGLKLSKRLVLPGGAP-SPRPVPG-MDRGLLPDAAAASSLLPSAPMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAF---VRVSGTWRV 112
+ DP VD+PD+PSYQP+V+GR DPP+LIPL M V L VDC V + W V
Sbjct: 59 VVDPAAVDSPDVPSYQPHVYGRLDPPALIPLHMREVALAVDCAAAGCAAAEVTLRARWWV 118
Query: 113 HCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFL 172
HC+ S C CRI VPMG+QG+ILG E + +SY T +I + E+ GA K A E+G L
Sbjct: 119 HCLTRSCDCHCRIVVPMGEQGTILGAEVTVGKRSYKTHVIDV-EDQGAVKIAKTESGGLL 177
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
K F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K K+EKI
Sbjct: 178 KRQFFSLTIPQVGGGEDIFATIRWSQKLLYDNGQFSVDIPFQFPQYVNPLPKVFMKKEKI 237
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFG 292
L +N+G E++ SSH LK+ R KL + +E+ V WSN DF F+Y+V + G
Sbjct: 238 QLTLNSGVSNEIVLKGSSHPLKERSRQGEKLSFFHEAVVENWSNKDFTFAYSVYSGDLSG 297
Query: 293 GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALA 352
G+L+Q + D D R+MFC++LLPG + RKVF+ +FI+D SGSMQGKPLE KNA+
Sbjct: 298 GMLVQPSTSDDYDDRDMFCIFLLPGNNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMY 357
Query: 353 VALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT 412
LS+L GD FNI+ FN E + FS+ +E ++ +E A +W+ NFIA G T+I PL+
Sbjct: 358 TTLSELVQGDYFNIITFNDELHSFSSCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLS 417
Query: 413 KAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
+A+ +L+NS ++P IFLVTDG+VEDER IC +K +L GS PRI TFG+GSYCNHY
Sbjct: 418 EAIALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY 477
Query: 473 FLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIP 532
FLRMLA I +G+Y AA+D SIE +M + F + S+++ NI+ID K + EFE+ I
Sbjct: 478 FLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTIVINISIDATKYIREFEVDSEYIQ 537
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLL 592
D+S++ PL VSGRYQGK P+TL AKG+L D+S +EL++Q KDI LD++ AKQQ+DLL
Sbjct: 538 DISAKCPLCVSGRYQGKLPETLTAKGYLADMSEISIELRVQHVKDISLDKVLAKQQMDLL 597
Query: 593 TAQAWFSEDKRLEEK-----------------------VSKMSVQTGVLCEYTRMII--- 626
TA+AWF E+ +LE K V K+S+Q + EYTR I+
Sbjct: 598 TAKAWFYENNQLEMKSGMRFTTVHCIMSYKVMLNLHYEVVKLSIQNSIPSEYTRTILLQN 657
Query: 627 -VETDERNNASE--------SPGTKKGSKKSDHQKILDSEVPKT------IMLQNLGIGF 671
VE E+ +E S G + + + + T L L +GF
Sbjct: 658 FVEKIEQVFVTEGGVLYMFFSYGNRTLDETLKLKGKQKPKKNSTQNEQSATSLNGLTLGF 717
Query: 672 GDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIV 731
GD AT EN G K PE E+F KA CCS L + CCC+C I C+K+ND+C IV
Sbjct: 718 GDTAATRENLSAGFGDTKPPERFEMFDKAV-GCCSRLTDCCCCMCFINTCSKMNDRCAIV 776
Query: 732 LTQLCTALACFGCFECCSNLCCCGQD 757
+ QLC AL+C CFECCS LCC G D
Sbjct: 777 MVQLCGALSCLACFECCS-LCCGGSD 801
>gi|357116396|ref|XP_003559967.1| PREDICTED: uncharacterized protein LOC100843408 [Brachypodium
distachyon]
Length = 751
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/756 (47%), Positives = 501/756 (66%), Gaps = 8/756 (1%)
Query: 3 DDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAV-DMLLPKSPMVYAVITDPKI 61
+DF +AVEDGLKLSKRL P P AV +LLP +PM YAV++DP
Sbjct: 2 EDFARAVEDGLKLSKRLVLPGGLPPPRPPTGMERGPDAAVSQLLLPTAPMAYAVVSDPGA 61
Query: 62 VDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSC 121
VD PD+PSYQPYV+GR DPP+LIPLQM ++L DC LD A V + W +HC+ S+ C
Sbjct: 62 VDTPDVPSYQPYVYGRLDPPALIPLQMKEIDLAADCTLDAAAVTLRARWWLHCITRSREC 121
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTL 181
D R+ VPMG+QGSILG E + +SY+TQ+I + E+ +TG FLKP++F LT+
Sbjct: 122 DVRLVVPMGEQGSILGAEVTVGRRSYNTQVIEV-EDHTMENIGKADTGGFLKPHMFCLTI 180
Query: 182 PQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTG 241
PQ++GG+ + +RWSQKL Y +G+ SV++PF+FP YV P K KREKI L VN+G
Sbjct: 181 PQVEGGADIVATIRWSQKLHYDNGQISVDIPFRFPYYVNPLPKVFMKREKIQLTVNSGFS 240
Query: 242 TEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL 301
EVL +SH LK+ R KL + +E+ V WS+ DF FSY+V G +L+Q +
Sbjct: 241 KEVLLQGTSHPLKEKARQGEKLSFLHEAIVEFWSSKDFTFSYSVYSGDFSGSILVQPSTS 300
Query: 302 HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
D D R+ F ++LLPG+ +RKVFKK V+F+VD SGSM GKPLE+ KNA++ ALS+L G
Sbjct: 301 QDYDDRDRFSIFLLPGSG-NRKVFKKAVVFVVDTSGSMHGKPLENVKNAVSTALSELAQG 359
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
D FNIV FN E + FS+ +E ++ A W+ NF+ G T+I PL++A+ +L++
Sbjct: 360 DYFNIVTFNDELHSFSSCLEKVNEKTTASAIDWMNGNFVGEGGTDIMHPLSEAMALLSSV 419
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+P IFL+TDG+V+DE IC MK+ L + G PRI TFG+G +CNHYFLRMLA +
Sbjct: 420 HDVLPQIFLMTDGSVDDEHNICQTMKTELISRGPKSPRISTFGLGLHCNHYFLRMLASVG 479
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+G+Y +A + +S+E ++ K F R +++++NI+ID K LD+FE+ IPD+S++ PL
Sbjct: 480 KGHYDSALETESVESRILKWFRRASNTIVSNISIDATKHLDDFEVDSEYIPDISAQCPLC 539
Query: 542 VSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSED 601
VSG+Y+GKFP+T+ AKG+L D++ +ELK+Q KD PLD + A QQI LLTA+AWFS D
Sbjct: 540 VSGQYRGKFPETVVAKGYLADMTEISIELKVQHIKDTPLDEVLAAQQIALLTAKAWFSAD 599
Query: 602 KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKT 661
K+LE KV +S+Q VL EYTRM++++T+ +E ++G +K+ K S P
Sbjct: 600 KQLERKVMDLSIQNSVLSEYTRMVVLQTNLE--LTELRVLEQGKQKTKGHK--GSNEPLR 655
Query: 662 IMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECC 721
I L L +GFGD AT +N G K PE +IF K A CC L + CC+CCI+ C
Sbjct: 656 IPLHGLKLGFGDKAATRDNLLTGFGEEKPPETLKIF-KKAGGCCGRLADCLCCMCCIKAC 714
Query: 722 NKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
N++NDQC I++ Q+C AL+C GC+ECCS +CC G +
Sbjct: 715 NRMNDQCAIMMAQICAALSCLGCYECCSEVCCGGSE 750
>gi|222625100|gb|EEE59232.1| hypothetical protein OsJ_11214 [Oryza sativa Japonica Group]
Length = 718
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/762 (46%), Positives = 473/762 (62%), Gaps = 49/762 (6%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK-----SGAVDMLLPKSPMVYAV 55
M +DF +AVEDGLKLSKRL P +P VP MD+ + A LLP +PM YAV
Sbjct: 1 MEEDFARAVEDGLKLSKRLVLPGGAP-SPRPVPG-MDRGLLPDAAAASSLLPSAPMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCV 115
+ DP VD+PD+PSYQP+V+GR DPP+LIPL M V L VDC
Sbjct: 59 VVDPAAVDSPDVPSYQPHVYGRLDPPALIPLHMREVALAVDCAAAGCAAARG-------- 110
Query: 116 MGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPN 175
E + +SY T +I + E+ GA K A E+G LK
Sbjct: 111 ----------------------AEVTVGKRSYKTHVIDV-EDQGAVKIAKTESGGLLKRQ 147
Query: 176 IFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLN 235
F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K K+EKI L
Sbjct: 148 FFSLTIPQVGGGEDIFATIRWSQKLLYDNGQFSVDIPFQFPQYVNPLPKVFMKKEKIQLT 207
Query: 236 VNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVL 295
+N+G E++ SSH LK+ R KL + +E+ V WSN DF F+Y+V + GG+L
Sbjct: 208 LNSGVSNEIVLKGSSHPLKERSRQGEKLSFFHEAVVENWSNKDFTFAYSVYSGDLSGGML 267
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
+Q + D D R+MFC++LLPG + RKVF+ +FI+D SGSMQGKPLE KNA+ L
Sbjct: 268 VQPSTSDDYDDRDMFCIFLLPGNNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMYTTL 327
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
S+L GD FNI+ FN E + FS+ +E ++ +E A +W+ NFIA G T+I PL++A+
Sbjct: 328 SELVQGDYFNIITFNDELHSFSSCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLSEAI 387
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
+L+NS ++P IFLVTDG+VEDER IC +K +L GS PRI TFG+GSYCNHYFLR
Sbjct: 388 ALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHYFLR 447
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
MLA I +G+Y AA+D SIE +M + F + S+++ NI+ID K + EFE+ I D+S
Sbjct: 448 MLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTIVINISIDATKYIREFEVDSEYIQDIS 507
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
++ PL VSGRYQGK P+TL AKG+L D+S +EL++Q KDI LD++ AKQQ+DLLTA+
Sbjct: 508 AKCPLCVSGRYQGKLPETLTAKGYLADMSEISIELRVQHVKDISLDKVLAKQQMDLLTAK 567
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILD 655
AWF E+ +LE KV K+S+Q + EYTR I+++ KK S +++
Sbjct: 568 AWFYENNQLEMKVVKLSIQNSIPSEYTRTILLQNFVEKIEQGKQKPKKNSTQNEQ----- 622
Query: 656 SEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCL 715
L L +GFGD AT EN G K PE E+F KA CCS L + CCC+
Sbjct: 623 ----SATSLNGLTLGFGDTAATRENLSAGFGDTKPPERFEMFDKAV-GCCSRLTDCCCCM 677
Query: 716 CCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQD 757
C I C+K+ND+C IV+ QLC AL+C CFECCS LCC G D
Sbjct: 678 CFINTCSKMNDRCAIVMVQLCGALSCLACFECCS-LCCGGSD 718
>gi|357475499|ref|XP_003608035.1| von Willebrand factor A domain-containing protein 5B1 [Medicago
truncatula]
gi|355509090|gb|AES90232.1| von Willebrand factor A domain-containing protein 5B1 [Medicago
truncatula]
Length = 639
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/629 (55%), Positives = 444/629 (70%), Gaps = 40/629 (6%)
Query: 167 ETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKI 226
+ G FLK NIFTLT+PQIDGGS LSI++RWSQK+ +GE+S+NVPF FPE++ PA K++
Sbjct: 14 QNGGFLKSNIFTLTIPQIDGGSNLSIKMRWSQKVVCSNGEYSLNVPFTFPEFINPAGKRM 73
Query: 227 PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVS 286
KREKI LNV+ G+E+L T+SH L+Q+RR G +G+ ++EVL WS DF FSY VS
Sbjct: 74 SKREKIRLNVDVVAGSELLFKTTSHPLQQVRRHAGSMGFVCDNEVLSWSKDDFSFSYAVS 133
Query: 287 PSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLED 346
SHI GGVLL+S S+HD DQREMF +YL PG + +KVFKKD+IFI+DISGSMQ K ++D
Sbjct: 134 SSHINGGVLLESASVHDFDQREMFYLYLSPGNIQKKKVFKKDIIFIIDISGSMQEKLMDD 193
Query: 347 TKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTN 406
TKNAL ALSKL+ D F I+AFNGE++L+S SMELA+ +AVERA +WI +NF+AGG TN
Sbjct: 194 TKNALLSALSKLNSNDMFTIIAFNGESHLYSKSMELASNDAVERAREWINLNFVAGGGTN 253
Query: 407 ICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
I PL A+E+L+ ++ S+PIIFLVTDG VEDERQIC +K+ + NG SICPRIYTFGIG
Sbjct: 254 ISHPLNTAIEILSGAQSSVPIIFLVTDGTVEDERQICTMVKNHMINGESICPRIYTFGIG 313
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFE- 525
+CNHYFLRMLA I RG Y AA D+D IE +M LF + S +LANI +D L ++D+ E
Sbjct: 314 PFCNHYFLRMLATIGRGQYDAALDVDMIESRMLTLFEKASSLILANIKMDMLDEIDQVEF 373
Query: 526 ------------------------------------MYPSRIPDLSSESPLIVSGRYQGK 549
+YPS IPDLSSE PLI+SGRY+G
Sbjct: 374 THAVGVSAATVLIIQFLSIPSLALFEKECQIFCHDGVYPSHIPDLSSEGPLILSGRYKGN 433
Query: 550 FPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
FP+TLK KG L D SNF ++LK+Q K IP+ R+ A+ QI LTAQAW SE+K+LE+KV+
Sbjct: 434 FPETLKVKGILADFSNFEIDLKIQKDKGIPVQRVSARDQIQHLTAQAWLSENKQLEQKVA 493
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGI 669
K+S+QTG + EYT MII+E D ESPG K SKK + + + + I+L NLG+
Sbjct: 494 KLSLQTGFISEYTCMIILENDHLKKIKESPGEKTVSKKKHSRYEANVQGQRMILLPNLGV 553
Query: 670 GFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCV 729
GFG+L ATAEN PPG E E+AEIF KAASNC T+ CCC+CCI+C +++N+QC+
Sbjct: 554 GFGNLKATAENTPPGYEAK--IEMAEIF-KAASNCFDTVCGYCCCMCCIQCFSRMNNQCL 610
Query: 730 IVLTQLCTALACFGCFECCSNLCCCGQDG 758
L QLC L C C CCS++CC G DG
Sbjct: 611 TALNQLCIGLGCLACLNCCSHICCSGNDG 639
>gi|414590863|tpg|DAA41434.1| TPA: hypothetical protein ZEAMMB73_769800 [Zea mays]
Length = 758
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/743 (45%), Positives = 467/743 (62%), Gaps = 51/743 (6%)
Query: 3 DDFVKAVEDGLKLSKRLYFGKDRPVTP-----PKVPSSMDKSGAVDMLLPKSPMVYAVIT 57
+DF +AVEDGLKLSKRL P + SS SG LLP +PM YAV+T
Sbjct: 2 EDFARAVEDGLKLSKRLVLPGGVPPPRPPAGMERTMSSASASGPDPRLLPTAPMAYAVVT 61
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMG 117
DP +D PD+PSYQPYV+G DPP+LIPLQM V+L VDC LD A V + W +HC+
Sbjct: 62 DPAAIDTPDVPSYQPYVYGHLDPPALIPLQMKEVDLAVDCALDAAHVTLRARWWLHCITR 121
Query: 118 SKSCDCRIAVPMGDQ--GSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPN 175
S+ CD R+ VP+G+Q GSILG E + +SY+TQ+I + E A +E+G LKP+
Sbjct: 122 SRDCDVRLVVPLGEQEQGSILGAEVTVGRRSYNTQVIEVEET--LENHAKIESGGLLKPH 179
Query: 176 IFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLN 235
+F LT+ Q++GG+ + RWSQKL Y G FSV++PF FP +V P K KREK+ L
Sbjct: 180 LFFLTISQVEGGADICATFRWSQKLLYDSGCFSVDIPFHFPYFVNPLPKVFMKREKVQLT 239
Query: 236 VNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVL 295
N G EV +SH LK+ R KL + +E+ V WS+ F+FSY+V + GGV
Sbjct: 240 ANTGFSKEVFLQGTSHPLKEKSRQGDKLFFLHEAIVENWSSKVFNFSYSVYSGDLSGGVF 299
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
+Q L D D R+ FC++LLPG+ +RKVF+K V+F+VD SGSMQGKPLE+ K+A++ AL
Sbjct: 300 VQPSILCDYDDRDSFCIFLLPGSG-NRKVFRKAVVFVVDTSGSMQGKPLENVKHAVSTAL 358
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
S+L GD FNI+ FN E + FS+ +E +A+ A W+ N +A G T+I PL++A+
Sbjct: 359 SELVQGDYFNIITFNDELHSFSSCLEQVNDKAIASATDWMNANLVAEGGTDIMHPLSEAM 418
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
+L++ ++P I+L+TDG+V+DE IC K+ +TN GS PRI TFG+GSYCNHYFLR
Sbjct: 419 ALLSSVHDTLPQIYLITDGSVDDEHNICQTTKTEVTNRGSKSPRISTFGLGSYCNHYFLR 478
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
MLA + +G+Y AA + SIE Q+ K F R S+++ NI+ID + LDEFE+ IPDLS
Sbjct: 479 MLASVGKGHYNAALETASIENQIVKWFRRASSTIVENISIDAMTHLDEFEVDSEYIPDLS 538
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
+ SPL VSG+Y+GKFPDT+KAKG+L D+ ++ELK+Q K++ LD++ Q IDL+TA+
Sbjct: 539 ANSPLCVSGKYRGKFPDTVKAKGYLADMKEVLIELKVQQLKEVTLDKVLVTQHIDLMTAK 598
Query: 596 AWFSEDKRLEEKVS----KMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQ 651
AW S K+LE KV K+S+Q GV EYT M++++T+ N ++ T+
Sbjct: 599 AWLSAAKQLERKVGYLVIKLSIQNGVPSEYTDMVLLQTNLDNTGTQKDATR--------- 649
Query: 652 KILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNK 711
+NL GF D+ +AE + ++ CCS + +
Sbjct: 650 -------------ENLVTGFWDVN-SAEKI--------------VMLQKGPGCCSRVADC 681
Query: 712 CCCLCCIECCNKLNDQCVIVLTQ 734
CC+CCI CN++NDQC I++ Q
Sbjct: 682 LCCMCCIRACNRMNDQCAILMAQ 704
>gi|414590862|tpg|DAA41433.1| TPA: hypothetical protein ZEAMMB73_769800 [Zea mays]
Length = 1295
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/743 (45%), Positives = 467/743 (62%), Gaps = 51/743 (6%)
Query: 3 DDFVKAVEDGLKLSKRLYFGKDRPVTP-----PKVPSSMDKSGAVDMLLPKSPMVYAVIT 57
+DF +AVEDGLKLSKRL P + SS SG LLP +PM YAV+T
Sbjct: 2 EDFARAVEDGLKLSKRLVLPGGVPPPRPPAGMERTMSSASASGPDPRLLPTAPMAYAVVT 61
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMG 117
DP +D PD+PSYQPYV+G DPP+LIPLQM V+L VDC LD A V + W +HC+
Sbjct: 62 DPAAIDTPDVPSYQPYVYGHLDPPALIPLQMKEVDLAVDCALDAAHVTLRARWWLHCITR 121
Query: 118 SKSCDCRIAVPMGDQ--GSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPN 175
S+ CD R+ VP+G+Q GSILG E + +SY+TQ+I + E A +E+G LKP+
Sbjct: 122 SRDCDVRLVVPLGEQEQGSILGAEVTVGRRSYNTQVIEVEET--LENHAKIESGGLLKPH 179
Query: 176 IFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLN 235
+F LT+ Q++GG+ + RWSQKL Y G FSV++PF FP +V P K KREK+ L
Sbjct: 180 LFFLTISQVEGGADICATFRWSQKLLYDSGCFSVDIPFHFPYFVNPLPKVFMKREKVQLT 239
Query: 236 VNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVL 295
N G EV +SH LK+ R KL + +E+ V WS+ F+FSY+V + GGV
Sbjct: 240 ANTGFSKEVFLQGTSHPLKEKSRQGDKLFFLHEAIVENWSSKVFNFSYSVYSGDLSGGVF 299
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
+Q L D D R+ FC++LLPG+ +RKVF+K V+F+VD SGSMQGKPLE+ K+A++ AL
Sbjct: 300 VQPSILCDYDDRDSFCIFLLPGSG-NRKVFRKAVVFVVDTSGSMQGKPLENVKHAVSTAL 358
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
S+L GD FNI+ FN E + FS+ +E +A+ A W+ N +A G T+I PL++A+
Sbjct: 359 SELVQGDYFNIITFNDELHSFSSCLEQVNDKAIASATDWMNANLVAEGGTDIMHPLSEAM 418
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
+L++ ++P I+L+TDG+V+DE IC K+ +TN GS PRI TFG+GSYCNHYFLR
Sbjct: 419 ALLSSVHDTLPQIYLITDGSVDDEHNICQTTKTEVTNRGSKSPRISTFGLGSYCNHYFLR 478
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
MLA + +G+Y AA + SIE Q+ K F R S+++ NI+ID + LDEFE+ IPDLS
Sbjct: 479 MLASVGKGHYNAALETASIENQIVKWFRRASSTIVENISIDAMTHLDEFEVDSEYIPDLS 538
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
+ SPL VSG+Y+GKFPDT+KAKG+L D+ ++ELK+Q K++ LD++ Q IDL+TA+
Sbjct: 539 ANSPLCVSGKYRGKFPDTVKAKGYLADMKEVLIELKVQQLKEVTLDKVLVTQHIDLMTAK 598
Query: 596 AWFSEDKRLEEKVS----KMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQ 651
AW S K+LE KV K+S+Q GV EYT M++++T+ N ++ T+
Sbjct: 599 AWLSAAKQLERKVGYLVIKLSIQNGVPSEYTDMVLLQTNLDNTGTQKDATR--------- 649
Query: 652 KILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNK 711
+NL GF D+ +AE + ++ CCS + +
Sbjct: 650 -------------ENLVTGFWDVN-SAEKI--------------VMLQKGPGCCSRVADC 681
Query: 712 CCCLCCIECCNKLNDQCVIVLTQ 734
CC+CCI CN++NDQC I++ Q
Sbjct: 682 LCCMCCIRACNRMNDQCAILMAQ 704
>gi|108708734|gb|ABF96529.1| von Willebrand factor type A domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 680
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/637 (50%), Positives = 430/637 (67%), Gaps = 11/637 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK-----SGAVDMLLPKSPMVYAV 55
M +DF +AVEDGLKLSKRL P +P VP MD+ + A LLP +PM YAV
Sbjct: 1 MEEDFARAVEDGLKLSKRLVLPGGAP-SPRPVPG-MDRGLLPDAAAASSLLPSAPMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAF---VRVSGTWRV 112
+ DP VD+PD+PSYQP+V+GR DPP+LIPL M V L VDC V + W V
Sbjct: 59 VVDPAAVDSPDVPSYQPHVYGRLDPPALIPLHMREVALAVDCAAAGCAAAEVTLRARWWV 118
Query: 113 HCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFL 172
HC+ S C CRI VPMG+QG+ILG E + +SY T +I + E+ GA K A E+G L
Sbjct: 119 HCLTRSCDCHCRIVVPMGEQGTILGAEVTVGKRSYKTHVIDV-EDQGAVKIAKTESGGLL 177
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
K F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K K+EKI
Sbjct: 178 KRQFFSLTIPQVGGGEDIFATIRWSQKLLYDNGQFSVDIPFQFPQYVNPLPKVFMKKEKI 237
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFG 292
L +N+G E++ SSH LK+ R KL + +E+ V WSN DF F+Y+V + G
Sbjct: 238 QLTLNSGVSNEIVLKGSSHPLKERSRQGEKLSFFHEAVVENWSNKDFTFAYSVYSGDLSG 297
Query: 293 GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALA 352
G+L+Q + D D R+MFC++LLPG + RKVF+ +FI+D SGSMQGKPLE KNA+
Sbjct: 298 GMLVQPSTSDDYDDRDMFCIFLLPGNNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMY 357
Query: 353 VALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT 412
LS+L GD FNI+ FN E + FS+ +E ++ +E A +W+ NFIA G T+I PL+
Sbjct: 358 TTLSELVQGDYFNIITFNDELHSFSSCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLS 417
Query: 413 KAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
+A+ +L+NS ++P IFLVTDG+VEDER IC +K +L GS PRI TFG+GSYCNHY
Sbjct: 418 EAIALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY 477
Query: 473 FLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIP 532
FLRMLA I +G+Y AA+D SIE +M + F + S+++ NI+ID K + EFE+ I
Sbjct: 478 FLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTIVINISIDATKYIREFEVDSEYIQ 537
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLL 592
D+S++ PL VSGRYQGK P+TL AKG+L D+S +EL++Q KDI LD++ AKQQ+DLL
Sbjct: 538 DISAKCPLCVSGRYQGKLPETLTAKGYLADMSEISIELRVQHVKDISLDKVLAKQQMDLL 597
Query: 593 TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
TA+AWF E+ +LE KV K+S+Q + EYTR I+++
Sbjct: 598 TAKAWFYENNQLEMKVVKLSIQNSIPSEYTRTILLQN 634
>gi|108708735|gb|ABF96530.1| von Willebrand factor type A domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 614
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 311/616 (50%), Positives = 416/616 (67%), Gaps = 11/616 (1%)
Query: 1 MADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDK-----SGAVDMLLPKSPMVYAV 55
M +DF +AVEDGLKLSKRL P +P VP MD+ + A LLP +PM YAV
Sbjct: 1 MEEDFARAVEDGLKLSKRLVLPGGAP-SPRPVPG-MDRGLLPDAAAASSLLPSAPMAYAV 58
Query: 56 ITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAF---VRVSGTWRV 112
+ DP VD+PD+PSYQP+V+GR DPP+LIPL M V L VDC V + W V
Sbjct: 59 VVDPAAVDSPDVPSYQPHVYGRLDPPALIPLHMREVALAVDCAAAGCAAAEVTLRARWWV 118
Query: 113 HCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFL 172
HC+ S C CRI VPMG+QG+ILG E + +SY T +I + E+ GA K A E+G L
Sbjct: 119 HCLTRSCDCHCRIVVPMGEQGTILGAEVTVGKRSYKTHVIDV-EDQGAVKIAKTESGGLL 177
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
K F+LT+PQ+ GG + +RWSQKL Y +G+FSV++PF+FP+YV P K K+EKI
Sbjct: 178 KRQFFSLTIPQVGGGEDIFATIRWSQKLLYDNGQFSVDIPFQFPQYVNPLPKVFMKKEKI 237
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFG 292
L +N+G E++ SSH LK+ R KL + +E+ V WSN DF F+Y+V + G
Sbjct: 238 QLTLNSGVSNEIVLKGSSHPLKERSRQGEKLSFFHEAVVENWSNKDFTFAYSVYSGDLSG 297
Query: 293 GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALA 352
G+L+Q + D D R+MFC++LLPG + RKVF+ +FI+D SGSMQGKPLE KNA+
Sbjct: 298 GMLVQPSTSDDYDDRDMFCIFLLPGNNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMY 357
Query: 353 VALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT 412
LS+L GD FNI+ FN E + FS+ +E ++ +E A +W+ NFIA G T+I PL+
Sbjct: 358 TTLSELVQGDYFNIITFNDELHSFSSCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLS 417
Query: 413 KAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
+A+ +L+NS ++P IFLVTDG+VEDER IC +K +L GS PRI TFG+GSYCNHY
Sbjct: 418 EAIALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY 477
Query: 473 FLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIP 532
FLRMLA I +G+Y AA+D SIE +M + F + S+++ NI+ID K + EFE+ I
Sbjct: 478 FLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTIVINISIDATKYIREFEVDSEYIQ 537
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLL 592
D+S++ PL VSGRYQGK P+TL AKG+L D+S +EL++Q KDI LD++ AKQQ+DLL
Sbjct: 538 DISAKCPLCVSGRYQGKLPETLTAKGYLADMSEISIELRVQHVKDISLDKVLAKQQMDLL 597
Query: 593 TAQAWFSEDKRLEEKV 608
TA+AWF E+ +LE KV
Sbjct: 598 TAKAWFYENNQLEMKV 613
>gi|168032073|ref|XP_001768544.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680257|gb|EDQ66695.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 778
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 252/687 (36%), Positives = 376/687 (54%), Gaps = 24/687 (3%)
Query: 5 FVKAVEDGLKLSKRLYFG---KDRPVTPPK---VPSSMDKSGAVDMLLPKSPMVYAVITD 58
FV ++ G LS++L G + PPK +P+ +K LP PM+YA+ITD
Sbjct: 7 FVTSIAAGQILSQQLRHGGGVSAPSLGPPKHSTIPTMPEKKKMKG--LPTCPMIYAIITD 64
Query: 59 PKIVDNPDIPSYQPYVHG---RCDPP---SLIPLQMNAVELDVDCYLDTAFVRVSGTWRV 112
NP + + +H R D P +L+PL + +ELDV C++ T FV W +
Sbjct: 65 LNTFGNPKVDNAHGSLHPPTFRADAPPLPALVPLSLEMIELDVACHISTGFVTAKSRWNL 124
Query: 113 HCVMGSKSCDCRIAVPMGDQGSILGVEAEIS-GKSYHTQLIALGENDGAGK----SASVE 167
+CV SCDC +A+PM QG++ VE ++ G+ Y T ++ E G +A +
Sbjct: 125 NCVRSQASCDCLLALPMDHQGTVSSVEIDMGHGRFYTTVVVPSDEAASYGARGSPNAQIN 184
Query: 168 TGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP 227
P +F L LPQ++GG+ L +++ W Q +++ G +S+ V F FPE + P +
Sbjct: 185 DPGTYNPELFRLNLPQVEGGTKLELKITWFQAMTFESGLYSLRVSFVFPEDIVPLGTNLA 244
Query: 228 KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYES--EVLKWSNIDFDFSYTV 285
K+ +N GT V + + LK+ R+ G + + ES + N DF SY V
Sbjct: 245 SITKVKCMINTGTNDTVEVGSFGNQLKETFREPGIVAFENESNGDATDGKNQDFVASYQV 304
Query: 286 SPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLE 345
IF ++ Q P + D R FC+ + P VF++ V+F++D SGSM G PL
Sbjct: 305 WSDGIFPNLIFQDPEPGESDNRGSFCLSISPPDPSKITVFQRAVVFLLDRSGSMYGDPLN 364
Query: 346 DTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGST 405
D AL L L P DSFNI+AF+ ET LFS+ ME A ++ RA +W A G T
Sbjct: 365 DALQALYSGLESLKPEDSFNIIAFDHETALFSSQMERANSASILRAREWATEKCKARGGT 424
Query: 406 NICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGI 465
+I +PL +A +++ N G++P +FL+TDGAV++E+ IC M+SR+ G+ PRI TFGI
Sbjct: 425 DILSPLQQAFKLVENFPGAVPYVFLITDGAVDNEKNICLTMQSRIVELGARAPRISTFGI 484
Query: 466 GSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFE 525
G YCN+YFL+MLA+I RG A+ + QM+++ + VL NI + L D E
Sbjct: 485 GHYCNYYFLKMLAVIGRGLSDVAFASGKLRGQMERMLVAAATPVLTNIGLAGLP--DGCE 542
Query: 526 MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICA 585
+YP IPDL +PL+VSG++ G FPDT+ G + D S + E+ + +PL ++ A
Sbjct: 543 VYPFAIPDLFCGNPLVVSGKFHGSFPDTISFFGLMPDQSTWKTEVPSRKTSKVPLSKVFA 602
Query: 586 KQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGS 645
KQQ+DLLT QAW DKR +++ +S+ TG+ CEYTRMI ET E +K
Sbjct: 603 KQQLDLLTGQAWLYGDKRRQQEAVNLSLATGMPCEYTRMIGFETTPE-KYQEFQNERKKG 661
Query: 646 KKSDHQKILDSEVPKTIMLQNLGIGFG 672
KK + +K +V +++ L IGFG
Sbjct: 662 KKLNIKKFTAGKVAGVVLVGGLAIGFG 688
>gi|108708733|gb|ABF96528.1| von Willebrand factor type A domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 477
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 217/428 (50%), Positives = 288/428 (67%), Gaps = 11/428 (2%)
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVE 389
+FI+D SGSMQGKPLE KNA+ LS+L GD FNI+ FN E + FS+ +E ++ +E
Sbjct: 61 VFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDYFNIITFNDELHSFSSCLEQVNEKTIE 120
Query: 390 RAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSR 449
A +W+ NFIA G T+I PL++A+ +L+NS ++P IFLVTDG+VEDER IC +K +
Sbjct: 121 NAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVEDERNICRTVKEQ 180
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
L GS PRI TFG+GSYCNHYFLRMLA I +G+Y AA+D SIE +M + F + S++
Sbjct: 181 LATRGSKSPRISTFGLGSYCNHYFLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASSTI 240
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVE 569
+ NI+ID K + EFE+ I D+S++ PL VSGRYQGK P+TL AKG+L D+S +E
Sbjct: 241 VINISIDATKYIREFEVDSEYIQDISAKCPLCVSGRYQGKLPETLTAKGYLADMSEISIE 300
Query: 570 LKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
L++Q KDI LD++ AKQQ+DLLTA+AWF E+ +LE KV K+S+Q + EYTR I+++
Sbjct: 301 LRVQHVKDISLDKVLAKQQMDLLTAKAWFYENNQLEMKVVKLSIQNSIPSEYTRTILLQ- 359
Query: 630 DERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPK 689
N + K+ KK+ Q +E T L L +GFGD AT EN G K
Sbjct: 360 ---NFVEKIEQGKQKPKKNSTQ----NEQSAT-SLNGLTLGFGDTAATRENLSAGFGDTK 411
Query: 690 LPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCS 749
PE E+F KA CCS L + CCC+C I C+K+ND+C IV+ QLC AL+C CFECCS
Sbjct: 412 PPERFEMFDKAVG-CCSRLTDCCCCMCFINTCSKMNDRCAIVMVQLCGALSCLACFECCS 470
Query: 750 NLCCCGQD 757
LCC G D
Sbjct: 471 -LCCGGSD 477
>gi|168036243|ref|XP_001770617.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678138|gb|EDQ64600.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 634
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 233/636 (36%), Positives = 351/636 (55%), Gaps = 33/636 (5%)
Query: 2 ADDFVKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPKI 61
A+ F +V G L+++L +G P+ P + S MD + + L PMVYAVITD +
Sbjct: 3 ANSFATSVGAGRVLAQQLRYGG--PLDYPTM-SPMDGTATQEAFL-TCPMVYAVITDMNV 58
Query: 62 VDNPDIPSYQPY-VHGRCDP------PSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHC 114
D Y PY H + P P+ +PL + +++ V+C++ T FV + TW V+C
Sbjct: 59 FGKADA-EYDPYNQHPQTYPMNAPTLPAFVPLTLGKMKVKVECHISTGFVTMKSTWHVNC 117
Query: 115 VMGSKSCDCRIAVPMGDQGSILGVEAEISG-KSYHTQLIALGENDGAGKSASVETGSFLK 173
+ SCDC +A+PM +QG++ VE ++ + Y T ++ E G S
Sbjct: 118 IRSQASCDCLLALPMDNQGTVSSVEIDMGNDRLYTTTVVPTEEAASYGARGS-------- 169
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIH 233
P +GG+ L +++ W Q +++ +G +S+ VP FP+ + P ++ K+
Sbjct: 170 --------PDAEGGTKLEVKVTWFQSMTFDNGMYSLRVPLVFPQEILPLETQLVSIIKVK 221
Query: 234 LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGG 293
+N GT V+ + +++ R+ GK+ + + W N DF SY V IF
Sbjct: 222 CAINTGTNDYVVVGAFGNSMEEAEREPGKVKLKKDGD--DWKNQDFIASYKVWSDGIFPN 279
Query: 294 VLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
++ Q ++D R FC+ + P KVF++ V+F++D SGSM GKP+ED + AL
Sbjct: 280 LIYQDGEPEELDSRGSFCLSISPPDPNKIKVFQRAVVFLLDRSGSMYGKPIEDARQALFF 339
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
AL L P DSFNIVAF+ E LFS+ ME AT A+ A +W N A G T+I PL +
Sbjct: 340 ALDSLKPEDSFNIVAFDHELTLFSSQMERATPNAIGWAREWAMTNCTARGGTDILGPLQQ 399
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
A L N ++P +FL+TDGAV DE+ IC AM+SR+ G PRI TFGIG YCN YF
Sbjct: 400 AFNFLENFPWAVPYVFLITDGAVSDEQNICLAMQSRIAALGVRSPRISTFGIGFYCNFYF 459
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L+MLA+I RG A+ D I QM+++ + VL N+ + L D E+YP I D
Sbjct: 460 LKMLAVIGRGMSDVAFTSDKIRKQMERMLVAAATPVLTNLGLAGLP--DGCEVYPFPIAD 517
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLT 593
L +PLIVSG++ G+FP ++ G L D S + +E+ + PL++I A+QQ+DLLT
Sbjct: 518 LFCGNPLIVSGKFHGEFPKSIVVTGLLPDQSTWQLEIPSRKDSKFPLNKIFARQQLDLLT 577
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
QAW D +++ +S+ TG+ C+YTR+I ET
Sbjct: 578 GQAWLYGDMNRQQEAVNLSLATGLPCQYTRVIGFET 613
>gi|168019616|ref|XP_001762340.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686418|gb|EDQ72807.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 621
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 235/630 (37%), Positives = 360/630 (57%), Gaps = 32/630 (5%)
Query: 6 VKAVEDGLKLSKRLYFGKDRPVTPPKVPSSMDKSGAVDMLLPKSPMVYAVITDPKIVDNP 65
+++V G L+++L++G +T P + MD + + LP PMVYA+ITD K+ NP
Sbjct: 1 MRSVAAGRMLAEQLHYGG--VLTRPTI-QDMDGT-RLQEDLPTCPMVYAIITDMKVFGNP 56
Query: 66 DIPS-----YQP-YVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSK 119
+ + Y P Y PP+L+PL + +L+V+C + T FV TW ++CV
Sbjct: 57 KVDNDHYEQYPPTYPLDAPPPPALVPLTLEKSKLNVECCISTGFVTSKCTWNLNCVRSQA 116
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTL 179
SCDC +AVP+ +QG++ VE ++ +T ++ D A +S+
Sbjct: 117 SCDCLLAVPIDNQGTVSSVEIDMGHDRLYTTIVV--PTDEAASYSSL------------- 161
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAG 239
Q++GG+ L +++ W Q + + +G +S+ VPF FPE++ P K+ K+ ++N G
Sbjct: 162 ---QVEGGAKLEVKVTWFQSMIFDNGMYSLRVPFVFPEHILPIATKLSSIIKVKCSINTG 218
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSP 299
T V + ++++ R+ GK+ + W N DF SY IF ++ Q P
Sbjct: 219 TNGGVELGAFGNPMEEILREHGKVIFKKGGN--DWKNQDFIASYKTWSDGIFPNLIFQDP 276
Query: 300 SLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
+ D R FC+ + P KVF++ V+F++D SGSM+GKP+E + AL L L
Sbjct: 277 EPGESDNRGSFCLSISPPDPNKIKVFQRAVVFLLDRSGSMKGKPIEAARQALYFGLESLK 336
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
P DSFNI+AF+ + LFS ME +T ++ RA +W N A G T+I +PL +A ++L
Sbjct: 337 PEDSFNIIAFDHDLTLFSPQMERSTTTSIARACEWSMTNCTARGGTDILSPLQQAFQLLE 396
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
N G+IP +FL+TDGAV E+ IC M+SR+ G+ PRI TFGIG YCN YFL+MLA+
Sbjct: 397 NFPGAIPYVFLITDGAVSAEQNICLTMQSRIVALGARAPRISTFGIGHYCNFYFLKMLAV 456
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
I RG A+ D I QM+++ + VL NI + L D E+YP IPDL +P
Sbjct: 457 IGRGMNEVAFKSDKIRGQMERMLVATAAPVLTNIGLARLP--DNCEVYPFPIPDLFCGNP 514
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFS 599
LI+SG++ GKFPD+L G + D S + +E+ + + +PL+R+ AKQQ+DL T QAW
Sbjct: 515 LIISGKFYGKFPDSLIVMGLMPDQSTWQIEVPSRNSSKLPLNRVFAKQQLDLFTGQAWLC 574
Query: 600 EDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
+K E++ +S+ TG+ C+YTRMI ET
Sbjct: 575 GNKSREQEAVNLSLSTGLPCQYTRMIGFET 604
>gi|326515718|dbj|BAK07105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/421 (46%), Positives = 280/421 (66%), Gaps = 14/421 (3%)
Query: 339 MQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGIN 398
M+GKPLE+ KNA++ ALS+L GD FNI+ FN E + FS +E ++A+ A+ W+ N
Sbjct: 1 MKGKPLENVKNAVSTALSELVQGDYFNIITFNEELHSFSACLEKVNEKAIANANDWMNAN 60
Query: 399 FIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
F+A G T+I PL +A+ +L+++ ++P IFL+TDG+V+DE IC +K+ L + GS P
Sbjct: 61 FVAEGGTDIMHPLNEAMALLSSAHDAVPQIFLMTDGSVDDEHDICQTVKNELLSRGSKSP 120
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
RI TFG+G YCNHYFLRM+A I +G++ AA + SIE ++ K F + ++++ANI+ID
Sbjct: 121 RISTFGLGLYCNHYFLRMVASIGKGHFDAALETGSIESRILKWFRKASNTIVANISIDAT 180
Query: 519 KDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI 578
K LD+FE+ IPD+S++ PL VSG+YQGKFP+T+ A G+L D++ +ELK+Q KD+
Sbjct: 181 KQLDDFEVDSECIPDISAQCPLCVSGKYQGKFPETVVATGYLADMTEISIELKVQHIKDM 240
Query: 579 PLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET--DERNNAS 636
PLD + A QQI LLTA+AW S DK+LE KV K+S++ VL EYT M++++T D
Sbjct: 241 PLDNVFAPQQIALLTAKAWLSADKQLERKVIKLSIENSVLSEYTSMVVLQTNLDAAQKVK 300
Query: 637 ESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEI 696
+ P KG+ + P L L +GFGD AT EN G K E +I
Sbjct: 301 QKPKGHKGANE-----------PLRFQLHGLKLGFGDKAATKENLLTGFGEEKPLETLKI 349
Query: 697 FVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCCGQ 756
F K A CCS + + CC+CCI+ CN++NDQC I++ Q+C AL+C GC+ECC+ +CC G
Sbjct: 350 F-KKAGGCCSRVADCLCCMCCIKACNRMNDQCAIMMAQVCAALSCLGCYECCAEVCCGGS 408
Query: 757 D 757
+
Sbjct: 409 E 409
>gi|357514089|ref|XP_003627333.1| hypothetical protein MTR_8g021230, partial [Medicago truncatula]
gi|355521355|gb|AET01809.1| hypothetical protein MTR_8g021230, partial [Medicago truncatula]
Length = 311
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/308 (59%), Positives = 231/308 (75%)
Query: 41 AVDMLLPKSPMVYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLD 100
++D LP +PM YAVI DP IVDNPDI SYQPYV+GRCDPP+LIPLQ+ +E+DVDC LD
Sbjct: 4 SIDEHLPSAPMCYAVIPDPDIVDNPDIRSYQPYVYGRCDPPALIPLQLQGIEMDVDCCLD 63
Query: 101 TAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGA 160
TAFV V+G WRVHCV GS CDC++A+P+GDQGS+LGVE + G+SYHT+L++L +
Sbjct: 64 TAFVTVTGRWRVHCVRGSSLCDCQVAIPIGDQGSLLGVEVDDFGRSYHTELVSLKDEKEK 123
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
K A + G +LK I+ + +PQ GGS S+++RWSQK+ + DGEF +N+PF FP+YV
Sbjct: 124 QKMAKAKDGYYLKSQIYIVKIPQFKGGSIFSVKIRWSQKILFHDGEFCLNIPFCFPKYVN 183
Query: 221 PAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFD 280
P + I K+EKI L +N+GT EVLC +SH LK + GKL SYE+EV WS+ DF
Sbjct: 184 PVGRSISKKEKIFLKLNSGTANEVLCKATSHPLKGVMHQAGKLSLSYEAEVPAWSSTDFS 243
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQ 340
FSYTVS S IFGGVLLQSP L D D+RE+FC+YL PG RKVFKKDV+F+VDIS SM+
Sbjct: 244 FSYTVSSSDIFGGVLLQSPFLRDFDEREIFCLYLYPGDFLDRKVFKKDVVFVVDISASMK 303
Query: 341 GKPLEDTK 348
G PLE+ K
Sbjct: 304 GSPLENVK 311
>gi|297607612|ref|NP_001060262.2| Os07g0614300 [Oryza sativa Japonica Group]
gi|255677968|dbj|BAF22176.2| Os07g0614300, partial [Oryza sativa Japonica Group]
Length = 352
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 182/356 (51%), Positives = 243/356 (68%), Gaps = 10/356 (2%)
Query: 398 NFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSIC 457
NF+AGG T+I PL +A+ L+++ +P IFL+TDG+V+DE IC +K+ L + GS
Sbjct: 1 NFVAGGGTDIMHPLNEAMASLSSAHDVLPQIFLMTDGSVDDEHNICQTVKTELISRGSKS 60
Query: 458 PRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT 517
PRI TFG+G YCNHYFLRMLA I RG+Y AA++ SIE Q+ + F + S+++ANI+ID
Sbjct: 61 PRISTFGLGLYCNHYFLRMLASIGRGHYDAAFETGSIESQVLRWFRKASSTIVANISIDA 120
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD 577
LDEFE+ IPD+S++SPL +SG+YQGKFPD + AKG+L D+ +ELK+Q KD
Sbjct: 121 TAHLDEFEIDSEYIPDISAKSPLCISGKYQGKFPDMVTAKGYLADMREISIELKVQHIKD 180
Query: 578 IPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
IPLD++ A QQI LLTA+AW S DK+LE KV K+S+Q + EYT M++++T E+ +A++
Sbjct: 181 IPLDKVLAAQQIGLLTAKAWLSSDKQLERKVVKLSIQNSIPSEYTSMVLLQTLEKVDAAQ 240
Query: 638 SPGTK-KGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEI 696
K KG K D P+ I LQ L +GFGD AT EN G K E EI
Sbjct: 241 KVKQKLKGHKGPDE--------PRRIPLQCLKLGFGDRAATRENLVTGFGDVKPLETFEI 292
Query: 697 FVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLC 752
KAA CCS L + CC+CCI+ CNK+NDQC IV+TQ+C A AC GC+ECC+ LC
Sbjct: 293 LNKAA-GCCSRLADCLCCMCCIKACNKMNDQCAIVMTQVCAAFACLGCYECCAELC 347
>gi|297601071|ref|NP_001050335.2| Os03g0407000 [Oryza sativa Japonica Group]
gi|255674580|dbj|BAF12249.2| Os03g0407000, partial [Oryza sativa Japonica Group]
Length = 360
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 187/370 (50%), Positives = 248/370 (67%), Gaps = 11/370 (2%)
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+E A +W+ NFIA G T+I PL++A+ +L+NS ++P IFLVTDG+VEDER IC +K
Sbjct: 2 IENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVEDERNICRTVK 61
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+L GS PRI TFG+GSYCNHYFLRMLA I +G+Y AA+D SIE +M + F + S
Sbjct: 62 EQLATRGSKSPRISTFGLGSYCNHYFLRMLASIGKGHYDAAFDTGSIEGRMVQWFQKASS 121
Query: 508 SVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFV 567
+++ NI+ID K + EFE+ I D+S++ PL VSGRYQGK P+TL AKG+L D+S
Sbjct: 122 TIVINISIDATKYIREFEVDSEYIQDISAKCPLCVSGRYQGKLPETLTAKGYLADMSEIS 181
Query: 568 VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
+EL++Q KDI LD++ AKQQ+DLLTA+AWF E+ +LE KV K+S+Q + EYTR I++
Sbjct: 182 IELRVQHVKDISLDKVLAKQQMDLLTAKAWFYENNQLEMKVVKLSIQNSIPSEYTRTILL 241
Query: 628 ETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEI 687
+ N + K+ KK+ Q +E T L L +GFGD AT EN G
Sbjct: 242 Q----NFVEKIEQGKQKPKKNSTQ----NEQSAT-SLNGLTLGFGDTAATRENLSAGFGD 292
Query: 688 PKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFEC 747
K PE E+F KA CCS L + CCC+C I C+K+ND+C IV+ QLC AL+C CFEC
Sbjct: 293 TKPPERFEMFDKAVG-CCSRLTDCCCCMCFINTCSKMNDRCAIVMVQLCGALSCLACFEC 351
Query: 748 CSNLCCCGQD 757
CS LCC G D
Sbjct: 352 CS-LCCGGSD 360
>gi|226531948|ref|NP_001141509.1| uncharacterized protein LOC100273621 [Zea mays]
gi|194704870|gb|ACF86519.1| unknown [Zea mays]
gi|414867179|tpg|DAA45736.1| TPA: hypothetical protein ZEAMMB73_410449 [Zea mays]
Length = 357
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 176/363 (48%), Positives = 244/363 (67%), Gaps = 6/363 (1%)
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGG 454
+ +NF+A G TNI PL++A+ +L+ S ++P I+ VTDG+V+DER IC +K++L G
Sbjct: 1 MNLNFVAQGGTNIMHPLSEAMTLLSTSHDALPQIYFVTDGSVDDERNICHTLKTQLIKSG 60
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
S PRI TFG+GSYCNHYFLRMLA I +G+Y AA+D SIE +M + F + S++++N++
Sbjct: 61 SKSPRISTFGLGSYCNHYFLRMLASIGKGHYAAAFDTGSIEGRMLQWFQKASSTIVSNVS 120
Query: 515 IDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQL 574
ID +K + +FE+ IPD+S++ PL VSGRY GK P+TL AKG L D+S +ELK+Q
Sbjct: 121 IDAIKHIQDFEVDSVYIPDISAKYPLCVSGRYNGKLPETLIAKGHLADMSEISIELKVQH 180
Query: 575 AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
KDIPLD++ AKQQ+DLLTA+AW E+K LE KV K+S+Q + EYTRM++++T
Sbjct: 181 IKDIPLDKVLAKQQMDLLTAKAWLMENKELERKVVKLSIQNSLPSEYTRMVLLQTSLDKI 240
Query: 635 ASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVA 694
K +K+S E+P + L L +GFGD+ AT EN G K E
Sbjct: 241 DPAQQAKNKPTKQSSS----PDELP-AMPLGGLAVGFGDVAATTENLTTGFGDMKAAEKF 295
Query: 695 EIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLCCC 754
+ KAA CCS + + CCC+C I+ C+K+NDQC IV+ Q C A+AC GCFECCS LCC
Sbjct: 296 VMLEKAA-GCCSRMADCCCCMCFIKACSKMNDQCAIVMAQACAAIACLGCFECCSELCCG 354
Query: 755 GQD 757
G +
Sbjct: 355 GSN 357
>gi|33147004|dbj|BAC80088.1| unknown protein [Oryza sativa Japonica Group]
Length = 342
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 175/344 (50%), Positives = 234/344 (68%), Gaps = 10/344 (2%)
Query: 410 PLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYC 469
PL +A+ L+++ +P IFL+TDG+V+DE IC +K+ L + GS PRI TFG+G YC
Sbjct: 3 PLNEAMASLSSAHDVLPQIFLMTDGSVDDEHNICQTVKTELISRGSKSPRISTFGLGLYC 62
Query: 470 NHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPS 529
NHYFLRMLA I RG+Y AA++ SIE Q+ + F + S+++ANI+ID LDEFE+
Sbjct: 63 NHYFLRMLASIGRGHYDAAFETGSIESQVLRWFRKASSTIVANISIDATAHLDEFEIDSE 122
Query: 530 RIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQI 589
IPD+S++SPL +SG+YQGKFPD + AKG+L D+ +ELK+Q KDIPLD++ A QQI
Sbjct: 123 YIPDISAKSPLCISGKYQGKFPDMVTAKGYLADMREISIELKVQHIKDIPLDKVLAAQQI 182
Query: 590 DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTK-KGSKKS 648
LLTA+AW S DK+LE KV K+S+Q + EYT M++++T E+ +A++ K KG K
Sbjct: 183 GLLTAKAWLSSDKQLERKVVKLSIQNSIPSEYTSMVLLQTLEKVDAAQKVKQKLKGHKGP 242
Query: 649 DHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTL 708
D P+ I LQ L +GFGD AT EN G K E EI KAA CCS L
Sbjct: 243 DE--------PRRIPLQCLKLGFGDRAATRENLVTGFGDVKPLETFEILNKAA-GCCSRL 293
Query: 709 GNKCCCLCCIECCNKLNDQCVIVLTQLCTALACFGCFECCSNLC 752
+ CC+CCI+ CNK+NDQC IV+TQ+C A AC GC+ECC+ LC
Sbjct: 294 ADCLCCMCCIKACNKMNDQCAIVMTQVCAAFACLGCYECCAELC 337
>gi|303272389|ref|XP_003055556.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463530|gb|EEH60808.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 802
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 218/730 (29%), Positives = 342/730 (46%), Gaps = 49/730 (6%)
Query: 30 PKVPSSMDKSGAVDMLLPKSPMVYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMN 89
P+ ++ A L P P +Y T + P Y Y CD
Sbjct: 8 PRAVEAVHPQAAEGALRPSCPGLY---TTTPCAEGPHAGEYF-YNAAVCD---------- 53
Query: 90 AVELDVDCYLDTAFVRVSGTWRVHCVM--GSKSCDCRIAVPMGDQGSILGVEAEISGKSY 147
++ DV+C+L T FV + W V + G S C A+P+ + ++ G+ A S K
Sbjct: 54 SIRFDVECHLSTGFVTCTSRWTVPKALLNGKASGSCLFAMPLANGLTVTGITARSSKKCV 113
Query: 148 HTQLIALGENDG-AGKSASVETGSFL--KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRD 204
+ ++ L + GKS T P++F +T+P + + + + + L + +
Sbjct: 114 TSAIVPLEDTTNFKGKSFGRNTTEHAPEHPDVFCMTVPNVAADEVVDVCVTYFHPLEFEN 173
Query: 205 GEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLG 264
G ++ P PE P K + + + +V ++SH L+ +++ G
Sbjct: 174 GAYAFTAPSTMPEASLPWDKDHVGVCSFTVAIRSAYEGDVAVASTSHPLQIIQQQSGFKK 233
Query: 265 YSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQ------REMFCMYLLPGT 318
+ + W N DF + HI + Q P+ H + Q R F + L P
Sbjct: 234 VRASAAMGGWKNADFKLTMPTWAEHIVAACVQQRPT-HILGQPPPRDPRSSFVVALAPPK 292
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN-GETYLFS 377
+ F + V+F++D SGSM G+P+E AL L L D FNI AF+ G+ Y +
Sbjct: 293 PERCLAFGRSVVFVIDRSGSMNGEPMEAANEALTTGLRSLTEHDYFNICAFDDGQEYFDA 352
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN--SRGSIPIIFLVTDGA 435
+M AT + VERA W+ + +A +T+I PL++A+++L G++P +FL+TDGA
Sbjct: 353 NAMTQATPKNVERAMAWMNEHCVARYTTDIYTPLSEALKLLAGCAGNGTVPFVFLITDGA 412
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V DE++IC + + G PR+ TFGIG YCNHYFL+MLA I RG + AA+ D I
Sbjct: 413 VSDEKEICKMLMAESQQKGEALPRVCTFGIGQYCNHYFLKMLANIGRGLFDAAFTNDKIA 472
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
QM K+ T S VL NI I + E E+YP +PDL +P++VSG+ G P T+
Sbjct: 473 TQMSKMLTAARSPVLTNIEIG-VGVGSEVELYPFPVPDLYLATPVMVSGKVMGVLPPTIS 531
Query: 556 AKGFLGDLSNFVVELKLQLAKD-------IPLDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
+G D V E + + D +PL+++ KQ+IDLL A+AW +DK+LE+++
Sbjct: 532 IRGRTAD--GGVWEQTVAVQNDDAIDTLTVPLEKVFVKQRIDLLVAKAWLYDDKKLEKEI 589
Query: 609 SKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLG 668
+SV+ V C YT++ E N KG K + ++L G
Sbjct: 590 IDISVEHDVACPYTKLCAFEVTPANEEKMKKDKAKGKKSKIAMYAVGGAA-GILVLGAAG 648
Query: 669 --IGFGDLTATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLND 726
IGFG++ T N G+ + + E N S +G CC +C + C + D
Sbjct: 649 AMIGFGNIAGTLAN---GAGV--VGSAGEALAGVIGNAGSQIGEGCCTVCDV-CGDTCGD 702
Query: 727 QCVIVLTQLC 736
C T LC
Sbjct: 703 ACGFA-TGLC 711
>gi|255079988|ref|XP_002503574.1| predicted protein [Micromonas sp. RCC299]
gi|226518841|gb|ACO64832.1| predicted protein [Micromonas sp. RCC299]
Length = 753
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 192/623 (30%), Positives = 298/623 (47%), Gaps = 32/623 (5%)
Query: 85 PLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCD-----CRIAVPMGDQGSILGVE 139
P + N++E +V+C+L TAFV W V + S + C A+PM +G + +
Sbjct: 45 PAECNSLEFNVECHLSTAFVTCKTQWMVPSKLLKSSANQGKNSCTFALPMDQKGVVTSIS 104
Query: 140 AEISGKSYHTQLIALGENDGAGKSASVETGSF------LKPNIFTLTLPQIDGGSYLSIR 193
A++ K H Q + D GK + G + P +F + +P + + I
Sbjct: 105 AKM--KKMHIQSCVVPVAD-TGKFSGKRFGRNGPEPCPVHPEVFAIAIPNVPANELVDIE 161
Query: 194 LRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHL 253
+ + Q L + DG + P P+ P ++ I V V+ SH +
Sbjct: 162 VTYFQPLKFEDGAYIFEAPTTLPKGALPQGAQLKDIVHIMTTVRGARPEPVMVECKSHPV 221
Query: 254 KQLRRDVGKLGYSYESEVLKWSNIDFDFSYTV-SPSHIFGGVLLQSPSLHDVDQREMFCM 312
++ G + KW N ++ V SPS V SP D R F +
Sbjct: 222 TLVQAGPGVTQVMIDPRSNKWGNGNYLMRMPVWSPSITAAAVQQPSPKGPTADTRAPFII 281
Query: 313 YLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE 372
+ P KS F + V FIVD SGSM GKP+ AL LS L P D+FNI AF+
Sbjct: 282 SISPPDPKSCAPFARSVFFIVDRSGSMTGKPMAGANQALLAGLSSLGPQDTFNICAFDNL 341
Query: 373 TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIPIIFL 430
S M A+ E + A WI + A G+T+I +PL AV +L+ G++P+I++
Sbjct: 342 QEYLSEDMVPASPENINAAKGWIQGHCTARGTTDILSPLRAAVAILSKRPLLGAVPLIYV 401
Query: 431 VTDGAVEDERQICDAMKSRLT----NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
VTDGAVE+ER+IC M+ ++ G PR+ TFGIG YCNHYFL+ML+ I +G
Sbjct: 402 VTDGAVENEREICRYMQEVMSAPPPEGLMTHPRVCTFGIGRYCNHYFLKMLSQIGKGLSD 461
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANI--AIDTLKDLDEFEMYPSRIPDLSSESPLIVSG 544
AAY + + QM L + VL ++ I + + E+YP +PDL +P++V+G
Sbjct: 462 AAYTDERVGSQMIALINASRTPVLTDVMLGIPGAGESSKVEVYPFPVPDLYIGAPVMVAG 521
Query: 545 RYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD-------IPLDRICAKQQIDLLTAQAW 597
+ QG P ++ KG L + E + +A D IPLD++ K++ID+LTA AW
Sbjct: 522 KIQGGLPPSMSIKGRLASGEEW--ESTIPVAPDLQNNVLSIPLDKVFIKERIDMLTANAW 579
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSE 657
+ +K E V+ +S++ GV C +T++ E D + +A KKG + +
Sbjct: 580 LTNNKSAEADVTALSLEYGVPCPHTKLCAFEVDPKKSAEVEAAKKKGGAMKIAKYAVGGA 639
Query: 658 VPKTIMLQNLGIGFGDLTATAEN 680
++ G FG++ A+ N
Sbjct: 640 AGVMVLGALAGADFGNVGASLAN 662
>gi|414867178|tpg|DAA45735.1| TPA: hypothetical protein ZEAMMB73_410449 [Zea mays]
Length = 230
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 97/189 (51%), Positives = 139/189 (73%)
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGG 454
+ +NF+A G TNI PL++A+ +L+ S ++P I+ VTDG+V+DER IC +K++L G
Sbjct: 1 MNLNFVAQGGTNIMHPLSEAMTLLSTSHDALPQIYFVTDGSVDDERNICHTLKTQLIKSG 60
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
S PRI TFG+GSYCNHYFLRMLA I +G+Y AA+D SIE +M + F + S++++N++
Sbjct: 61 SKSPRISTFGLGSYCNHYFLRMLASIGKGHYAAAFDTGSIEGRMLQWFQKASSTIVSNVS 120
Query: 515 IDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQL 574
ID +K + +FE+ IPD+S++ PL VSGRY GK P+TL AKG L D+S +ELK+Q
Sbjct: 121 IDAIKHIQDFEVDSVYIPDISAKYPLCVSGRYNGKLPETLIAKGHLADMSEISIELKVQH 180
Query: 575 AKDIPLDRI 583
KDIPLD++
Sbjct: 181 IKDIPLDKV 189
>gi|357514093|ref|XP_003627335.1| hypothetical protein MTR_8g021260 [Medicago truncatula]
gi|355521357|gb|AET01811.1| hypothetical protein MTR_8g021260 [Medicago truncatula]
Length = 260
Score = 209 bits (532), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 115/247 (46%), Positives = 158/247 (63%), Gaps = 6/247 (2%)
Query: 498 MQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
MQ LF+ S ++A+I I +L+ LD E +P+ IPDLSS SPLI+SGRY G FP+ +K
Sbjct: 1 MQSLFSTASSVIVADITIKSLEGLDSQEFFPTHIPDLSSGSPLIISGRYNGTFPELVKVT 60
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGV 617
G L D ++F V+LK++ KD+ L + +K+ IDL+TA+AW E + LEEKVSKMS+Q V
Sbjct: 61 GTLADRTSFAVDLKVKREKDMKLTYVFSKRHIDLVTARAWLLESQELEEKVSKMSIQNNV 120
Query: 618 LCEYT--RMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLT 675
EYT MI+V ++ E P + + + K L+ E+PK + L L GFGDL
Sbjct: 121 PSEYTFMEMILVNKNDVKKEPE-PFLLQKAYGTLSFKGLEVEIPK-LFLGGLSHGFGDLK 178
Query: 676 ATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQL 735
ATAEN PP ++ K E + KAAS CC + + CC +C ++ C+ +NDQC IV TQL
Sbjct: 179 ATAENTPPATKEAKPSE--GLLGKAASTCCGRVADTCCGMCLLKTCSFVNDQCTIVCTQL 236
Query: 736 CTALACF 742
C ALACF
Sbjct: 237 CAALACF 243
>gi|302836738|ref|XP_002949929.1| hypothetical protein VOLCADRAFT_104477 [Volvox carteri f.
nagariensis]
gi|300264838|gb|EFJ49032.1| hypothetical protein VOLCADRAFT_104477 [Volvox carteri f.
nagariensis]
Length = 725
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 162/589 (27%), Positives = 255/589 (43%), Gaps = 103/589 (17%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS----QKLSYRDGEFSVNVPFKF 215
+G+ + ET + P +F L L G + RLR + + L++ +G + + +P +
Sbjct: 155 SGQPQNSETAN--DPELFCLPL-GPPGACQPADRLRVTITTFEPLTFEEGHYVMRLPTEI 211
Query: 216 PEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNT-SSHHLKQLRRDVGKLGYSYESEVLKW 274
P + P + + + +N G T V S H + G + S +
Sbjct: 212 PYSMIPEGYSHTQLLDVIVTINTGNPTTVKYGVRSGHRAVPGVQSPGTVTLSLDKSP-DM 270
Query: 275 SNIDFDFSYTVSPSHIFGGVLL---QSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIF 331
N D D Y V S +F + + ++P D D R F + L P + F + +IF
Sbjct: 271 PNTDVDVRYLVWGSDMFLALNVTPPRAPGPADPDPRGAFVLTLAPPAPEHTTPFPRSIIF 330
Query: 332 IVDISGSMQGKPLEDTK-----NALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL--AT 384
I+D SGSM G+PL K +AL+ L L P D F +VAF+ E F+ +L T
Sbjct: 331 ILDRSGSMMGEPLSYAKWVREGSALSYGLRLLTPLDQFTVVAFDHEQLWFTPGGQLLPGT 390
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT------------------------- 419
+ V WI + A G T+I PL A+ +LT
Sbjct: 391 AQNVAACESWIHSSISARGLTDIRGPLQTAMGVLTAAAGAAAGGQYPQQQQGGEQHQQGA 450
Query: 420 ---NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
+ +P IFLVTDG V DE+ IC Y +
Sbjct: 451 AAVGTPRPLPFIFLVTDGCVTDEKDIC---------------------------RYVEQH 483
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LA RG + A+ +I+ QMQ + T VL+++ + TL +++ E+YP +PDL
Sbjct: 484 LATYGRGSFDVAFRPHAIQAQMQNMLTAAQRPVLSDLTL-TLPGVEQCELYPFPLPDLFC 542
Query: 537 ESPLIVSGRYQGKFPD---------------TLKAKGFLGD----LSNFVVELKLQLAKD 577
PL+V+G+++G +P + G L + SN V K D
Sbjct: 543 GMPLLVAGKFRGAWPPLRPLDQGQQDQGQGGGVCVNGMLPNGTAWSSNPVYPGK---ESD 599
Query: 578 IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
+PLD+I K ++DLLTAQAW + +L + +S+ TGV C +TR + ET ++ A+
Sbjct: 600 LPLDKIFIKNRLDLLTAQAWLEGNPPQLVNNIVDLSIATGVPCAHTRTVTFETTRKDFAA 659
Query: 637 -ESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIG----FGDLTATAEN 680
+ G +K ++ K ++L LG+G FGD+ ATA N
Sbjct: 660 LQHAANTGGRRKINYAKYAIGGAAGLVVLAGLGVGAAFAFGDVGATAAN 708
>gi|159475862|ref|XP_001696033.1| flagellar associated protein [Chlamydomonas reinhardtii]
gi|158275204|gb|EDP00982.1| flagellar associated protein [Chlamydomonas reinhardtii]
Length = 1043
Score = 165 bits (417), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 142/538 (26%), Positives = 240/538 (44%), Gaps = 72/538 (13%)
Query: 164 ASVETGSFLKPNIFTLTLP-QIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA 222
A+V P +F L +P + G + L + L++ +G + + +P + P + P
Sbjct: 154 AAVPEAHLNDPELFVLPMPGRAAPGDQFRVSLTTFEPLAFEEGSYVLRLPTELPAAMIPQ 213
Query: 223 IKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFS 282
+ + + +N G+ T V ++ L DVG W N D +
Sbjct: 214 GYSHTQLLDVIVTINTGSPTPV-----AYALPNADVDVG---------YRVWGN-DMFLA 258
Query: 283 YTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGK 342
VS + P D D R F + + P + F + V+F++D SGSM G+
Sbjct: 259 LNVSNP--------RPPHPADPDPRGAFVLSVAPPAPEFTAPFPRSVVFLLDRSGSMSGE 310
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAG 402
P+E K AL L L P D+F +VAF+ E F+ +L+ W+ + A
Sbjct: 311 PMEFAKAALCFGLRSLTPLDTFTVVAFDHEQLWFTPGGQLS----------WVRASVDAR 360
Query: 403 GSTNICAPLTKAVEMLT------------NSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
G T+I PL A+ +L+ NS I I + A +DE+ IC ++ +
Sbjct: 361 GLTDIMTPLQTAMRVLSGGGTRIASHSYLNSWHGINNIRSLQT-ARQDEKDICRFVEGHM 419
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
+ LA RG + A+ +I+ QMQ L T VL
Sbjct: 420 RGLLQHQ----------------QQQLATYGRGAFDVAFRPHAIQAQMQHLLTAAQRPVL 463
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-DTLKAKGFLGDLSNFVVE 569
+++ + TL + E+YP +PDL PL+V+G+Y G +P + G L + + +
Sbjct: 464 SDLTL-TLPGVAGCELYPYPLPDLFCGLPLVVAGKYSGAWPAGGITLNGTLPGGTPWSSK 522
Query: 570 LKLQLAK-DIPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSVQTGVLCEYTRMIIV 627
+ ++PLD+I K ++DLLTAQAW + + ++ ++ +S+ TG+ C +TR +
Sbjct: 523 PVFPGKEGNLPLDKIFIKSRLDLLTAQAWLAGNPPQMVNQIVDLSIATGIPCAHTRTVGF 582
Query: 628 ETDERNNAS-ESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIG----FGDLTATAEN 680
ET ++ A+ +S +G +K ++ K ++L LG+G FGDL AT N
Sbjct: 583 ETTHKDWAAIQSTAQSQGKRKVNYAKYAIGGAAGLVVLAGLGVGAAFVFGDLGATVAN 640
>gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium
profundum 3TCK]
gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium
profundum 3TCK]
Length = 714
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 157/568 (27%), Positives = 253/568 (44%), Gaps = 93/568 (16%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K+ + + G K++ VE +PNIF+ + I +++ + +
Sbjct: 129 IEGDIQPKAIAKKQYEAAQQAGV-KASLVEQH---RPNIFSTQVANIAPDESVTVEIEYQ 184
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTP------AIKKIPKREKI-----------------HL 234
+ + YRDGEFS+ +FP V P + K+P +I ++
Sbjct: 185 EAVLYRDGEFSL----RFPTVVAPRYIPVVPLNKVPDVNEITPFLRDLQDDPTLPFSLNI 240
Query: 235 NVNAGTGTEVLCNTSSHHLKQ--LRRDVGKLGY-----SYESEVLKWSNIDFDF-SYTVS 286
++NAG V+ NT SH Q L D L + VL W D S +
Sbjct: 241 DLNAGLPIAVI-NTPSHAFTQQKLSEDHYILSLIQPDIADRDVVLSWRPKATDLPSTALF 299
Query: 287 PSHIFG---GVLLQSPSL--HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQG 341
H+ G G+LL P H V+ + S +F + V F++DISGSM G
Sbjct: 300 TQHVEGQGYGLLLTMPPQVNHQVN------------STTSSALFHQSVTFVLDISGSMYG 347
Query: 342 KPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI-GINFI 400
+ +E K AL L +L P DSFNIV FN E L+S + T + RA +++ G++
Sbjct: 348 ESIEQAKQALRYGLQQLQPEDSFNIVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLD-- 405
Query: 401 AGGSTNICAPLTKAVEMLT----NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSI 456
A G T + A L A + T NS + I +TDG+V +E + D ++ +L +
Sbjct: 406 ADGGTEMAAALKAAFSIKTHDQLNSTRWLNQIVFITDGSVGNESALFDLIEQQLVDR--- 462
Query: 457 CPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
R++T GIGS N YF+ AM +G Y D+ + +M+ LF++ V+ +I +
Sbjct: 463 --RLFTVGIGSAPNSYFMTRAAMKGKGTYTYIGDVKEVNTKMRLLFSKISQPVMRDIKL- 519
Query: 517 TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFV----VELKL 572
D + +P+ +PDL + PL VS K PD G + V++
Sbjct: 520 AWSDGRSVDYWPNPVPDLYQQEPLQVS----FKIPDNAANLIITGQQVDHEWRQDVDIHQ 575
Query: 573 QLAKD-------IPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMI 625
LA D I LD I A+ QI + S D++ ++ + + ++ ++ YT +I
Sbjct: 576 GLAIDEKQPQPRIGLDIIWARNQISSIQMNPAISMDEK-KKHIELLGMKHHIVTRYTSLI 634
Query: 626 IV-------ETDERNNASESPGTKKGSK 646
V E + N+A+ P KG K
Sbjct: 635 AVDKTPARAEGEVVNDANVKPHLPKGWK 662
>gi|407688603|ref|YP_006803776.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'Balearic Sea AD45']
gi|407291983|gb|AFT96295.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'Balearic Sea AD45']
Length = 640
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/414 (27%), Positives = 192/414 (46%), Gaps = 60/414 (14%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT + I G +++ L++ Q++ YRDG+F+ ++P P Y
Sbjct: 76 RPNLFTQQVANIAPGEEITVTLQYVQQVDYRDGKFTFHLPTTLTPRYSPGIPLNQFNENT 135
Query: 219 ---------------------VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLR 257
+TP +++ + ++ N TG + TS +H
Sbjct: 136 VAEISGTGWGEPTDQVPDARAITPYMREGNEGPQLTFNATLNTGLTLNSVTSRNHRVNWS 195
Query: 258 RDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPG 317
G ++ +K D + SPS + DQ + + L+P
Sbjct: 196 ESTGNYLVTFNQSNIKMDR-DIWLEWQPSPSSAPQAAIFTESK----DQHDYALVMLMPP 250
Query: 318 TAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TY 374
KS+ + F +D+ F++D SGSM G+P+ D K +L +A+ +L D FN+VAFN + T
Sbjct: 251 QVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTTR 310
Query: 375 LFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSRGSIPIIFL 430
LF TS+E T+ ++ NF+ AGG T + L A++ T + I +
Sbjct: 311 LFETSVEGTTR------NKQYARNFVKHLNAGGGTEMAPALNAALKP-TTAENFIKQVVF 363
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
+TDGAV +E + +K+ L + R++T GIGS N YF+ A RG Y +
Sbjct: 364 ITDGAVGNEAALFSQIKNELGDA-----RLFTVGIGSAPNSYFMTRAAQFGRGSYVFVSN 418
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE-FEMYPSRIPDLSSESPLIVS 543
I+ QM L + S VL+N+++ + E+YPS+IPDL + PL+++
Sbjct: 419 TADIKQQMDSLLYKLESPVLSNLSLTLPAGYAQSAEIYPSKIPDLYAGVPLLLN 472
>gi|358451087|ref|ZP_09161524.1| vault protein inter-alpha-trypsin subunit [Marinobacter
manganoxydans MnI7-9]
gi|357224771|gb|EHJ03299.1| vault protein inter-alpha-trypsin subunit [Marinobacter
manganoxydans MnI7-9]
Length = 747
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 149/613 (24%), Positives = 263/613 (42%), Gaps = 94/613 (15%)
Query: 75 HGRCDPPSLI---PLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGD 131
G+ P+L+ + L D L +F S WR + P+ +
Sbjct: 108 QGQWQEPALVLDSEFDIQVSGLIADTRLLRSFQNTSDQWR----------EGVFVFPLPE 157
Query: 132 QGSILGV-----EAEISGK--SYHTQLIALGENDGAGK-SASVETGSFLKPNIFTLTLPQ 183
+ S+ G+ E I GK T L + AG+ +A+VE +PN+FT +
Sbjct: 158 KASVYGLTMTVGERTIEGKVQPRETALQTYEKAKQAGQHAANVEQ---QRPNLFTARVAN 214
Query: 184 IDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPA----------------IKKI 226
I G +S++L++ Q + Y+ G F + VP P Y+ A ++
Sbjct: 215 IPPGETVSVKLKYQQPVQYQAGVFELTVPTTLTPRYMPGAPVSTAPDQWQGGWALPTTEV 274
Query: 227 P-------------------KREKIHLNVNAGTGTEVLCNTSSHHLK--------QLRRD 259
P R I+L +++G E + ++ SH L ++R D
Sbjct: 275 PDAGAISPFTVTPDDVAADSHRAVINLVIDSGLPLESV-SSPSHRLATSQDGQTFRIRPD 333
Query: 260 VGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA 319
G++ + + V++W + G+ + H Q E + M +L
Sbjct: 334 GGEILMNRDF-VVRWRPV--------------AGLEPSAAVFHQSWQGEDYLMAMLVPGE 378
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
+D++F++D SGSM G+ + ++AL L L P D FN++ FN +T+
Sbjct: 379 NGTMALPRDLVFVIDTSGSMAGESIRQARSALQAGLDTLTPRDRFNVIQFNSQTHSLFMQ 438
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVE 437
E+AT + RA +++ A G T + L++A+E + G + + +TDGAV
Sbjct: 439 PEVATGNNLARARRYVD-RLRADGGTEMAPALSRALEGAGETEGGARVRQVIFITDGAVG 497
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+E + ++ +L + R++T GIGS N +F+R A RG Y A + ++
Sbjct: 498 NEAALFRQIRQQLGSQ-----RLFTVGIGSAPNRHFMREAARWGRGTYTAIHSPSDVDGP 552
Query: 498 MQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
+Q LF+ S VL +I +D E +P R DL PL+ R + LK
Sbjct: 553 LQALFSAMESPVLTDIRVDWPGQETGKESFPRRSGDLFQGEPLVHVVRGVPAMGE-LKVS 611
Query: 558 GFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTG 616
G L ++ L LQ A L R A+++ID L +A + + + +++++V G
Sbjct: 612 GRLPGGKDWTRTLDLQQAAPATGLHRHWAREKIDSLEDEARVTGREPDKAGITELAVSHG 671
Query: 617 VLCEYTRMIIVET 629
++ YT + V++
Sbjct: 672 LMSAYTSFVAVDS 684
>gi|349859157|gb|AEQ20610.1| marine proteobacterial sortase target protein [uncultured bacterium
CSLF43]
Length = 767
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 128/495 (25%), Positives = 225/495 (45%), Gaps = 55/495 (11%)
Query: 167 ETGSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP-- 221
+T S L +PNIFT ++ I G + I + + + L Y D F F FP V P
Sbjct: 138 QTASLLDQERPNIFTQSIANIAPGQAVKIVIGYVEPLKYEDAAFE----FTFPMVVGPRY 193
Query: 222 -------AIKKIPKR--------EKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYS 266
A + +P+R + L V+ + S H+ + R +
Sbjct: 194 IPNGLPDADRVVPRRTPEGTRAGHDLSLQVSLDAAMPLSALESPTHVVDVARPDARRALV 253
Query: 267 YESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKV 324
+ N DF Y V+ + I ++ S D+R + M +L P ++
Sbjct: 254 KLKDQATIPNKDFVLRYAVASTQIQDAIVAHS------DKRGGYFMLVLQPPARIAPAEI 307
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+++F++D SGSM G P+E K A+ +A+ L+P D+FN++ F+G+ ++ AT
Sbjct: 308 TPKEIVFVLDTSGSMMGFPIEKAKEAMKLAMDDLNPRDTFNLITFSGDEHILFPKPVPAT 367
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
E V A +++ GG+ + A KA ++ + I + +TDG V ++ +I
Sbjct: 368 PENVREAQKFLMSREGRGGTEMMKA--IKAALDPSDDQKHIRVACFMTDGEVGNDFEILH 425
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
A++ R++ FGIGS NH+ L +A RG D ++ R
Sbjct: 426 AVQQHPNA------RVFAFGIGSSVNHFLLDNMARQGRGEVEYVGLNDDGSAAAKRFHER 479
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG-FLGDL 563
+ VL +++ID L E+YP RIPDL S P++V GRY G +++ +G G
Sbjct: 480 VRTPVLTDVSID-WGSLPVSEVYPKRIPDLFSAKPVVVLGRYSGSANGSIRLRGKVAGRD 538
Query: 564 SNFVVELKL---QLAKDI-PLDRICAKQQI------DLLTAQAWFSEDKRLEEKVSKMSV 613
+ V+ + L + A D+ P+ + A+ ++ DL AQ ++ + V+++ +
Sbjct: 539 FDRVITVSLPAGEGAHDVAPV--LWARSKVDDLMNRDLRGAQTG-QQEPEIRNAVTQLGL 595
Query: 614 QTGVLCEYTRMIIVE 628
++ +YT + VE
Sbjct: 596 DYAIMTQYTSFVAVE 610
>gi|239813041|ref|YP_002941951.1| Vault protein inter-alpha-trypsin domain-containing protein
[Variovorax paradoxus S110]
gi|239799618|gb|ACS16685.1| Vault protein inter-alpha-trypsin domain protein [Variovorax
paradoxus S110]
Length = 691
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 158/615 (25%), Positives = 274/615 (44%), Gaps = 64/615 (10%)
Query: 65 PDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCR 124
P + + PY + D P++ L + A E+ V A V V+ T+R G ++ + +
Sbjct: 41 PRLKTESPYFFVKSDDPAVDRLPLKATEVAVKISGVIADVTVTQTYRNE---GQRAIEAK 97
Query: 125 IAVPMGDQGSILGVEAEISGKSYHTQLI----ALGENDGAGKSASVETGSFLK---PNIF 177
P + ++ G+ ++ + Q+ A E D A K +T + L+ PN+F
Sbjct: 98 YVFPGSTKAAVSGLNVRLADRLITAQIREKQQAQIEYDSAKKEG--KTAALLEQHLPNVF 155
Query: 178 TLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVN 237
+ + I G + + LR+++ L + G ++ F FP V P P+ E
Sbjct: 156 QMNVANILPGDDVKVELRYTELLVPQSGNYA----FVFPTVVGPRYNS-PQSENAQAKWV 210
Query: 238 AGTG--TEVLCNTSSH---------HLKQLRR-----DVGKLGYSYESEVLKWS------ 275
A +V NTS LK++R +V K ++++ +
Sbjct: 211 AQPTLRADVAPNTSFRLKASIDTPMGLKEVRSSSHTIEVKKSDEDRHADIVLAADGRPAD 270
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF-CMYLLPGTAKSRKVFKKDVIFIVD 334
N DF Y ++ I G++L D F M P + + +D IF+VD
Sbjct: 271 NRDFVLDYRLAGERIESGLMLYKGQNTDGSAENFFLAMVEPPKAVAASAISPRDYIFVVD 330
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQW 394
ISGSM G PL+ K L + L P D+FN++ F+G + S AT+ +E+A
Sbjct: 331 ISGSMHGFPLDTAKTVLERLIGGLRPSDTFNVLLFSGSNKMLSPRSVPATRANIEQALAT 390
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGG 454
I N+ GST + L + + S ++ LVTDG V ER+ + ++ L+
Sbjct: 391 IQ-NYGGSGSTELIPALKRVYAEPKEEKVSRTVV-LVTDGYVSVEREAFELVRKNLSKAN 448
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL-DSIEIQMQKLFTRGF--SSVLA 511
++ FGIGS N ++ I+R G + + D I+ Q R S VL
Sbjct: 449 -----VFAFGIGSSVNR---SLMEGIARAGMGEPFIITDPIQAPEQAARFRRMVESPVLT 500
Query: 512 NIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL---GDLSN--F 566
N+ + T LD +++ P +PD+ E P+IV G+++ KAKG + G +N +
Sbjct: 501 NVKV-TFGGLDVYDVEPQALPDVLGERPVIVFGKWRADAEG--KAKGRVIVEGQGANGPY 557
Query: 567 VVELKL--QLAKDI-PLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
EL++ QL +D L + A+ +I L+ Q +E+++++ ++ +L +YT
Sbjct: 558 RQELRIDPQLRQDTAALRTLWARHRIQSLSDQETLEGGSAFKERITELGLKYSLLTQYTS 617
Query: 624 MIIVETDERNNASES 638
I V+ RN A ++
Sbjct: 618 FIAVDKVVRNPAPQN 632
>gi|407700949|ref|YP_006825736.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'Black Sea 11']
gi|407250096|gb|AFT79281.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'Black Sea 11']
Length = 712
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 113/411 (27%), Positives = 193/411 (46%), Gaps = 54/411 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT + I G +++ L++ Q++ YRDG+F+ ++P P Y
Sbjct: 151 RPNLFTQQVANIAPGEEITVTLQYVQQVDYRDGKFTFHLPTTLTPRYSPGIPLNQFNEKI 210
Query: 219 ---------------------VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLR 257
+TP +++ + ++ N TG + TS +H
Sbjct: 211 EAEISGTGWGEPTDQVPDARAITPYMREGNEGPQLTFNATLNTGLTLNSVTSRNHRVNWS 270
Query: 258 RDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI-FGGVLLQSPSLHDVDQREMFCMYLLP 316
G ++ +K D + SPS + +S HD + L+P
Sbjct: 271 ESTGNYLVTFNQSNIKMDR-DIWLEWQPSPSSAPQAAIFTESKGQHDYA-----LVMLMP 324
Query: 317 GTAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-T 373
KS+ + F +D+ F++D SGSM G+P+ D K +L +A+ +L D FN+VAFN + T
Sbjct: 325 PQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTT 384
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
LF TS+E T+ + A ++ + AGG T + L A++ T + I + +TD
Sbjct: 385 RLFETSVEGTTRNK-QYARDFVK-HLNAGGGTEMAPALNAALKR-TTAENYIKQVVFITD 441
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GAV +E + +K+ L + R++T GIGS N YF+ A RG Y +
Sbjct: 442 GAVGNEAALFSQIKNELGDA-----RLFTVGIGSTPNSYFMTRAAQFGRGSYVFVSNTAD 496
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDE-FEMYPSRIPDLSSESPLIVS 543
I+ QM L + S VL+++++ + E+YPS+IPDL + PL+++
Sbjct: 497 IKQQMDSLLYKLESPVLSDLSLTLPAGYAQSAEIYPSKIPDLYAGVPLLLN 547
>gi|407684680|ref|YP_006799854.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'English Channel 673']
gi|407246291|gb|AFT75477.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii str. 'English Channel 673']
Length = 648
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 113/411 (27%), Positives = 194/411 (47%), Gaps = 54/411 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT + I G + + L++ Q++ YRDG+F+ ++P P Y
Sbjct: 76 RPNLFTQQVANIAPGEEIMVTLQYVQQVDYRDGKFTFHLPTTLTPRYSPGIPLNQFNENI 135
Query: 219 ---------------------VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLR 257
+TP +++ + ++ N TG + TS +H
Sbjct: 136 EAEISGTGWGEPTDQVPDARAITPYMREGNEGPQLTFNATLNTGLTLNSVTSRNHRVNWS 195
Query: 258 RDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI-FGGVLLQSPSLHDVDQREMFCMYLLP 316
G ++ +K D + SPS + +S HD + L+P
Sbjct: 196 ESTGNYLVTFNQSNIKMDR-DIWLEWQPSPSSAPQAAIFTESKGQHDYA-----LVMLMP 249
Query: 317 GTAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-T 373
KS+ + F +D+ F++D SGSM G+P+ D K +L +A+ +L D FN+VAFN + T
Sbjct: 250 PQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTT 309
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
LF TS+E T+ + A ++ + AGG T + L+ A++ T ++ I + +TD
Sbjct: 310 RLFETSVEGTTRNK-QYARDFVK-HLNAGGGTEMAPALSAALKR-TTTKDFIKQVVFITD 366
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GAV +E + +K+ L + R++T GIGS N YF+ A RG Y +
Sbjct: 367 GAVGNEAALFSQIKNELGDA-----RLFTVGIGSAPNSYFMTRAAQFGRGSYVFVSNTAD 421
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEF-EMYPSRIPDLSSESPLIVS 543
I+ QM L + S VL+++++ + E+YPS+IPDL + PL+++
Sbjct: 422 IKQQMDSLLYKLESPVLSDLSLTLPAGYAQLAEIYPSKIPDLYAGVPLLLN 472
>gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium
Ellin514]
gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium
Ellin514]
Length = 806
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 217/475 (45%), Gaps = 29/475 (6%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
++ L + I+ I L ++Q L G + +P +Y + IK + + +
Sbjct: 169 DVLKLKIFPIEPNGKKRITLSYTQVLKLDSGLLNYVLPMNAGKYSSKPIKSV----SVKV 224
Query: 235 NVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGV 294
NV + + + + SH ++ R + YE+ +K D D +P GV
Sbjct: 225 NVESKRPLKTIY-SPSHEVEVKRDGSNRATVGYEASEVK---PDADLQLYFAPEKDEIGV 280
Query: 295 LLQSPSLHDVDQREMFCMYLLPGT-AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L + D D F + PG AK++++ KDV+F++D SGSM GK +E K AL
Sbjct: 281 NLMAYKTGDEDG--YFLLLASPGVDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKKALQF 338
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+ L+ GD F I+ F+ E+ + +KE E+A +I N A G T I L K
Sbjct: 339 CVESLNDGDRFEIIRFSTESEPLFDKLAAVSKENREKAGDFIK-NLKAMGGTAIDEALKK 397
Query: 414 AVEMLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYC 469
A+ + + ++FL TDG DE QI M+ R RI+ FGIG+
Sbjct: 398 ALSLESKEGRPFVVVFL-TDGLPTVGTTDEDQILKGMQER----NKEKRRIFCFGIGTDV 452
Query: 470 NHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPS 529
N + L +A +R + + +E+++ F++ VLAN + D+ +MYPS
Sbjct: 453 NTHLLDRIAEETRAFSQYVLPEEDLEVKVSSFFSKINEPVLANPKLKFTADIRTTKMYPS 512
Query: 530 RIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAKD----IPLDRIC 584
+PDL L++ GRY GK G + GD F +L D IP R+
Sbjct: 513 PLPDLFKGEQLVLVGRYSGKGSSAAVIDGDVNGDKKKFTYDLNFPEHADEHDFIP--RLW 570
Query: 585 AKQQID-LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
A +++ LL E L+++V+ ++ + G++ YT +IVE + + ++S
Sbjct: 571 ATRRVGYLLDEIRLHGESSELKDEVTDLARKYGIVTPYTAYLIVEDESKRGVAQS 625
>gi|410666290|ref|YP_006918661.1| inter-alpha-trypsin inhibitor domain-containing protein [Simiduia
agarivorans SA1 = DSM 21679]
gi|409028647|gb|AFV00932.1| inter-alpha-trypsin inhibitor domain-containing protein [Simiduia
agarivorans SA1 = DSM 21679]
Length = 714
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 135/528 (25%), Positives = 232/528 (43%), Gaps = 76/528 (14%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP------- 212
AGK A++ +PN+FT + + G ++I+L +Q + YR G FS +P
Sbjct: 145 AGKRAALTEQQ--RPNLFTQKVANLAPGETVAIKLEITQPVDYRAGRFSWRLPTTLTPRY 202
Query: 213 ----------FKFPEYVTPAIKKIP--------------------------KREKIH--- 233
PE TPA +P +R H
Sbjct: 203 IPGQPLMGVSASLPESETPAEPTLPDAMTTDYFGWARPTTQVPDADKITPPQRHSAHSPG 262
Query: 234 ------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLG-YSYESEVLKWSNIDFDFSYTVS 286
L V G + +S+H D+ K G Y + + +D DF +
Sbjct: 263 LINPITLRVRLQAGLPLSNIAASYH----EIDLHKQGEYHHITTREPQVAMDRDFELHWT 318
Query: 287 PSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLED 346
P + G + V+ + LLPG + +DV FI+D SGSM G +E
Sbjct: 319 P--VVGAAPKAAFFTEMVEGENYGYLMLLPGQSPHTSNLPRDVTFIIDTSGSMSGPSIEQ 376
Query: 347 TKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELATKEAVERAHQWIGINFIAGGST 405
K++L +AL++L P D FN++ FN + +F S+ A+ + + RAH ++ N A G T
Sbjct: 377 AKSSLQLALARLQPSDRFNVIEFNSVYSQVFPASVP-ASSQNLARAHDFVR-NLQASGGT 434
Query: 406 NICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGI 465
+ L A+ + ++F +TDG+V +E+++ + +L G + R++T GI
Sbjct: 435 EMKPALEAALNQPASELWLQQLVF-ITDGSVGNEQELLSLIHHKL--GAT---RLFTVGI 488
Query: 466 GSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFE 525
GS N +F+R A RG + D++ + +M++LF + S+ L + ID + + E E
Sbjct: 489 GSAPNGFFMRKAAQFGRGDFVQIGDVNEVNQKMKQLFDKLESAALTQVKIDWPQAV-EVE 547
Query: 526 MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQL----AKDIPLD 581
+P+R+PDL + PL+V+ R +G F A ++ L L A +
Sbjct: 548 QWPARLPDLYAHQPLLVAARLRGPFVGQTIAVSGRSQAGDWRTTLTLTPNQPGAASKGVA 607
Query: 582 RICAKQQIDLLTAQAWFSEDK-RLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ A+ +ID L Q + + + +V +++ +L YT + VE
Sbjct: 608 TLWARAKIDALLDQLVTGKSREEVRNEVLPIALTHQLLSPYTSFVAVE 655
>gi|385333187|ref|YP_005887138.1| hypothetical protein HP15_3446 [Marinobacter adhaerens HP15]
gi|311696337|gb|ADP99210.1| protein containing a von Willebrand factor type A (vWA) domain
[Marinobacter adhaerens HP15]
Length = 704
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 135/505 (26%), Positives = 226/505 (44%), Gaps = 53/505 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT-PAIKKIPKRE 230
+PN+FT + I G +S+ L++ Q + Y+ G F + VP P Y+ A+ P +
Sbjct: 161 RPNLFTARVANIPPGETVSVELKYQQPVQYQAGVFELTVPTTLTPRYMPGKALSAAPDQW 220
Query: 231 KIHLNVNAGTGTEV---------------LCNTSSHHLKQLRRDVGKLGYSYESEVLKWS 275
+ V TEV + + S + L D G S S + +
Sbjct: 221 QGGWAVPT---TEVPDAVAISPFTVNPGDVADDSHRAVINLVIDSGLPLQSVSSPSHRLA 277
Query: 276 NIDFDFSYTVSPSHIFGGVLL--------------QSPS---LHDVDQREMFCMYLL-PG 317
+++V P GG +L Q PS H + E + M +L PG
Sbjct: 278 TSQDGQTFSVRPD---GGEILMDRDFVVRWRPVAGQEPSAAVFHQSWEGEDYLMAMLVPG 334
Query: 318 TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS 377
+ + +D++F++D SGSM G+ + ++AL L L P D FN++ FN +T+
Sbjct: 335 ESGA-MALPRDLVFVIDTSGSMAGESIRQARDALQAGLGTLTPRDRFNVIQFNSQTHSLF 393
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE--MLTNSRGSIPIIFLVTDGA 435
E+AT + RA Q++ A G T + L++A+E T + + +TDGA
Sbjct: 394 MQPEVATGNNLARARQYVD-RLRADGGTEMAPALSRALEGGGETEDGARVRQVIFITDGA 452
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V +E + ++ +L N R++T IGS N +F+R A RG Y A + ++
Sbjct: 453 VGNEAALFRQIRQQLGN-----QRLFTVAIGSAPNRHFMREAARWGRGTYTAIHSPSDVD 507
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q LF+ S VL +I ++ E +P R DL PLI R L+
Sbjct: 508 GPLQALFSAMESPVLTDIGVNWPGQKAGQESFPRRPGDLFQGEPLIHVVRGVPAM-GQLE 566
Query: 556 AKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQ 614
G L ++ L LQ A L R A+++ID L +A + + E +++++VQ
Sbjct: 567 VSGRLPGGRDWTRTLDLQQAAPATGLHRHWAREKIDSLEDEAKVTGREPDEAGLTELAVQ 626
Query: 615 TGVLCEYTRMIIVETDERNNASESP 639
G++ YT + V++ +SE+P
Sbjct: 627 HGLMSAYTSFVAVDSTP-ARSSEAP 650
>gi|427719440|ref|YP_007067434.1| Vault protein inter-alpha-trypsin domain-containing protein
[Calothrix sp. PCC 7507]
gi|427351876|gb|AFY34600.1| Vault protein inter-alpha-trypsin domain-containing protein
[Calothrix sp. PCC 7507]
Length = 815
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 142/556 (25%), Positives = 242/556 (43%), Gaps = 84/556 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+ NIFT +L I G + + +R+S +L + G N F FP V P + IP I
Sbjct: 111 RDNIFTQSLANIKPGEQIDVIIRYSDRLKFEGG----NYEFVFPMVVGP--RYIPG---I 161
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGY---------SYESEVLKWSNIDFDFSY 283
+ +A G + +S+ L D +L S++ V N + +
Sbjct: 162 TIAEDAVRGGSAIAPMTSNQDTDLVPDASRLNAPILPSGMRSSHDINVTIEINAGVEIQH 221
Query: 284 TVSPSHIF------------------------------GGVLLQSPSLHDVDQRE-MFCM 312
SPSH G +Q+ +L D+R F +
Sbjct: 222 INSPSHQIQIIREGQLVNVKLGSGDTIPNKDLILRYQVAGNNIQTTTLTQADERGGHFAL 281
Query: 313 YLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
YL+P + ++ KD++F++D SGS G+PL + + ++ L+P D+FNI+ F+
Sbjct: 282 YLIPALEYRPEEIVAKDMVFLMDSSGSQSGEPLMQCQELMRRFINGLNPDDTFNIIDFSD 341
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSRGSIPI 427
T S T + ++ + IN+I AGG T + + + + + G +
Sbjct: 342 TTQQLSPVPLANTSQ-----NRSLAINYINRLNAGGGTEMLGGIRTVLNLKATNPGRLQN 396
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
I L+TDG + +E QI +K RL G R+++FG GS N + L +A + RG
Sbjct: 397 IVLLTDGYIGNENQILAEVKQRLQPG----TRLHSFGAGSSVNRFLLNRIAELGRGIAHI 452
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR-- 545
+ ++ ++K F + + VLANI + D + MYPS PDL +E PL++ GR
Sbjct: 453 IRHDEPVDEVVEKFFCQINNPVLANIQLQWEGDGESPIMYPSTPPDLFAEQPLVLFGRKP 512
Query: 546 --YQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKR 603
+ GK T A G G L + + + ++ + +I L Q + K
Sbjct: 513 DAHSGKLHITGIAAG--GRRYQHSFHLNFPQSGNPAIAQLWGRSRIKDLMNQMVSGDTKA 570
Query: 604 LEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKI-LDSEVPKTI 662
E V+ ++ +L +YT + V D R N E D+ + + E+P+ I
Sbjct: 571 GVEAVTDTALAYQLLSQYTAFVAVSDDIRVNPHE-----------DYVSVQVPVEIPEAI 619
Query: 663 MLQNLGIGFGDLTATA 678
+ + + FG++TA+A
Sbjct: 620 IHEGI---FGNVTASA 632
>gi|406597659|ref|YP_006748789.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii ATCC 27126]
gi|406374980|gb|AFS38235.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas macleodii ATCC 27126]
Length = 640
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 192/411 (46%), Gaps = 54/411 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT + I G + + L++ Q++ YRDG+F+ ++P P Y
Sbjct: 76 RPNLFTQQVANIAPGEEIMVTLQYVQQVDYRDGKFTFHLPTTLTPRYSPGIPLNQFNENI 135
Query: 219 ---------------------VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLR 257
+TP +++ + ++ N TG + TS +H
Sbjct: 136 EAEISGTGWGEPTDQVPDARAITPFMREGNEGPQLTFNATLNTGLTLNSVTSRNHRVNWS 195
Query: 258 RDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI-FGGVLLQSPSLHDVDQREMFCMYLLP 316
G ++ +K D + SPS + +S HD + L+P
Sbjct: 196 ESTGNYLVTFNQSNIKMDR-DIWLEWQPSPSSAPQAAIFTESKGQHDYA-----LVMLMP 249
Query: 317 GTAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-T 373
KS+ + F +D+ F++D SGSM G+P+ D K +L +A+ +L D FN+VAFN + T
Sbjct: 250 PQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTT 309
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
LF TS+E T+ + A ++ + AGG T + L A++ T ++ I + +TD
Sbjct: 310 RLFETSVEGTTRNK-QYARDFVK-HLNAGGGTEMAPALNAALKR-TTTKDFIKQVVFITD 366
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GAV +E + +K+ L + R++T GIGS N YF+ A G Y +
Sbjct: 367 GAVGNEAALFSQIKNELGDA-----RLFTVGIGSAPNSYFMTRAAQFGLGSYVFVRNTAD 421
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDE-FEMYPSRIPDLSSESPLIVS 543
I+ QM L + S VL+++++ + E+YPS+IPDL + PL+++
Sbjct: 422 IKQQMDSLLYKLESPVLSDLSLTLPAGYAQSAEIYPSKIPDLYAGVPLLLN 472
>gi|428216851|ref|YP_007101316.1| NAD(+) ADP-ribosyltransferase [Pseudanabaena sp. PCC 7367]
gi|427988633|gb|AFY68888.1| NAD(+) ADP-ribosyltransferase [Pseudanabaena sp. PCC 7367]
Length = 902
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 138/587 (23%), Positives = 243/587 (41%), Gaps = 61/587 (10%)
Query: 94 DVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIA 153
DVD +D RV+ T + P+ D+ ++ +E +I + ++
Sbjct: 89 DVDAQVDGNIARVAVTQTFENPF-DMPLEAVYVFPLPDEAAVDDMEIKIGDRIIKGEIKR 147
Query: 154 LGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFS 208
E + A E T L + NIFT +L I G + + + ++ L + G +
Sbjct: 148 REEAQEIYEQARNEGRTAGLLEQERANIFTQSLANIRPGEQIDVTIYYTNSLEFEKGNYE 207
Query: 209 VNVPF-------------KFPEYVT-------PAIKKIPKREKI---------------- 232
P K P VT P ++P +I
Sbjct: 208 FVFPMVVGPRYIPGDPIGKSPAAVTDGVTQSLPDTTQVPDASRITPPVLKPGMRSGHDIN 267
Query: 233 -HLNVNAGTGTEVLCNTSSHHLKQLRRD-VGKLGYSYESEVLKWSNIDFDFSYTVSPSHI 290
L+++AG + + +TS H +K + V L S E + N D Y V+
Sbjct: 268 VKLSIDAGVAIQKVGSTS-HQIKMSQNGRVVNLSLSPEDTI---PNKDLIIRYQVANEQP 323
Query: 291 FGGVLLQSPSLHDVDQREMFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKN 349
VL +Q F YL+P KS ++ KDV+F++D SGS G+PL +K
Sbjct: 324 QTTVLTTK-----TEQGGHFATYLIPAIDYKSEEIVPKDVVFLMDTSGSQSGEPLIKSKE 378
Query: 350 ALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICA 409
+ +S L+P D+F I+ F T S+ T ++A +I A G T +
Sbjct: 379 LMRRFVSGLNPDDTFTIIDFANTTRQLSSIPLANTNSNRQKALNYIN-KLDANGGTELLN 437
Query: 410 PLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYC 469
+ + + G + + L+TDG + ++ +I A++ +L G R+Y+FG+GS
Sbjct: 438 GINAVLSFPSPEDGRVRSVVLITDGYIGNDTEILAAVQEKLKPGN----RLYSFGVGSSV 493
Query: 470 NHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPS 529
N Y L LA + RG ++ ++ ++K F + + VL NI ++ E+YP
Sbjct: 494 NRYLLDRLAEVGRGTTQVVRQDENTDLAVEKFFQQINNPVLTNIKVEWQGIGTAPEIYPQ 553
Query: 530 RIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNF--VVELKLQLAKDIPLDRICAKQ 587
PDL + PL++ GR + + LK G + + V+ + + + ++ +
Sbjct: 554 AAPDLFANQPLVLYGRKKDRISGNLKITGMMAGNKRYEKVIPVGFNNQEKGAIAQLWGRA 613
Query: 588 QIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
+I L Q + E L E + ++ +L +YT + V + R N
Sbjct: 614 RIKDLMNQMFSGETTTLVESATNTALSYNLLSKYTAFVAVSEEVRVN 660
>gi|319790717|ref|YP_004152357.1| nad(+) ADP-ribosyltransferase [Variovorax paradoxus EPS]
gi|315593180|gb|ADU34246.1| NAD(+) ADP-ribosyltransferase [Variovorax paradoxus EPS]
Length = 689
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 149/612 (24%), Positives = 264/612 (43%), Gaps = 64/612 (10%)
Query: 65 PDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCR 124
P + + PY + D PS+ L + E+ V A V V+ T+R G ++ + +
Sbjct: 42 PRLKTESPYFFVKSDDPSVDRLPLKGTEVAVKISGVIADVTVTQTYRNE---GQRAIEAK 98
Query: 125 IAVPMGDQGSILGVEAEISGKSYHTQLI----ALGENDGAGKSASVETGSFLK---PNIF 177
P + ++ G+ ++ + Q+ A E D A K +T + L+ PN+F
Sbjct: 99 YVFPGSTKAAVSGLNVRLADRLITAQIREKQQAQIEYDTAKKEG--KTAALLEQHLPNVF 156
Query: 178 TLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP---------AIKKIPK 228
+ + I G + + LR+++ L + G N F FP V P A K
Sbjct: 157 LMNVANILPGDDVKVELRYTELLVPQSG----NYEFVFPTVVGPRYNSPQSENAQAKWAA 212
Query: 229 REKIHLNVNAGTGTEVLCNTSS-HHLKQLRR-----DVGKLGYSYESEVLKWS------N 276
+ +H T ++ + LK++R DV K +++ + N
Sbjct: 213 QPTLHAGAAPSTSFKLKATIDTPMGLKEVRSTTHAIDVKKSDEDQHADIALAADGRPADN 272
Query: 277 IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDIS 336
DF Y ++ I G++L + + M P + + +D IF+VDIS
Sbjct: 273 RDFVLDYRLAGEKIESGLMLYKGQGENAENF-FLAMVEPPKAVAASAISPRDYIFVVDIS 331
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIG 396
GSM G PL+ K L + L P D+FN++ F+G + S AT+ +E+A I
Sbjct: 332 GSMHGFPLDTAKTVLERLIGGLRPSDTFNVLLFSGSNKMLSPKSVPATRANIEQALATIK 391
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSI 456
N+ GST + P K V + +VTDG V ER+ D ++S L+
Sbjct: 392 -NYSGSGSTELI-PALKRVYAEPKEENVSRTVVVVTDGYVTVEREAFDLVRSNLSKAN-- 447
Query: 457 CPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL-DSIEIQMQKL-FTRGFSSVLANIA 514
++ FGIGS N ++ I+R G + + D ++ Q F R S +
Sbjct: 448 ---VFAFGIGSSVNR---SLMEGIARAGMGEPFIITDPVQAPEQAARFRRMVESPVLTSV 501
Query: 515 IDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL---GDLSNFVVELK 571
T LD +++ P +PD+ E P+IV G+++G +AKG + G ++ +
Sbjct: 502 KATFGGLDVYDVEPQALPDVLGERPVIVFGKWRG------EAKGRVIIEGRSASGPYRQE 555
Query: 572 LQLAKDIPLD-----RICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMII 626
+++ LD + A+ +I L+ Q +E+++++ ++ +L +YT I
Sbjct: 556 VRIDDRTRLDTAALRTLWARHRIQSLSDQESLEGSAAFKERITELGLKYSLLTQYTSFIA 615
Query: 627 VETDERNNASES 638
V+ RN A ++
Sbjct: 616 VDKVVRNAAPQN 627
>gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106]
gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106]
Length = 843
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 130/558 (23%), Positives = 243/558 (43%), Gaps = 67/558 (12%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLP 182
P+ D+ ++ +E +I + + L E + A E T + L + NIFT +L
Sbjct: 119 PLPDEAAVDSLEIKIGDRIVKSILKKREEAQAIYEQAKTEGNTTALLEQERDNIFTQSLA 178
Query: 183 QIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAI---------KKIPKREKI 232
I G + +++R+++ L +++G + P P Y+ P I +++P +I
Sbjct: 179 NIKPGEQIEVKIRYTESLKFQEGNYEFVFPMVVGPRYI-PGILNNQQIRETQRVPDASRI 237
Query: 233 -----------------HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWS 275
L ++AG E + + S + Q R + ++ + ES+ +
Sbjct: 238 TPPVLPPGTRSGHDINVKLEIDAGILGEQITSPSHQIITQNRDNQIEVNLA-ESDTIP-- 294
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA-KSRKVFKKDVIFIVD 334
N D Y VS LLQ+ D+ F YL+P K ++ KDV+F++D
Sbjct: 295 NKDLIIRYQVSDETTQLKTLLQAD-----DRGGHFATYLIPSLEYKQDEIVAKDVVFLID 349
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQW 394
SGS +G+PL +K + + L+P D+F+I+ F+ T S + T ++A +
Sbjct: 350 TSGSQRGEPLAKSKQLMRRFIQSLNPDDTFSIIDFSDTTTALSATPLTNTVTNQQKAIAY 409
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGG 454
I A G T + + ++ + + I L+TDG + +E ++ +K++L +G
Sbjct: 410 IN-QLEANGGTELLNGIQTVMDFPSPPVKRLRSIVLITDGYIGNENEVLSVVKNQLKSGN 468
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
R+Y+FG+GS N + L LA I RG + E Q++ + + VL N+
Sbjct: 469 ----RLYSFGVGSSVNRFLLNRLAEIGRGTTQITRPDEPTEDQVELFVNQINNPVLTNLQ 524
Query: 515 IDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL--GDLSNFVVELKL 572
+ + E+YP R+ DL PL++ GR + LK +G GD ++ +
Sbjct: 525 VGWEGGGEPVEIYPVRVADLFDHQPLVIFGRKTDRRNGKLKIRGITANGDRYEQIIPVNF 584
Query: 573 QLAKDIPLDRICAKQQIDL------------------LTAQAWFSEDKRLEEKVSKMSVQ 614
+ ++ + + QID L Q + E + + V++ ++
Sbjct: 585 TASTNVNIPESTSSTQIDTRFGNSAIAQLWGRAKIKHLMNQMFGGETRSGVDAVTQTALD 644
Query: 615 TGVLCEYTRMIIVETDER 632
+L +YT + V + R
Sbjct: 645 YQLLSQYTAFVAVSEEVR 662
>gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841]
gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 797
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 123/514 (23%), Positives = 232/514 (45%), Gaps = 59/514 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P + +
Sbjct: 172 RPNIFTNQVANIGPGETIVVQIEYQQTIHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 227
Query: 233 HLNVNAGTGTE-------------VLCNTSSHHLK------QLRR--------------D 259
N AG T VL + + LR D
Sbjct: 228 EFNNGAGFATPRDPVENRDKIEAPVLDPRENARINPVSLTVDLRAGFPLGDVKSSFHAVD 287
Query: 260 VGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
V + G + LK + DF+ ++ +P + L + +D + ++
Sbjct: 288 VSQDGDQARTISLKADTVPADKDFELTWKAAPGKMPSAGLFR----EVIDGKTYLLAFVT 343
Query: 316 PGTAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
P TA K++V+F++D SGSM G +E + +LA+A+SKL+P D FN++ F+
Sbjct: 344 PPTAPDTAAPPAKREVVFVIDNSGSMSGPSIEQARQSLALAISKLNPDDRFNVIRFDDTM 403
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVT 432
+ + AT + E+A ++ A G T + L A+ + G++ + +T
Sbjct: 404 TDYFKGLVAATPDNREKAIAYVR-GLTADGGTEMLPALQAALRNQGPVATGALRQVVFLT 462
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +ERQ+ + + ++ R++T GIGS N YF+ A I RG + A D
Sbjct: 463 DGAIGNERQLFQEITANRSDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTD 517
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP- 551
+ +M +LF + + + +I T + ++ + P+ +PDL S P++++ + P
Sbjct: 518 QVASRMGELFAKLQNPAMTDITA-TFEGIEAENITPNPMPDLYSGEPVVLTAELPEEKPA 576
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVS 609
L+ G GD + VE+ + A D + ++ A+++ID A+A+ +D L++ +
Sbjct: 577 GKLQIIGKTGD-QPWRVEMDIANAADGSGISKLWARRKIDDFEARAYERQDPAALDKDIE 635
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+++ ++ T ++ V+ A + G+ K
Sbjct: 636 TVALAHHLVSRVTSLVAVDVTPSRPADQPLGSAK 669
>gi|85857862|ref|YP_460064.1| von Willebrand factor A [Syntrophus aciditrophicus SB]
gi|85720953|gb|ABC75896.1| von Willebrand factor type A like domain [Syntrophus aciditrophicus
SB]
Length = 785
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 143/594 (24%), Positives = 255/594 (42%), Gaps = 49/594 (8%)
Query: 72 PYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGD 131
PY + D PS+ L + A V+ A VRV+ +R G K + P
Sbjct: 38 PYFLVKSDDPSVDRLPLKATSAAVNISGVIADVRVTQRYRNE---GQKPLEAIYVFPAST 94
Query: 132 QGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDG 186
+ ++ G++ I + ++ E + A E + S L +PN+F + + I
Sbjct: 95 RAAVYGMKMTIGKRVIEAKIKKREEARRDYEQARDEGRSASLLEQQRPNVFQMNVANILP 154
Query: 187 GSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKIHLN---------- 235
G + + L++++ L D + P P Y ++ P E+ N
Sbjct: 155 GDEIVVELKYTELLVPEDRVYEFVYPTVVGPRYSNTPAEQAPPSEQWSANPYLHSGESPS 214
Query: 236 ------VNAGTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPS 288
VN TG V ++ SH +K + + + N D+ Y +
Sbjct: 215 YAFDIQVNLATGLPVRDLSSPSHKVKTSFQGPAQARVMLDPAEKSGGNRDYILRYRLDGD 274
Query: 289 HIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLED 346
I G+LL +E F + ++ P ++ ++ IFIVD+SGSM G PL+
Sbjct: 275 KIQSGLLLCE------GGKENFFLLMMQPPRRITQGEIPGREYIFIVDVSGSMHGFPLDI 328
Query: 347 TKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTN 406
+K LA + L P D FN++ F+G + L S AT E V +A I GG T
Sbjct: 329 SKKLLANLIGSLRPTDRFNVLLFSGGSSLMSEESLPATPENVRKAISVIEQQR-GGGGTE 387
Query: 407 ICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
I L +A+ + S ++ +VTDG V E ++ D ++ L N ++ FGIG
Sbjct: 388 ILPALKRALSLKKRENDSRTVV-IVTDGYVTVEEEVFDLIRKNLGNAN-----MFAFGIG 441
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEM 526
+ N + + +A + G ++ + + + + VL + +D K ++M
Sbjct: 442 TSVNRHLIEGMARVGMGEPLIIEKPEAADARADRFRRMIAAPVLTRVKVD-FKGFSAYDM 500
Query: 527 YPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP----LDR 582
P IPD+ +E P+I+ G+++GK T+ G G + L + A +P L
Sbjct: 501 EPLSIPDVLAERPVILFGKWRGKARGTVTLNGISGK-GPYSETLSVANASPLPGNAALRY 559
Query: 583 ICAKQQIDLLT-AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ A+ +I +L+ S+D+R++E V+ + + +L YT + V+ + R +
Sbjct: 560 LWARHRIAILSDYNRLHSDDQRIQE-VTNLGLTYNLLTAYTSFVAVDIEVRTDG 612
>gi|389776467|ref|ZP_10193956.1| hypothetical protein UU7_08538 [Rhodanobacter spathiphylli B39]
gi|388436617|gb|EIL93473.1| hypothetical protein UU7_08538 [Rhodanobacter spathiphylli B39]
Length = 653
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 133/516 (25%), Positives = 230/516 (44%), Gaps = 53/516 (10%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
V A+I K ++ N G ++A VE + N+F + + G ++I LR+
Sbjct: 108 VVADIQRKEAAKKVFEQARNSGK-RAALVEAD---EGNLFRTAVTNVAPGETVTIVLRYW 163
Query: 198 QKLSYRDGEFSVNVPFKF-PEYV-------------TPAI-----KKIPKREKIHLNVNA 238
Q++ Y G FS+ P F P Y TP + + P R I + ++A
Sbjct: 164 QRVDYGAGRFSLRFPLTFTPRYHMRDGARPGDAGIDTPQVFADDNAQAPLRTHIAVQLDA 223
Query: 239 GTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQS 298
G + ++SSH + R D ++ VL + DF S+ P
Sbjct: 224 GVPLASV-DSSSHAIISTRHDQRWDIRLRDASVLP--DRDFVLSWQPQPQ--------TQ 272
Query: 299 PSL----HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
P++ D+D + L+P ++++ +++I I+D SGSM G+ + + AL +A
Sbjct: 273 PNVASFRQDIDGAHYAMLMLMPPQQQAQR-LPRELILIIDTSGSMGGESIRQARAALDLA 331
Query: 355 LSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
LS+L PGD FN+ FN +S+ AT EA+++A W+ A G T + L A
Sbjct: 332 LSQLQPGDRFNVAEFNSTLNPWSSEAVAATPEALQKARAWVD-QLQANGGTEMAPALRFA 390
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ G + + TDGAV+D + + +L + R++ GIG+ N FL
Sbjct: 391 LAGHAPP-GYVRQVLFATDGAVDDPNGLMQLIDEQLGDS-----RLFPIGIGNAPNAGFL 444
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+ A RG D ++ M L R L ++ +D D +YP ++PDL
Sbjct: 445 QAAARHGRGSETLVTDTGNVGKAMVSLLARLDHPALRDLKVDWPAGSD---VYPRQLPDL 501
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLT 593
PL+V+ R D ++ G L + + + V L A+ LDR+ A+ +I L+
Sbjct: 502 YLGEPLLVTARLAAPV-DRVQVHGQLAERA-WTVPADLSAARPAAGLDRLWAQSRIADLS 559
Query: 594 AQ-AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
Q A + L E +++ ++ ++ YT ++ V+
Sbjct: 560 DQLARGGDAPELVEAITRTALDAHLVSAYTSLVAVD 595
>gi|307155059|ref|YP_003890443.1| Vault protein inter-alpha-trypsin domain-containing protein
[Cyanothece sp. PCC 7822]
gi|306985287|gb|ADN17168.1| Vault protein inter-alpha-trypsin domain protein [Cyanothece sp.
PCC 7822]
Length = 796
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 124/513 (24%), Positives = 213/513 (41%), Gaps = 65/513 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP------AIKKI 226
+ NIFT +L I G + + +R++ L++ G N F FP V P I++
Sbjct: 113 RDNIFTQSLANIKPGEQIDVIIRYTDSLNFEGG----NYEFVFPMVVGPRYIPGITIEES 168
Query: 227 P-----------------------------------KREKIHLNVNAGTGTEVLCNTSSH 251
P R I++ + G E+ +S
Sbjct: 169 PTGKGSAPSPMTQNQDTDLVPDASRLNAPIVPPGTRSRHDINVTLEIDAGVEIAAISSPS 228
Query: 252 HLKQL--RRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE- 308
H Q+ + + ++ + + N DF Y V+ VL Q D+R
Sbjct: 229 HQIQIESKERIKQVKLAGGDTI---PNKDFILRYQVAGDRTEAAVLTQG------DERGG 279
Query: 309 MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
F +YL+P + ++ KDV+F++D SGS G PL + + ++ L+P D+F+I+
Sbjct: 280 HFALYLIPAIKYQPSQIVPKDVVFLIDTSGSQMGAPLAQCQELMRHFINGLNPDDTFSII 339
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
F+ T S T + A +I AGG T + + + G +
Sbjct: 340 DFSDTTQQLSPVPLANTSQNRSLALNYIN-RLTAGGGTELMRGIRAVLNFPITDAGRLRS 398
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
I L+TDG + +E QI ++ L G R+Y+FG GS N + L +A I RG
Sbjct: 399 IVLLTDGYIGNENQILAEVQQHLKPGN----RLYSFGAGSSVNRFLLNRIAEIGRGLARI 454
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+ + +++ F + + VL NI L + + +YPS +PDL +E PL++ GR
Sbjct: 455 IRHDEPVNQVVEQFFRQINNPVLTNIQWQWLGEGESPVIYPSVLPDLFAEQPLVLFGRKS 514
Query: 548 GKFPDTLKAKGFLGDLSNF--VVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLE 605
+ L+ G + + EL Q + + ++ + +I L Q + K
Sbjct: 515 ARLAGKLQVSGIAAGGTGYQQTFELNFQETGNPAIAQLWGRSRIKALMNQMVQGDTKAGV 574
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
E V +++ +L +YT + V D R N E+
Sbjct: 575 EAVIDTALKYQLLSQYTAFVAVSDDVRVNPQEA 607
>gi|186686512|ref|YP_001869708.1| vault protein inter-alpha-trypsin subunit [Nostoc punctiforme PCC
73102]
gi|186468964|gb|ACC84765.1| Vault protein inter-alpha-trypsin domain protein [Nostoc
punctiforme PCC 73102]
Length = 818
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 133/516 (25%), Positives = 220/516 (42%), Gaps = 79/516 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+ NIFT +L I G + + +R+S L + G N F FP V P + IP
Sbjct: 112 RDNIFTQSLANIKPGEQIDVIIRYSDSLKFEGG----NYEFVFPMVVGP--RYIPGITIE 165
Query: 228 --------------------------------------KREKIHLNVNAGTGTEVL-CNT 248
R I++ + G E+ N+
Sbjct: 166 EDAVGGGSAIAPMTQNQDTDLVPDASRLNAPILPAGMRSRHDINVTIEINAGIEIQDINS 225
Query: 249 SSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE 308
SH ++ +R G+L N D Y V+ ++I Q+ +L D+R
Sbjct: 226 PSHQIQIIRE--GQLVNVKLGGGDTIPNKDLILRYQVASNNI------QTTTLTQADERG 277
Query: 309 -MFCMYLLPG-TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
F +YL+P + ++ KD++F++D SGS G+PL + + ++ L+P D+FNI
Sbjct: 278 GHFALYLIPAFEYRPDEIVAKDMVFLIDSSGSQSGEPLMQCQELMRRFINGLNPDDTFNI 337
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSR 422
+ F+ T S T A++ + IN+I AGG T + + + + +
Sbjct: 338 IDFSDTTQQLSPVPLPNTS-----ANRLLAINYINRLNAGGGTEMLGGIRTVLNLKATNP 392
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
G + I L+TDG + +E QI +K RL G R+++FG GS N + L +A + R
Sbjct: 393 GRLRNIVLLTDGYIGNENQILAEVKQRLQPGT----RLHSFGAGSSVNRFLLNRIAELGR 448
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G + ++ ++K F + + VLANI + D + MYPS PDL +E PL++
Sbjct: 449 GIARIIRHDEPVDEVVEKFFRQINNPVLANIQLQWEGDGESPIMYPSTPPDLFAEQPLVL 508
Query: 543 SGR----YQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
GR + GK T A G G L + + ++ + +I L Q
Sbjct: 509 FGRKPDAHSGKLHVTGIAAG--GRRYQHSFHLNFPQTGNPAIAQLWGRSRIKDLMNQMVS 566
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
+ K E V+ ++ +L +YT + V D R N
Sbjct: 567 GDTKGGVEAVTDTALTYQLLSQYTAFVAVSDDIRVN 602
>gi|443477678|ref|ZP_21067505.1| NAD(+) ADP-ribosyltransferase [Pseudanabaena biceps PCC 7429]
gi|443017140|gb|ELS31651.1| NAD(+) ADP-ribosyltransferase [Pseudanabaena biceps PCC 7429]
Length = 859
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 130/557 (23%), Positives = 233/557 (41%), Gaps = 61/557 (10%)
Query: 119 KSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---K 173
K + P+ DQ ++ +E +I ++ + E A E T L +
Sbjct: 126 KPLEAIYVFPLPDQAAVDDMEIKIGDRTIKGDIKKREEAKAIYDRARQEGRTAGLLEQER 185
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP------------ 221
NIFT +L I G + + +R+++ L + G++ F FP V P
Sbjct: 186 DNIFTQSLANIKPGEEIKVTIRYTESLKFERGDYE----FVFPLVVGPRYIPGQAIDDRG 241
Query: 222 AIKKIPKREKI-----------------HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLG 264
+ K+P I +L ++AG L +TS H L DV G
Sbjct: 242 STTKVPDAALITPPVVNPKTILAADVMVNLQIDAGVPIHNLYSTS-HRL-----DVKNNG 295
Query: 265 YSYESEVLKWSNI---DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA-K 320
+ + ++ I D Y VS VL + ++ F YL+P A +
Sbjct: 296 ETVQIKLADGDTIPNKDLIVRYKVSGDRTAPAVLTTT-----TERGAHFATYLIPAIAYR 350
Query: 321 SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSM 380
++ KDV+F++D SGS G P+ ++ + ++ L+P D+F I+ F+ T S+
Sbjct: 351 PDQIVPKDVVFLMDTSGSQSGDPILKSRELMRRFINGLNPNDTFTIIDFSSTTRQLSSYP 410
Query: 381 ELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
T ++A +I A G T + + S G + + L+TDG + ++
Sbjct: 411 LKNTAANRQKAMNYID-QVDANGGTELMNGINAVTNFPAASDGRLRSVVLITDGYIGNDN 469
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
++ A++ L G R+Y+FG+GS N Y L +A + RG + + +K
Sbjct: 470 EVIAAVQKNLKPGN----RLYSFGVGSSVNRYLLERIAEMGRGTSRVVRQDEPTQEVAEK 525
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
F + + VL NI + + EMYPS PDL +E PL + G+ + L+ G
Sbjct: 526 FFRQINNPVLTNIQVQWEGEGTAPEMYPSNPPDLFAEQPLTIFGKKGDRANGNLRITGIA 585
Query: 561 GDLSNFVVELKLQLA---KDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGV 617
+ + + ++ + ++ + +I L Q + E K L + V++ ++ +
Sbjct: 586 AGGERYEQTINVNFGDRNSNLGIAQLWGRARIKDLMNQMFAGEVKSLVDAVTQTALDYRL 645
Query: 618 LCEYTRMIIVETDERNN 634
L +YT + V + R N
Sbjct: 646 LSQYTAFVAVSDEVRVN 662
>gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1]
gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1]
Length = 718
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 147/605 (24%), Positives = 271/605 (44%), Gaps = 69/605 (11%)
Query: 80 PPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVE 139
P ++IPL+ + + +V + A V ++ T++ G+ + + P+ Q ++ ++
Sbjct: 31 PNAVIPLK--STKANVKISGNIAHVHIAQTYQN---TGNIPIEAKYVFPLSTQAAVHKMQ 85
Query: 140 AEISGKSYHTQLIALGENDGAGKSASVETGSFLK-----PNIFTLTLPQIDGGSYLSIRL 194
I ++ + ++ E A E K PN+F + + I ++I +
Sbjct: 86 MTIGDRTINAKIFEKQEAQRVYDKALKEGKRAAKLDQHRPNVFQMNVGNIMKNDLVTIDI 145
Query: 195 RWSQKLSYRDGEFSVNVPFKFPEYVTP----------AIKKIPKREK-------IHLNVN 237
+++ L G N F FP V P + P EK +L+V
Sbjct: 146 YYTEMLVPLAG----NYEFVFPGVVGPRFTGENTSGETVFNQPYTEKGVGDTFNYNLDVQ 201
Query: 238 AGTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLL 296
G + ++++H++K + K S SE SN DF Y++ + I G+LL
Sbjct: 202 INAGIPIADVSSNTHNIKVHYPNTRKAEISLTSENKNPSNRDFILKYSMRGNEIQSGLLL 261
Query: 297 QSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
+ E F Y++ P + + K+ K+ +F+VD+SGSM G P+E +K L
Sbjct: 262 YE------ENGEKFFAYMMEPPKASVNIKLTAKEYLFVVDVSGSMNGYPMEVSKKLLRNL 315
Query: 355 LSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAP 410
L L D +NI+ F G + + + TKE +++ GINF+ GG T +
Sbjct: 316 LVNLPETDHYNILLFAGGSSVLAPEPLACTKENIQK-----GINFLTNERGGGGTRLLNA 370
Query: 411 LTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCN 470
L A + + S + ++TDG V ER+ + ++ L ++TFGIGS N
Sbjct: 371 LKTAYALPRMDKTSARSMVVITDGYVSVERKAFEMIEQNLGQAN-----VFTFGIGSGVN 425
Query: 471 HYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR 530
Y L +A IS A +++ +K S +L I I + D +++ PS
Sbjct: 426 RYLLEGMAKISNSETFIATEMNEANDVAEKFRNYIKSPLLTQIRIKA-EGFDAYDVTPSS 484
Query: 531 IPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL---QLAKDI-PLDRICAK 586
IPD+ + P++V G+Y+G+ TL G G+ F E ++ L+KD L + A+
Sbjct: 485 IPDVFASRPILVFGKYRGEAKGTLTITGQTGN-GVFQKEFRVTDGMLSKDNEALRYLWAR 543
Query: 587 QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSK 646
++I+ L D + + +V+++ ++ ++ +YT + V+ + N K G++
Sbjct: 544 KKIERLDDYKRRFGD-QTKHQVTQLGLKYNLVTQYTSFVAVDNEVVN--------KNGNR 594
Query: 647 KSDHQ 651
K+ Q
Sbjct: 595 KTVKQ 599
>gi|428215298|ref|YP_007088442.1| hypothetical protein Oscil6304_5018 [Oscillatoria acuminata PCC
6304]
gi|428003679|gb|AFY84522.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Oscillatoria acuminata PCC 6304]
Length = 863
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 141/572 (24%), Positives = 239/572 (41%), Gaps = 79/572 (13%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV--TPAIKK------ 225
NIFT +L I G + + +R+S L + G + + P P Y+ TP +
Sbjct: 115 NIFTQSLANIQPGEQIDVIIRYSASLQFEGGNYELVFPMVVGPRYIPGTPIGENAVGSGS 174
Query: 226 -------------IPKREK---------------IHLNVNAGTGTEVL-CNTSSHHLK-- 254
+P + I++ V G + + SHHL
Sbjct: 175 APGPMSQNQDTDLVPDAARLNAPILPAGTRSGHDINMTVEIDAGVNIQGVRSPSHHLHIS 234
Query: 255 --QLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCM 312
R VG +G S+ + N DF Y V VL Q DQ F +
Sbjct: 235 HDGQRVLVGLVG----SDTIP--NKDFILRYQVGSETTQSTVLSQQ-----NDQGGHFAL 283
Query: 313 YLLPGTAKS-RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
YL+P + K+ KDV+F++D SGS G PL + + + L+P D+F+I+ F+
Sbjct: 284 YLIPALDYAPEKIVPKDVVFLIDTSGSQMGPPLRQCQELMRRFIQGLNPDDTFSIIDFSD 343
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLV 431
T S T + RA +I AGG T + + + G + I L+
Sbjct: 344 TTQQLSPVPLPNTPQNQSRAIAYIN-QLTAGGGTEMLRGIQAVLNFPVTDPGRLRSIVLL 402
Query: 432 TDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
TDG + +E QI ++ L +G R+Y+FG GS N + L +A + RG
Sbjct: 403 TDGYIGNENQILAEVQRHLKSGN----RLYSFGAGSSVNRFLLNRMAELGRGICRVIRHD 458
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
++ + +++ F + + VLANI + + + +YP+ PDL +E PL++ GR
Sbjct: 459 EAPDETVERFFRQINNPVLANIQVTWEGEGESPVIYPAIAPDLFAEQPLVLFGRKADAVS 518
Query: 552 DTLKAKGFL--GDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
TL+ G G + E+ Q + + ++ + +I L Q E K E V+
Sbjct: 519 GTLQVSGIAAGGVPYHQGFEMNFQSGGNPAVAQLWGRARIKALMNQMVSGERKSGVEAVT 578
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGI 669
+ ++ +LC+YT + V + R E+ + + + E+P+ + + +
Sbjct: 579 QTAIAYQLLCQYTAFVAVSEEVRVEHPEASVSVQ----------VPVEMPEGVNAEGV-- 626
Query: 670 GFGDLTATAENFPPGSEIPKLPEVAEIFVKAA 701
FG + A P P+LP+V ++ AA
Sbjct: 627 -FGSMGAVYAPSP-----PQLPQVLQVRRMAA 652
>gi|121602946|ref|YP_980275.1| vault protein inter-alpha-trypsin subunit [Polaromonas
naphthalenivorans CJ2]
gi|120591915|gb|ABM35354.1| Vault protein inter-alpha-trypsin domain protein [Polaromonas
naphthalenivorans CJ2]
Length = 701
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 134/602 (22%), Positives = 254/602 (42%), Gaps = 52/602 (8%)
Query: 65 PDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCR 124
P + + PY + D P + L + + +DV A V V+ ++ G ++ + R
Sbjct: 44 PRLKTESPYFFVKSDTPGVDALPLKSTSVDVQISGVIADVTVTQRYKNE---GQRAIEAR 100
Query: 125 IAVPMGDQGSILGVEAEISGK----SYHTQLIALGENDGA---GKSASVETGSFLKPNIF 177
P + ++ G+ ++ + + + A E D A GK+A++ PN+F
Sbjct: 101 YVFPASTRAAVYGMNVRLADRLITANIREKQQARIEYDAARNEGKTAALLEQHL--PNVF 158
Query: 178 TLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIH---- 233
+ + I G + + LR+++ L + G + F FP V P + +
Sbjct: 159 EMNVANILPGDDVKVELRYTELLVPQSGAYQ----FVFPTVVGPRYNSAQSSQALAQWVA 214
Query: 234 -------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLK-----WS 275
++ T + S H + DV K G + L+ +
Sbjct: 215 QPFLPAGQASATAFDIKVKLATPIGIKEVSSHSHSI--DVTKDGDERAAVSLRSGDKPGN 272
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIV 333
N DF Y ++ I GV+L + + E F + ++ P ++ + +D IF+V
Sbjct: 273 NRDFILDYRLAGERIESGVMLYQGTPGNGASGENFFLAMIEPPKQVAAQAISPRDYIFVV 332
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
DISGSM G PL+ K + + KL P D+FN++ F+G S + AT+ +E+A +
Sbjct: 333 DISGSMHGFPLDTAKTLMRELIGKLRPSDTFNVLLFSGSNRFLSPASVPATQANIEQAVR 392
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
I+ + GG P K V + + +VTDG V ER+ + ++ L+
Sbjct: 393 --TIDEMGGGGGTELIPALKRVYAEPKAADVSRTVVVVTDGFVTVEREAFELVRRNLSQA 450
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
+++FGIGS N + + LA G + Q ++ S VL ++
Sbjct: 451 N-----LFSFGIGSSVNRHLMEGLARAGMGEPFIITEPSQARAQAERFRRLIESPVLTSV 505
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT--LKAKGFLGDLSNFVVELK 571
+ + LD +++ P+++PD+ E P++V G+++G + + G N V
Sbjct: 506 KL-RFEGLDVYDVEPAQLPDVLGERPVVVFGKWRGTPAGQVIIDGQSATGPYRNAVTVPA 564
Query: 572 LQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE 631
L + A+ +I L+ Q ++++++ +Q +L +YT + V+
Sbjct: 565 QNGQNTAALRYLWARHRIASLSDQEALEGGDAHRQRITELGLQYNLLTQYTSFLAVDQVV 624
Query: 632 RN 633
RN
Sbjct: 625 RN 626
>gi|158337607|ref|YP_001518782.1| von Willebrand factor type A domain-containing protein
[Acaryochloris marina MBIC11017]
gi|158307848|gb|ABW29465.1| von Willebrand factor type A domain protein, putative
[Acaryochloris marina MBIC11017]
Length = 708
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 121/489 (24%), Positives = 208/489 (42%), Gaps = 43/489 (8%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+ N+FT +L I G + + +R++ L + G++ P P Y+
Sbjct: 169 RANLFTQSLANIVPGETIEVVIRYTNSLEFEGGDYEFVFPTVVGPRYIPGDQIDAAGNTT 228
Query: 220 --------TPAIKKIPKRE----KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY 267
TP + +R I +N++AG L + S L + + ++ +
Sbjct: 229 RVADAAKITPPLLPPSQRSGNDISITVNLDAGVPIRNLRSPSHPILTSKKGEQTQVKLAN 288
Query: 268 ESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA-KSRKVF 325
++ + N D Y V+ +L QS DQR F YL+P KS ++
Sbjct: 289 QTTI---PNKDLILRYQVASKQTQATLLTQS------DQRGGHFATYLIPALKYKSNQIV 339
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
KDV+F++D SGS G P+ ++ + L KL+P D+F+I+ F+ T S T
Sbjct: 340 PKDVVFLIDTSGSQSGPPIVQSRKLMTQFLDKLNPNDTFSIINFSNTTSKLSPKPLANTP 399
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
++A ++I A G T + + G + + L+TDG + D+ I A
Sbjct: 400 ANRKKALEYIK-KLDANGGTELMNGINTVAAFPPAPDGRLRSVVLLTDGLIGDDETIIAA 458
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
++ RL G RIY FG+G N + L LA + RG DS E K
Sbjct: 459 VRDRLKPGN----RIYPFGVGFSTNRFLLDRLAEVGRGTVEVVAPKDSAEKVAAKFVQTI 514
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSN 565
VL +I + + ++YP R+PDL + PL++ GR Q LK G + +
Sbjct: 515 NKPVLTDIEVSWVGPGKGPDIYPLRVPDLFANQPLVLHGRKQDSQSGKLKITGRIAGGKS 574
Query: 566 FVVEL--KLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
+ EL K + + + ++ + +I L Q + E ++V ++ +L +YT
Sbjct: 575 YEQELDVKFDASGNEAIAQLWGRNRIKSLMNQMYGRETDPAVKQVVDTALAYRLLSKYTA 634
Query: 624 MIIVETDER 632
+ V + R
Sbjct: 635 FVAVTEEIR 643
>gi|424882618|ref|ZP_18306250.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WU95]
gi|392518981|gb|EIW43713.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WU95]
Length = 795
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 123/503 (24%), Positives = 227/503 (45%), Gaps = 67/503 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 170 RPNIFTNQVANIGPGETIVVQIEYQQMIHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 225
Query: 228 ---------------------------KREKIHLN-----VNAGTGTEVLCNTSSHHLKQ 255
REK +N VN G + SS H
Sbjct: 226 EFNNGAGFATPRDPVENRDKIEAPVLDPREKARINPVALTVNLKAGFPLGDVKSSFHAVD 285
Query: 256 LRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
+ +D G + LK + DF+ ++ +P + L + +D +
Sbjct: 286 ISQD----GDQARTISLKADTVPADKDFELTWKAAPGKMPSAGLFR----EVIDGKTYLL 337
Query: 312 MYLLPGTAKSRKV--FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
++ P TA K++V+F++D SGSM G +E + +LA+A+SKL+P D FN++ F
Sbjct: 338 AFVTPPTAPDTAAPPAKREVVFVIDNSGSMSGPSIEQARQSLALAISKLNPDDRFNVIRF 397
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPII 428
+ + + AT + E+A ++ A G T + L A+ + G++ +
Sbjct: 398 DDTMTDYFKGLVTATPDNREKAIAYVR-GLTADGGTEMLPALQAALRNQGPVASGALRQV 456
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E+Q+ + + N G R++T GIGS N YF+ A + RG + A
Sbjct: 457 VFLTDGAIGNEQQLFQEITA---NRGDA--RVFTVGIGSAPNTYFMTKAAEMGRGTFTAI 511
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
D + +M +LF + + + +I T + + + PS +PDL S P++++
Sbjct: 512 GSTDQVASRMGELFAKLQNPAMTDITA-TFEGIAAENITPSPMPDLYSGEPVVLTAELPE 570
Query: 549 KFP-DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLE 605
+ P L+ G GD + VE+ + A D + ++ A+++ID L A+A+ +D L+
Sbjct: 571 ERPAGKLQIVGKTGD-QPWRVEMDIANAADGSGISKLWARRKIDDLEARAYERQDPAALD 629
Query: 606 EKVSKMSVQTGVLCEYTRMIIVE 628
+ + +++ ++ T ++ V+
Sbjct: 630 KDIETVALAHHLVSRVTSLVAVD 652
>gi|434404048|ref|YP_007146933.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Cylindrospermum stagnale PCC 7417]
gi|428258303|gb|AFZ24253.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Cylindrospermum stagnale PCC 7417]
Length = 841
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 134/522 (25%), Positives = 218/522 (41%), Gaps = 85/522 (16%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+ NIFT +L I G + + +R+S L + G N F FP V P + IP
Sbjct: 112 RDNIFTQSLANIKPGEQIDVIIRYSDSLKFEGG----NYEFVFPMVVGP--RYIPGITIE 165
Query: 228 --------------------------------------KREKIHLNVNAGTGTEVLCNTS 249
R I++ + G E+ +S
Sbjct: 166 DNAVGGGSAIAPMTLNQDTDLVPDASRLNSPILPAGMRSRHDINVTLEINAGVEIQDISS 225
Query: 250 SHHLKQLRRDVG----KLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVD 305
H Q+ R+ KLG + N D Y V+ ++ LQ+ +L D
Sbjct: 226 PSHQIQIIREEQLVQVKLGGGH-----TIPNKDLILRYQVASNN------LQTTTLTQAD 274
Query: 306 QRE-MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
+R F +YL+P + +V KDV+F++D SGS G PL + + ++ L+P D+
Sbjct: 275 ERGGHFALYLIPALQYRPEQVVPKDVVFLIDTSGSQGGAPLMQCQELMRRFINGLNPQDT 334
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLT 419
F+I+ F T S T + ++ + IN+I AGG T + + +
Sbjct: 335 FSIIDFADTTQQLSPVPLPNTSQ-----NRALAINYINRLNAGGGTEMLGGIRTVLNFPV 389
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ G + I L+TDG + +E QI +K RL G R+++FG GS N + L +A
Sbjct: 390 TNSGRLRSIVLLTDGYIGNENQILAEVKQRLQPGT----RLHSFGAGSSVNRFLLNRIAE 445
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+ RG + ++ ++K F + + V ANI + D + MYPS PDL +E P
Sbjct: 446 LGRGIARIIRHDEPVDEVVEKFFRQINNPVFANIQLQWEGDGESPIMYPSTPPDLFAEQP 505
Query: 540 LIVSG----RYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
L++ G R+ GK T A G G L Q + + ++ + +I L Q
Sbjct: 506 LVLFGRKPDRHSGKLHITGIAAG--GRRYQHSFNLDFQERGNPAIAQLWGRSRIKDLMNQ 563
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
+ K V+ ++ +L +YT + V D R N S+
Sbjct: 564 MVSGDTKSGVAAVTDTALTYQLLSQYTAFVAVSDDVRVNPSQ 605
>gi|444913780|ref|ZP_21233927.1| hypothetical protein D187_06097 [Cystobacter fuscus DSM 2262]
gi|444715338|gb|ELW56207.1| hypothetical protein D187_06097 [Cystobacter fuscus DSM 2262]
Length = 850
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 129/511 (25%), Positives = 232/511 (45%), Gaps = 69/511 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEYV--TPA------- 222
+PN+FT + + G + + + Q L +G +P P Y+ TP
Sbjct: 105 RPNVFTAQVGNLLPGEETRVEVEFLQVLQVEEGSLCWVLPTLVAPRYIPGTPTGDRTSHG 164
Query: 223 ----IKKIPKREKIH-----------LNVNAGTGTEVLCNTSSHHLKQLRRDVG------ 261
++P ++I L + G EV+ + SH L+ R G
Sbjct: 165 VAEPTSRVPDADRITPPVGDAPYGLTLELLVSLGREVVVESPSHALQLTRTGSGTRVTLS 224
Query: 262 KLGYSYESE-VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLP---G 317
+ G + + VL + D D ++T +H G D F + ++P G
Sbjct: 225 QPGVVLDRDLVLNIRSPDTDAAFTPLVTHRQG------------DAPGTFALTVVPDLLG 272
Query: 318 TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS 377
A + + +++V+F+VD SGSM G+ L + AL + L L GD FNI+AF +LFS
Sbjct: 273 MAGAPR--RQEVVFVVDTSGSMDGESLPQAQGALRLCLRHLREGDRFNIIAFENAFHLFS 330
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE 437
T++ +E+A +W+ A G T + PL AVE + ++ L+TDG V
Sbjct: 331 PQPVPFTQKTLEQADRWVAA-LHAHGGTELLEPLRAAVEAMPEG-----VVVLLTDGQVG 384
Query: 438 DERQICDA-MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+E +I A + +R T R+Y+FGIG+ + L+ LA + G + + I+
Sbjct: 385 NESEILQAVLAARRTA------RVYSFGIGTNVSDALLKDLARQTGGAVEFIHPGERIDD 438
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
++ F+R + + ++ + + ++ E+ P+ +P L +P + GRY T+
Sbjct: 439 KVVAQFSRALAPRVTDVEV-RFEGVEATELAPAELPPLVDGTPWSLFGRYTTPGTGTVVL 497
Query: 557 KGFLGDLS-NFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSV 613
KG G S + + + D P ++++ A ++I A A + L+E++ ++++
Sbjct: 498 KGRAGAESFSLAIAVNFPATSDRPAVEKLWAAERIRGWQAAALVGRRAEALKERILQLAL 557
Query: 614 QTGVLCEYTRMIIVE--TDERNNASESPGTK 642
G++ YT ++VE T ER AS P T+
Sbjct: 558 AHGLVTPYTSFVVVEERTGER-RASAQPETR 587
>gi|379728477|ref|YP_005320673.1| vault protein, inter-alpha-trypsin domain-containing protein
[Saprospira grandis str. Lewin]
gi|378574088|gb|AFC23089.1| vault protein, inter-alpha-trypsin domain-containing protein
[Saprospira grandis str. Lewin]
Length = 728
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 208/467 (44%), Gaps = 23/467 (4%)
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKR 229
+ +F + + I+ S I L + Q L DG S +P ++ I+ I R
Sbjct: 121 EYYNQELFKVRIFPIEPRSEQKIELTYHQALKKDDGGLSYRLPMNSAKFSAKPIQNISLR 180
Query: 230 EKIHLNVNAGTGTEVLCN--TSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSP 287
K+ + + N + SH ++ +R+ +E + ++ + DF+ +
Sbjct: 181 LKLK-------NQQAIKNVYSPSHEIELIRKGEKAASVGFEQKNVR-PDRDFELYWRAED 232
Query: 288 SHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
I +L D F + L PG + V KDV+F+ D SGSM GK +E
Sbjct: 233 KAISSSLLQYQKGKEDG----YFFLNLSPGWTDPKAVMAKDVVFVFDKSGSMSGKKMEQA 288
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNI 407
+ AL + L P D F ++ ++ E ++ +K E A ++I A G TNI
Sbjct: 289 QKALKFCVDNLGPEDRFELIPYSTEAQSLFGQLKSNSKTNREEAKEYID-ELRAIGGTNI 347
Query: 408 CAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGS 467
L A++ + +TDG A+ +L RI+TFGIGS
Sbjct: 348 EEALQMALDRKEKKAKRPFFVIFMTDGKPTIGEIEPQALLDKLAGYQKDQVRIFTFGIGS 407
Query: 468 YCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY 527
N L + +S+GY A + + +E+++ + + S VL+NI + K++ ++Y
Sbjct: 408 DINTKLLDQMTEMSKGYRDYALEDEDLELKLSNFYLKAASPVLSNIELVFDKNVKVEQLY 467
Query: 528 PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAKD----IPLDR 582
P + DL L + GRY+G+ P L+ K + G+ F +L + ++ IP
Sbjct: 468 PRKTEDLFRGESLNIMGRYKGEGPAKLQLKALVNGEEKTFEFQLDFKKKQEEHSFIP--D 525
Query: 583 ICAKQQIDLLTAQAWFS-EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ A + + L Q F+ E + L+++V +++ + G++ YT +IVE
Sbjct: 526 LWALRAVGYLLDQIRFNGEQEELKQEVIRLAKKHGIITPYTSYLIVE 572
>gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2]
gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2]
Length = 684
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 135/519 (26%), Positives = 232/519 (44%), Gaps = 76/519 (14%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
GK AS+ + +PN+F + I+ G + + L + Q L Y GEF+V +FP V
Sbjct: 132 GKKASLVSQK--RPNLFVSQVANIEAGQTVEVTLVYQQLLHYEQGEFTV----RFPMLVA 185
Query: 221 PAIKKIPKREKIH--LNVNAGTGTEVLCNTSSHHLKQLRRDVGK-LGYSYESEVLKWS-- 275
P + PKR L N + +E+ TS+ + +L K +G +K S
Sbjct: 186 PRYQ--PKRLVFDADLQGNWPSASEI---TSAPFIDELETQSAKAVGQGMSDAKIKQSVS 240
Query: 276 ---NIDFDFSYTVSPSH-----IFGG------------------VLLQSPSLHDVDQREM 309
N+ F+ +SP H + G VL P Q +
Sbjct: 241 IALNLGFELDTIMSPYHEINQQLIGNNHYQVSLKQGTTFANRDFVLRVKPKNQAAIQAAV 300
Query: 310 F----------CMYLLPGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
F + L+P + + + + ++VIF++D SGSM G+ LE K+AL AL+
Sbjct: 301 FKEHFENDDYALVMLMPPSDEFIAAQRLPREVIFVIDTSGSMHGESLEQAKSALFFALAN 360
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
LDP DSFNI+ FN + + A + RA ++ A G T I + ++
Sbjct: 361 LDPQDSFNIIEFNSKVNALNAQALPANDFNIRRARNFV-YGLKADGGTEIGLAFEQVLDN 419
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
++ I+FL TDG++ +E ++ +K L + RI+T GIGS N YF+
Sbjct: 420 SEHADYLRQIVFL-TDGSISNETEVFAQIKGSLGDS-----RIFTIGIGSAPNSYFMTRA 473
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPD 533
A + RG + D+ ++ M+ LF + LAN A+ L DE + +P I D
Sbjct: 474 ATLGRGTFTFIGDVTDVQRTMKNLFVQ-----LANAALKELIITDENGDALDFWPKPIAD 528
Query: 534 LSSESPLIVSGRYQ-GKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLL 592
L ++V+ + G+ ++ + G + + + Q +I R+ A+QQI L
Sbjct: 529 LYFNQAMMVAIKLNAGQNQINVRGQQAFGQFNAQFMLGQAQTGSNIA--RLWARQQIQSL 586
Query: 593 TAQAWFSEDKR---LEEKVSKMSVQTGVLCEYTRMIIVE 628
+ Q ++ +K + +K+ ++++ +L YT + V+
Sbjct: 587 SMQQVYTANKEHDPIADKILTLALKYQLLSPYTSFLAVD 625
>gi|359459134|ref|ZP_09247697.1| von Willebrand factor type A domain-containing protein
[Acaryochloris sp. CCMEE 5410]
Length = 708
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 122/489 (24%), Positives = 207/489 (42%), Gaps = 43/489 (8%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+ N+FT +L I G + + +R++ L + G++ P P Y+
Sbjct: 169 RANLFTQSLANIVPGETIEVVIRYTNSLEFEGGDYEFVFPTVVGPRYIPGDQIDAAGNTT 228
Query: 220 --------TPAIKKIPKRE----KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY 267
TP + +R I +N++AG L + S L + ++ +
Sbjct: 229 RVTDAAKITPPLLPPSQRSGNDISITVNLDAGVPIRNLRSPSHPILTSKKGQQTQVKLAN 288
Query: 268 ESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA-KSRKVF 325
+ + N D Y V+ +L QS DQR F YL+P KS ++
Sbjct: 289 QKTI---PNKDLILRYQVASKQTQATLLTQS------DQRGGHFATYLIPALKYKSNEIV 339
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
KDV+F++D SGS G P+ ++ +A L KL+P D+F+I+ F+ T S T
Sbjct: 340 PKDVVFLIDTSGSQSGPPIVQSRKLMAQFLDKLNPNDTFSIINFSNTTSKLSPKPLANTP 399
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
++A +I A G T + + G + + L+TDG + D+ I A
Sbjct: 400 ANRKKALDYIK-KLDANGGTELMNGINTVAAFPPAPDGRLRSVVLLTDGLIGDDETIIAA 458
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
++ RL G RIY FG+G N + L LA + RG DS E K
Sbjct: 459 VRDRLKPGN----RIYPFGVGFSTNRFLLDRLAEVGRGTVEVVAPKDSAEKVAAKFVKTI 514
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL--GDL 563
VL +I + + ++YP R+PDL + PL++ GR Q LK G + G
Sbjct: 515 NKPVLTDIEVSWVGPGKGPDIYPLRVPDLFANQPLVLHGRKQDGQSGKLKITGRMAGGKP 574
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
V+++K + + + ++ + +I L Q + E ++V ++ +L +YT
Sbjct: 575 YKQVLDVKFDASGNEAIAQLWGRNRIKSLMNQMYGRETDPAVKQVLDTALAYRLLSKYTA 634
Query: 624 MIIVETDER 632
+ V + R
Sbjct: 635 FVAVTEEIR 643
>gi|424840768|ref|ZP_18265393.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Saprospira grandis DSM 2844]
gi|395318966|gb|EJF51887.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Saprospira grandis DSM 2844]
Length = 728
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 206/467 (44%), Gaps = 23/467 (4%)
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKR 229
+ +F + + I+ S I L + Q L DG S +P ++ I+ I R
Sbjct: 121 EYYNQELFKVRIFPIEPRSEQKIELTYHQALKKDDGSLSYRLPMNSAKFSAKPIQNISLR 180
Query: 230 EKIHLNVNAGTGTEVLCN--TSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSP 287
K++ + + N + SH ++ +R+ +E + ++ + DF+ +
Sbjct: 181 LKLN-------NKQAIKNVYSPSHEIELIRKGEKAASVGFEQKNVR-PDRDFELYWRAED 232
Query: 288 SHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
I +L D F + L PG + V KDV+F+ D SGSM GK +E
Sbjct: 233 KAISSSLLQYQKGKEDG----YFFLNLSPGWTDPKAVMAKDVVFVFDKSGSMSGKKMEQA 288
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNI 407
+ AL + L P D F ++ ++ E ++ +K E A ++I A G TNI
Sbjct: 289 QKALKFCVDNLGPEDRFELIPYSTEAQSLFGQLKSNSKTNREEAKEYID-ELRAIGGTNI 347
Query: 408 CAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGS 467
L A+ + +TDG A+ +L RI+TFGIGS
Sbjct: 348 EEALQMALNRKEKKAKRPFFVIFMTDGKPTIGEIEPQALLDKLAGYQKDQVRIFTFGIGS 407
Query: 468 YCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY 527
N L + +S+GY A + + +E+++ + + S VL+NI + K + ++Y
Sbjct: 408 DINTKLLDQMTEMSKGYRDYALEDEDLELKLSNFYLKAASPVLSNIELVFDKKVKVEQLY 467
Query: 528 PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAKD----IPLDR 582
P + DL L + GRY+G P L+ K + G+ F +L + ++ IP
Sbjct: 468 PRKTEDLFRGESLNIMGRYKGDGPTKLQLKALVNGEEKTFEFQLNFKKKQEEHSFIP--D 525
Query: 583 ICAKQQIDLLTAQAWFS-EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ A + + L Q F+ E + L+++V +++ + G++ YT +IVE
Sbjct: 526 LWALRAVGYLLDQIRFNGEQEELKQEVIRLAKKHGIITPYTSYLIVE 572
>gi|120556589|ref|YP_960940.1| vault protein inter-alpha-trypsin subunit [Marinobacter aquaeolei
VT8]
gi|120326438|gb|ABM20753.1| Vault protein inter-alpha-trypsin domain protein [Marinobacter
aquaeolei VT8]
Length = 712
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 137/564 (24%), Positives = 235/564 (41%), Gaps = 100/564 (17%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGE--------NDGAGKSASVETGSFLKPNIFTL 179
P+ +Q S+ G+ + + QL + K+A+VE +PN+FT
Sbjct: 122 PLPEQTSVYGLTMTVGERRIVGQLQPRAQARQNYEKAKQAGQKAATVEQN---RPNLFTS 178
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------TPA---------- 222
+ I G +++ +++ Q ++YR GEF + +P P Y+ TPA
Sbjct: 179 RIANIAPGEEVTVEVQYQQPVNYRHGEFELRLPTTLTPRYMPGAPVATPASAWQSGWSLP 238
Query: 223 -------------------IKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKL 263
+ R I L++ AG + + TS H Q+ + +
Sbjct: 239 TTQVADADEISPFTVLPDDVNPGSHRATIQLDIEAGLPVDEV--TSPSHPLQVELEGSRA 296
Query: 264 GYSYES-EVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS---LHDVDQREMFCMYLLPGTA 319
S E ++L +D D P+ Q+P+ Q E F M ++ A
Sbjct: 297 TVSPEQGQIL----MDRDVIVRWRPADN------QAPTAALFRQQWQGEDFLMAMVMPPA 346
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
+ +V +++++F++D SGSM G+ + ++AL L L PGD FN++ FN + + T
Sbjct: 347 TTGQVLRRELLFVIDTSGSMAGESIRQARSALLRGLDTLRPGDRFNVIQFNSQAHALYTQ 406
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM-LTNSRGSIPIIFLVTDGAVED 438
A + RA ++ + A G T + L+ A+ M + S G + + +TDGAV +
Sbjct: 407 PVPANGHYLARARDYVQ-DLTADGGTEMAGALSLAMGMDGSESSGHVQQMVFMTDGAVGN 465
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
E + D +++ L N R++T IGS N +FLR A RG Y A + ++ +
Sbjct: 466 ESALFDQIRTGLGNR-----RLFTVAIGSAPNMHFLREAARWGRGQYTAVHSAAEVDKAL 520
Query: 499 QKLFTRGFSSVLANIAID---------TLKDLDEFEMYP--SRIPDLSSESPLIVSGRYQ 547
KLF + V+ ++ + K D F P + SE L VSGR
Sbjct: 521 GKLFAAMEAPVMTDVEVQWPGNAAQPVPAKPGDLFHGQPLLQVVRGAPSEGELTVSGRLP 580
Query: 548 GKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFSEDKRL 604
G L LA P LDR A+ +ID + A + +
Sbjct: 581 G----------------GRSWRTSLDLASAAPGKGLDRQWARGRIDAVMDSARLAGTEPD 624
Query: 605 EEKVSKMSVQTGVLCEYTRMIIVE 628
E + ++S+ GV+ +T + +E
Sbjct: 625 EAAIVELSLSHGVMSPFTSFVAIE 648
>gi|373488618|ref|ZP_09579282.1| Vault protein inter-alpha-trypsin domain-containing protein
[Holophaga foetida DSM 6591]
gi|372005563|gb|EHP06199.1| Vault protein inter-alpha-trypsin domain-containing protein
[Holophaga foetida DSM 6591]
Length = 770
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 139/618 (22%), Positives = 264/618 (42%), Gaps = 63/618 (10%)
Query: 59 PKIVDNPDIPSYQPYVHGRCDPPSL--IPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVM 116
PKI + + P+ + + PS+ +PL+ +V+ D+ + A VRV+ +R
Sbjct: 40 PKIPEGKPEATLSPFFWVKSEDPSVDQLPLKSTSVKTDIAGVI--ADVRVTQVYRNE--- 94
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIA--LGENDGAGKSASVETGSFL-- 172
G + P + ++ G++ I ++ ++ D A + S L
Sbjct: 95 GRNPIEAVYIFPASTRAAVYGMKMTIGERTLVAKIREKEQARRDYEQARAEGRSASLLEQ 154
Query: 173 -KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKR-- 229
+PN+F + + I G + + L +++ L +G + F +P V P P
Sbjct: 155 QRPNVFQMNVANIMPGDEIRVELSYTELLVPTNGTYE----FVYPTVVGPRYSNTPSSRA 210
Query: 230 ---EKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVG-----------KLGYSYE------- 268
EK N G L S Q+R G K G +Y+
Sbjct: 211 DASEKWVANPYTNEGEAPL----SAFAIQVRLAAGLPIQHITCGTHKTGVTYDGPSDASV 266
Query: 269 ---SEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA-KSRKV 324
+ N DF Y ++ I G+LL + + F + + P A K +
Sbjct: 267 KLDTSETHGGNRDFILKYQLTGDQIQSGLLLAKGA-----EENFFLLQVQPPKAVKPAMI 321
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS-TSMELA 383
++ IFI+D+SGS G P+E +K + +S L P D FN+ F + L+S A
Sbjct: 322 PPREFIFIMDVSGSQMGFPIEISKTLMEDLVSHLRPQDRFNVEVFESGSALWSPEGSRPA 381
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
+E ++ A +IG GG T I + L A+ L + G + TDG V E +
Sbjct: 382 NRENIQDALAFIGQQR-GGGGTEIISALRHAL-ALPRAEGMSRTFVISTDGYVSVEPGVL 439
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D ++ L GG+ ++ FG+GS N + + +A G + Q ++
Sbjct: 440 DVIRENL--GGA---NMFAFGVGSSVNRHLIEGMAHAGMGEPFVLTNPAEAPAQAERFRN 494
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD- 562
S VL N+ + + +++ P ++PDL +E P+I G+++G +++ G G+
Sbjct: 495 YIASPVLTNVKLTFNGNFQAYDVEPLQVPDLLAERPVICFGKWKGGTTGSVEVTGISGNG 554
Query: 563 --LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCE 620
+F + + ++ L + A+ +I L+ A + + L++ ++++ ++ G+L
Sbjct: 555 PFRQSFEISKAKVMDQEASLRNLWARHRIQLMGDYAGLGDAEGLKKSITELGLKYGLLTN 614
Query: 621 YTRMIIVETDERNNASES 638
YT + V++ RN++ ++
Sbjct: 615 YTSFVAVDSQIRNHSGQA 632
>gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein
[Pseudoalteromonas sp. SM9913]
gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein
[Pseudoalteromonas sp. SM9913]
Length = 666
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 134/563 (23%), Positives = 251/563 (44%), Gaps = 84/563 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDG-------AGKSASVETGSFL 172
+ + R P+ D+ ++ + I + H ++ + + AGK AS+
Sbjct: 79 AVNARYVFPLPDESAVQAMTMRIGERVIHGKIAKKHQAEKQYQTAKQAGKQASLVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREK 231
+ N+F + I G + I L + + + Y+ G F+++ P P Y K +++K
Sbjct: 137 RANMFITNVANIGPGEEIVITLEYQEIIDYQHGTFAIHFPTTITPRYHATTGKLNEEQDK 196
Query: 232 ---------------------------------IHLNVNAGTGTEVLCNTSSHHLKQLRR 258
+ L +N G E++ + +H +
Sbjct: 197 RTVKSDHTSSFTPQGWLSPIYTKQTNEAANASDLSLTINMDIGLELVDINAKYH----QV 252
Query: 259 DVGKLGYSYESEVLK-WSNIDFDFSYTVSP---SHIFGGVLLQSPSLHDVDQREMFCM-Y 313
++ + S VLK I+ DF P + + Q E F +
Sbjct: 253 NIENSAFGQYSIVLKPQQAINKDFVLEFKPLQKAQAQAALFTQ-----QAKNGEQFALAM 307
Query: 314 LLPGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
L+P + + + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ FN
Sbjct: 308 LMPPSDQFTQSERLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFNN 367
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLV 431
+ S + +A+ + RA ++I N A G T I L AV G + + +
Sbjct: 368 QVTAMSDTPLVASDFNLRRARRFI-YNLQADGGTEIQGAL-DAVLNGAQFEGFVRQVVFL 425
Query: 432 TDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY---GAA 488
TDG+V +E ++ ++ L + R++T GIGS N +F+R A I +G Y G+
Sbjct: 426 TDGSVSNEDELFKSIARTLGDS-----RLFTVGIGSAPNRFFMRRAADIGKGSYTFIGST 480
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
+D ++ +MQ+LF + + N+A+ T + D + +PS +PDL P++V+ +
Sbjct: 481 FD---VQPKMQQLFDKLAHPAMTNLAL-TNTNGDSLDFWPSPLPDLYFAEPIMVAIKL-- 534
Query: 549 KFPDTLKAKGFLGDLSN--FVVELKLQL-AKDIPLDRICAKQQIDLLTAQAWFSEDKRLE 605
D F G ++ F V+L Q +K +D++ A+Q+I L +++ ++
Sbjct: 535 ---DNTDTITFTGQTTHGEFKVKLNSQASSKATGIDKLWARQKIKSLL---LYNQQSEVK 588
Query: 606 EKVSKMSVQTGVLCEYTRMIIVE 628
++V +++++ +L +T I +E
Sbjct: 589 QQVQQLALEHQLLSPFTAFIAIE 611
>gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42]
gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 780
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 126/517 (24%), Positives = 238/517 (46%), Gaps = 66/517 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+PNIFT + I G + +++ + Q + GEFS+ +FP V P
Sbjct: 156 RPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 211
Query: 222 --------AIKKIP--KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
AI + P REKI L VN G + SS H
Sbjct: 212 DFNNGAGFAIPRDPVQNREKIEAPVLDPRENARINPVSLTVNLKAGFPLGEVKSSFH--- 268
Query: 256 LRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
D+G+ G + LK + DF+ ++ +P L + +D +
Sbjct: 269 -DVDIGQDGDQARTISLKGDAVPADKDFELTWKAAPGKTPSAGLFR----EAIDGKTYLL 323
Query: 312 MYLLPGTAKSRKVF-KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
++ P TA K++V+F++D SGSM G+ +E + +LA+A+S+L+P D FN++ F+
Sbjct: 324 AFVTPPTAPDAAAPSKREVVFVIDNSGSMSGQSIEQARQSLALAISRLNPNDRFNVIRFD 383
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIF 429
+ + AT + E+A ++ + A G T + L A+ + G++ +
Sbjct: 384 DTMTDYFKGLVAATPDNREKAVAYVR-SLTADGGTEMLPALEDALRNQGPVASGALRQVV 442
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
+TDGA+ +E+Q+ + + + R++T GIGS N YF+ A + RG +
Sbjct: 443 FLTDGAIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEVGRGTFTQIG 497
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-QG 548
D + +M +LF + + + +IA + + ++ ++ P+ +PDL S P++++ Q
Sbjct: 498 STDQVASRMSELFAKLQNPTMTDIAA-SFEGIEAEDITPNPMPDLYSGEPVVLTAELPQD 556
Query: 549 KFPDTLKAKGFLGDLSNFVVELKL-QLAKDIPLDRICAKQQIDLLTAQAWFSED-KRLEE 606
K L+ G GD + VE+ + A + ++ A+++ID L A+A+ +D L++
Sbjct: 557 KAAGKLQIVGKTGD-QPWRVEMDIANAAVGSGISKLWARRKIDDLEARAYERQDPAALDK 615
Query: 607 KVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ +++ ++ T ++ V+ A+E G+ K
Sbjct: 616 DIEAVALAHHLVSRVTSLVAVDVTPSRPANEPLGSAK 652
>gi|424871695|ref|ZP_18295357.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. viciae WSM1455]
gi|393167396|gb|EJC67443.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. viciae WSM1455]
Length = 795
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/514 (23%), Positives = 231/514 (44%), Gaps = 59/514 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P + +
Sbjct: 170 RPNIFTNQVANIGPGETIVVQIEYQQTIHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 225
Query: 233 HLNVNAGTGTE-------------VLCNTSSHHLK------QLRR--------------D 259
N AG T VL + + LR D
Sbjct: 226 EFNNGAGFATPRDPVENRDKIEAPVLDPRENARINPVSLTVDLRAGFPLGDVKSSFHAVD 285
Query: 260 VGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
+ + G ++ LK + DF+ ++ +P + L + +D + ++
Sbjct: 286 ISQDGDQSKTISLKADTVPADKDFELTWKAAPGKMPSAGLFR----EVIDGKTYLLAFVT 341
Query: 316 PGTAKSRKVF--KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
P T K++V+F++D SGSM G +E + +LA+A+SKL+P D FN++ F+
Sbjct: 342 PPTTPDTAAPPPKREVVFVIDNSGSMSGPSIEQARQSLALAISKLNPDDRFNVIRFDDTM 401
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVT 432
+ + AT + E+A ++ A G T + L A+ + G++ + +T
Sbjct: 402 TDYFKGLVAATPDNREKAIAYVR-GLTADGGTEMLPALQAALRNQGPVATGALRQVVFLT 460
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +ERQ+ + + + R++T GIGS N YF+ A I RG + A D
Sbjct: 461 DGAIGNERQLFQEIAANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFSAIGSTD 515
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP- 551
+ +M +LF + + + +I T + ++ + P+ +PDL S P++++ + P
Sbjct: 516 QVASRMGELFAKLQNPAMTDITA-TFEGIEAENITPNPMPDLYSGEPVVLTAELPEEKPA 574
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVS 609
L+ G G+ + VE+ + A D + ++ A++++D A+A+ +D L++ +
Sbjct: 575 GKLQIIGKTGN-QPWRVEMDIANAADGSGISKLWARRKMDDFEARAYERQDPAALDKDIE 633
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+++ ++ T ++ V+ A + G+ K
Sbjct: 634 TVALAHHLVSRVTSLVAVDVTPSRPADQPLGSAK 667
>gi|410648493|ref|ZP_11358901.1| uncharacterized protein GAGA_4475 [Glaciecola agarilytica NO2]
gi|410131951|dbj|GAC07300.1| uncharacterized protein GAGA_4475 [Glaciecola agarilytica NO2]
Length = 777
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 133/587 (22%), Positives = 254/587 (43%), Gaps = 105/587 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PN+FT + I +S+ + + Q + + FSV P +P +IP
Sbjct: 197 RPNLFTNAIANIGPNETISVTIEYQQVIRFDQQTFSVRFPMTITPRYSPRASQIPLINQT 256
Query: 228 -----------------KRE-----------KIHLNVNAGTGTEVLCNTSSHHLKQLRRD 259
R+ +I + +N+G S HH +
Sbjct: 257 AVAKVNSNGWGESVSVLSRQIKTPDEPANPIRISVELNSGFSISPEDIVSEHHPIDITPK 316
Query: 260 VGKLGYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVLLQSPSLHD 303
+ G+ Y+ E+ K + DFS P+ + + V+L P D
Sbjct: 317 DQQAGH-YQIELTKEHIANQDFSLRWKPTVGDIPSAAHFSETIGNYRYAMVMLTPPRQDD 375
Query: 304 VDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
D + + V ++V+F++D SGSM G+ + K A+ AL++L P DS
Sbjct: 376 ADDK-----------STKTPVSAREVVFLLDTSGSMAGESIVQAKRAVDFALTQLHPEDS 424
Query: 364 FNIVAFN-GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV------E 416
N++ FN L++T+M AT ++RA W+ + A G T + L+ A+ E
Sbjct: 425 VNVIEFNDAPQALWNTAMP-ATANNIQRARNWVA-SLSANGGTEMAPALSMALHKTNLEE 482
Query: 417 MLTNSRGSIPI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
TN + + + +TDG+V +E + ++++L + R++T GIGS N YF+
Sbjct: 483 QNTNEGSPVQLRQVVFITDGSVSNEDALMSLIENQLADS-----RLFTIGIGSAPNSYFM 537
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
A RG + D++ ++ +M +LF + V+ +I I+ +D E YPS IPDL
Sbjct: 538 TQAAQAGRGTFTYIGDINQVQQKMTELFNKLTRPVMQDIHIEFARDT---EFYPSVIPDL 594
Query: 535 SSESPLIVSGR---------YQGKFPDT-LKAKGFLGDLSN----FVVELKLQ-LAKDIP 579
+ P+++ R Q + PD +K G+ ++ + + + L AK
Sbjct: 595 YQDQPVVIHYRTPVAHLENGMQLQQPDNAVKVTGWQTTNASPSKPWSIRMPLSPTAKSAG 654
Query: 580 LDRICAKQQIDLLTAQAWFSEDKRL----EEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ A+++I+ L+ D + + +++++++ ++ ++T +I V+ ++ N+
Sbjct: 655 MGVAWAREKIEQLSHTLRRERDNTMRQTWQHSITEIALEHHMVSQFTSLIAVD-EQVTNS 713
Query: 636 SESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFP 682
++ + SK K ++ P + + + I G L TA N P
Sbjct: 714 KDAKPEHENSKN----KPINHNAPAGLAMAGVQIR-GQLPQTATNTP 755
>gi|428212115|ref|YP_007085259.1| hypothetical protein Oscil6304_1643 [Oscillatoria acuminata PCC
6304]
gi|428000496|gb|AFY81339.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Oscillatoria acuminata PCC 6304]
Length = 812
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 134/578 (23%), Positives = 242/578 (41%), Gaps = 58/578 (10%)
Query: 94 DVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIA 153
DV +D +V+ T R ++ + P+ D+ ++ +E +I + +
Sbjct: 73 DVKARIDGNVSQVAVTQRFQNPF-NEPLEAVYVFPLPDEAAVYDMEIKIGDRIIKGDIKQ 131
Query: 154 LGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFS 208
E A + T + L +PNIFT +L I G + + +R+S L + G +
Sbjct: 132 REEAQRIYDEARQQGRTAALLEQERPNIFTQSLANIKPGEQIEVTIRYSDSLPFEGGNYE 191
Query: 209 VNVPFKF-PEYV---------------------TPAIKKIPKRE----KIHLNVNAGTGT 242
P P Y+ TP + + R + + +NAG
Sbjct: 192 FVFPMVVGPRYIPGTPTTPETPNTNQVPDASRITPPVLRPGSRSGHDISVAVEINAGV-- 249
Query: 243 EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH 302
+ N S + + G+L ++ N D Y VS VL QS
Sbjct: 250 -PISNIQSPSHQIVTSQSGELVRIDLAKSDSIPNKDLILRYRVSGDRTKATVLTQSD--- 305
Query: 303 DVDQREMFCMYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
D+ F +YL+P +R ++ KDVIF++D SGS G P+ ++ + ++ L+P
Sbjct: 306 --DRGGHFAIYLIPALDYNRNEIVPKDVIFLMDTSGSQSGDPIVKSQELMRRFINGLNPN 363
Query: 362 DSFNIVAFNGETYLFSTSMELATKE-AVERAHQWIGINFI----AGGSTNICAPLTKAVE 416
D+F I+ F +T+ L+ + A A++ + IN+I A G T + + +
Sbjct: 364 DTFTIIDF------ANTATALSPRPLANTPANRNMAINYINQLDANGGTELLNGIQTVLN 417
Query: 417 MLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
+ G + I L+TDG + ++ +I ++ +L G R+Y+FG+GS N + L
Sbjct: 418 YPPSENGRLRSIVLLTDGYIGNDMEIIAEVQQKLQAGN----RLYSFGVGSSVNRFVLDR 473
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
LA + RG + E ++ F + + VL NI + + E+YP PDL +
Sbjct: 474 LAEVGRGTSQVIRQDEPTEAVAERFFRQINNPVLTNIEVSWEGGGEPAEVYPGLPPDLFA 533
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFL--GDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTA 594
E PL++ GR + L+ KG GD + + + + + ++ + +I L
Sbjct: 534 EQPLVLFGRKGDRARGQLRVKGMTANGDRYEELFRMAFEEEGNPAIAQLWGRHRIKDLMT 593
Query: 595 QAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
Q + E V+ ++ +L YT + V + R
Sbjct: 594 QMYGGETTSGVTAVTNTALAYRLLSAYTAFVAVSEEVR 631
>gi|254416017|ref|ZP_05029773.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
gi|196177192|gb|EDX72200.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
Length = 744
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 124/498 (24%), Positives = 211/498 (42%), Gaps = 55/498 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAI-KKIPKRE- 230
+ NIFT +L I G + + +R++ L + GE F FP V P IP E
Sbjct: 83 RDNIFTQSLANIKPGERIDVTIRYTDTLKFSSGECE----FVFPMVVGPRFCPGIPIDET 138
Query: 231 ---------------------------KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKL 263
+I + ++AG + N+ SH ++ + + G
Sbjct: 139 GDSDRVADASRIMPTIIPPSTRSGHDIQITVEIDAGVPISNI-NSPSHQIRVEQEETGVR 197
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKS- 321
E + + N D Y V+ VL QS D R F +YL+P S
Sbjct: 198 VLLGEGDTI--PNKDLILRYQVAGDRTQATVLTQS------DTRGGHFALYLIPAVDYSD 249
Query: 322 RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSME 381
++ KDV+F++D SGS +G P ++ + + L+P D+F I+ F+ T S
Sbjct: 250 HEIVPKDVVFLIDTSGSQKGDPFRKSQELMRRFIQGLNPQDTFTILDFSDITTQLSAKPL 309
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
T + +A +I A G T + + + G + I L+TDG + +E +
Sbjct: 310 ANTPQNRIKALTYIN-QLKANGGTYLLNGIRAVLNFPAAPEGRLRSIVLITDGYIGNESE 368
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
I +K L +G R+Y+FG+GS N + L +A + RG +S + +K
Sbjct: 369 ILAEVKQYLKSGN----RLYSFGVGSSPNRFLLNRMAELGRGTSRIVRQDESTQEVTEKF 424
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT---LKAKG 558
+ VL NI ++ + L E +YP+ PDL ++ PL++ GR +P+ LK G
Sbjct: 425 LRHINNPVLTNINVE-WQGLGEAMIYPTTPPDLFTQQPLVLFGRISPIYPEQTGILKITG 483
Query: 559 FLGDLSNFVVELKLQLAKD--IPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTG 616
+ + E L+L + + ++ + +I L Q W E K E + + ++
Sbjct: 484 TVTGGKRYEQEFHLRLDSEGNCAIAQLWGRARIKDLMNQMWHCEKKAQVEAIIETALTYQ 543
Query: 617 VLCEYTRMIIVETDERNN 634
+L YT + V R +
Sbjct: 544 ILAPYTAFVAVSEQIRTD 561
>gi|392553877|ref|ZP_10301014.1| inter-alpha-trypsin inhibitor domain-containing protein
[Pseudoalteromonas undina NCIMB 2128]
Length = 666
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 136/559 (24%), Positives = 248/559 (44%), Gaps = 76/559 (13%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDG-------AGKSASVETGSFL 172
+ + R P+ D+ ++ + I + ++ + + AGK AS+
Sbjct: 79 AVNARYVFPLPDESAVQAMTMRIGERVIKGKIAKKRQAEKQYQAAKKAGKQASLLRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+ N+F + + G + I L + + + Y+ G F+++ FP +TP
Sbjct: 137 RANMFITNVANVGPGEEIVITLEYQEIIDYQHGTFAIH----FPTTITPRYHAITGEVND 192
Query: 222 -----AIK----------------------KIPKREKIHLNVNAGTGTEVLCNTSSHHLK 254
AIK + + L +N G E++ + +H
Sbjct: 193 EQDTHAIKNANAPSFTPQGWLSPVYTKQTNEATSSSDLSLTINMDIGLELVDINAKYH-- 250
Query: 255 QLRRDVGKLGYSYESEVLK-WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
+ ++ + S VLK I+ DF P I ++ M
Sbjct: 251 --QVNIENPAFGQYSIVLKPQQAINRDFVLEFKPLQIAQAQAALFTQQAQNGEQYALAML 308
Query: 314 LLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE 372
+ P + ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNIV+FN E
Sbjct: 309 MPPSDSFTQSQRLPREMVFVVDTSGSMHGQSIEQAKKALFYALSLLDSDDSFNIVSFNNE 368
Query: 373 TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVT 432
+ S + +A+ + RA ++I N A G T I L AV T G + + +T
Sbjct: 369 VAVMSETPLIASDFNLRRAERFI-YNLQADGGTEIKGAL-DAVLDGTQFEGFVRQVVFLT 426
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DG+V +E Q+ + L + R++T GIGS N YF+R A I +G Y
Sbjct: 427 DGSVSNEDQLFKHIAHALGDS-----RLFTVGIGSAPNRYFMRRAADIGKGTYTFISSAH 481
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
++ +MQ+LF + + N+A+ T + D + +PS +PDL P++V+ + D
Sbjct: 482 DVQPKMQQLFDKLAHPAMTNLAL-TNTNGDSLDFWPSPLPDLYFAEPIMVAIKL-----D 535
Query: 553 TLKAKGFLGDLSN--FVVELKLQL-AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
T F G ++ F V+L Q +K +D++ A+Q+I L +++ ++++V
Sbjct: 536 TTDTITFTGQTAHGEFKVKLNPQASSKATGIDKLWARQKIKSLL---LYNQQSVVKQQVQ 592
Query: 610 KMSVQTGVLCEYTRMIIVE 628
+++++ +L +T I +E
Sbjct: 593 QLALEHQLLSPFTAFIAIE 611
>gi|222056730|ref|YP_002539092.1| Vault protein inter-alpha-trypsin domain-containing protein
[Geobacter daltonii FRC-32]
gi|221566019|gb|ACM21991.1| Vault protein inter-alpha-trypsin domain protein [Geobacter
daltonii FRC-32]
Length = 665
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 143/612 (23%), Positives = 257/612 (41%), Gaps = 70/612 (11%)
Query: 94 DVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIA 153
DV ++ V+ T + + G++ + R P + S+ G++ + GK T I
Sbjct: 55 DVKVTINGVIADVTVTQK-YVNNGTRPINARYCFPASTRASVHGMKMTV-GKRVVTARIK 112
Query: 154 LGENDGA--------GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDG 205
E A GKSAS+ +PN+F+++L I + I L +++ L DG
Sbjct: 113 EKEAAKAEFVKAKAEGKSASLLEEQ--RPNVFSMSLANIMPRDEVQIELHYAELLVPTDG 170
Query: 206 EFSVNVPFKF-PEYVTPAIKKIPKREK----------------IHLNVNAGTGTEVLCNT 248
+ P P Y P+R+K ++NV T +
Sbjct: 171 TYEFVYPTVVGPRYSNQKEADAPERDKWVKSPYLKEGKTPATAFNINVFLSTSIPIQEVV 230
Query: 249 SSHHLKQLRRDVGKLGYSYESEVLKWS-----NIDFDFSYTVSPSHIFGGVLLQSPSLHD 303
S H + D+ L S + L S N DF Y ++ I G++L
Sbjct: 231 SPSH----KVDLSWLDKSMVTAALVKSGGFAANRDFILKYRLAGKEIQSGLMLYQ----- 281
Query: 304 VDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
++E F + ++ P ++ ++ ++ IF++D+SGSM G PL+ K + + L P
Sbjct: 282 -GEKENFFLLMVQPPEKVQAAEILPREYIFVLDVSGSMHGFPLDTAKTLIRDLIGNLRPT 340
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
D+FN+V F G + + S AT E + +A + I GG T + A L KA+ L
Sbjct: 341 DTFNLVLFAGGSQVMDPSSIPATSENITKAIRLIDSQ-QGGGGTELAAALNKALS-LPRE 398
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+G ++TDG + ER+ + L +++FGIGS N Y + +A
Sbjct: 399 KGKARTAVIITDGFISAERESFKLISENLD-----TTNVFSFGIGSSINRYLVDGIAQAG 453
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+G + + + S +L + + K+ + + + P+ IPDL ++ PL+
Sbjct: 454 QGESFVVTKPEEAKEASDRFRKYVASPLLTGVHVK-FKEFEAYNVEPATIPDLFAQRPLV 512
Query: 542 VSGRYQGKFPDTLKAKGFLGD---LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
+ G+++GK ++ G F V+ L+ PL + A+ +I L+
Sbjct: 513 LYGKWRGKPIGEIELTGKTAAGKYTRTFYVKETKPLSDHAPLKYLWARSRIARLSNYNVQ 572
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEV 658
E+ +V+ + + +L YT I V + RN K+G+ + +D +
Sbjct: 573 GENSDTRREVTNLGLDYNLLTPYTSFIAVLEEVRN--------KEGAADN-----VDQPL 619
Query: 659 PKTIMLQNLGIG 670
P + NL +G
Sbjct: 620 PLPAGVSNLAVG 631
>gi|153006973|ref|YP_001381298.1| vault protein inter-alpha-trypsin subunit [Anaeromyxobacter sp.
Fw109-5]
gi|152030546|gb|ABS28314.1| Vault protein inter-alpha-trypsin domain protein [Anaeromyxobacter
sp. Fw109-5]
Length = 1362
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 120/510 (23%), Positives = 229/510 (44%), Gaps = 71/510 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+PN+FT ++ + G +++ LR+ ++ + G ++ + P P YV
Sbjct: 195 RPNLFTQSVANVPPGETVAVVLRYVHEVPFDGGRYAFHFPTTVGPRYVPGAALAPDAPGG 254
Query: 220 ------TPAIKKIPKREKIHLNVN-----AGTGTEVLCN-----------TSSHHLKQLR 257
+P ++P ++ V+ +G ++L T SH +
Sbjct: 255 ARGPGTSPDTGRVPDASRVTPPVSPPGTRSGHDVDLLVRLVPGGAFDEVETRSHRVVTGL 314
Query: 258 RDVG-KLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM---FCMY 313
G +L E + + + DF T P+ + P H + QRE F M
Sbjct: 315 EPAGARLVALAEDDRIP----NKDFVLTWRPAGVV-------PGAHALVQREKGEDFLML 363
Query: 314 LL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
+ P V K+++F+VD SGSM G P + + +A AL + P D+F +VAF+G
Sbjct: 364 FVQPPAGVAPALVRPKELVFLVDKSGSMMGAPFDRVRALVARALDAMGPDDTFQVVAFDG 423
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLV 431
S + AT A+ RA +W+ + GG T + + A+ + R + ++
Sbjct: 424 SAQAMSEAPLPATPSAIARAKEWLA-SLEGGGGTEMLEGVRAALSPPEDPR-RLRMVVFC 481
Query: 432 TDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
TDG + +E +I +A+++ R++ FGIGS N Y + + RG
Sbjct: 482 TDGFIGNEPEIIEAVEALRGRA-----RVFGFGIGSSVNRYLVEGVGRAGRGASEVVSLD 536
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+ + + +LF R +L ++ + + + ++ P+R+PDL + PL+V+ R +G P
Sbjct: 537 EPPDAAVARLFARIDRPLLTDLEVG-FEGAEVTDLLPARLPDLFAGQPLVVAARVRGGRP 595
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKDIPLD--------RICAKQQI-DLLTAQAWFSEDK 602
+ +G LG+ + +V +++ A+ D + A+++I DLL A+ E
Sbjct: 596 SHVILRGRLGE-APWVSRVEVATARAAVGDGSEQPVVGTLWARRRIDDLLGARPTAPERS 654
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
+EE V+ ++++ ++ YT + VE + R
Sbjct: 655 AIEETVA-LALRFKLVTPYTSFVAVERELR 683
>gi|223938464|ref|ZP_03630357.1| Vault protein inter-alpha-trypsin domain protein [bacterium
Ellin514]
gi|223892883|gb|EEF59351.1| Vault protein inter-alpha-trypsin domain protein [bacterium
Ellin514]
Length = 723
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/494 (23%), Positives = 219/494 (44%), Gaps = 54/494 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-----PEYVTPAIKKIP 227
+PNIF ++ I+ G + + +R+ Q L+Y DG + P P ++T + +
Sbjct: 222 RPNIFRQSVANIEPGKQIDVNIRYFQTLAYVDGWYEFVFPMVVGPRYNPSHITNGVGAVA 281
Query: 228 KRE-----------------------KIHLNVNAGTGTEVL-CNTSSHHLKQLRRDVGKL 263
+ + +HL+V+AG E C T H + + +L
Sbjct: 282 RNQGGTSGQSTEVQYLKPSERSGHEIALHLDVDAGVSIEEFSCVT--HKISKTSTTPEQL 339
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSR- 322
SE + N DF Y ++ I ++ H ++ F M L P +
Sbjct: 340 SVDL-SEGDRIPNKDFVLRYRIAGERIKSNFMV-----HRDERGGYFTMMLYPPKELGQL 393
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
+++F++D SGSM G+P+ K A+ AL +L PGDSF I+ F+
Sbjct: 394 GRAPMEMVFVLDCSGSMSGEPIAQAKAAIRHALKQLQPGDSFQIINFSEHASQLGAKPLE 453
Query: 383 ATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
AT E + + ++ +N + G T + + A++ + + + +TDG + +E +
Sbjct: 454 ATPENIRKGLAYVEALN--SDGPTEMIEGIKAALDFPHDPE-RLRFVCFLTDGFIGNEAE 510
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
I A+ R+ G S RI++FG+GS CN Y L LA + G D+ M
Sbjct: 511 ILAAVHERI--GAS---RIFSFGVGS-CNRYLLDHLAKMGGGAVAHLGLHDNGAKVMDDF 564
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
F R + +I +D +L E+YP ++PDL P+I++GR+ G ++ G G
Sbjct: 565 FERVSHPAMTDIKVD-WGNLQVSEVYPQQMPDLFVGRPVILTGRFSGANTANIRVTGKAG 623
Query: 562 DLS---NFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVL 618
NF V L+ +P + A+ +I L + + + L+ ++ ++++ G++
Sbjct: 624 VQPIELNFPVTLEDSAGNALP--SVWARAKIAELADRTTYQANSDLQGEIKQVALDYGLI 681
Query: 619 CEYTRMIIVETDER 632
+T + V++ ++
Sbjct: 682 SAFTAFLAVDSSQK 695
>gi|428316181|ref|YP_007114063.1| Vault protein inter-alpha-trypsin domain-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428239861|gb|AFZ05647.1| Vault protein inter-alpha-trypsin domain-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 831
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 116/495 (23%), Positives = 207/495 (41%), Gaps = 51/495 (10%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT------------- 220
NIFT +L I G + + +R+++ L + G++ P P Y++
Sbjct: 177 NIFTQSLANIKPGEKIEVTIRYTESLKFAGGDYEFVFPMVVGPRYISGNSTGADRINPPV 236
Query: 221 -PAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDF 279
P + + + ++AG + +TS ++ ++ + + N D
Sbjct: 237 LPPGTRSGHNIAVSVEIDAGVAIGDVRSTSHQITTHRSGNIVQVQLANSDTI---PNKDL 293
Query: 280 DFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPG-TAKSRKVFKKDVIFIVDISG 337
Y V+ + +L QS D+R F YL+P K+ ++ KDV+F++D SG
Sbjct: 294 ILRYRVAGENTRATILTQS------DKRGGHFATYLIPALNYKNNEIVPKDVVFLMDTSG 347
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGI 397
S QG+PL +K + + L+P D+F I+ F ST+ T + A +I
Sbjct: 348 SQQGEPLAKSKELMRRFIQGLNPNDTFTIIDFANTAKALSTNPLANTAANRQSAINYIQ- 406
Query: 398 NFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSIC 457
N A G T + + + + G + I L+TDG + +E ++ ++ L G
Sbjct: 407 NLQANGGTELLNGIQAVMNFPAAATGRLRSIVLITDGYIGNENEVLARVQGSLKPGN--- 463
Query: 458 PRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT 517
R+Y+FG+G N + L LA + RG + +K F S VL NI I
Sbjct: 464 -RLYSFGVGGSVNRFLLNRLAEVGRGTSQVIRQDEPSAEAAEKFFREINSPVLTNIQISW 522
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAK- 576
++ E+YP PDL + PL++ G+ + L+ +G L + L + A+
Sbjct: 523 EGMGEKPEIYPIAPPDLFATQPLVLFGKKTDRANGQLRIRGTLAGGKAYEQVLPVNFARS 582
Query: 577 -------------------DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGV 617
+ + ++ + +I L Q + E K L E V+ ++ +
Sbjct: 583 GSSRQRESTSVAATATDFGNPAVAQLWGRSRIKDLMNQMFGGETKSLVEAVTNTALTYRL 642
Query: 618 LCEYTRMIIVETDER 632
L EYT + V + R
Sbjct: 643 LSEYTAFVAVSEEVR 657
>gi|407693945|ref|YP_006818733.1| Vault protein inter-alpha-trypsin [Alcanivorax dieselolei B5]
gi|407251283|gb|AFT68390.1| Vault protein inter-alpha-trypsin [Alcanivorax dieselolei B5]
Length = 667
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 131/542 (24%), Positives = 249/542 (45%), Gaps = 57/542 (10%)
Query: 117 GSKSCDCRIAVPMGDQGSILG---------VEAEISGKSYHTQLIALGENDGAGKSASVE 167
G+ + R P+ +Q ++ G + EI K + + DG ++ VE
Sbjct: 90 GALFSEGRYVFPLPEQAAVTGMTLTVGERRIVGEIQPKQEAKRRYQQAKRDGH-RAGLVE 148
Query: 168 TGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-----PE----- 217
N+FT ++ I G + + L +SQ + + +F + VP PE
Sbjct: 149 Q---RHSNLFTTSVANIAPGERVEVTLHYSQTVDVQGNDFRLRVPMTLTPRYQPEGDENL 205
Query: 218 ---YVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYS-YESEVLK 273
+ +P P ++ + ++AG +L S H L D + Y + S+ L
Sbjct: 206 SGPFHSPDSSAAPNMARLRVELDAGQS--ILPPDSGSH--DLDVDGARGLYRIHPSDGLV 261
Query: 274 WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIF 331
+ DF + ++ + G L + +V+ E + + +L P + + ++ +F
Sbjct: 262 PMDRDFLLHWRIANAEQSRGSLF----VENVNG-EHYALVMLTPPSHPAATRALPRETLF 316
Query: 332 IVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY-LFSTSMELATKEAVER 390
I+D SGSM G + + +L AL+ L PGD FN++ FN + LF+ M A + +++
Sbjct: 317 IIDTSGSMSGASIRQARASLLEALAHLRPGDRFNVIEFNSVHHALFAAPMP-ADEAHLQQ 375
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A +++ A G T + L A+ M ++ ++F +TDGAV +E I + + RL
Sbjct: 376 ARRFV-TGLKASGGTEMLPALHTALTMPADAEYLRQVMF-ITDGAVTNEDGIFNLIHRRL 433
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
+ R++T GIGS N +F+R A RG + +L + M LF + + V+
Sbjct: 434 GSA-----RLFTVGIGSSPNSHFMRKAAQFGRGTFTYIGNLQEVSKAMNDLFVKLENPVI 488
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVEL 570
++ + TL E E++P RIPDL + PL+++ + + P+ L G L+ E
Sbjct: 489 RDVRL-TLPPGVEAEVWPRRIPDLYASEPLLLTLKLN-RVPEYLTLTG----LTPAPWEQ 542
Query: 571 KLQL--AKDIP-LDRICAKQQIDLLTAQAWFSE-DKRLEEKVSKMSVQTGVLCEYTRMII 626
LQL + P + R+ A+ +I+ L + E ++ + ++ ++++ ++ YT +
Sbjct: 543 TLQLPEGRQHPGIARLWARNKIEALNDRLALGEAEQVIRPLLTDVALEHSLMSRYTSFVA 602
Query: 627 VE 628
+E
Sbjct: 603 IE 604
>gi|119493558|ref|ZP_01624223.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106]
gi|119452612|gb|EAW33794.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106]
Length = 757
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 123/492 (25%), Positives = 214/492 (43%), Gaps = 50/492 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA---------- 222
+ NIFT +L I G + + +R+S L + G++ F FP V P
Sbjct: 112 RDNIFTQSLANIKPGEEIEVIIRYSDSLKFEQGDYE----FVFPMVVGPRYIPGENIDQS 167
Query: 223 --IKKIPKREKIHLNV---NAGTGTEVLCNTSSHH---LKQLRRDVGKLGYSYESEVLKW 274
++P KI + +G ++ H + Q+R ++ + E++
Sbjct: 168 GDTDQVPDASKITPPIVPEGTRSGQDIGVTVEIHAGLPISQVRSTSHQINITENGEIVTV 227
Query: 275 S--------NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA-KSRKV 324
N D Y VS + VL QS D+R F +Y +P K+ ++
Sbjct: 228 KLDNEDTIPNKDLILRYQVSGDNTQTTVLTQS------DERGGHFALYFIPAIEYKTDEI 281
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
KDV+F++D SGS QG PL + + ++ L+P D+FNI+ F T S + +
Sbjct: 282 VAKDVLFLMDTSGSQQGDPLFKCQELMRRFINGLNPNDTFNIMDFAHTTCTLSETPLANS 341
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEM--LTNSRGSIPIIFLVTDGAVEDERQI 442
E A +I A G T + + + ++ LT G + I L+TDG + +E I
Sbjct: 342 PENRSLAIHYIN-QLRANGGTELLNGIREVLKFPELT---GRLRSIVLLTDGYIGNENAI 397
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ L G R+++FG+GS N + + +A I RG +S + +K F
Sbjct: 398 LSEVQDNLKPGN----RLHSFGVGSSVNRFLINRIAEIGRGISRVVRQNESTQKVAEKFF 453
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
+ + VL +I I + + E+YP PDL +E PL++ GR LK G + +
Sbjct: 454 RQINNPVLTDINIHWEGNGETPEIYPEIAPDLFAEQPLVLFGRKVDSAAGNLKVTGKIAN 513
Query: 563 LSNFVVELKLQLAK--DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCE 620
+ + + + ++ + ++ +Q+I LT Q E E V+ ++ +L +
Sbjct: 514 GNPYEQTFSIDFSNTGNVAIAQLWGRQRIKHLTNQMLGYETTSDVEAVTDTALTYQLLSQ 573
Query: 621 YTRMIIVETDER 632
YT + V D R
Sbjct: 574 YTAFVAVSDDVR 585
>gi|149377596|ref|ZP_01895335.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893]
gi|149358138|gb|EDM46621.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893]
Length = 718
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 150/591 (25%), Positives = 249/591 (42%), Gaps = 94/591 (15%)
Query: 93 LDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILG---------VEAEIS 143
L D L +F S WR + P+ D S+ G + EI
Sbjct: 97 LIADTRLRQSFRNTSQQWR----------EGVFVFPLPDNASVYGMTMTAGERVIVGEIH 146
Query: 144 GKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYR 203
K + A + +G K A V+ +PN+FT + I G +++ L + Q + Y+
Sbjct: 147 EKQDAKRQYAKAKEEGR-KVARVDQ---QRPNLFTTRMANIPPGETITVELNYQQAVRYQ 202
Query: 204 DGEFSVNVPFKF-PEYV-------TPA---------IKKIP------------------- 227
GEF + +P P Y+ PA ++P
Sbjct: 203 SGEFELRLPTTLTPRYMPGQALDNEPAQWSGGWATPTTEVPDADQISPFTVRPEDVGPDS 262
Query: 228 KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSP 287
R I L +NAG + + S H + + + ES + +D D +P
Sbjct: 263 HRATIALEINAGLPVSRVSSPS--HRVTSTWNGNTVEVAPESRAVL---MDRDLIVRWAP 317
Query: 288 SHIFGGVLLQSPSLHDVDQREMFCMYLL-PGTAKSRKVFKKDVIFIVDISGSMQGKPLED 346
+ G+ + + Q + + + L+ PG +R + ++++F++D SGSM G+ +
Sbjct: 318 AR---GMEPAAAVFQEHWQGKDYLLALVVPGLNGNRSL-PRELVFVIDTSGSMAGESIRQ 373
Query: 347 TKNALAVALSKLDPGDSFNIVAFNGETY-LFSTSMELATKEAVERAHQWI-GINFIAGGS 404
+ AL L LD D FN++ FN +T+ LF S+ A+ + RA +++ G+N A G
Sbjct: 374 ARQALLRGLGTLDADDRFNVIQFNSQTHSLFMESVP-ASGNNIARARRYVKGLN--ADGG 430
Query: 405 TNICAPLTKAVEMLTN------SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
T + L A+E TN SR + + +TDGAV +E + ++ L +
Sbjct: 431 TEMAPALDAALE--TNGDGGEASRARVRQVVFITDGAVGNESALFGKIRDGLGSS----- 483
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
R++T GIGS N +F+R A RG Y A DL + + +LF + + VL +I +
Sbjct: 484 RLFTVGIGSAPNMHFMREAARYGRGTYTAISDLSDVARPLDELFGKMEAPVLTDIEVSWP 543
Query: 519 KDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP--DTLKAKGFLGDLSNFVVELKL-QLA 575
E +P RI DL P++ R P L+ G L D S + L L Q A
Sbjct: 544 GQAAATEAFPGRIGDLFKGEPMVQVVR---DLPAEGALEVSGRLPDGSQWQQSLSLEQAA 600
Query: 576 KDIPLDRICAKQQIDLLTAQAWFSE-DKRLEEKVSKMSVQTGVLCEYTRMI 625
L R AKQ+ID L + D + V ++ ++ ++ +YT +
Sbjct: 601 NGKGLHRHWAKQKIDSLMDSGLTGQIDDDARQHVIRLGLEHQLMTKYTSFL 651
>gi|407014943|gb|EKE28894.1| hypothetical protein ACD_2C00267G0006 [uncultured bacterium (gcode
4)]
Length = 853
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 122/496 (24%), Positives = 228/496 (45%), Gaps = 48/496 (9%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFK-FPEYVTPAIK------- 224
+PNIFT + I G + I++ + + L Y + +++ P +P Y ++K
Sbjct: 265 RPNIFTQKVANIMPGDNIKIKISYFEVLRYENWKYNYIFPMVVWPRYNPASVKDASNIES 324
Query: 225 ---KIPKRE----KIHLNVNAGTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSN 276
K RE + LN+ A L N+ +++++ ++ SE+ N
Sbjct: 325 PTIKAGYREWHTIDLKLNILAWVNIRWLKSNSHDVNVRKINDSQAEITLKNASEI---PN 381
Query: 277 IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRK---VFKKDVIFIV 333
DF F Y V+ G L+ H V + + L+ KS K + K+++F++
Sbjct: 382 KDFSFIYDVASESPQIGFLI-----HKVASEKDWYFALIAEPQKSPKQSEIRDKEIVFVL 436
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
D SGSM +P+E K A+ A+S L P DSFN+ FN + + + +LA A + +
Sbjct: 437 DTSGSMTWRPIETVKKAMRKAISNLGPNDSFNVYNFNTQVFTLFPASKLADWSAKDEWLK 496
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
++ N A G T + P +A++ +S + II + D + +E +I +K+ L
Sbjct: 497 YVE-NLEAWGGTMMDQPFMEALKNTWDSWDRMRIILWMMDWDIWNESEILSIIKTWLGQN 555
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL--DSIEIQMQKLFTRGFSSVLA 511
R++ + + N Y + +A + A Y L D+I+ ++ + + S VL
Sbjct: 556 -----RLFMLWVDAAANRYLIDKMA--ESWNWKATYVLGEDNIDEKVDEFYGNFASPVLT 608
Query: 512 NIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ-GKFPDTL--------KAKGFLGD 562
+I +D L ++ P R DL + PL V G+Y G F D+L AK +
Sbjct: 609 DIKVD-WDWLSVTDVMPGRFADLYAGQPLHVYGKYSAGAFLDSLDKERNIRITAKRWNEA 667
Query: 563 LSNFV-VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEY 621
S + + K + ++ + A+Q+ID + + F + +LEE+V+ + ++ ++ E+
Sbjct: 668 YSQTIKINFKKEEKENSSIAAFWARQKIDDIYRENSFMPNPKLEEEVTALGLKFSIMSEF 727
Query: 622 TRMIIVETDERNNASE 637
T I ++ RN + +
Sbjct: 728 TSFIAIDDTVRNGSGQ 743
>gi|387815976|ref|YP_005431471.1| Von Willebrand factor A [Marinobacter hydrocarbonoclasticus ATCC
49840]
gi|381341001|emb|CCG97048.1| Von Willebrand factor, type A:Vault protein inter-alpha-trypsin
precursor [Marinobacter hydrocarbonoclasticus ATCC
49840]
Length = 712
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 133/555 (23%), Positives = 236/555 (42%), Gaps = 82/555 (14%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGE--------NDGAGKSASVETGSFLKPNIFTL 179
P+ +Q S+ G+ + + QL + K+A+VE +PN+FT
Sbjct: 122 PLPEQASVYGLTMTVGERRIVGQLQPRAQARKNYEKAKQAGQKAATVEQN---RPNLFTS 178
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIP----------- 227
+ I G +++ +++ Q ++YR GEF + +P P Y+ A P
Sbjct: 179 RIANIAPGEEVTVEVQYQQPVNYRHGEFELRLPTTLTPRYMPGAPVSAPASAWQSGWSLP 238
Query: 228 ------------------------KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKL 263
R I L + AG + + TS H Q+ + +
Sbjct: 239 TTQVADADEISPFTVLPDDVNAGSHRATIALEIEAGLPVDEV--TSPSHPLQVELEGSRA 296
Query: 264 GYSYES-EVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS---LHDVDQREMFCMYLLPGTA 319
+ E VL +D D P+ Q+P+ H + E F M ++ A
Sbjct: 297 KVAPEQGPVL----MDRDVIVRWRPAGS------QAPTAALFHQQWRGEDFLMAMVMPPA 346
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
+ +V +++++F++D SGSM G+ + ++AL L L PGD FN++ FN + +
Sbjct: 347 TTGQVLRRELLFVIDTSGSMAGESIRQARSALLRGLDTLRPGDRFNVIQFNSQAHALYPQ 406
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM-LTNSRGSIPIIFLVTDGAVED 438
A + RA +++ + A G T + L+ A+ M + S G + + +TDGAV +
Sbjct: 407 PVPANGHYLARARRYVQ-DLDADGGTEMAGALSLAMGMDGSESSGHVQQMVFMTDGAVGN 465
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
E + D +++ L N R++T IGS N +FLR A RG Y A + ++ +
Sbjct: 466 ESALFDQIRTGLGNR-----RLFTVAIGSAPNMHFLREAARWGRGQYTAVHSAAEVDKAL 520
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL--IVSGRYQGKFPDTLKA 556
KLF + V+ ++ + + + P++ DL PL +V G L
Sbjct: 521 GKLFAAMEAPVMTDVEVQWPG--NAAQPVPAKPGDLFHGQPLLQVVRG---APAEGELTV 575
Query: 557 KGFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSV 613
G L ++ L LA P LDR A+ +ID + A + + E + ++S+
Sbjct: 576 SGRLPGGRSW--RTSLDLASAAPGKGLDRQWARGRIDAVMDSARLAGTEPDEAAIVELSL 633
Query: 614 QTGVLCEYTRMIIVE 628
GV+ +T + +E
Sbjct: 634 SHGVMSPFTSFVAIE 648
>gi|359437306|ref|ZP_09227374.1| uncharacterized protein [Pseudoalteromonas sp. BSi20311]
gi|359444826|ref|ZP_09234593.1| uncharacterized protein [Pseudoalteromonas sp. BSi20439]
gi|358027972|dbj|GAA63623.1| uncharacterized protein [Pseudoalteromonas sp. BSi20311]
gi|358041395|dbj|GAA70842.1| uncharacterized protein [Pseudoalteromonas sp. BSi20439]
Length = 666
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 133/559 (23%), Positives = 247/559 (44%), Gaps = 76/559 (13%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDG-------AGKSASVETGSFL 172
+ + R P+ D+ ++ + I + ++ + + AGK AS+
Sbjct: 79 AVNARYVFPLPDESAVQAMTMRIGERVIKGKIAKKRQAEKQYQAAKKAGKQASLLRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+ N+F + + G + I L + + + Y+ G F+++ FP +TP
Sbjct: 137 RANMFITNVANVGPGEEIVITLEYQEIIDYQHGTFAIH----FPTTITPRYHAITGEVND 192
Query: 222 -----AIK----------------------KIPKREKIHLNVNAGTGTEVLCNTSSHHLK 254
AIK + + L +N G E++ + +H
Sbjct: 193 DQDKHAIKNANAPSFTPQGWLSPVYTKQTNEAANASDLSLTINMDIGLELVDINAKYH-- 250
Query: 255 QLRRDVGKLGYSYESEVLK-WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
+ ++ + S VLK I+ DF P I ++ M
Sbjct: 251 --QVNIENPAFGQYSIVLKPQQAINRDFVLEFKPLQIAQAQAALFTQQAQNGEQYALAML 308
Query: 314 LLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE 372
+ P + ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNIV+FN E
Sbjct: 309 MPPSDSFTQSQRLPREMVFVVDTSGSMHGQSIEQAKKALFYALSLLDSDDSFNIVSFNNE 368
Query: 373 TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVT 432
S + +A+ + RA ++I N A G T I L ++ + G + + +T
Sbjct: 369 VAAMSETPLIASDFNLRRAERFI-YNLQADGGTEIKGALDAVLDGM-QFEGFVRQVVFLT 426
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DG+V +E Q+ + L + R++T GIGS N YF+R A I +G Y
Sbjct: 427 DGSVSNEDQLFKHIAHTLGDS-----RLFTVGIGSAPNRYFMRRAADIGKGTYTFISSAH 481
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
++ +MQ+LF + + N+A+ T + D + +PS +PDL P++V+ + D
Sbjct: 482 DVQPKMQQLFDKLAHPAMTNLAL-TNTNGDSLDFWPSPLPDLYFAEPIMVAIKL-----D 535
Query: 553 TLKAKGFLGDLSN--FVVELKLQL-AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
A F G ++ F V+L Q +K +D++ A+Q+I L +++ ++++V
Sbjct: 536 NTDAITFTGQTAHGEFKVKLNPQASSKATGIDKLWARQKIKSLL---LYNQQSVVKQQVQ 592
Query: 610 KMSVQTGVLCEYTRMIIVE 628
+++++ +L +T I +E
Sbjct: 593 QLALEHQLLSLFTAFIAIE 611
>gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella
violacea DSS12]
gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea
DSS12]
Length = 747
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/520 (24%), Positives = 231/520 (44%), Gaps = 84/520 (16%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIK-------- 224
KPNIFT ++ + G L +++ + + + Y GEFS+ +FP +V P +
Sbjct: 153 KPNIFTTSVANLGPGETLVVQISYQELVKYDKGEFSL----RFPMFVKPRYEPKNLEGKD 208
Query: 225 -KIPKRE---------------------KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGK 262
P E K+++ ++ G + TS +H D+ K
Sbjct: 209 DSYPSTETILAQFVDLSYGYGEHRAPGSKVNIEIDLDAGVALSEITSVYH------DIDK 262
Query: 263 LGYSYESEVLKW-----SNIDFDFSYT-VSPSHIFGGVLLQSPSLH-DVDQREMFCMY-- 313
++ LK +N DF +T VS S + Q H + D + Y
Sbjct: 263 HLVTHNRYDLKLVSDTPANRDFVLRWTPVSGSEPVAAIYTQQGQTHKEADNSKEASDYAL 322
Query: 314 --LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
L+P + V +D+I ++D SGSM G+ + K A+ AL+ L DSFN++AFN
Sbjct: 323 LMLIPPEIGASSVIARDLILVIDTSGSMSGEAIVQAKKAMGYALAGLGARDSFNVIAFNS 382
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS------- 424
+ + S AT + + RA+Q+I A G T + LT+A++ +S
Sbjct: 383 DVHALSAQSLAATAKNIGRANQFIR-TLKADGGTEMGPALTRALDNGNHSTSHQDEEDFD 441
Query: 425 -----IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ + +TDGAV +ER + + ++ ++ + R++T GIG+ N +F+ A
Sbjct: 442 SDGVRLKQVLFMTDGAVANERSLFNLIEDKIGHS-----RLFTIGIGAAPNSHFMERAAE 496
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLS 535
+G + L ++ +++ L + + +I + T+ D +P IPDL
Sbjct: 497 FGKGTFTYIGKLGEVQQKIESLLYKIEHPQVTDIELHYGDGTIPD-----HWPISIPDLY 551
Query: 536 SESPLIVSGRYQGKF----PDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQID 590
+ PL+V+ + + K L G +GD ++ L L K LD + A++QI
Sbjct: 552 ANEPLLVAIKMRPKIYQASASELIVSGMIGD-QDWHSSLPLNERKSAAGLDLVWARKQIA 610
Query: 591 LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD 630
L R++++V+ + ++ ++ ++T ++ VET+
Sbjct: 611 ALEMSKDGVNQARVKKQVTALGLKYHLVSQHTSLVAVETN 650
>gi|300868930|ref|ZP_07113536.1| Vault protein inter-alpha-trypsin [Oscillatoria sp. PCC 6506]
gi|300333147|emb|CBN58728.1| Vault protein inter-alpha-trypsin [Oscillatoria sp. PCC 6506]
Length = 849
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 123/523 (23%), Positives = 221/523 (42%), Gaps = 74/523 (14%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
NIFT +L I G + + +R++ L + G++ F FP V P + IP
Sbjct: 191 NIFTQSLANIKPGEQIEVTIRYTDSLKFEGGDYE----FVFPMVVGP--RYIPGTPIPPK 244
Query: 235 NVNAGTGTEVLC------------NTSSHH-------------LKQLRRDVGKLGYSYES 269
+ N+G T+ + NT S H + +R K+ +
Sbjct: 245 SPNSGVNTDRVPDASRINPPILPPNTRSGHDIGVSVEIDAGLAISNVRSTSHKIETAASG 304
Query: 270 EVLKWS--------NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA- 319
+++ N D Y V+ VL Q+ D+R F +YL+P
Sbjct: 305 NIVRVQLGNLDTIPNKDLILRYRVAGDRTQATVLTQA------DKRGGHFALYLIPALQY 358
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
KS ++ KDV+F++D SGS QG+PL +K + ++ L+P D+F I+ F S +
Sbjct: 359 KSNEIVPKDVVFLMDTSGSQQGEPLVKSKELMRRFINGLNPNDTFTIIDFANTAKALSVA 418
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDE 439
T E ++A +I A G T + + + G + + L+TDG + +E
Sbjct: 419 PLANTAENRQKALAYIE-QLQANGGTELLNGVQAVMNFPAAESGRLRSVVLITDGYIGNE 477
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ ++ L G R+Y+FG+GS N + L LA + RG + + ++
Sbjct: 478 NEVISQVQRSLKPGN----RLYSFGVGSSVNRFLLNRLAEVGRGTSVVIRQDEPTQDLVE 533
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
K T + VL NI + ++ E+YP PDL + PL++ GR + ++ G
Sbjct: 534 KFLTEINNPVLTNIEMVWEGGGEKPEIYPLVAPDLFANQPLVLFGRKSDRANGQIRITGT 593
Query: 560 LGDLS--------NFVVELKLQLAKDIP-----------LDRICAKQQIDLLTAQAWFSE 600
+ NF+ ++L+ + P + ++ + +I L +Q + E
Sbjct: 594 VAGGKRYEKSVPVNFMAGVELRQRESNPPSAIADFGNPAIAQLWGRYRIKDLMSQMFGGE 653
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
L + V+ ++ +L +YT + V + R E GT++
Sbjct: 654 TTSLVDAVTNTALTYRLLSQYTAFVAVSEEVR---VEPDGTRR 693
>gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3]
gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp.
ANA-3]
Length = 751
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 132/553 (23%), Positives = 243/553 (43%), Gaps = 78/553 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K Q+ +++G K AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKQQAKQIFEQAKSEG--KRASLVSQE--RPNMFTTEVANLAPDEELVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVPFKF-PEYVTPAI-------KKI-------------PKRE------ 230
+ + Y DG FS+ P P Y+ P + +++ P R+
Sbjct: 198 ETIHYEDGLFSLRFPLVVAPRYI-PGLTLGGNNGERVTSSQVFDADRIIAPIRDANSEAD 256
Query: 231 ---KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSP 287
K + V G G + S +H + G+L + + V +N DF + +
Sbjct: 257 PVLKADIKVKLGEGVDKSAVVSPYHPITIDEKQGQLTVALANRVP--ANRDFVLQWRLKQ 314
Query: 288 -SHIFGGVLLQSPSLHDVDQRE------------------MFCMYLLPGTAKSRKV-FKK 327
+ G V Q+ H V Q + M L P S ++ +
Sbjct: 315 GTSPVGWVFNQAGKTH-VSQDDNASADTGPTGKSSNTDNYSLVMVLPPKVEASEQLNLPR 373
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
++I ++D SGSM G + KNAL AL L P DSFNI+ FN + L S + AT
Sbjct: 374 ELILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLLSPTPLPATASN 433
Query: 388 VERAHQWIGINFIAGGSTNICAPLT-----KAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
+ A Q++ A G T + L +A + S+ + +TDG+V +E +
Sbjct: 434 LAMARQFVN-RLQADGGTEMAQALNAALPRQAFNAASAEDKSLRQVIFMTDGSVGNESAL 492
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ +++++ + R++T GIGS N +F++ A + RG + D+D +E ++ +L
Sbjct: 493 FELIRNQIGDN-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVEQKISQLL 547
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
+ VL ++ + D + +P+ IPDL P+++S + + P L G G
Sbjct: 548 AKIQYPVLTDLQV-RFDDGSVPDYWPAPIPDLYRGEPVLISLKRHPREPQELVISGRQGH 606
Query: 563 LSNFVVELKLQ--LAKDIP-----LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQT 615
N+ L LQ +A D+ LD + A++QI L + D +++++V+ +S+
Sbjct: 607 -KNWQQSLSLQANVATDVAQSSAGLDLLWARKQIAALELSKNGANDDKVKQQVTALSLNY 665
Query: 616 GVLCEYTRMIIVE 628
++ YT ++ V+
Sbjct: 666 HLVSPYTSLVAVD 678
>gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c]
gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c]
Length = 701
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 129/532 (24%), Positives = 237/532 (44%), Gaps = 95/532 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT T+ I +SI + + Q + + + FS+ P P Y
Sbjct: 114 RPNLFTNTIANIGPNESVSITIEYQQVVGFDEQTFSLRFPMTITPRYSPNNATDKSTVTT 173
Query: 219 ---------VTPAIKKIPKREK----IHLNVNAGTGTEVLCN--TSSHHLKQLRRDVGKL 263
VT ++I ++ I L+V +G + + TS HH + + G+
Sbjct: 174 VNTQGWGQSVTAISQQIKTADEPANPIRLSVELDSGFALTADDITSEHHPINISQQ-GEK 232
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVLLQSPSLHDVDQR 307
Y E+ + + DF+ T P+ + +G V+L +P + D
Sbjct: 233 NSGYHIELAQEHIANQDFALTWQPALSDAPSAAHFSETQGKYRYGLVML-TPPVQDA--- 288
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
Y G A ++++ ++V+F++D SGSM G+ + K A+ AL++L P D+ NI+
Sbjct: 289 -----YHSTGGAVAQQMPSREVVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDNVNII 343
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR----- 422
FN AT + ++RA W+ + A G T + LT A+ + R
Sbjct: 344 QFNDAPQALWKRAMPATAKHIQRARNWVA-SLHADGGTEMAPALTLALNKPSLHRDDSDL 402
Query: 423 -GSIPI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
GS + + +TDG+V +E + ++S+L + R++T GIGS N YF+ A
Sbjct: 403 LGSHKLRQVVFITDGSVSNEDALMSLIESKLADN-----RLFTIGIGSAPNSYFMTQAAQ 457
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
RG + D+ ++ +M LF + V+ +I I+ + E E YPS IPDL + P
Sbjct: 458 AGRGTFTYIGDIQQVQHKMTALFNKLTRPVMQDIHIEFAR---ETEFYPSVIPDLYAAQP 514
Query: 540 LIVSGR---------YQGKFPDT-LKAKGFLG--DLSNFVVELKLQLA---KDIPLDRIC 584
L++ R Q + PD LK G+ +S+ +++ L+ K L
Sbjct: 515 LVIHYRVPVTHLENGMQLQQPDNALKVTGWQSARAVSSKPWSIRMPLSPTTKRAGLGVAW 574
Query: 585 AKQQIDLLTAQAWFSED--------KRLEEKVSKMSVQTGVLCEYTRMIIVE 628
A+++I L+ D K+ ++ ++ ++++ ++ ++T +I V+
Sbjct: 575 AREKITQLSHTLRRERDSEMRPMMRKKWQDGITNIALEHHIVSQFTSLIAVD 626
>gi|254413248|ref|ZP_05027019.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
gi|196179868|gb|EDX74861.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
Length = 1037
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 120/522 (22%), Positives = 213/522 (40%), Gaps = 60/522 (11%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AGK+A + + N+FT +L I G + + +R++ L + G++ F FP V
Sbjct: 458 AGKTAGLLEQE--RANVFTQSLANIKPGESIQVTIRYTDSLKFEGGDYE----FAFPMVV 511
Query: 220 TP--------AIKKIPKREKI---HLNVNAGTG--TEVLCNTSSHHLKQLRRDVGKLGYS 266
P K P + H + ++ T T+ + ++ + G S
Sbjct: 512 APRYTAGNSVGSAKAPTTNSVGSKHFSASSAKAPTTNKTLMTNVAYAAEVNPPIAPPGRS 571
Query: 267 -YESEVLKWSNIDFDFSYTVSPSH------------------------------IFGGVL 295
++ +V + S SPSH G
Sbjct: 572 GHDIDVTVEIDAGVPISSVRSPSHPVTTQQTSSTVRVELADQETIPNKDLILRYQVAGAD 631
Query: 296 LQSPSLHDVDQRE-MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
Q+ L D+R F YL+P + ++ KDV+F+VD SGS G P+ +K +
Sbjct: 632 TQATVLTQADERGGHFATYLIPAIEYQQNEIVPKDVVFLVDTSGSQSGSPIVQSKELMRQ 691
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+ L+P D+F I+ F T S T + ++A +I A G T + +
Sbjct: 692 FIQGLNPQDTFTIIDFANSTTQLSDKPLANTPQNRKKALNYIN-RLDANGGTELMNGIDT 750
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
+ G + + L+TDG + D+ QI ++ RL G R+Y+FG+GS N +
Sbjct: 751 VLNFPAAPAGRLRSVVLLTDGLIGDDEQIIAEIRDRLKPGN----RLYSFGVGSSTNRFL 806
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
+ LA + RG +S E+ Q+ F + VL NI + + E YP ++ D
Sbjct: 807 IERLAELGRGTAEVVPPNESAEVVAQEFFQEINNPVLTNIQVSWEGTGNAPEFYPQKVRD 866
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL---QLAKDIPLDRICAKQQID 590
L + PL+V GR + LK G + + L + ++ + + ++ + +I
Sbjct: 867 LFANQPLVVFGRKGDRTNGKLKISGTVAGGQPYETSLDVNFDEVRGNGAIAQLWGRARIK 926
Query: 591 LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
L Q + E + E V+ ++ ++ +YT + V + R
Sbjct: 927 ALMNQMYGRETPEVVEAVTDTALNYRLMSKYTSFVAVTEEIR 968
>gi|383452422|ref|YP_005366411.1| hypothetical protein COCOR_00403 [Corallococcus coralloides DSM
2259]
gi|380727461|gb|AFE03463.1| hypothetical protein COCOR_00403 [Corallococcus coralloides DSM
2259]
Length = 839
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 124/505 (24%), Positives = 223/505 (44%), Gaps = 64/505 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-IKKIPKREK 231
+PNIFT ++ I G + +R+ + ++L+Y G + F FP V P I P
Sbjct: 152 RPNIFTQSVANILPGETIRVRIHYVERLTYDAGTYR----FSFPMVVAPRYIGGTP---- 203
Query: 232 IHLNVNAGTGTE---------------VLCNTSSHHLKQL------------------RR 258
L G G E VL +T S H QL R
Sbjct: 204 --LPTRQGEGVEPDTTSVLDASRITPPVLTDTRSGHDIQLTVRLDAGLPVHSLRSTTHRV 261
Query: 259 DVGKLGYSYESEVL----KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
DV + G S + L + N DF Y V+ + I VL+ D F + L
Sbjct: 262 DVKRDGQSRATVSLGRDDRIPNKDFILEYVVADALIRPAVLMHRDPGAD---HGYFLVML 318
Query: 315 LPGTAKS-RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
P + + R++ +++ ++D S S G +E +K L+ L P D+F ++ F+ +
Sbjct: 319 NPQLSPTQREIVPRELYMVLDTSCSQSGLAIEKSKAITKEVLTHLMPEDTFQVLNFDTQV 378
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
F+ + AT E ++ A ++ NF GG T++ +A+ + N + ++ +TD
Sbjct: 379 TKFAPTAVPATPENIQNALPYVA-NFWGGGGTDVRIAAQEAM-VPPNDPARLRMVLFMTD 436
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
G + + Q+ ++ L RI++ G+GS N Y + + + RG +L+
Sbjct: 437 GLIGGDEQVLSTLQQHLREET----RIFSAGVGSSTNRYLITKMGELGRG-ASTLVNLNR 491
Query: 494 IEIQMQKLF-TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
E + + F R VL ++ +DT L ++YP +PDL + PL + G++ G
Sbjct: 492 PEEDVAREFEQRMRGPVLTSVVVDT-DGLPVSDVYPKSVPDLFAGQPLFLVGKFTGTGDG 550
Query: 553 TLKAKGFL-GDLSNFVVELKL-QLAKDI-PLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
L+ G + G + +F V + ++A + L + A+Q+I+ LT + + E + + ++
Sbjct: 551 VLRISGRVRGQVRHFDVPVHFPEVAPEHDSLKSLWARQRIEELTVEGYRGETPEVVQGIT 610
Query: 610 KMSVQTGVLCEYTRMIIVETDERNN 634
++Q ++ YT + VE R N
Sbjct: 611 DTALQYHLMSRYTSFVAVEQVARTN 635
>gi|410638873|ref|ZP_11349426.1| hypothetical protein GLIP_4020 [Glaciecola lipolytica E3]
gi|410141401|dbj|GAC16631.1| hypothetical protein GLIP_4020 [Glaciecola lipolytica E3]
Length = 688
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 122/517 (23%), Positives = 230/517 (44%), Gaps = 81/517 (15%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
GK AS+ +PN+F ++ I G + + + + Q ++Y + FSV +FP T
Sbjct: 144 GKKASLLVQQ--RPNLFKNSVANIGPGESIRVTIEYQQAVAYDEDTFSV----RFPTTTT 197
Query: 221 PAIKKIPK--------------------------------------REKIHLNVNAGTGT 242
P + +PK + K+ +N+ TG
Sbjct: 198 P--RYLPKVGLESDPQEIADSGWGMTLPIYKADQALNTQAATEPEPKHKVAINIRLNTGF 255
Query: 243 EVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL 301
EV N+ H + QL + G S + +++ +N DF S+ + Q +
Sbjct: 256 EVQNINSEFHPIHQLEKSPGAYEISLQQDMI--ANQDFVLSWQPESNQ-------QPKAA 306
Query: 302 HDVDQRE--MFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
H V++ ++ M +L P T ++V+F++D SGSM G ++ K+AL A+ +
Sbjct: 307 HFVERTRDNVYGMVMLMPPTTETESLSLPREVVFVIDTSGSMAGDSMQQAKDALNYAIQQ 366
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L D+FN+V F+ + + A + A +++ N A G T + + L A+
Sbjct: 367 LPVEDTFNLVEFDSTARKMWPTAQAANAQNKSMAMEYV-TNLSADGGTEMLSALQLALGQ 425
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
I + +TDG+V +E + + ++ + R++T GIGS N +F+
Sbjct: 426 QAQLNERIRQVIFITDGSVGNEVTLFEYIQQNVQKS-----RLFTVGIGSAPNSFFMTEA 480
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A + +G + +D+++ +MQ LFT+ +L+++ ++ D+ E YP PDL
Sbjct: 481 ARMGKGTFTYIGSIDTVQERMQALFTKLTHPLLSDLLLNF---SDDVEYYPRNFPDLYKG 537
Query: 538 SPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQL-----AKDIPLDRICAKQQIDLL 592
PL+VS R + D L G+L N + L L AK LD + A+++I L
Sbjct: 538 EPLMVSYRTKN-LIDNLTVT---GNLKNQYWQKSLSLLNAGNAKG--LDVLWARRKITQL 591
Query: 593 TAQAWFSED-KRLEEKVSKMSVQTGVLCEYTRMIIVE 628
F D + +++ ++++ ++ + T ++ V+
Sbjct: 592 DRDKRFGHDYDEINKQIEQVAMTHHLVSDMTSLVAVD 628
>gi|254410148|ref|ZP_05023928.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
gi|196183184|gb|EDX78168.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
Length = 615
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 135/587 (22%), Positives = 250/587 (42%), Gaps = 60/587 (10%)
Query: 83 LIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEI 142
+ PL+ VE + L RV T R ++ + P+ D+ ++ +E +I
Sbjct: 25 VFPLKHTEVEAKIAGNLS----RVEVTQRFENPF-TQPLEAVYVFPLPDEAAVDDMEIKI 79
Query: 143 SGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGGSYLSIRLRWS 197
+ + E + A E T L + NIFT +L I G + + +R++
Sbjct: 80 GDRIIKGTIKKREEAQQIYEQAKQEGRTAGLLEQERDNIFTQSLANIKPGEEIDVTIRYT 139
Query: 198 QKLSYRDGEFSVNVPFKF-PEYV--------------------TPAIKKIPKREK----- 231
L++ G++ P P Y+ TP I +P+ +
Sbjct: 140 DSLNFEQGDYEFVFPMVVGPRYIPGTTIDESGDTDEVPDASRITPPI--VPEGTRSAHDI 197
Query: 232 -IHLNVNAGTGTEVLCNTSSHHLK-QLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSH 289
+ + ++AG + + ++ SH L+ + ++ ++ + + N D Y V+ +
Sbjct: 198 GVTVEIDAGVPIQNV-HSPSHQLQIETHGEIVRVQLGGDDTI---PNKDLILRYQVAGDN 253
Query: 290 IFGGVLLQSPSLHDVDQRE-MFCMYLLPG-TAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
VL QS DQR F +Y +P + ++F KDV+F++D SGS QG PL
Sbjct: 254 TQATVLTQS------DQRGGHFALYFIPALDYRPDQIFPKDVVFLMDTSGSQQGAPLHKC 307
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNI 407
+ + + L+P D+F I+ F+ S + T E E A +I A G+T +
Sbjct: 308 QELMRRFIQGLNPDDTFTILDFSNAVRHLSQTPLANTPENRELAINYIN-QLHASGATEM 366
Query: 408 CAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGS 467
+ + A+ + G + + L++DG + +E +I ++ L G R+Y+FG GS
Sbjct: 367 LSGIRAAINVPA-PEGRLRSVVLLSDGYIGNENRILAEVQQELKPGN----RLYSFGAGS 421
Query: 468 YCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY 527
N + L +A I RG + E ++ F + VL I ++ + E+Y
Sbjct: 422 SVNRFLLNRIAEIGRGICRIIRHDEPTEKVVENFFRSINNPVLTTIELNWNGSGETPEIY 481
Query: 528 PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAK--DIPLDRICA 585
P+ PDL ++ PL++ GR + + L+ G + L ++ + + ++
Sbjct: 482 PAIAPDLFAQQPLVLFGRKRDRMAGDLQINGTTAKGERYEKTFTLNFSETGNPAIAQLWG 541
Query: 586 KQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
+ +I L Q E K E V+ ++ +LC+YT + V D R
Sbjct: 542 RARIKDLMNQMVTYETKSGVEAVTDTALTYQLLCQYTAFVAVSDDVR 588
>gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor family protein [Shewanella oneidensis
MR-1]
gi|24348130|gb|AAN55237.1| inter-alpha-trypsin inhibitor family protein [Shewanella oneidensis
MR-1]
Length = 760
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 135/557 (24%), Positives = 237/557 (42%), Gaps = 79/557 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K Q+ + GK AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKQAAQQIFE--QAKAEGKRASLVSQE--RPNMFTTEVANLAPNEKLVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVPFKF-PEYV--------------------------TPAIKKIPKRE 230
+ + Y DG FS+ P P Y+ I+ P E
Sbjct: 198 ETIKYDDGVFSLRFPLVVAPRYIPGLTLGGESGQTGRVTSSQVFDADRIIAPIRGAPGEE 257
Query: 231 ----KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSN--IDFDFSYT 284
K + VN G G + S +H L G L S + V + + +
Sbjct: 258 DPVLKADIKVNLGLGVDKAAILSPYHAITLDEQQGGLTASLANRVRAERDFVLQWRLEQG 317
Query: 285 VSP-SHIFG--------------GVLLQ---SPSLHDVDQREMFCMYLLPGTAKSRKV-F 325
VSP + +F G ++ S S ++D + M L P S ++
Sbjct: 318 VSPVAWVFNQNGKTHQTQAASDDGSMVNTAASTSASNIDNYSL-VMVLPPKVEASGQLNL 376
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+++I ++D SGSM G + KNAL AL L DSFNI+ FN + L S AT
Sbjct: 377 PRELILVIDTSGSMAGDSIIQAKNALRYALRGLKAQDSFNIIEFNSDVSLLSPVPLPATA 436
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS-----IPIIFLVTDGAVEDER 440
E + A Q++ A G T + L A+ SR + + + +TDG+V +E
Sbjct: 437 ENLAIARQFVN-RLQADGGTEMSLALEAALPKQRPSRAASENNVLQQVIFMTDGSVGNEE 495
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ + ++ ++ + R++T GIGS N +F++ A + RG + D+D +E ++ +
Sbjct: 496 ALFELIRHQIGDN-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVEQKINQ 550
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
L T+ VL ++ + D + +PS IPDL P+++S ++Q + P L G
Sbjct: 551 LLTKIQYPVLTDLQV-RFDDGTVPDYWPSPIPDLYRGEPVLISIKHQPREPKELVISGRQ 609
Query: 561 G--DLSNFVVELKLQLAKDIP--------LDRICAKQQIDLLTAQAWFSEDKRLEEKVSK 610
G + FV +L LD + A++QI L + D +++++V+
Sbjct: 610 GHKNWQQFVSLESTELNNAAAAVEQQVAGLDLLWARKQIAALELSKNGANDDKVKQQVTA 669
Query: 611 MSVQTGVLCEYTRMIIV 627
+S+ ++ YT ++ V
Sbjct: 670 LSMNYHLVSPYTSLVAV 686
>gi|427722039|ref|YP_007069316.1| serine/threonine protein kinase [Leptolyngbya sp. PCC 7376]
gi|427353759|gb|AFY36482.1| serine/threonine protein kinase [Leptolyngbya sp. PCC 7376]
Length = 1034
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 136/603 (22%), Positives = 249/603 (41%), Gaps = 53/603 (8%)
Query: 61 IVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
+V NPD+P + ++ D L + E+D + A V V T+ +
Sbjct: 339 VVQNPDLPPFPAGLYATIDGNEQ-NLTLKNTEVDAKVSGNIAEVEVIQTFANPY---DQP 394
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYH-------TQLIALGENDGAGKSASVETGSFLK 173
+ P+ D ++ +E +I ++ E GK+A++ +
Sbjct: 395 LEAIYKFPLPDDAAVDDMEIKIGNQTIRGVIKRSEEAEAIYEEAKEDGKTAALLEQE--R 452
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP------------ 221
NIFT +L I G + + +R+S L + ++ F FP V P
Sbjct: 453 DNIFTQSLANILPGEEIEVTIRYSNSLKFEGEDYE----FVFPMVVAPRYGDGNIQASSI 508
Query: 222 ----AIKKIPKR---EKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKW 274
+ K +PK + I +N+ G + TS H + G G S + +
Sbjct: 509 GGATSAKALPKNRSGQDISVNLEIDAGVPIQNLTSPTHEIATQSLSGSTGISL-ANADEI 567
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKSR-KVFKKDVIFI 332
N D Y V+ G Q+ L DQR F YL+P + ++ KDV+F+
Sbjct: 568 PNKDLIVRYQVT------GKETQATMLAQADQRGGHFATYLIPAVDYQKDEIVPKDVVFL 621
Query: 333 VDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAH 392
+D SGS +G + +K + + L+P D+F I+ F+ T S T+ ++A
Sbjct: 622 MDTSGSQRGAAIRQSKELMRQFIQGLNPQDTFTIMDFSNTTKKLSRVPLDNTETNRQKAL 681
Query: 393 QWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTN 452
+++ A G T + + + + G + I L+TDG + D+R I M+ +L
Sbjct: 682 KYVN-KISANGGTELMNGINEVMTFPEAEDGRLRTIVLLTDGLIGDDRTIIGMMEKQLKP 740
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN 512
G RI+TFG+G NH+ + LA + RG + + + + + F + VL N
Sbjct: 741 GN----RIFTFGVGYSTNHFLINRLAEVGRGESEILPESEDPKGVVAEFFEEINNPVLTN 796
Query: 513 IAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL--GDLSNFVVEL 570
+ + + D E+YP +PDL PL++ G+ + LK G + G V ++
Sbjct: 797 VEVSWVGGGDRPEIYPEHMPDLYDSQPLVLHGKKGDRQNGQLKVTGTIAGGKPYEKVFDV 856
Query: 571 KL-QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
Q+ + + ++ + +I ++ E V+ ++ ++ +YT + V
Sbjct: 857 NFDQVQGNGAIAQLWGRAKIKEYMNDMYWMETTEGVNTVTNTALDYRLVSDYTSFVAVSD 916
Query: 630 DER 632
+ R
Sbjct: 917 EVR 919
>gi|421591173|ref|ZP_16036073.1| hypothetical protein RCCGEPOP_19158 [Rhizobium sp. Pop5]
gi|403703432|gb|EJZ19664.1| hypothetical protein RCCGEPOP_19158 [Rhizobium sp. Pop5]
Length = 688
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 123/518 (23%), Positives = 231/518 (44%), Gaps = 68/518 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 70 RPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 125
Query: 228 ----------------KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
R+KI L VN G + SS H
Sbjct: 126 EFNNGAGFAAPSDPVENRDKIEAPVLDPRENARINPVALTVNLKAGFPLGDVKSSFHEVD 185
Query: 256 LRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
+R+D G + LK + DF+ ++ +P L + +V + +
Sbjct: 186 IRQD----GDQTRTISLKGDAVPADKDFELTWKAAPGKTPSAGLFR-----EVTDGKTYL 236
Query: 312 MYLLPGTAKSRKVF--KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
+ + K++V+F++D SGSM G+ +E + +LA+A+S+L+P D FN++ F
Sbjct: 237 LAFVTPPTAPDAAAPPKREVVFVIDNSGSMSGQSIEQARQSLALAISRLNPDDRFNVIRF 296
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPII 428
+ + + AT + E+A ++ A G T + L A+ + G++ +
Sbjct: 297 DNTMTDYFKGLVAATPDNREKAIAYVN-GLTADGGTEMLPALEDALRNQGPVAAGALRQV 355
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E+Q+ + + N G R++T GIGS N YF+ A I RG +
Sbjct: 356 VFLTDGAIGNEQQLFQEIAA---NRGDA--RVFTVGIGSAPNTYFMTKAAEIGRGTFTEI 410
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-Q 547
D + +M +LF + + + +IA + + + ++ P+ +PDL S P++++ Q
Sbjct: 411 GSTDQVASRMGELFAKLQNPAMTDIAA-SFEGITAEDITPNPMPDLYSGEPVVLTAELPQ 469
Query: 548 GKFPDTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAW-FSEDKRLE 605
K L+ G GD + VE+ + A D + ++ A+++ID A+A+ E L+
Sbjct: 470 VKPAGKLQIVGKTGD-QPWRVEMDIANAADGTGISKLWARRKIDDFEARAYERQEPAALD 528
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ + +++ ++ T ++ V+ A+E G+ K
Sbjct: 529 KDIETVALAYHLVSRVTSLVAVDVTPSRPANEPLGSAK 566
>gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain-containing protein
[Rhizobium leguminosarum bv. trifolii WSM2304]
gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium
leguminosarum bv. trifolii WSM2304]
Length = 794
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 121/516 (23%), Positives = 229/516 (44%), Gaps = 63/516 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P + +
Sbjct: 169 RPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 233 HLNVNAGTGT--EVLCN-----------------------------------TSSHHLKQ 255
N AG T + L N SS H
Sbjct: 225 EFNNGAGFATPRDPLDNREKIEAPALDPRENAKINPVSLTVDLKAGFPLGEVKSSFHEVD 284
Query: 256 LRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
+R+D G + LK + DF+ ++ +P + L + + D +
Sbjct: 285 IRQD----GDQERTISLKGDAVPADKDFELTWQAAPGKLPSAGLFRE--VKDGKTCLLAF 338
Query: 312 MYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG 371
+ + K++V+F++D SGSM G +E K +LA+A+S+L P D FN++ F+
Sbjct: 339 VTPPTAPDAAAPPAKREVVFVIDNSGSMSGPSIEQAKQSLALAISRLTPNDRFNVIRFDD 398
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFL 430
+ + AT + E+A ++ A G T + L A+ + G++ +
Sbjct: 399 TMTDYFKGLVAATPDNREKAIAYVR-GLPADGGTEMLPALEDALRNQGPVATGALRQVVF 457
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
+TDGA+ +E+Q+ + + + R++T GIGS N YF+ A I RG +
Sbjct: 458 LTDGAIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTQIGS 512
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
D + +M +LF + + + +IA + + + ++ P+ +PDL S P++++ G
Sbjct: 513 TDQVASRMGELFAKLQNPAMTDIAAN-FEGIAAEDITPNPMPDLYSGEPVVLTAELPGDK 571
Query: 551 P-DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKR-LEEK 607
P L+ G GD + V++ + A D + ++ A+++ID A+A+ +D L++
Sbjct: 572 PAGKLEIIGKTGD-QPWRVQMDIANAADGNGISKLWARRKIDDFEARAYERQDPAGLDKD 630
Query: 608 VSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ +++ ++ T ++ V+ ASE G+ K
Sbjct: 631 IETVALAHHLVSRLTSLVAVDVTPSRPASEPFGSAK 666
>gi|440224863|ref|YP_007338259.1| von Willebrand factor type A domain-containing protein [Rhizobium
tropici CIAT 899]
gi|440043735|gb|AGB75713.1| von Willebrand factor type A domain-containing protein [Rhizobium
tropici CIAT 899]
Length = 752
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 118/468 (25%), Positives = 213/468 (45%), Gaps = 57/468 (12%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF----- 215
GK AS+ +PNIFT T+ I + +++ + Q + DG FS+ VP
Sbjct: 157 GKKASLVEQQ--RPNIFTNTVANIGPHEKIVVQIEYQQSVRLADGRFSLRVPLVVAPRHN 214
Query: 216 PEYVTPAIKK----------------------------IPKREK---IHLNVNAGTGTEV 244
P+ +P ++ +P ++ + L VN G +
Sbjct: 215 PQDASPITQQADLKAGWGHFKQSSSREVSNQPISTPLMVPGGDRANPVTLKVNLNAGFPL 274
Query: 245 LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDV 304
S +H ++ + V G + ++ DF ++ S++ L + V
Sbjct: 275 DDVKSLYHAVKIDK-VNDAGRRITLDGDATADRDFVLEWSAIASNMPSVGLFR----EHV 329
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
+ + Y++P + + ++V+F++D SGSM G +E + +L ALS+L+P D F
Sbjct: 330 GKDDYMLAYVMPPAVATPQKAGREVVFVIDNSGSMGGTSIEQARASLDYALSRLEPNDRF 389
Query: 365 NIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS 424
N++ F+ F +AT E + A +++ A G T + PL A++ + G
Sbjct: 390 NVIRFDNTMTKFFPDSVMATAENIASARRFV-TGLEAAGGTEMLPPLQAALDDSHQANGL 448
Query: 425 IPIIFLVTDGAVEDERQICDAM-KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
++FL TDG V +E+Q+ DA+ KSR G S RI+ GIGS N Y + A ++RG
Sbjct: 449 RQVVFL-TDGEVSNEQQLLDAIAKSR---GRS---RIFMVGIGSAPNTYLMSRAAELARG 501
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
+ + + +M+ LF + + + ++A + + P+ +PD+ PL+++
Sbjct: 502 SFTHIGSVAEVNERMRALFDKLENPAVTDVAAAFSE--KNVSLSPNLLPDIYHGEPLVLA 559
Query: 544 GRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQID 590
R GK TL G +GD + + L L A + + +I A++QID
Sbjct: 560 ARM-GKAAGTLTVSGKIGD-RPWTISLPLDQASNAKGISKIWARRQID 605
>gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain-containing protein [Glaciecola
sp. 4H-3-7+YE-5]
gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 777
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 132/590 (22%), Positives = 251/590 (42%), Gaps = 111/590 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PN+FT + I +S+ + + Q + + FSV P +P +IP
Sbjct: 197 RPNLFTNAIANIGPNETISVTIEYQQVIRFDQQTFSVRFPMTITPRYSPRASQIPLINQT 256
Query: 228 -----------------KRE-----------KIHLNVNAGTGTEVLCNTSSHHLKQLRRD 259
R+ +I + +N+G S HH +
Sbjct: 257 AVAKVNSNGWGESVSVLSRQIKTPDEPANPIRISVELNSGFSISPEDIVSEHHPIDITPK 316
Query: 260 VGKLGYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVLLQSPSLHD 303
+ G+ Y+ E+ K + DFS P+ + + V+L P D
Sbjct: 317 DQQAGH-YQIELTKEHIANQDFSLRWKPTVGDIPSAAHFSETIGNYRYAMVMLTPPRQDD 375
Query: 304 VDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
D + + V ++V+F++D SGSM G+ + K A+ AL++L P DS
Sbjct: 376 ADDK-----------STKTPVSAREVVFLLDTSGSMAGESIVQAKRAVDFALTQLHPEDS 424
Query: 364 FNIVAFN-GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
N++ FN L++ +M AT ++RA W+ + A G T + L+ A+ +
Sbjct: 425 VNVIEFNDAPQALWNLAMP-ATANNIQRARNWVA-SLSANGGTEMAPALSMALHKTNLEQ 482
Query: 423 GSI----PI----IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+I P+ + +TDG+V +E + ++++L + R++T GIGS N YF+
Sbjct: 483 QNINEGSPVQLRQVVFITDGSVSNEDALMSLIENQLADS-----RLFTIGIGSAPNSYFM 537
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
A RG + D++ ++ +M +LF + V+ +I I+ +D E YPS IPDL
Sbjct: 538 TQAAQAGRGTFTYIGDINQVQQKMTELFNKLTRPVMQDIHIEFARDT---EFYPSVIPDL 594
Query: 535 SSESPLIVSGR---------YQGKFPDT-LKAKGFLGDLSN----FVVELKLQ-LAKDIP 579
+ P+++ R Q + PD +K G+ ++ + + + L AK
Sbjct: 595 YQDQPVVIHYRTPVAHLENGMQLQQPDNAVKVTGWQTTNTSPSKPWSIRMPLSPTAKSAG 654
Query: 580 LDRICAKQQIDLLTAQAWFSED----KRLEEKVSKMSVQTGVLCEYTRMIIVE---TDER 632
+ A+++I+ L+ D + + +++++++ ++ ++T +I V+ T+ +
Sbjct: 655 MGVAWAREKIEQLSHTLRRERDITMRQTWQRSITEIALEHHMVSQFTSLIAVDEQVTNLK 714
Query: 633 NNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFP 682
N E +K K ++ P + + + I G L TA N P
Sbjct: 715 NTKPEHENSKN--------KPINHNAPAGLAMAGVQIR-GQLPQTATNTP 755
>gi|254414399|ref|ZP_05028165.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
gi|196178629|gb|EDX73627.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
Length = 801
Score = 129 bits (323), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 152/667 (22%), Positives = 286/667 (42%), Gaps = 89/667 (13%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLP 182
P+ D+ ++ +E +I ++ ++ E + A E T L + NIFT +L
Sbjct: 103 PLPDEAAVYDMEIKIGDRTIKGEIKKSEEAQAIYEQARQEGRTAGLLEQERANIFTQSLA 162
Query: 183 QIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV--------------------TP 221
I G + + +R++ L ++ G++ P P Y+ TP
Sbjct: 163 NIKPGEQIDVTIRYTDSLKFQGGDYEFVFPMVVGPRYIPGTPMDNSGNTNQVPDASRITP 222
Query: 222 AIKKIPKREKIHLNV----NAGTGTEVLCNTSSHHLKQLRR-DVGKLGYSYESEVLKWSN 276
+ K R +NV NAG + + + SH ++ RR + ++ E + N
Sbjct: 223 PVLKPGTRSGHDINVTVEINAGVPIQDV-RSPSHQIRTQRRGETVQVQLGGEDTI---PN 278
Query: 277 IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPG-TAKSRKVFKKDVIFIVD 334
D Y V+ G Q+ L D+R F +YL+P + ++ KDV+F++D
Sbjct: 279 KDLILRYQVA------GDRTQATILSQTDERGGHFALYLIPALDYRPDEIVPKDVVFLMD 332
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERA-HQ 393
SGS +G+P ++ + ++ L+P D+F I+ F +T+ +L+ + A ++
Sbjct: 333 TSGSQRGEPFLKSQELMRRFINGLNPDDTFTIIDF------ANTTTQLSPQPLANTAPNR 386
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP----IIFLVTDGAVEDERQICDAMKSR 449
+N+I G N L + + N S P + L+TDG + +E Q+ ++ +
Sbjct: 387 TQALNYINGLQANGGTELMNGIRAVLNFPPSAPNRLRSVVLLTDGYIGNESQVIAEVQRQ 446
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS- 508
L G R+Y+FG+GS N + L LA + RG D ++ + F R ++
Sbjct: 447 LKPGN----RLYSFGVGSSVNRFLLNRLAEVGRG-TAKIIRQDEPTQEVAETFARQINNP 501
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVV 568
VL +I + + ++YP PDL + PL++ GR + T+K G +
Sbjct: 502 VLTDIRVSWQGEGTSAQIYPLAAPDLFANQPLVLLGRKPDRRNGTIKITGMTAGSQRYEK 561
Query: 569 ELKLQL--AKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMI 625
L + + D P + ++ + +I L Q + E E V++ ++ +L +YT +
Sbjct: 562 TLSVNFDQSSDNPAIAQLWGRARIKDLMNQMFGGETTSGVEAVTQTALNYNLLSQYTAFV 621
Query: 626 IVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFG-----DLTATAEN 680
V + R E GT++ + + E+P+ + + + FG +++ TA
Sbjct: 622 AVSEEVR---VEPDGTRRRVQ-------VPVELPEGVSYEGI---FGTDDVMNMSGTANF 668
Query: 681 FP-PGSEIPKLPEV------AEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIV-L 732
P P IP + EI +++ S ++ G+ + IE ++ I L
Sbjct: 669 APAPSGMIPLSRQAGGTRGGGEILLESDSVDVTSQGSPQLTVVKIEGLEPEQEENAIASL 728
Query: 733 TQLCTAL 739
TQ +L
Sbjct: 729 TQHLQSL 735
>gi|410640327|ref|ZP_11350863.1| hypothetical protein GCHA_1088 [Glaciecola chathamensis S18K6]
gi|410140152|dbj|GAC09050.1| hypothetical protein GCHA_1088 [Glaciecola chathamensis S18K6]
Length = 777
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 129/587 (21%), Positives = 249/587 (42%), Gaps = 105/587 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PN+FT + I +S+ + + Q + + FSV P +P +IP
Sbjct: 197 RPNLFTNAIANIGPNETISVTIEYQQVIRFDQQTFSVRFPMTITPRYSPRASQIPLINQT 256
Query: 228 -----------------KRE-----------KIHLNVNAGTGTEVLCNTSSHHLKQLRRD 259
R+ +I + +N+G S HH +
Sbjct: 257 AVAKVNSNGWGESVSVLSRQIKTPDEPANPIRISVELNSGFSISPEDIVSEHHPIDITPK 316
Query: 260 VGKLGYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVLLQSPSLHD 303
+ G+ Y+ E+ K + DFS P+ + + V+L P D
Sbjct: 317 DQQAGH-YQIELTKEHIANQDFSLRWKPTVGDIPSAAHFSETIGNYRYAMVMLTPPRQDD 375
Query: 304 VDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
D + + V ++V+F++D SGSM G+ + K A+ AL++L P DS
Sbjct: 376 ADDK-----------STKTPVSAREVVFLLDTSGSMAGESIVQAKRAVDFALTQLHPEDS 424
Query: 364 FNIVAFN-GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
N++ FN L++ +M AT ++RA W+ + A G T + L+ A+ +
Sbjct: 425 VNVIEFNDAPQALWNMAMP-ATANNIQRARNWVA-SLSANGGTEMAPALSMALHKTNLEQ 482
Query: 423 GSI----PI----IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
G+ P+ + +TDG+V +E + ++++L + R++T GIGS N YF+
Sbjct: 483 GNTNEGSPVQLRQVVFITDGSVSNEDALMGLIENQLADS-----RLFTIGIGSAPNSYFM 537
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
A RG + D++ ++ +M +LF + V+ +I I+ ++ E YPS IPDL
Sbjct: 538 TQAAQAGRGTFTYIGDINQVQQKMTELFNKLTRPVMQDIHIEFARNT---EFYPSVIPDL 594
Query: 535 SSESPLIVSGR---------YQGKFPDT-LKAKGFLGDLSN----FVVELKLQ-LAKDIP 579
+ P+++ R Q + PD +K G+ ++ + + + L AK
Sbjct: 595 YQDQPVVIHYRAPVAHLENGMQLQQPDNAVKVTGWQTTNASPSKPWSIRMPLSPTAKSAG 654
Query: 580 LDRICAKQQIDLLTAQAWFSED----KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ A+++I+ L+ D + + +++++++ ++ ++T +I V+ N+
Sbjct: 655 MGVAWAREKIEQLSHTLRRERDITMRQTWQRSITEIALEHHMVSQFTSLIAVDEQVTNSK 714
Query: 636 SESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFP 682
P ++ K ++ P + + + I G L TA N P
Sbjct: 715 DAKP-----EHENSKNKPINHNAPAGLAMAGVQIR-GQLPQTATNTP 755
>gi|417109673|ref|ZP_11963334.1| putative vault protein inter-alpha-trypsin domain-containing
protein [Rhizobium etli CNPAF512]
gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing
protein [Rhizobium etli CNPAF512]
Length = 794
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 119/513 (23%), Positives = 228/513 (44%), Gaps = 57/513 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 169 RPNIFTNQVANIGPGEEIIVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIIQTV 224
Query: 228 ----------------KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
REKI L VN G + SS H
Sbjct: 225 EFNNGAGFATPRDPVENREKIEASVLDPSENAKINPVSLTVNLKAGFPLGDVNSSFHAVD 284
Query: 256 LRRDVGKL-GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
+R+D + S + + + ++ DF+ ++ +P L + + D + +
Sbjct: 285 IRQDSDQARTISLKGDAVP-ADKDFELTWKAAPGKTPSAGLFRE--VKDGKTYLLAFVTP 341
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY 374
+ K++V+F++D SGSM G+ +E + +LA+A+S+L D FN++ F+
Sbjct: 342 PTAPDAAAAPTKREVVFVIDNSGSMSGQSIEQARQSLALAISRLSKDDRFNVIRFDDTMT 401
Query: 375 LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVTD 433
+ + A+ + E+A ++ A G T + L A+ + G++ + +TD
Sbjct: 402 DYFNGLVAASPDNREKAITYV-RGLTADGGTEMLPALEDALRNQGPVASGALRQVVFLTD 460
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GA+ +E+Q+ + + + R++T GIGS N YF+ A I RG + A D
Sbjct: 461 GAIGNEQQLFQEISANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQ 515
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-D 552
+ +M +LF + + + +IA T + ++ P+ +PDL S P++++ P
Sbjct: 516 VASRMGELFAKLQNPAMTDIAA-TFEGTQAEDITPNPMPDLYSGEPVVLTAELPRDKPAG 574
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSK 610
L+ G GD + VE+ + A D + ++ A+++ID A+A+ +D L++ +
Sbjct: 575 KLQIVGMTGD-QPWRVEMDIGHAADGNGISKLWARRKIDDFEARAYQRQDPAALDKDIET 633
Query: 611 MSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+++ ++ T ++ V+ A+E G K
Sbjct: 634 VALAHHLVSRVTSLVAVDVTPSRPANEPLGAAK 666
>gi|398810082|ref|ZP_10568913.1| von Willebrand factor type A (vWA) domain protein containing
protein [Variovorax sp. CF313]
gi|398083998|gb|EJL74699.1| von Willebrand factor type A (vWA) domain protein containing
protein [Variovorax sp. CF313]
Length = 690
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 148/609 (24%), Positives = 264/609 (43%), Gaps = 68/609 (11%)
Query: 72 PYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGD 131
PY + D PS+ L + E+ V A V V+ T+R G ++ + + P
Sbjct: 49 PYFFVKSDDPSVDRLPLKGTEVSVKISGVIADVTVTQTYRNE---GQRAIEAKYVFPGST 105
Query: 132 QGSILGVEAEISGKSYHTQLI----ALGENDGAGKSASVETGSFLK---PNIFTLTLPQI 184
+ ++ G+ ++ + Q+ A E D A K +T + L+ PN+F + + I
Sbjct: 106 KAAVSGLNVRLADRLITAQIREKQQAQIEYDTAKKEG--KTAALLEQHLPNVFQMNVANI 163
Query: 185 DGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLN--VNAGTGT 242
G + + LR+++ L + G + F FP V P P+ E
Sbjct: 164 LPGDDVKVELRYTELLVPQSGNYQ----FVFPTVVGPRYNS-PQSENAQAKWVAQPTLAA 218
Query: 243 EVLCNTS---------SHHLKQLRR-----DVGKLGYSYESEVLKWS------NIDFDFS 282
V NTS LK++R DV K ++++ + N DF
Sbjct: 219 GVAPNTSFKLKASIDTPMGLKEIRSATHAIDVKKSDEDQHADIVLTADGRPADNRDFVLD 278
Query: 283 YTVSPSHIFGGVLLQSPSLHDVD-QREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSM 339
Y ++ I G++L + D E F + ++ P + + +D IF+VDISGSM
Sbjct: 279 YRLAGEKIESGLMLYKGQGQNSDGSAENFFLAMVEPPKAVAANAISPRDYIFVVDISGSM 338
Query: 340 QGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINF 399
G PL+ K L + L P D+FN++ F+G + S AT+ +E+A I N+
Sbjct: 339 HGFPLDTAKTVLERLIGGLRPSDTFNVLLFSGSNRMLSPKSVPATRANIEQALATIQ-NY 397
Query: 400 IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR 459
GST + P K V + +VTDG V ER+ + +++ L+
Sbjct: 398 SGSGSTELI-PALKRVYAEPKEDSVSRTVVVVTDGYVTVEREAFELVRNNLSKAN----- 451
Query: 460 IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL-DSIEIQMQKL-FTRGFSSVLANIAIDT 517
++ FGIGS N ++ I+R G + + D ++ Q F R + + T
Sbjct: 452 VFAFGIGSSVNR---SLMEGIARAGMGEPFIITDPVQAPEQAARFRRMVEAPVLTSVKAT 508
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL---GDLSNFVVELKLQL 574
LD +++ P +PD+ E P+IV G+++G +AKG + G +N ++++
Sbjct: 509 FGGLDVYDVEPQALPDVLGERPVIVFGKWRG------EAKGRVIIEGRSANGPYRQEVRI 562
Query: 575 AKDI-----PLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
+ L + A+ +I L+ Q +E+++++ ++ +L +YT I V+
Sbjct: 563 DERTRHDTAALRTLWARHRIQSLSDQETLEGGVAFKERITELGLKYSLLTQYTSFIAVDK 622
Query: 630 DERNNASES 638
RN A ++
Sbjct: 623 VVRNAAPQN 631
>gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3]
Length = 710
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 140/560 (25%), Positives = 240/560 (42%), Gaps = 70/560 (12%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E EI K+ L E GK AS+ +PNIF+ + + G L + L +
Sbjct: 130 IEGEIKPKAKANAL--FNEAKKQGKKASLLQQK--RPNIFSAEVANLAPGETLIVELNYQ 185
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL-NVNAGTGTEV------LCNTSS 250
+ + Y +GEFS +FP V P K P+ EK L N+ A + V L + S
Sbjct: 186 ELVHYDNGEFS----LRFPMVVAPRYK--PRGEKSALSNMAAEVNSYVASTLGDLQGSES 239
Query: 251 HHLKQLRRDV--------GKLGYSYES-EV-----------LKWSNIDFDFSYTVSP--- 287
+ + +V G++ Y EV L + + DF P
Sbjct: 240 ERVNLVDIEVTLDAGMPIGEINSPYHQIEVSSNGDSQAQIQLTAAKANSDFVLNWRPIVG 299
Query: 288 SHIFGGVLLQSPSLHDVD--------QREMFCMYLLPGTAKSR--KVFKKDVIFIVDISG 337
S + Q H D Q + + LLP K R + +++I ++D SG
Sbjct: 300 SAPKAAIFSQQGKTHVSDLESKATAAQPQYSLVMLLPPQDKMRLSALAPRELILVIDTSG 359
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGI 397
SM G+ +E K ++ AL+ L DSFNI+ FN Y S + A+ + + RA ++
Sbjct: 360 SMSGEAIEQAKASIIYALAGLSAQDSFNILQFNSNVYALSDTPLNASAKNIGRAQAYVQ- 418
Query: 398 NFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSIC 457
A G T + L KA+ +R + + +TDGAV +E Q+ ++++L
Sbjct: 419 RLQANGGTEMSLALDKALSQQDANRERLRQVLFITDGAVGNEPQLFTQIRNQLQQS---- 474
Query: 458 PRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT 517
R++T GIG N +F++ A + RG Y ++ +M + + + ++ +
Sbjct: 475 -RLFTIGIGDAPNAHFMQRAAELGRGTYTYIGKQSEVKSKMVAMLDKLEKPTVTDVEVH- 532
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD---LSNFVVELKLQL 574
D + +P+ IPDL + P++V+ + L G L + V+E
Sbjct: 533 FADGSVPDYWPASIPDLYAHEPIMVAMKLPSFSDKELVVSGQLAGQYWQQSLVIE---NS 589
Query: 575 AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET----- 629
A+ LD I A++QI L + R+E++V+ +S+ V+ YT ++ ++
Sbjct: 590 AEAKGLDLIWARKQIAALELSKEPANRDRIEKQVTAISLNYHVMSAYTSLVAIDKTPVRP 649
Query: 630 --DERNNASESPGTKKGSKK 647
D+ N P T KG ++
Sbjct: 650 IGDKLVNGRVLPHTPKGWQR 669
>gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1]
gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1]
Length = 757
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 133/555 (23%), Positives = 238/555 (42%), Gaps = 80/555 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K+ Q+ + GK AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKAEAKQIFQ--QAKAEGKRASLVSQE--RPNMFTTDVANLAPNEQLVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVP--------------------------------FKFPEYVTPAIKK 225
+ + Y DG FS+ P F V P +
Sbjct: 198 ENIKYEDGLFSLRFPLVVAPRYIPGFIHDKASANDNAANSRVTSSEVFDAERIVAPLRRA 257
Query: 226 IPKREKI---HLNVNAGTGTEVLCNTSSHHLKQLRRDV---GKLGYSYESEVLKWSNIDF 279
K++ + ++ V G + S +H Q++ D G + S + V+ +N DF
Sbjct: 258 NSKQDPVLNANIEVRLAKGVDKSTIASPYH--QIKLDEPHHGIINVSLTNSVV--ANRDF 313
Query: 280 DFSYT----VSP----------SHIFGGVLLQSPSLHDVDQREMFCMYLLPGT-AKSRKV 324
+ +SP +H G D M L P T +
Sbjct: 314 VLQWRAKQGMSPMALVFNQQGKTHGDGASEDNVSENRQSDDHYSLVMVLPPKTDEHALST 373
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+++I ++D SGSM G + K+AL AL+ L DSFNI+ FN E S + A
Sbjct: 374 LPRELILVIDTSGSMAGDSIVQAKSALLYALNGLKAEDSFNIIEFNSELTQLSPTSLPAN 433
Query: 385 KEAVERAHQWIGINFIAGGSTNIC----APLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
+ + RA Q+I A G T + A L + + L+ S S+ + +TDG+V +E+
Sbjct: 434 QTHLARARQFIH-RLQADGGTEMALALNAALPRGINRLSESSQSLRQVIFMTDGSVGNEQ 492
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ D ++ ++ R++T GIGS N +F++ A + RG + ++D +E ++ K
Sbjct: 493 ALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGNVDEVEQKISK 547
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
L ++ VL +I + D + +PS IPDL P++VS + + P L G
Sbjct: 548 LLSKIQYPVLTDINV-RFDDGGVPDYWPSPIPDLYRGEPVVVSLKRSEREPQELVISGRQ 606
Query: 561 GDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSV 613
G N+ L L+ +D LD + A++QI L + D +++++V+ +S+
Sbjct: 607 GH-KNWQQSLSLKDSSDGLITNQDAGLDLLWARKQIAALELSKNGANDDKVKQQVTALSM 665
Query: 614 QTGVLCEYTRMIIVE 628
++ YT ++ V+
Sbjct: 666 NYHLVSPYTSLVAVD 680
>gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens
CN-32]
gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32]
Length = 757
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 133/554 (24%), Positives = 238/554 (42%), Gaps = 78/554 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K+ Q+ + GK AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKAEAKQIFQ--QAKAEGKRASLVSQE--RPNMFTTDVANLAPNEQLVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVP--------------------------------FKFPEYVTPAIKK 225
+ + Y DG FS+ P F V P +
Sbjct: 198 ENIKYEDGLFSLRFPLVVAPRYIPGFIHDKASANDNAANSRVTSSEVFDAERIVAPLRRA 257
Query: 226 IPKREKI---HLNVNAGTGTEVLCNTSSHHLKQLRRDV---GKLGYSYESEVLKWSNIDF 279
K++ + ++ V G + S +H Q++ D G + S + V+ +N DF
Sbjct: 258 NSKQDPVLNANIEVRLAKGVDKSTIASPYH--QIKLDEPHHGIINVSLTNSVV--ANRDF 313
Query: 280 DFSYT----VSP----------SHIFGGVLLQSPSLHDVDQREMFCMYLLPGT-AKSRKV 324
+ +SP +H G D M L P T +
Sbjct: 314 VLQWRAKQGMSPMALVFNQQGKTHGDGASEDNVSENRQSDDHYSLVMVLPPKTDEHALST 373
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+++I ++D SGSM G + K+AL AL+ L DSFNI+ FN E S + A
Sbjct: 374 LPRELILVIDTSGSMAGDSIVQAKSALLYALNGLKAEDSFNIIEFNSELTQLSPTSLPAN 433
Query: 385 KEAVERAHQWIGINFIAGG---STNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
+ + RA Q+I GG S + A L + + L+ S S+ + +TDG+V +E+
Sbjct: 434 QTHLARARQFIHRLQADGGTEMSLALNAALPRGINRLSESSQSLRQVIFMTDGSVGNEQA 493
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ D ++ ++ R++T GIGS N +F++ A + RG + ++D +E ++ KL
Sbjct: 494 LFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGNVDEVEQKISKL 548
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
++ VL +I + D + +PS IPDL P++VS + + P L G G
Sbjct: 549 LSKIQYPVLTDINV-RFDDGGVPDYWPSPIPDLYRGEPVVVSLKRSEREPQELVISGRQG 607
Query: 562 DLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQ 614
N+ L L+ +D LD + A++QI L + D +++++V+ +S+
Sbjct: 608 H-KNWQQSLSLKDSSDGLITNQDAGLDLLWARKQIAALELSKNGANDDKVKQQVTALSMN 666
Query: 615 TGVLCEYTRMIIVE 628
++ YT ++ V+
Sbjct: 667 YHLVSPYTSLVAVD 680
>gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7]
gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp.
MR-7]
Length = 755
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 130/556 (23%), Positives = 240/556 (43%), Gaps = 80/556 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K Q+ + +G K AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKQQAKQIFEQAKAEG--KRASLVSQE--RPNMFTTEVANLAPDEELVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVPFKF-PEYVTPAI-------KKI-------------PKRE------ 230
+ + Y DG FS+ P P Y+ P + +++ P R+
Sbjct: 198 ETIHYEDGLFSLRFPLVVAPRYI-PGLTLGGNNDERVTSSQVFDADRIIAPIRDASSETD 256
Query: 231 ---KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSP 287
K + V G G + S +H ++ G+L + + V +N DF + +
Sbjct: 257 PVLKADIKVKLGEGVDKSAVVSPYHPIRIDEKQGQLTAALANRVP--ANRDFVLQWRLKQ 314
Query: 288 -SHIFGGVLLQSPSLHDVDQREM-----------------FCMYLLPGTAKSRKV-FKKD 328
+ V Q+ H M L P S ++ ++
Sbjct: 315 GTSPVAWVFNQTGKTHTTQDDNASADTGPTGNSSNTDNYSLVMVLPPKVEASEQLNLPRE 374
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
+I ++D SGSM G + KNAL AL L P DSFNI+ FN + L S + AT +
Sbjct: 375 LILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLLSPTPLPATATNL 434
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAV--EMLTNSRG---SIPIIFLVTDGAVEDERQIC 443
A Q++ A G T + L A+ + + G S+ + +TDG+V +E +
Sbjct: 435 AMARQFVN-RLQADGGTEMAQALNAALPRQAFNTASGEDKSLRQVIFMTDGSVGNESALF 493
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ +++++ + R++T GIGS N +F++ A + RG + D+D +E ++ +L
Sbjct: 494 ELIRNQIGDN-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVEQKISQLLA 548
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
+ VL ++ + D + +P+ IPDL P+++S + + P L G G
Sbjct: 549 KIQYPVLTDLQV-RFDDGSVPDYWPAPIPDLYRGEPVLISLKRHPREPQELVISGRQGH- 606
Query: 564 SNFVVELKLQL------AKDIP-----LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMS 612
N+ L LQ A D+ LD + A++QI L + D +++++V+ +S
Sbjct: 607 KNWQQSLSLQANDASNPATDVSQPVAGLDLLWARKQIAALELSKNGANDDKIKQQVTALS 666
Query: 613 VQTGVLCEYTRMIIVE 628
+ ++ YT ++ V+
Sbjct: 667 MNYHLVSPYTSLVAVD 682
>gi|424918736|ref|ZP_18342100.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM597]
gi|392854912|gb|EJB07433.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM597]
Length = 794
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/512 (22%), Positives = 231/512 (45%), Gaps = 55/512 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P + +
Sbjct: 169 RPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 233 HLNVNAGTGT--EVLCNTSSHHLKQL-RRDVGKL-----------GYS--------YESE 270
N AG T + + N L RR+ K+ G+ +E +
Sbjct: 225 EFNNGAGFATPRDPVENREKIEAPVLDRRENAKINPVSLTVDLKAGFPLGEVKSCFHEVD 284
Query: 271 VLKW---------------SNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
+ + ++ DF+ ++ +P + L + + D + +
Sbjct: 285 IRQHGDQEGTISLKGDAVPADKDFELTWQAAPGKLPSAGLFRE--VKDGKTYLLAFVTPP 342
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
+ K++V+F++D SGSM G +E K +LA+A+S+L P D FN++ F+
Sbjct: 343 TAPDAAAPPAKREVVFVIDNSGSMSGPSIEQAKQSLALAISRLTPNDRFNVIRFDDTMTD 402
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVTDG 434
+ + AT + E+A ++ A G T + L A+ + G++ + +TDG
Sbjct: 403 YFKGLVAATPDNREKAIAYVR-GLTADGGTEMLPALEDALRNQGPVATGALRQVVFLTDG 461
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
A+ +E+Q+ + + + R++T GIGS N YF+ A I RG + D +
Sbjct: 462 AIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTQIGSTDQV 516
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-DT 553
+M +LF + + + +IA + + + ++ P+ +PDL S P++++ G P
Sbjct: 517 ASRMGELFAKLQNPAMTDIAAN-FEGIAAEDITPNPMPDLYSGEPVVLTAELPGDKPAGK 575
Query: 554 LKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKR-LEEKVSKM 611
L+ G GD + V + + A D + ++ A+++ID A+A+ +D L++ + +
Sbjct: 576 LEIIGKTGD-QPWRVRMDIANAADGNGISKLWARRKIDDFEARAYERQDPAGLDKDIETV 634
Query: 612 SVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
++ ++ T ++ V+ A+E G+ K
Sbjct: 635 ALAHHLVSRLTSLVAVDVTPSRPANEPLGSAK 666
>gi|124265333|ref|YP_001019337.1| hypothetical protein Mpe_A0140 [Methylibium petroleiphilum PM1]
gi|124258108|gb|ABM93102.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 711
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 155/669 (23%), Positives = 270/669 (40%), Gaps = 99/669 (14%)
Query: 72 PYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGD 131
PY + D PS+ L + A E+D A V V+ +R G + + R P
Sbjct: 56 PYFFVKSDDPSVDRLPLKATEVDARIAGPIADVTVTQRYRNE---GQRPIEARYVFPGST 112
Query: 132 QGSILGVEAEISGK----SYHTQLIALGENDGAGKSASVETGSFL---KPNIFTLTLPQI 184
Q ++ + + + + A E + A + +T + L +PN+F++ + I
Sbjct: 113 QAAVHAMTVRVGHRVIVADIREKQRARIEFEAAKREG--KTAALLEQERPNVFSMNVANI 170
Query: 185 DGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------------------- 221
G +++ LR+++ L +G + F FP V P
Sbjct: 171 LPGDEVAVELRYTELLPPTEGRYQ----FVFPTVVGPRYRSPANKVATTEEQANGTAAPS 226
Query: 222 ----AIKKIPKRE----KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGK-----LGYSYE 268
A+ +P E + L V + V SS H ++ + LG E
Sbjct: 227 GSFPAVPYLPAGEASDTRFDLRVAFASPLPVSGLRSSSHQIEVEGEGSNGARVALGGDAE 286
Query: 269 SEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFK 326
S N DF Y ++ I G+ L E F + ++ P + ++
Sbjct: 287 SSRHN-GNRDFILDYRLAGDGIASGLTL----FPGAPGEENFFLAMVEPPRAIATTQINP 341
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+D +F+VDISGSM G PL+ K L + L P D+FN++ F+G + + + AT+
Sbjct: 342 RDYVFVVDISGSMHGYPLDTAKTLLRHLIGGLRPSDTFNVLLFSGSNRMLNETSVPATQA 401
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
V +A + I GST I P K V L S + +VTDG V E ++ +
Sbjct: 402 NVAQALRTIA-QMGGSGSTEIV-PALKRVAALPKSPDVSRSVIVVTDGYVTVESEVFQLI 459
Query: 447 KSRL--TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ L TN ++ FGIGS N + + +A +G + Q ++L
Sbjct: 460 RRNLGQTN-------VFAFGIGSSVNRHLIEGIARAGQGEPFIVTRPEQAAAQAERLRRM 512
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLS 564
+ VL + + LD +++ P R+PD+ P++V G+++G+ L +G +
Sbjct: 513 IDAPVLTQVKA-RFEGLDTYDVEPERLPDVLGGRPVLVFGKWRGEPRGQLVVEGQAAHGA 571
Query: 565 -NFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
++ + A+ + L + A+ +I L+ Q E ++ + ++ +L +YT
Sbjct: 572 WQAMLPVATPDAQAVALRHLWARHRIQSLSDQEALQGGDTQREAITALGLRYSLLTQYTS 631
Query: 624 MIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPP 683
I V+ RN A GS D S +P+ + NL I
Sbjct: 632 FIAVDRVVRNPAG-------GSTPVDQ----PSPLPQGV--SNLAI-------------- 664
Query: 684 GSEIPKLPE 692
G+E+P PE
Sbjct: 665 GAEVPSTPE 673
>gi|424896206|ref|ZP_18319780.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM2297]
gi|393180433|gb|EJC80472.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM2297]
Length = 794
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 121/514 (23%), Positives = 230/514 (44%), Gaps = 59/514 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 169 RPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 228 ----------------KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
REKI L VN G + SS H
Sbjct: 225 EFNNGAGFATPRDPVENREKIEAPVLDPRDNAKINPVSLTVNLKAGFPLGEVKSSFHEVD 284
Query: 256 LRRDVGKL-GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
+R+D + S + E + ++ DF+F++ +P + L + + D + +
Sbjct: 285 IRQDGDQARTVSLKGEAVP-ADKDFEFTWKPAPGKLPSAGLFRE--VKDGKTYLLAFVTP 341
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY 374
+ K++V+F++D SGSM G +E + +LA+A+S+L+P D FN++ F+
Sbjct: 342 PTAPDAAAPPAKREVVFVIDNSGSMSGPSIEQARQSLALAISRLNPDDRFNVIRFDDTMT 401
Query: 375 LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV--EMLTNSRGSIPIIFLVT 432
+ + T + E+A ++ A G T + L A+ + +R ++FL T
Sbjct: 402 DYFKGLVAGTPDNREKAIAYVR-GLTADGGTEMLPALEDALRNQGPVATRALRQVVFL-T 459
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +E+Q+ + + + R++T GIGS N YF+ A I RG + D
Sbjct: 460 DGAIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTQIGSTD 514
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ +M +LF + + + +IA + + ++ P+ +PDL S P++++ P
Sbjct: 515 QVASRMGELFAKLQNPAMTDIAA-KFEGIAAEDITPNPMPDLYSGEPVVLTAELPEVKPS 573
Query: 553 -TLKAKGFLGDLSNFVVELKL-QLAKDIPLDRICAKQQIDLLTAQAWFSEDKR-LEEKVS 609
L+ G GD + VE+ + A+ + ++ A+++ID A+A+ +D L++ +
Sbjct: 574 GKLEIIGKTGD-QPWRVEMDIANAAEGSGISKLWARRKIDDFEARAYERQDPAGLDKDIE 632
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+++ ++ T ++ V+ A E G+ K
Sbjct: 633 TVALAHHLVSRLTSLVAVDVTPSRPAGEPLGSAK 666
>gi|392408932|ref|YP_006445539.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Desulfomonile tiedjei DSM 6799]
gi|390622068|gb|AFM23275.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Desulfomonile tiedjei DSM 6799]
Length = 820
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 133/588 (22%), Positives = 245/588 (41%), Gaps = 64/588 (10%)
Query: 85 PLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISG 144
PL+ +V++DV +L VR ++ + + P+ +G++ + +
Sbjct: 49 PLEHTSVKVDVSGFLARVTVR-----QIFRNPQREKIEAVYTFPLSSEGAVNSMVMRMGE 103
Query: 145 KSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQK 199
+ ++ E ++A S L +PNIFT ++ I G + I +++S+
Sbjct: 104 RVIKGEVKRREEARAMYEAAKSRGNVASLLDQERPNIFTQSVANIMPGERVEIEIQYSEV 163
Query: 200 LSYRDGEFSVNVPF------------------KFPE--------YVTPAIKKIPKR--EK 231
LSY DG F P + P+ +TP + R
Sbjct: 164 LSYDDGAFKFVFPMVVGPRFIPGARTGKQGTGRSPDTTQVRDASLITPPVAPAGTRAGHD 223
Query: 232 IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIF 291
+ L + G + S H L+RD + N DF SYTV+ +
Sbjct: 224 VDLTLTLKAGIPIHSIKSELHEIHLQRDGTDRATVSLKNKDEIPNRDFVLSYTVAGDDVR 283
Query: 292 GGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
G+L S + + + + P K+ K++IF++D SGS +GKPLE K+ +
Sbjct: 284 SGIL----SHKEGKNGYVTIVMVPPKKIAPDKISPKEMIFVIDCSGSQEGKPLEKAKDTM 339
Query: 352 AVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPL 411
+ +++P D+FNIV FN + T++ +A ++ + A G T + +
Sbjct: 340 RYIIDRMNPDDTFNIVDFNSGARMLFDEARKNTEQNRAKALTYLS-SLEARGGTWMGPAV 398
Query: 412 TKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSIC---PRIYTFGIGSY 468
K M S + I+ +TDG V ++ +I +K C R + FG G+
Sbjct: 399 EKICSMPPQS-NRLRIVTFMTDGYVGNDFEIISLVKK--------CRDKSRWFPFGTGNS 449
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP 528
N + L +A + G E +K + R + VL +I + + + +++P
Sbjct: 450 VNRFLLDRMAGVGGGEPEYILLNSPGEEIAKKFYQRIAAPVLTDITLQQ-EGVALEDVFP 508
Query: 529 SRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD---LSNFVVELKLQLAKDIPLDRICA 585
+ DL S PLI RY+ + KGF V+L + + L + A
Sbjct: 509 KSLIDLWSHKPLIFKARYKSAGRGAITIKGFAAGKPYTQTLHVDLPEKETANPSLGSLWA 568
Query: 586 KQQIDLLTAQAWFS-----EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ ++D L ++ WFS DK L+E++ +++++ ++ +YT + VE
Sbjct: 569 RAKVDDLMSRDWFSVQQGTPDKELKEEIVRVALEHQIMTQYTSFVAVE 616
>gi|424885400|ref|ZP_18309011.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM2012]
gi|393177162|gb|EJC77203.1| marine proteobacterial sortase target protein [Rhizobium
leguminosarum bv. trifolii WSM2012]
Length = 794
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 120/511 (23%), Positives = 228/511 (44%), Gaps = 65/511 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P + +
Sbjct: 169 RPNIFTNQVANIGPGEEIVVQVEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 233 HLNVNAGTGTE------------------------------------VLCNT-SSHHLKQ 255
N AG T +L + SS H
Sbjct: 225 EFNNGAGFATPRDPVENRDKIEAPVLDPRENAKINPVSLTVDLKAGFLLGDVKSSFHEVD 284
Query: 256 LRRDVGKLGYSYESEVLKWSNI--DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
+R+D G + LK + D DF T P+ G L + +V + + +
Sbjct: 285 IRQD----GNQARTISLKGEAVPADKDFELTWKPAP---GKLPSAGLFREVKDGKTYLLA 337
Query: 314 LLPGTAKSRK---VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
+ + K++V+F++D SGSM G +E + +LA+A+S+L+P D FN++ F+
Sbjct: 338 FVTPPTAPDAAAPLAKREVVFVIDNSGSMSGPSIEQARQSLALAISRLNPDDRFNVIRFD 397
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIF 429
+ + AT + E+A ++ A G T + L A+ + + G++ I
Sbjct: 398 DTMTDYFKGLVPATPDNREKAIAYVR-GLTADGGTEMLPALEDALRNQGSVASGALRQIV 456
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
+TDGA+ +E+Q+ + + + R++T GIGS N YF+ A I RG +
Sbjct: 457 FLTDGAIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTQIG 511
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-QG 548
D + +M +LF + + + +IA + + + ++ P+ +PDL S P++++ +
Sbjct: 512 STDQVASRMGELFAKLQNPAMTDIAAN-FEGIAAEDITPNPMPDLYSGEPVVLTAELPED 570
Query: 549 KFPDTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKR-LEE 606
K L+ G GD + VE+ + A D + ++ A+++ID A+A+ +D L++
Sbjct: 571 KPAGKLEIIGKTGD-QPWRVEMNIANAADGSGISKLWARRKIDDFEARAYERQDPAGLDK 629
Query: 607 KVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
+ +++ ++ T ++ V+ A E
Sbjct: 630 DIETVALAHHLVSRLTSLVAVDVTPSRPAGE 660
>gi|359450117|ref|ZP_09239585.1| uncharacterized protein [Pseudoalteromonas sp. BSi20480]
gi|358044106|dbj|GAA75834.1| uncharacterized protein [Pseudoalteromonas sp. BSi20480]
Length = 664
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 133/557 (23%), Positives = 238/557 (42%), Gaps = 77/557 (13%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + + + AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMRIGERVIKGHIDKKIEAEKKYAQAQKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKI------ 226
+ N+F + I G ++I L + + ++YR G F++ +FP +TP I
Sbjct: 137 RANMFVTNVANIGPGEQVAIELEYQEIINYRSGTFTI----RFPTTITPRYHAINGTLEQ 192
Query: 227 ----------------------------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRR 258
P E LN++ G E++ TS HH R
Sbjct: 193 PSSSLMDQNLAPSGWLRPVYNVQKNNDAPSSE-FTLNIDIDVGLELVDITSKHH----RI 247
Query: 259 DVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGG-VLLQSPSLHDVDQREMFCMYLLP 316
+V + L +N + DF P + + D R M + P
Sbjct: 248 NVNNPAFGQYKIALNDTNGANRDFVLAFKPLQKENAQAAFFTQQFENAD-RYGLAMLMPP 306
Query: 317 GTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
G ++ + ++++F+VD SGSM G+ +E K AL ALS L+ DSFNI+ F+
Sbjct: 307 GDHFTQTQRLPREMVFVVDTSGSMHGQSIEQAKKALFYALSLLESDDSFNIIGFDNNVTA 366
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
S +A+ + RA ++I + A G T I L ++ T G + + +TDG+
Sbjct: 367 MSDRPLIASDFNLRRAERFI-YSLEADGGTEIQGALDAVLDGSTFD-GFVRQVVFLTDGS 424
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V +E + ++++L + R++T GIGS N +F+R A I +G + ++
Sbjct: 425 VSNEATLFKNIQAKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGTFTFIGSTQEVQ 479
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 480 PKMQQLFDK-----LAHPAITDLALSDENGNSLDFWPSPLPDLYFGEPVMVAIKLNDAKS 534
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKM 611
L + G LS + AK I ++ A+Q+I L ++E ++++V K+
Sbjct: 535 VILTGQTAQGPLSIKLSTQNSSQAKGIA--KLWARQKIKSLL---LYNEQSSVKDEVQKL 589
Query: 612 SVQTGVLCEYTRMIIVE 628
++ +L +T I +E
Sbjct: 590 ALTHQLLSPFTAFIAIE 606
>gi|254428069|ref|ZP_05041776.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881]
gi|196194238|gb|EDX89197.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881]
Length = 657
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 121/499 (24%), Positives = 216/499 (43%), Gaps = 39/499 (7%)
Query: 159 GAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEY 218
+GK AS+ S +PN+FT + I G + + L + Q L+ F + +P
Sbjct: 121 ASGKKASLV--SQQRPNLFTTAVANIAPGETVQVELHYQQTLNVDGHRFQLRLPLTLTPR 178
Query: 219 VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNID 278
TP + + + N A G T+S H+ D G + S
Sbjct: 179 FTPPTEAPHTLDSLLRNTVAAPGGTADAGTASVHID---LDAGARLATLGSPSHAIHYQR 235
Query: 279 FDFSYTVSPSHIFGGV-----LLQSPSLHD------------VDQREMFCMYLLPGTAKS 321
YT++P G + LL + L D +D + ++P
Sbjct: 236 HGRRYTITPK--AGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDGEHYALLMVVPPKTGQ 293
Query: 322 RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSM 380
++ +FI+D SGSM G P+ K +L +AL +L PGD FNI F+ + T LF T +
Sbjct: 294 VTALPRETLFIIDSSGSMGGAPMRQAKASLHLALQRLKPGDRFNITDFDSQHTLLFETPV 353
Query: 381 ELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
+ + + ++A ++ A G T++ L+ + S G + + +TDGAV +E
Sbjct: 354 TV-SDNSRQQAQDFVD-GLQASGGTHMLPALSATLSQ-PASDGYLRQVIFITDGAVGNES 410
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
I A+ +L R++T GIGS N +F+ A RG + D + ++ M
Sbjct: 411 GIFRALHQQLGEA-----RLFTVGIGSAPNSHFMTRAAQFGRGSFTYINDQNQVQQGMDT 465
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
LF R S ++ N+ + L D E +P ++PDL + PL+V+ + P + G+
Sbjct: 466 LFRRLESPLMRNLQVQ-LPDGIVAERWPQKLPDLYAGEPLLVAMKLSAP-PQQITVSGY- 522
Query: 561 GDLSNFVVELKLQLAKDIP-LDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVL 618
++ + L + P + A+++I DL+ + + + ++ ++++ ++
Sbjct: 523 -STQHWQQPVTLPTNNNHPGTASLWARRKIADLMDRITLGAAETDIAPLITDVALRHQLV 581
Query: 619 CEYTRMIIVETDERNNASE 637
YT I VE A+E
Sbjct: 582 SRYTSFIAVEETVSRPANE 600
>gi|337277664|ref|YP_004617135.1| hypothetical protein Rta_00560 [Ramlibacter tataouinensis TTB310]
gi|334728740|gb|AEG91116.1| Hypothetical protein Rta_00560 [Ramlibacter tataouinensis TTB310]
Length = 700
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 134/589 (22%), Positives = 235/589 (39%), Gaps = 65/589 (11%)
Query: 81 PSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILG--- 137
PSL L + + ++V A V V+ +R G+++ + R P + ++ G
Sbjct: 59 PSLDALPLKSTRVEVHIAGVIADVTVTQHYRNE---GTRAIEARYLFPGSTRAAVHGLNV 115
Query: 138 ------VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLK---PNIFTLTLPQIDGGS 188
+ A+I K Q A +G +T + L+ PN+F + + I G
Sbjct: 116 RLADRLITADIREKRQARQEYAAARQEG-------KTAALLEQQLPNVFQMNVANILPGD 168
Query: 189 YLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKIHLNVNAGTGTEVLCN 247
+ + LR+++ L G + P P Y +P + + G + +
Sbjct: 169 EVKVELRYTELLVPEAGRYQFVFPTVVGPRYNSPQSSQARTAWVAQPTLREGQASPAAFD 228
Query: 248 TSSH-----HLKQLRRDVGKLGYSYESEVLKW-----------SNIDFDFSYTVSPSHIF 291
H ++ + ++G E E N DF Y ++ I
Sbjct: 229 LQLHIASPLAIQGVESPSHRIGVELEGERQAHVALQGEPGRPAGNRDFILHYRLAGEGIA 288
Query: 292 GGVLL-QSPSLHDVDQREMF-CMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKN 349
GVLL Q P Q F M P +R + +D +F+VDISGSM G PL+ K
Sbjct: 289 SGVLLYQGP------QENFFLAMVQPPRQVDARAIPPRDYVFVVDISGSMHGFPLDTAKA 342
Query: 350 ALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICA 409
L + L P D+FN++ F+G S AT+ +E+A + I + GG +
Sbjct: 343 VLRELIGGLRPSDTFNVLLFSGSNRFLSPQPVPATRANLEQAIRT--IEQMGGGGSTELI 400
Query: 410 PLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYC 469
P K V + + +VTDG V ER+ + ++ L+ ++ FGIGS
Sbjct: 401 PALKRVYAQPKAADVSRTVVVVTDGYVTVEREAFELVRRHLSQAN-----VFAFGIGSSV 455
Query: 470 NHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPS 529
N + + LA +G Q + F R S + + L+ +++ P
Sbjct: 456 NRHLIEGLARAGQGEPFVITQASQARDQATR-FRRMIESPVLTSVTARFEGLEAYDVEPP 514
Query: 530 RIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD--LSNFVVELKLQLAKDIP-------- 579
++PD+ +E P++V G+++G L +G D V + A P
Sbjct: 515 QLPDVLAERPVVVFGKWRGNPGGQLVVEGRAADGPYRQVVPIATVAGAGGAPQEGGDTRA 574
Query: 580 LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
L + A+ +I L+ Q E+++ + +Q +L +YT + V+
Sbjct: 575 LRHLWARHRIAALSDQEALEGGDGQRERITALGLQYRLLTQYTSFLAVD 623
>gi|221134029|ref|ZP_03560334.1| vault protein inter-alpha-trypsin [Glaciecola sp. HTCC2999]
Length = 757
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 129/565 (22%), Positives = 252/565 (44%), Gaps = 89/565 (15%)
Query: 159 GAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF--- 215
G K++ VE +PN+FT T+ I G ++SI + + Y DGE+ V +P
Sbjct: 206 GGKKASLVEQK---RPNLFTNTIANIGPGEHISITISYQNIAQYTDGEYLVRLPMTLSQR 262
Query: 216 -------------------PEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQL 256
P+Y A K+ + LN + + +++HH++
Sbjct: 263 YAGELNSVDGSGESLDPVDPDY---AHKQYAMDVVLTLNTQSA---DTRITSTTHHIESE 316
Query: 257 RRDVGKLGYSYESEVLKWSNIDFDFSYTVSPS----------HIFG---GVLLQSPSLHD 303
D G E + +N DF S+++ + +I G G++ P +
Sbjct: 317 AIDKGIYQVRLAEEDV--ANRDFVVSWSLGNTDDAVVGHYQEYIDGTTYGLIHVVPPVIS 374
Query: 304 VDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
D M + P +++ IF++D SGSM G L +A+ +S L D+
Sbjct: 375 HDPSSMAST-VTPS-------IQQNTIFVLDSSGSMHGTALTQAIDAIREGVSYLTEHDT 426
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
FNIV F+ E + A + + A +++ + + G TN+ L ++ L +S
Sbjct: 427 FNIVDFDSEARALWRQSQFADEVSKAEAMRFLR-HVDSDGGTNMQDALALSLTQLLDSST 485
Query: 424 SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ + VTDG++ +ER++ + +L + R++T GIG+ N +F+ AM+ +G
Sbjct: 486 GLTQVIFVTDGSINNERELLKQIAEQLGD-----KRLFTVGIGAAPNSHFMEYAAMLGKG 540
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
Y DL I+ +M LF++ S ++ +I L +E +YP +PD+ + P+++S
Sbjct: 541 TYTYIDDLTEIQPKMAYLFSQLRSPMITDIQ---LTPSEELSLYPQVLPDIYLDQPVVLS 597
Query: 544 GRYQG--KFPDTLKAKGFLGDLSNFVVELKLQLAK-----DIPLDRICAKQQIDLLTAQA 596
RY G P+ L KG LG + ++ Q + L+++ A+ +I + ++
Sbjct: 598 YRYDGVADQPEPLTIKGRLGSDETGLEQIWSQALPVSSHLTMGLNKLWARAKIKNI--ES 655
Query: 597 WFSEDKR---------LEEKVSKMSVQTGVLCEYTRMIIVE-TDERNNASESPGTKKGSK 646
W+ + ++ K+++ ++ ++ +T ++ V+ T + +S +
Sbjct: 656 WWHTNTEYLPEELPDIMQAKITETALSAKIISRFTSLVAVDVTPSKPMGKQSKAVDVLNA 715
Query: 647 KSDHQKILDSEVPKTIMLQNLGIGF 671
K+ H L ++P+T +LG+G
Sbjct: 716 KAQH---LVGQLPQT----SLGLGI 733
>gi|239627294|ref|ZP_04670325.1| von Willebrand factor [Clostridiales bacterium 1_7_47_FAA]
gi|239517440|gb|EEQ57306.1| von Willebrand factor [Clostridiales bacterium 1_7_47FAA]
Length = 681
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 139/598 (23%), Positives = 259/598 (43%), Gaps = 79/598 (13%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLI----ALGENDGA---GKSASVETG 169
GS + P + S+ G++ ++ K ++ A E D A GKSAS+
Sbjct: 84 GSVPVNAGYVFPASTRVSVHGMKMQVGDKVVTAKIKEKEEAKQEFDAAKSEGKSASLLEQ 143
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKK-IPK 228
+PN+FT+ + I G ++I L +++ ++ +G + F FP V P P
Sbjct: 144 Q--RPNVFTMNVANIMPGDTVNIELHYTEMIALSEGSYE----FVFPAVVGPRYSSPSPD 197
Query: 229 REK--------------------IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYE 268
RE+ + V+ TG + SS H + + +
Sbjct: 198 REEDGNQWVASPYQEGGAVPKGTYDIAVSLSTGVPITGIVSSSHKINIEQSADSSAHITL 257
Query: 269 SEVLKWS-NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVF 325
+ + N DF Y ++ + G++L + ++E F + ++ P + +
Sbjct: 258 KDPADYGGNRDFILRYQLAGQTVNSGLMLNT------GEKENFFLLMVQPPERVPAEAIP 311
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
++ IF++D+SGSM G PL+ K + +S L D+FN++ F+ + S AT
Sbjct: 312 PREYIFVLDVSGSMFGYPLDTAKELIRNMVSNLRETDTFNLILFSNDAIRMSARSLPATD 371
Query: 386 EAVERAHQWIGINFI----AGGSTNICAPLTKAVEM-LTNSRGSIP-IIFLVTDGAVEDE 439
E VERA IN I GG T + L KAV + + + GS+ + ++TDG + DE
Sbjct: 372 ENVERA-----INLINRQKGGGGTELAPALEKAVGIPMDSGAGSVSRSVVVITDGYMSDE 426
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+ I D + L ++FGIG+ N Y + +A G D S
Sbjct: 427 QAIFDIVAGNLD-----TTSFFSFGIGTSVNRYLIEGIARTGGGESFVVTD-SSESADTA 480
Query: 500 KLF-TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK--A 556
+LF T S VL ++ +D D +++ P+ IP L ++ P+++ G+++G +
Sbjct: 481 RLFDTYIQSPVLTDVQVD-YDGFDAYDVEPTAIPTLFAQKPIVLFGKWRGNPAGAIHITG 539
Query: 557 KGFLGDLSNFV-VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKR-LEEKVSKMSVQ 614
K GD S + V A++ + + A+ +++ L + D+ + ++V+ + +
Sbjct: 540 KSGTGDYSQTIQVSEASSSAENSAIPYLWARTRVENLMDYGFSGGDEEAVRKEVTALGLD 599
Query: 615 TGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFG 672
++ YT + V + RN ES + +D +P ++ + +L +G G
Sbjct: 600 YSMMTSYTSFVAVIDEVRNTTGES-------------RDVDQPLPLSLQVSDLAVGGG 644
>gi|440684401|ref|YP_007159196.1| Vault protein inter-alpha-trypsin domain-containing protein
[Anabaena cylindrica PCC 7122]
gi|428681520|gb|AFZ60286.1| Vault protein inter-alpha-trypsin domain-containing protein
[Anabaena cylindrica PCC 7122]
Length = 843
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 131/510 (25%), Positives = 211/510 (41%), Gaps = 75/510 (14%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP------- 227
NIFT +L I G + + +R+++ L + G + F FP V P + IP
Sbjct: 115 NIFTQSLANIQPGEQIDVIIRYTESLKFEGGNYE----FVFPMVVGP--RYIPGISIEDN 168
Query: 228 ------------------------------------KREKIHLNVNAGTGTEV-LCNTSS 250
I++ V G E+ N+ S
Sbjct: 169 TAGRGSAPAPMSLNQDTDLVPDASRLNAPILPAGTRSSHDINVTVEINAGVEIGHINSPS 228
Query: 251 HHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-M 309
HH+ Q+ RD G L N D Y V+ + VL Q+ D+R
Sbjct: 229 HHI-QITRD-GNLAQIQLGGGDTIPNKDLILRYQVAGENTQTSVLTQA------DERGGH 280
Query: 310 FCMYLLPGTAKSRKVF-KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
F +YL+P F KD++F++D SGS G PL + + ++ L+P D+F+I+
Sbjct: 281 FAVYLIPALQYQPDQFVPKDMVFLIDRSGSQSGAPLMQCQELMRRFIAGLNPHDTFSIID 340
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI- 427
F T S + T E RA +I AGG T + + ++ + N + P+
Sbjct: 341 FCDTTQQLSPVPLVNTPENRLRAINYIN-RLNAGGGTQML----RGIQTVLNFPVTDPVR 395
Query: 428 ---IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
I L+TDG + +E QI ++ L G R+++FG GS N + L +A I RG
Sbjct: 396 LRNIVLLTDGYIGNENQIIAEVQRCLQPGT----RLHSFGAGSSVNRFLLNRVAEIGRGI 451
Query: 485 YGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSG 544
+ + ++K F + V ANI + D + MYPS PDL +E PL++ G
Sbjct: 452 ARIIRHDEPMNEVVEKFFRQINHPVFANIQLHWEGDGEFPIMYPSTPPDLFAEQPLVLFG 511
Query: 545 RYQGKFPDTLKAKGFLGD--LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDK 602
R L G GD S +V + Q+ + + ++ + +I L Q + K
Sbjct: 512 RKPDGCAGKLHVTGMSGDGRRSQYVFNINFQVMGNPAIAQLWGRSRIKDLMNQMVSGDTK 571
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
E V+ ++ +L +YT + V + R
Sbjct: 572 VGVEAVTNTALTYQLLSQYTAFVAVSDEVR 601
>gi|334121292|ref|ZP_08495364.1| Vault protein inter-alpha-trypsin domain-containing protein
[Microcoleus vaginatus FGP-2]
gi|333455227|gb|EGK83882.1| Vault protein inter-alpha-trypsin domain-containing protein
[Microcoleus vaginatus FGP-2]
Length = 837
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 124/524 (23%), Positives = 221/524 (42%), Gaps = 77/524 (14%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV----TPAIKK---- 225
NIFT +L I G + + +R+++ L + G++ P P Y+ TP + +
Sbjct: 177 NIFTQSLANIKPGEKIEVTIRYTESLKFAGGDYEFVFPMVVGPRYISRNSTPPLVRGGQG 236
Query: 226 -------------IPKREK------IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYS 266
+P + + + ++AG + +TS ++ ++ +
Sbjct: 237 GVNTDADRINPPVLPPGTRSGHNIAVSVEIDAGVAIGDVRSTSHQITTDRSGNIVQVQLA 296
Query: 267 YESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPG-TAKSRKV 324
+ N D Y V+ + +L QS D+R F YL+P K+ ++
Sbjct: 297 NSDTI---PNKDLILRYRVAGENTRATILTQS------DKRGGHFATYLIPALNYKTNEI 347
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
KDV+F++D SGS QG+PL +K + + L+P D+F I+ F ST+ T
Sbjct: 348 VPKDVVFLMDTSGSQQGEPLAKSKELMRRFIQGLNPTDTFTIIDFANTAKALSTTPLANT 407
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN----SRGSIPIIFLVTDGAVEDER 440
E ++A IN+I N + L ++ + N + G + I L+TDG + +E
Sbjct: 408 PENRQKA-----INYIEKLQANGGSELLNGIQAVMNFPAAATGRLRSIVLITDGYIGNEN 462
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA-AYDLDSIEIQMQ 499
++ ++ L G R+Y+FG+G N + + LA + RG +D S E
Sbjct: 463 EVLARVQGSLKPGN----RLYSFGVGGSVNRFLINRLAEVGRGTSQVIRHDEPSAEA-AD 517
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
K F VL NI I ++ E+YP PDL + PL++ G+ + L+ +G
Sbjct: 518 KFFREINRPVLTNIQISWEGMGEKPEIYPIAPPDLFASQPLVLFGKKSDRTNGQLRIRGT 577
Query: 560 LGDLSNFVVELKLQLAK--------------------DIPLDRICAKQQIDLLTAQAWFS 599
L + L + A+ + + ++ + +I L Q +
Sbjct: 578 LAGGKAYEQVLPVNFARSAGARQRESTSVAAGATDFGNPAVAQLWGRSRIKDLMNQMFGG 637
Query: 600 EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
E K L E V+ ++ +L EYT + V + R E GT++
Sbjct: 638 ETKSLVEAVTNTALTYRLLSEYTAFVAVSEEVR---VEPDGTRR 678
>gi|383764369|ref|YP_005443351.1| hypothetical protein CLDAP_34140 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384637|dbj|BAM01454.1| hypothetical protein CLDAP_34140 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 747
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 126/471 (26%), Positives = 210/471 (44%), Gaps = 22/471 (4%)
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKR 229
+L +F I G ++RL ++Q L +DG + N P + Y A P+
Sbjct: 137 EYLGRGLFQTRAFPIPPGETRTVRLTYTQLLERKDGLYRFNYPLRASHYSADA----PEE 192
Query: 230 EKIHLNVNAGTGTEVLCNTSSH-HLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPS 288
+H+ + A G + + + +++L D + Y DFD +
Sbjct: 193 IALHIELRALPGLRTIYSPNYPITIERLADDHAVVSYHATG---AEPTSDFDLYFGTEKQ 249
Query: 289 HIFGGVLLQSPSLHDVDQREMFCMYLLPG--TAKSRKVFKKDVIFIVDISGSMQGKPLED 346
I +L P+ D F + PG A +V +D+I ++D+SGSMQGK ++
Sbjct: 250 PIGIDLLSYKPAGED----GYFLLLAAPGFEAAAEEEVVARDLIVVIDVSGSMQGKKIQQ 305
Query: 347 TKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTN 406
K AL + L+P D FN++AF+ L+ +++ + A +WI A GST+
Sbjct: 306 AKEALHYIVQNLNPQDRFNLIAFSTGVSLWEKTLQPVEARTQQAAIRWID-KLRASGSTD 364
Query: 407 ICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDAMKSRLTNG--GSICPRIYTF 463
I L +A+ L + P I +TDG DA+ + N G R++TF
Sbjct: 365 INRALLEALAQLDDEDVDRPAYILFLTDGQPTQGEMNVDAIVRNVLNNLPGRRSLRLFTF 424
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE 523
G+G N L +LA RG ++I+ + + + + VLA+I ID +
Sbjct: 425 GVGYDVNTDLLDVLADELRGTSHYVKPDEAIDEAVSDFYAQISTPVLADIQIDFGESTFI 484
Query: 524 FEMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-L 580
++YP +PDL + L+V GRY+ G+ TL + + + E L P +
Sbjct: 485 EDVYPYPLPDLFAGRQLVVVGRYREGGEVEVTLSGRVNNQEKTYIYSERMLTEKGGEPFV 544
Query: 581 DRICAKQQIDLLTAQAWFS-EDKRLEEKVSKMSVQTGVLCEYTRMIIVETD 630
++ A ++I L AQ S D L + V +S+Q GVL YT +++E D
Sbjct: 545 AQLWATRKIGHLLAQIRRSGPDPELVQAVIDLSLQYGVLTPYTAYLVLEPD 595
>gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4]
gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp.
MR-4]
Length = 759
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 135/560 (24%), Positives = 243/560 (43%), Gaps = 84/560 (15%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K Q+ + +G K AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKQQAKQIFEQAKAEG--KRASLVSQE--RPNMFTTEVANLAPDEELVVEISYQ 197
Query: 198 QKLSYRDGEFSVNVPFKF-PEYVTPAI-------KKI-------------PKRE------ 230
+ + Y DG FS+ P P Y+ P + +++ P R+
Sbjct: 198 ETIHYEDGLFSLRFPLVVAPRYI-PGLTLGGNNHERVTSSQVFDADRIIAPIRDASSETD 256
Query: 231 ---KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTV-- 285
K + V G G + S +H + G+L + + V +N DF + +
Sbjct: 257 PVLKADIKVKLGEGVDKSAVVSPYHPITIDEKQGQLTAALANRVP--ANRDFVLQWRLKQ 314
Query: 286 --SP-SHIFG-----------GVLLQSPSLHDVDQREM------FCMYLLPGTAKSRKV- 324
SP + +F S S VD M L P S +
Sbjct: 315 GTSPVAWVFNQAGKTHTTQDDNASADSGSTASVDGNSNSNNNYSLVMVLPPKVEASEQPN 374
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+++I ++D SGSM G + KNAL AL L P DSFNI+ FN + L S++ AT
Sbjct: 375 LPRELILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLLSSTPLPAT 434
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAV--EMLTNSRG---SIPIIFLVTDGAVEDE 439
+ A Q++ A G T + L A+ + + G S+ + +TDG+V +E
Sbjct: 435 ATNLAMARQFVN-RLQADGGTEMAQALNSALPRQAFNTASGEDKSLRQVIFMTDGSVGNE 493
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+ + +++++ + R++T GIGS N +F++ A + RG + D+D +E ++
Sbjct: 494 SALFELIRNQIGDN-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVEQKIS 548
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
KL + VL ++ + D + +P+ IPDL P+++S + + P L G
Sbjct: 549 KLLAKIQYPVLTDLQV-RFDDGSVPDYWPAPIPDLYRGEPVLISLKRHPREPQELVISGR 607
Query: 560 LGDLSNFVVELKLQL------AKDIP-----LDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
G N+ L LQ A D+ LD + A++QI L + D +++++V
Sbjct: 608 QGH-KNWQQSLSLQANDASHSAIDVAQPTAGLDLLWARKQIAALELSKNGANDDKVKQQV 666
Query: 609 SKMSVQTGVLCEYTRMIIVE 628
+ +S+ ++ YT ++ V+
Sbjct: 667 TALSLNYHLVSPYTSLVAVD 686
>gi|392536808|ref|ZP_10283945.1| inter-alpha-trypsin inhibitor [Pseudoalteromonas marina mano4]
Length = 664
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 128/510 (25%), Positives = 225/510 (44%), Gaps = 70/510 (13%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AGK A++ + N+F + I G ++I L + + ++YR+G F++ +FP +
Sbjct: 126 AGKQAALVRQQ--RANMFVTNVANIGPGEQVAIELEYQEIINYRNGTFTI----RFPTTI 179
Query: 220 TPAIKKI----------------------------------PKREKIHLNVNAGTGTEVL 245
TP I P E LN++ G E++
Sbjct: 180 TPRYHAINGTLEQPSSSLMDQNVAPNGWLSPVYNVQKNNDAPISE-FTLNIDIDVGLELV 238
Query: 246 CNTSSHHLKQLRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGG-VLLQSPSLHD 303
TS HH R +V + L +N + DF P + +
Sbjct: 239 DITSKHH----RINVNNPAFGQYKIALNDTNGANRDFVLAFKPLQKENAQAAFFTQQFEN 294
Query: 304 VDQREMFCMYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD 362
D R M + PG ++ + ++++F+VD SGSM G+ +E K AL ALS L+ D
Sbjct: 295 AD-RYGLAMLMPPGDHFTQTQRLPREMVFVVDTSGSMHGQSIEQAKKALFYALSLLESDD 353
Query: 363 SFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
SFNI+ F+ S +A+ + RA ++I + A G T I L ++ T
Sbjct: 354 SFNIIGFDNNVTAMSDRPLIASDFNLRRAERFI-YSLEADGGTEIQGALDAVLDGSTFD- 411
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
G + + +TDG+V +E + ++++L + R++T GIGS N +F+R A I +
Sbjct: 412 GFVRQVVFLTDGSVSNEATLFKNIQAKLGDS-----RLFTVGIGSAPNSFFMRRAADIGK 466
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSES 538
G + ++ +M++LF + LA+ AI L DE + +PS +PDL
Sbjct: 467 GTFTFIGSTQEVQPKMEQLFDK-----LAHPAITDLALSDENGNSLDFWPSPLPDLYFGE 521
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF 598
P++V+ + L + G LS + AK I ++ A+Q+I L +
Sbjct: 522 PVMVAIKLNDAKSVILTGQTAQGPLSIKLSTQNSSQAKGIA--KLWARQKIKSLL---LY 576
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+E ++++V K+++ +L +T I +E
Sbjct: 577 NEQSSVKDEVQKLALTHQLLSPFTAFIAIE 606
>gi|409197615|ref|ZP_11226278.1| von Willebrand factor A [Marinilabilia salmonicolor JCM 21150]
Length = 813
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 124/491 (25%), Positives = 229/491 (46%), Gaps = 54/491 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT----------- 220
+PN+F + + I G+ + +++ +++ L D + P P YV+
Sbjct: 137 RPNVFKMNVANIVPGATVEVKMSYTELLVPSDKIYEFVYPTVVGPRYVSKGEIAEASTET 196
Query: 221 ----PAIKKIPK-REKIHLNVNAGTG---TEVLCNTSSHHLKQLRRDVGKLGYSYESEVL 272
P +K K + +++ TG ++ C T S+ + + L +
Sbjct: 197 WTGNPYLKAGEKPASTLDIDIELNTGLPIQDIRCETHSNKIDYQAKSSASLSM----DEP 252
Query: 273 KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVI 330
N DF Y ++ I GVLL +D E F + ++ P + + ++ +
Sbjct: 253 YGGNRDFVMQYRLAGDQIETGVLL-----YDSPDGEKFFLAMMQPPERVQPEDIPAREYV 307
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
FIVDISGSM G PL+ +K + LS L D FNIV F+G +++++ AT+ ++
Sbjct: 308 FIVDISGSMSGFPLDISKKLMKSLLSGLRGKDLFNIVFFSGGSFIYNEESLPATEANIDD 367
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLV-TDGAVEDERQICDAMKSR 449
A Q++ N GG T + L A M N R + F++ TDG V E + D +++
Sbjct: 368 AIQFMD-NRRGGGGTELLNALKSA--MGLNGRENYARSFVILTDGYVSVEDETFDYIRNN 424
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
L N ++FGIG N + + +A + GY A +++ E Q + S
Sbjct: 425 LGNA-----NFFSFGIGRGVNRHLIEGMAHV--GYGEAFVAMNNGEAVKQAGRFEKYISK 477
Query: 510 LANIAIDT-LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG--DLSNF 566
ID + + +++ P IPDL +E P+I+SG+Y+G+ LK G G ++S
Sbjct: 478 PVLTGIDYFFEGFEAYDVLPENIPDLFAERPIIISGKYRGEPKGVLKVTGTSGRSEISK- 536
Query: 567 VVELKLQLAKDIPLDRICAKQQIDLLTAQA------WFSEDKR--LEEKVSKMSVQTGVL 618
+ +K +++ L + A+++I LL A + + DK+ L+ +++++ ++ +L
Sbjct: 537 TLNIKADGQENMALKYLWAREKIRLLGDYASLDSYDYGNSDKKEALKREITQLGLKYNLL 596
Query: 619 CEYTRMIIVET 629
+YT I V++
Sbjct: 597 TQYTSFIAVDS 607
>gi|119468931|ref|ZP_01611956.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7]
gi|119447583|gb|EAW28850.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7]
Length = 664
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 132/557 (23%), Positives = 239/557 (42%), Gaps = 77/557 (13%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + + + AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMRIGERVIKGHIDKKIEAEKKYAQAQKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKI------ 226
+ N+F + I G ++I L + + ++YR G F++ +FP +TP I
Sbjct: 137 RANMFVTNVANIGPGEQVAIELEYQEIINYRSGTFTI----RFPTTITPRYHAINGTLEQ 192
Query: 227 ----------------------------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRR 258
P E LN++ G E++ TS HH R
Sbjct: 193 PSSSLMDQNVAPSGWLSPVYNVQKNNDAPISE-FTLNIDIDVGLELVDITSKHH----RI 247
Query: 259 DVGKLGY-SYESEVLKWSNIDFDFSYTVSPSHIFGG-VLLQSPSLHDVDQREMFCMYLLP 316
+V + Y+ + S + DF P + + D R M + P
Sbjct: 248 NVNNPAFGQYKIALNDTSGANRDFVLAFKPLQKENAQAAFFTQQFENAD-RYGLAMLMPP 306
Query: 317 GTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
G ++ + ++++F+VD SGSM G+ +E K AL ALS L+ DSFNI+ F+
Sbjct: 307 GDHFTQTQRLPREMVFVVDTSGSMHGQSIEQAKKALFYALSLLESDDSFNIIGFDNNVTA 366
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
S +A+ + RA ++I + A G T I L ++ T G + + +TDG+
Sbjct: 367 MSDRPLIASDFNLRRAERFI-YSLEADGGTEIQGALDAVLDGSTFD-GFVRQVVFLTDGS 424
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V +E + ++++L + R++T GIGS N +F+R A I +G + ++
Sbjct: 425 VSNEATLFKNIQAKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGTFTFIGSTQEVQ 479
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+M++LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 480 PKMEQLFDK-----LAHPAITDLALSDENGNSLDFWPSPLPDLYFGEPVMVAIKLNDAKS 534
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKM 611
L + G LS + AK I ++ A+Q+I L ++E ++++V K+
Sbjct: 535 VILTGQTAQGPLSIKLSTQNSSQAKGIA--KLWARQKIKSLL---LYNEQSSVKDEVQKL 589
Query: 612 SVQTGVLCEYTRMIIVE 628
++ +L +T I +E
Sbjct: 590 ALTHQLLSPFTAFIAIE 606
>gi|403388904|ref|ZP_10930961.1| hypothetical protein CJC12_13934 [Clostridium sp. JC122]
Length = 748
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 135/571 (23%), Positives = 252/571 (44%), Gaps = 46/571 (8%)
Query: 81 PSLIPLQMNAV---ELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILG 137
P+LI N V ++DV Y+ FV S ++ ++ G + P+ + + G
Sbjct: 4 PNLIYDTDNNVILDKVDVTGYVCGEFVEFSTSY-IYENRGGNDVEAVFTFPIPETAVLSG 62
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFL----KPNIFTLTLPQIDGGSYLSIR 193
EA I G++ T + + E + E+ +FL N F +++ +I G + ++
Sbjct: 63 FEANIGGRTIKTTIESKEEAEKICDKLKEESSNFLLEEINANDFKISIGKIISGETIKLK 122
Query: 194 LRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKRE---KIHLNVNAGTGTEVLCNTSS 250
+ + ++L Y + + +P + E + K E K +LN+ EV +S+
Sbjct: 123 ISYIEELEYEEKSLKLTIP-QIVEPINIENSKNLMNENDYKCYLNLLVEPFNEVEFESST 181
Query: 251 HHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI--FGGVLLQSPSLHDVDQRE 308
H + + D GY Y+ K +D D ++ + G++ ++P D
Sbjct: 182 HDIAVEKGD----GYIYKIYFNKTEYLDRDLVIYITEKELEETTGMIYENPK----DNSN 233
Query: 309 MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
+ + +P + + + F+VDIS SM+G+ L KNAL + L L+ D+FNIVA
Sbjct: 234 ILYLRFMPDIEEEEENLVHEYEFLVDISSSMEGEKLLQAKNALQICLRSLNEKDTFNIVA 293
Query: 369 FNGETYLFSTSME-LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
+ + F+ + L +E + A ++I N I + ++ + +G
Sbjct: 294 MGDKLHYFNEGGKVLYNEENLILASKFID-NLQCEDDALIFEGIKHGLK--DDHKGKENT 350
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
I L TD V+DE++I D ++ N RI+ FGI + N YF+ LA I+ G
Sbjct: 351 ILLFTDDIVDDEKEILDYVEKVCENS-----RIFPFGIDASVNTYFINKLARITYGKAEY 405
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
D + IE + F R + +I +D D++ + YP I + E P + + +
Sbjct: 406 INDNNKIEDTILNQFNRIRGLQVTDIKLD-FGDMNVEKTYPRTIEYMYDEEPFSIFAKVR 464
Query: 548 GKFPDTLKAKGFLGD--------LSNFVVELKLQLAKDIPLDRIC--AKQQIDLLTAQAW 597
GK + G +G+ L+N +E + + I + C +Q+I ++ +
Sbjct: 465 GKAEGVVTLTGMVGERRVQRRVSLNNLELEENANVIEKIWYKKRCDSIEQRIRFERSEVY 524
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
S ++ K+ ++S + GVL T I++E
Sbjct: 525 NS----MKNKLIELSKKIGVLTSKTSFILIE 551
>gi|399071467|ref|ZP_10749999.1| marine proteobacterial sortase target protein [Caulobacter sp.
AP07]
gi|398043381|gb|EJL36291.1| marine proteobacterial sortase target protein [Caulobacter sp.
AP07]
Length = 718
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 182/399 (45%), Gaps = 49/399 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKRE- 230
+PN+FT + + G + +++ + ++ GE+S+ VP P Y +P P
Sbjct: 168 RPNMFTNAVANVGPGETVLVQIEYQAPVAVSAGEYSLRVPLVVAPRYTSPGAAPTPDANA 227
Query: 231 -----KIHLNVNAGT-------GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNID 278
++H VN T G + TS+ H +R G+ G + + D
Sbjct: 228 AIVDPRVHATVNPVTITVHLRAGFPLASVTSATHAVDVR---GERGGKVVTLAQGKTPAD 284
Query: 279 FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL-----PGTAKSRKVFKKDVIFIV 333
DF T P+ L++PS+ ++ Y+L P ++++ +D++F++
Sbjct: 285 RDFVLTWRPAP------LKAPSVGLFHEKVAGGDYVLAQVTPPISSRTGPPAPRDIVFVI 338
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAH 392
D SGSM G+ + K +LA L+ L GD FN+V F+ T LF ++E A V +A
Sbjct: 339 DNSGSMGGESMRQAKASLAYGLAHLKAGDRFNVVRFDDTLTVLFPDTVE-ADAAHVAKAQ 397
Query: 393 QWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLT 451
+++ A G T + +T A+ + G + + +TDGA+ DE+ + DA+ +
Sbjct: 398 KFVA-GLDASGGTEMIPAMTAALKDARAADTGRVRQVVFLTDGAIGDEQGLFDAISA--- 453
Query: 452 NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLA 511
G R++ GIGS N + + A + RG + D +E M++LF + S V
Sbjct: 454 --GRGRSRVFMVGIGSAPNTFLMSRAAELGRGTFTHIGSTDGVEEAMRQLFDKLESPVAT 511
Query: 512 NIAIDTLKDLDEFE-----MYPSRIPDLSSESPLIVSGR 545
N+ + FE M P+ +PDL P+ V+ R
Sbjct: 512 NL-------VASFEGATADMAPAVLPDLYRGEPVTVAAR 543
>gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain-containing protein
[Rhizobium leguminosarum bv. trifolii WSM1325]
gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium
leguminosarum bv. trifolii WSM1325]
Length = 791
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 119/517 (23%), Positives = 229/517 (44%), Gaps = 66/517 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 169 RPNIFTNQVANIGPGETIVVQIEYQQTIHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 228 ----------------KREKI----------------HLNVNAGTGTEVLCNTSSHHLKQ 255
R+KI L V+ G + SS H
Sbjct: 225 EFNNGAGFATPRDPVENRDKIAAPVLDPRENARINPVSLTVDLRAGFPLGDVKSSFHAVD 284
Query: 256 LRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
+ +D G + LK + DF+ ++ + + L + +D +
Sbjct: 285 INQD----GDQARTISLKADTVPADKDFELTWKAAAGKMPSAGLFR----EVIDGKTYLL 336
Query: 312 MYLLPGTAKSRKVF--KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
++ P A K++V+F++D SGSM G +E + +LA+A+SKL+P D FN++ F
Sbjct: 337 AFVTPPAAPDTAAPPAKREVVFVIDNSGSMSGPSIEQARQSLALAISKLNPDDRFNVIRF 396
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPII 428
+ + + AT + E+A ++ A G T + L A+ + G++ +
Sbjct: 397 DDTMTDYFKGLVTATPDNREKAIGYVR-GLTADGGTEMLPALQAALRNQGPVASGALRQV 455
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E+Q+ + + + R++T GIGS N YF+ A + RG + A
Sbjct: 456 VFLTDGAIGNEQQLFQEITANRGDA-----RVFTVGIGSAPNTYFMTKAAEMGRGTFTAI 510
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
D + +M +LF + + + +IA T + + ++ P+ +PDL S P++++ +
Sbjct: 511 GSTDQVASRMGELFAKLQNPAMTDIAA-TFEGIKAEDITPNPMPDLYSGEPVVLTAQLPE 569
Query: 549 KFPDTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEE 606
L+ G GD + VE+ + A D + ++ A+++ID A+A+ +D L++
Sbjct: 570 NNAGKLQIIGKTGD-QPWRVEMDIANAADGSGISKLWARRKIDDFEARAYERQDPAALDK 628
Query: 607 KVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ +++ ++ T ++ V+ A + G+ K
Sbjct: 629 DIETVALAHHLVSRVTSLVAVDVTPSRPADQPLGSAK 665
>gi|336424687|ref|ZP_08604722.1| hypothetical protein HMPREF0994_00728 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336014220|gb|EGN44077.1| hypothetical protein HMPREF0994_00728 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 682
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 136/594 (22%), Positives = 247/594 (41%), Gaps = 60/594 (10%)
Query: 85 PLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISG 144
PL+ V ++++ FV + + G + P + +I G++ EI G
Sbjct: 54 PLKETNVSVNINGIFAETFVT-----QTYSNEGQDPINATYVFPASSRVTIHGMKMEI-G 107
Query: 145 KSYHTQLIALGEN--------DGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRW 196
T I E GKSAS+ +PN+FT+ + + G + I L +
Sbjct: 108 DEIITAKIKEKEEARHDYEQAKSEGKSASLLEQQ--RPNVFTMDVANVMPGDIIRIELHY 165
Query: 197 SQKLSYRDGEFSVNVP-FKFPEYVTPAIKKIPK----------------REKIHLNVNAG 239
++ ++ DG + P P Y +P++ K K REK ++NVN
Sbjct: 166 TEMITPTDGIYQFVFPTVAGPRYTSPSVPKSLKAETWIASPFLRLGDTPREKYNINVNLS 225
Query: 240 TG---TEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLL 296
G T++ C + + + ++ S E N DF Y ++ + I G++L
Sbjct: 226 AGVPITDLQCGSHKIDVAWDNQTSARISLSNPEEFA--GNRDFILDYKLTGTEISSGLML 283
Query: 297 QSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
+ + E F + ++ P + ++ IFI+D+SGSM G PL+ +K +
Sbjct: 284 GT------GESENFFLLMVQPPERYTPETIPPREYIFILDVSGSMYGFPLDTSKELIRNL 337
Query: 355 LSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L D FN++ F+ S AT E V++A I N GG T + L A
Sbjct: 338 AGSLRETDRFNVILFSDSLIQMSPESVPATAENVQKAFALID-NEEGGGGTELAPALETA 396
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + S G+ + +TDG + ER+I D + + ++FGIGS N Y +
Sbjct: 397 L-AIPASPGAARSVIAITDGYISGEREIFDIIGRNIG-----TTNFFSFGIGSSVNRYLI 450
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+A G D + T S VL ++ + T D +++ P + L
Sbjct: 451 DGIAKTGLGESFVVTDPSEAAATADRFRTYIQSPVLTDVQV-TYDGFDVYDIEPPTLSTL 509
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGD---LSNFVVELKLQLAKDIPLDRICAKQQIDL 591
+ P+++ G+++G+ T+ G G + V + L + + + A+ +++
Sbjct: 510 FASRPIVLFGKWKGEPSGTIHITGKTGGRDYAEDIPVSAVIPLEANNIIQYLWARTRVER 569
Query: 592 LTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
LT +ED + K V+++ ++ ++ YT I V RN +S K+
Sbjct: 570 LTDYG-ITEDLQANAKKEVTELGLRYSMMTPYTSFIAVTEKVRNTEGKSTDVKQ 622
>gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652]
gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli
CIAT 652]
Length = 794
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 129/525 (24%), Positives = 232/525 (44%), Gaps = 81/525 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+PNIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 169 RPNIFTNQVANIGPGEEIIVQIEYQQTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 224
Query: 228 ----------------KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
REKI L VN G + SS H
Sbjct: 225 EFNNGAGFATPRDPVENREKIEASVLDPSENAKINPVSLTVNLKAGFPLGDVNSSFHAVD 284
Query: 256 LRRDVGKLGYSYESEVLKWSNI--DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
+R+D + + LK + D DF T + L ++PS + + Y
Sbjct: 285 IRQDSDQ----ARTMSLKGDAVPADKDFELT------WKAALGKTPSAGLFREVKDGKTY 334
Query: 314 LLP------GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
LL + K++V+F++D SGSM G+ +E + +LA+A+S+L D FN++
Sbjct: 335 LLAFVTPPTAPDAAAAPTKREVVFVIDNSGSMSGQSIEQARQSLALAISRLSKDDRFNVI 394
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP- 426
F+ + + A+ + E+A I ++ G S + + A+E ++G +
Sbjct: 395 RFDDTMTDYFNGLVAASPDNREKA-----ITYVRGLSADGGTEMLPALEDALRNQGPVAS 449
Query: 427 -----IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
++FL TDGA+ +E+Q+ + + N G R++T GIGS N YF+ A I
Sbjct: 450 GALRQVVFL-TDGAIGNEQQLFQEISA---NRGDA--RVFTVGIGSAPNTYFMTKAAEIG 503
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
RG + A D + +M +LF + + + +IA T + ++ P+ +PDL S P++
Sbjct: 504 RGTFTAIGSTDQVASRMGELFAKLQNPAMTDIAA-TFEGTQAEDITPNPMPDLYSGEPVV 562
Query: 542 VSGRYQGKFP-DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFS 599
++ P L+ G GD + VE+ + A D + ++ A+++ID A+A+
Sbjct: 563 LTAELPRDKPAGKLQIVGKTGD-QPWRVEMDIAHAADGNGISKLWARRKIDDFEARAYQR 621
Query: 600 ED-KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+D L++ + +++ ++ T ++ V+ A+E G K
Sbjct: 622 QDPAALDKDIETVALAHHLVSRVTSLVAVDVTPSRPANEPLGAAK 666
>gi|430744235|ref|YP_007203364.1| hypothetical protein Sinac_3408 [Singulisphaera acidiphila DSM
18658]
gi|430015955|gb|AGA27669.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Singulisphaera acidiphila DSM 18658]
Length = 819
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 144/607 (23%), Positives = 243/607 (40%), Gaps = 77/607 (12%)
Query: 83 LIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEI 142
L+PL+ V +D L +R + + + + P+ D+ ++ E+
Sbjct: 34 LLPLKALEVRGRIDGLLAQVSIR-----QTFVNVLDEPLEATYIFPLPDRAAVTRFRMEV 88
Query: 143 SGKSYHTQLIALGENDG-------AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLR 195
+G+ L G+ AG AS+ +P +FTL + + G + L
Sbjct: 89 AGRVIEGVLEERGQARKEYQQAIEAGHRASIAEEE--RPGVFTLRVGNLPPGEEAVVELS 146
Query: 196 WSQKLSYRDGEFSVNVPFKFPEYVTPA-------------------IKKIPKREKIH--- 233
+ L + DGE V F+FP V P +P +I
Sbjct: 147 LTGPLPFADGE----VTFRFPLVVAPRYIPGIPLPGPSTGKGVAHDTDAVPDASRISPPV 202
Query: 234 --------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYES--EVLKWSNI 277
++V+ + +S H + + G GY + + N
Sbjct: 203 LLPGFPNPVQLTLTVDVHESIASVGALRSSLHAVLE----TGDQGYRRVTLLNTGERLNR 258
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKSRKVFKKDVIFIVDIS 336
DF + + I + + + D E F + ++P TA S +D+IF++D S
Sbjct: 259 DFILRFKLGAESIETSLSVHPDANTDRGGNEGTFALTIVPPTASSEVTRPRDLIFVLDRS 318
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL---FS-TSMELATKEAVERAH 392
GSM+G + + ALA + L D FN++AF+ FS + AT RA
Sbjct: 319 GSMEGWKIVAARRALARMVDTLGAADRFNVLAFDDRIEYPPGFSDQGLVTATDRQRFRAV 378
Query: 393 QWIGINFIAGGSTNICAPLTKAVEMLTNSR-GSIPIIFLVTDGAVEDERQICDAMKSRLT 451
+++ A G T + PL + VE+L + G I+ LVTDG V +E QI + RL
Sbjct: 379 EFLA-TLTARGGTEMAGPLDQGVEILARAETGRDRILVLVTDGQVGNEDQILKGLGRRLQ 437
Query: 452 NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLA 511
RI+T GI N FL LA + G D ++ M + R VL
Sbjct: 438 G-----IRIFTLGIDRAVNEGFLHRLAELGGGTSEIVESEDRLDEVMDAVHRRIGIPVLT 492
Query: 512 NIAI--DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVE 569
+ + D L + + + P RIPDL +P+++ GRY+G FP++L G +
Sbjct: 493 GLRLEPDKLPVVPD-SLVPGRIPDLFPSAPVLILGRYRGAFPESLTLSGRDAAGQPWSQA 551
Query: 570 LKLQLAKDIPLDRICAKQQIDLLTAQAWFSED--KRLEEKVSKMSVQTGVLCEYTRMIIV 627
++ + + A+ Q+ L + + LE+ + S+Q GVLC +T + +
Sbjct: 552 AVARVRDNPAIAAAWARGQLRKLEDRYTVEQTGCAELEKTIVATSLQFGVLCRFTAYVAI 611
Query: 628 ETDERNN 634
+ E N
Sbjct: 612 DLAEVVN 618
>gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor [Pseudoalteromonas haloplanktis
TAC125]
gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative
Inter-alpha-trypsin inhibitor domain protein
[Pseudoalteromonas haloplanktis TAC125]
Length = 664
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 245/563 (43%), Gaps = 71/563 (12%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL---IA----LGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ +A E AGK A++
Sbjct: 83 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVAAEKKYAEAKQAGKQAALVRQQ-- 140
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA---------I 223
+ N+F + I G + I L + + + Y G F+V +FP +TP I
Sbjct: 141 RANMFITNVANIAPGEQVIIELEYQEIIDYSSGTFTV----RFPGTITPRYHVTQGEIDI 196
Query: 224 KKIPKR------------------------EKIHLNVNAGTGTEVL-CNTSSHHLKQLRR 258
K ++ + +LN++ G E++ N+ H++
Sbjct: 197 NKESQKPTNSLPHGWLSPVYSTQKNDDKPSSQFNLNLDIDVGLELVDINSKFHNVNIQNT 256
Query: 259 DVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPG- 317
G+ Y E+ + + ++ DF P + +R M + P
Sbjct: 257 AFGQ----YSIELNEQNALNRDFVLEFKPLQKEQAQAAFFTEQFENGERYGLAMLMPPAD 312
Query: 318 --TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
A R ++ +F+VD SGSM G+ +E KNAL ALS LD DSFNI+ F+ L
Sbjct: 313 NFIATQR--LARETVFVVDTSGSMHGQSMEQAKNALFYALSLLDSNDSFNIIGFDNVVTL 370
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
S +A+ + RA ++I A G T I L ++ + G + + +TDG+
Sbjct: 371 MSDKPLVASGFNLRRAERFI-YGLQADGGTEIQGALDAVLDG-SQFDGFVRQVIFLTDGS 428
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V +E + +++++L + R++T GIGS N +F+R A + +G + ++
Sbjct: 429 VSNEDALFKSIQAKLGDS-----RLFTVGIGSAPNSFFMRRAADVGKGSFTFIGSTSEVQ 483
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+MQ+LF + + N+A+ ++ + + +PS +PDL P++V+ + L
Sbjct: 484 PKMQQLFDKLAHPAITNLALSD-ENGNSLDFWPSPLPDLYFNEPIMVAIKLNNASNVILN 542
Query: 556 AKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQ 614
+ G +S + L Q + + ++ A+Q+I L ++ ++++V ++++
Sbjct: 543 GQTAQGPIS---INLNTQAGSNAKGIAKLWARQKIKSLL---LYNSQNAVKDEVQQLALT 596
Query: 615 TGVLCEYTRMIIVETDERNNASE 637
+L +T I +E + N +E
Sbjct: 597 HQLLSPFTAFIAIEQTQINPIAE 619
>gi|336311710|ref|ZP_08566671.1| hypothetical protein SOHN41_02154 [Shewanella sp. HN-41]
gi|335864824|gb|EGM69893.1| hypothetical protein SOHN41_02154 [Shewanella sp. HN-41]
Length = 751
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 131/523 (25%), Positives = 234/523 (44%), Gaps = 67/523 (12%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY- 218
GK AS+ + +PN+FT + + L + + + + + Y DG FS+ P P Y
Sbjct: 164 GKRASLVSQE--RPNMFTTEVANLAPDEQLVVEISYQEAIKYEDGLFSLRFPLVVAPRYI 221
Query: 219 -------------VTPA----IKKI--PKRE---------KIHLNVNAGTGTEVLCNTSS 250
VT A ++I P R K+ + V G G + S
Sbjct: 222 PGNTADNGANNDRVTSAQVFDAERIVAPIRGTDEGDSPELKVDIWVELGEGVDTRAVESP 281
Query: 251 HHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTV----SP----------SHIFGGVLL 296
+H +L G S + V +N DF + + SP +H +L
Sbjct: 282 YHQIKLSEKRGSTMVSLANSVP--ANRDFVLQWRLKQGNSPLAWVFNQRGKTHQNESDVL 339
Query: 297 QSPSLHDVDQREMFCMYLLPGTAK-SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
+ + +D M L P K S+ +++I ++D SGSM G P+ KNAL AL
Sbjct: 340 SQEAGNAIDDSYSLVMVLPPKADKHSQMRLPRELILVIDTSGSMAGDPIVQAKNALLYAL 399
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
+ L DSFNI+ FN + S + T + RA Q++ A G T + L A
Sbjct: 400 NGLTAQDSFNIIEFNSQMSQLSPASLPVTSSNLSRARQFVN-RLQADGGTEMALALNAAF 458
Query: 416 ----EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNH 471
+ +T S+ + +TDG+V +E+ + D ++ ++ + R++T GIGS N
Sbjct: 459 AANYQNVTQETQSLRQVIFMTDGSVGNEQALFDLIRYQIGDS-----RLFTVGIGSAPNS 513
Query: 472 YFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRI 531
+F++ A + RG + ++ ++ ++ +L + VL +I + D + + +PS I
Sbjct: 514 HFMQRAAELGRGTFTYIGKVEEVDEKISELLNKIQYPVLTDIHV-RFDDGNIPDYWPSPI 572
Query: 532 PDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL--QLA----KDIPLDRICA 585
PDL P++VS + + L G G N+ L L LA +++ LD + A
Sbjct: 573 PDLYQGEPVLVSLKRSERESKGLVITGRQGH-KNWQQSLSLMDNLAAIPEQNVGLDLLWA 631
Query: 586 KQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
++QI L + D+R++++++ +S+ ++ YT ++ V+
Sbjct: 632 RKQIAALELSKNGANDERVKQQITALSLNYHLVSAYTSLVAVD 674
>gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1]
gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1]
Length = 802
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 183/398 (45%), Gaps = 32/398 (8%)
Query: 248 TSSHHLKQLRRDVGKLGYSYESEVLKWSNI--DFDFSYTVSPSHIFGGVLLQSPSLHDV- 304
+ +H ++ R ++ YE + NI D DF + G LL DV
Sbjct: 220 SPTHSIRSERFGENEVAIRYEEK-----NILPDRDFEMMILLGEGEGLHLLSYRDPGDVQ 274
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
D F M L P KDVIF++D SGSM+G + K AL LS+L+P D F
Sbjct: 275 DSDGFFLMLLAPRIQAPETAIPKDVIFVLDRSGSMEGVKFQQAKQALEYVLSRLNPQDRF 334
Query: 365 NIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS 424
N+++F+ + +F+ ME EA+ +A +W+ A G TNI L A++ + + R +
Sbjct: 335 NLLSFSNQVEVFAPEME--GVEAIPQAQKWVA-GLSAAGGTNIHRALLDAIQFVRSQRPT 391
Query: 425 IPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+IFL TDG + D QI D + G R++ FG+G + + L LA+
Sbjct: 392 Y-LIFL-TDGLPTVGITDREQILDDFARQAPRGL----RLFVFGVGYDVDTFLLDELALA 445
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
G + + + F + + VL ++ + T++ + +++YP +PDL S +
Sbjct: 446 HHGLSLYVRPEEDLNQAVAGFFEKISTPVLTDLTL-TVEGTEVYDVYPQPLPDLFVGSQV 504
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD---------IPLDRICAKQQIDL 591
+++GRY+ T+ G + + L L +D + L R+ A ++I
Sbjct: 505 VITGRYRQPGKVTVVLSGQISGETRTYRFPSLSLKEDSRFEAGGSEVGLPRLWAMRKIGY 564
Query: 592 LTAQAWFS-EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
L Q + D ++ ++S++ G++ YT ++ E
Sbjct: 565 LLEQIRLNGADPETIAQIVRLSIRYGIITPYTSYLVTE 602
>gi|386313526|ref|YP_006009691.1| Vault protein inter-alpha-trypsin domain-containing protein
[Shewanella putrefaciens 200]
gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella
putrefaciens 200]
Length = 757
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 133/555 (23%), Positives = 238/555 (42%), Gaps = 80/555 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K+ Q+ + GK AS+ + +PN+FT + + L + + +
Sbjct: 142 IEGQIHPKAEAKQIFQ--QAKAEGKRASLVSQE--RPNMFTTDVANLAPNEQLVVEISYQ 197
Query: 198 QKLSYRDGEFS-----VNVPFKFP--------------------------EYVTPAIKKI 226
+ + Y DG FS V P P E + +++
Sbjct: 198 ENIKYEDGLFSLRFPLVVAPRYIPGFIHDKASANDNAANSRVTSSEVFDAERIVAPLRRA 257
Query: 227 PKREKIHLNVNA----GTGTEVLCNTSSHHLKQLRRDV---GKLGYSYESEVLKWSNIDF 279
++ LN N G + S +H Q++ D G + S + V+ +N DF
Sbjct: 258 NSKQDPVLNANIEVRLAKGVDKSTIASPYH--QIKLDEPHHGIINVSLTNSVV--ANRDF 313
Query: 280 DFSYT----VSP----------SHIFGGVLLQSPSLHDVDQREMFCMYLLPGT-AKSRKV 324
+ +SP +H G D M L P T +
Sbjct: 314 VLQWRAKQGMSPMALVFNQQGKTHGDGASEDNVSENRQSDYHYSLVMVLPPKTDEHALST 373
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+++I ++D SGSM G + K+AL AL+ L DSFNI+ FN E S + A
Sbjct: 374 LPRELILVIDTSGSMAGDSIVQAKSALLYALNGLKAEDSFNIIEFNSELTQLSPTSLPAN 433
Query: 385 KEAVERAHQWIGINFIAGGSTNIC----APLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
+ + RA Q+I A G T + A L + + L+ S S+ + +TDG+V +E+
Sbjct: 434 QTHLARARQFIH-RLQADGGTEMALALNAALPRGINRLSESSQSLRQVIFMTDGSVGNEQ 492
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ D ++ ++ R++T GIGS N +F++ A + RG + ++D +E ++ +
Sbjct: 493 ALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGNVDEVEQKISQ 547
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
L ++ VL +I + D + +PS IPDL P++VS + + P L G
Sbjct: 548 LLSKIQYPVLTDINV-RFDDGGVPDYWPSPIPDLYRGEPVVVSLKRSEREPQELVISGRQ 606
Query: 561 GDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSV 613
G N+ L L+ +D LD + A++QI L + D +++++V+ +S+
Sbjct: 607 GH-KNWQQSLSLKDSSDGLITNQDAGLDLLWARKQIAALELSKNGANDDKVKQQVTALSM 665
Query: 614 QTGVLCEYTRMIIVE 628
++ YT ++ V+
Sbjct: 666 NYHLVSPYTSLVAVD 680
>gi|332706718|ref|ZP_08426779.1| hypothetical protein LYNGBM3L_23300 [Moorea producens 3L]
gi|332354602|gb|EGJ34081.1| hypothetical protein LYNGBM3L_23300 [Moorea producens 3L]
Length = 972
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 180/409 (44%), Gaps = 55/409 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-IKKIPKREK 231
+ NIFT +L I G + + LR++ +L ++ G N F FP V P I P E
Sbjct: 118 RDNIFTQSLAHIKPGEQIEVTLRYTDQLQFQGG----NYEFVFPMVVGPRFIPGTPIDES 173
Query: 232 ----------------------------IHLNVNAGTG-TEVLCNTSSHHLKQLRRD--- 259
+ + ++AG TE+ + +++QL +
Sbjct: 174 GDTDFVPDASRITPPLIPPDIRSGHDIGVTVEIDAGVAITEIYSTSHQINIEQLNIEQQG 233
Query: 260 -VGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-EMFCMYLLPG 317
+ ++ S E + N D Y ++ +L S DQR F +YL+P
Sbjct: 234 AILRVKLSQEDTI---PNKDLILRYGIAKEQTQTTLLTHS------DQRGNHFAVYLIPA 284
Query: 318 TAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF 376
S ++ KDV+F++D SGS G PL ++ + ++ L+P D+F I+ +
Sbjct: 285 LDYSTDEIVPKDVVFVIDTSGSQMGDPLLKSQELMRRFINGLNPKDTFTIIDVSDRATQL 344
Query: 377 STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAV 436
ST + + +A +I A G T + + + + G + I L++DG +
Sbjct: 345 STKPLSNSPQNCRKAINYIN-QLKANGGTYLLKGIRHLLNLPAAPEGRLRSIVLLSDGYI 403
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+E Q+ ++ +L G RIY+FG+GS N + L LA I RG + E
Sbjct: 404 SNENQVLAEVQQQLKPGN----RIYSFGVGSSPNRFLLNRLAEIGRGIARIVRQDEPTEA 459
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
++ F + + VL NI + T + L E +YP+ PDL ++ PL++ GR
Sbjct: 460 VAEQFFRQINNPVLTNIHV-TWEGLGEAIIYPNPTPDLFTQQPLVICGR 507
>gi|124003329|ref|ZP_01688179.1| von Willebrand factor, type A [Microscilla marina ATCC 23134]
gi|123991427|gb|EAY30858.1| von Willebrand factor, type A [Microscilla marina ATCC 23134]
Length = 827
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 141/593 (23%), Positives = 257/593 (43%), Gaps = 65/593 (10%)
Query: 87 QMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKS 146
QM E VD + V+ T +V+ G + + P Q ++ + I +
Sbjct: 52 QMPLKETKVDVNVAGVIADVTVT-QVYKNEGKNTLEAIYVFPGSTQAAVYAMTMTIGER- 109
Query: 147 YHTQLIALGENDGAGKS------ASVETGSFL---KPNIFTLTLPQIDGGSYLSIRLRWS 197
+LIA E G ++ +T S L +PN+F + + I + + L ++
Sbjct: 110 ---KLIAKIEEKGKARAQYEAAKKQGKTASLLEQHRPNVFQMNVANILPKDLIKVELHYT 166
Query: 198 QKLSYRDGEFSVNVPFKF-PEYV-TPAIK--------KIPKREK---------IHLNVNA 238
+ L DG + + P P Y TPA K K P ++ I+ +NA
Sbjct: 167 ELLVPTDGVYEFSYPTVVGPRYSDTPAAKATAGEKWVKNPYLKEGSKPNYTFDINTTINA 226
Query: 239 GTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLL- 296
G + + C + ++ + G + +SE N D+ Y ++ S I G+LL
Sbjct: 227 GMPIQQMACTSHKVNVNYQDKSTGVIKLK-KSEKF-GGNRDYIVRYRLAGSKIQSGLLLY 284
Query: 297 ------QSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTK 348
S D + E F + ++ P K+ ++ ++ +FIVD+SGSM G PL +K
Sbjct: 285 EGENEVASGKEEDNENAEKFFLMMMQPPKAPKNSQIPPREYVFIVDVSGSMHGFPLSVSK 344
Query: 349 NALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNIC 408
L + KL P D FN++ F + S AT+ +++A I GG T +
Sbjct: 345 RLLKNLIGKLRPKDKFNVMLFESSNQMMSPESMEATQANIQKAFGVIDQQR-GGGGTRLL 403
Query: 409 APLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSY 468
L KA+ S + +VTDG V E++ D +++ L ++ FGIGS
Sbjct: 404 PALKKALAFKQTKDYSRSFV-VVTDGYVTVEKEAFDLIRNNLNRAN-----LFAFGIGSS 457
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY- 527
N + + +A G +++ +K + VL NI I D F++Y
Sbjct: 458 VNRFLIEGMARAGMGEPFIVTHGTEADVKAEKFRNYIQNPVLTNIKI----KYDGFQVYD 513
Query: 528 --PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLA---KDIPLDR 582
P +PD+ +E P+IV G+Y+GK + G G+ + +K+ A + +
Sbjct: 514 TEPWAVPDVFAERPIIVYGKYKGKPTGKITVTGLSGN-KTYSKTIKVSSATQENNQAIRY 572
Query: 583 ICAKQQIDLL-TAQAWFSEDKRLE-EKVSKMSVQTGVLCEYTRMIIVETDERN 633
+ A+++I L + ++ +++ +K++++ ++ +L +YT I + + RN
Sbjct: 573 LWARERIKLHDDYRKYYGANRQKHIKKITELGLKYNLLTQYTSFIAISPEVRN 625
>gi|170726477|ref|YP_001760503.1| cell wall anchor domain-containing protein [Shewanella woodyi ATCC
51908]
gi|169811824|gb|ACA86408.1| LPXTG-motif cell wall anchor domain protein [Shewanella woodyi ATCC
51908]
Length = 739
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 131/578 (22%), Positives = 237/578 (41%), Gaps = 84/578 (14%)
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLI-------ALGENDGAGKSASVETGSFLK 173
+ R P+ D+ ++ + E+ K Q+ AGK AS+ + S
Sbjct: 102 LNGRYQFPLPDEAAVDHMRLEVGSKVIEGQIKEKLKAKKQFEMAKKAGKRASLVSQS--T 159
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKI 232
PNIFT ++ + G L + + + + +SY+ GEFS+ P P Y +P + K +
Sbjct: 160 PNIFTTSVANLGPGEALVVEISYQELVSYKKGEFSLRFPMVVNPRYFSPLSMQEQKSRSV 219
Query: 233 -----------------HLNVNAGT----------GTEVLCNTSSHHLKQLRRDVGKLGY 265
H+ ++A G E+ +S +H + +L
Sbjct: 220 SFANTLATIENGTAFNDHIGIDAAAKVSIEVELDAGVELGLISSPYH----KVSHTQLSG 275
Query: 266 SYESEVLKWSNIDFDFSYTVSP---SHIFGGVLLQSPSLHDVDQREM------------- 309
S+ L D DF P + V Q H + +
Sbjct: 276 SHYRVSLTGVKADRDFVLNWRPQLDTKPVAAVFSQQGKTHSLSSKAQVEPTDSNASTKAD 335
Query: 310 ----------FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
M L P K +++I ++D SGSM G + K AL AL+ L
Sbjct: 336 SNKAVEDDYALLMLLPPSDQKQDVSISRELILVIDTSGSMSGASIAQAKRALNYALAGLK 395
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
D+FN++ FN S AT + + A+Q++ + A G T + L A++ T
Sbjct: 396 AKDTFNVIEFNSNVGSLSPYSLPATAKNIGLANQYVR-SLKANGGTEMQLALNAALDKGT 454
Query: 420 NSRG----SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
+ + + +TDG+V DE+ + +K ++ R++T GIGS N +F+R
Sbjct: 455 ETEALGSERLRQVLFMTDGSVGDEQSLFHLIKQKIGES-----RLFTLGIGSAPNSHFMR 509
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
A RG + LD ++ +++ L + L +I + D + +P+ IPDL
Sbjct: 510 RAAEFGRGTFTYIGKLDEVQSKIESLLYQIERPQLTDIKL-RYADNRVPDYWPAMIPDLY 568
Query: 536 SESPLIVSGRYQG----KFPDTLKAKGFL-GDLSNFVVELKLQLAKDIPLDRICAKQQID 590
+E PL+V+ + P L G + G V LK + ++ LD I A++QI
Sbjct: 569 AEEPLLVAIKMNSTQHVSSPTELIVSGTIGGQYWQESVSLK-ERKPELGLDLIWARKQIA 627
Query: 591 LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
L ++R++++++ +++ ++ +T ++ VE
Sbjct: 628 ALELSKNGVNEQRVKQQITALALDYHLVSAHTSLVAVE 665
>gi|428320903|ref|YP_007118785.1| Vault protein inter-alpha-trypsin domain-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428244583|gb|AFZ10369.1| Vault protein inter-alpha-trypsin domain-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 786
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 122/513 (23%), Positives = 221/513 (43%), Gaps = 54/513 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV--TP-------- 221
+ NIFT +L I G + + +R++ L + G++ P P Y+ TP
Sbjct: 115 RDNIFTQSLANIKPGESIDVTIRYTDSLKFLAGDYEFIFPMVVGPRYIPGTPLDETAASG 174
Query: 222 ----------AIKKIPKREKIH-----------------LNVNAGTGTEVLCNTSSHHLK 254
+ +P +++ L + AG + N+ SH L+
Sbjct: 175 SAPAPMTVNSSTDIVPDASRLNAPILPPGMRSGHDIGVTLEIEAGVPIRNV-NSPSHQLQ 233
Query: 255 QLRRDVGKLGYSYESE-----VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE- 308
D + + E + ++ N D + + + G QS L D R
Sbjct: 234 IEYFDSPQTSLASEHKNGLRVRVQLGNQDTIPNKDLIVRYQIAGRETQSTVLTQADDRGG 293
Query: 309 MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
F +YL+P K+ ++ KDV+F+VD SGS G PL+ + L ++ L+P D+F+I+
Sbjct: 294 HFAVYLIPAIEYKTDEIVPKDVVFLVDTSGSQSGAPLQQCQALLRQFINGLNPADTFSIL 353
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
F+ + S T E ++A ++I A G T + + A++ S G +
Sbjct: 354 DFSDKVRRLSQQPLPNTAENRQQAIEYIN-KLRASGGTEMLGGIRAAIDFPAPS-GRLRT 411
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
+ ++DG + +E Q+ ++ L +G R+Y+FG GS N + L +A I RG
Sbjct: 412 VVFLSDGYIGNENQVLAEVQQHLKSGN----RLYSFGAGSSVNRFLLDRIAEIGRGISRI 467
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+ + ++K F + + VL N+ I + + +YPS PDL +E PL++ G+
Sbjct: 468 IRHDEPVNEFVEKFFRQINNPVLVNLEITWQGEGENPVIYPSVAPDLFAEQPLVLFGKKG 527
Query: 548 GKFPDTLKAKGFLGDLSNFVVELKLQLAK--DIPLDRICAKQQIDLLTAQAWFSEDKRLE 605
+ TL KG +V +L + + ++ + +I L Q + K
Sbjct: 528 DRQAGTLHIKGVSAGGHQYVQTYQLDFTDPGNPAVAQLWGRHRIKDLMNQMVQYDTKVGV 587
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
E V++ ++ +L +YT + V D R +ES
Sbjct: 588 ETVTETALTYQLLSQYTAFVAVSDDVRVEPTES 620
>gi|410627309|ref|ZP_11338051.1| hypothetical protein GMES_2525 [Glaciecola mesophila KMM 241]
gi|410153159|dbj|GAC24820.1| hypothetical protein GMES_2525 [Glaciecola mesophila KMM 241]
Length = 797
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 136/597 (22%), Positives = 252/597 (42%), Gaps = 118/597 (19%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT T+ I +SI + + Q + + + F + P P Y
Sbjct: 210 RPNLFTNTIANIGPNESISITIEYQQVVGFDEQTFRLRFPMTITPRYSPNDATDKSTVAT 269
Query: 219 ---------VTPAIKKIPKREK----IHLNVNAGTGTEVLCN--TSSHHLKQLRRDVGKL 263
VT ++I ++ + ++V +G + + S HH + + K
Sbjct: 270 VNTQGWGQSVTAMSQQIKTADEPANPVRISVELDSGFALTADDIISEHHPINISQQGEKN 329
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVLLQSP---SLHDV 304
G Y E+ + + DF+ T P+ + +G V+L P + H
Sbjct: 330 G-GYHIELAQEHIANHDFALTWQPAPGDAPSAAHFSETQGKYRYGLVMLTPPVQDASHST 388
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
D A ++++ ++V+F++D SGSM G+ + K A+ AL++L P D+
Sbjct: 389 D------------GAVAKQMTPREVVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDNV 436
Query: 365 NIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML------ 418
NI+ FN AT + ++RA W+ + A G T + LT A+
Sbjct: 437 NIIQFNDSPQALWKRAMPATAKNIQRARNWVA-SLHADGGTEMAPALTLALNKPSLHRDD 495
Query: 419 TNSRGSIPI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
++S GS + + +TDG+V +E + ++++L + R++T GIGS N YF+
Sbjct: 496 SDSLGSHKLRQVVFITDGSVSNEGALMSLIENKLADS-----RLFTIGIGSAPNSYFMTQ 550
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
A RG + D+ ++ +M LF + V+ +I I+ + E E YPS PDL +
Sbjct: 551 AAQAGRGTFTYIGDIQQVQHKMTALFNKLTRPVMQDIHIEFAR---ETEFYPSVTPDLYA 607
Query: 537 ESPLIVSGR---------YQGKFPDT-LKAKGFLGDLSNFVVELKLQL-----AKDIPLD 581
PL++ R Q + PD LK G+ + +++ AK L
Sbjct: 608 AQPLVIHYRVPVTHLENGMQVQHPDNALKVTGWQSARAASSKPWSIRMPLSPTAKRAGLG 667
Query: 582 RICAKQQIDLLTAQAWFSED--------KRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
A+++I L+ + K+ ++ ++ ++++ ++ ++T +I V+
Sbjct: 668 VAWAREKIAQLSHTLRRERNSDIRPMMRKKWQDSITNIALEHHIVSQFTSLIAVD----- 722
Query: 634 NASESPGTKKGSKKSDHQKILDSEV----PKTIMLQNLGIG----FGDLTATAENFP 682
E P T + K +H+ D+ + P + + L I G L TA N P
Sbjct: 723 ---EKPPTPQ-DVKPEHENSKDTPINHNAPAGLAMAGLQITGLQMVGQLPQTATNTP 775
>gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli
Brasil 5]
Length = 784
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 133/560 (23%), Positives = 245/560 (43%), Gaps = 85/560 (15%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
VE +I + ++ + +G K+A +E +PNIFT + I G + +++ +
Sbjct: 138 VEGQIKPRQEAREIYEQAKAEGK-KTALLEQ---QRPNIFTSQVANIGPGEEIIVQIEYQ 193
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTPAIKKIP---------------------KREKIH--- 233
Q + GEFS+ +FP V P P REKI
Sbjct: 194 QTVHQSGGEFSL----RFPMVVAPRYNPAPIVQTVEFNNGAGFATPRDPVENREKIEASV 249
Query: 234 -------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI--D 278
L VN G + SS H +R+D + + LK + D
Sbjct: 250 LDPNENAKINPVSLTVNLKAGFPLGDVNSSFHAVDIRQDSDQ----ARTISLKGGAVPAD 305
Query: 279 FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLP------GTAKSRKVFKKDVIFI 332
DF T + L ++PS + + YLL + K++V+F+
Sbjct: 306 KDFELT------WKAALGKTPSAGLFREVKDGKTYLLAFVTPPTAPDAAAAPTKREVVFV 359
Query: 333 VDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAH 392
+D SGSM G+ +E + +LA+A+S+L D FN++ F+ + + A+ + E+A
Sbjct: 360 IDNSGSMSGQSIEQARQSLALAISRLSKDDRFNVIRFDDTMTDYFNGLVAASPDNREKA- 418
Query: 393 QWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP------IIFLVTDGAVEDERQICDAM 446
I ++ G S + + A+E ++G + ++FL TDGA+ +E+Q+ +
Sbjct: 419 ----ITYVRGLSADGGTEMLPALEDALRNQGPVASGALRQVVFL-TDGAIGNEQQLFQEI 473
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
+ + R++T GIGS N YF+ A I RG + A D + +M +LF +
Sbjct: 474 SANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQ 528
Query: 507 SSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-DTLKAKGFLGDLSN 565
+ + +IA T + ++ P+ +PDL S P++++ P L+ G GD
Sbjct: 529 NPAMTDIAA-TFEGTQAEDITPNPMPDLYSGEPVVLTAELPRDKPAGKLQIVGKTGD-QP 586
Query: 566 FVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSVQTGVLCEYTR 623
+ VE+ + A D + ++ A+++ID A+A+ +D L++ + +++ ++ T
Sbjct: 587 WRVEMDIAHAADGNGISKLWARRKIDDFEARAYQRQDPAALDKDIETVALAHHLVSRVTS 646
Query: 624 MIIVETDERNNASESPGTKK 643
++ V+ +E G K
Sbjct: 647 LVAVDVTPSRPENEPLGAAK 666
>gi|90019771|ref|YP_525598.1| inter-alpha-trypsin inhibitor domain-containing protein
[Saccharophagus degradans 2-40]
gi|89949371|gb|ABD79386.1| von Willebrand factor, type A [Saccharophagus degradans 2-40]
Length = 763
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 163/342 (47%), Gaps = 39/342 (11%)
Query: 308 EMFCMYLLP--GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
+ + LLP G + + +D++F+VD SGSMQG ++ K +L AL L+P D+FN
Sbjct: 365 DYLLLMLLPPQGQQQHTQSLSRDIVFVVDTSGSMQGTSIQQAKRSLQFALRGLNPSDTFN 424
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT------ 419
I+ F+ F + AT V+ A W+ N A T + A L +A + L
Sbjct: 425 IIEFDTSFSRFRSRPVSATASNVQAAVSWVN-NLNADNGTEMYAALEEAFDQLASINPNG 483
Query: 420 --NSRGS--IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
NS+ S + + +TDGAV +E+ + + RL N R++T IGS N YF+R
Sbjct: 484 TENSKSSNNLQQVVFITDGAVGNEQALLSLIHRRLNNA-----RLFTVAIGSAPNSYFMR 538
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
A +G D + +M L ++ ++++++I + + + E+YP RIPDL
Sbjct: 539 KAAQFGKGANVFIGDTAEVTHKMNALLSKLKTTLVSDINVQWPQ---QSEVYPQRIPDLY 595
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQIDLLTA 594
+ PL+++ + G T+ G L + +L + + + ++ AK +ID L
Sbjct: 596 AGEPLLLAAKTSGAM-GTIDISGNTA-LQPWQSQLTINPYHNNSGVAQVWAKSKIDAL-- 651
Query: 595 QAWFSEDKRLE--------EKVSKMSVQTGVLCEYTRMIIVE 628
ED + E ++V +++ ++ YT + VE
Sbjct: 652 -----EDSKTEGANPQDVRKQVVDVALTHALITPYTSFVAVE 688
>gi|444308483|ref|ZP_21144128.1| cell wall anchor domain-containing protein [Ochrobactrum
intermedium M86]
gi|443488066|gb|ELT50823.1| cell wall anchor domain-containing protein [Ochrobactrum
intermedium M86]
Length = 720
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 213/460 (46%), Gaps = 63/460 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-----PEYVTPAIKKIP 227
+PN+FT + I + I++ + Q + D FS+ VP P +P ++K+
Sbjct: 137 RPNVFTNAVANIGPHEKVVIQIEYQQAVRLADERFSLRVPLVVAPRYNPNIASPVVQKVE 196
Query: 228 KREKIHLNVNAGTG--------------TEVLCNTSSHHLKQLRRD--VGKLGYSYESEV 271
+ + +AG E+ N S + +L+ +GK+ Y
Sbjct: 197 MQNGWGKSSDAGKPDPYNAPIVTPLTPPAELRTNPVSISV-ELKPGFPLGKVESLYHKVR 255
Query: 272 LKWSN---IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF----------CMYLLPGT 318
++ +N + T + F VL S +D Q +F Y+ P
Sbjct: 256 IETTNDATREITLDGTAAADRDF--VLEWSAVANDAPQVGLFREHVGKDDYVLAYVTPPA 313
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFS 377
S K +++V+F++D SGSM G +E K +L ALS L PGD FN++ F+ T F
Sbjct: 314 VASAKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGDRFNVIRFDDTLTRFFE 373
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE 437
S+E A+++ + A ++ ++ A G T + L A++ G I+FL TDG +
Sbjct: 374 VSVE-ASQQNIASARHFV-MSLEAQGGTAMLPALHAALDDSHQGNGLRQIVFL-TDGEIS 430
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY---GAAYDLDSI 494
+E+Q+ DA+ +R G S RI+ GIG+ N Y + A + RG + G+A ++D
Sbjct: 431 NEQQLLDAIAAR--RGRS---RIFMVGIGTAPNSYLMNHAAELGRGTFTHIGSAAEVDE- 484
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLK-DLDE--FEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+M+ LF + L N A+ LK + E M PS +PDL PL+++ R GK
Sbjct: 485 --RMRALFDK-----LENPAVTDLKANFSEKNVSMTPSILPDLYRGEPLVIAARM-GKAA 536
Query: 552 DTLKAKGFLGDLSNFVVELKL-QLAKDIPLDRICAKQQID 590
L +G + D + V L L Q K + ++ A+++ID
Sbjct: 537 GNLVIEGQI-DGRPWTVNLPLDQAMKAEGISKLWARRKID 575
>gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium
HTCC5015]
gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium
HTCC5015]
Length = 670
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 127/542 (23%), Positives = 222/542 (40%), Gaps = 101/542 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREK- 231
+PN+FT + I G + + LR+++ L+ EFS+ +P P I E+
Sbjct: 152 RPNLFTSKVANIAPGETIHVELRYTEALAIDGREFSLRLPTTMTSRFHPQESSIKPVEQG 211
Query: 232 ----------------IHLNVNAGTGTEVLCNTSSHHLKQLRRDVGK-----LGYSYESE 270
I ++++ G + + + SH + R +G+ +G S+ +
Sbjct: 212 PIVPSSAVGQSSHLADITVDIDGGWPIQNI-ESPSHPF--VERSLGRGYRVHMGSSFSDK 268
Query: 271 VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVI 330
V + D + + P G + S + M P S ++V+
Sbjct: 269 VAM--DQDVVLRWQLDPVASASGAVF---SEEYKGEHYALVMLRTPDEMTSGPRMPREVV 323
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
F++D SGSM G+ + K AL+ A+ +L P D FN+V FN + +SM A+ V++
Sbjct: 324 FVIDTSGSMAGQRMYHAKQALSQAVERLSPDDRFNVVEFNNQHSRLFSSMRSASAINVKQ 383
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A W+G GG T + + A+ + ++ + + L+TD +V +E +I ++ R
Sbjct: 384 ALNWVG-RLQGGGGTMMLPAVEDALSVRSDP-AYLRQVILITDASVGNEAEILRVVE-RQ 440
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
G R++T GIG N Y LR A + +G Y ++ +MQ+LF + + VL
Sbjct: 441 RKGA----RLFTVGIGVSPNSYLLRKAAQVGQGDYVYIASGQEVKARMQRLFAKLENPVL 496
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVEL 570
+ ID L + E E++P+ +PDL Y G+ L
Sbjct: 497 KQLNID-LPEGAEAEVWPNPLPDL-----------YHGR-------------------PL 525
Query: 571 KLQLAKDIPLDRICAKQQIDLLTAQA---------------WFSE--------------D 601
L + D P+D + K Q D L Q W E
Sbjct: 526 YLAMKLDKPIDHLVLKAQTDRLWQQRIALPEPTEATGIHTLWAREKIAAQLDGLRQGKTQ 585
Query: 602 KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKT 661
+ + + V ++++ V+ YT + VE A E KK S +++ +D PKT
Sbjct: 586 EEVRQAVLAVALEHAVMSPYTSFVAVEETPVRPAGED--LKKAS--VPNRRPVDQAYPKT 641
Query: 662 IM 663
+
Sbjct: 642 AL 643
>gi|310824705|ref|YP_003957063.1| von willebrand factor type a domain-containing protein [Stigmatella
aurantiaca DW4/3-1]
gi|309397777|gb|ADO75236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca
DW4/3-1]
Length = 881
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/446 (26%), Positives = 218/446 (48%), Gaps = 54/446 (12%)
Query: 217 EYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY-ESEVLKWS 275
+ +TP I + KI L V+ G EV+ + SH L L ++ ++ S+ +EV
Sbjct: 181 DRITPPIGAVGYGLKIDLRVD--LGREVVVESPSHALT-LNKEGHQVRVSFARNEV---- 233
Query: 276 NIDFDFSYTV-SPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLP---GTAKSRKVFKKDVI 330
+D DF TV SP L H + ++ F + ++P G A K +++V+
Sbjct: 234 ALDRDFVLTVRSPDT---SASLTPLVTHRLGEKPGTFALTVVPDLLGLATGPK--RQEVV 288
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
F+VD SGSM+G+ L + AL + L L GD FNI+AF+ F+ + T++ +E+
Sbjct: 289 FVVDTSGSMEGESLPQAQGALRLCLRHLREGDRFNIIAFDTSFQSFAPQPAVFTQKTLEQ 348
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA-MKSR 449
A +W+ A G T + P+ AV+ ++ L+TDG V +E +I A +++R
Sbjct: 349 ADRWVAA-LRANGGTELLQPMLAAVQAAPEG-----VVVLLTDGQVGNEAEILQAVLRAR 402
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
T RIY+FGIG+ + L+ +A + G + + I+ ++ F+R +
Sbjct: 403 KTA------RIYSFGIGTNVSDALLKDMARQTDGAVEFIHPGERIDDKVVAQFSRALAPR 456
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAKGFLGDLSNFV 567
+ + + ++ E+ P+ +P L P + GRY G+ TLK + FV
Sbjct: 457 ITELQAH-FEGVEGTELAPAELPALVDAVPWTLLGRYTTPGRGKVTLKGR---SGAEPFV 512
Query: 568 VELKLQL--AKDIP-LDRICAKQQIDLLTAQAWFSED------KRLEEKVSKMSVQTGVL 618
+ + + D P ++++ A ++I + W S + +R++E++ +++V+ ++
Sbjct: 513 LSVPVDFPAVSDRPAVEKLWAAERI-----RGWESAELDGRRAQRMKERIVQLAVEHQLV 567
Query: 619 CEYTRMIIVE--TDERNNASESPGTK 642
+T ++VE T ER AS P T+
Sbjct: 568 TRHTSFVVVEERTGER-RASGQPETR 592
>gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301]
gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301]
Length = 777
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 213/460 (46%), Gaps = 63/460 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-----PEYVTPAIKKIP 227
+PN+FT + I + I++ + Q + D FS+ VP P +P ++K+
Sbjct: 194 RPNVFTNAVANIGPHEKVVIQIEYQQAVRLADERFSLRVPLVVAPRYNPNIASPVVQKVE 253
Query: 228 KREKIHLNVNAGTG--------------TEVLCNTSSHHLKQLRRD--VGKLGYSYESEV 271
+ + +AG E+ N S + +L+ +GK+ Y
Sbjct: 254 MQNGWGKSSDAGKPDPYNAPIVTPLTPPAELRTNPVSISV-ELKPGFPLGKVESLYHKVR 312
Query: 272 LKWSN---IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF----------CMYLLPGT 318
++ +N + T + F VL S +D Q +F Y+ P
Sbjct: 313 IETTNDATREITLDGTAAADRDF--VLEWSAVANDAPQVGLFREHVGKDDYVLAYVTPPA 370
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFS 377
S K +++V+F++D SGSM G +E K +L ALS L PGD FN++ F+ T F
Sbjct: 371 VASAKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGDRFNVIRFDDTLTRFFE 430
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE 437
S+E A+++ + A ++ ++ A G T + L A++ G I+FL TDG +
Sbjct: 431 VSVE-ASQQNIASARHFV-MSLEAQGGTAMLPALHAALDDSHQGNGLRQIVFL-TDGEIS 487
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY---GAAYDLDSI 494
+E+Q+ DA+ +R G S RI+ GIG+ N Y + A + RG + G+A ++D
Sbjct: 488 NEQQLLDAIAAR--RGRS---RIFMVGIGTAPNSYLMNHAAELGRGTFTHIGSAAEVDE- 541
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLK-DLDE--FEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+M+ LF + L N A+ LK + E M PS +PDL PL+++ R GK
Sbjct: 542 --RMRALFDK-----LENPAVTDLKANFSEKNVSMTPSILPDLYRGEPLVIAARM-GKAA 593
Query: 552 DTLKAKGFLGDLSNFVVELKL-QLAKDIPLDRICAKQQID 590
L +G + D + V L L Q K + ++ A+++ID
Sbjct: 594 GNLVIEGQI-DGRPWTVNLPLDQAMKAEGISKLWARRKID 632
>gi|193215485|ref|YP_001996684.1| Vault protein inter-alpha-trypsin domain-containing protein
[Chloroherpeton thalassium ATCC 35110]
gi|193088962|gb|ACF14237.1| Vault protein inter-alpha-trypsin domain protein [Chloroherpeton
thalassium ATCC 35110]
Length = 837
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 134/636 (21%), Positives = 270/636 (42%), Gaps = 79/636 (12%)
Query: 59 PKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGS 118
P +VD + + PY + D P L + + DV A V V+ +V+ G
Sbjct: 53 PALVDENE-RTLSPYFFIKSDDPEAEQLPLLSTSADVKIAGVIADVAVT---QVYKNSGK 108
Query: 119 KSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---K 173
K + P + ++ G++ I ++ ++ + + A E T S L +
Sbjct: 109 KPIEAIYTFPASTRAAVYGMKMTIGERTIVAKIKERQKAREEYEKAKSEGKTASLLEEQR 168
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV-------------- 219
PN+F + + I G + + L++++ L D + F +P V
Sbjct: 169 PNVFQMNVANILPGDTIRVELKYTELLVPTDAVYE----FVYPSVVGPRYSNRKASDAGE 224
Query: 220 ------TPAIKK--IPKREKIHLNVNAGTGT---EVLCNTSSHHLKQLRRDVGKLGYSYE 268
TP +K+ P E + V TG ++ C + ++ ++ + ++
Sbjct: 225 RDKWVKTPYLKEGEAPTSE-FDIAVEVSTGVPIDDIACVSHKTAVRLEKKSLAEVSLDKS 283
Query: 269 SEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF-KK 327
+ N D+ Y ++ + I G+LL + ++ F + P + K+ +
Sbjct: 284 EKF--GGNRDYILRYRLAGNQIQSGLLL-----FEGEKENFFLATVQPPKRVTEKMIPNR 336
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ I+IVD+SGSM G+P+ +K + L +L P ++FN++ F+G + L S AT +
Sbjct: 337 EYIYIVDVSGSMFGQPIAISKELMKKLLGRLRPTETFNLLLFSGGSKLLSEKSLPATDKN 396
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+E+A + N GG T + L +A+ + GS + ++TDG V E + + ++
Sbjct: 397 IEKAFYALE-NEHGGGGTELLRALNRALGLPKKEAGSRTFV-VITDGYVSFEVETFETIR 454
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR---------GYYGAAYDL------- 491
L ++ GIG+ N + + +A G+ G + +
Sbjct: 455 KNLNKAN-----LFAVGIGNGVNRFLIEGMARAGSGEPTVLELPGHGGFSLRMTDENQKP 509
Query: 492 --DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY---PSRIPDLSSESPLIVSGRY 546
DS +I++++ + + + D+FE Y P +PD+ +E P+I+ G++
Sbjct: 510 KEDSSDIKLEQKVDKFLEYIEFPVLTQLQYKFDDFETYDVEPVSLPDVMAERPVILFGKW 569
Query: 547 QGKFPDTLKAKGFLGDLSNFVVELKLQL----AKDIPLDRICAKQQIDLLTAQAWFSEDK 602
+GK + GF GD + L ++ K+ L + A+ +I L SE
Sbjct: 570 RGKAEGKITLSGFTGDGEKYEHSLNVKKFKPSEKNAALRYLWARDRIATLGDYNALSESD 629
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
++++++ + +L +YT + ++++ RN ES
Sbjct: 630 ERTKEITELGLTYNLLTKYTSFVAIDSEVRNKTGES 665
>gi|115372024|ref|ZP_01459336.1| inter-alpha-trypsin inhibitor family heavy chain-related
protein-hypothetical secreted or membrane-associated
protein containing vWFA domain [Stigmatella aurantiaca
DW4/3-1]
gi|115370989|gb|EAU69912.1| inter-alpha-trypsin inhibitor family heavy chain-related
protein-hypothetical secreted or membrane-associated
protein containing vWFA domain [Stigmatella aurantiaca
DW4/3-1]
Length = 843
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/446 (26%), Positives = 218/446 (48%), Gaps = 54/446 (12%)
Query: 217 EYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY-ESEVLKWS 275
+ +TP I + KI L V+ G EV+ + SH L L ++ ++ S+ +EV
Sbjct: 143 DRITPPIGAVGYGLKIDLRVD--LGREVVVESPSHALT-LNKEGHQVRVSFARNEV---- 195
Query: 276 NIDFDFSYTV-SPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLP---GTAKSRKVFKKDVI 330
+D DF TV SP L H + ++ F + ++P G A K +++V+
Sbjct: 196 ALDRDFVLTVRSPDT---SASLTPLVTHRLGEKPGTFALTVVPDLLGLATGPK--RQEVV 250
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
F+VD SGSM+G+ L + AL + L L GD FNI+AF+ F+ + T++ +E+
Sbjct: 251 FVVDTSGSMEGESLPQAQGALRLCLRHLREGDRFNIIAFDTSFQSFAPQPAVFTQKTLEQ 310
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA-MKSR 449
A +W+ A G T + P+ AV+ ++ L+TDG V +E +I A +++R
Sbjct: 311 ADRWVAA-LRANGGTELLQPMLAAVQAAPEG-----VVVLLTDGQVGNEAEILQAVLRAR 364
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
T RIY+FGIG+ + L+ +A + G + + I+ ++ F+R +
Sbjct: 365 KTA------RIYSFGIGTNVSDALLKDMARQTDGAVEFIHPGERIDDKVVAQFSRALAPR 418
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAKGFLGDLSNFV 567
+ + + ++ E+ P+ +P L P + GRY G+ TLK + FV
Sbjct: 419 ITELQAH-FEGVEGTELAPAELPALVDAVPWTLLGRYTTPGRGKVTLKGR---SGAEPFV 474
Query: 568 VELKLQL--AKDIP-LDRICAKQQIDLLTAQAWFSED------KRLEEKVSKMSVQTGVL 618
+ + + D P ++++ A ++I + W S + +R++E++ +++V+ ++
Sbjct: 475 LSVPVDFPAVSDRPAVEKLWAAERI-----RGWESAELDGRRAQRMKERIVQLAVEHQLV 529
Query: 619 CEYTRMIIVE--TDERNNASESPGTK 642
+T ++VE T ER AS P T+
Sbjct: 530 TRHTSFVVVEERTGER-RASGQPETR 554
>gi|149371058|ref|ZP_01890653.1| von Willebrand factor type A like domain [unidentified eubacterium
SCB49]
gi|149355844|gb|EDM44402.1| von Willebrand factor type A like domain [unidentified eubacterium
SCB49]
Length = 733
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 138/597 (23%), Positives = 267/597 (44%), Gaps = 57/597 (9%)
Query: 82 SLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAE 141
++IPL+ + E+++ + A V+V+ +V+ G+ + + + P+ Q ++ ++ +
Sbjct: 38 AVIPLKESNTEVNITGTI--AHVKVT---QVYHNEGTTAIEAKYVFPLSTQAAVHDMKMK 92
Query: 142 ISGKSYHTQLIALGENDGAGKSASVETGSFLK-----PNIFTLTLPQIDGGSYLSIRLRW 196
+ + + ++ + + + A E K N+F + + + G +SI + +
Sbjct: 93 VGERVTNAKIFEKQKAEEVYQEAVSEGKRAAKLDQARANVFQMKVGNVMPGDEISIEVYY 152
Query: 197 SQKLSYRDGEFSVNVPF----KFP--------EYVTPAIKK-IPKREKIHLNVNAGTGTE 243
++ LS +G++ P +F + P KK I ++ V G
Sbjct: 153 TEMLSPTNGQYQFVAPSVVGPRFTGEQTNAETSFNNPTTKKGIADTFTYNMQVEIQAGMM 212
Query: 244 VL-CNTSSHHLKQL--RRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
+ +++H + L RD + S +E +N DF +Y++ I G+LL
Sbjct: 213 IQNIESNTHKVNVLYPSRDSASIFLSKSNE--NPANRDFVLNYSLRGEEIASGLLLYE-- 268
Query: 301 LHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
H+ E F Y++ P T K + V ++ +FIVD+SGSM G PLE +K+ + L L
Sbjct: 269 -HN---NEKFFAYMMEPPATVKPKNVTAREYLFIVDVSGSMNGYPLEVSKDLMRNLLCNL 324
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
+ D+FN+ F + +F+ + AT E V A +++ GG+ + A L A E+
Sbjct: 325 NADDTFNVQLFASSSTIFNPTPVEATDENVTNAIKFLTSGQGGGGTQLLSA-LNVAYELP 383
Query: 419 TNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
+ GS + ++TDG V ER+ ++ L ++TFGIGS N Y + +A
Sbjct: 384 RSQEGSSRSMVIITDGYVSVEREAFTKIEENLDQAS-----VFTFGIGSSVNRYLIEGMA 438
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+S+ A + + S V+ + +T K D +++ PS +PD+ +
Sbjct: 439 AVSKSESFIATSREEASKVAEDFKKYIDSPVMTQVKFET-KGFDVYDVEPSAVPDIFAAR 497
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGD---LSNFVVELKLQLAKDIPLDRICAKQQIDLL-TA 594
P+++ G+Y+G+ + G+ G F V + L + A+++I+ L
Sbjct: 498 PVVIFGKYKGEANGEIIMTGYQGRKKIKQRFKVSEGSLTPANKALRYLWARKKIEQLDDY 557
Query: 595 QAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQ 651
F++D + E V + +Q + +T + V+ E K GS KS Q
Sbjct: 558 NTRFNQDTKAE--VIALGLQYNLATRFTSFVAVD--------EQVVNKDGSLKSVKQ 604
>gi|399036855|ref|ZP_10733819.1| marine proteobacterial sortase target protein [Rhizobium sp. CF122]
gi|398065682|gb|EJL57303.1| marine proteobacterial sortase target protein [Rhizobium sp. CF122]
Length = 777
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 124/538 (23%), Positives = 236/538 (43%), Gaps = 71/538 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP----- 227
+ NIFT + I G + +++ + Q + GEFS+ +FP V P P
Sbjct: 170 RSNIFTNQVANIGPGETIVVQIEYQQAVHQSGGEFSL----RFPMVVAPRYNPAPIVQTV 225
Query: 228 ----------------KREKIH----------------LNVNAGTGTEVLCNTSSHHLKQ 255
REKI L VN G + SS H
Sbjct: 226 EFNNGAGFAVPTDPVENREKIEAPVLDPRENAKINPVALTVNLKAGFPLGDVKSSFHDVD 285
Query: 256 LRRDVGK-LGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
+++D + S E + ++ DF+ ++ +P L + ++ + ++
Sbjct: 286 IKQDGDQSRTISITGESVP-ADKDFELTWKAAPGKTPSAGLFR----EVINGKTYLLAFV 340
Query: 315 LPGTA--KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE 372
P S K++V+F++D SGSM G + K +LA+A+SKL+P D FN++ F+
Sbjct: 341 TPPATPDTSSPTAKREVVFVIDNSGSMSGPSIAQAKESLALAISKLNPDDRFNVIRFDDT 400
Query: 373 TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN----SRGSIPII 428
+ + A+ + E+A ++ A G T + L A L N + G++ +
Sbjct: 401 MTDYFNGLVAASPDNREKAIAYVR-GLTADGGTEMLPALQDA---LRNQGPVANGALRQV 456
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E+Q+ + N G R++T GIGS N YF+ A + RG +
Sbjct: 457 VFLTDGAIGNEQQLFGEISQ---NRGEA--RVFTVGIGSAPNSYFMTKAAEMGRGTFTQI 511
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
+ + +M +LF + + + +I T + + ++ P+ +PDL + P++++
Sbjct: 512 GSAEQVATRMGELFAKLQNPAMTDITA-TFEGIKAKDITPNPMPDLYTGEPVVLTAELPE 570
Query: 549 KFP-DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDK-RLE 605
+ P L+ G G + + V++ + A D + ++ A+++ID L A+A+ ED+ L+
Sbjct: 571 EKPAGKLQIIGKTG-IQPWRVDVDIANAADGQGISKLWARRKIDDLEARAYEREDRASLD 629
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK----GSKKSDHQKILDSEVP 659
+ +++ ++ T ++ V+ A E G+ K D +K+ E P
Sbjct: 630 RDIETVALAHHLVSRVTSLVAVDVTPSRPAGEPVGSAKLPLNLPDGWDFEKVFGEEAP 687
>gi|359454985|ref|ZP_09244238.1| uncharacterized protein [Pseudoalteromonas sp. BSi20495]
gi|358047951|dbj|GAA80487.1| uncharacterized protein [Pseudoalteromonas sp. BSi20495]
Length = 672
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 130/569 (22%), Positives = 241/569 (42%), Gaps = 80/569 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+ N+F + I G + I L + + + Y G F++ +FP +TP I ++
Sbjct: 137 RANMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 192
Query: 233 H------------------------------------LNVNAGTGTEVL-CNTSSHHLKQ 255
+ LN+N G E++ N+ H++
Sbjct: 193 NTQNQAHVNSLPTGWLSPVYSIKSSTQNDNVPSSQFSLNINIDVGLELVDINSKFHNV-- 250
Query: 256 LRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
DV + L +N ++ DF P + R M +
Sbjct: 251 ---DVQNTAFGQYKIALNETNAVNRDFVLEFKPLQKEQAQAAFFTQQFENGDRYGLAMLM 307
Query: 315 LPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F+
Sbjct: 308 PPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFDNIV 367
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
S +A+ + RA ++I + A G T I L ++ + G + + +TD
Sbjct: 368 TPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDG-SQFDGFVRQVVFLTD 425
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
G+V +E + ++S+L + R++T GIGS N +F+R A I +G +
Sbjct: 426 GSVSNEDALFKNIQSKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGSFTFIGSTSE 480
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRYQGK 549
++ +MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 481 VQPKMQQLFDK-----LAHPAITNLALSDESGNSLDFWPSPLPDLYFGEPIMVAIKLNNA 535
Query: 550 FPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
L + G LS ++L Q + + ++ A+Q+I L ++E ++++V
Sbjct: 536 KNVVLAGQTAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQSTVKDEV 589
Query: 609 SKMSVQTGVLCEYTRMIIVETDERNNASE 637
++++ +L +T I +E + +E
Sbjct: 590 QELALTHQLLSPFTAFIAIEQTQIKEVAE 618
>gi|218662246|ref|ZP_03518176.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli
IE4771]
Length = 487
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/322 (26%), Positives = 168/322 (52%), Gaps = 12/322 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K++V+F++D SGSM G +E + +LA+A+S+L+P D FN++ F+ + + AT
Sbjct: 46 KREVVFVIDNSGSMSGPSIEQARQSLALAISRLNPDDRFNVIRFDDTMTDYFKGLVAATP 105
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVTDGAVEDERQICD 444
+ E+A ++ A G T + L A+ + G++ + +TDGA+ +E+Q+
Sbjct: 106 DNREKAIAYV-RGLTADGGTEMLPALEDALRNQGPVASGALRQVVFLTDGAIGNEQQLFQ 164
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ + + R++T GIGS N YF+ A I RG + A D + +M +LF +
Sbjct: 165 EITANRGDA-----RVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAK 219
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-DTLKAKGFLGDL 563
+ + +IA T + ++ ++ P+ +PDL S P++++ Q P L+ G GD
Sbjct: 220 LQNPAMTDIAA-TFEGVEAEDITPNPMPDLYSGEPVVLTAELQQDKPAGKLQIVGKTGD- 277
Query: 564 SNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSVQTGVLCEY 621
+ VE+ + A D + ++ A+++ID A+A+ +D L++ + +++ ++
Sbjct: 278 QPWRVEMDIAHAADGNGISKLWARRKIDDFEARAYQRQDPAALDKDIETVALAHHLVSRV 337
Query: 622 TRMIIVETDERNNASESPGTKK 643
T ++ V+ A+ G+ K
Sbjct: 338 TSLVAVDVKPSRPATGPLGSAK 359
>gi|406935836|gb|EKD69701.1| hypothetical protein ACD_47C00030G0005, partial [uncultured
bacterium]
Length = 500
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 154/305 (50%), Gaps = 22/305 (7%)
Query: 250 SHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM 309
SH + ++ ++ S+E K + DF YTVS + ++ P+ D
Sbjct: 211 SHKIDVQKKTDNEVKVSFEESNSK-PDKDFTLYYTVSDEDLGISMISHKPAKED----GY 265
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P +KV KD+ F+VD SGSM G + ++AL ++ L+ D+FNI++F
Sbjct: 266 FMLLASPKVETEKKVIPKDICFVVDTSGSMSGDKHKQVQSALKFCVNNLESDDTFNIISF 325
Query: 370 NGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIP-- 426
+ E L+ ++ +A K+ VE A ++I G+ I G TNI L KA++M R +P
Sbjct: 326 STEPKLYEKNLVVAKKDNVEEALKYIDGMRAIGG--TNINEALYKALKM--EFRKDVPAF 381
Query: 427 IIFLVTDGA----VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
I+FL TDG V D +I +K NG + +I+ FG+G+ N L LA+ +
Sbjct: 382 IVFL-TDGMPTVDVTDPTEILKNVKKE--NG--VKAKIFCFGVGTDLNTTLLDKLAVENN 436
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G + + IE+++ +T+ S VL ++A+D + YP IPD S L++
Sbjct: 437 GVIEYVTENEDIELKISNFYTKIASPVLTDVAVD-FGSIKTDNAYPKEIPDFFKGSQLVL 495
Query: 543 SGRYQ 547
GRY+
Sbjct: 496 IGRYR 500
>gi|359435837|ref|ZP_09225979.1| uncharacterized protein [Pseudoalteromonas sp. BSi20652]
gi|357917515|dbj|GAA62228.1| uncharacterized protein [Pseudoalteromonas sp. BSi20652]
Length = 676
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 130/572 (22%), Positives = 244/572 (42%), Gaps = 82/572 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMHIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+PN+F + I G + I L + + + Y G F++ +FP +TP
Sbjct: 137 RPNMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 192
Query: 222 --------------------AIKKIPKRE---------KIHLNVNAGTGTEVLCNTSSHH 252
+I+ I + + LN++ G E++ +S H
Sbjct: 193 NTQNQAHVNPLPTGWLSPVYSIQNITQNSTQNDTVPSSQFSLNIDIDVGLELVDISSKFH 252
Query: 253 LKQLRRDVGKLGY-SYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
DV + Y+ + + + ++ DF P + R
Sbjct: 253 ----NVDVQNTAFGQYKIALNEMNAVNRDFVLEFKPLQKEQAQAAFFTQQFENGDRYGLA 308
Query: 312 MYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
M + PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F+
Sbjct: 309 MLIPPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFD 368
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFL 430
S +A+ + RA ++I + A G T I L ++ + G + +
Sbjct: 369 SIVTPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDG-SEFDGFVRQVVF 426
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
+TDG+V +E + ++S+L + R++T GIGS N +F+R A I +G +
Sbjct: 427 LTDGSVSNEDVLFKNIQSKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGSFTFIGS 481
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRY 546
++ +MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 482 TSEVQPKMQQLFDK-----LAHPAITNLALSDESGNSLDFWPSPLPDLYFGEPIMVAIKL 536
Query: 547 QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLE 605
L + G LS ++L Q + + ++ A+Q+I L ++E ++
Sbjct: 537 NDAKSVVLAGQTAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQSTVK 590
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
++V ++++ +L +T I +E + +E
Sbjct: 591 DEVQELALTHQLLSPFTAFIAIEQTQIKEVAE 622
>gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella
baltica OS223]
gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica
OS223]
Length = 772
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 161/335 (48%), Gaps = 27/335 (8%)
Query: 310 FCMYLLPGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M L P KS + +++I ++D SGSM G + KNAL AL L P DSFNI+
Sbjct: 374 LVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIE 433
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI--- 425
FN FS + AT + RA Q++ A G T + L A L S GS+
Sbjct: 434 FNSSLSQFSATSLPATSSNLSRARQFVS-RLQADGGTEMALALDAA---LPKSLGSVSPD 489
Query: 426 ---PI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
P+ + +TDG+V +E+ + D ++ ++ R++T GIGS N +F++ A +
Sbjct: 490 AVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAEL 544
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + +D ++ ++ L ++ VL +I + D + +PS I DL P+
Sbjct: 545 GRGTFTYIGKVDEVDAKISALLSKIQYPVLTDIQV-RYDDGSVPDYWPSPIADLYRGEPV 603
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLT 593
+VS + + P L G G N+ L LQ + LD + A++QI L
Sbjct: 604 LVSLKRSAREPQELVISGRQGH-KNWQQSLSLQDNSAGLITNQGAGLDLLWARKQIGALE 662
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ D +++++V+ +S+ ++ YT ++ V+
Sbjct: 663 LSKNGANDDKVKQQVTALSMNYHLVSAYTSLVAVD 697
>gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica
OS185]
gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella
baltica OS185]
Length = 772
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 161/335 (48%), Gaps = 27/335 (8%)
Query: 310 FCMYLLPGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M L P KS + +++I ++D SGSM G + KNAL AL L P DSFNI+
Sbjct: 374 LVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIE 433
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI--- 425
FN L S + AT + RA Q++ A G T + L A L S GS+
Sbjct: 434 FNSSLSLLSATPLPATSSNLSRARQFVS-RLQADGGTEMALALDAA---LPKSLGSVSPD 489
Query: 426 ---PI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
P+ + +TDG+V +E+ + D ++ ++ R++T GIGS N +F++ A +
Sbjct: 490 AVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAEL 544
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + +D ++ ++ L ++ VL +I + D + +PS I DL P+
Sbjct: 545 GRGTFTYIGKVDEVDAKISALLSKIQYPVLTDIQV-RYDDGSVPDYWPSPIADLYRGEPV 603
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLT 593
+VS + + P L G G N+ L LQ + LD + A++QI L
Sbjct: 604 LVSLKRSAREPQELVISGRQGH-KNWQQSLSLQDNSAGLITNQGAGLDLLWARKQIGALE 662
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ D +++++V+ +S+ ++ YT ++ V+
Sbjct: 663 LSKNGANDDKVKQQVTALSMNYHLVSAYTSLVAVD 697
>gi|410631577|ref|ZP_11342252.1| hypothetical protein GARC_2152 [Glaciecola arctica BSs20135]
gi|410149023|dbj|GAC19119.1| hypothetical protein GARC_2152 [Glaciecola arctica BSs20135]
Length = 729
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 114/448 (25%), Positives = 196/448 (43%), Gaps = 39/448 (8%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I KS ++ + G S + +PN+F+ + I G + + + +
Sbjct: 144 IEGQIKEKSEAKKIYLQAKEQGKKASLVAQQ----RPNMFSNAIANIGPGETIEVTIEYQ 199
Query: 198 QKLSYRDGEFSVNVPFKF-PEY------VTPAIKK--IPKR--EKIHLNVNAGTGTEVLC 246
QKL + ++S+ P P Y V P I+ P + I L VN G EV
Sbjct: 200 QKLHFDQQQYSLRFPMTITPRYMPSNTTVEPLIQAEFSPSQTNSNIELQVNLHAGFEVQD 259
Query: 247 NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDF---DFSYTVSPSHIFGGVLLQSPSLHD 303
S H ++++ S S+ ++ S D DF P G +
Sbjct: 260 IKSEFHAVKVKQ------LSDGSQHIQLSKGDVANQDFVLNWRPE--LGAAPQSAHFTQQ 311
Query: 304 VDQREMFCMYLLP--GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
V+ E + LLP T ++ ++VIF++D SGSM+G ++ + AL +A+ +L
Sbjct: 312 VNGDEYGLVMLLPPIDTEQATPTQPREVIFVLDTSGSMEGDSIKQAQKALLLAIEQLTTT 371
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM-LTN 420
D FNI+ FN S + A E A ++ + A G T + A ++
Sbjct: 372 DQFNIIEFNSYAQNLWKSPKTADAINKEDAKHFVN-SLSANGGTEMETAFKLAFSQPKSD 430
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ ++ I +TDG+V +E + + ++L N R++T GIGS N YF+ A +
Sbjct: 431 TSTTLRQIVFITDGSVGNEESLMQLISNKLENS-----RLFTVGIGSAPNSYFMSEAAKM 485
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSS 536
RG + +D ++ +M KL T+ + +I + TL+ + E YP+ I DL
Sbjct: 486 GRGTFTYIGSVDQVQEKMLKLLTKLQHPAITDIELQLNASTLQSNRQLEFYPNVISDLYQ 545
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLS 564
PL++S R D+ A +L L+
Sbjct: 546 GEPLVLSYRLTDASIDSGSANSYLSKLN 573
>gi|373854950|ref|ZP_09597747.1| PEP motif putative anchor domain protein [Opitutaceae bacterium
TAV5]
gi|372471732|gb|EHP31745.1| PEP motif putative anchor domain protein [Opitutaceae bacterium
TAV5]
Length = 732
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 153/660 (23%), Positives = 262/660 (39%), Gaps = 100/660 (15%)
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMG 117
DP D + PY R P + L + + +D A VRV+ +V+ G
Sbjct: 39 DPGSGTGSDNKTLSPYFRVRGARPGVEALPLRSTRVDATVAGVIADVRVT---QVYTNTG 95
Query: 118 SKSCDCRIAVPMGDQGSILGV---------EAEISGKSYHTQLIALGENDGAGKSASVET 168
+ + P + ++ G+ EAEI + Q+ ++G +T
Sbjct: 96 TTPLEAIYVFPGSTRAAVHGLTMTLGDRRLEAEIRERKQARQIYETARSEG-------KT 148
Query: 169 GSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP---- 221
S L +PN+F + + I G +++ LR+++ LS DG +S F +P V P
Sbjct: 149 ASLLEQQRPNVFQMNVANILPGDEITVELRYTELLSPEDGVYS----FVYPGVVGPRYTG 204
Query: 222 -----------AIKKIPK----REKIHLNVNAGTGT---------------------EVL 245
A IPK R + +AGT E +
Sbjct: 205 ASGAGPEDQWVANPYIPKTADDRRAGLTDPDAGTDADGNPVSTFDIAVTLAAGVPVQEAV 264
Query: 246 CNTSSHHLKQLRRDVGKLGYSYESEV-LKWSNIDFDFSYTVSPSHIFGGVLLQ--SPSLH 302
C SH K D + + N DF Y ++ + G+L+ +P
Sbjct: 265 CR--SHRTKITYDDPSTARIALDPAARADAGNRDFILRYRLAGEALKTGLLIAEGTPGAG 322
Query: 303 DVDQREMF-CMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
D F M P + + +D +FI+D+SGSM+G PL+ TK L + P
Sbjct: 323 GADGENWFVAMIQPPARPVAASLPPRDYLFIIDVSGSMRGFPLQTTKTLLRGLFPTMRPQ 382
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
D+FNIV F+G + + + AT + A++ + + G+ + P + L
Sbjct: 383 DTFNIVLFSGHSTSLAEAPVPATPANIRAANELLDRQYGYDGTELL--PALRHAFSLPRP 440
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ I +VTDG V E + + ++ L GG ++ FGIGS N + + LA
Sbjct: 441 EATSRTIAIVTDGYVTVEHEAFELVRQHL--GGENGANVFAFGIGSSVNRHLIEGLARAG 498
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+G D E +L S VL NI + + D +++ P+ +PD+ +E P+I
Sbjct: 499 QGEPFIVTDPAEAEAVAARLGKTIASPVLTNINV-SFDGFDVYDVEPATLPDVFAERPVI 557
Query: 542 VSGRYQGKFPDTLKAKGFLG-------------DLSNFVVEL----KLQLAKDIPLDRIC 584
+ G+++G + G G D F L + L + L R+
Sbjct: 558 LFGKWRGPRVGKITLTGLAGGGAGEEGEGGNAHDSRRFTAALFPGATVPLPRADGLSRLW 617
Query: 585 AKQQI----DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPG 640
A+ +I D + A A E++ E ++ + + +L +YT + V+ R + +PG
Sbjct: 618 ARHRIAQLGDDVEAGAADREERIAE--ITTLGLTWRLLTKYTSFVAVDHVVRRDPGTTPG 675
>gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related
protein-hypothetical secreted or membrane-associated
[Blastopirellula marina DSM 3645]
gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related
protein-hypothetical secreted or membrane-associated
[Blastopirellula marina DSM 3645]
Length = 788
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 144/588 (24%), Positives = 268/588 (45%), Gaps = 53/588 (9%)
Query: 87 QMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKS 146
++ ++E+D + A V+VS T+ C S+ P+ G+I + + GK
Sbjct: 60 KIKSLEVDANIEDQIAKVQVSQTFENTC---SRQLQVSFLFPLPYDGAIDSLTLLVDGKE 116
Query: 147 YHTQLIALGE-----NDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLS 201
Y +L+ + + K+ ++ +F ++ + G+ ++ + +SQ L
Sbjct: 117 YPAKLLPKEKAREIYEEIVRKNQDPALLEWIGTGMFQTSVFPVPAGASRTVTITYSQLLR 176
Query: 202 YRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTS-SHHLKQLRRDV 260
+D + F FP T P EK+ L V+ T ++ S +H ++ R+
Sbjct: 177 -KDNRLT---DFLFP-LSTARFTDKP-LEKLRLRVSVETKEKLKSVYSPTHEVEVKRKGK 230
Query: 261 GKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAK 320
+ + E + +N DF + + + + VL P D + F + P +
Sbjct: 231 NRAVVTIEQKDCVPTN-DFRLFFDTAKTDLSASVLTYRP---DKSEDGYFLLLASPPVEE 286
Query: 321 SRKV-FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
V KK VIF+VD SGSM G+ +E K A L+ L+ GD FNI+A++ + F
Sbjct: 287 VGDVKTKKTVIFVVDRSGSMSGEKIEQAKEAAKFVLNNLNEGDLFNIIAYDSDVESFEPE 346
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDG----A 435
++ + E+A ++ N AGGSTNI L KA+ ML + + ++FL TDG
Sbjct: 347 LQKLDDKTREKALGFVD-NLYAGGSTNIDGALAKAMGMLKDDKRPSYMLFL-TDGLPTHG 404
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL---D 492
++E +I D K + + R+ +FG+G N R+L +SR +G + + +
Sbjct: 405 EQNEAKIVDNAKQK----NDVRARVISFGVGYDVNS---RLLDRLSRECFGQSEYVRPNE 457
Query: 493 SIEIQMQKLFTRGFSSVLANIAI--DTLKDLDEF--EMYPSRIPDLSSESPLIVSGRYQG 548
IE + KL+ + + V+ N+AI D K D F + P + DL + LI++GRY+
Sbjct: 458 DIETHVAKLYNKISAPVMTNVAIKYDLEKGGDNFVNRLQPKQSHDLFAGEQLIIAGRYRK 517
Query: 549 KFPDTLKAKGFLGDL-------SNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSE 600
+ G +GD + FV E Q + +++ A ++I +++ +
Sbjct: 518 HGDAKITIVGTVGDKKQKVTFPAEFVKESDDQTYAFV--EKLWAMRRIGEIIDDIDLNGK 575
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKS 648
+ L +++ +S + G++ YT + DE ++ S+ ++G +S
Sbjct: 576 NDELVKELVSLSTKHGIITPYTSFL---ADENSSVSDLADARRGGTRS 620
>gi|414070757|ref|ZP_11406737.1| Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas sp.
Bsw20308]
gi|410806774|gb|EKS12760.1| Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas sp.
Bsw20308]
Length = 672
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 131/569 (23%), Positives = 239/569 (42%), Gaps = 80/569 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
+ N+F + I G + I L + + + Y G F++ +FP +TP I ++
Sbjct: 137 RANMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 192
Query: 233 H------------------------------------LNVNAGTGTEVL-CNTSSHHLKQ 255
+ LN+N G E++ N+ H++
Sbjct: 193 NTQNQAHVNSLLTGWLSPVYSIKSSTQNDNVPSSQFSLNINIDVGLELVDINSKFHNV-- 250
Query: 256 LRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
DV + L +N ++ DF P + R M +
Sbjct: 251 ---DVQNTAFGQYKIALNETNAVNRDFVLEFKPLQKEQAQAAFFTQQFENGDRYGLAMLM 307
Query: 315 LPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F+
Sbjct: 308 PPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFDNIV 367
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
S +A+ + RA ++I + A G T I L ++ ++FL TD
Sbjct: 368 TPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDGSEFDSFVRQVVFL-TD 425
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
G+V +E + ++S+L + R++T GIGS N +F+R A I +G +
Sbjct: 426 GSVSNEDALFKNIQSKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGSFTFIGSTSE 480
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRYQGK 549
++ +MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 481 VQPKMQQLFDK-----LAHPAITNLALSDESGNNLDFWPSPLPDLYFGEPIMVAIKLNNA 535
Query: 550 FPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
L + G LS ++L Q + + ++ A+Q+I L ++E ++++V
Sbjct: 536 KNVVLAGQAAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQSTVKDEV 589
Query: 609 SKMSVQTGVLCEYTRMIIVETDERNNASE 637
+++ +L +T I +E + +E
Sbjct: 590 QDLALTHQLLSPFTAFIAIEQTQIKEVAE 618
>gi|372266961|ref|ZP_09503009.1| inter-alpha-trypsin inhibitor domain-containing protein
[Alteromonas sp. S89]
Length = 811
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 133/560 (23%), Positives = 246/560 (43%), Gaps = 84/560 (15%)
Query: 88 MNAVELDVDCYLDTAFVRVSGTWR--VHCVMGSKSCDCR-IAVPMGDQGSILGVEAEISG 144
M L + ++ ++ S WR V+ + ++ R + + +G++ + V
Sbjct: 99 MRVRGLVAEVHIAQTYINTSNHWREAVYVLPLPENAAVRGMEIVIGERRIVGKVREREEA 158
Query: 145 KSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRD 204
K +T E AGK A++ +PN+FT + I G +++ +R+ Q L +
Sbjct: 159 KKVYT------EARAAGKRAALLEQQ--RPNMFTSRVANIAPGEKIAVEVRYLQTLKFDR 210
Query: 205 GEFSVNVPFKF-PEYVTPAIKKIPKR------EKIHLNVNAGT----------------- 240
+FS+ +P P Y+ ++ ++ +++ LN + +
Sbjct: 211 DQFSLRLPSTLTPRYIPGVSGELSRKSAQESAQELALNSHGWSLPTDQVPDAQHISPFMV 270
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFS----YTVSPSHIFGG--- 293
L N SH + + D+G +G + +DF Y V + GG
Sbjct: 271 PMAELANRGSHQMS-IHIDLG-MGLPLADIYSPYHEVDFRRQDGNRYQV---QLKGGSAP 325
Query: 294 -----VLLQSPSLHDVDQREMFCMYLLPGTAK-------------SRKVFKKDVIFIVDI 335
VL P + + +F G+A+ + + ++V+++VD
Sbjct: 326 MDRDFVLHWRPQMSSMPSAAVFAEQSQDGSAQYLQLLLLPPREPGATRKLPREVVYVVDT 385
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI 395
SGSM G + K +L +ALS+L D FNI+ FN F A++E + RA W+
Sbjct: 386 SGSMGGNSIRQAKESLLLALSRLAAQDRFNIIEFNSHHRSFYPRPVSASQENIHRARDWV 445
Query: 396 GINFIAGGSTNICAPLTKAV-EMLTNSRGS-IPIIFLVTDGAVEDERQICDAMKSRLTNG 453
+ A G T + L +A+ + L G + + +TDGAV +ER + + ++ RL
Sbjct: 446 E-SLSATGGTEMAPALKEALSQQLDEQAGELVRQVVFITDGAVGNERALFEIIQQRL--- 501
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
G + R++T GIGS N +F+ A I RG DL ++ QM +LF + + ++ ++
Sbjct: 502 GQV--RLFTVGIGSASNSHFMTKAAQIGRGSAVHIGDLGEVQAQMGRLFEKLENPLVTDL 559
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL- 572
+D + + + E YP ++PDL P+ + R +G+ P + + G L VV +L
Sbjct: 560 RVDFPRGI-KLEAYPKQLPDLYLGEPVRLVARIEGQ-PLGMPSSGEL------VVSGRLA 611
Query: 573 --QLAKDIPLDRICAKQQID 590
+ ++ I L ++ QQ D
Sbjct: 612 GKRFSRTISLAQLVVDQQGD 631
>gi|348028163|ref|YP_004870849.1| vault protein inter-alpha-trypsin [Glaciecola nitratireducens
FR1064]
gi|347945506|gb|AEP28856.1| vault protein inter-alpha-trypsin [Glaciecola nitratireducens
FR1064]
Length = 839
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 116/451 (25%), Positives = 190/451 (42%), Gaps = 65/451 (14%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K QL + AGK AS+ +PN+FT + I G + I L +
Sbjct: 171 IEGQIKEKQEAKQLFEQAK--LAGKKASLIEQK--RPNLFTNNIANIGPGETIIISLEYQ 226
Query: 198 QKLSYRDGEFSVNVPFKF-PEY--------VTPAIKKIPK-------------------- 228
Q L Y +G +S+ P P+Y +T A ++ +
Sbjct: 227 QTLQYSNGSYSLRFPLGITPKYMPTNTDAAITSANEEQAELALLSSSGNTQMSGLAISNN 286
Query: 229 ----------REKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNID 278
E I+++V TGT++ S HH + + LG + L S +
Sbjct: 287 MQDLASAPAYSENINISVVLNTGTQLQGIESEHH--SIYTSI--LGNNKYLVTLDGSQVK 342
Query: 279 -FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
+DF P G + V+ E + L P ++VIF++D SG
Sbjct: 343 MYDFVLNWQPE--LGSKPTSAHFTQFVNGNEYGLIVLYPPLPDEPLYLDREVIFVLDTSG 400
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGI 397
SM GK +E K ALA A+ L D FNI+ FN + + A +E ++ +
Sbjct: 401 SMSGKAIEQAKQALAYAIDDLSTRDKFNIIEFNSNAEILWRQAKFADQE-----NKAVAF 455
Query: 398 NFIAGGSTNICAPLTKAVEMLTNSRGSIPI---IFLVTDGAVEDERQICDAMKSRLTNGG 454
FI+ S N + +A+ M S + I +TDG+V +E + ++ L
Sbjct: 456 EFISELSANGGTEMRQALTMALRSEADPELFKQILFITDGSVSNESDLMTLIEENLGPA- 514
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
R++T GIGS N YF+ A +G + D +E +M +L + + L +I
Sbjct: 515 ----RLFTIGIGSAPNSYFMTEAAKSGKGTFTFIGDTRLVEEKMAQLLGKINAPALTDIK 570
Query: 515 IDT--LKDLDEFEMYPSRIPDLSSESPLIVS 543
++ ++ +FEMYP+ I DL S PL+++
Sbjct: 571 LNLKGIQQYQQFEMYPNVISDLYSSEPLVIT 601
>gi|71280467|ref|YP_270055.1| von Willebrand factor type A domain-containing protein [Colwellia
psychrerythraea 34H]
gi|71146207|gb|AAZ26680.1| von Willebrand factor type A domain protein [Colwellia
psychrerythraea 34H]
Length = 786
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 131/530 (24%), Positives = 226/530 (42%), Gaps = 93/530 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF----------------- 215
+PN+FT + I S + + L++ +S+ G+F++ +P
Sbjct: 183 RPNLFTNKIANIPAQSTVVVTLKFIMPVSFSQGKFNLRLPLALTDRYQPRSTSNSFNESS 242
Query: 216 ---PEYVTP-----------------AIKKIPK------------------REKIHLNVN 237
PEY + + K +P+ R + +N+
Sbjct: 243 GHSPEYSSERSPSNFTHDLTNVSANISTKSLPEPFKISTTRSSTTHVRSVARSQSSINIV 302
Query: 238 AGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKW------SNIDFDFSYTVSPSHIF 291
+G + S H Q R KL + + SN FD ++ + S+
Sbjct: 303 LNSGIPITSIVSDSHKIQSRDLSSKLNSEQNAYFITLDKTQVISNKTFDLTWQLIASN-- 360
Query: 292 GGVLLQSPSLHDVDQR---EMFCMY-LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
P + Q E + + P +V +D+IFI+D SGSMQ +E
Sbjct: 361 ------QPQVSSFTQEISGEHYTLLTFFPPEKAVAQVIARDIIFIIDTSGSMQAGSMEQA 414
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTN 406
K++L +AL +L+ DSFNI+AF+ +T L +A+ + +A Q+I G++ A G T
Sbjct: 415 KSSLQLALLQLNNKDSFNIIAFDNDTELLFPVTHMASAHNISKAQQFIDGLS--ANGGTE 472
Query: 407 ICAPLTKAVEM---LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
+ PL+ A+ M T S +I I +TDGAV +E ++ +L N R+YT
Sbjct: 473 MYRPLSNALMMKKDKTQSSKAIRQIVFITDGAVANEFELM-----QLLNTAQGDFRLYTV 527
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE 523
GIG+ N YF++ A RG Y + ++ +M T+ L NIA+ + +
Sbjct: 528 GIGAAPNGYFMKKAAQFGRGSYVFIQNKSEVQRKMSHFMTKISQPALTNIALTLDNQIHQ 587
Query: 524 F-EMYPSRIPDLSSESPLIVSGRYQGKFPDT---LKAKGFLGDLSNFVVELKLQLAKDIP 579
E+YP +IPDL PL ++ + Q FP + L A+ ++ Q +K I
Sbjct: 588 HVEVYPKKIPDLYFGEPLQIALKSQ--FPISSVQLTAETVSTPFYQQLIIDDRQPSKGI- 644
Query: 580 LDRICAKQQIDLLTAQAWFSEDK-RLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ A+++I+ L +K +++ +V S+ ++ YT I VE
Sbjct: 645 -SSLWARRKIESLVDSLIVGANKDKVKSQVIATSLNHQIISPYTSFIAVE 693
>gi|75907575|ref|YP_321871.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413]
gi|75701300|gb|ABA20976.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413]
Length = 819
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 117/511 (22%), Positives = 206/511 (40%), Gaps = 73/511 (14%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-IKKIP---- 227
+ NIFT +L I G + + +R+S+ L + G + F FP V P + IP
Sbjct: 113 RDNIFTQSLANIQPGEQIDVIIRYSESLKFTAGNYE----FVFPMVVAPRYVPGIPIEGN 168
Query: 228 ------------------------------------KREKIHLNVNAGTGTEVLCNTSSH 251
I++ + G EV S
Sbjct: 169 AVGVGSATAPMTQNQDTDIVPDGSRLNAPILPSGMRSPHDINVTIEIDAGVEVQNIQSPS 228
Query: 252 HLKQLRRDVGKLGYSYESEVLKWS------NIDFDFSYTVSPSHIFGGVLLQSPSLHDVD 305
H Q+ Y+ + ++K + N D Y V+ G Q+ L D
Sbjct: 229 HQVQI-------SYAEKRVLVKLAGGDTIPNKDLILRYQVA------GESTQATVLSQAD 275
Query: 306 QRE-MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
+R F +YL+P + ++ KDV+F++D SGS G PL + + ++ L+P D+
Sbjct: 276 ERGGHFALYLIPALQYRQNQIVPKDVVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDT 335
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
F+I+ F+ T S + RA +I A G T + + + G
Sbjct: 336 FSIIDFSDTTRQLSPVPLANNSQNRTRAINYIN-RLTANGGTEMLRGIRAVLNFPVTDSG 394
Query: 424 SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ I L+TDG + +E QI ++ L G R+Y+FG GS N + L +A + RG
Sbjct: 395 RLRSIVLLTDGYIGNENQILAEVQQHLQAGN----RLYSFGAGSSVNRFLLNRIAELGRG 450
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
+ + + K + + + VLANI + D + +YP+ PDL +E PL++
Sbjct: 451 IAQIIRHDEPTDEVVDKFYRQINNPVLANINLQWEGDGNAPIIYPATPPDLFAEQPLVLF 510
Query: 544 GRYQGKFPDTLKAKGFLGDLSNF--VVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSED 601
G+ L G + + + ++L + + + ++ + +I L + +
Sbjct: 511 GKKPDARGGKLHITGIVAGGTRYQNTIKLDFEETGNPAIAQLWGRSRIKELMNKMVSGDT 570
Query: 602 KRLEEKVSKMSVQTGVLCEYTRMIIVETDER 632
K E V+ ++ +L +YT + V D R
Sbjct: 571 KLGVEGVTDTALTYQLLSQYTAFVAVSDDVR 601
>gi|373949210|ref|ZP_09609171.1| sortase target protein [Shewanella baltica OS183]
gi|386324950|ref|YP_006021067.1| sortase target protein [Shewanella baltica BA175]
gi|333819095|gb|AEG11761.1| sortase target protein [Shewanella baltica BA175]
gi|373885810|gb|EHQ14702.1| sortase target protein [Shewanella baltica OS183]
Length = 771
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 160/335 (47%), Gaps = 27/335 (8%)
Query: 310 FCMYLLPGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M L P KS + +++I ++D SGSM G + KNAL AL L P DSFNI+
Sbjct: 373 LVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIE 432
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI--- 425
FN FS + AT + RA Q++ A G T + L A L S GS+
Sbjct: 433 FNSSLSQFSATPLPATSSNLSRARQFVS-RLQADGGTEMALALDAA---LPKSLGSVSPD 488
Query: 426 ---PI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
P+ + +TDG+V +E+ + D ++ ++ R++T GIGS N +F++ A +
Sbjct: 489 AVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAEL 543
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + +D ++ ++ L ++ VL +I + D + +PS I DL P+
Sbjct: 544 GRGTFTYIGKVDEVDAKISALLSKIQYPVLTDIQV-RFDDGSVPDYWPSPIADLYRGEPV 602
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLT 593
+VS + + P L G G N+ L L + LD + A++QI L
Sbjct: 603 LVSLKRSAREPQELVISGRQGH-KNWQQSLSLHDNSTGLITNQGAGLDLLWARKQIGALE 661
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ D +++++V+ +S+ ++ YT ++ V+
Sbjct: 662 LSKNGANDDKVKQQVTALSMNYHLVSAYTSLVAVD 696
>gi|171911941|ref|ZP_02927411.1| Vault protein inter-alpha-trypsin domain protein [Verrucomicrobium
spinosum DSM 4136]
Length = 679
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 134/584 (22%), Positives = 255/584 (43%), Gaps = 63/584 (10%)
Query: 83 LIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEI 142
++PL+ V+ + + V +V+ G + + P + ++ GVE +
Sbjct: 55 VLPLRSTQVQTKISGVIADVTVE-----QVYQNTGDGPIEAKYVFPASTRAAVHGVEMRL 109
Query: 143 SGK------SYHTQLIALGEN-DGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLR 195
+ TQ A E KSAS+ +PN+F +++ I G + + L
Sbjct: 110 GQRVIKSTIQEKTQAKATYEKAKSQNKSASLLEEH--RPNVFEMSVANILPGEEVKVSLH 167
Query: 196 WSQKLSYRDGEFSVNVPFKFPEYVTP------------AIKKIP---------KREKIHL 234
+S+KL + + F FP V P A + P ++L
Sbjct: 168 YSEKLLPSNRVYE----FVFPTVVGPRYQSGSGNGTGEAWMENPYVATGSVGAATFALNL 223
Query: 235 NVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYE-SEVLKWSNIDFDFSYTVSPSHIFGG 293
+ AG + + + S L Q GK S+ S +N DF Y +S + G
Sbjct: 224 ELQAGMPLQSITSPSHQGLMQFS---GKSSASFTLSPSADHANHDFVLRYQLSGREVATG 280
Query: 294 VLL-QSPSLHDVDQREMFCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
+LL Q+P+ + F + + P ++ + +D +F++D+SGSM G P+E +K +
Sbjct: 281 LLLHQAPAGSSPEAESFFLLNVQPPAKWEAGQTPPRDYLFVLDVSGSMNGFPIETSKRLM 340
Query: 352 AVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPL 411
+ L L+PGD+FNI+ F ++ + S AT E + A + + + GG T + L
Sbjct: 341 SDLLKGLNPGDTFNILHFASDSAVLSPKPLAATPENIHLATKDLSRHRGNGG-TELLPAL 399
Query: 412 TKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNH 471
+A+ S I+ L TDG V E++ ++ L N ++TFGIG+ N
Sbjct: 400 QRALATPREVGVSRSIVIL-TDGYVTIEKEAFRLVRKELQNA-----NVFTFGIGTAVNR 453
Query: 472 YFLRMLAMISRG---YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP 528
+ + LA +G + D + + ++ +R VL ++ + T + D +E P
Sbjct: 454 WLIEGLAHAGQGDPFVVLSEKDAAAAAERFREYISR---PVLTDVQV-TYEGFDAYETEP 509
Query: 529 SRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG----DLSNFVVELKLQLAKDIPLDRIC 584
+ IPD+ ++ P+ + G+++G+ + +G G + S V + ++ + L +
Sbjct: 510 ASIPDVFADRPIELIGKWRGQPQGRIIIRGKTGGSPYEASFNVGQEAIKELSNPALRPLW 569
Query: 585 AKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
AK ++ L + + D + + V+ + V+ +L YT + V+
Sbjct: 570 AKDRVRTLNYELQITPDAKTIKTVTDLGVRYALLTPYTSFVGVD 613
>gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi
ATCC 49188]
gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi
ATCC 49188]
Length = 750
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 121/457 (26%), Positives = 209/457 (45%), Gaps = 57/457 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-----PEYVTPAIKKI- 226
+PN+FT + I + I++ + Q + D FS+ VP P+ +P ++++
Sbjct: 167 RPNVFTNAVANIGPHEKVVIQIEYQQAVRLSDERFSLRVPLVVAPRYNPDNASPVVQEVE 226
Query: 227 -------------PKREKIHLNVNAGTGTEVLCNTSSHHLK-QLRRDVGKLGYSYESEVL 272
P L T + N + +K + +GK+ + +
Sbjct: 227 IKNGWGKSRDTGKPDTYNTPLVTPLAPPTALRTNPVTISVKLKAGFPLGKVESLFHKVRI 286
Query: 273 KWSN-----IDFDFSYTVSPSHI--FGGVLLQSPSL----HDVDQREMFCMYLLPGTAKS 321
+N I D + + + V +P + + + + Y+ P S
Sbjct: 287 DTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVGLFREHIGKDDYVLAYVTPPALAS 346
Query: 322 RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSM 380
K +++VIF++D SGSM G +E K +L ALS+L PGD FN++ F+ T F S+
Sbjct: 347 PKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGDRFNVIRFDDTLTKFFEDSV 406
Query: 381 ELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
+ A +E + A +++ + A G T + L A++ G I+FL TDG + +E+
Sbjct: 407 D-ANQENIASARRFV-TSLEAQGGTEMLPALHAALDDSNQGNGLRQIVFL-TDGEISNEQ 463
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY---GAAYDLDSIEIQ 497
Q+ DA+ +R G S RI+ GIGS N Y + A + RG + G+A ++D +
Sbjct: 464 QLLDAVAAR--RGRS---RIFMVGIGSAPNSYLMNRAAELGRGTFTHIGSAAEVDE---R 515
Query: 498 MQKLFTRGFSSVLANIAIDTLK-DLDE--FEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
M+ LF + L N A+ LK + E M PS +PDL PL+++ R GK L
Sbjct: 516 MRALFDK-----LENPAVTDLKANFSEKNVSMTPSLLPDLYRGEPLVIAARM-GKAIGNL 569
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQID 590
+G + D + V L L A + + ++ A+++ID
Sbjct: 570 AIEGQI-DGRPWTVNLPLDQAMNAEGISKLWARRKID 605
>gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica
OS155]
gi|386340829|ref|YP_006037195.1| Vault protein inter-alpha-trypsin domain-containing protein
[Shewanella baltica OS117]
gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella
baltica OS155]
gi|334863230|gb|AEH13701.1| Vault protein inter-alpha-trypsin domain-containing protein
[Shewanella baltica OS117]
Length = 771
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 160/336 (47%), Gaps = 29/336 (8%)
Query: 310 FCMYLLPGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M L P KS + +++I ++D SGSM G + KNAL AL L P DSFNI+
Sbjct: 373 LVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIE 432
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-- 426
FN FS + AT + RA Q++ A G T + L A L S GS P
Sbjct: 433 FNSSLSQFSATPLPATSSNLSRARQFVS-RLQADGGTEMALALDAA---LPKSLGSAPSD 488
Query: 427 -------IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+IF+ TDG+V +E+ + D ++ ++ R++T GIGS N +F++ A
Sbjct: 489 AVQPLRQVIFM-TDGSVGNEQALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAE 542
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+ RG + +D + ++ L ++ +L +I + D + +PS I DL P
Sbjct: 543 LGRGTFTYIGKVDEVGEKISALLSKIQYPLLTDIQV-RFDDGSVPDYWPSPIADLYRGEP 601
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLL 592
++VS + + P L G G N+ L LQ + LD + A++QI L
Sbjct: 602 VLVSLKRSAREPQELVILGRQGH-KNWQQSLSLQDNSAGLITNQGAGLDLLWARKQIGAL 660
Query: 593 TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ D +++++V+ +S+ ++ YT ++ V+
Sbjct: 661 ELSKNGANDDKVKQQVTALSMNYHLVSAYTSLVAVD 696
>gi|114567581|ref|YP_754735.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114338516|gb|ABI69364.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 776
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 186/400 (46%), Gaps = 47/400 (11%)
Query: 172 LKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF------------------ 213
++PN+F ++L QID + I + + Q++ D E +++P
Sbjct: 102 VRPNVFQVSLGQIDADEEVEIAISYFQEIKNIDTEMRISIPMLLAPRFIPGKPLGKKIGP 161
Query: 214 -------KFPE--YVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLG 264
+ P+ +++P I + R + L+V+ T + + SH ++ R D
Sbjct: 162 GRAEPTDRVPDADFISPPIGETGYRATLSLHVHNNTPISSI-KSPSHKIRIDRMDEYSAT 220
Query: 265 YSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF-CMYLLPGTAKSRK 323
+ + + N DF + + + + ++P + E F C+ P +
Sbjct: 221 ITLQENNTRM-NRDFVLNLKLDGETVPRIIYWKNP------KDEYFACITYTPELPIIEQ 273
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
K+ IF++DIS SM+GK +E +A+ + L LD GDSFN++AF E + F+
Sbjct: 274 RQPKEYIFLIDISRSMEGKKIEHAADAIQICLRNLDEGDSFNLLAFESENHAFAPKSLPY 333
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
+E +++A W+ N A G TNI + A++ + + ++ L TDG V +E +I
Sbjct: 334 NQENLDKASAWVK-NLHAMGGTNILPAVQLALKEAGDQQK---VVILATDGQVGNENEII 389
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ ++ R N +C +++ GI + N YF+ +A G +Y +S+E +M + F
Sbjct: 390 NYVRKRNQN---LC--LFSLGIDTAVNSYFINQIAEAGNGCAEFSYPGESLEEKMLRHFA 444
Query: 504 RGFSSVLANI--AIDTLKDLDEFEMYPSRIPDLSSESPLI 541
R ++ + N+ ++ + D E PSR+ D+ + LI
Sbjct: 445 RINATSMDNVTFSLPNISAYDWAETPPSRLYDMEPYTHLI 484
>gi|392533188|ref|ZP_10280325.1| inter-alpha-trypsin inhibitor [Pseudoalteromonas arctica A 37-1-2]
Length = 673
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 132/572 (23%), Positives = 240/572 (41%), Gaps = 82/572 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 81 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 138
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP---AIK---KI 226
+ N+F + I G + I L + + + Y G F++ +FP +TP AI ++
Sbjct: 139 RANMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 194
Query: 227 PKREKIH----------------------------------LNVNAGTGTEVLCNTSSHH 252
+ K H LN++ G E++ S H
Sbjct: 195 STQNKAHVNPLPTGWLSPVYSIQNITQNSTQTDNVPSSQFSLNIDIDVGLELVDINSKFH 254
Query: 253 LKQLRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
DV + L +N ++ DF P + R
Sbjct: 255 ----NVDVQNTAFGQYKIALNETNAVNRDFVLEFKPLQKEQAQAAFFTQQFENGDRYGLA 310
Query: 312 MYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
M + PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F+
Sbjct: 311 MLMPPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFD 370
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFL 430
S +A+ + RA ++I + A G T I L ++ + G + +
Sbjct: 371 NIVTPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDG-SEFDGFVRQVVF 428
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
+TDG+V +E + ++S+L + R++T GIGS N +F+R A I +G +
Sbjct: 429 LTDGSVSNEDVLFKNIQSKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGSFTFIGS 483
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRY 546
++ +MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 484 TSEVQPKMQQLFDK-----LAHPAITNLALSDESGNSLDFWPSPLPDLYFGEPIMVAIKL 538
Query: 547 QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLE 605
L + G LS ++L Q + + ++ A+Q+I L ++E ++
Sbjct: 539 NNAKSVVLAGQTAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQNTVK 592
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
++V ++++ +L +T I +E +E
Sbjct: 593 DEVQELALTHQLLSPFTAFIAIEQTHIKEVAE 624
>gi|17231904|ref|NP_488452.1| hypothetical protein alr4412 [Nostoc sp. PCC 7120]
gi|17133548|dbj|BAB76111.1| alr4412 [Nostoc sp. PCC 7120]
Length = 820
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 116/509 (22%), Positives = 208/509 (40%), Gaps = 69/509 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT----------- 220
+ NIFT +L I G + + +R+S+ L + G + P P Y+
Sbjct: 113 RDNIFTQSLANIQPGEQIDVIIRYSESLKFTAGNYEFVFPMVVAPRYIPGIPIEGNASGV 172
Query: 221 ---------------------------PAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHL 253
P+ + P + + ++AG + + + SH +
Sbjct: 173 GSATAPMTQNQDTDIVPDGSRLNAPILPSGMRSPHDINVTIEIDAGVKVQNI-QSPSHQV 231
Query: 254 KQLRRDVGKLGYSYESEVLKWS------NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR 307
+ + Y+ + ++K + N D Y V+ G Q+ L D+R
Sbjct: 232 Q--------ISYAEKQVLVKLAGGDTIPNKDLILRYQVA------GESTQATVLSQADER 277
Query: 308 E-MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
F +YL+P + +V KDV+F++D SGS G PL + + ++ L+P D+F+
Sbjct: 278 GGHFALYLIPAIQYRQDQVVPKDVVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDTFS 337
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
IV F+ T S + RA +I A G T + + + G +
Sbjct: 338 IVDFSDTTRQLSPVPLANNAQNRTRAINYIN-QLSANGGTEMLRGIRAVLNFPVTDPGRL 396
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
I L+TDG + +E QI ++ L +G R+Y+FG GS N + L +A + RG
Sbjct: 397 RSIVLLTDGYIGNENQILAEVQQHLKSGN----RLYSFGAGSSVNRFLLNRIAELGRGIA 452
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ + + K + + + VLANI + D D +YP PDL +E PL++ G+
Sbjct: 453 QIIRHDEPTDEIVDKFYRQINNPVLANINLQWEGDGDSPIIYPCNPPDLFAEQPLVLFGK 512
Query: 546 YQGKFPDTLKAKGFLGDLSNF--VVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKR 603
L G + +++ +L + + + ++ + +I L E K
Sbjct: 513 KPDARGGKLHITGIVAGGTHYQHTFDLNFEETGNPAIAQLWGRYRIKELMNLMVSGETKL 572
Query: 604 LEEKVSKMSVQTGVLCEYTRMIIVETDER 632
E V+ ++ +L +YT + V D R
Sbjct: 573 GVEAVTDTALTYQMLSQYTAFVAVSDDVR 601
>gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica
OS195]
gi|378708235|ref|YP_005273129.1| Vault protein inter-alpha-trypsin domain-containing protein
[Shewanella baltica OS678]
gi|418024880|ref|ZP_12663861.1| sortase target protein [Shewanella baltica OS625]
gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica
OS195]
gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein
[Shewanella baltica OS678]
gi|353535735|gb|EHC05296.1| sortase target protein [Shewanella baltica OS625]
Length = 771
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 160/335 (47%), Gaps = 27/335 (8%)
Query: 310 FCMYLLPGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M L P KS + +++I ++D SGSM G + KNAL AL L P DSFNI+
Sbjct: 373 LVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIE 432
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI--- 425
FN FS + AT + RA Q++ A G T + L A L S GS
Sbjct: 433 FNSSLSQFSATSLPATSSNLSRARQFVS-RLQADGGTEMALALDAA---LPKSLGSASPD 488
Query: 426 ---PI--IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
P+ + +TDG+V +E+ + D ++ ++ R++T GIGS N +F++ A +
Sbjct: 489 AVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES-----RLFTVGIGSAPNSHFMQRAAEL 543
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + +D ++ ++ L ++ VL +I + D + +PS I DL P+
Sbjct: 544 GRGTFTYIGKVDEVDEKISALLSKIQYPVLTDIQV-RFDDGSVPDYWPSPIADLYRGEPV 602
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-------LAKDIPLDRICAKQQIDLLT 593
+VS + + P L G G N+ L LQ + LD + +++QI L
Sbjct: 603 LVSLKRSAREPQELVISGRQGH-KNWQQSLSLQDNSAGLITNQGAGLDLLWSRKQIGALE 661
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ D +++++V+ +S+ ++ YT ++ V+
Sbjct: 662 LSKNGANDDKVKQQVTALSMNYHLVSAYTSLVAVD 696
>gi|410622940|ref|ZP_11333760.1| hypothetical protein GPAL_2273 [Glaciecola pallidula DSM 14239 =
ACAM 615]
gi|410157402|dbj|GAC29134.1| hypothetical protein GPAL_2273 [Glaciecola pallidula DSM 14239 =
ACAM 615]
Length = 821
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 140/580 (24%), Positives = 238/580 (41%), Gaps = 95/580 (16%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I K ++ + +G K++ VE +PN+FT + I G + I + +
Sbjct: 178 IEGQIKQKQEAKRIFEQAKREGK-KASLVE---HRRPNLFTNNIANIGPGETVVINIEYQ 233
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKRE--------------------------- 230
Q L Y +S+ PF P + E
Sbjct: 234 QTLQYLHDGYSLRFPFTITPRYAPVNGNTEQAETALLTTLENAQVTTQAWVTDTDKDNHN 293
Query: 231 ----------KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY-----ESEVLKWS 275
I+++V TGT++ S HH + G+ G SY ES+V +
Sbjct: 294 TVASQSVFSDNINISVRLNTGTQLQNIESEHHA--IITSTGE-GNSYLVTLAESDVKRQ- 349
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDI 335
DF ++ + G + V+ E + L P + V ++VIF++D
Sbjct: 350 --DFVLNWQLE----LGSKPTSAHFTQFVNGSEYGMIVLYPPLPNEQLVLDREVIFVLDT 403
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI 395
SGSM G+ + K ALA A+ L D FNI+ FN + + A + ++
Sbjct: 404 SGSMSGEAIVQAKQALAYAIDDLSTKDKFNIIEFNSNADILWRKAKFADPQ-----NKIS 458
Query: 396 GINFIAGGSTNICAPLTKAVEMLTNSRGSIPI---IFLVTDGAVEDERQICDAMKSRLTN 452
+FIA N + +A+ M S + I +TDG+V +E + ++ L
Sbjct: 459 AFDFIAKLRANGGTEMHQALSMALLSEADPALFKQILFITDGSVSNEANLMRLIEENLKQ 518
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN 512
R++T GIG+ N YF+ A +G + D ++ +M +L ++ S L +
Sbjct: 519 A-----RLFTIGIGAAPNSYFMTEAAKSGKGTFTFIGDTKQVKRKMAELMSKINSPALTD 573
Query: 513 IAIDT--LKDLDEFEMYPSRIPDLSSESPLIVSGR----YQGKFPDTLKAKG--FLGDLS 564
I ++ ++ +FEMYP+ I DL S PL+++ + FP + K +G +G+ +
Sbjct: 574 IKLNLKGIQQYQQFEMYPNVIGDLYSSDPLVLTYKQPMGMDASFPFS-KTEGPMLMGNYN 632
Query: 565 NFVVELKLQLAKDIPLDR------ICAKQQIDLLTAQAWFSE-DKRLEEKVSKMSVQTGV 617
N Q P+DR I AK ++ + T A E +L +V K + V
Sbjct: 633 NNAWHFTPQK----PVDRGIIQADILAK-RLHIETHNARHKEPQNKLTFEVEKKAKGINV 687
Query: 618 LCEYTRMIIVETDERNNASESPGTK-----KGSKKSDHQK 652
L ++ + D+RN A + P + SK SD QK
Sbjct: 688 LWASEKIAQLSRDKRNIAFDGPDAQFESPSSHSKTSDQQK 727
>gi|116748764|ref|YP_845451.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter
fumaroxidans MPOB]
gi|116697828|gb|ABK17016.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter
fumaroxidans MPOB]
Length = 812
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 140/599 (23%), Positives = 245/599 (40%), Gaps = 59/599 (9%)
Query: 72 PYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGD 131
PY + + + L M + +V+ A V+V T+R G + + P
Sbjct: 61 PYFFVKSEDSDVDRLPMKSTSAEVEIAGVIANVKVVQTYRNE---GKRPIEAIYVFPAST 117
Query: 132 QGSILGVEAEISGKSYHTQLIALGEN--------DGAGKSASVETGSFLKPNIFTLTLPQ 183
+ ++ ++ I G+ T +I E GKSAS+ +PN+F + +
Sbjct: 118 RAAVSAMKMTI-GERTVTAVIRKREEARRDYEQAKSQGKSASLLEQQ--RPNVFQMNVAN 174
Query: 184 IDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREK----------- 231
I G + L +++ LS +G + P P Y P EK
Sbjct: 175 IMPGDEIKTELTYNELLSPTEGVYEFVYPTVVGPRYSNQPAAGAPASEKWVQNPYLHEKE 234
Query: 232 -------IHLNVNAGTG-TEVLC---NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFD 280
I +NAG E+ C T+ + Q R V LG + + N DF
Sbjct: 235 PPTYSFNITARLNAGLPIREITCPSHETAIRYEGQTRASV-DLGANEKF----GGNRDFV 289
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQ 340
Y +S I G+LL D+ P + ++ IFIVD+SGSM
Sbjct: 290 LKYRLSGESIDSGLLL----YRGKDENFFLLTVQPPKRVVEAAIPAREYIFIVDVSGSMH 345
Query: 341 GKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFI 400
G PLE +K L + L P D FN++ F+G++ + + A+ + V RA + IG
Sbjct: 346 GFPLEISKRLLTDLIGGLKPTDCFNVMLFSGDSTVMAERSVPASADNVRRAVEMIG-RRQ 404
Query: 401 AGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRI 460
GG T + L KA+ L G + + TDG V E + + ++S + +
Sbjct: 405 GGGGTELLPALKKALS-LPRKEGVSRSMVIATDGFVTVEEEAFELIRSHIGDAN-----F 458
Query: 461 YTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKD 520
+ FGIG+ N + +A G D +K S +L N+ D
Sbjct: 459 FPFGIGTSVNRMLIEGMARAGAGEPFVITRPDEAPAGAEKFRRYIQSPLLTNVKAD-FGA 517
Query: 521 LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAK---- 576
+++ P+ IPD+ +E P+++ G+++G+ + KG G F + + ++
Sbjct: 518 FGVYDVEPAGIPDVLAERPVVIFGKWRGEPSGKITVKGSTG-AGEFAKTVDVGGSRPEEG 576
Query: 577 DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ L + A+ +I LT + D +++++ ++ G+L YT + ++T RN +
Sbjct: 577 NSVLKYLWARARITALTDRNQLRSDATRVGQITELGLKYGLLTSYTSFVAIDTLVRNQS 635
>gi|402822555|ref|ZP_10872033.1| cell wall anchor domain-containing protein [Sphingomonas sp. LH128]
gi|402263905|gb|EJU13790.1| cell wall anchor domain-containing protein [Sphingomonas sp. LH128]
Length = 715
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 125/517 (24%), Positives = 221/517 (42%), Gaps = 81/517 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTP---------- 221
+PN+FT + + G + I + + ++ DG FS+ +P P Y P
Sbjct: 156 RPNMFTTMVANVGPGETVLIAIEYQAPVAQADGTFSLRLPLVVGPRYTPPHTLGTVDAVA 215
Query: 222 ---AIKKIP--------KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVG----KLGYS 266
A+ P + + G EV S +H + +G KLG S
Sbjct: 216 DANAVTAAPVLTPGLGETVNPTSIAIRLAPGFEVANLISRYHRVNITGPLGDRTIKLGGS 275
Query: 267 YESE---VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRK 323
++ VL+W + D + G+ Q H + P S
Sbjct: 276 VPADRDFVLEWRSASADPTL---------GLFAQ----HTAKGDYVMASVTPPADTSSLP 322
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMEL 382
++++F++D SGSM G +E+ K +L AL L P D FN++ F+ T LF S+
Sbjct: 323 TPPREMVFVIDNSGSMGGDSMEEAKASLLHALGTLRPQDHFNVIRFDDTMTRLFDHSVA- 381
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-----IIFLVTDGAVE 437
AT + V A ++ G + A G T + L A+ GS P I+FL TDG +
Sbjct: 382 ATPDQVALARRFAG-SLEAQGGTEMLPALKAALADAATGGGSDPSILRQIVFL-TDGEIS 439
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+ER++ A+ + +GG ++ GIGS N Y + ++ I G Y + +
Sbjct: 440 NEREMVAAIGA---DGGR--SHVFMVGIGSAPNDYLMERMSTIGGGLYTHIGASGEVTAK 494
Query: 498 MQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
M L VL++ A+ L + E+ PSR+PD+ + PL++ GR TL
Sbjct: 495 MMPLL-----DVLSHPAVRDLSVRVEGGSLELTPSRLPDVYAGRPLVLVGRTD-HLSGTL 548
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFS--EDKRLEEKVS 609
G +G E ++ LA +P ++++ A+ +I + A ED++ + +V
Sbjct: 549 TVSGTIG---GKAWERRVDLASALPSPAIEKVWARSRISDIEADRALGKLEDEQADAEVE 605
Query: 610 KMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSK 646
++ +++ ++ T ++ V+ E+P G++
Sbjct: 606 RIGMESTIVTSRTSLVAVD--------ETPSRPAGAR 634
>gi|254414923|ref|ZP_05028687.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
gi|196178412|gb|EDX73412.1| Vault protein inter-alpha-trypsin [Coleofasciculus chthonoplastes
PCC 7420]
Length = 928
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 152/659 (23%), Positives = 263/659 (39%), Gaps = 90/659 (13%)
Query: 73 YVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQ 132
YV P + PL+ V+ V + + V V+ T+ S + P+ ++
Sbjct: 140 YVQSETGNPLVFPLKHTDVQAKVTG--NVSRVEVTQTFENPF---STPLEATYIFPLPNE 194
Query: 133 GSILGVEAEISGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGG 187
++ +E I ++ + E + A + T L + NIFT +L I G
Sbjct: 195 AAVDDMEIRIGDRTNKGNIKKRQEAVAIYEQAKKQGRTAGLLEQERANIFTQSLANIQPG 254
Query: 188 SYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV-------------------TPAIKKIPK 228
+ + +R++ L + G + F FP V PA + K
Sbjct: 255 EQIDVIIRYTDSLKFTGGSYE----FVFPMVVGARYIPGTTIDENTLGGGSAPAPMTLNK 310
Query: 229 R----------------------EKIHLNVNAGTGTEVL-CNTSSHHLKQLRRDVG-KLG 264
I++ V+ G E+ ++ SH ++ R+D G ++
Sbjct: 311 DTDLVPDASRLNAPILPPGTRSGHNINVTVDIEAGVEIKEVHSPSHQIQIERQDQGMRVT 370
Query: 265 YSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA-KSR 322
S + N D Y V+ G Q+ L D R F +YL+P
Sbjct: 371 LSRRDTI---PNKDLILRYQVA------GDRTQTTVLSQADTRGGHFAVYLIPAIEYNPH 421
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
++ KDV+F++D SGS G+PL + + ++ L+P D+F I+ F+ T S
Sbjct: 422 QLVPKDVVFLIDTSGSQSGEPLNKCQELMRRFINGLNPHDTFTIIDFSDTTRQLSPVPLA 481
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
T + A +I A G T + + + G + I L+TDG + +E QI
Sbjct: 482 NTVQNRNSAMNYIN-QLNASGGTQLRRGIQAVLNFPEVDPGRLRSIVLLTDGYIGNENQI 540
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG-AAYDLDSIEIQMQKL 501
++ L G R+++FG GS N + L +A I RG YD + E+ ++
Sbjct: 541 LAEVQRHLKLGN----RLHSFGAGSSVNRFLLNRIAEIGRGISRIVRYDEPTEEV-AEQF 595
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL- 560
F + + VL NI + + + +YP+ PDL +E PL++ G + P L+ G +
Sbjct: 596 FGQINNPVLTNIQLYWQGEGEPPIIYPATPPDLFAEQPLVLFGCKKDALPGRLEVTGTIA 655
Query: 561 -GDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLC 619
G EL + + + ++ + +I L Q E K E V+ ++ +L
Sbjct: 656 GGQQYQQSFELNFEETGNPAIAQLWGRNRIKDLMNQMVQGETKVGVETVTDTALTYQLLS 715
Query: 620 EYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATA 678
+YT I V + R N P S + + E+P+ I Q + FGD+ A
Sbjct: 716 QYTAFIAVSDEVRVN----PNNHSVSVQ------VPVEMPEGISYQGI---FGDVEVEA 761
>gi|359440721|ref|ZP_09230634.1| uncharacterized protein [Pseudoalteromonas sp. BSi20429]
gi|358037427|dbj|GAA66883.1| uncharacterized protein [Pseudoalteromonas sp. BSi20429]
Length = 673
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 128/573 (22%), Positives = 242/573 (42%), Gaps = 84/573 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 81 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 138
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+ N+F + I G + I L + + + Y G F++ +FP +TP
Sbjct: 139 RANMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 194
Query: 222 --------------------AIKKIPKRE---------KIHLNVNAGTGTEVL-CNTSSH 251
+I+ I + + LN++ G E++ N+ H
Sbjct: 195 NTQNQAHVNPLPTGWLSPVYSIQNITQNSTQIDNVPSSQFSLNIDIDVGLELVDINSKFH 254
Query: 252 HLKQLRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF 310
++ DV + L +N ++ DF P + R
Sbjct: 255 NV-----DVQNTAFGQYKIALNETNAVNRDFVLEFKPLQKEQAQAAFFTQQFENGDRYGL 309
Query: 311 CMYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
M + PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F
Sbjct: 310 AMLMPPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGF 369
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
+ S +A+ + RA ++I + A G T I L ++ + G + +
Sbjct: 370 DNIVTPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDG-SEFDGFVRQVV 427
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
+TDG+V +E + ++S+L + R++T GIG N +F+R A + +G +
Sbjct: 428 FLTDGSVSNEDALFKNIQSKLGDS-----RLFTVGIGGAPNSFFMRRAADVGKGSFTFIG 482
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGR 545
+ ++ +MQ+LF + LA+ AI L DE + +PS +PDL P++V+ +
Sbjct: 483 STNEVQPKMQQLFDK-----LAHPAITNLALSDESGNSLDFWPSPLPDLYFGEPIMVAIK 537
Query: 546 YQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRL 604
L + G LS ++L Q + + ++ A+Q+I L ++E +
Sbjct: 538 LNNAKSVVLAGQTAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQNTV 591
Query: 605 EEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
+++V +++ +L +T I +E + +E
Sbjct: 592 KDEVQELAFTHQLLSPFTAFIAIEQTQIKEVAE 624
>gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas
haloplanktis ANT/505]
gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas
haloplanktis ANT/505]
Length = 676
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 130/572 (22%), Positives = 240/572 (41%), Gaps = 82/572 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGENDGAGKSASVETGSFL 172
+ + R P+ D+ ++ + I + Q+ E AGK A++
Sbjct: 79 AVNARYVFPLPDESAVHAMTMRIGERVIKGQIDKKVEAEKKYEEAKKAGKQAALVRQQ-- 136
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP----------- 221
+ N+F + I G + I L + + + Y G F++ +FP +TP
Sbjct: 137 RANMFITNVANIGPGEQVIIELEYQEIIDYSSGTFAI----RFPTTITPRYHAISGEVEV 192
Query: 222 --------------------AIKKIPKRE---------KIHLNVNAGTGTEVLCNTSSHH 252
+I+ I + + LN++ G E++ S +H
Sbjct: 193 NTQNQAHTNPLPTGWLSPVYSIQNITQNSTQTDNVPSSQFSLNIDIDVGLELVDINSKYH 252
Query: 253 LKQLRRDVGKLGYSYESEVLKWSN-IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFC 311
DV + L +N ++ DF P + R
Sbjct: 253 ----NVDVQNTAFGQYKIALNETNTVNRDFLLEFKPLQKEQAQAAFFTQQFENGDRYGLA 308
Query: 312 MYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
M + PG ++ + ++++F+VD SGSM G+ +E K AL ALS LD DSFNI+ F+
Sbjct: 309 MLMPPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQAKKALFYALSLLDSDDSFNIIGFD 368
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFL 430
S +A+ + RA ++I + A G T I L ++ + G + +
Sbjct: 369 NIVTPMSDKPLIASDFNLRRAERFI-YSLEADGGTEIQGALNAVLDG-SEFDGFVRQVVF 426
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
+TDG+V +E + ++S+L + R++T GIGS N +F+R A I +G +
Sbjct: 427 LTDGSVSNEDALFKNIQSKLGDS-----RLFTVGIGSAPNSFFMRRAADIGKGSFTFIGS 481
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE----FEMYPSRIPDLSSESPLIVSGRY 546
++ +MQ+LF + LA+ AI L DE + PS +PDL P++V+ +
Sbjct: 482 TSEVQPKMQQLFDK-----LAHPAITNLALSDESGNSLDFSPSPLPDLYFGEPIMVAIKL 536
Query: 547 QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLE 605
L + G LS ++L Q + + ++ A+Q+I L ++E ++
Sbjct: 537 NNAKSVVLAGQTAQGPLS---IKLSTQNSSSAQGIAKLWARQKIKSLL---LYNEQSVVK 590
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
++V ++++ +L +T I +E + +E
Sbjct: 591 DEVQELALTHQLLSPFTAFIAIEQTQIKEVAE 622
>gi|256823631|ref|YP_003147594.1| Vault protein inter-alpha-trypsin domain-containing protein
[Kangiella koreensis DSM 16069]
gi|256797170|gb|ACV27826.1| Vault protein inter-alpha-trypsin domain protein [Kangiella
koreensis DSM 16069]
Length = 689
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 133/539 (24%), Positives = 243/539 (45%), Gaps = 69/539 (12%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E E+ K + + +G +++ VE ++ N+FT L I G ++I + +
Sbjct: 103 IEGEVQEKQQAKRTYEKAKAEGK-RTSLVEQ---IRDNLFTTKLANIAPGESITIHIEF- 157
Query: 198 QKLSYRDG-EFSVNVPFKFPEYVTPAIKKIPKREKI--------HLNVNAGTGTEVLCNT 248
Q+L + DG FS+ +P +TP K + + + ++ V +T
Sbjct: 158 QQLVHNDGTHFSMRMPLG----ITPRYKPVDATHEFSNNDNNDDYNYFSSDDSETVYLDT 213
Query: 249 SSHHLKQLRRDVGKLGYSYESE----VLKWSNIDFDFSYTVSPSHIF-----GG------ 293
+HH +L + + + V N FD S SP H GG
Sbjct: 214 PAHHSAELSPAFTQFNTGSQPDRPVSVTVHLNPGFDLSLLESPYHKMNTQQTGGQYTIQL 273
Query: 294 ----------VLLQSPSL----------HDVDQREMFCMYLLPGTAK--SRKVFKKDVIF 331
VL P L D + +LP T +K +++IF
Sbjct: 274 ENPAQAERDFVLNWQPQLGQQPKVALFSESYDDHNYHVLMMLPPTHDLVQQKTQPREMIF 333
Query: 332 IVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVER 390
++D SGSM G+ ++ K L ALS+L D+FNI+ F N LF ++ AT +E
Sbjct: 334 VIDSSGSMSGESMQQAKQGLYYALSQLSINDTFNIIDFDNDANKLFDEAVP-ATLSNLEM 392
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A ++ A G T I + A++ +S + + +TDG++ +ERQI ++++L
Sbjct: 393 AKYFVA-TLEADGGTEIAKAINLALDKPDSSL--LRQVVFLTDGSIGNERQIFQMIENQL 449
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
N R++T GIG+ N YF+ A RG + ++ ++++LF + L
Sbjct: 450 GNN-----RLFTIGIGAAPNSYFMSKAANYGRGTFTYIGKASEVQTKLEQLFKKLRYPAL 504
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVEL 570
N+++++ K D+ E+YP R+ DL PL VS R ++++ KG D ++ +L
Sbjct: 505 ENLSLES-KYSDQLELYPGRLRDLYLGEPLFVSYRIPKGVTNSVQVKG-QADAYDWSFKL 562
Query: 571 -KLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ KD + R+ A+ +ID + + S ++ E++ K +++ ++ +YT ++ VE
Sbjct: 563 PPVTNGKDKGIARLWARMKIDAIKSDFNLSHEQS-REQILKTALEYHLMSDYTSLVAVE 620
>gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata
obscuriglobus UQM 2246]
Length = 821
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 179/389 (46%), Gaps = 20/389 (5%)
Query: 250 SHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFS--YTVSPSHIFGGVLLQSPSLHDVDQR 307
+H + +R+ ++ ++E K + +D DF Y I L+ P +
Sbjct: 198 THAVNTVRKSDKEVSVTFER---KQALLDKDFQLFYGHGDKDIGLSPLVYKP----IQTE 250
Query: 308 EMFCMYLL-PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
+ + M+L+ P +K +D++ ++D S SM ++ K A+ LS+L P D F +
Sbjct: 251 DGYFMFLISPQVEAEKKRVARDLVLVLDTSSSMSDIKMQQAKKAVKFCLSQLQPEDRFGV 310
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
V F+ F + + A + ++ A +WI G T I L A+ M ++
Sbjct: 311 VRFSTTVTKFRSELVAANTDYLDLATKWID-GLKTSGGTAIWPALNDALAMRSSDPSRPF 369
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
+ TDG + D + + + RI+TFG+G N L LA +R
Sbjct: 370 TMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGDDVNAAMLDQLADSTRAVST 429
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY 546
+ + IE+++ L+ + + VL ++ + T +++ E+YP ++PDL + L+V GRY
Sbjct: 430 YVREAEDIEVKVSGLYAKISNPVLTDVQLATSENVQLHEIYPPKLPDLFQGTQLVVIGRY 489
Query: 547 QGKFPDTLKAKGFLGDLS-------NFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFS 599
G+ P ++ G +G NF + + + KD ++ + A++++ + Q +
Sbjct: 490 TGEGPSVIRLTGLVGKERQELVYEFNFPGKTESDVGKDF-VEPLWARRKVGYILDQIRVN 548
Query: 600 -EDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
E K L ++V ++ + G+ YT ++V
Sbjct: 549 GEKKELVDEVVMLAKRYGIATPYTSHLVV 577
>gi|383452677|ref|YP_005366666.1| von Willebrand factor type A domain-containing protein
[Corallococcus coralloides DSM 2259]
gi|380734892|gb|AFE10894.1| von Willebrand factor type A domain-containing protein
[Corallococcus coralloides DSM 2259]
Length = 900
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 161/326 (49%), Gaps = 26/326 (7%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+++V+F+VD SGSM G L + AL + L L GD FN++AF FS T+
Sbjct: 289 RQEVVFVVDTSGSMSGDSLPQAQAALRLCLRHLREGDRFNVIAFENSFRSFSAQTVPFTQ 348
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+E+A W+ A G T + P+ A++ + ++ L+TDG V +ER+I DA
Sbjct: 349 RTLEQADAWVA-GLKAYGGTELLEPMVTAMKAAPDG-----VVVLLTDGQVGNEREILDA 402
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+ + G R+++FGIG+ + LR LA + G + + I+ ++ F+R
Sbjct: 403 VLAARGTG-----RVFSFGIGTNVSDALLRDLARRTDGAVEFIHPGERIDEKVVAQFSRA 457
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG-DLS 564
+ + ++ + ++ E+ P+ +P L +P + GRY ++ KG G +
Sbjct: 458 LAPRVTDLEV-RFDGVEASELAPATLPPLVDGTPWTLFGRYAQAGTGSVTLKGKSGREPF 516
Query: 565 NFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWF------SEDKRLEEKVSKMSVQTGV 617
+ V L L + P ++++ A ++I + W + ++E++ +++++ +
Sbjct: 517 SLTVRLDLPAQSERPVVEKLWAAERI-----RGWMDAGLVGRRAEAMKERIVRLAIEHQL 571
Query: 618 LCEYTRMIIVETDERN-NASESPGTK 642
+YT ++VE + + AS +P T+
Sbjct: 572 ATQYTSFVVVEERQGDRRASGTPDTR 597
>gi|391230041|ref|ZP_10266247.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Opitutaceae bacterium TAV1]
gi|391219702|gb|EIP98122.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Opitutaceae bacterium TAV1]
Length = 732
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 150/660 (22%), Positives = 259/660 (39%), Gaps = 100/660 (15%)
Query: 58 DPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMG 117
DP D + PY R P + L + + +D A VRV+ +V+ G
Sbjct: 39 DPGSGTGSDNKTLSPYFRVRGARPGVEALPLRSTRVDATVAGVIADVRVT---QVYTNTG 95
Query: 118 SKSCDCRIAVPMGDQGSILGV---------EAEISGKSYHTQLIALGENDGAGKSASVET 168
+ + P + ++ G+ EAEI + Q+ ++G +T
Sbjct: 96 TTPLEAIYVFPGSTRAAVHGLTMTLGDHRLEAEIRERKQARQIYETARSEG-------KT 148
Query: 169 GSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP---- 221
S L +PN+F + + I G +++ LR+++ LS DG +S F +P V P
Sbjct: 149 ASLLEQQRPNVFQMNVANILPGDEITVELRYTELLSPEDGVYS----FVYPGVVGPRYTG 204
Query: 222 -----------AIKKIPK----REKIHLNVNAGTGT---------------------EVL 245
A IPK R + +AGT E +
Sbjct: 205 ASGAGPEDRWVANPYIPKTADDRRAGLTDPDAGTDADGNPVSTFDIAVTLAAGVPVQEAV 264
Query: 246 CNTSSHHLKQLRRDVGKLGYSYESEV-LKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDV 304
C SH K D + + N DF Y ++ + G+L+
Sbjct: 265 CR--SHRTKITYDDPSTARIALDPAARADAGNRDFILRYRLAGEALKTGLLIAEGPPGAG 322
Query: 305 DQRE---MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
M P + + +D +FI+D+SGSM+G PL+ TK L + P
Sbjct: 323 GAAGENWFVAMIQPPARPVAASLPPRDYLFIIDVSGSMRGFPLQTTKTLLRGLFPTMRPQ 382
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
D+FNIV F+G + + + AT + A++ + + G+ + P + L
Sbjct: 383 DTFNIVLFSGHSTSLAEAPVPATPANIRAANELLDRQYGYDGTELL--PALRHAFSLPRP 440
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ I +VTDG V E + + ++ L GG ++ FGIGS N + + LA
Sbjct: 441 EATSRTIAIVTDGYVTVEHEAFELVRQHL--GGEHGANVFAFGIGSSVNRHLIEGLARAG 498
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+G D E +L S VL NI + + D +++ P+ +PD+ +E P+I
Sbjct: 499 QGEPFIVTDPAEAEAVAARLGKTIASPVLTNINV-SFDGFDVYDVEPATLPDVFAERPVI 557
Query: 542 VSGRYQGKFPDTLKAKGFLG-------------DLSNFVVEL----KLQLAKDIPLDRIC 584
+ G+++G + G G D F L + L + L R+
Sbjct: 558 LFGKWRGPRVGKITLTGLAGGGAGEEGEGGNAHDSRRFTAALFPGATVPLPRADGLSRLW 617
Query: 585 AKQQI----DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPG 640
A+ +I D + A A E++ E ++++ + +L +YT + V+ R + +PG
Sbjct: 618 ARHRIAQLGDDVEAGAADREERIAE--ITRLGLTWRLLTKYTSFVAVDHVVRRDPGTTPG 675
>gi|418064110|ref|ZP_12701674.1| sortase target protein, partial [Methylobacterium extorquens DSM
13060]
gi|373553036|gb|EHP79636.1| sortase target protein, partial [Methylobacterium extorquens DSM
13060]
Length = 676
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 122/513 (23%), Positives = 224/513 (43%), Gaps = 81/513 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY--VTPAIKKI--- 226
+PN+FT + I G + +++ + Q + G +++ +P P Y PA+K +
Sbjct: 169 RPNLFTNAVANIGPGETVLVQIAYQQPVRSSAGTYALRLPTVAAPRYSPAPPAVKAVVER 228
Query: 227 ------PKREKIH----------------LNVNAGTG-TEVLCNTSSHHLKQLRRDVGKL 263
P REKI L ++ G T +++H ++ R +
Sbjct: 229 GAADPVPDREKIAAPVLDPARHAPIKPLTLTIDLKAGFTLGQVRSATHAVRIEERSASER 288
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-----EMFCMYLLPGT 318
+ ++ ++ DF+ ++ +P ++PS+ +R + + P T
Sbjct: 289 RITL-ADGATAADRDFELTWNAAPG--------EAPSIGLFRERVAGAEAVLAVVTPPET 339
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
A +DV+F++D SGSM G + K +L + L +L GD FN++ F+ + F T
Sbjct: 340 ASPAASVPRDVVFVIDNSGSMGGASMRQAKASLLIGLDRLGAGDRFNVIRFD---HSFDT 396
Query: 379 SM-ELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLT-NSRGSIPIIFLVT 432
+L +A H +F+AG G T + APL A+ T G + + +T
Sbjct: 397 LFPDLVPADA---GHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQVVFLT 453
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +E QI A+ + G S R++ GIGS N Y +R A + RG + D
Sbjct: 454 DGAIGNEAQIFSAIATE--RGRS---RLFMVGIGSAPNGYLMRHAAELGRGSFTQIDTPD 508
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ +M+ L + S + ++ T + ++ P+R+PDL PL +S R G+
Sbjct: 509 QVTERMRALLVKLESPAVTDLTA-TFSE-PGIDVTPARLPDLYRGEPLTLSARM-GQARG 565
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDL---------LTAQAWFSEDK 602
TL G +G + +L L A+D + ++ A+ +I LTA+A
Sbjct: 566 TLTLTGRIGG-QPWQTQLHLDAAQDGTGIGKLWARAKIAEAETARLTGGLTAEA------ 618
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ + ++++ G+ T ++ V+ R A
Sbjct: 619 -ADAAILRLALDYGLTTRLTSLVAVDATPRRPA 650
>gi|410617334|ref|ZP_11328305.1| hypothetical protein GPLA_1529 [Glaciecola polaris LMG 21857]
gi|410163171|dbj|GAC32443.1| hypothetical protein GPLA_1529 [Glaciecola polaris LMG 21857]
Length = 751
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 123/552 (22%), Positives = 232/552 (42%), Gaps = 102/552 (18%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
G+ AS+ T +PN+FT + I LSI + + Q +++ + F++ P P Y
Sbjct: 183 GRKASLITQH--RPNLFTNAIANIGPQESLSITMEYQQVIAFDEQTFALRFPMTITPRYS 240
Query: 220 ------------------------TPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHH--- 252
T + + I + +++G S HH
Sbjct: 241 PTNTIVAEVNHDGWASSTAILSAQTNPVDQPANPVSISVELDSGFALNAEDIVSEHHKII 300
Query: 253 -LKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPS----------------HIFGGVL 295
+ G+ + + + DFS T P+ + +G V+
Sbjct: 301 VTPKHPAQNGEANKRFHIALNNKQPANRDFSLTWRPATSDLPSAAHFTETQGEYRYGLVM 360
Query: 296 LQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
L P + V++R + ++VIF++D SGSM G + K A+ AL
Sbjct: 361 LTPPIQNSVNERHV----------------PREVIFLLDTSGSMAGGSIVQAKQAVGFAL 404
Query: 356 SKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
++L D NI+ FN + T L+ +M A+ ++RA W+ +A G T + LT A
Sbjct: 405 TQLHVDDKLNIIEFNDQPTSLWHKAMP-ASAANIQRAKNWLA-GLMADGGTEMAPALTVA 462
Query: 415 V--------EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
+ E +T ++ + +TDG+V +E + +++ L N R++T GIG
Sbjct: 463 LHNNKATDDESITPINQTLRQVVFITDGSVSNEDALMGLIETELNNS-----RLFTVGIG 517
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEM 526
S N YF+ A RG + D ++ +M +LF + V+ +I ID ++ E
Sbjct: 518 SAPNSYFMTQAAQAGRGTFTYIGDTSQVQQKMGELFNKLAMPVMQDIHIDFARNT---EF 574
Query: 527 YPSRIPDLSSESPLIVSGR---------YQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD 577
YP+ IPDL + P+++ R + PD A G S+ +++ L
Sbjct: 575 YPNIIPDLYAAQPVVIHYRVPVLLTESAMANQQPD--NAVKVTGRQSSLPWSIRMPLTPT 632
Query: 578 IP---LDRICAKQQIDLLTAQAWFSEDKRLEEK----VSKMSVQTGVLCEYTRMIIVETD 630
P L A+++I L+ ++ + ++ ++ ++++ ++ ++T +I V D
Sbjct: 633 APHAGLGVAWAREKITQLSNTLRREQNSEMRQQWQTNITAIALEHHIVSQFTSLIAV--D 690
Query: 631 ERNNASESPGTK 642
E+ + S+ +K
Sbjct: 691 EQVSKSQHETSK 702
>gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis
HAW-EB4]
gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis
HAW-EB4]
Length = 850
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 164/353 (46%), Gaps = 44/353 (12%)
Query: 311 CMYLLP--GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
+ L+P G+ ++++ ++D SGSM G + K+AL AL+ L P DSFN++
Sbjct: 437 LVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALKYALAGLRPQDSFNVLQ 496
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG--SIP 426
FN +S + AT + RA +I A G T + L A+ L N RG S P
Sbjct: 497 FNSTVERWSRHVMPATAINLGRAQNYIN-GLQADGGTEMSLALDAALTKLDNDRGHNSKP 555
Query: 427 I-----------------------IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
+ + +TDGAV +E ++ + +K++L R++T
Sbjct: 556 VHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGES-----RLFTI 610
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE 523
GIGS N +F++ A + RG Y LD + ++ L + + ++ + D
Sbjct: 611 GIGSAPNAHFMQRAAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDVELH-FSDGSV 669
Query: 524 FEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD--------LSNFVVELKLQLA 575
+ +P RIPDL + P++V+ R D L +G L L++ L+ A
Sbjct: 670 PDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSAAQVNDLEQA 729
Query: 576 KDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
K LD I A++QI L + +R+E++++ ++++ ++ YT ++ V+
Sbjct: 730 KG--LDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVD 780
>gi|367474886|ref|ZP_09474377.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 285]
gi|365272815|emb|CCD86845.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 285]
Length = 755
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 119/478 (24%), Positives = 213/478 (44%), Gaps = 76/478 (15%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
GK+A++ +PNIFT +L I G + +++ + Q ++ GEFS+ VP P Y
Sbjct: 160 GKTAALTEQE--RPNIFTNSLANIGPGETVLVQIEYQQPVAQVAGEFSLRVPLVVAPRYN 217
Query: 220 TPAIKK------------------IPKREKI----------------HLNVNAGTGTEVL 245
I + +P R++I + V G +
Sbjct: 218 PRPIVQSVDLRPDSNGWGAASNDTVPDRDRISPEVLDPAKNDPVNPTRITVRLQAGFALG 277
Query: 246 CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVD 305
S HH + K +E + ++ DF+ ++ + S V L + D D
Sbjct: 278 EVKSHHHQVTVDSPDAKTRVVTLAEGVVPADRDFELTWKPA-SDKLPSVGLFHEQVGDAD 336
Query: 306 QREMFCMYLLPGT--AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
++ P A +++ +DVIF++D SGSM G + K +L AL +L PGD
Sbjct: 337 ---YLLAFVTPPAVAATTQRPQPRDVIFVIDNSGSMGGTSIRQAKASLLYALGRLQPGDR 393
Query: 364 FNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
FN++ F+ T LF +S+ A E V A +++G GG+ + A + ++S
Sbjct: 394 FNVIRFDDTMTVLFPSSVP-ADAEHVGNATRFVGALEARGGTEMVPAMRAALTDDGSDSD 452
Query: 423 GSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
++FL TDGA+ +E+Q+ + AM+ R RI+ GIGS N Y + A
Sbjct: 453 RVRQVVFL-TDGAIGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLMSRAAE 503
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTR-------GFSSVLANIAIDTLKDLDEFEMYPSRIP 532
+ RG + ++ +E +M+ LF + G S+ + + D + P+ +P
Sbjct: 504 LGRGAFTHIGSVEQVEERMRDLFAKLENPVVTGLSASFSEASAD---------LTPAVLP 554
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
D+ PL+++ + + +L+ KG +GD + + L L A+ L ++ A+++I
Sbjct: 555 DVYRNEPLVLAAKID-RLAGSLQLKGRIGD-QPWTITLPLSGAAEGKGLSKLWARRKI 610
>gi|240137956|ref|YP_002962428.1| hypothetical protein MexAM1_META1p1287 [Methylobacterium extorquens
AM1]
gi|240007925|gb|ACS39151.1| conserved hypothetical protein [Methylobacterium extorquens AM1]
Length = 735
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 122/513 (23%), Positives = 224/513 (43%), Gaps = 81/513 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY--VTPAIKKI--- 226
+PN+FT + I G + +++ + Q + G +++ +P P Y PA+K +
Sbjct: 169 RPNLFTNAVANIGPGETVLVQIAYQQPVRSSAGTYALRLPTVAAPRYSPAPPAVKAVVER 228
Query: 227 ------PKREKIH----------------LNVNAGTG-TEVLCNTSSHHLKQLRRDVGKL 263
P REKI L ++ G T +++H ++ R +
Sbjct: 229 GAADPVPDREKIAAPVLDPARHAPIKPLTLTIDLKAGFTLGQVRSATHAVRIEERSASER 288
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-----EMFCMYLLPGT 318
+ ++ ++ DF+ ++ +P ++PS+ +R + + P T
Sbjct: 289 RITL-ADGATAADRDFELTWNAAPG--------EAPSIGLFRERVAGAEAVLAVVTPPET 339
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
A +DV+F++D SGSM G + K +L + L +L GD FN++ F+ + F T
Sbjct: 340 ASPAASVPRDVVFVIDNSGSMGGASMRQAKASLLIGLDRLGAGDRFNVIRFD---HSFDT 396
Query: 379 SM-ELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLT-NSRGSIPIIFLVT 432
+L +A H +F+AG G T + APL A+ T G + + +T
Sbjct: 397 LFPDLVPADA---GHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQVVFLT 453
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +E QI A+ + G S R++ GIGS N Y +R A + RG + D
Sbjct: 454 DGAIGNEAQIFSAIATE--RGRS---RLFMVGIGSAPNGYLMRHAAELGRGSFTQIDTPD 508
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ +M+ L + S + ++ T + ++ P+R+PDL PL +S R G+
Sbjct: 509 QVTERMRALLVKLESPAVTDLTA-TFSE-PGIDVTPARLPDLYRGEPLTLSARM-GQARG 565
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDL---------LTAQAWFSEDK 602
TL G +G + +L L A+D + ++ A+ +I LTA+A
Sbjct: 566 TLTLTGRIGG-QPWQTQLHLDAAQDGTGIGKLWARAKIAEAETARLTGGLTAEA------ 618
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ + ++++ G+ T ++ V+ R A
Sbjct: 619 -ADAAILRLALDYGLTTRLTSLVAVDATPRRPA 650
>gi|338530650|ref|YP_004663984.1| von Willebrand factor type A domain-containing protein [Myxococcus
fulvus HW-1]
gi|337256746|gb|AEI62906.1| von Willebrand factor type A domain-containing protein [Myxococcus
fulvus HW-1]
Length = 856
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 157/319 (49%), Gaps = 19/319 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K++V+F+VD+SGSM G L + AL + L L GD FN++AF F T+
Sbjct: 281 KQEVVFVVDVSGSMAGGSLPQAQAALRLCLRHLREGDRFNVIAFENRFQTFQPQPVPFTQ 340
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+E A +W+ A G T + AP+ AV+ + +I L+TDG V +E +I A
Sbjct: 341 RTLEEADRWVAA-LNANGGTELLAPMRAAVQAAPDG-----VIVLLTDGQVGNESEILRA 394
Query: 446 -MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+++R T R+Y+FGIG+ + LR LA + G + + I+ ++ F+R
Sbjct: 395 VLEARKTA------RVYSFGIGTNVSDVLLRDLAKQTGGDVEFIHPGERIDDKVVAQFSR 448
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG-DL 563
+ + + ++ E+ P+ +P L P + GRY T+ +G G +
Sbjct: 449 ALAPRVTELEAH-FDGVECVELAPAELPPLVDGVPWTLLGRYASPGTGTVTLRGRSGREP 507
Query: 564 SNFVVELKLQLAKDIP-LDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEY 621
+ V + L A D P ++++ A ++I AQ + +++++ +++V ++ Y
Sbjct: 508 FSLTVRVDLPAASDRPAVEKLWAAERIRGWEAAQLTGRRAESMKQRIVELAVAHQLVTRY 567
Query: 622 TRMIIVE--TDERNNASES 638
T ++VE ER N+ ++
Sbjct: 568 TSFVVVEERKGERRNSGQA 586
>gi|355674054|ref|ZP_09059406.1| hypothetical protein HMPREF9469_02443 [Clostridium citroniae
WAL-17108]
gi|354814177|gb|EHE98778.1| hypothetical protein HMPREF9469_02443 [Clostridium citroniae
WAL-17108]
Length = 677
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 132/609 (21%), Positives = 263/609 (43%), Gaps = 66/609 (10%)
Query: 63 DNPDIPSYQPY--VHGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS 120
D+PD + PY V G P PL+ V ++ + +V + + G
Sbjct: 31 DDPD-KTLAPYFFVEGADPDPDSFPLKETKVSSQINGVIADTYVT-----QTYTNEGKTP 84
Query: 121 CDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGEND-------GAGKSASVETGSFLK 173
+ + P + S+ G+ ++ + ++ E GKSAS+ +
Sbjct: 85 INAKYIFPASTKASVHGMTMQVGNRLITARIKEKEEAREEFEEAKSQGKSASLLEQQ--R 142
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEYV------------- 219
PN+F++ + + + I+L +++ + +G + P P Y
Sbjct: 143 PNVFSMDVANVMPNDTVVIQLHYTEMVVSTEGSYQFVFPTVTGPRYTSSPEDPGKDDNQW 202
Query: 220 --TPAIK--KIPK-REKIHLNVNAGTG-TEVLCNTSSHHLKQLRRDVGKLGYSYESEVLK 273
TP +K K P R I ++++ G T++ C SH +K + + E
Sbjct: 203 VSTPYLKEGKTPSGRYDIAVSLSTGVPVTDITCR--SHSIKIDKSSASAARITLEDPEDF 260
Query: 274 WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIF 331
+ DF Y ++ + G+LL D + E F M ++ P + ++ IF
Sbjct: 261 AGDRDFILDYKLTGQDVNCGLLL------DQGEEENFFMLMIQPPKRYDPEDIPPREYIF 314
Query: 332 IVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERA 391
++D+SGSM G PL+ K + ++ L D+FN++ F+G + S AT E ++RA
Sbjct: 315 LMDVSGSMYGFPLDTAKGLIKDLVTSLRDTDTFNLILFSGASSRMSPVSVPATAENIQRA 374
Query: 392 HQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLT 451
+ I GG T I L A+ + + G+ I +TDG + E++I + L
Sbjct: 375 VELIDRQE-GGGGTEIAPALRDAL-AIPKTEGTSRSIITITDGYISGEKEIFGIISQNLA 432
Query: 452 NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLA 511
+ ++FGIG N Y + +A + G D D + K F + + A
Sbjct: 433 D-----TDFFSFGIGDSVNRYLIDGIAKVGLGESFVVTD-DKDALDTAKRFC---TYIQA 483
Query: 512 NIAIDTLKDLDEFEMY---PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVV 568
+ D + FE+Y P+ +P L ++ P+++ G+++G+ +++ G G+ ++V
Sbjct: 484 PLLTDIQVSYNGFEVYDVEPAALPTLFAQRPIVLFGKWRGEPGGSIQVTGRSGN-RDYVQ 542
Query: 569 ELKLQLA---KDIPLDRICAKQQIDLLT-AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRM 624
++++ A + + + A+ ++ LT ED ++++V+++ ++ ++ YT
Sbjct: 543 DIQVSQASIGSNPGISYLWARTMVENLTDYNTGLEEDDSIKKEVTQLGLKYSMMTPYTSF 602
Query: 625 IIVETDERN 633
+ V + RN
Sbjct: 603 VAVMEEVRN 611
>gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium
HTCC2148]
gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium
HTCC2148]
Length = 686
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 172/354 (48%), Gaps = 28/354 (7%)
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
DQ M + P + ++ + ++++F+VD SGSM G ++ K +L AL L P D F
Sbjct: 294 DQYYGLLMLVPPASQRAAETVPREIVFVVDTSGSMGGVSIKQAKGSLTRALRHLGPNDRF 353
Query: 365 NIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM-----LT 419
N++ FN A+ ++ A +++ + A G T + L A+++
Sbjct: 354 NVIEFNSSHRALFQHAVPASHHNLQLASEYVR-HLEASGGTEMMPALQLALKLPGAQDEL 412
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
++ + +TDGAV +E + + + L GGS R++T GIGS N +F+R A
Sbjct: 413 RPEPALRQVIFITDGAVGNESALFEHIVDSL--GGS---RLFTVGIGSAPNAWFMRKAAE 467
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
RG + D+ + +M LF V ++ +D +D ++P+R PDL + P
Sbjct: 468 YGRGTFTYIGDVAEVGEKMDALFLNLTRPVATHLNVDWPDGVD---VWPARTPDLYAGEP 524
Query: 540 LIVSGRYQGKFP-DTLKAKGFLGDLSNFVVELKLQLAKDIPLD--------RICAKQQID 590
L+++ + + P D LK +G LG+ + + +L+ A + PL + A+++I+
Sbjct: 525 LLIAVKLGDELPIDGLKIQGLLGE-TEWSTKLQFPAALN-PLSMNNSEGVATLWARKKIE 582
Query: 591 LLTAQAWF--SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTK 642
L + + SED + E+V ++++ +L YT + VE SE+ +K
Sbjct: 583 GLIDEKYRGKSEDW-VRERVLPLALKHSLLSPYTSFVAVEEVVSRPESEAAKSK 635
>gi|374851734|dbj|BAL54685.1| vault protein inter-alpha-trypsin domain protein [uncultured
candidate division OP1 bacterium]
gi|374855473|dbj|BAL58329.1| Vault protein inter-alpha-trypsin domain protein [uncultured
candidate division OP1 bacterium]
Length = 763
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 188/383 (49%), Gaps = 17/383 (4%)
Query: 248 TSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR 307
+ +H + + + S+E+ ++ S DF YTVS + GV + + + D D
Sbjct: 200 SPTHKISVQKTSESEATVSFEATQVQ-SAQDFRLFYTVSEKEL--GVDVITYRVGDEDG- 255
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
F + + P + V KD++F+VD+SGSM G+ L+ K +L A+ KL P D +V
Sbjct: 256 -YFLLLVSPAQGEIAPV-PKDIVFVVDVSGSMDGEKLQQAKQSLKYAIGKLSPQDRVAVV 313
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFI-AGGSTNICAPLTKAVEMLTNSRGS-- 424
AF+ FS +L + + ++R+ ++ + A G TNI L + +++L R +
Sbjct: 314 AFSDTVTEFSA--QLVSGDGLDRSQLEAFVDKLEATGGTNINDALLRGLDLLAADRSAER 371
Query: 425 IPIIFLVTDGAVED-ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ ++ +TDG E +K+ + RI+ FG+G N L L+ + G
Sbjct: 372 LKMLVFLTDGLPRTGETDTTQIVKNVTEKNAQLNARIFCFGVGYDVNTILLDTLSEQNGG 431
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
+ ++IE ++ + + + + +L ++ I + DL +++YP ++PDL S + +
Sbjct: 432 FSTYVEPGENIETEIAQFYEKVGAPLLTDLQI-SFGDLKVYDLYPPKLPDLFKGSQIQIV 490
Query: 544 GRYQ--GKFPDTLKAKGFLGDL-SNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF-S 599
GRY+ GK L K D S F +EL + A+ L R+ A ++I L +
Sbjct: 491 GRYRAPGKGTVALTGKRQGADWRSAFEIELPGESARYSFLPRMWAARKIGYLLDEIRLKG 550
Query: 600 EDKRLEEKVSKMSVQTGVLCEYT 622
E++ L + V K++ + G++ YT
Sbjct: 551 ENQELVDAVKKLAEKFGIVTPYT 573
>gi|154252742|ref|YP_001413566.1| vault protein inter-alpha-trypsin subunit [Parvibaculum
lavamentivorans DS-1]
gi|154156692|gb|ABS63909.1| Vault protein inter-alpha-trypsin domain protein [Parvibaculum
lavamentivorans DS-1]
Length = 755
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 120/516 (23%), Positives = 225/516 (43%), Gaps = 68/516 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGE-FSVNVP------------------- 212
+PN+FT ++ I G + +++ + Q + RDG+ FS+ P
Sbjct: 163 RPNVFTNSVANIGPGETIIVQIEYQQTVR-RDGDRFSLRFPMVVAPRYTPKTADPQLVDF 221
Query: 213 -----------------FKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQ 255
+ P + PA +I + L ++ G + +S+HH
Sbjct: 222 APGGGWGEVRQSEPENDLEQPPVLHPAQGQI---NPVSLALSLDAGFALGDISSTHHKIA 278
Query: 256 LRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
L RD + +E L +N DF+ + + + L + V + + L
Sbjct: 279 LNRDGKQKATLKLAEELTPANKDFELVWKPAAAKAPAAALFR----ERVGNEDYLLVMLT 334
Query: 316 PGTAKSRKVFK-KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-T 373
P + + K ++ IF++D SGSM G + K +L AL +L PGD+FN++ F+ T
Sbjct: 335 PPSGSVQPEAKPREAIFVIDNSGSMSGPSMVQAKESLLWALDRLKPGDTFNVIRFDDTLT 394
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
LF ++ A E + A +++ GG+ + A ++ N + I +TD
Sbjct: 395 VLFPDAVP-AHGENLAVAKKFVKSLEANGGTEMLPALRASLIDRNVNDGTRLRQIVFLTD 453
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GA+ +E ++ + S L R++T GIGS N YF+ + RG +
Sbjct: 454 GAISNEAELFHEITSNLGRS-----RLFTVGIGSAPNSYFMTRASEAGRGTFTHIGKETE 508
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT 553
+ +M +LF + + V+ NI T D E +P+ +PDL P+++S R
Sbjct: 509 VTERMAELFEKLQNPVMTNITA-TWPDGRTTESWPNPVPDLYKGEPVVLSARMP------ 561
Query: 554 LKAKGFL---GDLSNFVVELKLQL--AKDIP-LDRICAKQQIDLLTAQAWFSED-KRLEE 606
KA G L GD++ E+++QL + P + ++ A+ +I L A A + D ++ +
Sbjct: 562 -KATGTLTLKGDVAGEPWEVRMQLNAGETRPGIGKLWARNKIGQLEADAQIAGDWEKHDA 620
Query: 607 KVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTK 642
++ ++++ ++ T ++ V+ A E +K
Sbjct: 621 EILRVALDHNLVSRRTSLVAVDVTPSRPAGEKLASK 656
>gi|255671683|gb|ACU26442.1| uncharacterized protein [uncultured bacterium HF186_25m_18N5]
gi|255671756|gb|ACU26513.1| uncharacterized protein [uncultured bacterium HF186_25m_27D22]
Length = 836
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 130/543 (23%), Positives = 228/543 (41%), Gaps = 68/543 (12%)
Query: 133 GSILG---VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSY 189
G I+G +EAEI K +L + +G ++A VE+ +PN+F + + G
Sbjct: 103 GLIVGGRLIEAEIRAKEKARELYEAAKAEGR-RAALVESK---RPNVFEQAVANVLPGEL 158
Query: 190 LSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKI-------------PK-------R 229
+ + LR+ + Y D E+ F FP V P + P+ R
Sbjct: 159 VHVTLRYVAPVHYEDAEYV----FTFPGVVGPRYASVKGSEAMTTQAAVNPRYIADKETR 214
Query: 230 EKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSH 289
++ L ++ V S H ++ K Y ++ N + Y ++ +
Sbjct: 215 HRVSLELHLNESATVFQAHSPSHALEIDAGPDKSTYVSLADPDALPNRTIEVRYMLASTQ 274
Query: 290 IFGGVLLQSPSLH-DVDQREMFCMYLLPG-TAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
V S H Q F + L P +V K++ F+VD SGSM G+PL+
Sbjct: 275 PVARV-----SAHRKPGQTGTFALTLEPPLKVDPDQVTPKELFFVVDTSGSMMGEPLDKA 329
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAG----G 403
+ A+ AL ++ P D+F I+ F + T E + + G+ FI G
Sbjct: 330 RAAMRYALERMGPDDTFQIIDFASGVASLAPRPLPNTPENLRK-----GLAFIEAMTSQG 384
Query: 404 STNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
T + A + A++ T G + I+ +TDG + ++ I D + + R+++F
Sbjct: 385 GTEMLAGIRAALDGPTPP-GRLRIVAFMTDGYIGNDGDILDYIDQSVGQA-----RLFSF 438
Query: 464 GIGSYCNHYFLRMLAMISRGYY------GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT 517
G+G N Y L +A RG AA +D + ++ + R +L +++ID
Sbjct: 439 GVGEDVNRYLLEEMATRGRGTVQYVRLGDAAAPVDEV---VETFYARMGQPLLTDVSID- 494
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRY--QGKFPDTLK-AKGFLGDLSNFVVELKLQL 574
L + P IPDL PL + GRY G+ T++ ++G +V+L
Sbjct: 495 WGALKVESLSPRLIPDLFVGLPLRIFGRYTRAGRAEVTIRGSRGGERVAIPVIVDLPEHA 554
Query: 575 AKDIPLDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
+ ++ + AK I +L AW + + + V+K+ ++ +L YT + VE D
Sbjct: 555 PEHSAIELMWAKDTIARMLHPNAWEDTQETI-DAVTKLGLRHNILTRYTSFVAVERDLVA 613
Query: 634 NAS 636
NA+
Sbjct: 614 NAT 616
>gi|405372957|ref|ZP_11027852.1| von Willebrand factor type A domain protein [Chondromyces
apiculatus DSM 436]
gi|397087996|gb|EJJ19007.1| von Willebrand factor type A domain protein [Myxococcus sp.
(contaminant ex DSM 436)]
Length = 872
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/438 (24%), Positives = 204/438 (46%), Gaps = 43/438 (9%)
Query: 217 EYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKL--GYSY------E 268
+ +TP + + ++ L V+ G EV+ + SH + R + ++ G+S
Sbjct: 176 DRITPPVGAVSYGLRMDLLVD--LGREVVVESPSHAITVTREEGSRVRVGFSRGEVSLDR 233
Query: 269 SEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLP---GTAKSRKVF 325
VL + D +T +H GG D F + ++P G A +
Sbjct: 234 DLVLSLRSPDNSAVFTPLVTHRKGG-----------DGPGTFALTVVPDLLGMASAPP-- 280
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K++V+F VD+SGSM G+ L + AL + L L GD FNI+AF F ++
Sbjct: 281 KQEVVFAVDVSGSMDGESLPQAQAALRLCLRHLREGDRFNIIAFESGFTTFQPQPVPFSQ 340
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+E+A QW+ GG T + AP+ AV+ + + ++ L+TDG V +E +I A
Sbjct: 341 RTLEQADQWVAALRPLGG-TELLAPMKAAVQAVPDG-----VVVLLTDGQVGNESEILQA 394
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+ + + R+Y+FGIG+ + LR +A + G + + I+ ++ F+R
Sbjct: 395 VLADRKSA-----RVYSFGIGTNVSDVLLRDMAKQTGGAVEFIHPGERIDDKVVAQFSRA 449
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG-DLS 564
+ ++ + + ++ E+ P+ +P L P + GRY T+ +G G +
Sbjct: 450 LAPRVSELEV-RFDGVECSELAPAELPPLIDGVPWTLLGRYAAPGTGTVTLRGRSGREPF 508
Query: 565 NFVVELKLQLAKDIP-LDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYT 622
+ V + L D P ++++ A ++I +AQ + +++++ ++V ++ +YT
Sbjct: 509 SLTVRVDLPSVSDRPAVEKLWAAERIRSWESAQLTGRRAESMKQRIIDLAVAHQIVTQYT 568
Query: 623 RMIIVE--TDERNNASES 638
++VE T ER + ++
Sbjct: 569 SFVVVEERTGERRTSEQA 586
>gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS 278]
gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand
factor type A (VWA) domain; putative signal peptide
[Bradyrhizobium sp. ORS 278]
Length = 755
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 129/527 (24%), Positives = 229/527 (43%), Gaps = 76/527 (14%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF----- 215
GK+A++ +PNIFT +L I G + +++ + Q ++ GEFS+ VP
Sbjct: 160 GKTAALTEQE--RPNIFTNSLANIGPGETILVQIEYQQPVAQVAGEFSLRVPLVVAPRYN 217
Query: 216 --------------------------------PEYVTPAIKKIPKREKIHLNVNAGTGTE 243
PE + PA KI + + AG
Sbjct: 218 PKPIVQSVELRPASNGWGAASNDPVPDRDRISPEVLDPAKNDPVNPTKITVRLQAGFA-- 275
Query: 244 VLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSN----IDFDFSYTVSP-SHIFGGVLLQS 298
L SHH + V E+ V+ ++ D DF T P S V L
Sbjct: 276 -LGEVKSHHHQ-----VTVESTDAETRVITLADGVVPADRDFELTWKPASENMPSVGLFH 329
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
+ D D F TA R +DVIF++D SGSM G + K +L AL +L
Sbjct: 330 EQVGDADYLLAFVTPPAVATATQRPQ-PRDVIFVIDNSGSMGGTSIRQAKASLLYALGRL 388
Query: 359 DPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
P D FN++ F+ T LF +S+ A E V A +++ GG+ + A +
Sbjct: 389 QPNDRFNVIRFDDTMTVLFPSSVP-ADAEHVGNATRFVSSLDARGGTEMVPAMRAALTDD 447
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
++S ++FL TDGA+ +++Q+ + AM+ R RI+ GIGS N Y +
Sbjct: 448 GSDSDRMRQVVFL-TDGAIGNDQQLFETITAMRGR--------SRIFMVGIGSAPNTYLM 498
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
A + RG + ++ +E +M+ LF + + V+ + T + ++ P+ +PD+
Sbjct: 499 SRAAELGRGAFTHIGSVEQVEERMRDLFAKLENPVVTGLTA-TFSEASA-DLTPAVLPDV 556
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI---D 590
PL+++ + + +L+ KG +GD + + L L A+ L ++ A+++I +
Sbjct: 557 YRNEPLVLAAKID-RLAGSLQLKGRIGD-QPWTITLPLSGAAEGKGLSKLWARRKIGDAE 614
Query: 591 LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
+ + D+ + + K+++Q ++ T ++ V+ R + E
Sbjct: 615 VAKTMRQMTPDE-ADGAILKLALQHQLVTRLTSLVAVDKTPRRSDGE 660
>gi|218529650|ref|YP_002420466.1| Vault protein inter-alpha-trypsin domain-containing protein
[Methylobacterium extorquens CM4]
gi|218521953|gb|ACK82538.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium
extorquens CM4]
Length = 738
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 122/514 (23%), Positives = 226/514 (43%), Gaps = 82/514 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY------VTPAIKK 225
+PN+FT + I G + +++ + Q + G +++ +P P Y VT +++
Sbjct: 169 RPNLFTNAVANIGPGETVLVQIEYQQPVRSSAGTYALRLPTVAAPRYSPAPPAVTAVVER 228
Query: 226 ------IPKREKI---------HLNVNAGTGTEVL--------CNTSSHHLKQLRRDVGK 262
+P REKI H +N T T L +++H ++ R +
Sbjct: 229 GAAADPVPDREKIAAPVLDPARHAPINPLTLTIDLMAGFPLGQVRSATHAVRIEERSASE 288
Query: 263 LGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL-----HDVDQREMFCMYLLPG 317
+ ++ ++ DF+ ++T +P ++PS+ + + P
Sbjct: 289 RRITL-ADGATAADRDFELTWTAAPG--------EAPSIGLFREQVAGAEAVLAVVTPPE 339
Query: 318 TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS 377
+A +DV+F++D SGSM G + K +L + L +L GD FN++ F+ + F
Sbjct: 340 SASPASPVPRDVVFVIDNSGSMGGASMRQAKASLLIGLDRLGAGDRFNVIRFD---HSFD 396
Query: 378 TSM-ELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLT-NSRGSIPIIFLV 431
T +L +A H +F+AG G T + APL A+ T G + + +
Sbjct: 397 TLFPDLVPADA---GHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQVVFL 453
Query: 432 TDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
TDGA+ +E QI A+ + G S R++ GIGS N Y +R A + +G +
Sbjct: 454 TDGAIGNEAQIFSAIATE--RGRS---RLFMVGIGSAPNGYLMRHAAEVGQGSFTQIDTP 508
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
D + +M+ L + S + ++ T + ++ P+R+PDL PL +S R G+
Sbjct: 509 DQVTERMRALLVKLESPAVTDLTA-TFSE-PGIDVTPARLPDLYRGEPLTLSARM-GQAR 565
Query: 552 DTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKRL------ 604
TL G +G + +L L A+D + ++ A+ +I A+A E RL
Sbjct: 566 GTLTLTGRIGG-QPWQTQLHLDAAQDGTGIGKLWARAKI----AEA---ETARLTGGLTT 617
Query: 605 ---EEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ + ++++ G+ T ++ V+ R A
Sbjct: 618 EAADAAILRLALDHGLTTRLTSLVAVDATPRRPA 651
>gi|428310089|ref|YP_007121066.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
gi|428251701|gb|AFZ17660.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
Length = 1016
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 110/492 (22%), Positives = 208/492 (42%), Gaps = 48/492 (9%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT------------- 220
NIFT +L I G + + +R++ L + G + P P Y+
Sbjct: 478 NIFTQSLANIKPGEKIDVTIRYTNSLKFEKGNYEFVFPMVVGPRYIPGTKSDSQGNTNPV 537
Query: 221 --------PAIKKIPKREKIHLNVNAGTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEV 271
P I + I + V G + + SH + L+ G + ++
Sbjct: 538 KDASKITPPTIPTGRSGQDIGVTVEIEGGVPISNVRSPSHQISTLQD-----GRAVRIQL 592
Query: 272 LKWSNI---DFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTA-KSRKVFK 326
K + I D Y V+ G QS L D+R F YL+P +S ++
Sbjct: 593 DKQNTIPNKDLIVRYQVA------GNETQSTVLTQADERGGHFATYLIPAVQYQSNEIVP 646
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
KDV+F++D SGS G + +K + ++ L+ D+F I+ F + S+ T +
Sbjct: 647 KDVVFLMDTSGSQAGSAIAQSKELMRRFINGLNSNDTFTIIDFASTSQQLSSKPLQNTPQ 706
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
++A +I GI+ A G + + + K + G + + L+TDG + ++ ++
Sbjct: 707 NRQKALDYINGID--ANGGSELMNGIEKVLSFPAAPEGRLRSVVLITDGLIGNDNEVIAE 764
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
++ L G R+Y+FG+GS N + + +A RG + + +K F R
Sbjct: 765 VQKNLKPGN----RLYSFGVGSSVNRFLIDRIAEEGRGIAEVLPPDEPAQKVAEKFFRRI 820
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL--GDL 563
+ VL NI + + ++YP + DL + PL++ GR + LK G G
Sbjct: 821 NNPVLTNIEVSWEGSGKQPDIYPLKPADLFANQPLVLFGRKGDRSKGNLKIMGMAAGGKR 880
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
V ++ + + + ++ + +I L +Q + +E + V+ +++ +L +YT
Sbjct: 881 YEKTVPIEFEGGGNPAIAQLWGRARIKELMSQMFDAETASGVKAVTDTALKYRLLSQYTA 940
Query: 624 MIIVETDERNNA 635
+ V + R +A
Sbjct: 941 FVAVTEEVRVDA 952
>gi|149918791|ref|ZP_01907278.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
gi|149820392|gb|EDM79808.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
Length = 877
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 120/533 (22%), Positives = 220/533 (41%), Gaps = 64/533 (12%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY 218
AGK+A + +PNIFT + I G + + + Q L DG + + +P P +
Sbjct: 173 AGKAAGLLEQE--RPNIFTQRVANIPPGQTIEVSMHVVQPLEQEDGRYELVLPTVVGPRF 230
Query: 219 V-------------------TPAIKKIPKREKI---------------HLNVNAGTGTEV 244
+ P ++P +I +V TG
Sbjct: 231 IPGTPLAQHRQPAPGENTGIAPNTDEVPDASRITAPVVPEGFTTCAHVEASVVIDTGLRP 290
Query: 245 LCNTSSHHLKQLRR--DVGKLGYSYESE-VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL 301
S H + R DV + +S+ +N DF S+ + ++ Q P+
Sbjct: 291 RRIQSKFHGIDIMRSGDVAAIELDADSDGAPVVANRDFVVSWDLGRDQPKAAIVAQPPTS 350
Query: 302 HDVDQREMFCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
D F + + P + ++++F+VD SGSM G P++ K + AL + P
Sbjct: 351 EGGDG--YFTLTVQPPEQVADEQAVARELVFVVDNSGSMGGLPMDTAKGLMRKALKDIRP 408
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
D+F ++ F+ S + AT++ +E ++ GG+ KA + +
Sbjct: 409 DDTFTVLRFSESASGLSNKLLPATQDNIEAGVDYVDAMQGMGGTQMTEG--IKAALRVPH 466
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ ++ +TDG + +E+ I + + + + R+++ G+G N Y L +A +
Sbjct: 467 DPDRLRVVMFLTDGYIGNEQAIFELIDDNIGDA-----RLFSLGVGGAPNRYLLDGMASV 521
Query: 481 SRGYYG-AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
RG A YD + + +++ + R + VL ++ ID + L E+YP +IPDL + P
Sbjct: 522 GRGAVTYAGYD-EPADPVIERFYERVATPVLTDVEID-WQGLAVEEVYPGKIPDLFAGQP 579
Query: 540 LIVSGRYQGKFPD--TLKAKGFLGDLSNFV---VELKLQLAKDIP-LDRICAKQQIDLLT 593
+ V GRY G +KAK + + V + A D+ + + A+ +ID L
Sbjct: 580 ITVFGRYAGAPTGEIVIKAKARTAEGVETIELPVHFDVAKADDVEGVGSVWARTKIDALM 639
Query: 594 A-----QAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGT 641
W K ++ V +++Q ++ EYT + V+ A SP T
Sbjct: 640 GYPMRPDPWSPLGKETKQAVIDVALQHRIMTEYTAFVAVDERVVAGADGSPKT 692
>gi|404319573|ref|ZP_10967506.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi
CTS-325]
Length = 720
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 120/469 (25%), Positives = 201/469 (42%), Gaps = 81/469 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+PN+FT + I + I++ + Q + D FS+ VP P Y
Sbjct: 137 RPNVFTNAVANIGPHEKVVIQIEYQQAVRLSDERFSLRVPLVVAPRYNPDNAYPVVQEVE 196
Query: 219 ---------------------VTP-----AIKKIPKREKIHLNVNAGTGT------EVLC 246
VTP A++ P + L G +V
Sbjct: 197 MKNGWGKSSDTGKPDTYNAPLVTPLAPPAALRTNPVSISVKLKAGFPLGKVESLFHKVRI 256
Query: 247 NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQ 306
+T++ +++ D + + VL+WS + D G+ + + +
Sbjct: 257 DTTNDATREITLD--GMAAADRDFVLEWSAVASDAPQV--------GLFREH-----IGK 301
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
+ Y+ P S K +++VIF++D SGSM G +E K +L ALS+L PGD FN+
Sbjct: 302 DDYVLAYVTPPALASPKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGDRFNV 361
Query: 367 VAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
+ F+ T F S++ A +E + A +++ + A G T + L A++ G
Sbjct: 362 IRFDDTLTKFFEDSVD-ANQENITSARRFV-TSLEAQGGTEMLPALHAALDDSNQGNGLR 419
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
I+FL TDG + +E+Q+ DA+ +R G S RI+ GIGS N Y + A + RG +
Sbjct: 420 QIVFL-TDGEISNEQQLLDAVAAR--RGRS---RIFMVGIGSAPNSYLMNRAAELGRGTF 473
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIV 542
+ +M+ LF + L A+ LK M PS +PDL PL++
Sbjct: 474 THIGSAAEVNERMRALFDK-----LEKPAVTDLKTSFSEKNVSMTPSLLPDLYRGEPLVI 528
Query: 543 SGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQID 590
+ R GK L +G + D + V L L A + + ++ A+++ID
Sbjct: 529 AARM-GKAIGNLAIEGQI-DGRPWTVNLPLDQAINAEGISKLWARRKID 575
>gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein
[Methylobacterium nodulans ORS 2060]
gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium
nodulans ORS 2060]
Length = 725
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 109/436 (25%), Positives = 188/436 (43%), Gaps = 76/436 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT----------- 220
+PN+FT ++ I G + +++ + Q + G ++ +P P Y
Sbjct: 158 RPNLFTNSVANIGPGETVLVQIAYQQPVRLSGGTHALRIPLVVAPRYNPAPAPVTPAAEG 217
Query: 221 PAIKKIPKREKI---------HLNVNAGTGTEVL--------CNTSSHHLKQLRRDVGKL 263
+P R +I H N T T L +++H ++ V +
Sbjct: 218 AGADPVPDRARITPPVLDPAEHAPANPVTLTVTLQAGFPLGSVQSATHPIR-----VEET 272
Query: 264 GYSYESEVLK----WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL---- 315
G L ++ D + ++T +P+ ++P++ +R YLL
Sbjct: 273 GPESRRVTLADGPVPADRDIELTWTAAPA--------RAPAIGLFRERVGTDEYLLAVVT 324
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P ++ +DV F++D SGSM G + K +L +AL +L P D FN++ F+
Sbjct: 325 PPEGQNLARRPRDVTFVIDNSGSMAGASMRQAKASLLMALDRLAPADRFNVIRFD----- 379
Query: 376 FSTSMELATKEAV--ERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRG-SIPII 428
+M+ EAV + H + +F+A G T + APLT A+ T R + I
Sbjct: 380 --NTMDQLFPEAVPADERHLAVARSFVAALEARGGTEMLAPLTAALADPTPERTDRVRQI 437
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E QI A+ + G R++ GIGS N + + A + RG Y A
Sbjct: 438 VFLTDGAIGNEEQIFSAIAA-----GRGRSRLFMIGIGSAPNAHLMTYAAELGRGSYTAI 492
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDE--FEMYPSRIPDLSSESPLIVSGRY 546
+D + +M++L T+ S V+ D E E P +PDL P++++ R
Sbjct: 493 GTIDQVAERMRELLTKLESPVV----TDLTASFSEPGVEATPRLLPDLYRGEPVVLAARL 548
Query: 547 QGKFPDTLKAKGFLGD 562
G+ TL +G +GD
Sbjct: 549 -GRSAGTLTLRGRIGD 563
>gi|333985506|ref|YP_004514716.1| Vault protein inter-alpha-trypsin domain-containing protein
[Methylomonas methanica MC09]
gi|333809547|gb|AEG02217.1| Vault protein inter-alpha-trypsin domain-containing protein
[Methylomonas methanica MC09]
Length = 668
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 121/518 (23%), Positives = 227/518 (43%), Gaps = 65/518 (12%)
Query: 68 PSYQPYVHGRCDPPSL--IPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRI 125
PS P + P+L +PL+ + E+ ++ + A V V+ +R G++ + R
Sbjct: 31 PSLAPRLWIPDGDPALDQLPLKDSRAEVRINGPI--AQVTVTQHYRN---AGTRPINARY 85
Query: 126 AVPMGDQGSILG---------VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNI 176
P + ++ G VEA+I K+ Q+ + AGK A++ +PN+
Sbjct: 86 VFPGSTRAAVHGLTMKIGERLVEAKIKEKAEAKQIFEQAKT--AGKRAALLEQQ--RPNV 141
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP--------- 227
F + + G + + L +S+ L G + F +P V P P
Sbjct: 142 FMMDTANLMPGDDIELVLTYSELLVPEQGIYE----FVYPTVVGPRYGSDPFQVAADAAW 197
Query: 228 -----------------KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESE 270
+ I +++++ + L +T H ++ D S +
Sbjct: 198 FANPYAANRQDGGNPAATQTSIQVSLDSPIPVQDLISTQ-HKIQTDWHDAQSADISLDPA 256
Query: 271 VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVI 330
N DF + + I G+L S D+ M P ++ ++ +++ +
Sbjct: 257 ETLAGNRDFILRFRLQDQQILSGLLTYSQD----DENYFLTMAQPPARVQAGQIMRREYL 312
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
FI+D+SGSM G PL+ K+ + L+ L P +SFNI+ F+G + + S + AT V++
Sbjct: 313 FILDVSGSMHGFPLDTAKSLMQQLLNDLKPHESFNILFFSGGSNILSPTPLPATPANVQQ 372
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A + N+ GG T + A L A M ++ + I+ +VTDG V ER+ D + L
Sbjct: 373 ALATMQ-NYRGGGGTELIAALESAYSMPRDADTARSIV-VVTDGYVGVEREAYDLIGKNL 430
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS-V 509
N ++ FGIGS N Y + +A G + + + ++ F R + +
Sbjct: 431 HNTN-----LFAFGIGSSVNRYLIESMAKAGAGEPFVVTEQNQVA-EVGNRFRRYVEAPL 484
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
L+NI + + ++ +++ P+ IP + +E P++V G+Y+
Sbjct: 485 LSNIRLQG-EGVELYDLEPADIPLMLAERPIVVFGKYR 521
>gi|427416208|ref|ZP_18906391.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Leptolyngbya sp. PCC 7375]
gi|425758921|gb|EKU99773.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Leptolyngbya sp. PCC 7375]
Length = 831
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 157/361 (43%), Gaps = 15/361 (4%)
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKSRKVF-KKDVIFIV 333
N D Y V+ +L QS D+R F +Y++P A + KD++F++
Sbjct: 254 NKDLIVRYQVATETTQTSLLTQS------DERGGHFAVYMIPALAYDTDAYIPKDMVFLI 307
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
D SGS G PL + + + L P D+FNIV F T S T ++A
Sbjct: 308 DTSGSQSGAPLAQCQALMRRFIEGLHPQDTFNIVNFANTTQQLSQVPLGNTSRNRQQALN 367
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
++ AGG T + L + + + I I L+TDG + +E QI ++ L
Sbjct: 368 YVD-RLRAGGGTEMLRGLKTVLNLPQPNSERIRNIVLLTDGYIGNETQIFAEVQQSL--- 423
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
+ R+++FG GS N + L +A + RG + I+ + K F + + VL N+
Sbjct: 424 -GLATRLHSFGAGSSLNRFLLNRVAEMGRGISQVVRHDEGIDRIVDKFFDQINNPVLGNL 482
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ 573
+ + +YP +PD+ +E PLI+ GR K P TL G F L +
Sbjct: 483 SAHWEGAGETPSLYPMAVPDVFAEHPLILFGRKTDKQPGTLHLSGIAAGGKTFKQSLAVA 542
Query: 574 L--AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE 631
+ + ++ + +I L Q E + V++ ++ +L +YT + V +
Sbjct: 543 FDSTGNPAIAQLWGRSRIKELMNQMVSGETTHGVQAVTETALAYQLLSQYTAFVAVSEES 602
Query: 632 R 632
R
Sbjct: 603 R 603
>gi|108758937|ref|YP_629042.1| von Willebrand factor A [Myxococcus xanthus DK 1622]
gi|108462817|gb|ABF88002.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK
1622]
Length = 860
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 155/316 (49%), Gaps = 20/316 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K++V+F+VD+SGSM G+ L + AL + L L GD FN++AF F T+
Sbjct: 281 KQEVVFVVDVSGSMAGESLPQAQAALRLCLRHLREGDRFNVIAFENRFQSFQPEPVPFTQ 340
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+E A +W+ A G T + AP+ AV+ + +I L+TDG V +E +I A
Sbjct: 341 RTLEEADRWVAA-LNADGGTELLAPMRAAVQAAPDG-----VIVLLTDGQVGNEAEILRA 394
Query: 446 -MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+++R T R+Y+FGIG+ + LR +A + G + + I+ ++ F+R
Sbjct: 395 VLEARKTA------RVYSFGIGTNVSDVLLRDMAKQTGGDVEFIHPGERIDDKVVAQFSR 448
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLS 564
+ + + + ++ E+ P+ +P L P + GRY T+ +G G
Sbjct: 449 ALAPRVTELEVH-FDGVECAELAPAELPPLVDGVPWTLLGRYASPGTGTVTLRGRSGQEP 507
Query: 565 -NFVVELKLQLAKDIP-LDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEY 621
+ V + L D P ++++ A ++I AQ + +++++ +++V ++ Y
Sbjct: 508 FSLTVRVDLPAVSDRPAVEKLWAAERIRGWEAAQLVGRRAESMKQRIVELAVAHQIVTRY 567
Query: 622 TRMIIVET---DERNN 634
T ++VE D RN+
Sbjct: 568 TSFVVVEERKGDRRNS 583
>gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein
[Hahella chejuensis KCTC 2396]
gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Hahella chejuensis KCTC 2396]
Length = 733
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 125/537 (23%), Positives = 221/537 (41%), Gaps = 89/537 (16%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
GK A+V +PN+FT + I G +++ LR+ Q +SYR G FS+ +P P Y+
Sbjct: 143 GKRAAVVHQQ--RPNLFTHKVANIGPGEMITLTLRYQQNISYRSGAFSLALPLTVTPRYI 200
Query: 220 -----TPAIKKIPK---------------------------------------------- 228
TPA + K
Sbjct: 201 PGSSATPAWSEEDKTRLRNELRSHERTLINDSGWAPATNQVADAPEITPPTVAKEDLLTP 260
Query: 229 ----REKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYT 284
R ++HLN G V S H Q + GY E K +D DF+ T
Sbjct: 261 SHQARIRVHLN----PGLPVESIESVTHRIQWTQQTN--GYEVSLESNKDVPMDKDFTLT 314
Query: 285 VSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGK 342
+ G ++ ++ +++ LL P + +++I++VD SGSM+G
Sbjct: 315 W---RVRQGSEPEAALFKEIVGDDVYAQLLLMPPQFSDEGLSLPRELIWVVDTSGSMEGV 371
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAG 402
++ ++A+ AL L P D FN++ FN A + A+++A +++ A
Sbjct: 372 SIQQARDAVLQALDTLTPRDRFNVIEFNSHARKLFPQAVPAQERALQQARRFVR-GLKAD 430
Query: 403 GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYT 462
G T I L +A+ G + + +TDG+V +E + + +L + R++T
Sbjct: 431 GGTEIAEALDRALSDAA-PEGYVRQVVFLTDGSVGNELALFKQIDQQLGDS-----RLFT 484
Query: 463 FGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLD 522
GIG N +F+R A RG Y D + ++ +L L ++ +D L+
Sbjct: 485 VGIGPSPNRFFMRKAAQFGRGAYSHINDTAEVSDKIAELTAALRQPALRDVRLDVQSALN 544
Query: 523 EFEMYPSRIPDLSSESPL-----IVSGRYQGKFPDTLKAKGFLGDLSN----FVVELKL- 572
E+YP IPDL P+ + G + P +++ G +G L + ++ L L
Sbjct: 545 A-EVYPVAIPDLYRGEPVQLLFKVEDGAAASELPASIQGYG-VGLLQDEQPLWMRSLDLQ 602
Query: 573 QLAKDIPLDRICAKQQID-LLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
Q A + R A+ +ID L ++ ++ + + V +++ ++ YT + VE
Sbjct: 603 QAAPSQGVARAWARHKIDHWLDRKSMGEPEENVRDVVLPLALDFALMSPYTSFVAVE 659
>gi|296185113|ref|ZP_06853523.1| von Willebrand factor type A domain protein [Clostridium
carboxidivorans P7]
gi|296049947|gb|EFG89371.1| von Willebrand factor type A domain protein [Clostridium
carboxidivorans P7]
Length = 815
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 133/535 (24%), Positives = 224/535 (41%), Gaps = 61/535 (11%)
Query: 128 PMGDQGSILGVEAEISGKSYHT------QLIALGEN--DGAGKSASVETGSFLKPNIFTL 179
P+ I G EAEI GK+ T + I + EN D + +E LKP+IF +
Sbjct: 54 PIPKTAIISGFEAEIGGKTLKTIVEEKEKAIKIYENAKDRGDNTVFLEE---LKPHIFKI 110
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF-----------KFPEYVTPAIKKIPK 228
+ +I G + I+L + +L Y+D +F + +P K + KI K
Sbjct: 111 AVGKIVSGETVKIKLSYIDELQYKDKKFKLTIPAIYEPTKMDENSKLNNLKNTLVNKIVK 170
Query: 229 R--EKIHLNVNAGTGTEVLCN---TSSHHLKQLRRD---VGKLGYSYESEVLKWSNIDFD 280
+ E E LC S +H ++ R+ V K+ + E + D D
Sbjct: 171 KKSEDSEFQFKTNIIVESLCKLDFESPYHKLKVEREGDTVAKINLQDDYEAM-----DKD 225
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQ 340
F + + + + R + + ++P K+ +F++DIS +M+
Sbjct: 226 FILIMKEKDPLEADGMIYEYKENGEDRRVVYLRMIPKLDDYEMEDKESYVFLIDISDTMK 285
Query: 341 GKPLEDTKNALAVALSKLDPGDSFNIVAFNGET--YLFSTSMELATKEAVERAHQWIGIN 398
G LE KNAL + + L+ GD+F+I+A GET Y + M E +++A QWI N
Sbjct: 286 GDKLEQAKNALQMCIRNLEEGDTFDIIAM-GETLKYFWDEGMAEFNSETLKKASQWIE-N 343
Query: 399 FIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
+I + ++E + G I + TD VE+E +I D ++ L +
Sbjct: 344 LTTEDDADIFGAIKYSLE----NEGGHNTILIFTDDEVEEEDEILDYVRENLGDN----- 394
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
RI+TFG S N+YFL LA S G IE + + F R + + +I ID
Sbjct: 395 RIFTFGFDSETNNYFLNKLAHESFGKAEFINKGRRIEYVVLRQFKRIQNPEVDDIQID-W 453
Query: 519 KDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI 578
++ YP I + + G+ D + G +GD +++ + +
Sbjct: 454 GNMKVQSTYPRTIDYMYDREAFSIFAYVIGEVSDEIVISGKVGD-KDYI--------RKV 504
Query: 579 PLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
LD + +LL + W KR++ M Q G + E R ++E ++N
Sbjct: 505 DLDNFNTEDNANLL-KKMWAR--KRIKSIEMNMKSQRGDVKEAMRQKVIELSKKN 556
>gi|427708648|ref|YP_007051025.1| Vault protein inter-alpha-trypsin domain-containing protein [Nostoc
sp. PCC 7107]
gi|427361153|gb|AFY43875.1| Vault protein inter-alpha-trypsin domain-containing protein [Nostoc
sp. PCC 7107]
Length = 821
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 122/511 (23%), Positives = 202/511 (39%), Gaps = 69/511 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-IKKIPKR-- 229
+ NIFT +L I G + + +R+S+ L + G N F FP V P + IP
Sbjct: 113 RDNIFTQSLANIQPGEQIDVIIRYSESLKFTAG----NYEFVFPMVVAPRYVPGIPIEGN 168
Query: 230 --------------------------------------EKIHLNVNAGTGTEVL-CNTSS 250
I++ + G E+ + S
Sbjct: 169 TGGVGSATAPMMQNQDTDIVPDGSQLNAPILPSGMRSPHDINVTIEIDAGVEIQDVQSPS 228
Query: 251 HHLK---QLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR 307
H L+ Q +R + KL K N D Y V+ VL Q+ D+R
Sbjct: 229 HQLQIAYQGKRVLVKLAGGD-----KIPNKDLILRYQVAGDSTQATVLTQA------DER 277
Query: 308 E-MFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
F +YL+P + +V KDV+F++D SGS G PL + + ++ L+ D+F+
Sbjct: 278 GGHFALYLIPALQYRQDQVVPKDVVFLIDTSGSQMGAPLMQCQELMRRFINGLNLDDTFS 337
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
I+ F+ T S + RA +I A G T + + + G +
Sbjct: 338 IIDFSDTTRQLSPVPLANNSQNRTRAINYIN-QLSANGGTEMLRGIRAVLNFPVTDPGRL 396
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
I L+TDG + +E QI ++ L G R+Y+FG GS N + L +A + RG
Sbjct: 397 RSIVLLTDGYIGNENQILSEVQQHLQPGN----RLYSFGAGSSVNRFLLNRIAELGRGVS 452
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ + + K + + + VLANI + D D +YP PDL +E PL++ G+
Sbjct: 453 RIIRHDEPTDEVVDKFYRQINNPVLANINLQWEGDGDSPIIYPCTPPDLFAEQPLVLFGK 512
Query: 546 YQGKFPDTLKAKGFL--GDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKR 603
L G + G + + + + ++ + +I L + K
Sbjct: 513 KPDGRGGKLHITGIVAGGTRYQHTFNMNFEETGNSGIAQLWGRSRIKELMNLMVSGDTKL 572
Query: 604 LEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
E V+ ++ +L +YT + V D R N
Sbjct: 573 GVEAVTDTALTYQLLSQYTAFVAVSDDVRVN 603
>gi|365898821|ref|ZP_09436757.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3843]
gi|365420423|emb|CCE09299.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3843]
Length = 754
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 126/530 (23%), Positives = 228/530 (43%), Gaps = 82/530 (15%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
GK+A++ +PNIFT ++ I G + +++ + + ++ EFS+ VP P Y
Sbjct: 158 GKTAALTEQE--RPNIFTNSVANIGPGETVLVQIEYQEPVAQNGNEFSLRVPLVVGPRYN 215
Query: 220 TPAIKK------------------IPKREKI----------------HLNVNAGTGTEVL 245
I + +P R++I + V G VL
Sbjct: 216 PAPIVQSVELRPGGNGWGAASNDPVPDRDRISPPALDPAENAPVNPTRITVRLQAGF-VL 274
Query: 246 CNTSSHH--LKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL-- 301
SHH +K D + V+ D DF T PS +PS+
Sbjct: 275 GEVKSHHHQVKIESPDATTRVVTLADGVVP---ADRDFELTWKPSD------ETTPSVGL 325
Query: 302 ---HDVDQREMFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
H D + P A +++ +DVIF++D SGSM G ++ K +L AL +
Sbjct: 326 FREHVGDADYLLAFVTPPAVAPAAQQPLPRDVIFVIDNSGSMGGTSIQQAKASLLYALGR 385
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L P D FN++ F+ + S A E V A ++ A G T + + A+
Sbjct: 386 LQPTDRFNVIRFDDTMTVLFPSAVPADAEHVGSATGFVS-GLEARGGTEMVPAMRAALTD 444
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
++ + + +TDGA+ +E+Q+ + AM+ R RI+ GIGS N Y +
Sbjct: 445 ESSEADHVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLM 496
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRI 531
A + RG + ++ +E +M+ LF + L N A+ L ++ P+ I
Sbjct: 497 TRAAELGRGAFTHIGSVEQVEERMRGLFAK-----LENPAVTGLTASFSDASADVTPAII 551
Query: 532 PDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIPLDRICAKQQI- 589
PD+ + PL+++ + K +L+ KG +GD + V L L A+ L ++ A+++I
Sbjct: 552 PDVYRDEPLVLAAKLD-KLEGSLRIKGRIGD-RPWTVTLPLANAAEGKGLSKLWARRKIS 609
Query: 590 --DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
++ + D+ ++ + K++++ ++ T ++ V+ R A E
Sbjct: 610 DAEVARTMRQITPDE-ADKAILKLALEHQLVTRLTSLVAVDQTPRRPAGE 658
>gi|148252253|ref|YP_001236838.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1]
gi|146404426|gb|ABQ32932.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1]
Length = 755
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 113/442 (25%), Positives = 191/442 (43%), Gaps = 58/442 (13%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
GK+A++ +PNIFT +L I G + +++ + Q ++ DG FS+ VP
Sbjct: 160 GKTAALTEQE--RPNIFTNSLANIGPGETVLVQIEYQQPVAQVDGAFSLRVPLVVAPRYN 217
Query: 221 PAI-------------------KKIPKREKIHLNVNAGTGTE---------------VLC 246
PA +P R++I V T+ L
Sbjct: 218 PAPIVQSVDLRPGSDGWSAASNDPVPDRDRISPEVLDPAKTDPVNPTKITVRLQAGFALG 277
Query: 247 NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI-FGGVLLQSPSLHDVD 305
SHH Q+ D + + D DF T P+ + V L + D D
Sbjct: 278 EVKSHH-HQVTIDSPDAKTRIVTLAEGVAPADRDFELTWKPASVAMPSVGLFHEQVGDAD 336
Query: 306 QREMFCMYLLPGTAKS-RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
+ P A S ++ +DVIF++D SGSM G + K +L AL +L P D F
Sbjct: 337 Y--LLAFITPPAVAASAQRPQPRDVIFVIDNSGSMGGTSIRQAKASLLYALGRLQPNDRF 394
Query: 365 NIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
N++ F+ T LF +S+ A E V A ++ GG+ + A + ++S
Sbjct: 395 NVIRFDDTMTVLFPSSVP-ADAEHVGSATSFVSALEARGGTEMVPAMRAALTDDGSDSDR 453
Query: 424 SIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
++FL TDGA+ +E+Q+ + AM+ R RI+ GIGS N Y + A +
Sbjct: 454 VRQVVFL-TDGAIGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLMTRAAEL 504
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + ++ +E +M+ LF + + V+ + T + ++ P+ +PD+ PL
Sbjct: 505 GRGAFTPIGSVEQVEERMRDLFAKLENPVVTGLTA-TFSEASA-DLTPAVLPDVYRNEPL 562
Query: 541 IVSGRYQGKFPDTLKAKGFLGD 562
+++ R + +L+ K +GD
Sbjct: 563 VLAARID-RLAGSLQLKARIGD 583
>gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4]
gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella
loihica PV-4]
Length = 776
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 124/548 (22%), Positives = 230/548 (41%), Gaps = 95/548 (17%)
Query: 159 GAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEY 218
+GK AS+ + +PNIFT + + G L + + + Q++ Y+DGEFS +FP
Sbjct: 178 ASGKKASLVSQQ--RPNIFTSEVANLGPGEALVVEIAYQQQVRYQDGEFS----LRFPTA 231
Query: 219 VTPAIKKIPK---------------------------REKIHLNVNAGTGTEV-LCNTSS 250
+TP + PK +K++L+V G + T
Sbjct: 232 ITP--RYFPKGQVPDLEQASVNDIQGLNVLNESTQSDEQKLYLDVVLDAGMAISRLETPY 289
Query: 251 HHLKQLRRDVGKLGYSYESEV-------LKWSNI----------------------DFDF 281
H ++Q + K+G + + + LKW + +F
Sbjct: 290 HQMRQTQLGGTKIGLNLTANLRPDRDFLLKWRPLLAEQPSAVMFAQLGKTHEFKTHEFKN 349
Query: 282 SYTVSPSHIFGGVLLQS---PSLHDVDQREMFCMY----LLPGTAKSRKVFKKDVIFIVD 334
+++ S ++ P+ +E Y L+P K+R +++ ++D
Sbjct: 350 EESLASSQAHNDQVVSEANHPAEAQASDKEAKDSYALVMLMPPQDKARVRLPRELTLVID 409
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQW 394
SGSM G + K+A+ AL+ L D+FN++AF+ S AT + +A+ +
Sbjct: 410 TSGSMTGDSIAQAKSAILNALAGLGSQDTFNVIAFDSSVRSLSPVALSATAANLGKANLF 469
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI---------IFLVTDGAVEDERQICDA 445
+ + A G T + L +A+ + SI + +TDGAV +E +
Sbjct: 470 VQ-SLEADGGTEMAPALLRALSQPESGVSSISSAVKPERLKQVVFITDGAVGNEASLFAL 528
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+ + + R++T GIG+ N YF+ A RG Y + ++ ++ +L +
Sbjct: 529 IAANIGRQ-----RLFTVGIGAAPNGYFMERAARAGRGTYTYVGKISEVDAKIGELLEKI 583
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY---QGKFPDTLKAKGFLGD 562
S ++++ + TL D + +P +I DL + P++V+ R Q D L G L D
Sbjct: 584 ESPQISDVTL-TLDDGSIPDYWPVQIGDLYAHEPIMVALRLTPAQRSRSDALIISGDL-D 641
Query: 563 LSNFVVELKLQLAKDIP--LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCE 620
N+ L L D P +D I A+ QI L ++E V+++++ ++
Sbjct: 642 GKNWQRRLAL-TGGDKPRGIDLIWARNQIASLQLSKNAKNQAAIKESVTRLAMDYHLVSP 700
Query: 621 YTRMIIVE 628
YT ++ V+
Sbjct: 701 YTSLVAVD 708
>gi|163850829|ref|YP_001638872.1| vault protein inter-alpha-trypsin subunit [Methylobacterium
extorquens PA1]
gi|163662434|gb|ABY29801.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium
extorquens PA1]
Length = 732
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 121/513 (23%), Positives = 223/513 (43%), Gaps = 81/513 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY--VTPAI------ 223
+PN+FT + I G + +++ + Q + G +++ +P P Y PA+
Sbjct: 164 RPNLFTNAVANIGPGETVLVQIEYQQPVRSSAGTYALRLPTVAAPRYSPAPPAVMAVVER 223
Query: 224 ---KKIPKREKI---------HLNVNAGTGTEVL--------CNTSSHHLKQLRRDVGKL 263
+P RE I H +N T T L +++H ++ R +
Sbjct: 224 GAADPVPDRETIAAPVLDPARHAPINPLTLTIDLKAGFTLGQVRSATHAVRIEERSASER 283
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-----EMFCMYLLPGT 318
+ ++ ++ DF+ ++ +P ++PS+ +R + + P +
Sbjct: 284 RITL-ADGATAADRDFELTWNAAPG--------EAPSIGLFRERVAGAEAVLAVVTPPES 334
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
A S +DV+F++D SGSM G + K +L + L +L D FN++ F+ + F T
Sbjct: 335 ASSAASVPRDVVFVIDNSGSMGGASMRQAKASLLIGLDRLGAHDRFNVIRFD---HSFDT 391
Query: 379 SM-ELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLT-NSRGSIPIIFLVT 432
+L +A H +F+AG G T + APL A+ T G + + +T
Sbjct: 392 LFPDLVPADA---GHLMRAKSFVAGLQASGGTEMLAPLQAALRGATPEETGRLRQVVFLT 448
Query: 433 DGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
DGA+ +E QI A+ + G S R++ GIGS N Y +R A + RG + D
Sbjct: 449 DGAIGNEAQIFSAIATE--RGRS---RLFMVGIGSAPNGYLMRHAAELGRGSFTQIDTPD 503
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ +M+ L + S + ++ T + ++ P+R+PDL PL +S R G+
Sbjct: 504 QVTERMRALLVKLESPAVTDLTA-TFSE-PGIDVTPARLPDLYRGEPLTLSARM-GQARG 560
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDL---------LTAQAWFSEDK 602
TL G +G + +L L A+D + ++ A+ +I LTA+A
Sbjct: 561 TLTLTGRIGG-QPWQTQLHLDAAQDGTGIGKLWARAKIAEAETARLTGGLTAEA------ 613
Query: 603 RLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ + ++++ G+ T ++ V+ R A
Sbjct: 614 -ADAAILRLALDHGLTTRLTSLVAVDATPRRPA 645
>gi|365884973|ref|ZP_09423998.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 375]
gi|365286415|emb|CCD96529.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 375]
Length = 755
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 114/471 (24%), Positives = 211/471 (44%), Gaps = 62/471 (13%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
GK+A++ +PNIFT +L I G + +++ + Q ++ GEFS+ VP P Y
Sbjct: 160 GKTAALTEQE--RPNIFTNSLANIGPGETVLVQIEYQQPVAQVAGEFSLRVPLVVAPRYN 217
Query: 220 TPAIKK------------------IPKREKI----------------HLNVNAGTGTEVL 245
I + +P R++I + V G +
Sbjct: 218 PKPIVQSVDLRPDSNGWGAASNDPVPDRDRISPEVLDPAKNDPVNPTRITVRLQAGFALG 277
Query: 246 CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVD 305
S HH + + +E + ++ DF+ ++ + S V L + D D
Sbjct: 278 EVKSHHHQVTIDSPDARTRVVTLAEGVVPADRDFELTWKPA-SEKMPSVGLFHEQVGDAD 336
Query: 306 QREMFCMYLLPGT--AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
++ P A +++ +DVIF++D SGSM G + K +L AL +L PGD
Sbjct: 337 ---YLLAFVTPPAVAATTQRPQPRDVIFVIDNSGSMGGTSIRQAKASLLYALGRLQPGDR 393
Query: 364 FNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
FN++ F+ T LF +S+ A E V A ++ + A G T + + A+ +
Sbjct: 394 FNVIRFDDTMTVLFPSSVP-ADAEHVGSATSFVS-SLDARGGTEMVPAMRAALTDDGSDS 451
Query: 423 GSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ + +TDGA+ +E+Q+ + AM+ R RI+ GIGS N Y + A
Sbjct: 452 DRVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLMSRAAE 503
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+ RG + ++ +E +M+ LF + + V+ + T + ++ P+ +PD+ P
Sbjct: 504 LGRGAFTHIGSVEQVEERMRDLFAKLENPVVTGLTA-TFSEASA-DLTPAVLPDVYRNEP 561
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQI 589
L+++ + + +L+ KG +GD + + L L A + L ++ A+++I
Sbjct: 562 LVLAAKID-RLAGSLQLKGRIGD-QPWTITLPLSSAAEGKGLSKLWARRKI 610
>gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein
[Stackebrandtia nassauensis DSM 44728]
gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia
nassauensis DSM 44728]
Length = 831
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 145/606 (23%), Positives = 244/606 (40%), Gaps = 102/606 (16%)
Query: 93 LDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILG---------VEAEIS 143
LD++C++ VR T R H G + + P+ ++ ++ V AE+
Sbjct: 39 LDINCHVTGLGVRTVVTQRFHNPHG-EPIEATYIFPLPERAAVTDMTMTVAERTVTAELH 97
Query: 144 GKSYHTQL--IALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLS 201
++ QL A+ E GK AS+ + ++FT+ + + G + L L+
Sbjct: 98 ERAKARQLYDTAISE----GKRASIAEAE--RADVFTMRVGNLGAGEEAVVTLTLVGPLA 151
Query: 202 YRDGEFSVNVPFKF-PEYV-------TPA-------IKKIPKREKIHLNV-------NAG 239
+ D E ++ +P P Y+ P +P +I V
Sbjct: 152 FEDNEATLRLPLVVAPRYIPGQPTGAAPVGEGYAEDTDAVPDASRITPPVLLPGFPNPVR 211
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWS-------NIDFDFSYTVSPSHIFG 292
EV + + L+QLR + + EV + N DF + S
Sbjct: 212 LSIEVTIDPAGLPLRQLRSSLHAVTVDETGEVTRVRIEPGERVNRDFILRFDYGESGDVA 271
Query: 293 GVLLQSPSLHDVDQRE----MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTK 348
G LL +P D+ E F + +P + R +DV+ ++D SGSM G + +
Sbjct: 272 GSLLTAP-----DENEPTSGTFQLTAIPPSDLPR-ARPRDVVVLLDRSGSMGGWKMVAAR 325
Query: 349 NALAVALSKLDPGDSFNIVAFNGETYLFS-TSMELATKEAVERAHQWIGINFIAG----G 403
A A + L D F + F+ T + S ++ A +++ + +AG G
Sbjct: 326 RAAARIVDTLSSADRFAVRCFD--TAMTSPEGLDPNGLSAGTDRNRFRAVEHLAGTETRG 383
Query: 404 STNICAPLTKAVEMLT-NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYT 462
T+I PL+ AV++LT +G +I LVTDG V +E QI + RL+ R++
Sbjct: 384 GTDILKPLSTAVDLLTAGEKGRDRVIILVTDGQVGNEDQILRELTGRLSGM-----RVHV 438
Query: 463 FGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK-DL 521
GI N FL LA++ RG D ++ + R + V+ ++ + DL
Sbjct: 439 VGIDKAVNAGFLHRLALVGRGRCELVESEDRLDEATAHIHRRIVAPVVTDLTVTGEGLDL 498
Query: 522 DEFEMYPSRIPDLSSESPLIVSGRYQGK-FPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+ + P RIPDL + +PLI+SGRY G LK G D + + EL
Sbjct: 499 EPETLAPHRIPDLFTGAPLIISGRYHGAGTTPRLKLTGTSQDGTPWTSEL---------- 548
Query: 581 DRICAKQQIDLLTAQAWFSEDKR-----------------LEEKVSKMSVQTGVLCEYTR 623
A+ LT AW R LE+++ +S++ VL +T
Sbjct: 549 ---AARTDDTALTCPAWARAHLRDLEDRYASAPGSPDLGDLEKRIVDVSLRHRVLSRFTS 605
Query: 624 MIIVET 629
+ V++
Sbjct: 606 FVAVDS 611
>gi|386847940|ref|YP_006265953.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Actinoplanes sp.
SE50/110]
gi|359835444|gb|AEV83885.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Actinoplanes sp.
SE50/110]
Length = 887
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 134/550 (24%), Positives = 233/550 (42%), Gaps = 71/550 (12%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQL----IALGEND---GAGKSASVETGSFLKPNIFTLT 180
P+ D+ +++G+ G++ +L A + D AG+ AS+ +P++FT+
Sbjct: 73 PLPDRAAVVGMTMTADGRTVTAELQERAAAREQYDQAVAAGQRASIAEEE--RPDVFTMR 130
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV--------------TPAIKK 225
I G + +RL + + DG + P P Y+ T
Sbjct: 131 AGNILAGERIVVRLSLVGPMPFEDGAATYRFPLVVAPRYIPGAPLPGPYAGDGQTADTDA 190
Query: 226 IPKREKIH-----------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLK- 273
+P +I L ++ G V + + L ++R + + S E L+
Sbjct: 191 VPDASRISPPVLLPGFPNPLRLSLG----VTVDAAGLELGEVRSSLHAI--SEEGGTLRI 244
Query: 274 ----WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKSRKVFKKD 328
++ DF +P G + P D D E + + +LP A KD
Sbjct: 245 APGERADRDFVLRLAYAPG---GNTAVAVP---DNDGEEGTYQIVILP-PAGDVPPRPKD 297
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
V+ ++D SGSM G + + A A + L D F ++ F+ + +E EA
Sbjct: 298 VVLLLDRSGSMGGWKMVAARRAAARVIDTLTGADRFAVLTFDHQVER-PDGVEPGLSEAT 356
Query: 389 ERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+R +++ I +A G T + PLT + +L +S G ++ LVTDG V +E QI
Sbjct: 357 DR-NRFRAIEHLARADARGGTELLTPLTTGLGLLADSTGRDRVLVLVTDGQVGNEDQIVH 415
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ + GS R++T GI N FL LA + G D ++ M+ + R
Sbjct: 416 EVTPLI---GST--RLHTIGIDRAVNAGFLGRLAALGAGRAELVESEDRLDEAMEHIHRR 470
Query: 505 GFSSVLANIAIDTLKDLDEFE--MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
+ V+ +++ T D+ E P R+P L PL+V+GRY G T G D
Sbjct: 471 IGAPVVTGLSL-TADDVTALEGTRSPQRLPGLYPGVPLVVTGRYSGSTAGTFTVTGTTRD 529
Query: 563 LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYT 622
F + L+ + + + A+ ++ L A+ + D LE+++ + S++ GVLC +T
Sbjct: 530 EQAFQATIPLRERAEPAVTKQWARARLRDLE-DAFAAGDHTLEQQIVETSLRFGVLCRFT 588
Query: 623 RMIIVETDER 632
+ V DER
Sbjct: 589 AYVAV--DER 596
>gi|365886018|ref|ZP_09424993.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3809]
gi|365338520|emb|CCD97524.1| conserved hypothetical protein [Bradyrhizobium sp. STM 3809]
Length = 755
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 109/443 (24%), Positives = 196/443 (44%), Gaps = 60/443 (13%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV 219
GK+A++ +PNIFT +L I G + +++ + Q ++ GEFS+ VP P Y
Sbjct: 160 GKTAALTEQE--RPNIFTNSLANIGPGETVLVQIEYQQPVAQVAGEFSLRVPLVVAPRYN 217
Query: 220 TPAIKK------------------IPKREKIHLNVNAGTGTEVLCNT------------- 248
I + +P R++I V T+ + T
Sbjct: 218 PKPIVQSVDLRPYSDGWGAASNDPVPDRDRISPEVLDPAKTDPVNPTRITVRLQAGFALG 277
Query: 249 ---SSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVD 305
S HH + K +E + ++ DF+ S+ + S V L + D D
Sbjct: 278 EVKSHHHQVTIESPDAKTRVVTLAEGVVPADRDFELSWKPA-SEKMPSVGLFHEQVGDAD 336
Query: 306 QREMFCMYLLPGT--AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
++ P A +++ +DVIF++D SGSM G + K +L AL +L P D
Sbjct: 337 ---YLLAFVTPPAVAAATQRPQPRDVIFVIDNSGSMGGTSIRQAKASLLYALGRLQPNDR 393
Query: 364 FNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
FN++ F+ T LF S+ A E V A ++ A G T + + A+ +
Sbjct: 394 FNVIRFDDTMTVLFPASVP-ADAEHVGDATSFVS-GLDARGGTEMVPAMRAALTDDGSDA 451
Query: 423 GSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ + +TDGA+ +E+Q+ + AM+ R RI+ GIGS N Y + A
Sbjct: 452 DRVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLMSRAAE 503
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+ RG + ++ +E +M+ LF + + V+ ++ + + ++ P+ +PD+ P
Sbjct: 504 LGRGAFTHIGSVEQVEERMRDLFAKLENPVVTGLSANFSE--ASADLTPAVLPDVYRNEP 561
Query: 540 LIVSGRYQGKFPDTLKAKGFLGD 562
L+++ + + +L+ KG +GD
Sbjct: 562 LVLAAKID-RLAGSLQLKGRIGD 583
>gi|307943680|ref|ZP_07659024.1| von Willebrand factor type A domain-containing protein [Roseibium
sp. TrichSKD4]
gi|307773310|gb|EFO32527.1| von Willebrand factor type A domain-containing protein [Roseibium
sp. TrichSKD4]
Length = 746
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 112/424 (26%), Positives = 180/424 (42%), Gaps = 49/424 (11%)
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAG 239
TLP DG +L RL E+ + P+ + P ++ LN
Sbjct: 216 TLPVQDGSGWLVERL----------PEYPPVIGLTAPDKIDP--------RRVELNAALR 257
Query: 240 TGTEVLCNTSSHHLKQLRRDVGKLGYSYE-SEVLKWSNIDFDFSYTVSPSHIFGGVLLQS 298
+ T L SS H ++ G S+ +E + N DF Y+++ L
Sbjct: 258 SATRFLSLASSTH--EIETKTNDSGTSFHFAEGREIDNRDFVLRYSMAAEKRMATGALT- 314
Query: 299 PSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALS 356
H + R F + P A V ++++F++D SGSM G+P++ +K+ + AL
Sbjct: 315 ---HFDETRGGFLSLHIEPPKLAPEDLVTPRELVFVLDTSGSMGGQPMDASKSFMHAALD 371
Query: 357 KLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE 416
L D F I+ F T F+ S AT RA+ G NF+ G S + A+
Sbjct: 372 GLRENDQFRILRFANNTSAFAKSAMPAT-----RANIKAGKNFVTGLSARGGTEMNNAIN 426
Query: 417 M---LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
L G++ I+ +TDG + +R++ + R+ N RI+ FGIG N Y
Sbjct: 427 AAFDLPPVPGTMRIVVFLTDGYIGGDREVIQTVYDRIGNA-----RIHAFGIGKAINRYL 481
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L LA RG ++ + L + + +L +I ID LD + +R+PD
Sbjct: 482 LEGLAREGRGRVRYIEPGETGQEAAAALASSIDAPLLTDIEID-WNGLDVSDQTTARLPD 540
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAK-------DIPLDRICA 585
L + + + V RY+G+ + KG + G + V L L+ K D PL I A
Sbjct: 541 LFAGNAIDVMARYKGEGAHQITVKGLVNGRRAEIPVSLTLKDTKSDTEPKADSPLPLIWA 600
Query: 586 KQQI 589
++QI
Sbjct: 601 REQI 604
>gi|393767973|ref|ZP_10356516.1| vault protein inter-alpha-trypsin subunit [Methylobacterium sp.
GXF4]
gi|392726579|gb|EIZ83901.1| vault protein inter-alpha-trypsin subunit [Methylobacterium sp.
GXF4]
Length = 720
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 178/409 (43%), Gaps = 55/409 (13%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY-----------VT 220
+PN+FT T+ I G + +++ + Q + D ++ +P P Y
Sbjct: 150 RPNVFTNTVANIAPGETVLVQITYQQTIPVADATRALRLPLVVAPRYNPGSQALASVATG 209
Query: 221 PAIKKIPKREKIH----------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLG 264
PA +P R +I + V G + S+ H + + G G
Sbjct: 210 PAADPVPDRARISPPVLDPSQTPPVNPVTVTVRLEAGFALGAVRSATHAIHVA-ETGPQG 268
Query: 265 YSYE-SEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSL----HDVDQREMFCMYLLPGTA 319
+E + ++ DF+ ++ +PS ++P L +V R + P A
Sbjct: 269 RRITLAEGVVPADRDFELTWEPAPS--------RAPELGLFREEVAGRTYLLATVNPPEA 320
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFS 377
++ +DV F++D SGSM G + K L L++L P D N++ F+ T+ LF
Sbjct: 321 EAAPRPARDVTFVIDNSGSMAGASMRQAKAGLLAGLARLTPRDRVNVIRFD-HTWDALFP 379
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAV 436
++ AT EA+ +A ++ + A G T + APL A+ + + + +TDGAV
Sbjct: 380 EAVP-ATPEALRKAEIFVA-DLEARGGTEMLAPLKAALADPHPEETARVRQVVFLTDGAV 437
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
DE +I A+ L R++ GIGS N + +R A I RG + DL +
Sbjct: 438 GDEERIFAAIHEGLGRS-----RLFMVGIGSAPNGHLMRHAAEIGRGSFTEIRDLGQVGE 492
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ + L+ + S + ++ T+ D E+ P R+PDL PL+ + R
Sbjct: 493 RTRSLYAKLESPAVTDLTA-TVSAPDA-EVTPRRLPDLYRGEPLVFAAR 539
>gi|356960241|ref|ZP_09063223.1| hypothetical protein gproSA_00907 [gamma proteobacterium SCGC
AAA001-B15]
Length = 746
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 141/544 (25%), Positives = 230/544 (42%), Gaps = 95/544 (17%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
GKSAS+ +PNIFT ++ I G+ + I +++ Q + + +FS+ P +
Sbjct: 137 GKSASLVEQQ--RPNIFTTSVANIAPGATIGIAIQYQQAVLIDNDKFSIRFPMVVGDRYI 194
Query: 221 P---------AIKKIPKREK------------------IHLNVNAGTGTEVLCNTSSHHL 253
P A+ +P + I +N+N G EV SS+H
Sbjct: 195 PGTMVATPNNALGVVPNTHQVKDASKITPPSDRRADLPITININLKAGFEVASLDSSYH- 253
Query: 254 KQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
K + +L + ++ DF+ +++ S SP L QR+ Y
Sbjct: 254 KIVVNQSDELTKQISLDQNYQADRDFELTWSADKS--------LSPELALFTQRKDDHYY 305
Query: 314 L-LPGTAKSRKVFKK-----DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
L L T VFK+ +VIFI+D SGSM G + KNAL A+++L P D FNI+
Sbjct: 306 LMLMATPPKDDVFKRTNTPREVIFIIDSSGSMAGGAMTQAKNALNRAIARLKPTDRFNII 365
Query: 368 AFN-GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
F+ G LF + + E ++ ++ + IA G T A+E SR S
Sbjct: 366 DFDSGFRPLFKGA--VPANETNKKNGKYFVNSRIADGGTEAL----DAIEYGLGSRDSKS 419
Query: 427 ------IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+IFL TDG V DE +I +K ++ N R ++ GIGS N Y L LA +
Sbjct: 420 ENYLRQVIFL-TDGQVGDEDEIFRTVKWKIEND-----RFFSIGIGSAPNSYLLTKLAEL 473
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + + +M LF + S L +I +D +P P++++E L
Sbjct: 474 GRGAFTYIGSTQEVGQKMHDLFEKLESPALTDIEVD----------FP---PEVNAELAL 520
Query: 541 -IVSGRYQGK----------FPDTL--KAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQ 587
+S Y G+ PD L G G S + K LD + ++
Sbjct: 521 GTISDLYAGETITAVYKVNVLPDNLIISGNGINGVFSKNITINSSNNTKG--LDVLWGRR 578
Query: 588 QIDLL--TAQAWFSEDKR--LEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ID L F E R +++ + +++Q ++ ++T ++ V+ SE ++
Sbjct: 579 KIDRLKDIYNNQFKEKSRDLVKQDIVTLALQHHLVSDFTSLVAVDVTPTRPESEQLNSQS 638
Query: 644 GSKK 647
KK
Sbjct: 639 IVKK 642
>gi|456352158|dbj|BAM86603.1| conserved hypothetical protein [Agromonas oligotrophica S58]
Length = 755
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/465 (23%), Positives = 202/465 (43%), Gaps = 72/465 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAI--------- 223
+PNIFT +L I G + +++ + Q ++ GEFS+ VP PA
Sbjct: 170 RPNIFTNSLANIGPGETVVVQIEYQQPVAQAAGEFSLRVPLVVAPRYNPAPLVQSVDLRP 229
Query: 224 ----------KKIPKREKIHLNVNAGTGTEVLCNT----------------SSHHLKQLR 257
+P R++I V T+ + T S HH +
Sbjct: 230 DSNGWGAASNDPVPDRDRISPEVLDPAKTDPINPTRITVRLQAGFALGEVKSHHHQVTID 289
Query: 258 RDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPG 317
K +E + ++ DF+ ++ + + V L + D D + P
Sbjct: 290 SPDAKTRVVTLAEGVVPADRDFELTWKPA-AAAMPSVGLFHEQVGDADY--LLAFVTPPA 346
Query: 318 TA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYL 375
A +++ +DVIF++D SGSM G + K +L AL +L P D FN++ F+ T L
Sbjct: 347 VAPTAQRPQPRDVIFVIDNSGSMGGTSIRQAKASLLYALGRLQPNDRFNVIRFDDTMTVL 406
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
F +S+ A E V A ++ + A G T + + A+ + + + +TDGA
Sbjct: 407 FPSSVP-ADVEHVGSATSFVS-SLEARGGTEMVPAMRAALTDDGSDSDRVRQVVFLTDGA 464
Query: 436 VEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
+ +E+Q+ + AM+ R RI+ GIGS N Y + + + RG + ++
Sbjct: 465 IGNEQQLFETITAMRGR--------SRIFMVGIGSAPNTYLMTRASELGRGAFTHIGSVE 516
Query: 493 SIEIQMQKLFTR-------GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+E +M+ LF + G S+ + ++D + P+ +PD+ PL+++ +
Sbjct: 517 QVEERMRDLFAKLENPVVTGLSASFSEASVD---------LTPAVLPDVYRNEPLVLAAK 567
Query: 546 YQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
+ +L+ KG +GD + + L L A+ L ++ A+++I
Sbjct: 568 LD-RLAGSLQLKGRIGD-QPWTITLPLAGAAEGKGLSKLWARRKI 610
>gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068]
gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068]
Length = 786
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/456 (25%), Positives = 196/456 (42%), Gaps = 49/456 (10%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETG------- 169
GS+ + + P+ G+I + + GK Y +L+ A K+ + G
Sbjct: 100 GSQQMEVSLVFPLPYDGAIDRMTFLVDGKEYDAKLLE------AKKAREIYEGYVRRSQD 153
Query: 170 ----SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKK 225
++ +F ++ + G+ + +R+SQ L + +P Y I+K
Sbjct: 154 PALLEWVGRGMFQTSVFPVPPGAERKVTIRFSQLLRQEHRLTDLLIPMATARYTNTPIEK 213
Query: 226 IPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTV 285
+ I ++ + + +H + R D +E+ DF Y V
Sbjct: 214 VSLEATIESSIAIKS-----VYSPTHAVDVKRPDEKHATVKFEASNY-LPTTDFRLLYDV 267
Query: 286 SPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPL 344
+ + VL P D F M P ++ + KK VIF+VD SGSMQGK +
Sbjct: 268 GDAPLAASVLSYRP---DNSDEGFFLMLASPNHSQGEVDLTKKTVIFVVDRSGSMQGKKI 324
Query: 345 EDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGS 404
E + A+ L+ L GD+FNIVA++ F ++ + A ++ AGGS
Sbjct: 325 EQAREAMRYVLNNLHEGDTFNIVAYDSTVESFKPELQKFDDATRKSALAYVD-GLYAGGS 383
Query: 405 TNICAPLTKAVEMLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRI 460
TNI L A MLT S I+FL TDG +E +I + K + + R+
Sbjct: 384 TNISGALDSAFAMLTGSDRPNYILFL-TDGLPTAGETNEGKIVELAKQKNVHRA----RM 438
Query: 461 YTFGIGSYCNHYFLRMLAMISRGYYGAAYDL---DSIEIQMQKLFTRGFSSVLAN--IAI 515
FG+G N R+L +SR +G + + +++E + +L+++ S VL + ++I
Sbjct: 439 INFGVGYDVNS---RLLDRMSRENFGQSQYVRPDENLEASVSRLYSKMSSPVLTDVKVSI 495
Query: 516 DTLKDLDEF----EMYPSRIPDLSSESPLIVSGRYQ 547
D D MYP ++ D+ S L+++GRY+
Sbjct: 496 DIEGAGDSSSAVNRMYPKQVMDIFSGEQLVIAGRYK 531
>gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP6]
gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound
[Methanosaeta concilii GP6]
Length = 726
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 115/453 (25%), Positives = 200/453 (44%), Gaps = 29/453 (6%)
Query: 187 GSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLC 246
G I L +SQ L +G P + + ++ + R +I + +
Sbjct: 145 GEERKIELEYSQILPVENGLVHYIYPLSTERFSSRPLEDLVVRAQIE------SREPLKA 198
Query: 247 NTSSHHLKQLRR--DVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDV 304
SS H + R D L +S+VL + DF+ YT+S I +L S +
Sbjct: 199 VYSSRHEVSIDREDDYHALLGLEQSDVL--PDRDFELFYTISSEKIGLNLL----SFKEE 252
Query: 305 DQREMFCMYLLPGT-AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
Q F + P ++ KD+I ++D SGSMQG+ ++ K A L L+P D
Sbjct: 253 GQDGFFLLLAAPDVKVNEEEIVVKDIILVLDTSGSMQGEKMDQAKEAARYVLDHLNPLDR 312
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
F IV+F T FS S+E A + ++ ++ A GST+I + +AV + R
Sbjct: 313 FAIVSFATTTRSFSPSLEPAAQ--ADKGKDFLD-RLEAMGSTDINRAMIEAVGLAEEVRP 369
Query: 424 SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ +IFL TDG + + A+ + RI++FG+G + L ++M + G
Sbjct: 370 TT-LIFL-TDGLPTEGVTVTGAILDNVAREAPDNVRIFSFGVGDDVDTDLLDQISMDNGG 427
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
+ I+ ++ + + VL+++++D D+ ++YP RIPDL + S LI+
Sbjct: 428 ASTYVRPGEEIDEEVSAFYRKVKMPVLSDLSLD-WGDIIVDQVYPQRIPDLFAGSQLIML 486
Query: 544 GRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD-----IPLDRICAKQQID-LLTAQAW 597
GRY+ P + KG + L K+ IP R+ A + + LT
Sbjct: 487 GRYREGGPAKITLKGMVNQEERSYTYEDLSFRKEGGDDFIP--RLWATRAVGYYLTQIRL 544
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD 630
+ E + + + +S + G++ YT ++ E D
Sbjct: 545 YGEKQEWIDSIVSLSTRYGIITPYTSFLVQEKD 577
>gi|163797398|ref|ZP_02191350.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199]
gi|159177317|gb|EDP61874.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199]
Length = 683
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 151/307 (49%), Gaps = 20/307 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
++VIF++D+SGSM+G+PL K +L + L D+FN+VAFN + F + A+ +
Sbjct: 328 REVIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASGK 387
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
RA + AGG T + A A++M + ++F +TDGAV +E + + +
Sbjct: 388 -FHRAALKVIDGLKAGGGTEMAAAFELALQMPGDPDRLQQVVF-ITDGAVSNEAALFNQI 445
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
K L R++T GIGS N +F+ A RG Y D S E M+ LFT+
Sbjct: 446 KGELG-----ARRLFTVGIGSAPNTFFMEEAARFGRGTYTYIGDTSSAERVMRDLFTKIS 500
Query: 507 SSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-QGKFPDTLKAKGFLGDL 563
L NI + + ++D+ P IPDL + PL ++ + QG + +G LGD
Sbjct: 501 FPALTNIEVRGEGVEDIT-----PGTIPDLYAGEPLSIAMKLRQGN--KAITVQGRLGD- 552
Query: 564 SNFVVELKLQLAKDIPLDRI-CAKQQIDLLTAQAWFSEDK-RLEEKVSKMSVQTGVLCEY 621
+ + + L A + R+ A+++I + DK R+ +V +++ ++ Y
Sbjct: 553 TEWKRTVTLSPAGNQSGIRVDWARKRIADWQRAGYLGLDKDRVRSEVLSLALTHHLVSRY 612
Query: 622 TRMIIVE 628
T + V+
Sbjct: 613 TSFVAVD 619
>gi|149187053|ref|ZP_01865360.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21]
gi|148829342|gb|EDL47786.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21]
Length = 697
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 165/348 (47%), Gaps = 29/348 (8%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P AK K+ +++IF++D SGSM G+ + +L ALS L P D FNI+ F+ +
Sbjct: 305 PAAAKVEKLPPRELIFVIDNSGSMSGESMRAASKSLVYALSTLRPEDRFNIIRFDHSMTM 364
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
A + + +A ++ + G T++ L A+ + I +TDG
Sbjct: 365 LHPDAVAADRTNLAKARRY-AESLRGQGGTDMLPALRAALRDRDPDGKRLRQIIFLTDGN 423
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG-AAYDLDSI 494
+ +ER++ + L R++ GIGS N + +R +A RG + D +++
Sbjct: 424 LSNEREMMSEISIALGRS-----RVFMVGIGSAPNSHLMRRMAEAGRGTFTHVGQDAEAV 478
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
+M+++ R V+ +++ + + E+ P+ +PDL + PL++ GR + +
Sbjct: 479 S-EMRRMLNRLAKPVVTGLSVRV--EGSDLELTPAVLPDLYAGEPLVLRGRTE-SLKGEI 534
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAWFSEDKRLE--EKVSKM 611
+ G +G + + L L AKD P + R+ A+++I + AQ W + +E ++V +
Sbjct: 535 EVSGLIGG-KRWRIALDLDSAKDSPSIARLWARRKISDIEAQRWSYQIDGVEADKQVEAL 593
Query: 612 SVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVP 659
+ ++ + T ++ V+ E+P +G Q++ E+P
Sbjct: 594 GMAFHIVTDLTSLVAVD--------ETPSRPEG------QRLTREELP 627
>gi|433605438|ref|YP_007037807.1| hypothetical protein BN6_36390 [Saccharothrix espanaensis DSM
44229]
gi|407883291|emb|CCH30934.1| hypothetical protein BN6_36390 [Saccharothrix espanaensis DSM
44229]
Length = 841
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 118/504 (23%), Positives = 214/504 (42%), Gaps = 60/504 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+P +FT + + G +++RL + L+Y D + +P P Y+
Sbjct: 122 RPGVFTTRVGNVLPGERVTVRLTLAGALAYEDDAATYRLPLVVAPRYIPGAPLAGGQVGD 181
Query: 220 --------TPAIKKI-----------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDV 260
P +I P R + + V+ G ++S H R
Sbjct: 182 GVAADTDAVPDASRISPPVLLPGFPNPVRLSVVVEVDT-AGLPAATPSASLHATVTERT- 239
Query: 261 GKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAK 320
G+ E + DF + + + L P + F + +LP A+
Sbjct: 240 -SRGFRVTVEPGARVDRDFLLRLAFADTGVVRTALAVRPD--EGGATGTFALTVLPPRAE 296
Query: 321 SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSM 380
+ +DV+ ++D SGSM G + + A + L GD F ++AF+ T++
Sbjct: 297 T-AARGRDVVLVLDRSGSMHGWKMVAARRAAGRIVDSLTTGDRFAVLAFD-NVVDTPTAL 354
Query: 381 ELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAV 436
EA +R H++ + F++G G T + APL +A E+L + ++ LVTDG V
Sbjct: 355 PAGLVEATDR-HRYRAVEFLSGLDARGGTEMLAPLRRAGELLGHDGDRERVLVLVTDGQV 413
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
DE ++ + + L R++T GI N FL+ LA + G + D ++
Sbjct: 414 GDEDRLLRELATPLAG-----VRVHTVGIDRAVNEAFLQRLAGTA-GRFELVESEDRLDD 467
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
++ + R S VL +A+ + LD + P+ +PDL +P++V+GRY+G+ D +
Sbjct: 468 ALRGIHRRIGSPVLTGLAV-LAEGLDVESLAPAPLPDLFHGAPVVVTGRYRGRL-DGVDL 525
Query: 557 KGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQ--AWFSEDKRLEEKVSKMSV 613
G G VE ++ D P L + A+ ++ L + + LE ++ S+
Sbjct: 526 SGDGGWRQR--VE---AVSSDNPGLPALWARARVRDLEDRYVVGVGDLAELERRIVATSL 580
Query: 614 QTGVLCEYTRMIIVETDERNNASE 637
+ GVL +T + V++ N + E
Sbjct: 581 EFGVLSRFTAFVAVDSRVVNESGE 604
>gi|254560515|ref|YP_003067610.1| hypothetical protein METDI2058 [Methylobacterium extorquens DM4]
gi|254267793|emb|CAX23640.1| conserved hypothetical protein [Methylobacterium extorquens DM4]
Length = 733
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 118/509 (23%), Positives = 217/509 (42%), Gaps = 72/509 (14%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY--VTPAI------ 223
+PN+FT + I G + +++ + Q + G +++ +P P Y PA+
Sbjct: 164 RPNLFTNAVANIGPGETVLVQIEYQQPVRSSAGTYALRLPTVAAPRYSPAPPAVMAVVER 223
Query: 224 ----KKIPKREKI---------HLNVNAGTGTEVL--------CNTSSHHLKQLRRDVGK 262
+P RE I H N T T L +++H ++ R +
Sbjct: 224 GAAADPVPDRETIAAPVLDPARHAPSNPLTLTIDLKAGFTLGRVRSATHAVRIEERSASE 283
Query: 263 LGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-----EMFCMYLLPG 317
+ ++ ++ DF+ ++ +P ++PS+ +R + + P
Sbjct: 284 RRITL-ADGATAADRDFELTWNAAPG--------EAPSIGLFRERVAGAEAVLAVVTPPE 334
Query: 318 TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS 377
TA +DV+F++D SGSM G + K +L + L +L D FN++ F+
Sbjct: 335 TASPAASVPRDVVFVIDNSGSMGGASMRQAKASLLIGLDRLGAHDRFNVIRFDHSFDTLF 394
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-NSRGSIPIIFLVTDGAV 436
+ A + RA ++ A G T + APL A+ T G + + +TDGA+
Sbjct: 395 PDLVPADAHHLMRAKSFVA-GLQASGGTEMLAPLQAALRDATPEETGRLRQVVFLTDGAI 453
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+E QI A+ + G S R++ GIGS N Y +R A + RG + D +
Sbjct: 454 GNEAQIFSAIATE--RGRS---RLFMVGIGSAPNGYLMRHTAELGRGSFTQIDTPDQVTE 508
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
+M+ L + S + ++ T + ++ P+R+PDL PL +S R G+ TL
Sbjct: 509 RMRALLVKLESPAVTDLTA-TFSE-PGIDVTPTRLPDLYRGEPLTLSARM-GQARGTLTL 565
Query: 557 KGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKRL---------EE 606
G +G + +L L A+D + ++ A+ +I A+A E RL +
Sbjct: 566 SGRIGG-QPWQTQLHLDAAQDGTGIGKLWARAKI----AEA---ETARLTGGLTAEAADA 617
Query: 607 KVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+ ++++ G+ T ++ V+ R A
Sbjct: 618 AILRLALDHGLTTRLTSLVAVDATPRRPA 646
>gi|261856580|ref|YP_003263863.1| Vault protein inter-alpha-trypsin domain-containing protein
[Halothiobacillus neapolitanus c2]
gi|261837049|gb|ACX96816.1| Vault protein inter-alpha-trypsin domain protein [Halothiobacillus
neapolitanus c2]
Length = 671
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 129/581 (22%), Positives = 239/581 (41%), Gaps = 78/581 (13%)
Query: 96 DCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALG 155
D +D RV+ T R G++ + R P ++ + I + ++
Sbjct: 59 DIVIDGPIARVTITQRYRN-EGTRPINARYVFPGSTHAAVQSLTMNIGDRIIKAKIKEKE 117
Query: 156 EN-------DGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFS 208
E AGK A++ +PN+F + + I G + + L++S+ L DG +
Sbjct: 118 EAKKIYEVAKQAGKHAALLEQK--RPNVFMMNVANIMPGDTVELVLQYSELLIPDDGVYQ 175
Query: 209 VNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDV-------- 260
+ +P V P P R H A + NT + Q++ D+
Sbjct: 176 L----VYPTVVGPRYGGDPIRATPHNRWIANPYAK--DNTDGSNPAQIKTDIHVRIASPI 229
Query: 261 --GKLGYSYESEVLKW-----------------SNIDFDFSYTVSPSHIFGGVLLQSPSL 301
L + V W N DF S+ + + I G++
Sbjct: 230 PISDLRSAQHKIVTHWLNDKSAEISLDPSETHTGNRDFILSFRLQGAKINSGLMT----- 284
Query: 302 HDVDQREMFCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
++ + F M P +V K++ +F+VD+SGSM G PL + + LS L P
Sbjct: 285 YEWNGEHYFLMMAQPPKRVAPTEVMKREYLFVVDVSGSMYGFPLNTASDLMRELLSSLKP 344
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
++FNI+ F+G + + S + AT E ++RA + I GG P K +
Sbjct: 345 QETFNILFFSGGSRVLSPTPLQATPENLQRAMTM--MRSIQGGGGTELLPALKTAFAMPR 402
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ + I ++TDG V+ ERQ D +K L + ++ FGIGS N Y + +A
Sbjct: 403 TEDTARSIVVITDGYVDVERQAYDLIKQNLNSTN-----LFAFGIGSSVNRYLMESMAHA 457
Query: 481 SRG---YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
+G D+ + + ++ + VL++I I ++ ++ PS IP + +E
Sbjct: 458 GQGEPFIITGPNDVPGVGARFRRYVD---APVLSHIKIRG-NGVELYDTEPSEIPVMLAE 513
Query: 538 SPLIVSGRYQGKFPD-TLKAKGFLGDLSNFVVELKL---------QLAKDIPLDRICAKQ 587
P+++ G+Y+ P TL+ G + L L A+ +P+ + A+Q
Sbjct: 514 RPIVIFGKYRNAQPGATLELTGTRA-TGEYRATLSLDDSNGQADKNQAELLPV--LWARQ 570
Query: 588 QIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
++ L+ +D +++ ++ ++ +L YT + V+
Sbjct: 571 RLMYLSDLQ--GDDDAHRDEIIRLGLRYSLLTRYTSFVAVD 609
>gi|300855507|ref|YP_003780491.1| hypothetical protein CLJU_c23310 [Clostridium ljungdahlii DSM
13528]
gi|300435622|gb|ADK15389.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528]
Length = 805
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 122/539 (22%), Positives = 225/539 (41%), Gaps = 55/539 (10%)
Query: 128 PMGDQGSILGVEAEISGKSYHT------QLIALGENDGAGKSASVETGSFLKPNIFTLTL 181
P+ I G EAEI G++ + + + EN ++ F P++F +++
Sbjct: 54 PLPAAAIISGFEAEIGGRALKAVVEEKDKALQIYENAKLRGDSTFSLEEF-SPHLFKISI 112
Query: 182 PQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF-----------KFPEYVTPAIKKI---- 226
+I G + I+L + ++L Y+D F + +P K E + K+
Sbjct: 113 GKIISGETVKIKLSYIEELDYKDSTFKLTIPAISQPKIIRNRSKIEELKDNLVNKLGIRK 172
Query: 227 PKREKIHLNVN--AGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYT 284
K E VN + ++V H++K R S E + D DF
Sbjct: 173 TKEENFEFKVNIIVESLSKVDFRCPYHNIKVEREGDTVAKISLEE---NYYLTDKDFILF 229
Query: 285 VSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPL 344
+ + + +++ + M ++P + + +F++DIS +M+G+ L
Sbjct: 230 IRERETLEADGMIYEYKENDEEKGIVYMRMIPKLDPYEEDITEHYVFLIDISDTMKGEKL 289
Query: 345 EDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA-TKEAVERAHQWIGINFIAGG 403
E KNAL + + L GD+F+++A + FS + + +++ A +WI G
Sbjct: 290 EQAKNALQLCIRNLSKGDTFDMIALGNKLISFSQNGPIEFNADSLREASKWID-----GL 344
Query: 404 STNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
T A + ++ + GS I L TD +VEDE +I + + + RI+TF
Sbjct: 345 KTEKDADIFGGIKYSLENEGSENTILLFTDDSVEDEEKILSYVNENIGDN-----RIFTF 399
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLD---SIEIQMQKLFTRGFSSVLANIAIDTLKD 520
GI S N+YFL LA S YG A +D IE + + F R S + I I+ +
Sbjct: 400 GIDSCANNYFLNKLAHES---YGKAEFIDIGQRIEDVILRQFNRIQSPQVDGIDINWGR- 455
Query: 521 LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG--------DLSNFVVELKL 572
L YP I + P + G+ + KG++ D+ NF E
Sbjct: 456 LQVNSTYPRTIEYMYDREPFSIFASVTGEVEGRITLKGYVDGKEYLRRIDIDNFSTEENA 515
Query: 573 QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE 631
L + + + ++++ + E + ++ K+ ++S ++G + T I++E E
Sbjct: 516 SLLRKVWARKKMKSLEMNMKAQRG--QEKEEMKRKLIELSKKSGFISPETTFILMELRE 572
>gi|406831790|ref|ZP_11091384.1| von Willebrand factor type A [Schlesneria paludicola DSM 18645]
Length = 774
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 140/585 (23%), Positives = 244/585 (41%), Gaps = 55/585 (9%)
Query: 75 HGRCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGS 134
H PP ++ + +D A V+VS T+ G+ + + P+ + +
Sbjct: 43 HAPFRPPVSGSYKIKQLRVDARIVDQVARVQVSQTF---ANTGTSPMEVQFLFPLPYESA 99
Query: 135 ILGVEAEISGKSYHTQLIALGENDGAGKSASVETG-----SFLKPNIFTLTLPQIDGGSY 189
I + + GK + +L+ E +S ++ +F + + G+
Sbjct: 100 IDQLTLLVDGKEWSGRLLPATEARSIYESIVRSNRDPALLEWIGSGLFQTRVFPVPVGTE 159
Query: 190 LSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTS 249
+ L ++Q L G P +Y + AI+ + R I V+ + T
Sbjct: 160 RVVTLNYNQLLKKDHGLTDFAFPLSTAKYTSRAIENVEVRIAIESTVDL---KNIYSPTQ 216
Query: 250 SHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM 309
+ +K+ + YS ++V DF + V + V+ P ++ M
Sbjct: 217 TVDIKRTDARHAVVSYSRSNDV---PVTDFRLLFDVGQGAVGANVISYKPKANEDGYFLM 273
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
+ T +R +K +IF+VD SGSM GK E AL L+ L GD FNI+A+
Sbjct: 274 LASPEVKSTDTTR--IRKTLIFVVDRSGSMSGKKFEQACAALRFVLNNLHEGDLFNIIAY 331
Query: 370 NGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIP-I 427
+G F +E E A ++ G++ AGG TNI LT A+ L ++ S+P
Sbjct: 332 DGTVESFRPELEKYNDETRRAALGFVQGLH--AGGGTNIQGALTTALSQLKDT--SVPNY 387
Query: 428 IFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ +TDG V +E Q+C + K+ + RI TFG+G N R+L I+R
Sbjct: 388 VLFLTDGLPTVGVVNEAQLCQSTKA----ANQVHARILTFGVGYDVNA---RLLDRIARD 440
Query: 484 YYGAAYDL---DSIEIQMQKLFTRGFSS-VLANIAIDTLKDLDEFE---------MYPSR 530
+G + + + IE + ++ RG SS VL N + LD F+ YP
Sbjct: 441 GFGLSESVRPDEDIEAHVSTVY-RGISSPVLTNATVQF--QLDGFKPENGPAVNRTYPHG 497
Query: 531 IPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAKDIP----LDRICA 585
DL L++ GRYQ ++ G + G +F L D ++++
Sbjct: 498 TFDLFEGQQLVIVGRYQNPGHARVQLNGQVNGQPQSFEFSAMLNPVSDDQNFAFIEKLWG 557
Query: 586 KQQIDLLTAQAWFS-EDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
++I + Q S ++ L +++ ++S + G+L YT + ET
Sbjct: 558 LRRIGEIIDQIDLSGKNDELVKELVELSTRHGILTPYTSFLADET 602
>gi|271964249|ref|YP_003338445.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
Length = 774
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 124/503 (24%), Positives = 217/503 (43%), Gaps = 59/503 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+P++FT+ + I G +++ L +Q L Y DG + P P Y+
Sbjct: 120 RPDVFTIRVGNIVPGERVTVHLTLNQPLPYEDGAATFRFPLVVAPRYIPGTLLDGEQAGD 179
Query: 220 --TPAIKKIPKREKIHLNV-------NAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESE 270
P + +P +I V V + + L+++R + + ++
Sbjct: 180 GWAPDTEAVPDASRITPPVLLPGFPDPVRLSLAVAVDPAGLELREVRSSLHVVREEGDTV 239
Query: 271 VLK-WSNIDFDF----SYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF 325
L+ +D DF S+ S S L P D D+ F + ++P A +
Sbjct: 240 RLQPGERLDRDFILRLSFDASTS------LALVPDADDGDE-GTFALTVVPDPAPA-TAN 291
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG---ETYLFSTSMEL 382
KDV+ ++D SGSM G + + A A + L GD F +++F+G + +
Sbjct: 292 AKDVVLVLDRSGSMTGWKMVAARRAAARIVDTLTGGDRFAVLSFDGVVEQPEGLGEGLSE 351
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
A+ RA + + A G T + APL +AV +L+ S G ++ LVTDG V +E QI
Sbjct: 352 ASDRNRYRAVEHLA-RLEARGGTEMLAPLEQAVALLSES-GRDRVLVLVTDGQVGNEDQI 409
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ + +RL R++T GI N FL LA + G D ++ M+ +
Sbjct: 410 LERIGARLAG-----VRVHTVGIDRAVNAGFLGRLAGLGSGRCELVESEDRLDEAMEHIH 464
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS---ESPLIVSGRYQGKFPDTLKAKGF 559
R + ++ ++++ D ++ P + L S PL V GRY+G L G
Sbjct: 465 RRIGAPLVTDLSLKA----DGLDLVPGTVSHLGSIYAGVPLTVLGRYRGASGGALAVHGL 520
Query: 560 LGDLSNFVVELKLQLAK--DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGV 617
D + E ++ A+ + I A+ ++ L + + + D LEE++ +S++ GV
Sbjct: 521 --DSTGQPWERRVAGARVDGRAMRPIWARARLRALEDR-YAAGDHSLEEEIVGVSLRYGV 577
Query: 618 LCEYTRMIIVETDERNNASESPG 640
LC +T + + D R A PG
Sbjct: 578 LCRFTAFVAI--DSRVVAEGGPG 598
>gi|188580652|ref|YP_001924097.1| LPXTG-motif cell wall anchor domain-containing protein
[Methylobacterium populi BJ001]
gi|179344150|gb|ACB79562.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium
populi BJ001]
Length = 723
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/413 (24%), Positives = 176/413 (42%), Gaps = 53/413 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKFPEY--VTPAIKK---- 225
+PN+FT + I G + ++L + Q + G S+ +P P Y PA+
Sbjct: 152 RPNLFTNAVANIGPGETVLVQLEYQQPVRQSGGTASLRLPTVAAPRYSPAPPAVTSAAAG 211
Query: 226 ------IPKREKIH----------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKL 263
+P REKI L+++ G + S+ H ++
Sbjct: 212 AGAADPVPDREKIEAPVLDPTRQAPVNPLSLSIDLKAGFALGAVRSATHAIRIEEVSESE 271
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQR-----EMFCMYLLPGT 318
++ ++ DF+ +++ +P G P++ +R + + P
Sbjct: 272 RRITLADGAPAADRDFELTWSAAP----GTDPSAGPAVGLFRERVVGAETVLALVTPPEG 327
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
A +DV+F++D SGSM G + K +L + L +L PGD FN++ F+
Sbjct: 328 AAPAVALPRDVVFVIDNSGSMGGASIRQAKASLLIGLDRLRPGDRFNVIRFDHSFDTLFP 387
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-NSRGSIPIIFLVTDGAVE 437
+ A + + RA +++ A G T + APL A+ T + I +TDGA+
Sbjct: 388 DVVPADESHLARAKRFVS-GLEASGGTEMLAPLRAALADATPEDTARLRQIVFLTDGAIG 446
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+E QI A+ + G S R++ GIGS N Y + A + RG + D + +
Sbjct: 447 NEAQIFSAIAAE--RGRS---RLFMVGIGSAPNGYLMSHAAELGRGSFTQIDTPDQVSER 501
Query: 498 MQKLFTRGFSSVLANIAIDTLKDL---DEFEMYPSRIPDLSSESPLIVSGRYQ 547
M+ L T+ L + A+ L + D ++ P+R+PDL PL +S R +
Sbjct: 502 MRALLTK-----LESPAVTELTAVFSEDGVDVTPTRLPDLYRGEPLTLSARMR 549
>gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana
ATCC 700345]
gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC
700345]
Length = 789
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/342 (26%), Positives = 159/342 (46%), Gaps = 39/342 (11%)
Query: 311 CMYLLP--GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
+ L+P G + +++I ++D SGSM G + K AL AL+ L P D FNIV
Sbjct: 381 LVMLMPPQGAEQQPSSIHRELILVIDTSGSMSGDAIIQAKTALKYALAGLRPTDKFNIVQ 440
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV------------E 416
FN + +S AT + +A +I A G T + + A+ E
Sbjct: 441 FNSDVDKWSGMAMSATPYNLAQAQNYIN-RLEANGGTEMSIAINAALNIETVTDKETGTE 499
Query: 417 MLTNSRGS--IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ N GS + + +TDGAV +E + + ++++L + R++T GIGS N +F+
Sbjct: 500 LDNNDLGSNLLRQVLFITDGAVSNESMLFELIEAQLGDS-----RLFTIGIGSAPNAHFM 554
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLD-EF------EMY 527
+ A + RG Y LD + QK+ S+L I + D+D F + +
Sbjct: 555 QRAAQLGRGTYTYIGKLDEVN---QKVV-----SLLKKIEKPQVTDVDLRFSDGSVPDYW 606
Query: 528 PSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAK 586
P RIPDL + PL+V+ + L +G L + L + + LD + A+
Sbjct: 607 PVRIPDLYAHEPLLVAVKLPSYASSDLLVQGLLAG-QFWQRRLSVSATEQAKGLDLVWAR 665
Query: 587 QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+QI L + +R+E++++ +++ ++ YT ++ V+
Sbjct: 666 KQIAALELSKQAANRERIEKQITAIAMNFHIMSAYTSLVAVD 707
>gi|255525871|ref|ZP_05392799.1| Vault protein inter-alpha-trypsin domain protein [Clostridium
carboxidivorans P7]
gi|255510435|gb|EET86747.1| Vault protein inter-alpha-trypsin domain protein [Clostridium
carboxidivorans P7]
Length = 531
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 125/486 (25%), Positives = 201/486 (41%), Gaps = 57/486 (11%)
Query: 128 PMGDQGSILGVEAEISGKSYHT------QLIALGEN--DGAGKSASVETGSFLKPNIFTL 179
P+ I G EAEI GK+ T + I + EN D + +E LKP+IF +
Sbjct: 54 PIPKTAIISGFEAEIGGKTLKTIVEEKEKAIKIYENAKDRGDNTVFLEE---LKPHIFKI 110
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF-----------KFPEYVTPAIKKIPK 228
+ +I G + I+L + +L Y+D +F + +P K + KI K
Sbjct: 111 AVGKIVSGETVKIKLSYIDELQYKDKKFKLTIPAIYEPTKMDENSKLNNLKNTLVNKIVK 170
Query: 229 R--EKIHLNVNAGTGTEVLCN---TSSHHLKQLRRD---VGKLGYSYESEVLKWSNIDFD 280
+ E E LC S +H ++ R+ V K+ + E + D D
Sbjct: 171 KKSEDSEFQFKTNIIVESLCKLDFESPYHKLKVEREGDTVAKINLQDDYEAM-----DKD 225
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQ 340
F + + + + R + + ++P K+ +F++DIS +M+
Sbjct: 226 FILIMKEKDPLEADGMIYEYKENGEDRRVVYLRMIPKLDDYEMEDKESYVFLIDISDTMK 285
Query: 341 GKPLEDTKNALAVALSKLDPGDSFNIVAFNGET--YLFSTSMELATKEAVERAHQWIGIN 398
G LE KNAL + + L+ GD+F+I+A GET Y + M E +++A QWI N
Sbjct: 286 GDKLEQAKNALQMCIRNLEEGDTFDIIAM-GETLKYFWDEGMAEFNSETLKKASQWIE-N 343
Query: 399 FIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
+I + ++E + G I + TD VE+E +I D ++ L +
Sbjct: 344 LTTEDDADIFGAIKYSLE----NEGGHNTILIFTDDEVEEEDEILDYVRENLGDN----- 394
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
RI+TFG S N+YFL LA S G IE + + F R + + +I ID
Sbjct: 395 RIFTFGFDSETNNYFLNKLAHESFGKAEFINKGRRIEYVVLRQFKRIQNPEVDDIQID-W 453
Query: 519 KDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG--------DLSNFVVEL 570
++ YP I + + G+ D + G +G DL NF E
Sbjct: 454 GNMKVQSTYPRTIDYMYDREAFSIFAYVIGEVSDEIVISGKVGDKDYIRKVDLDNFNTED 513
Query: 571 KLQLAK 576
L K
Sbjct: 514 NANLLK 519
>gi|383780107|ref|YP_005464673.1| hypothetical protein AMIS_49370 [Actinoplanes missouriensis 431]
gi|381373339|dbj|BAL90157.1| hypothetical protein AMIS_49370 [Actinoplanes missouriensis 431]
Length = 873
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 128/546 (23%), Positives = 229/546 (41%), Gaps = 69/546 (12%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGEND-------GAGKSASVETGSFLKPNIFTLT 180
P+ D+ ++ G+ + ++ QL E AG+ AS+ +P++FT+
Sbjct: 73 PLPDRAAVTGMTMTAADRTVVAQLRERAEAREAYDQAIAAGQRASIAEEE--RPDVFTMR 130
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT--------------PAIKK 225
I G +++RLR L Y DG + P P YV P
Sbjct: 131 AGNIMPGERVTVRLRLVGPLPYEDGAATFRFPLVVAPRYVPGTPLPDSYAGDGQLPDTDA 190
Query: 226 IPKREKIH-----------LNVNAG-----TGTEVLCNTSSHHLKQLRRDVGKLGYSYES 269
+P +I L ++ G G ++ SS H + D G L
Sbjct: 191 VPDASRISPPVLLPGFPNPLRLSIGVDIDPAGLQLGDVRSSLH--TVLDDEGTL------ 242
Query: 270 EVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLPGTAKSRKVFKKD 328
+ D DF ++ + GG + ++ D + E + + +LP A + KD
Sbjct: 243 RIAPGERADRDFVLRLA--YAPGGE--SAVAVPDEEGAEGTYQLVVLPPEAAT-APRPKD 297
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
V+ ++D SGSM G + + A A + L D F ++ F+ ++ EA
Sbjct: 298 VVLLLDRSGSMGGWKMVAARRAAARVVDTLTGADRFTVLTFDHHVE-RPAGLDHGLCEAS 356
Query: 389 ERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+R +++ + +A G T + PL + ML S G ++ L+TDG V +E QI
Sbjct: 357 DR-NRFRAVEHLARADARGGTELLTPLISGLTMLAGSSGRDRVLVLITDGQVGNEDQILQ 415
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++ + R++T GI N FL LA I G D ++ M+ + R
Sbjct: 416 EIRPLIG-----TTRVHTVGIDRAVNAGFLGRLAAIGAGRAELVESEDRLDEAMEHIHRR 470
Query: 505 GFSSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
+ V+ + + D + LD+ P+R+P L PL+++GRY+G+ L G D
Sbjct: 471 IGAPVVTGLGVTGDGITLLDD-SRSPARLPGLYPGVPLVITGRYRGEPAGRLTVTGRTRD 529
Query: 563 LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYT 622
+F + +Q + + + A+ ++ L A+ + D E+++ S++ GVLC +T
Sbjct: 530 DQDFRTGVAVQRRSEPAVTSLWARARVRDLE-DAYAAGDHHREQEIVSTSLRFGVLCRFT 588
Query: 623 RMIIVE 628
+ V+
Sbjct: 589 AYVAVD 594
>gi|383775049|ref|YP_005454118.1| hypothetical protein S23_68300 [Bradyrhizobium sp. S23321]
gi|381363176|dbj|BAL80006.1| hypothetical protein S23_68300 [Bradyrhizobium sp. S23321]
Length = 741
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 117/465 (25%), Positives = 209/465 (44%), Gaps = 72/465 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKK------ 225
+PNIFT ++ I G + +++ + + + E+S+ +P P Y I +
Sbjct: 155 RPNIFTNSIANIGPGETVLVQIEYQEPVQQSGNEYSLRLPLVVGPRYNPAPIVQSVDFRQ 214
Query: 226 ------------IPKREKIH----------------LNVNAGTGTEVLCNTSSHHLKQLR 257
+P R++I + V+ G + S HH ++
Sbjct: 215 DGSGWGATNSDPVPDRDRISPPVLDPAKNAPVNPTSITVHLKAGFALGEVKSHHHNVKVE 274
Query: 258 R--DVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
D ++ + V D DF T P+ ++PS+ +R YLL
Sbjct: 275 SPDDTTRIVTLADGAV----PADRDFELTWKPAAA------KAPSVGLFRERVGDADYLL 324
Query: 316 ----PGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
P +A+ ++K ++V+F++D SGSM G + K +L ALS+L P D FN++ F
Sbjct: 325 AFVTPPSAEQATQKPLPREVVFVIDNSGSMGGTSIVQAKASLTYALSRLQPNDRFNVIRF 384
Query: 370 NGET-YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPII 428
+ LF TS+ A V A ++ GG+ + A + L ++ ++
Sbjct: 385 DDTMDMLFPTSVP-ADAAHVGEATAFVSALQARGGTEMVPAMRAALTDKLGDTNTVRQVV 443
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
FL TDGA+ +E+Q+ +A+ + G S R++ GIGS N Y + + + RG +
Sbjct: 444 FL-TDGAIGNEQQLFEAITA--MRGRS---RVFMVGIGSAPNTYLMTRASELGRGAFTHI 497
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDL---DEFEMYPSRIPDLSSESPLIVSGR 545
++ +E +M+ LF + L N A+ L + ++ P IPD+ + PL+++ R
Sbjct: 498 GSVEQVEERMRGLFAK-----LENPAVTGLSAKFSDAKADVTPGIIPDVYRDEPLVLAAR 552
Query: 546 YQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
K +L+ KG +GD + V L LQ A+ L ++ AK++I
Sbjct: 553 LD-KLAGSLEIKGRVGD-RPWSVTLPLQNAAEGKGLSKLWAKRKI 595
>gi|398831333|ref|ZP_10589512.1| marine proteobacterial sortase target protein [Phyllobacterium sp.
YR531]
gi|398212901|gb|EJM99503.1| marine proteobacterial sortase target protein [Phyllobacterium sp.
YR531]
Length = 753
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 141/293 (48%), Gaps = 21/293 (7%)
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSL----HDVDQREMFCMYLLPGTAKSRKVFK-KDV 329
++ D DF T +P ++P++ V + + Y+ P + K + +++
Sbjct: 305 AHADRDFELTWTPKDS------KAPAVGLFREHVGKDDFALAYITPPVIEQEKDIRPREI 358
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVE 389
+F++D SGSM G ++ K +L ALS+L D FN++ F+ L A E
Sbjct: 359 VFVIDNSGSMGGSSMDQAKASLDYALSRLKSADRFNVIRFDDTMDLLFNDTVSANSENTS 418
Query: 390 RAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS-IPIIFLVTDGAVEDERQICDAMKS 448
+A ++ N A G T + PL A+ + S I I +TDGA+ +E+Q+ DA+ +
Sbjct: 419 KARAFVN-NLQASGGTEMLEPLRAALNDPRPAEQSFIRQIVFLTDGAIGNEQQLLDAIAA 477
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
+ G S R++ GIGS N+Y + A + RG + + + + +M+ LF + +
Sbjct: 478 K--RGRS---RVFMVGIGSAPNNYLMNRAAELGRGSFTSIGSGEEVNDRMRDLFAKLENP 532
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
V+ N+ +M P+ +PDL PL+++ + G TL KG +G
Sbjct: 533 VVTNLVAKFSS--GNADMTPATLPDLYRGEPLVIAAKL-GAPSGTLTVKGMIG 582
>gi|157375629|ref|YP_001474229.1| vault protein inter-alpha-trypsin subunit [Shewanella sediminis
HAW-EB3]
gi|157318003|gb|ABV37101.1| vault protein inter-alpha-trypsin domain protein [Shewanella
sediminis HAW-EB3]
Length = 770
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 157/340 (46%), Gaps = 30/340 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
M L P KSR +++I ++D SGSM G +E K A+ AL+ L D+FN++ F
Sbjct: 358 LVMLLPPSLEKSRNRVSRELILVIDTSGSMSGSAMEQAKKAMKYALAGLGSDDTFNVIEF 417
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT---------- 419
N + S A+ + +E A++++ + + G T + L A+ +
Sbjct: 418 NSKVSSLSKGPIPASTKNIEMANRFVH-SLTSDGGTEMALALEHALGQESGGSSWQETGL 476
Query: 420 -----NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
S + + +TDGAV +E ++ +K R+ R++T GIGS N +F+
Sbjct: 477 QGKDEESTSRLRQVLFMTDGAVGNEAELFKLIKYRIGKS-----RLFTLGIGSAPNSHFM 531
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID-TLKDLDEFEMYPSRIPD 533
+ A RG + DLD ++ ++Q L + + +I + T + +F +P+ IPD
Sbjct: 532 QRAAEFGRGTFTYIGDLDEVQEKIQGLLYKIEHPQITDIELHYTDGTIPDF--WPATIPD 589
Query: 534 LSSESPLIV-----SGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQ 588
L +E PL+V S +Y + + G V L + LD I A +Q
Sbjct: 590 LYAEEPLLVAIKMPSDKYAASQDKLVISGSIAGRYWQDAVSLDNKKTA-AGLDLIWANKQ 648
Query: 589 IDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
I L + R++++++ +++ ++ +T ++ V+
Sbjct: 649 IAALDLNKDGVNNSRVKQQITALALNYHLVSRHTSLVAVD 688
>gi|302036308|ref|YP_003796630.1| hypothetical protein NIDE0941 [Candidatus Nitrospira defluvii]
gi|300604372|emb|CBK40704.1| conserved exported protein of unknown function, contains von
Willebrand factor, type A and vault protein
inter-alpha-trypsin domain [Candidatus Nitrospira
defluvii]
Length = 712
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 97/409 (23%), Positives = 179/409 (43%), Gaps = 53/409 (12%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+PN+FT ++ I G +++I L + + + Y + F + P P Y+
Sbjct: 170 RPNLFTTSVANIGPGEHVTIELEYQETVRYENDRFQLRFPMAIGPRYIPGVPVIVEGQDP 229
Query: 220 -----TPAIKKIPKREKIHLNVNA---GTGTEV-----------LCNTSS--HHLKQLRR 258
+P ++P +I ++ GT + L S H + L+
Sbjct: 230 QGSGTSPGTDRVPDASRITPPIHQPEDGTINPLSLSLSLHPGFPLARVESPFHPIISLQD 289
Query: 259 DVGKLGYSYESEVLKWSNIDFDFSYTVSP-SHIFGGVLLQSPSLHDVDQREMFCMYLLPG 317
G S + + ++ DF + +P + V + D + L+P
Sbjct: 290 QDGGYQISLREDTVP-ADRDFQLIWHPAPRTEPMATVFTEQ-----KDGTSYAMLMLVPP 343
Query: 318 TAKSRKVFK--KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY- 374
T + +D+ FI+D SGSM G +E K +L ALS+L D FNI+ FN
Sbjct: 344 TQHRETTARVPRDITFIIDRSGSMAGASIEQAKGSLTAALSRLTTQDRFNIIQFNHTVRS 403
Query: 375 LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDG 434
LF + TK ++++A ++ + A G T I L +A++ +S + I L+TDG
Sbjct: 404 LFPIPQPVTTK-SMQQAIRYTE-HLAADGGTEILPALRQALKSPQDS-ARLQQIILITDG 460
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
V +E ++ + + R+ + R++T GIGS N + +R A RG + +++ +
Sbjct: 461 QVGNEEELFELLHQRVGSR-----RLFTIGIGSTPNSHLMRKAAETGRGTFTYIGNVNEV 515
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
+ ++ LF + VL +I +D E +P+ I DL P++++
Sbjct: 516 KDKLDGLFRKLEHPVLNDITLDA-TGWSGLEQFPATITDLYEGEPIVLA 563
>gi|118593261|ref|ZP_01550646.1| von Willebrand factor type A like domain [Stappia aggregata IAM
12614]
gi|118434152|gb|EAV40808.1| von Willebrand factor type A like domain [Stappia aggregata IAM
12614]
Length = 772
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 173/374 (46%), Gaps = 29/374 (7%)
Query: 276 NIDFDFSYTVSP-SHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF-KKDVIFIV 333
N DF Y ++ S + GV S ++ D F + + P + + +++++F++
Sbjct: 315 NRDFVLRYELAAQSDVAAGV----SSRYEADGGGYFSLLIEPPKLPAEDMIGQRELVFVL 370
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
D SGSM G+P+E +K + A+ L P D F I+ F+ +T F+ LAT+ ++A +
Sbjct: 371 DTSGSMSGQPIEASKTFMTAAIKALRPDDYFRILHFSNDTSQFAGQAVLATERNKQKALK 430
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
++ + AGG T I + A + + ++FL TDG + DE + ++ +R+
Sbjct: 431 FVA-DLSAGGGTEINQAVNAAFDQAQPDNTTRIVVFL-TDGYIGDEATVIKSIANRIGKA 488
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
RIY FG+G+ N + L +A RGY + + L + +L +I
Sbjct: 489 -----RIYAFGVGNSVNRFLLDAMATEGRGYARYVALGEDAGEAAEGLAANLKTPLLTDI 543
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKL 572
+ID L P+RIPDL + V GRY T+ G + G + +E+ L
Sbjct: 544 SID-WNGLKVEGQSPARIPDLFEGGSVRVMGRYTAGGRHTIFINGLVNGHAARLPLEIDL 602
Query: 573 QLAKD--------IPLDRICAKQQIDLLTAQAWF---SEDKRLEEKVSKMSVQTGVLCEY 621
+ + D +PL I A++QI +A+ S D +L++++ + + + +
Sbjct: 603 KSSVDGSAVSSRALPL--IWAREQI-FDKERAYTIGGSTDTQLKQEIVDLGLTYSLQTRF 659
Query: 622 TRMIIVETDERNNA 635
T + V N A
Sbjct: 660 TSFVAVSEKVVNGA 673
>gi|160940244|ref|ZP_02087589.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC
BAA-613]
gi|158436824|gb|EDP14591.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC
BAA-613]
Length = 683
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 130/642 (20%), Positives = 267/642 (41%), Gaps = 72/642 (11%)
Query: 44 MLLPKSPMVYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDCYLDTAF 103
ML VY D D P + +V PL+ +V ++ + +
Sbjct: 16 MLALTPAAVYGEEPDGDTADKTLAPYF--FVQDAAQAADQFPLKDTSVSSTINGVIAETY 73
Query: 104 VRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQL-------IALGE 156
V + + G+ + P + ++ G++ EI + ++ E
Sbjct: 74 VT-----QTYANEGTIPINASYVFPASARVTVHGMKMEIGNEVITAKIKEREEAKTEFEE 128
Query: 157 NDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP-FKF 215
GKSAS+ +PN+FT+ + + G + I L +++ +S RDG + P
Sbjct: 129 AKSEGKSASLLEQQ--RPNVFTMNVANVMPGDTVRIELHYTELISPRDGVYQFVFPTVTG 186
Query: 216 PEYV--------------TPAIKK-IPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDV 260
P Y +P +++ + + V+ G + TSS H ++ ++
Sbjct: 187 PRYAGSAPETGGGDEWTASPYLEEGTAPQGTYEIAVSLSAGVPISSLTSSSHKIKVEKES 246
Query: 261 GKLGYSYESEVLKWS-NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PG 317
+ + S+ +++ + DF Y ++ I G++L + ++E F M ++ P
Sbjct: 247 ESIAHVTLSDSGEFAGDRDFILDYRLTGQEINCGLML------NTGEKENFFMLMVQPPE 300
Query: 318 TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS 377
++ ++ ++ IF++D+SGSM G PL+ K + + L D FN++ F+ +
Sbjct: 301 RVRAEEIPPREYIFVLDVSGSMFGYPLDTAKELIGNLVGNLRDSDQFNLILFSDTAVSMA 360
Query: 378 TSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
AT E + +A I+ I GG T + L +AV + + R + I+ +TD
Sbjct: 361 PKSVPATAENIRQA-----IDLIERQDGGGGTELAPALEQAVSLPRDPRMARSIV-TITD 414
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
G + DE I + L ++FGIG+ N Y + +A G +
Sbjct: 415 GYMSDESSIFSLINRNLKTAD-----FFSFGIGTSVNRYLIDGIAKAGSGEAFVVTEPSQ 469
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT 553
T S V+ I + + D +++ P +P L ++ P+++ G+++G+ T
Sbjct: 470 ASDTACDFSTYIQSPVMTGINV-SFDGFDVYDVEPDILPTLFAQRPIVLFGKWRGQPSGT 528
Query: 554 LKAKGFLGDLSNFVVELKLQLAKDIPLDR-----ICAKQQIDLLTAQAWFSEDKR----- 603
++ G G+ +++ E+ + A + P D + A+ +++ LT ED +
Sbjct: 529 IRITGKTGN-QDYMQEIPVGRA-EAPADNTAIPYLWARTKVERLTDYGT-REDVQDAVRQ 585
Query: 604 --LEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+++ V+++ + ++ YT I V R+ + K+
Sbjct: 586 AMIKKTVTQLGLDYSMMTPYTSFIAVTETVRSQDGSAADVKQ 627
>gi|374578603|ref|ZP_09651699.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
WSM471]
gi|374426924|gb|EHR06457.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
WSM471]
Length = 753
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 114/467 (24%), Positives = 206/467 (44%), Gaps = 76/467 (16%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKK------ 225
+PNIFT ++ I G + +++ + + + E+S+ +P P Y I +
Sbjct: 168 RPNIFTNSVANIGPGEAVLVQIEYQEPVHQSGNEYSLRLPLVVGPRYNPAPIVQSVDFRN 227
Query: 226 ------------IPKREKIH----------------LNVNAGTGTEVLCNTSSHHLKQLR 257
+P R++I + V G + SSHH ++
Sbjct: 228 DGSGWGATNSDPVPDRDRISPPVLDPARNAPVNPTSITVRLNAGFALGEVKSSHHTVKVE 287
Query: 258 R--DVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
+ ++ + V D DF T P+ + ++PS+ +R YLL
Sbjct: 288 SPDNTTRIVTLADGAV----PADRDFELTWKPAAV------KAPSVGLFRERVGDADYLL 337
Query: 316 ----PGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
P A+ ++K ++VIF++D SGSM G + K +L ALS+L P D FN++ F
Sbjct: 338 AFVTPPAAEQATQKPRPREVIFVIDNSGSMGGTSIVQAKASLTYALSRLQPTDRFNVIRF 397
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
+ + + A E V A ++ GG+ + A + L ++ ++F
Sbjct: 398 DDTMDVLFPASVAADAERVGEATSFVSALQARGGTEMVPAMRAALTDKLGDASMVRQVVF 457
Query: 430 LVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
L TDGA+ +E+Q+ + AM+ R R++ GIGS N Y + A + RG +
Sbjct: 458 L-TDGAIGNEQQLFETITAMRGR--------SRVFMVGIGSAPNTYLMTRAAELGRGAFT 508
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVS 543
++ +E +M+ LF + L N A+ L + ++ P+ IPDL + PL+++
Sbjct: 509 HIGSVEQVEERMRGLFAK-----LENPAVTGLSAKFSEAKADVTPTIIPDLYRDEPLVLA 563
Query: 544 GRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
+ +++ KG +GD + V L LQ A+ L ++ AK++I
Sbjct: 564 AKLD-TLAGSVEIKGRVGD-RPWSVTLPLQNAAEGKGLSKLWAKRKI 608
>gi|149919202|ref|ZP_01907685.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
gi|149819916|gb|EDM79338.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
Length = 877
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 145/587 (24%), Positives = 244/587 (41%), Gaps = 69/587 (11%)
Query: 94 DVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGS------ILG---VEAEISG 144
DVD ++ +V R H + + R P+ + + I+G +E+E+
Sbjct: 119 DVDAHVHGHVAQVKLRQRFHNSF-KQPIEVRYVFPLPENAAVDDMRMIIGERTIESEVET 177
Query: 145 KSYHTQLIALGENDGAGKSASVETGSFL---KPNIFTLTLPQIDGGSYLSIRLRWSQKLS 201
K+ + A G T + L +PN+FT ++ + G + + +++ Q L+
Sbjct: 178 KAKAVERFADAREAG-------HTAALLEQERPNVFTQSVTNVAPGESVEVEVQYVQTLT 230
Query: 202 YRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVN---AGTGTEV-----LCNTSSHHL 253
G N F FP V P + H V+ G GT L T +
Sbjct: 231 QDGG----NYEFVFPMVVGPRFSPPGTSAEAHAAVSPPIVGEGTRTGHDVSLSMTVAAGG 286
Query: 254 KQLRRD------VGKL---GYSYE-SEVLKWSNIDFDFSYTVSPSHI-----FGGVLLQS 298
K R D VG G++ ++ N DF + + + FG L QS
Sbjct: 287 KVQRWDSPTHTVVGSETSDGFALRLADQKTLPNRDFVVRWQSTAAQAKATAYFGPQLSQS 346
Query: 299 PSLHDVDQREMFCMYLLPGTAK-SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
+ Q F + + P + V ++++IF++D SGSM G PL K L ALS
Sbjct: 347 VAGA---QPGHFTLVVEPPQSDLDSLVGQREMIFVIDRSGSMSGVPLALAKQTLREALSH 403
Query: 358 LDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
L P D+FN+++F T + + A ++ + A ++I AGG T + + A+
Sbjct: 404 LRPVDTFNVISFESSTAMLYEAAVPANEQNLVHAERFID-GLQAGGGTMMSGAVDAALSP 462
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLR 475
G +F VTDG + +E +I + R + R++ GIGS N L
Sbjct: 463 EIG-LGRHRYVFFVTDGFISNEDEIARQASALVRAADKAGQRARVFGMGIGSSPNRELLA 521
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPS--RIPD 533
L+ +G Y A + + ++ S+VL +I ID L ++PS +PD
Sbjct: 522 SLSKAGKGRYLAVGNREHPREAVEAYTRMVDSAVLTDIHID-WDGLPVQAVFPSADTLPD 580
Query: 534 LSSESPLIVSGRYQGKF-PDTL-KAKGFLGDL---SNFVVELKLQLAKDIPLDR----IC 584
L + + GRY P L +A+ L + VEL + +A DR +
Sbjct: 581 LYASHSTVWVGRYDDSLDPAQLAQAQPVLRATVAGTQTTVELPITVAASPEDDRTLATLW 640
Query: 585 AKQQIDLLTAQAW--FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
A++QI LTA+ + + LE V+ + + ++ YT +I V+T
Sbjct: 641 AREQIGDLTARHYDGVLDHAELERHVTDLGLSYHLVTAYTSLIAVDT 687
>gi|442317766|ref|YP_007357787.1| von Willebrand factor type A domain-containing protein [Myxococcus
stipitatus DSM 14675]
gi|441485408|gb|AGC42103.1| von Willebrand factor type A domain-containing protein [Myxococcus
stipitatus DSM 14675]
Length = 1024
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 147/311 (47%), Gaps = 21/311 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K++VIF+VD+SGSM G L K AL + L L D FNI+AF + F
Sbjct: 439 VPKQEVIFLVDVSGSMDGDSLPQAKAALRLCLRHLRETDRFNIIAFESNYHSFRPQPVPF 498
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
T+ +E A +W+ A G T + PLT A + ++ L+TDG V +E QI
Sbjct: 499 TQRTLEDADRWVDA-LQANGGTELLKPLTFAARTAPDG-----VLVLLTDGQVGNEEQIL 552
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
A+ + R+Y+FGIG+ + LR LA +RG + + I+ ++ F+
Sbjct: 553 RAVLAERKTA-----RVYSFGIGTNVSDALLRDLARQTRGDVEFIHPGERIDEKVVAQFS 607
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ----GKFPDTLKAKGF 559
R + + + + ++ E+ P+ +P + P + GRY GK TL+ +
Sbjct: 608 RALAPRVTELQV-YFDGIEGVELAPAELPVMVDGIPWTLMGRYPRPGFGKV--TLRGRSG 664
Query: 560 LGDLSNFVVELKLQLAKDIP-LDRICAKQQID-LLTAQAWFSEDKRLEEKVSKMSVQTGV 617
+ + V + D P ++++ A ++I L A +++++ +++V +
Sbjct: 665 VEPFA-LSVNVHFPHESDRPAVEKLWAAERIKGWLDASLTGRRADAMKKRIVELAVSHQL 723
Query: 618 LCEYTRMIIVE 628
+ YT ++VE
Sbjct: 724 VTRYTSFVVVE 734
>gi|153953715|ref|YP_001394480.1| hypothetical protein CKL_1090 [Clostridium kluyveri DSM 555]
gi|219854335|ref|YP_002471457.1| hypothetical protein CKR_0992 [Clostridium kluyveri NBRC 12016]
gi|146346596|gb|EDK33132.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
gi|219568059|dbj|BAH06043.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length = 805
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 124/539 (23%), Positives = 225/539 (41%), Gaps = 55/539 (10%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGENDGA---GKSASVETGSFLK-----PNIFTL 179
P+ I G EAEI G++ + E D A ++A + GS P+ F +
Sbjct: 54 PIPVTAVISGFEAEIGGRTLKA---IVEEKDKALQVYENARIRGGSIFSLEEFSPHFFKI 110
Query: 180 TLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF-----------KFPEYVTPAIKKIPK 228
++ +I G + I+L + +L Y+D F + +P + E + + ++
Sbjct: 111 SIGKIISGEIVKIKLSYIDELEYKDSTFKLTIPAISEPKIIRSKSRIEELRSNLVNRLGI 170
Query: 229 RE------KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFS 282
++ + +N+ + ++V H +K R + E + + +D +F
Sbjct: 171 KKGRDEDFEFKVNIIVESLSKVNFRCPYHKIKVEREGDTVAKITLEED---YYLMDKEFI 227
Query: 283 YTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGK 342
+ + + Q+ + + ++P + K++ IF++DIS +M+G+
Sbjct: 228 LFIKERETLEADGMIYEYKENDRQKGIVYIRMIPKLDPYEEEIKENYIFLIDISDTMKGE 287
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS-MELATKEAVERAHQWIGINFIA 401
LE KNAL + + L GD+F+I+A FS M ++++ +A +WI N
Sbjct: 288 KLEQAKNALQLCIRNLSKGDTFDIIAMGVNLIDFSKDGMIEFDQDSLRKASKWID-NLDT 346
Query: 402 GGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIY 461
+I + ++E G+ I L TD V+DE I +K + + RI+
Sbjct: 347 EEDADIFGAIRYSLE----KEGNKNTILLFTDDLVDDEENILAYVKENIGDN-----RIF 397
Query: 462 TFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQKLFTRGFSSVLANIAIDTLKD 520
TFGI S N+YFL LA S G D+D IE + + F R + + NI ID +
Sbjct: 398 TFGIDSSANNYFLNKLAHESCG-KAEFIDIDERIEDIVLRQFNRIQNPQVHNIEID-WGE 455
Query: 521 LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG--------DLSNFVVELKL 572
L YP I + P + G+ + KG + D+ NF E
Sbjct: 456 LKVKNSYPRTIEYMYDREPFSIFANVLGEVGGQIVLKGNVDGKEYIQKVDIDNFNTEENA 515
Query: 573 QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE 631
L K I + +I++ + E R +K+ ++S ++ T I++E E
Sbjct: 516 NLLKKIWAKKRIKSLEINMRAERGTIRESMR--KKIIELSKDNKLISSETTFILMELRE 572
>gi|430742747|ref|YP_007201876.1| hypothetical protein Sinac_1816 [Singulisphaera acidiphila DSM
18658]
gi|430014467|gb|AGA26181.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Singulisphaera acidiphila DSM 18658]
Length = 780
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 142/578 (24%), Positives = 250/578 (43%), Gaps = 53/578 (9%)
Query: 77 RCDPPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSIL 136
R + P ++ V +D A V+VS T+ H GS + P+ ++G+I
Sbjct: 33 RPNHPIARSYEIRDVTVDAKVRDQVAEVQVSQTF--HNP-GSTVLESEYYFPLPEEGAIQ 89
Query: 137 GVEAEISGKSYHTQLIALGENDGAGKSASVETG------SFLKPNIFTLTLPQIDGGSYL 190
+ GK +L+ E + V T ++ +F ++ I G+
Sbjct: 90 NFVLMVDGKELPGRLLPKDEARRIYEEI-VRTKRDPALLEYMGRGLFRTSVFPIPPGADR 148
Query: 191 SIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSS 250
+ +R++Q L RD + V F +P K +R + L + + + L + +S
Sbjct: 149 KVTMRYTQ-LCKRDRDL---VEFAYPFNTQKFTAKPIQRLTLSLQIQSRDAIKSLYSPTS 204
Query: 251 HHLKQLRRDVGKLGYSYESEVLKWSNI----DFDFSYTVSPSHIFGGVLLQSPSLHDVDQ 306
+RRD G +L+ ++ DF Y+++ + V+ PS +
Sbjct: 205 D--TTIRRD----GDHDARVLLEQRDVVPKNDFRLVYSLAEGALGATVMSYKPS---GSE 255
Query: 307 REMFCMYLLPGTAKSR-KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
F M P + K K VIF++D SGSM GK +E + AL L+ L D FN
Sbjct: 256 DGYFLMLASPQVKPTDVKPVPKTVIFVLDRSGSMAGKKIEQARKALQFVLNNLRDDDLFN 315
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
IV ++ F ++ T + A +++ N GG+TNI A L A+ M+ +
Sbjct: 316 IVVYDDRVETFQPELQRYTSKTRAEADRFVD-NIREGGATNIDAALKSALGMIHDDSRPS 374
Query: 426 PIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
++FL TDG + ++ A +R N RI+ FG+G N R+L +S G
Sbjct: 375 YVLFL-TDGLPTAGETGEMAIAEGARKAN--REHARIFAFGVGYDVNS---RLLDRLSGG 428
Query: 484 YYGAAYDL---DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
G + + D IE + + + + S VL +I I+ L D D YP +PDL L
Sbjct: 429 NSGTSEYVKPDDDIESHVSRFYAKMTSPVLTDIRIE-LTDTDVNRTYPRDLPDLFEGGQL 487
Query: 541 IVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAK-------DIPLDRICAKQQIDLL 592
+ GRY+ T++ G G+ +F E +LA D ++R+ A +++ +
Sbjct: 488 VWVGRYRQAGRSTIRLSGRAGGERRSF--EFPAELASTDRGTAFDF-VERVWAMRRVGHI 544
Query: 593 TAQAWF-SEDKRLEEKVSKMSVQTGVLCEYTRMIIVET 629
+ ++K L +++ +S + G++ YT + E+
Sbjct: 545 IDEIDLRGQNKELIDELVSLSTKYGIMTPYTSFLADES 582
>gi|27376088|ref|NP_767617.1| hypothetical protein bll0977 [Bradyrhizobium japonicum USDA 110]
gi|27349227|dbj|BAC46242.1| bll0977 [Bradyrhizobium japonicum USDA 110]
Length = 754
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/473 (23%), Positives = 200/473 (42%), Gaps = 88/473 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA---------- 222
+PNIFT ++ I G + +++ + + + E+S+ VP PA
Sbjct: 168 RPNIFTNSVANIGPGETVLVQIEYQEPVHQSGNEYSLRVPLVVAPRYNPAPIVQSVDFRN 227
Query: 223 ---------IKKIPKREKIHLNV-----NAGTGTE----------VLCNTSSHHLKQLRR 258
+P RE+I V NA L SHH
Sbjct: 228 DGSGWGAATSDPVPDRERISPTVLDPARNAPANPTSITVRLKAGFALGEVKSHH-----H 282
Query: 259 DVGKLGYSYESEVLKWSN----IDFDFSYTVSPSHIFGGVLLQSPSL----HDVDQREMF 310
V + ++ ++ D DF T P+ ++PS+ V +
Sbjct: 283 SVKIESPDNATRIVTLADGAVPADRDFELTWKPAA------QKAPSVGLFREHVGDADYL 336
Query: 311 CMYLLPGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
++ P +A+ ++K ++V+F++D SGSM G + K +L AL +L P D FN++
Sbjct: 337 LAFVTPPSAEQATQKPLPREVVFVIDNSGSMGGTSIVQAKASLLYALGRLQPADRFNVIR 396
Query: 369 FNGET-YLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSRG 423
F+ LF S+ + AH +F+ A G T + + A+ G
Sbjct: 397 FDDTMDVLFPASV------PADAAHVGEATSFVSALQARGGTEMVPAMRAALTDKIGDTG 450
Query: 424 SIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ + +TDGA+ +E+Q+ + AM+ R R++ GIGS N Y + + +
Sbjct: 451 MVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRVFMVGIGSAPNTYLMTRASEL 502
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSE 537
RG + ++ +E +M+ LF + L N A+ L ++ P+ IPD+ +
Sbjct: 503 GRGAFTHIGSVEQVEERMRGLFAK-----LENPAVTGLSAKFSEATADVTPAIIPDVYRD 557
Query: 538 SPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
PL+++ R K +L+ KG +GD + V L LQ A+ L ++ A+++I
Sbjct: 558 EPLVLAARLD-KLAGSLEIKGRVGD-RPWSVTLPLQNAAEGKGLSKLWARRKI 608
>gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein
[Stigmatella aurantiaca DW4/3-1]
gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella
aurantiaca DW4/3-1]
Length = 749
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 121/569 (21%), Positives = 236/569 (41%), Gaps = 35/569 (6%)
Query: 80 PPSLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVE 139
PP+ PL + + + VD TA RV T+ G + P+ ++
Sbjct: 29 PPNARPLAIKSQRVKVDIQDGTAVTRVEQTF---LNEGPTQLEAHYIFPLPKGAALSEFY 85
Query: 140 AEISGKSYHTQLIALGENDGAGKSASVETG-----------SFLKPNIFTLTLPQIDGGS 188
I+G+ +L+ E D K+ + G ++ ++F + + +
Sbjct: 86 LWINGQKTKGELL---ERD---KATHIYEGIVRRLADPGLLEYVDSDVFRVRVFPVPARG 139
Query: 189 YLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-IKKIPKREKIHLNVNAGTGTEVLCN 247
+ L +SQ L+Y G + + P PA + + K +A ++V
Sbjct: 140 EQKVELTFSQVLNYSAGLYHYHYPLGATAKGQPADWRLVGGSAKNDFTFSAKVSSKVPLK 199
Query: 248 T---SSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDV 304
+ +H + RR + E + D D ++VS + G LL +
Sbjct: 200 SIYSPTHPMDVSRRGEAEAVVGLEQVNGADLSKDLDLYFSVSDKAV-GLSLL---TYKQA 255
Query: 305 DQREMFCMYLLPGTAKS-RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
D+ F + P T S ++ K V F++D SGSMQG ++ K+AL +++L+P D+
Sbjct: 256 DEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQGSRMQIAKDALKYCVTRLNPQDT 315
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
FN+V F+ + +++ A E +++A ++ A G T I L + ++
Sbjct: 316 FNVVRFSTDVEALFPALKSAQPENIQKAVAFVE-QLEAIGGTAIDEALVRGLQDNDGKSS 374
Query: 424 SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ ++ +TDG A+ +G R++TFG+G N L L+ G
Sbjct: 375 APHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAG 434
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS 543
D E ++ + + S+ + + L +D +++YP ++PDL + L+V
Sbjct: 435 TSDFVRDGKEFETKISSFYDK-VSNPVLSDLSLDLSSIDAYDLYPRKLPDLFKGTQLVVM 493
Query: 544 GRYQGKFPDTLKAKGFL-GDLSNF--VVELKLQLAKDIPLDRICAKQQIDLLTAQAWF-S 599
GRY+ G++ G+ F + +D + R+ A +++ L +
Sbjct: 494 GRYRKSGDAKAVLTGYVNGEKRTFDYGTTAPKEATRDGFIPRLWAIRKVGFLLEEIRLRG 553
Query: 600 EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
E L ++V ++ + G++ YT ++VE
Sbjct: 554 ERPELRDEVISLAKKFGIVTPYTSYLVVE 582
>gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina
NCIMB 400]
gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella
frigidimarina NCIMB 400]
Length = 722
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 110/437 (25%), Positives = 190/437 (43%), Gaps = 70/437 (16%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
GK AS+ + N+FT + + L + + + QK+ YRDG FS+ P
Sbjct: 141 GKQASLLQQQ--RRNLFTSDVANLGPHEQLVVEISYQQKVEYRDGLFSLRFPLAITPRYN 198
Query: 221 PAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLK-------QLRRDVGKLGYSYES--EV 271
P + ++ + + +A T T S+ H++ Q+ D G S +S
Sbjct: 199 PQADRTTEQPLLAMPSSANTAT------SAKHVRPALDVKMQVNIDAGFELTSLDSLYHP 252
Query: 272 LKWSNIDFDFSYTVSPSHI------------FGGV-----LLQSPSLHDVD--------- 305
+K SN+ +S + I G V Q+ H D
Sbjct: 253 IKQSNVGNHYSVNFAGKQIADRDFVLQWQANVGAVPKAATFYQTGKTHLADNSDERSETA 312
Query: 306 QRE-------MFCMYLL--PGTAKSRK-VFKKDVIFIVDISGSMQGKPLEDTKNALAVAL 355
QR+ M+ + +L P S + + +++I ++D SGSM G+ + K AL AL
Sbjct: 313 QRQPNPVDNNMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSMSGQSITQAKQALQFAL 372
Query: 356 SKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
+ L DSFNI+ FN + + S + A + +A+++I + A G T + + L A+
Sbjct: 373 AGLRDIDSFNIIEFNSDVTMLSATPLSANSRNIGKANRFIQ-SLDADGGTEMRSALQTAL 431
Query: 416 ---------EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
+ +S +IF+ TDGAV +E ++ + +L + R++T GIG
Sbjct: 432 VDSVQQDSDQTDAHSEMLRQVIFM-TDGAVGNEHELYQLINDQLGDS-----RLFTVGIG 485
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEM 526
S N F+R A + RG + + ++ ++++L + VL NI + L D +
Sbjct: 486 SAPNSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKIEQPVLTNIGLYYL-DGSVPDY 544
Query: 527 YPSRIPDLSSESPLIVS 543
+P+ I DL PL VS
Sbjct: 545 WPTTISDLYQNEPLWVS 561
>gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365]
gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365]
Length = 775
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 122/531 (22%), Positives = 223/531 (41%), Gaps = 64/531 (12%)
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNV 236
F+ L I+ + ++W+Q L S VP + V ++ E+ V
Sbjct: 134 FSFRLFPIEASERKPVEVQWTQWLRREGQRVSYRVPVGHAQSV---VQLAIDDERRITEV 190
Query: 237 NAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLL 296
+ T T L + H++ G G + Y V GG
Sbjct: 191 KSPTHTLALEGVGTEHVRVRATGRGSAGA-------------LELHYRVD-----GGDWA 232
Query: 297 QSPSLHDVDQREMFCMYLL---PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
S +H + + + L PG A + ++ KDV ++D SGSM G PL K+A
Sbjct: 233 LSAFVHRDAGHDAYLIANLATPPGLAAT-EIADKDVTLVLDRSGSMSGAPLARAKDAAKA 291
Query: 354 ALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT 412
+++L GD N++AF +G LF + + + E +A ++I GG T++ L
Sbjct: 292 VVARLGDGDRVNVMAFDDGVDALFLRPVPI-SAERRSQAVEYID-RLSDGGGTDLAGALA 349
Query: 413 KAVEMLTNSR-----GSIP-IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
+A++ S GS P +I +TDG + + + ++ G + R++T G+G
Sbjct: 350 EALDAQHPSESEADTGSRPHVILFLTDGQSDSQATL------QVARGDAGDARVFTIGVG 403
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL---KDLDE 523
L LA RG + IE ++ +L++ + VL ++A++ D+
Sbjct: 404 DGVEKPLLARLASEKRGRFTFIASPSEIERKVSRLYSEIAAPVLVDLAVEVTGGDADVRL 463
Query: 524 FEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLS---NFVVELKLQLAKDIPL 580
YP +PDL L+++GR +G P TL +G G V+++ Q+ + +
Sbjct: 464 SRRYPRSVPDLYRGDELVITGRVRGDGPLTLSLRGERGGEPVAFTRVIDVPAQITRPW-V 522
Query: 581 DRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
R+ A+ ++ DLL A F E L ++V ++V + +YT + + E +++
Sbjct: 523 GRMWAQARVDDLLEEIALFGETDELRDEVISLAVAYNFVTQYTSFLAIPESELTDSARD- 581
Query: 640 GTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTATAENFPPGSEIPKL 690
+ ++ Q+IL + + ++ N PPG I K+
Sbjct: 582 --QLADARAQKQQILAAHKDAVALSRS-------------NMPPGDPILKM 617
>gi|410951423|ref|XP_003982396.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Felis
catus]
Length = 856
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/484 (23%), Positives = 205/484 (42%), Gaps = 74/484 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL L L P D FN+++F
Sbjct: 258 FVHYFAP---EGLPTIPKNVIFVIDKSGSMLGRKIQQTREALIKILDDLSPRDQFNLISF 314
Query: 370 NGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+G+ + + A+ E V +A +++ GI A G TNI + AV++L+N+
Sbjct: 315 SGDATQWKPLLVPASAEDVNQARRYVAGIQ--AQGGTNINDAMLMAVQLLSNANRQELLP 372
Query: 423 -GSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
GS+ +I L+TDG ++ + +A+ + + ++ G G ++ F
Sbjct: 373 AGSVSLIILLTDGDPTVGETNPARIQKNVQEAIAGQYS--------LFCLGFGFDVSYAF 424
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L +A+ + G Y+ +Q+Q + + +L + + + E E+
Sbjct: 425 LEKMALDNGGLARRIYEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVSQDNFRL 483
Query: 534 LSSESPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------ 579
L S ++V+G+ Q + PD L AK G L + N + ++++A+
Sbjct: 484 LFKGSEIVVAGKLQDQSPDVLSAKVRGHL-HMQNITFQTEIRVAEKEEEFQSPKYIFHSF 542
Query: 580 LDRICAKQQIDLLTAQAWFSED---KRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNN 634
++R+ A I L + + D + LE + +S+Q + T M++ E +ER+
Sbjct: 543 MERLWAYLTIQQLLEKMILASDAEKQALETRALNLSLQYSFVTPLTSMVVTKPEGEERSE 602
Query: 635 ASESPGTKKGSKKSDHQKILDSEVPKTIMLQ--NLGIGFGDLTATAENFPPGSEIPKLPE 692
+E P G +S P ++ Q +G LT A+ P P +P
Sbjct: 603 VAEKPVETGGC---------ESRTPASLAAQFHTMGDRRSRLTEPAKTDSPPGTTPAVPA 653
Query: 693 VAEIFVKAASNCCSTLGNKCCCLCCIEC------CNKLN--DQCVIVLTQLCTALACFGC 744
+ A TL + C++ N L+ DQ V V Q T A F
Sbjct: 654 PIQ-----APPVILTLPGQSVNPLCVDIKHSQGPMNLLSDPDQGVEVTGQYETEKAQFSW 708
Query: 745 FECC 748
E
Sbjct: 709 IEVT 712
>gi|392552419|ref|ZP_10299556.1| inter-alpha-trypsin inhibitor [Pseudoalteromonas spongiae
UST010723-006]
Length = 686
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 143/619 (23%), Positives = 246/619 (39%), Gaps = 107/619 (17%)
Query: 88 MNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVE-------- 139
+ + LD+ AFV + ++ + ++ + P+ + ++ +E
Sbjct: 66 LKSTHLDIQVKGMLAFVELKQVYQNPAL---ETLNGSYQFPLPENSAVFAMEMMLDNRKI 122
Query: 140 -AEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQ 198
EI K + + G K+A V+ S N+F + I+ G ++++L +
Sbjct: 123 IGEIKEKQQAEVIYKKAQQQG-HKAALVKNQS---ANVFKTKVANIEPGQEVAVKLSFQF 178
Query: 199 KLSYRDGEFSVNVPFKF-PEYVT-------PAIKKIPKREKIHLNVNAGTGTE------- 243
L+Y+D FS+ +P P Y I R + N+ E
Sbjct: 179 PLTYQDEVFSLRLPTVVAPRYFNNLDLDENDPISHFQTRSQSSFQNNSLMAEEAPTLDFL 238
Query: 244 ---VLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGV------ 294
VL + ++++ + DV N+ F+ SPSH+ V
Sbjct: 239 QNTVLTSFNANNKITINFDV---------------NLGFELDSITSPSHMLSVVQTGNTQ 283
Query: 295 --LLQSPS------------------------LHDVDQREMF--CMYLLPGTAKSRKVFK 326
L+ +P H+ E F M + P +
Sbjct: 284 YQLIPAPQSLVANRDFDIKFVPLLNEQITTQVFHENINGESFYKVMMMPPKSDFVSHSLP 343
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
++++F++D SGSM G +E K AL +AL KLD D FNIVAF+ E LF AT
Sbjct: 344 RELVFVIDTSGSMAGTSMEQAKQALILALEKLDTDDLFNIVAFDHEVELFQRGSVAATYS 403
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVE-MLTNSRGSIPIIFLVTDGAVEDERQICDA 445
++ A +++ A G T I + L + + +R + + L+TDGAV D +
Sbjct: 404 QLQAAKRFVA-GLEADGGTEINSALAASFDGQFDETR--VRQVVLLTDGAVADAESVKGL 460
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+K+R + +++T IGS N + +R+LA +G +L++I ++ L +
Sbjct: 461 VKNRRADS-----KLFTIAIGSAPNSHLMRLLAKHGKGETLFINNLENIASEVSLLSDKL 515
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-QGKFPDTLKAKGFLGDLS 564
+ L NI + L E P+ DL SPL+ + Q + G G
Sbjct: 516 SNPALQNIRVLNDMGLPLAESLPT--ADLYFSSPLVAYFKLTQAAQAVQIAGDGLHGKFL 573
Query: 565 NFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRM 624
V AK I R AK+ I+ L +DK +++K+ ++ G+L YT
Sbjct: 574 KRVSLENAIAAKGIT--RAYAKEHINTLD----LVQDK---AEITKLGIRHGLLTPYTAF 624
Query: 625 IIVET---DERNNASESPG 640
I VE D N + E+P
Sbjct: 625 IAVEEYFGDVANKSFEAPN 643
>gi|386399120|ref|ZP_10083898.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
WSM1253]
gi|385739746|gb|EIG59942.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
WSM1253]
Length = 753
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 116/473 (24%), Positives = 206/473 (43%), Gaps = 86/473 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKK------ 225
+PNIFT ++ I G + +++ + + + E+S+ +P P Y I +
Sbjct: 168 RPNIFTNSVANIGPGETVLVQIEYQEPVHQSGNEYSLRLPLVVGPRYNPAPIVQSVDFRN 227
Query: 226 ------------IPKREKIH------------------LNVNAGTGT-EVLCNTSSHHLK 254
+P R++I + +NAG EV S HH
Sbjct: 228 DGSGWGATNSDPVPDRDRISPPVLDPARNAPVNPTSITIRLNAGFALGEV---KSPHH-- 282
Query: 255 QLRRDVGKLGYSYESEVLKWSN----IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF 310
+V + ++ ++ D DF T P+ ++PS+ +R
Sbjct: 283 ----NVKVESPDNTTRIVTLADGAVPADRDFELTWKPAAA------KAPSVGLFRERVGD 332
Query: 311 CMYLL----PGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
YLL P A+ ++K ++VIF++D SGSM G + K +L ALS+L P D F
Sbjct: 333 TDYLLAFVTPPAAEQATQKPRPREVIFVIDNSGSMGGTSIVQAKASLTYALSRLQPTDRF 392
Query: 365 NIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS 424
N++ F+ + + A E V A ++ A G T + + A+ G
Sbjct: 393 NVIRFDDTMDVLFPASVAADAERVGEATSFVSA-LQARGGTEMVPAMRAALTDKLGDAGM 451
Query: 425 IPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ + +TDGA+ +E+Q+ + AM+ R R++ GIGS N Y + A +
Sbjct: 452 VRQVVFLTDGAIGNEQQLFETITAMRGR--------SRVFMVGIGSAPNTYLMTRAAELG 503
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSES 538
RG + ++ IE +M+ LF + L N A+ L + ++ P+ IPD+ +
Sbjct: 504 RGAFTHIGSVEQIEERMRGLFAK-----LENPAVTGLSAKFSEAKADVTPTIIPDVYRDE 558
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQID 590
PL+++ + +++ KG +GD + V L LQ A+ L ++ AK++ID
Sbjct: 559 PLVLAAKLD-TLAGSVEIKGRVGD-RPWSVTLPLQNAAEGKGLSKLWAKRKID 609
>gi|444377375|ref|ZP_21176606.1| Inter-alpha-trypsin inhibitor domain protein [Enterovibrio sp.
AK16]
gi|443678456|gb|ELT85125.1| Inter-alpha-trypsin inhibitor domain protein [Enterovibrio sp.
AK16]
Length = 707
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 126/594 (21%), Positives = 246/594 (41%), Gaps = 91/594 (15%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E EI K + G K++ VE + N+F+ + + G + + +++
Sbjct: 114 IEGEIQKKQQARETFNRAAEQGK-KASLVEQH---RANLFSTQIANVAPGETIVVNIQYQ 169
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTPAI-------------KKIPKREKIHLN--------V 236
+ + Y DGEFS+ P F P + +P+ + H+ +
Sbjct: 170 ETVHYEDGEFSLRFPTTFTHRYIPGLPDSTALQARGDTEAPVPEEGEQHVEEVVTPLQEM 229
Query: 237 NAGTGTEVLCNTSSHHLKQLRRD---VGKLGYSYESEVLKWSNIDF-------------- 279
G ++ + RD ++ ++ E+++ IDF
Sbjct: 230 QNGWAVATRQVPDANEITPEYRDPLLEDEIAFTLEADINMGLPIDFVESTTASINVQKVS 289
Query: 280 DFSYTVSPSHIFGG----VLLQSPSLHDVDQREMFC----------MYLLPGTAKSRKVF 325
D Y+V+ S + VL P+ + +F + +P +K
Sbjct: 290 DSRYSVTLSEMEIANQDFVLKWRPAAASMPVAALFVQEKEGEKYGLLMAMPPQSKQTNSI 349
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+++ ++DISGSM G+ +E K A+ AL L+ D FN++ FN E +S+ AT
Sbjct: 350 PQNITLVLDISGSMYGESIEQAKEAVIYALHSLEADDYFNLIIFNHEARRYSSVPVKATL 409
Query: 386 EAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ--- 441
+ + + GI G L A E + G + I +TDGA+ +E +
Sbjct: 410 QNIGLVSSIVRGIEADGGTEMATALELAYASEPMA---GYLNQIVFMTDGAIGNEDELYG 466
Query: 442 -ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
I +K R R++T GIGS N F++ A+ +G + +L+ + ++
Sbjct: 467 LISKGLKDR---------RLFTVGIGSAPNSAFMQRAAVSGKGSFTHISNLNEVSTSLKP 517
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
LF + S V+ +I + + + + +P+ +PDL PL ++ + TL+ G
Sbjct: 518 LFEKLSSPVMKDITVK-WDNGEPVDSWPNPVPDLYKTQPLDLAFTIP-ENASTLQLTGIQ 575
Query: 561 GDLS-NFVVELK-LQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVL 618
D ++L+ +Q +D + A++QI+ +T S D++ E +++++ + ++
Sbjct: 576 SDAPWQLDIDLEAIQAGAATGIDVLWARKQIESITLNQSLSADEK-ENRITELGLNHHIV 634
Query: 619 CEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKT--IMLQNLGIG 670
++T + V+ + P K HQ ++ PK MLQ+ G+G
Sbjct: 635 TKHTSLAAVDNN--------PSRPLDEKAKPHQ--INPHAPKNNVAMLQS-GLG 677
>gi|421596069|ref|ZP_16039972.1| hypothetical protein BCCGELA001_03005, partial [Bradyrhizobium sp.
CCGE-LA001]
gi|404271818|gb|EJZ35595.1| hypothetical protein BCCGELA001_03005, partial [Bradyrhizobium sp.
CCGE-LA001]
Length = 669
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/470 (24%), Positives = 205/470 (43%), Gaps = 82/470 (17%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA-------IKK 225
+PNIFT +L I G + +++ + + + E+S+ +P PA +K
Sbjct: 168 RPNIFTNSLANIGPGETVLVQIEYQEPVHQSGNEYSLRLPLVVGPRYNPAPIVQSVDFRK 227
Query: 226 ------------IPKREKI---------HLNVNAGTGTE------VLCNTSSHHLKQLRR 258
+P R+ I H VN + T L SHH
Sbjct: 228 DGSGWGAINPDPVPDRDHISPPVLDPAKHAPVNPTSITVRLKAGFALGEIKSHH-----H 282
Query: 259 DVGKLGYSYESEVLKWSN----IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
+V + V+ ++ D DF T P V ++PS+ + YL
Sbjct: 283 NVKIESPDATTRVVTLADGTVAADRDFELTWKP------VAEKAPSVGLFREHVGGADYL 336
Query: 315 L-----PGTAK-SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
L P + ++K ++V+F++D SGSM G + K +LA ALS+L P D FN+V
Sbjct: 337 LALVTPPSVDQATQKPLPREVVFVIDNSGSMGGTSIVQAKASLAYALSRLQPTDRFNVVR 396
Query: 369 FNGET-YLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSRG 423
F+ LF S+ + AH +F+ A G T + + A+ G
Sbjct: 397 FDDTMDMLFPASV------PADAAHVGEATSFVSALQARGGTEMVPAMRAALTDKLGETG 450
Query: 424 SIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ + +TDGA+ +E+Q+ + AM+ R R++ GIGS N Y + A +
Sbjct: 451 LVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRVFMVGIGSAPNTYLMTRAAEL 502
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
RG + ++ +E +M+ LF + + V+ + + + ++ P+ IPD+ + PL
Sbjct: 503 GRGAFTHIGSVEQVEERMRGLFAKLENPVVTGLTAKFSE--AKADVTPAIIPDVYRDEPL 560
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
+++ + K +++ KG +GD + V L L+ A+ L ++ AK++I
Sbjct: 561 VLATKLD-KLAGSVEIKGRVGD-RPWSVTLPLENAAEGKGLSKLWAKRKI 608
>gi|384214716|ref|YP_005605880.1| hypothetical protein BJ6T_10000 [Bradyrhizobium japonicum USDA 6]
gi|354953613|dbj|BAL06292.1| hypothetical protein BJ6T_10000 [Bradyrhizobium japonicum USDA 6]
Length = 754
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 118/472 (25%), Positives = 209/472 (44%), Gaps = 86/472 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA---------- 222
+PNIFT ++ I G + +++ + + + E+S+ +P PA
Sbjct: 168 RPNIFTNSVANIGPGETVLVQIEYQEPVHQSGNEYSLRLPLVVGPRYNPAPIVQSVDFRK 227
Query: 223 ---------IKKIPKREKI---------HLNVNAGTGTE------VLCNTSSHHLKQLRR 258
+ +P R++I + VN + T L SHH
Sbjct: 228 DGSGWGASNSEPVPDRDRISPPVLDPARNAPVNPTSITMRLKAGFALGEVRSHH-----H 282
Query: 259 DVGKLGYSYESEVLKWSN----IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL 314
+V + ++ ++ D DF T P+ ++PS+ +R YL
Sbjct: 283 NVKVESPDNATRIVTLADGAVAADRDFELTWKPAAD------KAPSVGLFRERVGDADYL 336
Query: 315 L----PGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
L P +A+ ++K ++V+F++D SGSM G + K +L ALS+L P D FN++
Sbjct: 337 LAFVTPPSAEQTTQKPLPREVVFVIDNSGSMGGTSIVQAKASLTYALSRLQPNDRFNVIR 396
Query: 369 FNGET-YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
F+ LF S+ A V A ++ GG+ + P +A LT+ G +
Sbjct: 397 FDDTMDVLFPASVP-ADATHVGEATSFVSALQARGGTEMV--PAMRAA--LTDKAGDTSM 451
Query: 428 ---IFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ +TDGAV +E+Q+ + AM+ R R++ GIGS N Y + A +
Sbjct: 452 VRQVVFLTDGAVGNEQQLFETITAMRGR--------SRVFMVGIGSAPNTYLMTRAAELG 503
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSES 538
RG + ++ +E +M+ LF + L N A+ L + ++ P+ IPD+ +
Sbjct: 504 RGAFTHIGSVEQVEERMRGLFAK-----LENPAVTGLSAKFSEAKADVTPAIIPDVYRDE 558
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDRICAKQQI 589
PL+++ + K +L+ KG +GD + V L LQ A+ L ++ AK++I
Sbjct: 559 PLVLAAKLD-KLAGSLEIKGRVGD-RPWSVTLPLQNAAEGKGLSKLWAKRKI 608
>gi|440803676|gb|ELR24559.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 621
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 133/634 (20%), Positives = 259/634 (40%), Gaps = 96/634 (15%)
Query: 51 MVYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDC------------Y 98
M+Y + P I +YQPY PP PL + AV ++ +
Sbjct: 1 MLYRRVPHPPI-------TYQPY-----SPPQTFPLPIKAVSIEAKVADLCASVTIRQKF 48
Query: 99 LDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGV--EAEISGKSYHTQLIALGE 156
++T + ++ + D + D I G E E + +Y A+
Sbjct: 49 VNTEETPIEAIYQFQLEDKATVSDFTAFI---DGNKIKGTIQEKEEARNTYDD---AIAS 102
Query: 157 NDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPF--K 214
GA + E +PN+FT+ + + G + + + + +L + +G+ +P +
Sbjct: 103 GHGAYRMEENEE----EPNLFTVNVGNLPPGKEVEVVITYVTELEFEEGQLKFRIPSNNQ 158
Query: 215 FPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKW 274
P Y + ++H ++ + ++KQL + + + E +
Sbjct: 159 NPSYTAAGNSPPQLKLQVHFDMTS-------------NIKQLSSPSHPISFEFGDEPTQA 205
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSLHD------VDQR--EMFCMYLLPGTAKSR-KVF 325
+ D + T + + VL + H+ VD++ + + L P A+ +
Sbjct: 206 TVTLGDTTKTQAQDLV---VLTKLAKPHEPCGRVEVDEKGTKTVMVSLYPKLAQDDDEDI 262
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFSTSMELAT 384
++IFIVD SGSM G ++ K+ L + L L G FNI+ F T +LF +
Sbjct: 263 YTEMIFIVDRSGSMAGSRMQQVKDTLHIFLRSLGEGTQFNIIGFGSRTQHLFPSGSVEYN 322
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+ +E A + + F G T+I PLT+ ++ G +F++TDG V + ++
Sbjct: 323 DKNLELATKHVDQMFANLGGTDILKPLTEVLQQ-PPKEGYPRQLFILTDGEVNNTQECIQ 381
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++ + R++TFG+G+ + ++ LA G++ + +E ++ + R
Sbjct: 382 FVRKH-----ADTTRVFTFGVGNEASQDLVKGLAKSGEGFFEFVRPGEGMEEKVMRQLRR 436
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT-------LKAK 557
L ++ + T K + E P R + L+V G + D+ ++AK
Sbjct: 437 AMQPALTDVTV-TWKGATQVEPAPFRFAPVFCGGQLVVYGIVKKGQVDSDNNVEVVVQAK 495
Query: 558 GFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFSEDKR----------L 604
L F +++ L+K P L ++ AK + + + D L
Sbjct: 496 TV---LKPFEAAIQVDLSKAKPGNLLHKLAAKTLLKSIDDGRSYLHDAAGNVKSGGQAGL 552
Query: 605 EEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
+ ++S+ +GVLC++T + VE ER +A+ES
Sbjct: 553 VAEAVRISLASGVLCKHTAFVAVE--EREDATES 584
>gi|395517401|ref|XP_003762865.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like,
partial [Sarcophilus harrisii]
Length = 902
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 160/344 (46%), Gaps = 49/344 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM GK ++ T+ AL L L P D FN+V F+G + ++ A E
Sbjct: 270 KNVVFVIDKSGSMSGKKMKQTREALVKILGDLKPEDQFNLVIFDGFVTQWRPTLLQALPE 329
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG-AVED 438
VE A ++ +A G+TNI + AV ML S GS+ ++ L+TDG E
Sbjct: 330 NVEEAKKFAS-RILAMGATNINDAVLVAVNMLDESNRKEQLPAGSVSMVILLTDGDPTEG 388
Query: 439 ERQICDAMKSRLTNGGSICPRIYTF--GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
ER D K + +I R Y + G G N+ FL LA+ + G Y+ ++
Sbjct: 389 ER---DPQKIQENVKAAIGGRYYLYCLGFGFDVNYAFLEKLALENGGVARRIYEDSDADL 445
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---------SPLIVSGRYQ 547
Q+Q + +AN + TLK F+ YP +L ++ S ++V+G+ Q
Sbjct: 446 QLQDFYHE-----VANPLLTTLK----FQ-YPQNSVELLTQDNFRVFFKGSEIVVAGKLQ 495
Query: 548 GKFPDTLKAK-GFLGDLSNFVVELKLQLAKDIP------------LDRICAKQQIDLLTA 594
+ D+L A+ NF + ++ +A ++R+ A I L
Sbjct: 496 SEVQDSLSAQVKAQSHTQNFTFQTEVNVADKEKVFQSPKYIFHNFMERLWAYLTIQQLLE 555
Query: 595 QAWFS---EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+A + E K LE + +S++ + T M++ + ++++ A
Sbjct: 556 KAVSASGDEQKNLEARALNLSLKFNFVTPLTSMVVTKPEDQDKA 599
>gi|410900340|ref|XP_003963654.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6-like
[Takifugu rubripes]
Length = 1046
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 132/285 (46%), Gaps = 27/285 (9%)
Query: 273 KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFI 332
K + DF Y V+ + G V +HD F Y P K V KDVIF+
Sbjct: 254 KGVHADFMIQYDVALRDLMGEV-----QVHD----GYFVHYFAP---KGLPVVPKDVIFV 301
Query: 333 VDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERA 391
+D+SGSM G ++ TK A++ L+ L GD FNI+ F+ + + + + AT++ V A
Sbjct: 302 IDVSGSMIGTKIKQTKQAMSTILADLREGDHFNIITFSDQVHTWKRGRTVRATRQNVRDA 361
Query: 392 HQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS-------IPIIFLVTDGAVEDERQICD 444
+++ IA G TNI A L A +++ S +P++ +TDG D
Sbjct: 362 KEFVR-RIIAEGWTNINAALLSAAQLINPPSSSRHLASHRVPLVIFLTDGEATIGETAGD 420
Query: 445 AMKSRLTNGGSICPRIYTFGI--GSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ LTN FG+ G + L+ LA+ +RG Y+ +Q++ +
Sbjct: 421 TI---LTNARKALGSASLFGLAFGDDADFLLLKRLALDNRGVARMVYEDADAALQLKGFY 477
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
S +L+++ + L D F++ S P+ S L+V+G+ +
Sbjct: 478 DEVASPLLSDVQLSYLDD-QAFDVTRSLFPNYFQGSELVVAGKVK 521
>gi|296269297|ref|YP_003651929.1| vault protein inter-alpha-trypsin domain-containing protein
[Thermobispora bispora DSM 43833]
gi|296092084|gb|ADG88036.1| Vault protein inter-alpha-trypsin domain protein [Thermobispora
bispora DSM 43833]
Length = 796
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 135/545 (24%), Positives = 220/545 (40%), Gaps = 64/545 (11%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEY------------- 218
+P++FT+ + I G +++RL SQ L Y DG P P Y
Sbjct: 121 RPDVFTIRVGNIMPGERVTVRLELSQPLPYEDGAAEFRFPLVVAPRYIPGSPLGAPAAGS 180
Query: 219 -VTPAIKKIPKREKIHLNV-------NAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESE 270
V P +P +I V V + + L+++R + + ++
Sbjct: 181 GVVPDTDAVPDASRITPPVLLPGFPAPVRLSFTVTLDPAGLELREIRSSLHDVAVEGDTI 240
Query: 271 VLK-WSNIDFD----FSYTVSPSHIFGGVLLQSPSLHD---VDQREMFCMYLLPGTAKSR 322
L+ +D D F+Y S GG+LL D D + +LP +
Sbjct: 241 RLRPGERLDRDVILRFAYRAS-----GGLLLVPDEPGDGAAEDGEGTLMLTVLPPDDREA 295
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSME 381
+DV+ ++D SGSM G + + A A + L D ++AF N F +
Sbjct: 296 ARRPRDVVLVLDRSGSMAGWKMVAARRAAARIVDTLTERDRLAVLAFDNVIERAFPDGLT 355
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIPIIFLVTDGAVEDE 439
AT A RA +++ A G T + APL +A+ L + G ++ LVTDG V DE
Sbjct: 356 AATDRARYRAVEFLA-RLEARGGTEMLAPLEEALTALAAAAEGGRDAVLVLVTDGQVGDE 414
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+I + M SR+ GG R++ GI N FL LA++ G D ++ M+
Sbjct: 415 DRILERMASRI--GGV---RVHAVGIDRAVNAAFLGRLAVLGAGRCELVESEDRLDEAME 469
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS---ESPLIVSGRYQGKFPDTLKA 556
+ R + A + D D E+ P + L S PL + GRY+G+ L
Sbjct: 470 HIHRR----IGAPLVTDITARPDGLEVIPGTLTHLGSLYPGVPLTLLGRYRGRPTGALTL 525
Query: 557 KGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTG 616
G + + + A + I + + L + + + D LEE++ S++ G
Sbjct: 526 AGRTASGEPWETRVTGRAAANPAARAIWVRAHLRELEDR-YAAGDTSLEERIVATSLRFG 584
Query: 617 VLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPKTIMLQNLGIGFGDLTA 676
VLC +T ++T R A +P +H+ I E P + L G + A
Sbjct: 585 VLCRFTAFAAIDT--RVTADGAP---------EHRVIQPVEPPSGWEMTELATG-APMMA 632
Query: 677 TAENF 681
T +F
Sbjct: 633 TLASF 637
>gi|395833008|ref|XP_003789539.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Otolemur
garnettii]
Length = 834
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 100/399 (25%), Positives = 174/399 (43%), Gaps = 61/399 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 260 FVHYFAP---EGLSTIPKNVVFVIDTSGSMSGRKIQQTREALIKILDDLSPKDQFNLIVF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
NG+ + S+ A+ V A + GI A G TNI + AVE+L NS
Sbjct: 317 NGKAKQWKPSLVPASVNNVNEARSFAAGIR--ARGGTNINDAMLMAVELLDNSNKEELLP 374
Query: 422 RGSIPIIFLVTDG-AVEDERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
GSI +I L+TDG E E K+ R GG ++ G G ++ FL LA+
Sbjct: 375 EGSISLIILLTDGDPTEGETNPTQIQKNVREAIGGQY--NLFCLGFGFEVSYAFLEKLAL 432
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE- 537
+ G Y+ +Q+Q + + +++ +A + YPS + +++ E
Sbjct: 433 DNGGLARRIYEDSDSALQLQDFYQEVANPLMSAVAFE----------YPSNAVEEVTQEN 482
Query: 538 -------SPLIVSGRYQGKFPDTLKAK-GFLGDLSNFVVELKLQLAKDIP---------- 579
S ++V+G+ + + PD L AK N +++ +A+
Sbjct: 483 FQLFFKGSEMVVAGKLKDQSPDVLSAKVSGQTHTQNITFQMESNVAEQEEEFRGPKYIFH 542
Query: 580 --LDRICAKQQIDLLTAQAWFSEDKR---LEEKVSKMSVQTGVLCEYTRMIIV--ETDER 632
++R+ A I L Q+ + D + LE + +S+ + T M++ E E+
Sbjct: 543 NFMERLWAYLTIQQLLEQSVSAPDTKRQALETRALNLSLNYNFVTPLTSMVVTKPEGQEQ 602
Query: 633 NNASESPGTKKGSKKSDHQKILDS-------EVPKTIML 664
+ +E P + ++ H + + +VP I+L
Sbjct: 603 SQVAEKPVEDENRARNVHSETTMTTPAPAPIQVPSAILL 641
>gi|118595076|ref|ZP_01552423.1| hypothetical protein MB2181_05370 [Methylophilales bacterium
HTCC2181]
gi|118440854|gb|EAV47481.1| hypothetical protein MB2181_05370 [Methylophilales bacterium
HTCC2181]
Length = 700
Score = 99.0 bits (245), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 136/539 (25%), Positives = 236/539 (43%), Gaps = 79/539 (14%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AGK AS+ +PNIFT ++ I G ++I + + Q + + FS+ P +
Sbjct: 149 AGKKASLVEQK--RPNIFTASVANIPPGESITIAIEYQQSVLVDNDTFSIRFPMAVKDRY 206
Query: 220 TPAIKKIPKREKI------HLNVNAGTGT---EVLCNTSSHHLKQLRR--DVGKLGYSY- 267
P K ++ + H +A T T + L + H L+ + K+ SY
Sbjct: 207 VPGEKIATPKDALGTSPNTHRVTDASTVTPPSDPLVDRPIHMTIHLKPGFNTAKVASSYH 266
Query: 268 ------ESEVLKWSNI--------DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
E + ++ N+ DF+ +T P+ +P L Q++ Y
Sbjct: 267 NIHVTNEGQQNQYINLTNSAQLERDFELVWTAKPN--------LTPDLAVFTQKKGGHHY 318
Query: 314 LLPGTAKSRKV------FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
L+ A K ++VIFI+D SGSM G ++ K ALA A+ +LD GD FNI+
Sbjct: 319 LMVMAAPVNKTPSHYPKTPREVIFIIDSSGSMHGSSMDQAKQALAEAIMRLDKGDRFNII 378
Query: 368 AFNGE-TYLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSR 422
F+ + T LF+ +M ++ G+ F A G T+ PL +A++ S+
Sbjct: 379 DFDTQFTPLFTEAMPAIN------INKKSGLQFTKSLEADGGTD---PL-EAIKFAFTSK 428
Query: 423 GSIP-------IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
IP IIFL TDG V +E +I D ++ + + +T GIGS N Y +
Sbjct: 429 -KIPSQPLLRQIIFL-TDGQVSNEHEIIDTVRQYIDQD-----KFFTIGIGSAPNSYLMT 481
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
LA RG + ++ + +M +LF++ S L +I + ++ + E IPDL
Sbjct: 482 KLADYGRGAFTYIGEVKEVRTKMTELFSKLESPALTDIQLSLPSNV-QAEQANDVIPDLY 540
Query: 536 SESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLT- 593
+ L + P +L+ G D + L L+ A D +D + A+++I+ L
Sbjct: 541 AGETLTAVFKL-NTLPSSLEITGKTMD-GQYTKTLHLRDASDATGIDVLWARRKIERLMD 598
Query: 594 --AQAWFSEDKRLEE-KVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSD 649
W E+K + ++K+++ ++ ++T ++ + E TK + K D
Sbjct: 599 AYNTLWGEEEKAYAKIPITKLAIDYHLVSKFTSLVATDITPSRPGLEHLATKAVTNKMD 657
>gi|329894134|ref|ZP_08270119.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium
IMCC3088]
gi|328923306|gb|EGG30626.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium
IMCC3088]
Length = 460
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 129/269 (47%), Gaps = 14/269 (5%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMF-CMYLLPGTAKSRKVFKKDVIFIVDIS 336
D DF + P+ G L Q+ + + F + +LP + + ++V+F++D S
Sbjct: 57 DRDFLLSWRPAE---GRLPQASFFSESSELGGFGLLSVLPPSQQDWAAPPREVVFVIDTS 113
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIG 396
GSM G+ + + AL +L L P D+FNIV FN E A A+ A ++I
Sbjct: 114 GSMAGQSIVAARRALVESLKSLRPEDAFNIVEFNHEASALFAQPYPAENYALAHAIRFIR 173
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSI 456
+ A G T I A + + T+++ IIF +TDG+V +E ++ + L +
Sbjct: 174 -SLEADGGTEIEAAFDLTLALPTDAQKLRQIIF-ITDGSVSNESELLAKINRELEDR--- 228
Query: 457 CPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
R++T GIGS N YF+ A RG + ++ +E ++ +L + L +I I+
Sbjct: 229 --RLFTVGIGSSPNRYFMEEAARAGRGTFSYIANVSDVEDEIGRLLGKLSRPALTDITIE 286
Query: 517 TLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
D + PSRIPDL PL V+ R
Sbjct: 287 GEGFTD---LTPSRIPDLYFGEPLSVALR 312
>gi|393773528|ref|ZP_10361925.1| hypothetical protein WSK_2922 [Novosphingobium sp. Rr 2-17]
gi|392721010|gb|EIZ78478.1| hypothetical protein WSK_2922 [Novosphingobium sp. Rr 2-17]
Length = 710
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 117/501 (23%), Positives = 217/501 (43%), Gaps = 77/501 (15%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTP---------- 221
+PN+F+ + + G + I + + ++ +G FS+ +P P YV P
Sbjct: 151 RPNMFSTLVANVGPGETVLISIEYQAPVTQANGTFSLRLPLVVGPRYVPPHTLTGQSAVV 210
Query: 222 ---AIKKIP--------KREKIHLNVNAGTGTEVLCNTSSHHLKQLR-----RDVGKLGY 265
A+ P K + V G +V S +H + R V +L
Sbjct: 211 DAQAVTSAPVLDPKLGVKLNPTSIAVRLAPGFDVANLISRYHKVNVSGPLDDRTV-RLAE 269
Query: 266 SYESE---VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSR 322
S ++ VL+W + D + + H D+ + + P SR
Sbjct: 270 SVPADRDFVLEWRSASADPTLGLFAQH--------------TDKGDYVMASVTPPADLSR 315
Query: 323 -KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSM 380
++++F++D SGSM G +++ K +L AL L P D FN++ F+ T LF S+
Sbjct: 316 LPTPPREMVFVIDNSGSMGGASMQEAKASLIHALGTLRPQDHFNVIRFDDTMTRLFDHSV 375
Query: 381 ELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-----IIFLVTDGA 435
AT + V+ A ++ G + A G T + L A+ G+ P IIFL TDG
Sbjct: 376 A-ATPDQVDLARRFAG-SLEAQGGTEMLPALKAALADAATGGGNDPETLRQIIFL-TDGE 432
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ +ER++ A+ +GG ++ GIGS N Y + ++ I G Y +
Sbjct: 433 ISNEREMMAAIGD---DGGR--SHVFMVGIGSAPNDYLMERMSTIGGGVYTHVGAPGEVA 487
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDE---FEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+M L VL++ A+ L E ++ SR+PD+ + PL++ GR
Sbjct: 488 AKMMPLL-----DVLSHPAVRNLSVRVEGGSLDLTTSRLPDVYAGRPLVLVGRTD-HLAG 541
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFS--EDKRLEEK 607
TL G +G + + ++ L + P ++++ A ++I + A +D + + +
Sbjct: 542 TLTVSGTIG---GKLWQKRVDLTRAQPSPAVEKLWAHRRITDVEADRALGKLDDAKADAE 598
Query: 608 VSKMSVQTGVLCEYTRMIIVE 628
V ++ +++G++ T ++ V+
Sbjct: 599 VERIGMESGIVTSRTSLVAVD 619
>gi|170743237|ref|YP_001771892.1| cell wall anchor domain-containing protein [Methylobacterium sp.
4-46]
gi|168197511|gb|ACA19458.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium sp.
4-46]
Length = 761
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/429 (23%), Positives = 178/429 (41%), Gaps = 62/429 (14%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT----------- 220
+PN+FT ++ I G + +++ + Q + G ++ +P P Y
Sbjct: 195 RPNLFTTSVANIGPGETVLVQIAFQQPVRLSGGTHALRLPLVVAPRYSPAPGLLQPAAEG 254
Query: 221 PAIKKIPKREKI---------HLNVNAGTGTEVL--------CNTSSHHLKQLRRDVGKL 263
PA +P R +I H VN T T L +++H ++ V +
Sbjct: 255 PARDPVPDRARIAPPVLDPAVHGPVNPVTLTVTLRAGFPLGTVESATHAIR-----VEET 309
Query: 264 GYSYESEVLK----WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA 319
G L ++ DF ++ +PS L + V + E + P
Sbjct: 310 GPDSRRVTLADGPVPADRDFALTWRAAPSAAPAVGLFR----ERVGEDEYLLAVVTPPEG 365
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFST 378
++ ++V F++D SGSM G + K +L VAL +L P D FN++ F+ LF
Sbjct: 366 RAPARRPREVTFVIDNSGSMAGASMRQAKASLLVALDRLGPADRFNVIRFDDTMDLLFPA 425
Query: 379 SMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRG-SIPIIFLVTD 433
+ + AH+ F+A G T + PL A+ G + I +TD
Sbjct: 426 PVP------ADEAHRDAARRFVAALEARGGTEMLPPLRAALADPHPEEGDRVRQIVFLTD 479
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
GA+ +E QI A+ + G R++ GIGS N + + A + G Y A +D
Sbjct: 480 GAIGNEEQIFSAI-----SAGRGRSRLFMIGIGSAPNGHLMTHAAELGGGSYTAIGTIDQ 534
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT 553
+ + +L + S V+ ++A + E P +PDL P++++ R + + T
Sbjct: 535 VAERTAELLAKLESPVVTDLAAAFSE--PGVEATPRLLPDLYRGEPVVLAARLR-EATGT 591
Query: 554 LKAKGFLGD 562
L +G +G+
Sbjct: 592 LTLRGRIGE 600
>gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 698
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 146/301 (48%), Gaps = 17/301 (5%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA--TKEAV 388
F+VD SGSM GK +E+ + + + L G F I+ F + F T +E T E V
Sbjct: 248 FVVDCSGSMAGKRIENAVKCMRLFIQSLPVGCRFAILKFGSQ---FQTVLEPCDYTDENV 304
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC-DAMK 447
RA + + G T+I +PL E L G + IF +TDG V + IC A K
Sbjct: 305 ARAMKLLDTIKADMGGTDILSPLQHVSE-LKAKEGFVKQIFFLTDGEVHNPDMICATAQK 363
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+R N RI++ G+GS + ++ +A S G Y D D++ ++ ++ + S
Sbjct: 364 NRSGN------RIFSIGLGSGADPGLIKGMARKSGGNYSIIGDDDNMNEKVIEMLSSAIS 417
Query: 508 SVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFV 567
L +I I + + + EM+PS P L +++P + F D + G L + +
Sbjct: 418 PALKDITIQS--ESTQTEMWPSPCPPLFAKNPQSFIAK--SNFTDNILISGSLINEQIDI 473
Query: 568 VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
V +L + L ++ ++ ID +Q ED+ +++K +S+ +GVLC+YT I V
Sbjct: 474 VIPVSRLDDNHGLRQLFSRYIIDDYESQLLIKEDEEIKKKCIDLSLSSGVLCKYTSYIGV 533
Query: 628 E 628
+
Sbjct: 534 D 534
>gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella
aurantiaca DW4/3-1]
gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella
aurantiaca DW4/3-1]
Length = 540
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 165/357 (46%), Gaps = 13/357 (3%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKS-RKVFKKDVIFIVDIS 336
D D ++VS + G LL + D+ F + P T S ++ K V F++D S
Sbjct: 24 DLDLYFSVSDKAV-GLSLL---TYKQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTS 79
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIG 396
GSMQG ++ K+AL +++L+P D+FN+V F+ + +++ A E +++A ++
Sbjct: 80 GSMQGSRMQIAKDALKYCVTRLNPQDTFNVVRFSTDVEALFPALKSAQPENIQKAVAFVE 139
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDAMKSRLTNGGS 455
A G T I L + ++ + + S P ++ +TDG A+ +G
Sbjct: 140 -QLEAIGGTAIDEALVRGLQD-NDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRK 197
Query: 456 ICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
R++TFG+G N L L+ G D E ++ + + S+ + +
Sbjct: 198 AKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDK-VSNPVLSDLS 256
Query: 516 DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNF--VVELKL 572
L +D +++YP ++PDL + L+V GRY+ G++ G+ F
Sbjct: 257 LDLSSIDAYDLYPRKLPDLFKGTQLVVMGRYRKSGDAKAVLTGYVNGEKRTFDYGTTAPK 316
Query: 573 QLAKDIPLDRICAKQQIDLLTAQAWF-SEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ +D + R+ A +++ L + E L ++V ++ + G++ YT ++VE
Sbjct: 317 EATRDGFIPRLWAIRKVGFLLEEIRLRGERPELRDEVISLAKKFGIVTPYTSYLVVE 373
>gi|398824220|ref|ZP_10582561.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
YR681]
gi|398225142|gb|EJN11423.1| marine proteobacterial sortase target protein [Bradyrhizobium sp.
YR681]
Length = 753
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 116/487 (23%), Positives = 207/487 (42%), Gaps = 85/487 (17%)
Query: 156 ENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF 215
E AG+ A++ +PNIFT ++ I G + +++ + + + + E+S+ VP
Sbjct: 153 EARRAGRKAALTEQE--RPNIFTNSVANIGPGETVLVQIEYQEPVQSGN-EYSLRVPLAV 209
Query: 216 PEYVTPA-------------------IKKIPKREKI----------------HLNVNAGT 240
PA +P R++I + V
Sbjct: 210 APRYNPAPIVQSVDFRQDGSGWGATNSNPVPDRDRIAPQVLDPAKSAPVNPTSITVRLKA 269
Query: 241 GTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
G L SHH ++ D + D DF T P+ ++PS
Sbjct: 270 GF-ALGEVKSHH-HNVKTDSPDNTTRVITLAAGAVPADRDFELTWKPAA------EKAPS 321
Query: 301 LHDVDQREMFCMYLL----PGTAK--SRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
+ +R YLL P +A+ ++K ++V+F++D SGSM G + K +L A
Sbjct: 322 VGLFRERVGDADYLLAFVTPPSAEQATQKPLPREVVFVIDNSGSMGGTSIVQAKASLTYA 381
Query: 355 LSKLDPGDSFNIVAFNGET-YLFSTSMELATKEAVERAHQWIGINFI----AGGSTNICA 409
L +L P D FN++ F+ LF S+ + AH +F+ A G T +
Sbjct: 382 LGRLQPNDRFNVIRFDDTMDVLFPASVP------ADAAHVGEATSFVSALQARGGTEMVP 435
Query: 410 PLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD---AMKSRLTNGGSICPRIYTFGIG 466
+ A+ + + +TDGA+ +E+Q+ + AM+ R R++ GIG
Sbjct: 436 AMRAALTDRIGDTNMVRQVVFLTDGAIGNEQQLFETITAMRGR--------SRVFMVGIG 487
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDL---DE 523
S N Y + A + RG + ++ +E +M+ LF + L N A+ L +
Sbjct: 488 SAPNTYLMTRAAELGRGAFTHIGSVEQVEERMRGLFAK-----LENPAVTGLSAKFSDAK 542
Query: 524 FEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-LAKDIPLDR 582
++ P+ IPD+ + PL+++ + K +L+ KG +GD + V L LQ A+ L +
Sbjct: 543 ADVTPAIIPDVYRDEPLVLAAKLD-KLAGSLEIKGRIGD-RPWSVTLPLQNAAEGKGLSK 600
Query: 583 ICAKQQI 589
+ A+++I
Sbjct: 601 LWARRKI 607
>gi|354465723|ref|XP_003495326.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Cricetulus griseus]
Length = 888
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 159/345 (46%), Gaps = 32/345 (9%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++DISGSM G+ ++ T++AL L + GD N + F+ + + S+ A
Sbjct: 280 VVPKNIVFVIDISGSMAGRKIQQTRDALLKILDDMKQGDYMNFIMFSTDVTTWKDSLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV 436
T +E A ++ + +GG TNI L + + MLT++R S II ++TDG
Sbjct: 340 TPANLEEARTFVR-SIRSGGGTNINDGLLRGIRMLTDAREQHLVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G+ N+ FL +A+ + G Y+ +
Sbjct: 399 NTGESRPEKIQENVRNAIEGKFPLYNLGFGNNLNYNFLETMALENHGLARRIYEDSDANL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QGKFPD 552
Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR F
Sbjct: 459 QLQGFYEEVANPLLTNVEVEYPENAI-LDLTKNNYPHFYDGSEIVVAGRLVDSDMNNFKA 517
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKDIPL-----------DRICAKQQIDLLTAQ---AWF 598
+K G L DL+ F E+ ++ D L +R+ A I+ L + A
Sbjct: 518 DVKGHGALNDLT-FTEEVDMK-EMDAALKEQGYIFGNYIERLWAYLTIEQLLEKRKNAHG 575
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
E + L + ++S++ + T M++ + +++ + ++ PG
Sbjct: 576 EEKENLTAQALELSLKYHFVTPLTSMVVTKPEDNEDQTSIADKPG 620
>gi|351710076|gb|EHB12995.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Heterocephalus
glaber]
Length = 920
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 170/374 (45%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL LS L+P D FN++ F
Sbjct: 260 FVHYFAP---EGLPTMPKNVVFVIDKSGSMNGRKIQQTREALIKILSDLNPKDQFNLIVF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+GE + S+ A++E V +A + + A G TNI + AV++L +S
Sbjct: 317 SGEATQWEPSLVPASEENVNKARSY-AASIQARGGTNINDAMLGAVQLLESSNRAELLSA 375
Query: 423 GSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + ++ + +A+ S+ + ++ G G ++ FL
Sbjct: 376 GSVSLIILLTDGDPTVGETSPTQIQKNVQEAIASQYS--------LFCLGFGFDVSYAFL 427
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G Y+ +Q+Q + + +L +A + + E E+ +
Sbjct: 428 EKLALDNGGLARRIYEDSDSALQLQDFYQEVANPLLTEVAFEYPSNSVE-EVTQNNFQLF 486
Query: 535 SSESPLIVSGRYQGKFPDTL--KAKG--------FLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ + + PD L K +G F + S E Q K I ++
Sbjct: 487 FRGSEMVVAGKLRDQSPDVLLGKVRGQTHMENITFQAESSTAEQEGLFQSPKYIFHNFME 546
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNAS 636
R+ A QQ+ T A E K L+ + +S+ + T M++ E E++ +
Sbjct: 547 RLWAYLTIQQLLEKTVSASDDESKALKTRALNLSLSYNFVTPLTSMVVTKPEGQEQSQVA 606
Query: 637 ESPGTKKGSKKSDH 650
P + +++ H
Sbjct: 607 NKPVEEDKRRENIH 620
>gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684]
gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684]
Length = 698
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 142/313 (45%), Gaps = 23/313 (7%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D F++D+SGSM G + + ++ L KL+ D F I+ FN + AT EA
Sbjct: 309 DWTFVLDVSGSMDGHKIATLADGVSQTLGKLNSNDRFRIITFNQSAADLTRGFVTATPEA 368
Query: 388 VERAHQWIGI--NFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
V QWI AGGSTN+ A L A L + R + I LVTDG R
Sbjct: 369 V---GQWINRVKTIAAGGSTNLFAGLETACRRLDDDRTT--SIVLVTDGVANVGRTEQRE 423
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE---IQMQ-KL 501
LT R++TF IG+ N L LA S G+ D+D I+ IQ + K+
Sbjct: 424 FLQLLT---EYDVRLFTFVIGNSANRPLLDRLAKDSGGFAMQISDVDDIQGRLIQAKAKV 480
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL- 560
V I + + DL P+ I +L L++ G Y G T+K K +
Sbjct: 481 LYEKMHDVRIEIDGEKVDDL-----TPATIGNLYLGQQLVLFGHYHGSGEVTIKMKARIS 535
Query: 561 GDLSNFVVELKL-QLAKDIP-LDRICAKQQIDLLTAQAW-FSEDKRLEEKVSKMSVQTGV 617
G+ + L ++A D P L+R+ A +ID L A+ E ++L ++ + V +
Sbjct: 536 GEEKEWRCTAMLPEVATDNPELERLWALSRIDELMAEVRELGETEQLTSQIVDLGVHYSL 595
Query: 618 LCEYTRMIIVETD 630
+ +YT M+++ D
Sbjct: 596 VTDYTSMLVLNDD 608
>gi|183983524|ref|YP_001851815.1| hypothetical protein MMAR_3543 [Mycobacterium marinum M]
gi|183176850|gb|ACC41960.1| conserved membrane protein [Mycobacterium marinum M]
Length = 983
Score = 95.9 bits (237), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 125/561 (22%), Positives = 233/561 (41%), Gaps = 89/561 (15%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLI-------ALGENDGAGKSASVETGSFLKPNIFTLT 180
P+ D+ ++ + + G+ +L A E +G+SA + +P++FT+
Sbjct: 74 PLPDRAAVTRMRVSVDGRDVEAELREREAARRAYDEAIASGRSARILEED--RPDVFTIR 131
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+ I G +++ L L+Y DGE + F+FP V P + IP + V G
Sbjct: 132 VGNIVPGQRVTVALTIINPLTYEDGEAT----FRFPLVVAP--RYIPGEPLADIAVGDGY 185
Query: 241 G----------------------------TEVLCNTSSHHLKQLRRDVGKL-GYSYESEV 271
+V + + +L Q+R ++ + G +V
Sbjct: 186 ADDTDAVPDASRITPPMLVAGFAHSVPVTIDVGIDPAGFNLSQVRSNMPAVTGADGRIQV 245
Query: 272 LKWS--NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLP--GTAKSRKVFK 326
+ N DF + ++L + D D E + + +LP G A R
Sbjct: 246 TPGAPANRDFVLRLDYDAQELASSLVL----VPDADGDEGTYQLTVLPPAGVAAPRP--- 298
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+ ++ ++D S SM G + + A + + L D F ++ F+ + + + E
Sbjct: 299 RHLVLVLDRSRSMAGWKMTAARRAASRIVDALTSDDRFAVLTFD-DGIEYPVGLPAGLTE 357
Query: 387 AVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRGSI---PIIFLVTDGAVEDE 439
A +R H++ + +A G T + APL +A+ +L + + ++ L++DG V +E
Sbjct: 358 ASDR-HRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQVGNE 416
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
Q+ + L G + R++T G+ N FLR LA + G + D ++ +
Sbjct: 417 DQLLQELSGDL---GRV--RLHTIGVDEAVNAGFLRRLAGVGGGRCVLVDNEDRLDEALP 471
Query: 500 KLFTRGFSSVLAN---------IAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
+ R + LA IAI E + P+R+PDL PL+VSGRY+G
Sbjct: 472 GVIQRCARTPLATGLEVRAAGFIAI-------EDTVSPARLPDLLPGVPLVVSGRYRGSA 524
Query: 551 PDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKR--LEEK 607
P +L +G +++ + + + + A+ + DL A S D R + +
Sbjct: 525 PGSLTVRGATEAGADWSITVAAERCAVPAVTAQWARAHLRDLEDGYAAASADTRDDVATR 584
Query: 608 VSKMSVQTGVLCEYTRMIIVE 628
+ S++ GVLC +T + E
Sbjct: 585 IVDTSLRFGVLCRFTAYVAGE 605
>gi|426249407|ref|XP_004018441.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Ovis
aries]
Length = 891
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 158/346 (45%), Gaps = 45/346 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + S+ A
Sbjct: 280 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A +++ ++ G TNI L + + ML +R S II ++TDG
Sbjct: 340 TPENIQEARKFV-MDIHDRGMTNINDALLRGISMLNKAREEHTVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGLARRIYEDSDAS 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EAEYPQNAILDLTQNSYQHFYDGSEIVVAGRLA 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKAAVKGHGAINDLT-FTEEVDMKEMEKALQEQDYIFGDYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
+ AW E + L + ++S++ + T M++ + ++ N +
Sbjct: 568 EKRKNAWGEEKENLTAQALELSLKYHFVTPLTSMVVTKPEDNENQT 613
>gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1]
gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1]
Length = 740
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 162/363 (44%), Gaps = 30/363 (8%)
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL-----PGTAKSRKVFKKDV 329
+N DF+ ++ S +P L QR Y++ P + + +++
Sbjct: 296 ANRDFELRWSASG---------DAPMLGLFKQRHGELEYVMATITPPALERVGEAPPREM 346
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAV 388
IF++D SGSM G+ + + +L AL L P D FN++ F+ T LF+++++ +
Sbjct: 347 IFVIDNSGSMAGESMPAARRSLLYALETLRPQDRFNVIRFDDTMTELFASAVQASDSNIA 406
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A + N +A G T + L A+ +IFL TDGA+ +E D M+
Sbjct: 407 --AAKTFTHNLMANGGTEMLPALRAALRDRAPDERVRQVIFL-TDGALSNE---ADMMEE 460
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
N R++ GIGS N Y +R +A RG + + E QMQ+L R
Sbjct: 461 --INRNRKDSRVFMVGIGSAPNTYLMRRMAEAGRGTFTHVGMGEEAEDQMQRLLDRLSLP 518
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFV 567
V + + + + P +PDL + PL++ GR + TL G + G
Sbjct: 519 VATGLTANV--EGGNIDFAPRDLPDLYAGEPLVLLGRTR-HLEGTLTVSGMIEGVRWTRS 575
Query: 568 VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMI 625
++LK D+ + ++ A ++I + A+ W E +R + + ++ + ++ T ++
Sbjct: 576 IDLKDASDSDV-VAKLWASRRIAEVEAERWSGETSHERADTSIEELGMAFHLVTSRTSLV 634
Query: 626 IVE 628
V+
Sbjct: 635 AVD 637
>gi|330791224|ref|XP_003283694.1| hypothetical protein DICPUDRAFT_147402 [Dictyostelium purpureum]
gi|325086437|gb|EGC39827.1| hypothetical protein DICPUDRAFT_147402 [Dictyostelium purpureum]
Length = 885
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 150/324 (46%), Gaps = 22/324 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K + IF+VD SGSM G P+ K AL + L L+ FNI F + +S + +L T
Sbjct: 311 KAEYIFVVDCSGSMSGTPISKAKRALEICLKSLNEKSKFNIYRFGSDFQTYSRTSQLYTD 370
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICD 444
E++E+A ++I G T + P+ ++ T P +F++TDG V + ++ +
Sbjct: 371 ESLEQAMEYIRQTDANLGGTELLPPIKDILK--TEYDPEYPRQVFILTDGEVSERDELIE 428
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ S RI+TFGIGS + + L+ +GYY D +E ++ KL +
Sbjct: 429 YVAKE-----SNTTRIFTFGIGSGVDKELVIGLSKACKGYYELIDDNKDMEEKVMKLVSI 483
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP-DTLKA------K 557
F L+NI +D +L + P I L + +++ G + P DT A
Sbjct: 484 SFLPTLSNIKVD-WGNLS-VDQAPLVIRPLFTNERMMIYGTLNKELPKDTTTANIRIYGN 541
Query: 558 GFLGDLSNFVVELKLQLAKDIP--LDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQ 614
G G++ +F EL + + A +QI DL + S+D + +K+ ++ +
Sbjct: 542 GPSGEIVSFTAELDFSKVDSTSRYIHTLSAFKQIQDLEEGERKQSKDNK--DKIVELGKK 599
Query: 615 TGVLCEYTRMIIVETDERNNASES 638
G++ ++T I+ E ES
Sbjct: 600 YGLVSKHTSYIVTAEGEGEVTEES 623
>gi|126723120|ref|NP_001075478.1| inter-alpha-trypsin inhibitor heavy chain4 precursor [Oryctolagus
cuniculus]
gi|11041722|dbj|BAB17303.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus]
Length = 951
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 159/355 (44%), Gaps = 45/355 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K VIFIVD SGSM G+ ++ T+ AL L L+P D FN++ F+ + TS+ A+ E
Sbjct: 274 KTVIFIVDQSGSMLGRKIQQTREALLKILDDLNPRDRFNLILFSSSATPWKTSLVQASLE 333
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-----GSIPIIFLVTDGA------ 435
V A + G AGG T+I L AV +L + + GS+ ++ L+TDG
Sbjct: 334 TVSEARSYAGAIQAAGG-TDINEALLLAVSLLDHEQEELRAGSVSLLILLTDGEPTQGKT 392
Query: 436 --VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
E +R + +A+ R + ++ G G N+ FL LA+ + G Y+
Sbjct: 393 NPTEIQRNVREAIGGRYS--------LFCLGFGFNVNYPFLEKLALDNGGLARRVYEDSD 444
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+Q+Q + L ++ + + ++E R+ L SE ++V+G+ + PD
Sbjct: 445 AALQLQDFYQEVAHPQLTSVTFEYSSNAVEEVTRDSFRMHFLGSE--MVVAGKLRADSPD 502
Query: 553 TLKAKGFLGDLS--NFVVELKLQLAKDIP------------LDRICAKQQIDLLTAQAWF 598
L AK G + N ++ +AK ++R+ A I L Q +
Sbjct: 503 VLAAK-VRGRVHTGNVTFYMESSVAKQEEAFRSRQYVFGSFMERLWAYLTIQQLLEQMVY 561
Query: 599 ---SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA--SESPGTKKGSKKS 648
++ + LE + +S+ + T M+I + + + A +E P K++
Sbjct: 562 APKAQQRALESRARNLSLDYSFVTPLTSMVITKPEGQGQAQVAEKPSQDDKHKQA 616
>gi|395516899|ref|XP_003762621.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4
[Sarcophilus harrisii]
Length = 606
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/248 (30%), Positives = 125/248 (50%), Gaps = 29/248 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ ++ T+ AL L L+P D FN+VAF+G + + A+KE
Sbjct: 266 KNVVFVIDKSGSMAGRKIKKTRAALVKILDDLNPEDHFNLVAFSGRVIQWKPMLLQASKE 325
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG-AVED 438
+E A ++ N A G+TNI + AV ML S GS+ ++ L+TDG E
Sbjct: 326 HLESAKTFLS-NTPALGATNINEAILIAVRMLDESDRKQQLPAGSVSMVILLTDGDPTEG 384
Query: 439 E---RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
E ++I + +K+ L NG Y G G N+ FL LA+ + G YD +
Sbjct: 385 ETKLQKIQENVKA-LMNGRY---HFYCLGFGFNINYAFLEKLALDNGGVARRIYDDSDSD 440
Query: 496 IQMQKLFTRGFSSVLANIAI----DTLKDL--DEFEMYPSRIPDLSSESPLIVSGRYQGK 549
+Q+Q + + +L + D ++ L D F ++ +S L+V+G+ + +
Sbjct: 441 LQLQDFYQEVANPLLTKVEFHYPDDAVEQLTQDSFRVF-------FKDSELVVAGKLRSQ 493
Query: 550 FPDTLKAK 557
D L A+
Sbjct: 494 VLDLLSAQ 501
>gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus
musculus]
Length = 927
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 275 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 334
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 335 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 393
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 394 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 450
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 451 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 509
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 510 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 567
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 568 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 610
>gi|338714655|ref|XP_001915876.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Equus
caballus]
Length = 824
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 162/373 (43%), Gaps = 42/373 (11%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL + L P D FN+V F
Sbjct: 247 FVHYFAP---EGLPTIPKNVIFVIDQSGSMAGRKIQQTREALIKIVDDLGPKDQFNLVCF 303
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSR--- 422
N E + S+ A+ E ++ A NF AG G TNI + AV++L +
Sbjct: 304 NEEATQWKPSLVPASAENMKEAR-----NFAAGIMARGGTNINDAVLLAVQLLERANKQE 358
Query: 423 ----GSIPIIFLVTDG-AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG E + K+ C ++ G G + ++ FL L
Sbjct: 359 LLPAGSVSLIILLTDGDPTVGETNRANIQKNVQEAISGQC-SLFCLGFGFHVSYAFLEKL 417
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G Y+ +Q+Q + + +L +A + + E E+ L
Sbjct: 418 ALDNGGLARRIYEDSDSALQLQDFYQEVANPLLMKVAFEYPSNAVE-EVTQDNFRLLFKG 476
Query: 538 SPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------LDRI 583
S ++V+G+ + K PD L AK G L L N + + +A+ ++R+
Sbjct: 477 SEMVVAGKLRDKGPDVLSAKVRGQL-HLQNITFQTEASVAEKEEEFRSPKYIFHSFMERL 535
Query: 584 CAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASES 638
A QQ+ A SE + LE + +S+ + T M++ E E + +E
Sbjct: 536 WAYLTIQQLLERAVSASDSEKQALETRALSLSLNYSFVTPLTSMVVTKPEGQEESRVAEK 595
Query: 639 PGTKKGSKKSDHQ 651
P + ++ H
Sbjct: 596 PMENENRHRNIHH 608
>gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 precursor
[Mus musculus]
Length = 941
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus
musculus]
Length = 943
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 275 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 334
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 335 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 393
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 394 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 450
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 451 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 509
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 510 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 567
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 568 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 610
>gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 precursor
[Mus musculus]
gi|443286784|sp|A6X935.2|ITIH4_MOUSE RecName: Full=Inter alpha-trypsin inhibitor, heavy chain 4;
Short=ITI heavy chain H4; Short=ITI-HC4;
Short=Inter-alpha-inhibitor heavy chain 4; Flags:
Precursor
gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus]
gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus
musculus]
Length = 942
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus]
Length = 942
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus]
gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus]
Length = 941
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus]
Length = 942
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|344276617|ref|XP_003410104.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Loxodonta
africana]
Length = 891
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 159/350 (45%), Gaps = 48/350 (13%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM G+ ++ T+ AL L + D N + F+ + + S+ AT E
Sbjct: 283 KNVVFVIDISGSMTGRKIQQTREALLRILGDVKENDYLNFILFSSDVTTWKDSLVQATPE 342
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
++ A ++ N +GG TNI L +A+ ML +R S I+ ++TDG
Sbjct: 343 NLQEARAFVR-NIHSGGMTNINDGLLRAISMLNKAREEHKIPERSTSIVIMLTDGDANVG 401
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+ ++ + N S +Y G G+ N+ FL +A+ + G Y+ ++Q+Q
Sbjct: 402 ESRPEKIQENVRNAISGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDADLQLQ 461
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRYQGK- 549
+ + +L N+ +D YP I DL+ S ++V+GR +
Sbjct: 462 GFYEEVANPLLTNVEVD----------YPENAILDLTQNSYQHFYDGSEIVVAGRLLDED 511
Query: 550 ---FPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL---T 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 512 LNSFKADVKGHGATNDLT-FTEEVDVKEMEKALKEQEYIFGNYIERLWAYLTIEQLLDKR 570
Query: 594 AQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
A+ E + L + +S++ + T M++ +++++ ++ PG
Sbjct: 571 KNAYGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDSEDQTAIADKPG 620
>gi|405972771|gb|EKC37520.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Crassostrea gigas]
Length = 612
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 165/381 (43%), Gaps = 50/381 (13%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF +Y V + G VL+Q+ + F Y P ++ K++IF++DISG
Sbjct: 184 DFVVAYDVDHQNDGGIVLVQN---------DYFVHYFSP---SGLQILPKNIIFVIDISG 231
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGI 397
SM G ++ + + V L KL D FNI+ F+ L+ S E AT +E A ++
Sbjct: 232 SMGGFKIQKVREVMHVILGKLRDYDYFNILLFDDRITLWKKSPERATASNIEVARRYADE 291
Query: 398 NFIAGGSTNICAPLTKAVEML------TNSRGSIPIIFLVTDG----AVEDERQICDAMK 447
+A GSTNI L AV++L + RG + I+FL TDG + D QI +K
Sbjct: 292 TLVARGSTNINTALIDAVDLLRGLSKREDGRGRV-IVFL-TDGHPTAGIVDPTQIRRNVK 349
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+R N G I+ G G + FL LA + G+ Y+ Q++ + +
Sbjct: 350 TR--NEG--LASIFALGFGYNVDMAFLEALAYENGGFARRIYEESDASDQLETFYAEIST 405
Query: 508 SVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD----TLKAKGFLGD 562
+L ++ + T + ++ + P + L+V+GR + P T+ A+G G
Sbjct: 406 PLLVDVNVKYTPTVIVSDQVTRTVFPQYFQGTELVVAGRLRNNAPSDWQATVTAEGDTGS 465
Query: 563 LSNFVVELKLQLAKDIPLD----------RICAKQQIDLLTAQAWFSEDKRLEEKVSKMS 612
S L + PLD R A +I L E++ + +M+
Sbjct: 466 TSG---RLDFTATPEYPLDGPGVTSDFAERYWAFSRIKELLKMELVEENREQKSIYREMA 522
Query: 613 VQTGVLCEY----TRMIIVET 629
+ + ++ T M++ E+
Sbjct: 523 LNMSLAYQFVTPLTSMVVTES 543
>gi|334338566|ref|XP_003341805.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Monodelphis domestica]
Length = 824
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 157/344 (45%), Gaps = 49/344 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ + T+ AL L L P D FN+V F+G + S+ A+ E
Sbjct: 247 KNVVFVIDKSGSMFGRKITQTREALVKILDDLTPKDQFNLVIFSGSVTQWKPSLLQASAE 306
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEML--TNSRGSIP-----IIFLVTDGAVEDE 439
VE+A ++ N A G+TNI + AV+ML +N R +P ++ L+TDG
Sbjct: 307 HVEKAKKF-ARNIQANGNTNINEAVLVAVKMLDESNRREQLPEESVSLVILLTDGDPTTG 365
Query: 440 RQICDAMKSRLTNGGSICPRIYTF--GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
Q D K + G+I R Y + G G N+ FL LA+ + G Y+ ++Q
Sbjct: 366 EQ--DPKKIQKNVRGAIGGRYYLYCLGFGFDVNYAFLEKLALENGGVARRIYEDSDADLQ 423
Query: 498 MQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---------SPLIVSGRYQG 548
+Q + + +L + EF+ YP +L ++ S L+V+G+ +
Sbjct: 424 LQDFYQEVANPLLTKV---------EFQ-YPDNSVELLTQDNFLVFFRGSELVVAGKLKP 473
Query: 549 KFPDTL--KAKGFLGDLSNFVVELKLQLAKDIP------------LDRICAKQQIDLLTA 594
+ D L K KG NF + ++ + ++R+ A I L
Sbjct: 474 QVEDLLSVKVKG-QSSTQNFTFQTEVNVEDKEKVFQSPKYIFHNFMERLWAYLTIQQLLE 532
Query: 595 QAWFS---EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA 635
+A + E K LE + +S+ + T M++ + + ++ A
Sbjct: 533 KAVSASAEEKKSLEARALGLSLNFSFVTPLTSMVVTKPEGQDQA 576
>gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus
musculus]
Length = 904
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 275 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 334
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 335 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 393
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 394 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 450
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 451 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 509
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 510 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 567
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 568 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 610
>gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus]
Length = 902
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 158/343 (46%), Gaps = 37/343 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L+++A + D E E+ + S ++V+G+ Q + PD L A
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVE-EVTRYKFQHHFKGSEMVVAGKLQDQGPDVLLA 507
Query: 557 K--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQIDLLTAQAWF 598
K G + + N + + +A+ ++R+ A +QQ++ + A
Sbjct: 508 KVSGQM-HMQNITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 565
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 566 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 608
>gi|355696814|gb|AES00466.1| inter-alpha inhibitor H4 [Mustela putorius furo]
Length = 597
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 128/269 (47%), Gaps = 40/269 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM+GK + T+ AL L L P D FN+V+F
Sbjct: 244 FVHYFAP---EGLPTIPKNVIFVIDKSGSMRGKKIHQTREALIKILDDLSPNDQFNLVSF 300
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+G+ + S+ A+ E V +A + GI A G TNI + AV++L ++
Sbjct: 301 SGDVTQWRPSLVPASAENVNQARSYATGIQ--AQGGTNINDAMLMAVQLLDSANQQELLP 358
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG D +I +K + GG +++ G G ++ FL L
Sbjct: 359 EGSVSLIILLTDGDPTMGETDPARIQKNVKEAV--GGQYS--VFSLGFGFDVSYAFLEKL 414
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSS 536
A+ + G Y+ +Q+Q + + +L + + YPS + D+S
Sbjct: 415 ALDNGGLARRIYEDSDSALQLQDFYQEVANPLLTAVTFE----------YPSNAVEDVSQ 464
Query: 537 E--------SPLIVSGRYQGKFPDTLKAK 557
+ S ++V+G+ + + PD L AK
Sbjct: 465 DNFRLLFKGSEIVVAGKIKDQSPDVLSAK 493
>gi|334338564|ref|XP_001380249.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Monodelphis domestica]
Length = 921
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 128/251 (50%), Gaps = 35/251 (13%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ + T+ A+ L L P D FN+V F+G + + ++ A+ +
Sbjct: 270 KNVVFVIDKSGSMAGRKMRQTREAMVQILGDLRPEDQFNLVIFDGHVFQWMPALLQASSQ 329
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVED- 438
VE+A ++ + A G+TNI + AV+ML +S GS+ ++ L+TDG D
Sbjct: 330 NVEQAKKFTSL-ISAMGATNINDAVLLAVKMLDDSNRKEKLPPGSVSMVILLTDGDATDG 388
Query: 439 ---ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
++I + +K+ + GGS +Y G G N+ FL LA+ + G Y+ +
Sbjct: 389 ETNPKKIQENVKAAI--GGSY--HLYCLGFGFDVNYAFLEKLALENGGVARRIYEDSDSD 444
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---------SPLIVSGRY 546
+Q+Q + + +L + EF+ YP +L ++ S L+V+G+
Sbjct: 445 LQLQDFYQEVANPLLTKV---------EFQ-YPDNAVELLTQDNFRVFFKGSELVVAGKL 494
Query: 547 QGKFPDTLKAK 557
+ + ++L A+
Sbjct: 495 KSQLENSLLAE 505
>gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca]
Length = 854
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 158/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D NI+ F+G+ ++ S+ A
Sbjct: 245 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLVQA 304
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ N G TNI L +A+ ML +R S I+ ++TDG
Sbjct: 305 TPENIQEARTFVK-NIHDQGMTNINDGLMRAINMLNKAREEHRVPERSTSIVIMLTDGDA 363
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 364 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 422
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 423 LQLQGFYEEVANPLLTGVEVE----------YPENAIQDLTQNAYQHFYDGSEIVVAGRL 472
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 473 LDEDMNNFKADVKGHGATSDLT-FTEEVDMKEMEKALKEQDYIFGNYIERLWAYLTIEQL 531
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
+ A E + L + +S++ + T M++ + +++ ++ PG
Sbjct: 532 LEKRKNAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 585
>gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Anolis carolinensis]
Length = 914
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 160/360 (44%), Gaps = 30/360 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K++IF++D SGSM GK ++ T AL L L+P D FN+V F
Sbjct: 254 FVHYFAPDQMPT---LPKNIIFVIDKSGSMIGKKIQQTIEALQKILEDLNPEDHFNLVVF 310
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------R 422
+GE + +S+ AT+E VE A Q++ +A G T+I L A+ L +
Sbjct: 311 SGEISEWQSSLLKATEENVELAKQYVR-TIMAQGGTDINGALLTAINSLDRATSAELLPE 369
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
SI +I L+TDG +++++ + +Y G G ++ FL LA+ +R
Sbjct: 370 QSISMIVLLTDGQPTVGETNVNSIQTNIKKANDGNYFLYCLGFGFDVSYTFLEKLALENR 429
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G Y+ +Q+Q + + +L I + D E+ + L S ++V
Sbjct: 430 GIARRIYEDSDAALQLQDFYQEVATPILKEIVMK-YPDNAVLEITQNNFKVLFDGSEIVV 488
Query: 543 SGRYQGK---FPDTLKAKGF-----LGDLSNFV-VELKLQLAKDI---PLDRICAKQQID 590
+G+ + P +KA+ L + +N E Q K I ++R+ A I
Sbjct: 489 AGKLSNEVDVLPVEIKAQALSSYLTLKEDANVTEKEQVFQHRKYIFGDFIERLWAYLTIK 548
Query: 591 LLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDE-RNNASESPGTKKG 644
L + A E K LE K +S++ + T M++ E +E N +E+ K G
Sbjct: 549 QLLEKFVSAKEQEQKTLETKALNLSLKFNFVTPLTSMVVTKPEAEEVANKPTEADNEKLG 608
>gi|395516011|ref|XP_003762190.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Sarcophilus harrisii]
Length = 463
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 121/251 (48%), Gaps = 35/251 (13%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ ++ T+ AL LS L P D FN+V F+ E L+ ++ A+ E
Sbjct: 204 KNVVFVIDKSGSMAGRKIKQTREALIKILSDLKPEDQFNLVIFSREVILWKPTLLQASTE 263
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG----A 435
VE A + + A GSTNI AV ML S GS+ ++ L+TDG
Sbjct: 264 HVEEA-KLFASSINADGSTNINEAALVAVRMLDESNRKEQLPAGSVSMVILLTDGDPTTG 322
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++I + +K+ + GG+ +Y G G ++ FL LA+ + G Y+ +
Sbjct: 323 ERNPQKIQENVKAAI--GGNYY--LYCLGFGFDVHYAFLEKLALENGGVARRIYEDSDAD 378
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---------SPLIVSGRY 546
+Q+Q + + +L + YP +L ++ S L+VSG+
Sbjct: 379 LQLQDFYQEVANPLLTMVKF----------QYPQNAVELLTQDNFRVFFQGSELVVSGKL 428
Query: 547 QGKFPDTLKAK 557
Q + PD L +
Sbjct: 429 QPQAPDLLSVQ 439
>gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [gamma proteobacterium IMCC1989]
gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [gamma proteobacterium IMCC1989]
Length = 699
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 165/371 (44%), Gaps = 27/371 (7%)
Query: 303 DVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD 362
+ +QR F + L PG + D IF++DISGSM GK + L KL D
Sbjct: 272 EANQRGTFMLTLTPGDDLGKVTTGSDWIFVLDISGSMSGK-YATLVEGVRQGLEKLRQQD 330
Query: 363 SFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
F +V FN + ++ ++ V Q + N+ GG TN+ A L K + + R
Sbjct: 331 RFKVVLFNNGSVDLTSGFLTVSQANVANVLQQLD-NYKVGGGTNLYAGLQKGLTGIDADR 389
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+ I LVTDG V + Q +L N + R++TF +G+ N L+ +A +S
Sbjct: 390 PA--GIILVTDG-VANVGQTEKKTFLKLLNDHDV--RLFTFIMGNSANRPLLKEMADVSN 444
Query: 483 GYYGAAYDLDS----IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
G+ + + D I + KL + F + ++ ID +K + + P +I L
Sbjct: 445 GFAMSVSNADDIVGHIMLATSKLTHQAFRDI--DLDIDGVKVKN---ITPEKISSLYRGE 499
Query: 539 PLIVSGRYQ--GKFPDTLKAK-GFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
L V G Y G L AK G V ++ ++R+ A I+ L +Q
Sbjct: 500 QLTVFGHYWKPGTANVVLTAKIGGNEKTYRTTVNFPDVSHRNPEIERLWAFASIEHLQSQ 559
Query: 596 A-WFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD-------ERNNASESPGTKKGSKK 647
++ DK E+ ++ ++V+ G++ +YT MI+VE +RNN +K ++
Sbjct: 560 LDYYGADKDTEQAIADIAVEYGLVTDYTSMIVVEEAVFQSLNIDRNNQRRVETEQKAREE 619
Query: 648 SDHQKILDSEV 658
Q + ++ V
Sbjct: 620 RSAQPVTNNRV 630
>gi|444513525|gb|ELV10371.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Tupaia chinensis]
Length = 902
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 174/401 (43%), Gaps = 60/401 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 298 FVHYFAP---EGLPAMPKNVIFVIDTSGSMSGRKIQQTREALIKILDDLKPKDQFNLITF 354
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------R 422
+ E + + A+ + VE+A + + A G TNI + AV++L +S R
Sbjct: 355 SREVTQWEPLLVPASAQNVEKAKSY-AKDIRAIGGTNINDAVLAAVDLLKSSNQKELLPR 413
Query: 423 GSIPIIFLVTDGA-VEDERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
GS+ +I L+TDG E E K+ R G ++ G G + FL LA+
Sbjct: 414 GSVSLIILLTDGEPTEGETNPVKIQKNVREAIGDQF--NLFCLGFGFDVIYPFLEKLALD 471
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--- 537
+ G Y+ +Q+Q + + ++ + + YPS + + ++
Sbjct: 472 NGGLARRIYEDSDSALQLQDFYQEVANPLMTAVVFE----------YPSNVVEAVTQDNF 521
Query: 538 ------SPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP---------- 579
S ++V+G+ Q + PD L AK G L L N + + +A+
Sbjct: 522 RLLFKGSEIVVAGKLQEQNPDVLSAKVNGQL-HLQNITFQAEASVAEQEEAFQGPKYIFH 580
Query: 580 --LDRICA----KQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDE 631
++R+ A +QQ++ L A +E K LE + MS++ + T M++ E +
Sbjct: 581 NFMERLWAYLTIQQQLERLIL-ASDAERKALETQALNMSLKYSFVTPLTSMVVTKPEGQD 639
Query: 632 RNNASESPGTKKGSKKSDHQKILDS----EVPKTIMLQNLG 668
++ E P + + H +L PK + + N G
Sbjct: 640 QSQVLEKPVEDESKHRPSHPAMLAPVPTLSAPKVLTVPNPG 680
>gi|444513527|gb|ELV10373.1| Inter-alpha-trypsin inhibitor heavy chain H1 [Tupaia chinensis]
Length = 911
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 122/251 (48%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F + + S+ A++
Sbjct: 290 KNVVFVIDISSSMEGQKVKQTKEALLKILDDMQPGDYFDLVLFGSQVQSWRGSLVQASQA 349
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ +F GSTN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 350 NLQAARSFVK-SFSLAGSTNLNGGLLRGIEILNRAQGSLPELSNHASILIMLTDGEPTEG 408
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI DA++ R +Y G G + FL +++M + G Y+
Sbjct: 409 VTDRSQILTNIRDAIRGRFP--------LYNLGFGHNLDFNFLEVMSMENNGRAQRIYED 460
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ---- 547
Q+Q +++ +L ++ + +D + R S ++V+GR
Sbjct: 461 HDATQQLQGFYSQVAKPLLVDVELQYPQDAVS-ALTQHRHKQYYEGSEIVVAGRIADHKL 519
Query: 548 GKFPDTLKAKG 558
G F ++A+G
Sbjct: 520 GSFKADVQARG 530
>gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Ailuropoda melanoleuca]
Length = 891
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 158/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D NI+ F+G+ ++ S+ A
Sbjct: 280 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ N G TNI L +A+ ML +R S I+ ++TDG
Sbjct: 340 TPENIQEARTFVK-NIHDQGMTNINDGLMRAINMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPENAIQDLTQNAYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 LDEDMNNFKADVKGHGATSDLT-FTEEVDMKEMEKALKEQDYIFGNYIERLWAYLTIEQL 566
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
+ A E + L + +S++ + T M++ + +++ ++ PG
Sbjct: 567 LEKRKNAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 620
>gi|395516901|ref|XP_003762622.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3
[Sarcophilus harrisii]
Length = 887
Score = 92.4 bits (228), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 164/370 (44%), Gaps = 53/370 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P ++ + K+V+F++DISGSM G+ L TK+AL L+ + D N + F
Sbjct: 268 FVHFFAP---QNLPIVPKNVVFVIDISGSMSGRKLAQTKDALLHILNDVKEDDFLNFILF 324
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ + ++ +M AT E ++ A ++ N G TNI L K +EM+ +R
Sbjct: 325 SSDVTVWKENMVQATPENLKEARNFVE-NMQTAGLTNINDGLLKGIEMVNRAREMGILLD 383
Query: 424 -SIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S II ++TDG D ++ + N G P +Y G G N+ FL +A+ +
Sbjct: 384 RSTSIIIMLTDGEANTGESRVDKIQENVKNAIGGKFP-LYNLGFGHDVNYNFLEKMALEN 442
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN-IAIDTLKDLDEFEMYPSR-IPDLSSE-- 537
G Y+ +QMQ GF +AN + ID DL YP I D++
Sbjct: 443 HGVARRIYEDSDANLQMQ-----GFYDEVANPLLIDV--DL----QYPQNAISDVTQNNY 491
Query: 538 ------SPLIVSGRY----QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI--------- 578
S ++V+G+ F +K G L DL+ F E+ ++ ++
Sbjct: 492 QHFYDGSEIVVAGKVVDLESNSFVADVKGHGALNDLT-FTEEVDMKEMEEALKEQEYIFG 550
Query: 579 -PLDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
++R+ A I+ L + A E + L + +S++ + T M++ + E N
Sbjct: 551 DYIERLWAYLTIEQLLEKRKNAEGEEKENLTAQALDLSLKYHFVTPLTSMVVTKP-EDNE 609
Query: 635 ASESPGTKKG 644
++S K G
Sbjct: 610 ETQSIADKPG 619
>gi|130491439|ref|NP_001076115.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Oryctolagus
cuniculus]
gi|6579183|dbj|BAA88322.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus
cuniculus]
Length = 906
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 128/264 (48%), Gaps = 24/264 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K ++F++DISGSM+G+ ++ TK AL L + P D F++V F
Sbjct: 272 FAHFFAPQNLKN---MSKSLVFVIDISGSMEGQKVKQTKEALLKILGDIRPEDYFDLVLF 328
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ S+ A++ ++ A ++ F G+TN+ L + +E+L N++G++P
Sbjct: 329 GSRVQSWRGSLVPASEANLQAARDFVQ-RFSLAGATNLNGGLLRGIEILNNAQGNLPAVS 387
Query: 427 ----IIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
I+ ++TDG V D QI ++S + GG +Y+ G G + FL+ L+
Sbjct: 388 KHAAILIMLTDGEPTEGVTDRPQILKNIRSAI--GGRFP--LYSLGFGHDLDFNFLKSLS 443
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
M + G+ Y+ Q+Q + + + +L ++ + +D + R S
Sbjct: 444 MENNGWAQRIYEDHDAAQQLQGFYNQVANPLLVDVELLYPQDA-VVALTQHRHKQYYDGS 502
Query: 539 PLIVSGRYQ----GKFPDTLKAKG 558
++V+GR G F ++A+G
Sbjct: 503 EIVVAGRIADHKLGSFKADVRARG 526
>gi|348503069|ref|XP_003439089.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
protein-like [Oreochromis niloticus]
Length = 1165
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 124/275 (45%), Gaps = 34/275 (12%)
Query: 302 HDVDQREM----------FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
+DVD +++ F Y P + V KDVIF++D+S SM G ++ TK A+
Sbjct: 252 YDVDLKDLIGDVQVFDGYFVHYFAP---RGLPVVPKDVIFVIDVSSSMIGNKIKQTKQAM 308
Query: 352 AVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAP 410
+ L L GD FNI+ F+ + + + + AT++ V A +++ IA G TNI A
Sbjct: 309 STILGDLREGDHFNIITFSDKVHTWKKGKTVRATRQNVWEAKEFVK-RIIAEGWTNINAA 367
Query: 411 LTKAVEMLT---------NSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSIC 457
L A +++ S +P++ +TDG V I K L +
Sbjct: 368 LLSAAQLVNPPSSGPSRRTSSHRVPLVIFLTDGEATIGVTSSETILSNAKKALGSAS--- 424
Query: 458 PRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT 517
++ G + L+ LA+ +RG Y+ +Q++ + S +L++I +
Sbjct: 425 --LFGLAFGDDADFLLLKRLALENRGVARMVYEDADAALQLKGFYDEVASPLLSDIQLSY 482
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
L D F++ S P+ S L+V+GR + D
Sbjct: 483 LDD-QAFDVTRSLFPNYFQGSELVVTGRVKTGVKD 516
>gi|123428998|ref|XP_001307617.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121889256|gb|EAX94687.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 642
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 142/301 (47%), Gaps = 17/301 (5%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
FIVD SGSM G + + + + L G F+I+ F + S + K
Sbjct: 218 FIVDCSGSMTGSRILKAIECMRMFIQSLPVGCRFSIIKFGSNFHTILKSCDYTDKNLANA 277
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD-AMKSR 449
IN GG T+I +PL K V + +G + IFL+TDG V+D +IC A K+R
Sbjct: 278 MQLLYTINSDMGG-TDIYSPL-KYVSDIKPKKGFVKQIFLLTDGEVQDSDEICAMAYKNR 335
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
N RI++ G+GS + ++ +A S G Y D D++ ++ ++ S
Sbjct: 336 SNN------RIFSIGLGSGADPGLIKGIARKSGGNYTIIGDNDNLNQKVIEMLCSAISPA 389
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSSESP--LIVSGRYQGKFPDTLKAKGFLGDLSNFV 567
L NI I + + EM+PS P L ++P IV + + L + + + + V
Sbjct: 390 LCNITIQSKS--KQTEMWPSPCPPLFPKNPQSFIVKSSFT---ENILISGSLIKEQVDIV 444
Query: 568 VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
V + L ++ L ++ ++ ID Q D ++++K +S+ +GVLC++T I V
Sbjct: 445 VPVS-PLVDNLGLRQLFSRYIIDDYETQLILKNDDKIKKKCIDLSLSSGVLCKFTSYIGV 503
Query: 628 E 628
+
Sbjct: 504 D 504
>gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus]
Length = 941
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM GK ++ T+ AL L L P D FN++ F+GE + S+ AT+E
Sbjct: 273 KNVIFVIDKSGSMSGKKIQQTREALVKILKDLSPQDQFNLIEFSGEANQWKQSLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ +A + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNKAVNYAS-RIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVSLIILLTDGDPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E I NG ++ G G N+ FL +A+ + G Y+ +
Sbjct: 392 ETNPTIIQNNVREAINGQY---SLFCLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
Q+Q + + +L+++A + D ++E Y + SE +V+G+ Q + PD L
Sbjct: 449 QLQDFYHEVANPLLSSVAFEYPSDAVEEVTRYKFQHHFKGSE---VVAGKLQDQGPDVLL 505
Query: 556 AK----------GFLGDLSNFVVELKLQLAKDI---PLDRICA----KQQIDLLTAQAWF 598
AK F + S E + + K I ++R+ A +QQ++ + A
Sbjct: 506 AKVSGQMHMQHITFQTEASVAQQEKEFKSPKYIFHNFMERLWALLTIQQQLEQRIS-ASG 564
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 565 AELEALEAQVLNLSLKYNFVTPLTHMVVTKPEGQEQFQVAEKP 607
>gi|348588787|ref|XP_003480146.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Cavia
porcellus]
Length = 882
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 162/360 (45%), Gaps = 40/360 (11%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL LS L+P D FNI+ F
Sbjct: 260 FVHYFAP---EGLPTMPKNVVFVIDKSGSMSGRKIQQTREALIKILSDLNPRDLFNIIVF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+ E + ++ A++E V A + + +GG TNI + AV++L +S
Sbjct: 317 SEEATQWKPALVPASEENVSMAKSYAATIYASGG-TNINDAMLMAVQLLESSNRAELLSA 375
Query: 423 GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
GS+ +I L+TDG + ++I ++ + N S ++ G G ++ FL LA
Sbjct: 376 GSVSLIILLTDGDPTVGETNPKKIQKNVQEAIGNQYS----LFCLGFGFDVSYAFLEKLA 431
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G Y+ +Q+Q + + +L N+A + E E+ L S
Sbjct: 432 LDNGGLARRIYEDSDSALQLQDFYQEVANPLLTNVAFQYPSNAVE-EVTQDNFRLLFKGS 490
Query: 539 PLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIPL------------DRIC 584
++V+G+ + + D AK G + D+ N E + A+ L +R+
Sbjct: 491 EMVVAGKLRDQRSDVFSAKVSGQM-DMQNVTFETESSTAEQEKLFQSPKYIFHNFMERLW 549
Query: 585 AKQQIDLL---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L + A +E K L+ + +S+ + T M++ E E++ + P
Sbjct: 550 AYLTIQQLLENSVSASDAESKALKSRALNLSLNYNFVTPLTSMVVTKPEGQEQSQVANKP 609
>gi|114570700|ref|YP_757380.1| vault protein inter-alpha-trypsin subunit [Maricaulis maris MCS10]
gi|114341162|gb|ABI66442.1| Vault protein inter-alpha-trypsin domain protein [Maricaulis maris
MCS10]
Length = 740
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/428 (22%), Positives = 178/428 (41%), Gaps = 81/428 (18%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV------------ 219
+PN+FT ++ I G + ++ + + DG F + P P Y+
Sbjct: 163 RPNMFTTSVANIGPGETIIVQFEYQDVARFVDGRFQLTQPLGLTPRYIPDGGDFQMVSTD 222
Query: 220 ----------TPAI--KKIPKREKIHL------NVNAGTGTEVLCNTSSHHLKQLRRDVG 261
TP + + R+++ L +++AG + + L + R D G
Sbjct: 223 SSSVPDASRITPPVMPASLEPRDQLRLPVTITADLDAGYALGEIASLYHATLVERRSD-G 281
Query: 262 KLGYSYESEVLKWSNIDFDFSYTVS-PSHIFGGVLLQSPSLHDVDQREMFCM-YLLP--- 316
S + +N DF ++ + PS + ++ Q E + + +LP
Sbjct: 282 TARISLADGPIP-ANRDFVLTWRAADPSEASAALFIEE------WQGETYLLAQILPPAE 334
Query: 317 -GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
G R+ ++ IF++D SGSM G + + AL AL +L+PGD FN++ F+
Sbjct: 335 LGADTPRRA--RETIFVIDNSGSMGGASMRQARAALITALQRLEPGDRFNVIRFD----- 387
Query: 376 FSTSMELATKEAVERAHQWIGINFI------AGGSTNICAPLTKAVEMLTNSRGS-IPII 428
+ME +AV+ + + A G T + L A+ + S + I
Sbjct: 388 --NTMEQVFPQAVDASPDNVATALTFARRLEAQGGTVMLPALNAALRDTSPDDDSRVRQI 445
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
+TDGA+ +E ++ A+++ L R++ GIGS N YF+ A + RG
Sbjct: 446 VFLTDGAIGNEAELFAAIEAGLGRS-----RLFPVGIGSAPNGYFMSRAARLGRGTSTQI 500
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEF-------EMYPSRIPDLSSESPLI 541
+ +E +M++LFT + + DLD E++P+ +PDL P+
Sbjct: 501 GQVSEVEARMEELFT--------ALERPVMTDLDALFPEGALSEIWPAPLPDLYYGEPVT 552
Query: 542 VSGRYQGK 549
++ R +
Sbjct: 553 LTARLASR 560
>gi|440718499|ref|ZP_20898947.1| von Willebrand factor type A [Rhodopirellula baltica SWK14]
gi|436436151|gb|ELP29919.1| von Willebrand factor type A [Rhodopirellula baltica SWK14]
Length = 764
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 141/315 (44%), Gaps = 17/315 (5%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
++ ++VI ++D SGSM G + + L L+P D F ++AF+ T F +
Sbjct: 339 EITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTTAFQPNAVS 398
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
AT ++ A Q++ + A G TN+ L A+ + + L+TD V ++ I
Sbjct: 399 ATDANIQSAKQFVRV-LRASGGTNLLPALKLALGGEADESARPRYMVLMTDALVGNDHSI 457
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ R++ G+ N Y + A + RG+ + D+ ++
Sbjct: 458 LRYLRQPEFQDA----RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFH 513
Query: 503 TRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
+ ++ ID +KD PSR+PDL + PLIV GRY T+ KG
Sbjct: 514 ELTSQPYMTDLQIDWGGLVVKD-----QVPSRLPDLYAGKPLIVLGRYDTPATGTITLKG 568
Query: 559 FL-GDLSNFVVELKL--QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQT 615
+ G +EL+L Q A+ + + A+Q+I + + +E +++++ ++
Sbjct: 569 NVSGQPVQMGMELELPEQEAEHDSIGPVWARQRIRQIWNRDVGAETPAGRAEITELGLKH 628
Query: 616 GVLCEYTRMIIVETD 630
++ +YT I VE +
Sbjct: 629 QLMTQYTSFIAVEKE 643
>gi|338714629|ref|XP_001492399.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Equus
caballus]
Length = 891
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 163/352 (46%), Gaps = 46/352 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ ++ TK+AL L + D N + F+G+ + ++ A
Sbjct: 280 VVPKNVVFVIDVSGSMYGRKIQQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDNLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIP-----IIFLVTDGAV 436
T E +++A +++ +N + G TNI L + + ML +R ++P II ++TDG
Sbjct: 340 TPENIQQAREFV-MNIHSQGMTNINDALLRGISMLNKAREENAVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G+ N+ FL +A+ + G Y+ +
Sbjct: 399 NVGESKPEKIQENVHNAIRGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRYQG 548
Q+Q GF +AN + ++ E E + I DL+ S ++V+GR
Sbjct: 459 QLQ-----GFYEEVANPLLTSV----EVEYPENAILDLTQNTYQHFYDGSEIVVAGRLVD 509
Query: 549 K----FPDTLKAKGFLGDLSNFVVELKLQLAKDI----------PLDRICAKQQIDLLTA 594
K F +K G + DL+ F E+ ++ ++ ++R+ A I+ L
Sbjct: 510 KDLNNFKADVKGHGAINDLT-FTEEVDMKEMEEALRERDYIFGNYIERLWAYLTIEQLLE 568
Query: 595 Q---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
+ A E + L + +S++ + T M++ + +++ ++ PG
Sbjct: 569 KRKNAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 620
>gi|3024046|sp|P97278.1|ITIH1_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1;
Short=ITI heavy chain H1; Short=ITI-HC1;
Short=Inter-alpha-inhibitor heavy chain 1; Flags:
Precursor
gi|1694688|dbj|BAA13938.1| inter-alpha-trypsin inhibitor heavy chain 1 [Mesocricetus auratus]
Length = 914
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 134/298 (44%), Gaps = 38/298 (12%)
Query: 274 WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIV 333
W N DF +Y V+ + ++ + F + P K+ K+++F++
Sbjct: 254 WLNGDFKVTYDVNRDKLCDLLVANN----------YFAHFFAP---KNLTNMSKNLVFVI 300
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
DISGSM+G+ ++ TK AL L + PGDSF++V F + S+ AT+ ++ A
Sbjct: 301 DISGSMEGQKVKQTKEALLKILGDVKPGDSFDLVLFGSRVQSWKGSLVPATQANLQAAQD 360
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----AVEDERQI 442
++ F G+TN+ L + +E+L ++GS P I+ ++TDG D QI
Sbjct: 361 FVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPASILIMLTDGEPTEGETDRSQI 419
Query: 443 C----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
+A++ R +Y G G + FL +++M + G+ Y+ Q+
Sbjct: 420 LKNVRNAIRGRFP--------LYNLGFGHDLDFNFLEVMSMENSGWAQRIYEDHDATQQL 471
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q + + + +L ++ + +D + R S ++V+GR T KA
Sbjct: 472 QGFYNQVANPLLTDVELQYPQD-SVLSLTQHRHKQYYDGSEIVVAGRIADHKLSTFKA 528
>gi|32476015|ref|NP_869009.1| inter-alpha-trypsin inhibitor domain-containing protein
[Rhodopirellula baltica SH 1]
gi|32446559|emb|CAD76394.1| inter-alpha-trypsin inhibitor family heavy chain-related
protein-hypothetical secreted or membrane-associated
protein containing vWFA domain [Rhodopirellula baltica
SH 1]
Length = 764
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 139/311 (44%), Gaps = 9/311 (2%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
++ ++VI ++D SGSM G + + L L+P D F ++AF+ T F +
Sbjct: 339 EITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTTAFQPNAVS 398
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
AT ++ A Q++ A G TN+ L A+ + + L+TD V ++ I
Sbjct: 399 ATDANIQSAKQFVR-GLRASGGTNLLPALKLALGGEADESARPRYMILMTDALVGNDHSI 457
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ R++ G+ N Y + A + RG+ + D+ ++
Sbjct: 458 LRYLRQPEFQDA----RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFH 513
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-G 561
+ ++ ID L + PSR+PDL + PLIV GRY T+ KG + G
Sbjct: 514 ELTSQPYMTDLQID-WGGLGVKDQVPSRLPDLYAGKPLIVLGRYDTPATGTITLKGNVSG 572
Query: 562 DLSNFVVELKL--QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLC 619
+EL+L Q A+ + + A+Q+I + + +E +++++ ++ ++
Sbjct: 573 QPVQMGMELELPEQEAEHDSIGPVWARQRIRQIWNRDVGAETPAGRAEITELGLKHQLMT 632
Query: 620 EYTRMIIVETD 630
+YT I VE +
Sbjct: 633 QYTSFIAVEKE 643
>gi|410951369|ref|XP_003982370.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1
[Felis catus]
Length = 880
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 159/349 (45%), Gaps = 40/349 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + ++ A
Sbjct: 280 VVPKNVVFVIDVSGSMHGRKMEQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDTLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIP-----IIFLVTDGAV 436
T E +E A ++ N G TNI L + + ML +R S+P II ++TDG
Sbjct: 340 TPENIEEARTFVK-NIDDQGMTNINDGLLRGISMLNKAREEHSVPERSTSIIIMLTDGDP 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
+Q+Q + + +L + ++ ++DL + S S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVEYPENAIQDLTQ-----SAYQHFYDGSEIVVAGRLLDEDM 512
Query: 548 GKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLTAQ-- 595
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L +
Sbjct: 513 NNFKADVKGHGAINDLT-FTEEVDMKEMEKALKEQDYIFGNYIERLWAYLTIEQLLEKRK 571
Query: 596 -AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
A E + L + +S++ + T M++ + +++ ++ PG
Sbjct: 572 NAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 620
>gi|440790243|gb|ELR11526.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 984
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 152/333 (45%), Gaps = 41/333 (12%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFSTSMELATKE 386
+++F+VD S SM G+ + K LA+ L L PG FNI+ F T +LF
Sbjct: 345 EMVFLVDRSYSMAGQRIAQVKETLAIFLRSLGPGTFFNIIGFGSRTQHLFKNGSVEYNDT 404
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDA 445
+E A + + + T+I PL + ++ T + P +F++TDG V + + +
Sbjct: 405 NLEIAAKHVALMNADLDGTDILKPLKEVLQ--TKPKDGYPRQLFILTDGQVANTIECVEF 462
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
++ + R++TFGIGS + ++ LA G+Y + DS+E ++ + R
Sbjct: 463 VRKH-----ADTTRVFTFGIGSEASTELVKGLARAGEGFYEFVHGNDSMEEKVMRQLNRA 517
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR------------YQGKFPDT 553
L N+A+ T K + P R+ SPL GR G+ T
Sbjct: 518 MQPALTNLAL-TWKGF-KVRQSPHRL------SPLFAGGRIVVYAFPQDGEPQTGQIEVT 569
Query: 554 LKAK-GFLGDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSED-------KRL 604
L+AK G +SN ++L D+ + ++ AK + DL +++F K +
Sbjct: 570 LRAKIGNKPFISNTKIDLNDARPGDL-IHKLAAKSTLRDLAECRSYFHNPKGEVVPGKNV 628
Query: 605 EEKVSKMSVQTGVLCEYTRMIIVETDERNNASE 637
++ ++SV+ GV+ + T + VE ER +E
Sbjct: 629 TQEAVRLSVEHGVMSKLTAFVAVE--EREEITE 659
>gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 722
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 148/303 (48%), Gaps = 21/303 (6%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA--TKEAV 388
FIVD SGSM + + + + + L G F+I+ F F T + T E V
Sbjct: 248 FIVDCSGSMSCSRINNAIKCMRLFIQSLPVGCRFSILRFGSH---FETVLPPCDYTDENV 304
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC-DAMK 447
A + G TNI APL + V L S G + IF +TDG V++ IC A+K
Sbjct: 305 ANAMNLLDNISANMGGTNILAPL-QHVSDLQASEGFVKQIFFLTDGEVDNSDIICATALK 363
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+R TN RI++ G+GS + ++ +A S G Y D D++ ++ ++ + S
Sbjct: 364 NRSTN------RIFSIGLGSGADPGLIKGMARKSGGNYAIIGDNDNMNEKVIEMLSSAIS 417
Query: 508 SVLANIAIDTLKDLDEFEMYPSRIPDLSSESP--LIVSGRYQGKFPDTLKAKGFLGDLSN 565
L +I+I + + + EM+PS P L +++P IV + + L + L D +
Sbjct: 418 PALRDISIQS--ESTQTEMWPSPCPPLFAKNPQSFIVKSEFT---ENILLSGTLLHDQVD 472
Query: 566 FVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMI 625
V+ + +L + L ++ ++ I+ +Q D+ +++K +S+ +GVLC+YT +
Sbjct: 473 IVIPVS-RLDNNKGLRQLFSRYIIEDYESQLLLKNDESIKKKCIDLSLASGVLCKYTSYV 531
Query: 626 IVE 628
V+
Sbjct: 532 GVD 534
>gi|66801705|ref|XP_629777.1| hypothetical protein DDB_G0292028 [Dictyostelium discoideum AX4]
gi|74851198|sp|Q54DU5.1|Y2028_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0292028
gi|60463177|gb|EAL61370.1| hypothetical protein DDB_G0292028 [Dictyostelium discoideum AX4]
Length = 932
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 25/319 (7%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELAT 384
K + IF++D SGSM GKP+E +K AL + + L+ FNIV F LF TS
Sbjct: 340 KSEFIFVLDCSGSMSGKPIEKSKMALEICMRSLNENSKFNIVCFGSNFNKLFETSKHY-N 398
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP----IIFLVTDGAVEDER 440
E +++A ++I G T + P+ V++L S+ S P +F++TDG + +
Sbjct: 399 DETLQKASEYINRIDANLGGTELLEPI---VDIL--SKESDPEFPRQVFILTDGEISNRD 453
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
++ D + RI+T+GIGSY + + ++ +GYY D +E ++ K
Sbjct: 454 KLIDYVGKEANTT-----RIFTYGIGSYVDKELIVGVSKACKGYYEMIVDNSDMEEKVMK 508
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-----QGKFPDTLK 555
L + L+NI +D L P++I L ++ +++ + + L
Sbjct: 509 LISIAMQPTLSNIKVD-WGVLSNVVQSPAQIRPLFNQERMMIYATLDKEPTEKQVTVILS 567
Query: 556 AKGFLGDLSNFVVELKLQLAKDIP--LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSV 613
G L + +F V+L A + + A + I L ++ E K ++K+ K+
Sbjct: 568 GNGPLSERISFPVDLDFSKANQSTSHIHTLSAFKHIQDL-EESERKEKKDNKDKIVKLGK 626
Query: 614 QTGVLCEYTRMIIVETDER 632
+ G++ ++T I+ ++
Sbjct: 627 RFGLVSKHTSYIVTADSDK 645
>gi|410951371|ref|XP_003982371.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 2
[Felis catus]
Length = 899
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 159/349 (45%), Gaps = 40/349 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + ++ A
Sbjct: 280 VVPKNVVFVIDVSGSMHGRKMEQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDTLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIP-----IIFLVTDGAV 436
T E +E A ++ N G TNI L + + ML +R S+P II ++TDG
Sbjct: 340 TPENIEEARTFVK-NIDDQGMTNINDGLLRGISMLNKAREEHSVPERSTSIIIMLTDGDP 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
+Q+Q + + +L + ++ ++DL + S S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVEYPENAIQDLTQ-----SAYQHFYDGSEIVVAGRLLDEDM 512
Query: 548 GKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLTAQ-- 595
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L +
Sbjct: 513 NNFKADVKGHGAINDLT-FTEEVDMKEMEKALKEQDYIFGNYIERLWAYLTIEQLLEKRK 571
Query: 596 -AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
A E + L + +S++ + T M++ + +++ ++ PG
Sbjct: 572 NAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 620
>gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum So ce56]
gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum So ce56]
Length = 865
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 139/298 (46%), Gaps = 13/298 (4%)
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI 395
SGSMQG+ L+ K AL + L L GD F ++AF+ + S+ T+ ++ A ++
Sbjct: 300 SGSMQGESLDQAKRALRLCLRHLAEGDRFGVIAFSSDFRALEPSLAPFTQATLKAADAFV 359
Query: 396 GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGS 455
A G T + PL AV ML ++ ++ L+TDG V +E QI D + R G
Sbjct: 360 D-GLRADGGTEMLNPLLAAVGMLGDAE-RDRVVVLLTDGQVGNEAQIVDRVVQR---GKG 414
Query: 456 ICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
+ RIYTFGIG+ + + LA + G + + I+ ++ F R + + +
Sbjct: 415 V--RIYTFGIGTNVSDVLVNDLARRTEGAAEFIHPGERIDEKVTAQFARATAVRVTGLEA 472
Query: 516 DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQL 574
+ +D E+ P+ P L P +V GRY L +G L G+ + V ++L
Sbjct: 473 -RFEGVDVGELAPAERPALVDGEPWVVYGRYGQPGIGRLSLRGTLRGERFHLDVPVELAA 531
Query: 575 AKDIP-LDRICAKQQIDLLTAQAWFSEDKRLE---EKVSKMSVQTGVLCEYTRMIIVE 628
P L+ + A ++ L KR E +++ ++V+ V ++ +++E
Sbjct: 532 EASRPGLEALWAGARVRELEESERGLSGKRAEAMRKRIVDLAVRHRVASKHASFVVIE 589
>gi|345806499|ref|XP_848765.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Canis
lupus familiaris]
Length = 912
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 60/379 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL L L P D FN+++F
Sbjct: 254 FVHYFAP---EGLPTIPKNVIFVIDKSGSMSGRKIQQTREALIKILDDLKPNDQFNLISF 310
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+G+ + + A+ E V++A ++ N A G TNI + AV +L + S
Sbjct: 311 SGDVTHWKPLLVPASPENVDQAKRY-AANIEAHGGTNINDAMLTAVRLLQSANQKELLSD 369
Query: 423 GSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG ++ + A+ + + ++ G G ++ FL
Sbjct: 370 GSVSLIILLTDGDPTVGETSPARIQKNVQKAIDGQYS--------LFCLGFGFDVSYVFL 421
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI-------AIDTLKDLDEFEMY 527
LA+ + G Y+ +Q+Q + + +L + A+D + D F +
Sbjct: 422 EKLALDNGGLARRIYEDSDSALQLQDFYQEVANPLLTAVTFEYPDNAVDKVSQ-DNFRL- 479
Query: 528 PSRIPDLSSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKD 577
L S ++V G+ + + PD L AK F + S E + Q K
Sbjct: 480 ------LFKGSEIVVVGKLRDQSPDVLSAKVRGQLHMQNITFQTESSVAEQEKEFQSPKY 533
Query: 578 I---PLDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ET 629
I ++R+ A I L Q A +E + LE + +S+ + T M++ E
Sbjct: 534 IFHSFMERLWAYLTIQQLLEQMVSASGAEKQALETRALNLSLSYSFVTPLTSMVVTKPEG 593
Query: 630 DERNNASESPGTKKGSKKS 648
++R+ +E P + K+
Sbjct: 594 EQRSEVAEKPVETRNKHKN 612
>gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis]
Length = 849
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 126/268 (47%), Gaps = 39/268 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D+SGSM G ++ T+ A+ L LDP D F I+ F+ ++TS+ ATKE
Sbjct: 269 KNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPEDHFGIILFDHRIQFWNTSLSKATKE 328
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
++ A ++ + G T+I AP+ KAV+ML R SI +I L+TDG
Sbjct: 329 NIDEAMVYVKA-IQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGDPNSG 387
Query: 440 RQICDAMKSRLTN--GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
++ + GG + +++ G G+ + FL +++ + G Y+ +Q
Sbjct: 388 ESRIPVIQENVKAAIGGQMS--LFSLGFGNDVKYPFLDVMSRENNGLARRIYEGSDAALQ 445
Query: 498 MQKLFTRGFSSVLANI-------AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
+Q + S +L ++ A+D+L ++ L + S ++V+GR +
Sbjct: 446 LQGFYDEVSSPLLLDVDLRYPDNAVDSLT--------TNQFSQLFNGSEIVVAGRLKDN- 496
Query: 551 PDTLKAKGFLGDLSNFVVELKLQLAKDI 578
D+ NF VE+ Q D
Sbjct: 497 -----------DIDNFPVEVSGQGLNDF 513
>gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1039
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 126/268 (47%), Gaps = 39/268 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D+SGSM G ++ T+ A+ L LDP D F I+ F+ ++TS+ ATKE
Sbjct: 433 KNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPEDHFGIILFDHRIQFWNTSLSKATKE 492
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
++ A ++ GG T+I AP+ KAV+ML R SI +I L+TDG
Sbjct: 493 NIDEAMVYVKAIQSYGG-TDINAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGDPNSG 551
Query: 440 RQICDAMKSRLTN--GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
++ + GG + +++ G G+ + FL +++ + G Y+ +Q
Sbjct: 552 ESRIPVIQENVKAAIGGQMS--LFSLGFGNDVKYPFLDVMSRENNGLARRIYEGSDAALQ 609
Query: 498 MQKLFTRGFSSVLANI-------AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
+Q + S +L ++ A+D+L ++ L + S ++V+GR +
Sbjct: 610 LQGFYDEVSSPLLLDVDLRYPDNAVDSLT--------TNQFSQLFNGSEIVVAGRLKDN- 660
Query: 551 PDTLKAKGFLGDLSNFVVELKLQLAKDI 578
D+ NF VE+ Q D
Sbjct: 661 -----------DIDNFPVEVSGQGLNDF 677
>gi|440905494|gb|ELR55870.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Bos grunniens mutus]
Length = 891
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 155/347 (44%), Gaps = 47/347 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + S+ A
Sbjct: 280 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A +++ ++ G TNI L + + ML +R S II ++TDG
Sbjct: 340 TPENIQEASKFV-MDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPQNAILDLTQNSYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 ADEDMNSFKAAVKGHGAINDLT-FTEEVDMKEMEKALQERDYIFGDYIERLWAYLTIEQL 566
Query: 593 ---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
A E + L + ++S++ + T M++ + ++ N +
Sbjct: 567 LDKRKNAQGEEKENLTAQALELSLKYHFVTPLTSMVVTKPEDNENQT 613
>gi|421612223|ref|ZP_16053337.1| von Willebrand factor type A [Rhodopirellula baltica SH28]
gi|408497000|gb|EKK01545.1| von Willebrand factor type A [Rhodopirellula baltica SH28]
Length = 764
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 140/315 (44%), Gaps = 17/315 (5%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
+V ++VI ++D SGSM G + + L L+P D F ++AF+ T F +
Sbjct: 339 EVTPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTTAFQPNAVS 398
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
AT ++ A Q++ A G TN+ L A+ + + L+TD V ++ I
Sbjct: 399 ATDANIQSAKQFVR-GLRASGGTNLSPALKLALGGEADESARPRYMILMTDALVGNDHSI 457
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ R++ G+ N Y + A + RG+ + D+ ++
Sbjct: 458 LRYLRQPEFQDA----RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFH 513
Query: 503 TRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
+ ++ ID +KD PSR+PDL + PLIV GRY T+ KG
Sbjct: 514 ELTSQPYMTDLQIDWGGLVVKD-----QVPSRLPDLYAGKPLIVLGRYDTPATGTITLKG 568
Query: 559 FL-GDLSNFVVELKL--QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQT 615
+ G +EL+L Q ++ + + A+Q+I + + +E +++++ ++
Sbjct: 569 NVSGQPVQMGMELELPEQESEHDSIGPVWARQRIRQIWNRNVGAETPAGRAEITELGLKH 628
Query: 616 GVLCEYTRMIIVETD 630
++ +YT I VE +
Sbjct: 629 QLMTQYTSFIAVEKE 643
>gi|118618215|ref|YP_906547.1| hypothetical protein MUL_2774 [Mycobacterium ulcerans Agy99]
gi|118570325|gb|ABL05076.1| conserved membrane protein [Mycobacterium ulcerans Agy99]
Length = 981
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 119/548 (21%), Positives = 225/548 (41%), Gaps = 75/548 (13%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLI-------ALGENDGAGKSASVETGSFLKPNIFTLT 180
P+ D+ ++ + + G+ +L A E +G+SA + +P++FT+
Sbjct: 72 PLPDRAAVTRMRVSVDGRDVEAELREREAARRAYDEAIASGRSARILEED--RPDVFTIR 129
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGT 240
+ I G +++ L L+Y DGE + F+FP V P + IP + V G
Sbjct: 130 VGNIVPGQRVTVALTIINPLTYEDGEAT----FRFPLVVAP--RYIPGEPLADIAVGDGY 183
Query: 241 G----------------------------TEVLCNTSSHHLKQLRRDVGKL-GYSYESEV 271
+V + + +L Q+R ++ + G +V
Sbjct: 184 ADDTDAVPDASRITPPMLVAGFAHSVPVTIDVGIDPAGFNLSQVRSNMPAVTGADGRIQV 243
Query: 272 LKWS--NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE-MFCMYLLP--GTAKSRKVFK 326
+ N DF + ++L + D + E + + +LP G A R
Sbjct: 244 TPGAPANRDFVLRLDCDAQELASSLVL----VPDAEGDEGTYELTVLPPAGAATPRP--- 296
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+ ++ ++D S SM G + + A + + L D F ++ F+ + + + E
Sbjct: 297 RHLVLVLDRSRSMAGWKMTAARRAASRIVDALTSDDRFAVLTFD-DGIEYPVGLPAGLTE 355
Query: 387 AVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSR---GSIPIIFLVTDGAVEDE 439
A +R H++ + +A G T + APL +A+ +L + ++ L++DG V +E
Sbjct: 356 ASDR-HRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQVGNE 414
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
Q+ + L G + R++T G+ N FLR LA + G + D ++ +
Sbjct: 415 DQLLQELSGDL---GRV--RLHTIGVDEAVNAGFLRRLAGVGGGRCVLVDNEDRLDEALP 469
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFE--MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
+ R + LA E M P+R+PDL PL+VSGRY+G P +L +
Sbjct: 470 GVIQRCARTPLATGLEVRAAGFTAIEDTMSPARLPDLLPGVPLVVSGRYRGSAPGSLTVR 529
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKR--LEEKVSKMSVQ 614
G +++ + + + + + + DL A D R + ++ S++
Sbjct: 530 GATEAGADWSITVAAERCAVPAVTAQWVRAHLRDLEDGYAAAPADTRDDVATRIVDTSLR 589
Query: 615 TGVLCEYT 622
GVLC +T
Sbjct: 590 FGVLCRFT 597
>gi|334338570|ref|XP_001380264.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Monodelphis domestica]
Length = 919
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 167/375 (44%), Gaps = 53/375 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P ++ V K+V+F++D+SGSM G+ L TK AL LS + D N + F
Sbjct: 283 FVHFFAP---QNLPVVPKNVVFVIDVSGSMSGRKLVQTKEALLKILSDVKKDDFLNFILF 339
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ + + ++ AT E ++ A +++ A G TNI L + +EM+ +R
Sbjct: 340 SSDVRTWKENLVPATPENLKAAEEFVH-QIQATGGTNINDGLLRGIEMVNKAREMGTVLD 398
Query: 424 -SIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S II ++TDG + ++ + N G P +Y G G N+ FL +A+ +
Sbjct: 399 RSTSIIIMLTDGEANVGESRVEKIQENVRNAIGGKYP-LYNLGFGYDVNYNFLERMALEN 457
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN-IAIDTLKDLDEFEMYPSR-IPDLSSE-- 537
G Y+ +QMQ GF +AN + ID DL YP I D++
Sbjct: 458 NGVARRIYEDSDANLQMQ-----GFYDEVANPLLIDV--DL----QYPQNAISDITQNNY 506
Query: 538 ------SPLIVSGRY----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP------- 579
S ++V+G+ F +K G L DL+ F E+ L ++ K +
Sbjct: 507 RHFYEGSEIVVAGKLVDFDTNSFIADVKGHGALNDLT-FTEEVDLKEMEKALEEQEYIFG 565
Query: 580 --LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
++R+ A I+ L + A E + L + +S++ + T M++ + E N
Sbjct: 566 DYIERLWAYLTIEQLLEKRKNAQGEEKENLTAQALDLSLKYHFVTPLTSMVVTKP-EDNE 624
Query: 635 ASESPGTKKGSKKSD 649
+E+ K G K+D
Sbjct: 625 EAEAIADKPGEVKAD 639
>gi|417305158|ref|ZP_12092139.1| von Willebrand factor type A domain-containing protein
[Rhodopirellula baltica WH47]
gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein
[Rhodopirellula baltica WH47]
Length = 764
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 139/315 (44%), Gaps = 17/315 (5%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
++ ++VI ++D SGSM G + + L L+P D F ++AF+ T F
Sbjct: 339 EITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTTAFQPDAIA 398
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
AT ++ A Q++ A G TN+ L A+ + + L+TD V ++ I
Sbjct: 399 ATDANIQSAKQFVR-GLRASGGTNLLPALKLALGGEADESARPRYMVLMTDALVGNDHSI 457
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ R++ G+ N Y + A + RG+ + D+ ++
Sbjct: 458 LRYLRQPEFQDA----RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFH 513
Query: 503 TRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
+ ++ ID +KD PSR+PDL + PLIV GRY T+ KG
Sbjct: 514 ELTSQPYMTDLQIDWGGLVVKD-----QVPSRLPDLYAGKPLIVLGRYDTPATGTITLKG 568
Query: 559 FL-GDLSNFVVELKL--QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQT 615
+ G +EL+L Q A+ + + A+Q+I + + +E +++++ ++
Sbjct: 569 NVSGQPVQMGMELELPEQEAEHDSIGPVWARQRIRQIWNRDVGAETPAGRAEITELGLKH 628
Query: 616 GVLCEYTRMIIVETD 630
++ +YT I VE +
Sbjct: 629 QLMTQYTSFIAVEKE 643
>gi|410951373|ref|XP_003982372.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 3
[Felis catus]
Length = 699
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 120/257 (46%), Gaps = 23/257 (8%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + ++ A
Sbjct: 280 VVPKNVVFVIDVSGSMHGRKMEQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDTLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIP-----IIFLVTDGAV 436
T E +E A ++ N G TNI L + + ML +R S+P II ++TDG
Sbjct: 340 TPENIEEARTFVK-NIDDQGMTNINDGLLRGISMLNKAREEHSVPERSTSIIIMLTDGDP 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
+Q+Q + + +L + ++ ++DL + S S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVEYPENAIQDLTQ-----SAYQHFYDGSEIVVAGRLLDEDM 512
Query: 548 GKFPDTLKAKGFLGDLS 564
F +K G + DL+
Sbjct: 513 NNFKADVKGHGAINDLT 529
>gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Sus scrofa]
Length = 889
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/352 (23%), Positives = 161/352 (45%), Gaps = 46/352 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E T++AL L + D N V F+G+ + S+ A
Sbjct: 280 VVPKNVVFVIDVSGSMYGRKMEQTRDALLKILDDIKEDDYLNFVLFSGDVTTWKDSLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG-- 434
T E +++A +++ N G TNI L + ML +R S II ++TDG
Sbjct: 340 TPENIQKAREFVR-NIRDQGMTNINDGLLTGISMLNKAREEHKVPERSTSIIIMLTDGDA 398
Query: 435 --AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
V +I + +++ + GG +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NMGVSKPEKIQENVRNAI--GGKFP--LYNLGFGNNLNYNFLESMALENHGLARRIYEDS 454
Query: 493 SIEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGRY-- 546
+Q+Q + + +L ++ + + ++DL + + S ++V+GR
Sbjct: 455 DANLQLQGFYEEVANPLLTSVEVGYPENAIQDLTQ-----NTYQHFYDGSEIVVAGRLAD 509
Query: 547 --QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI----------PLDRICAKQQIDLLTA 594
F +K G + DL+ F E+ ++ ++ ++R+ A I+ L
Sbjct: 510 EDMNSFRADVKGHGAINDLT-FTEEVDMKEMEEALKERDYIFGNYIERLWAYLTIEQLLE 568
Query: 595 Q---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS---ESPG 640
+ A E + L + +S++ + T M++ + ++ N + + PG
Sbjct: 569 KRKNAQGKEKETLTAQALDLSLKYHFVTPLTSMVVTKPEDNENQTAIADKPG 620
>gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens]
Length = 637
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 167/371 (45%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ G S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGGYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|269468735|gb|EEZ80354.1| hypothetical protein Sup05_1046 [uncultured SUP05 cluster
bacterium]
Length = 682
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 177/397 (44%), Gaps = 64/397 (16%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AGK S+ +PNIFT + I G ++I + + Q + + +FS+ P E
Sbjct: 144 AGKKVSLVEQQ--RPNIFTTKVANIGPGETITIAIEYQQAVRIDNDQFSIRFPMVVGERY 201
Query: 220 TPAIKKIPKREK------------------------------IHLNVNAGTGTEVLCNTS 249
P KKI + I +N+ AG T+ + +
Sbjct: 202 IPG-KKINTQPNALGNKANTHRVKDASKIAPPTDTNADRPVAISINLKAGFDTDSII-SP 259
Query: 250 SHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM 309
H + + D S ++ +N DF+ ++ + +P+L Q++
Sbjct: 260 YHQISIVETDKLTKHISLKN---TQANRDFELTWQAHKT--------LTPTLALFTQQKG 308
Query: 310 FCMYL-LPGTAKSRKVFK-----KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
YL L T + +VFK ++VIFI+D SGSM G +E NAL A+++L P D
Sbjct: 309 DDHYLMLMATPPADEVFKQSHTPREVIFIIDSSGSMMGSSMEQATNALIQAINRLKPTDR 368
Query: 364 FNIVAFNGE-TYLFST---SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
FNI+ F+ + LF T ++++ + + A + +GG+ + A +
Sbjct: 369 FNIIDFDSDFEVLFDTAIPAIDMNKRHGIRFAKHLVA----SGGTEPLEAIKFALLSKDE 424
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+S + + +TDG V +E+++ A++ + + R +T GIGS N Y + +A
Sbjct: 425 DSDKYLRQVIFLTDGQVGNEKELFRAVQQNIDDD-----RFFTIGIGSAPNDYLMTKMAE 479
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+G + D+D +E++M +LF++ S + +I I+
Sbjct: 480 YGKGAFTYIGDIDEVEVKMGELFSKLESPAMTDININ 516
>gi|354465847|ref|XP_003495388.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Cricetulus griseus]
Length = 886
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 167/359 (46%), Gaps = 64/359 (17%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P +S K+VIF++D SGSM G+ ++ T+ AL L+ L P D FN++ F
Sbjct: 259 FVHHFAP---ESLPTMPKNVIFVIDKSGSMSGRKIQQTREALIKILNDLSPKDHFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------R 422
+ + ++ S+ A++E ++RA ++ A G TNI + AV+ML +S +
Sbjct: 316 DWKATQWTQSLMEASEENLKRATEYAS-KIHAHGGTNINDAMLLAVKMLDSSNQAELLPK 374
Query: 423 GSIPIIFLVTDGAVED--------ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + +R + +A+ ++ + ++ G G ++ FL
Sbjct: 375 GSVSLIILLTDGEPTEGETNPANIQRNVQEAINNQYS--------LFCLGFGFDVHYSFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA + G Y+ +Q+Q + + +L ++A + YPS ++
Sbjct: 427 EKLAQDNGGLARRIYEDSDSALQLQDFYHEVANPLLLSVAFE----------YPSDAVEV 476
Query: 535 SSE---------SPLIVSGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLA-KDIP--- 579
++ S ++V+G+ Q + D L AK G + N + + +A KD
Sbjct: 477 VTQDNFRHHFKGSEMVVAGKLQDQGLDVLSAK-ISGQMHKQNITFQTEASIAQKDFQSPK 535
Query: 580 ------LDRICA----KQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
++R+ A QQ++ + + A E K LE + +S++ + T M++ +
Sbjct: 536 YIFHNFMERLWALLTIHQQLEQMVS-ASGDELKALESQALNLSLKYNFVTPLTHMVVTK 593
>gi|431899874|gb|ELK07821.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Pteropus alecto]
Length = 858
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 157/354 (44%), Gaps = 48/354 (13%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM G+ +E TK+AL L + D N + F+G+ + ++ AT E
Sbjct: 276 KNVVFVIDISGSMHGRKMEQTKDALLKILDDVKEEDYLNFILFSGDVITWKDNLVQATPE 335
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
++ A ++ N G TNI L + + ML +R S II ++TDG
Sbjct: 336 NIQEARTFVR-NIQDQGMTNINDGLLRGISMLNKAREEHRVPERSTSIIIMLTDGDANTG 394
Query: 440 RQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
++ + N G P +Y G G N FL +A+ + G Y+ +Q+
Sbjct: 395 ESRPPKIQENVRNAIGGKFP-LYNLGFGDNLNFNFLESMALENHGIARRIYEDSDANLQL 453
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRYQGK- 549
Q GF +AN + ++ E E + I DL+ S ++V+GR K
Sbjct: 454 Q-----GFYEEVANPLLTSV----EVEYPENAILDLTKNTYQHFYDGSEIVVAGRLVDKD 504
Query: 550 ---FPDTLKAKGFLGDLSNFVVELK-LQLAKDIP---------LDRICAKQQIDLLTAQ- 595
F +K G + DL+ F E+ +++ K + ++R+ A I+ L +
Sbjct: 505 MNNFKADVKGHGAINDLT-FTEEVDMMEMEKALQKQDYIFGNYIERLWAYLTIEQLLEKR 563
Query: 596 --AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNA---SESPGTKKG 644
A E + L + +S++ + T M++ + ++ N ++ PG G
Sbjct: 564 KNALGDEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNENQMAIADKPGEVDG 617
>gi|351710074|gb|EHB12993.1| Inter-alpha-trypsin inhibitor heavy chain H1 [Heterocephalus
glaber]
Length = 912
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 122/252 (48%), Gaps = 29/252 (11%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+K+++F++DISGSM+G+ ++ TK AL L + P D F++V F + S+ +K
Sbjct: 289 RKNLVFVIDISGSMEGQKVKQTKEALLKILGDMQPEDYFDLVLFGSHVQSWRGSLVQVSK 348
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG---- 434
++ A ++ F G+TN+ L + +E+LT ++ S+P I+ ++TDG
Sbjct: 349 ANLQAAQDFVR-RFSLNGATNLNGGLLRGIEILTKAQESLPELSNHASILIMLTDGEPTE 407
Query: 435 AVED----ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
V D +R I +A++ R +Y G G + FL +++M + G+ Y+
Sbjct: 408 GVTDRAQIQRNIRNAIRGRFP--------LYNLGFGHDLDFNFLEVMSMENNGWAQRIYE 459
Query: 491 LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ--- 547
Q+Q + + + +L ++ + +D + R S ++V+GR
Sbjct: 460 DHDATQQLQGFYKQVANPLLVDVDLQYPQDA-VLALTQHRHKQYYDGSEIVVAGRIADHK 518
Query: 548 -GKFPDTLKAKG 558
G F +KA+G
Sbjct: 519 LGSFKADVKARG 530
>gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3
[Pan troglodytes]
Length = 900
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 167/371 (45%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + +GI + G TNI + AV++L +S
Sbjct: 316 STEASQWRPSLVPASAENVNKARSFAVGIQALGG--TNINDAMLMAVQLLDSSNQEEQLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFRNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|344243840|gb|EGV99943.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Cricetulus griseus]
Length = 799
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 84/342 (24%), Positives = 162/342 (47%), Gaps = 61/342 (17%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIF++D SGSM G+ ++ T+ AL L+ L P D FN++ F+ + ++ S+ A++E
Sbjct: 273 KNVIFVIDKSGSMSGRKIQQTREALIKILNDLSPKDHFNLIVFDWKATQWTQSLMEASEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDGAVED- 438
++RA ++ A G TNI + AV+ML +S +GS+ +I L+TDG +
Sbjct: 333 NLKRATEYAS-KIHAHGGTNINDAMLLAVKMLDSSNQAELLPKGSVSLIILLTDGEPTEG 391
Query: 439 -------ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
+R + +A+ ++ + ++ G G ++ FL LA + G Y+
Sbjct: 392 ETNPANIQRNVQEAINNQYS--------LFCLGFGFDVHYSFLEKLAQDNGGLARRIYED 443
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---------SPLIV 542
+Q+Q + + +L ++A + YPS ++ ++ S ++V
Sbjct: 444 SDSALQLQDFYHEVANPLLLSVAFE----------YPSDAVEVVTQDNFRHHFKGSEMVV 493
Query: 543 SGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLA-KDIP---------LDRICA----K 586
+G+ Q + D L AK G + N + + +A KD ++R+ A
Sbjct: 494 AGKLQDQGLDVLSAK-ISGQMHKQNITFQTEASIAQKDFQSPKYIFHNFMERLWALLTIH 552
Query: 587 QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
QQ++ + + A E K LE + +S++ + T M++ +
Sbjct: 553 QQLEQMVS-ASGDELKALESQALNLSLKYNFVTPLTHMVVTK 593
>gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica]
gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica]
Length = 1393
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 97/196 (49%), Gaps = 10/196 (5%)
Query: 314 LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
+L A+S +V+F++D SGSM G L+ TK+A +S LD F+IV FN T
Sbjct: 774 MLKSNAQSSSSAPYNVVFVMDKSGSMIGTKLDQTKDAFRSMISSLDRNAKFSIVGFNYAT 833
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-TNSRGSIP-IIFLV 431
+ + AT VE A +I AGG TN+ A L A+E+ + S ++P +I +
Sbjct: 834 TAWRNKLVRATNYNVEEARSFIS-RISAGGGTNMHAALLDAIELCNSESSSTVPCMIMFM 892
Query: 432 TDGA-----VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
TDG E+ R + D KSR +I + FG G ++ FL L++++ G
Sbjct: 893 TDGTATVGVTEESRILADVTKSRQQGKANIALNVIGFGAG--ISYSFLSRLSVLNSGIAR 950
Query: 487 AAYDLDSIEIQMQKLF 502
++ + QMQ F
Sbjct: 951 QIFEDTNAAFQMQGFF 966
>gi|358456027|ref|ZP_09166252.1| Vault protein inter-alpha-trypsin domain-containing protein
[Frankia sp. CN3]
gi|357080678|gb|EHI90112.1| Vault protein inter-alpha-trypsin domain-containing protein
[Frankia sp. CN3]
Length = 907
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 130/581 (22%), Positives = 236/581 (40%), Gaps = 100/581 (17%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGEN----DGA---GKSASVETGSFLKPNIFTLT 180
P+ + ++ + E G+ +L G+ D A G+ AS+ +P +FT+
Sbjct: 85 PLPPRAAVTALRMEADGRVVEAELKERGQARADYDTAITQGRRASIAEEE--RPGVFTMR 142
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV-------------------- 219
+ I G +++RL + +LSY DG + P P Y+
Sbjct: 143 VGNILPGERVTVRLTLAGELSYEDGAATFRFPLVVAPRYIPGAPLAGEQVGDGVTADTDA 202
Query: 220 TPAIKKI-----------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVG-----KL 263
P +I P+R + ++++ + S H + RR G +
Sbjct: 203 VPDASRISPPVLLPGFPNPERLSVAVDIDPAGLPLASVDASLHTVITERRTDGLRVRLEP 262
Query: 264 GYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQRE--------------- 308
G + + L D D + + S S VL P+ +
Sbjct: 263 GDRADRDFLLRLTFDADEAVSTSLS-----VLPDEPAADGATTADGETATVGAGGAAGGG 317
Query: 309 MFCMYLLP--GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
F + +LP GT ++R +DV+ ++D SGSM G + + A + L GD F +
Sbjct: 318 TFALTILPPAGTTRARD---RDVVIVLDRSGSMGGWKMVAARRAAGRIVDTLRGGDRFAV 374
Query: 367 VAFNGET---YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG 423
+AF+ + + A+ A RA +++ A G T + PL +A +L R
Sbjct: 375 LAFDNRVDTPPALPSGLAPASDRARFRAVEFLA-TLEARGGTEMLEPLRQAAALL-RPRS 432
Query: 424 SIP----IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
P ++ LVTDG V +E Q+ + + LT R++T GI N FLR L+
Sbjct: 433 ETPERDRVVVLVTDGQVGNEDQLLRELATGLTGV-----RVHTVGIDRAVNEAFLRRLSG 487
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI------DTLKDLDEFEMYPSRIPD 533
+ G D ++ M+ + R + V+ + + D D D + P+ +PD
Sbjct: 488 PA-GRCELVESEDRLDDAMRHIHRRIDAPVVTGLRLLPAGPADLAVDAD--SLSPAPLPD 544
Query: 534 LSSESPLIVSGRYQGKFPDTLKAKGFLGD-LSNFVVELKLQLAKDIPLDRICAKQQI-DL 591
L + +P++++GR+ G T+ G D S + + + L + A+ +I DL
Sbjct: 545 LFTGAPVVITGRFTGPPVGTVTVAGQAADGGSPWQTAAAAAASGNPALTALWARARIRDL 604
Query: 592 LTAQAWFSEDKR----LEEKVSKMSVQTGVLCEYTRMIIVE 628
A ++R LE+++ + S++ VL +T + V+
Sbjct: 605 EDRYAADQAERRGHEALEKRIVETSLRYRVLSRFTAFVAVD 645
>gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus]
Length = 889
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 155/347 (44%), Gaps = 47/347 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + S+ A
Sbjct: 278 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPA 337
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A +++ ++ G TNI L + + ML +R S II ++TDG
Sbjct: 338 TPENIQEASKFV-MDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTSIIIMLTDGDA 396
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 397 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGLARRIYEDSDAN 455
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 456 LQLQGFYEEVANPLLTGVEVE----------YPQNAILDLTQNSYQHFYDGSEIVVAGRL 505
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 506 ADEDMNSFKAAVKGHGAINDLT-FTKEVDMKEMEKALQERDYIFGDYIERLWAYLTIEQL 564
Query: 593 ---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
A E + L + ++S++ + T M++ + ++ N +
Sbjct: 565 LDKRKNAQGEEKEILTAQALELSLKYHFVTPLTSMVVTKPEDNENQT 611
>gi|410928947|ref|XP_003977861.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1-like [Takifugu rubripes]
Length = 1216
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 109/211 (51%), Gaps = 11/211 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L ++ RKV + IF++D SGSM G + K+A+ V L L PG FNIV F
Sbjct: 325 FCPDLKSVSSDLRKVHG-EFIFLIDRSGSMSGVNINRVKDAMVVMLKSLMPGCFFNIVGF 383
Query: 370 NGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
G TY LFSTS +EA+ A +++ G TNI PL+ + RG +
Sbjct: 384 -GSTYKSLFSTSQSY-DEEALTLACEYVRKVRADMGGTNILDPLSWILRQ-PMIRGHPRL 440
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
+FL+TDGA+ + ++ +L G + R +TF IG + LA +S+G
Sbjct: 441 LFLLTDGAISNTGRVI-----QLVRGHARYIRCFTFSIGQTACRRLVEALATVSKGTAEF 495
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
D + ++ +M K + S+VL++I+I+ L
Sbjct: 496 LADGERLQPKMLKSLKKTMSAVLSDISIEWL 526
>gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4
[Pan troglodytes]
Length = 930
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 167/371 (45%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + +GI + G TNI + AV++L +S
Sbjct: 316 STEASQWRPSLVPASAENVNKARSFAVGIQALGG--TNINDAMLMAVQLLDSSNQEEQLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFRNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca]
Length = 901
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 165/365 (45%), Gaps = 50/365 (13%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL L L P D FN+++F
Sbjct: 228 FVHYFAP---EGLPTIPKNVIFVIDKSGSMSGRKMQQTREALIKILDDLSPKDQFNLISF 284
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+G+ + + A+ E V +A + GI A G T+I + AV++L +++
Sbjct: 285 SGDAAQWKPLLVPASAENVNQARSYAAGIQ--AHGGTDINEAVLMAVQLLNSAKQKELMP 342
Query: 423 -GSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
G++ +I L+TDG +R + +A+ + + ++ G G ++ F
Sbjct: 343 EGTVSLIILLTDGDPTMGETNPARIQRNVKEAIDGQYS--------LFCLGFGFDVSYAF 394
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L LA+ + G Y+ +Q+Q + + +L + + ++ E E+
Sbjct: 395 LEKLALDNGGLARRIYEDSDSALQLQDFYEEVANPLLTAVTFEYPSNVVE-EVSQDNFRL 453
Query: 534 LSSESPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVV---------ELKLQLAKDI---P 579
L S ++V+G+ + + PD L AK G L + N E + Q K I
Sbjct: 454 LFKGSEIVVAGKLRDQSPDVLSAKVRGQL-HMQNITFQTESRVTEQEKEFQSPKYIFHSF 512
Query: 580 LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNN 634
++R+ A I L Q A +E + LE + +S+ + T M++ + D ER+
Sbjct: 513 MERLWAYLTIQQLLEQMVSASATEKQALETRALNLSLHYCFVTPLTSMVVTKPDGRERSE 572
Query: 635 ASESP 639
+E P
Sbjct: 573 VAEKP 577
>gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus]
gi|160332333|sp|P56652.2|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus]
gi|296474789|tpg|DAA16904.1| TPA: inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos
taurus]
Length = 891
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 155/347 (44%), Gaps = 47/347 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + S+ A
Sbjct: 280 VVPKSVVFVIDVSGSMHGRKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A +++ ++ G TNI L + + ML +R S II ++TDG
Sbjct: 340 TPENIQEASKFV-MDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPQNAILDLTQNSYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G + DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 ADEDMNSFKAAVKGHGAINDLT-FTEEVDMKEMEKALQERDYIFGDYIERLWAYLTIEQL 566
Query: 593 ---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
A E + L + ++S++ + T M++ + ++ N +
Sbjct: 567 LDKRKNAQGEEKEILTAQALELSLKYHFVTPLTSMVVTKPEDNENQT 613
>gi|449473807|ref|XP_002191398.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4
[Taeniopygia guttata]
Length = 867
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 133/607 (21%), Positives = 250/607 (41%), Gaps = 69/607 (11%)
Query: 87 QMNAVE---LDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEIS 143
Q +AVE L VDC + + F T R+ ++S + V + I I
Sbjct: 15 QKHAVEIYSLHVDCKVTSRFAHTVITSRI-VNRANESREATFEVELPKTAFITNFSMSID 73
Query: 144 GKSY----HTQLIALGENDGA---GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRW 196
G+ Y + A E D A G+SA + + K F +++ I S ++ +L +
Sbjct: 74 GEVYPGIIKEKAAAQDEYDSAVSQGQSAGLVKITDRKLEQFHVSVS-IAAASKVTFQLSY 132
Query: 197 SQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQL 256
+ L + G++ + + V P K++ K +I +++ G L S+ +L
Sbjct: 133 EELLKRQLGKYELLIK------VRP--KQLVKHFQIDVHIFEPQGIHFLETDSTFMTNEL 184
Query: 257 RRDVGKLGYSYESEVL----------------KWSNIDFDFSYTVSPSHIFGGVLLQSPS 300
+ K+ ++ +L N DF Y V G + + +
Sbjct: 185 TEALTKVQNETKAHILFKPTLDQQKKNSELDETLLNGDFVVRYDVKREATAGDIQIVNG- 243
Query: 301 LHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDP 360
F Y P + VF K+VIF++D SGSM G+ +E T++AL L L
Sbjct: 244 --------YFVHYFAP---QEMPVFPKNVIFVIDRSGSMTGRKIEQTRDALLKILQDLRQ 292
Query: 361 GDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN 420
D F+ + FN + + +S+ AT+E V A + A G T+I L AV +L
Sbjct: 293 EDHFSFITFNNKVVEWKSSLLPATEENVASAAALVQ-TLTARGGTDISGALLAAVGVLEK 351
Query: 421 SRG----SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
+ G SI +I L+TDG + + ++ ++ + ++ G G ++ FL
Sbjct: 352 AEGLPERSISMIILLTDGQPTSGEKNVEVIQEKVQEAINGKYALFCLGFGFDVSYKFLEK 411
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
+A+ + G Y+ +Q+Q + + +L I + ++ E + +
Sbjct: 412 MALSNGGIARRIYENADAALQLQGFYQEVATPILMQIEMQYPENSVE-GLTKNNFKLFFE 470
Query: 537 ESPLIVSGRYQGK---FPDTLKAKGFLGDLS---NFVVELKLQLAKDIP------LDRIC 584
S +IVSG+ + P +KA+ +L+ V+ K Q+ ++ ++R+
Sbjct: 471 GSEIIVSGKISNELDLLPVEIKAQSHTSNLTLKEEANVKEKEQVFQNQKYIFGNFIERLW 530
Query: 585 AKQQIDLLTAQAWFSED---KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGT 641
A I L +A +++ K LE + +S++ + T M++ + ++ + P
Sbjct: 531 AYLSIQQLLEEAISAQEEDQKALEAQALDLSLRYSFVTPLTSMVVTKPQQQEELANKPTE 590
Query: 642 KKGSKKS 648
K S
Sbjct: 591 ADNEKPS 597
>gi|119585666|gb|EAW65262.1| inter-alpha (globulin) inhibitor H3, isoform CRA_a [Homo sapiens]
Length = 670
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 165/375 (44%), Gaps = 50/375 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPGTKKGSK 646
+ A E + L + +S++ + T M++ + DER A + PG S
Sbjct: 568 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPGEGGDSH 626
Query: 647 KSDHQKILDSEVPKT 661
+ + +L ++P +
Sbjct: 627 TGEPRHVLPDQLPAS 641
>gi|426249405|ref|XP_004018440.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Ovis
aries]
Length = 904
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 163/370 (44%), Gaps = 60/370 (16%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P S K+VIF++D SGSM G+ ++ T+ AL L L P D FN+++F
Sbjct: 260 FVHYFAP---DSLSTMPKNVIFVIDKSGSMMGRKIKQTQEALIKILDDLSPHDQFNLISF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + + A+ E V A + GI A G TNI + AV++L +
Sbjct: 317 SAEATKWKPLLVPASTENVNEAKSYATGIR--AQGGTNINDAMLMAVQLLEKANREELLP 374
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GSI +I L+TDG D +I + ++ + S ++ G G ++ FL +
Sbjct: 375 AGSITLIILLTDGDPTAGETDPLKIQENVRKAVNGQHS----LFCLGFGFDVSYAFLEKM 430
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G Y+ +Q+Q + + ++ ++A + YPS + ++
Sbjct: 431 ALENGGLARRIYEDSDSALQLQDFYQEVANPLMMSVAFE----------YPSNAVESVTQ 480
Query: 538 ---------SPLIVSGRYQGKFPDTLKAKGFLGDLS----NFVVELKLQLAKDI---P-- 579
S L+V+G+ + PD L A+ G L FV+E + +++ P
Sbjct: 481 DAFRVFFKGSELVVAGKLRDHSPDVLSAQ-VRGQLHRENITFVMESHVAEQEEMFRGPKY 539
Query: 580 -----LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ET 629
++R+ A I L Q A +E + LE + +S+ + T M+I E
Sbjct: 540 IFHSFMERLWAYLTIQQLLEQMVSALDAEKQALEARALNLSLSYSFVTPLTSMVITKPEG 599
Query: 630 DERNNASESP 639
E++ +E P
Sbjct: 600 QEQSQVAEKP 609
>gi|1915956|emb|CAA72309.1| inter-alpha-inhibitor heavy-chain 1 [Sus scrofa]
Length = 779
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 102/206 (49%), Gaps = 24/206 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL LS L PGD F++V F + S+ A+
Sbjct: 158 LNKNVVFVIDISSSMEGQKVKQTKEALLKILSDLKPGDYFDLVLFGSAVQSWRGSLVQAS 217
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
++ A ++ F GSTN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 218 TANLDAARSYVR-QFSLAGSTNLNGGLLRGIEILNKAQGSLPEFSNRASILIMLTDGEPT 276
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI DA++ R +Y G G + FL ++A+ + G Y
Sbjct: 277 EGVTDRSQILKNVRDAIRGRFP--------LYNLGFGHDVDWNFLEVMALENNGRAQRIY 328
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAI 515
+ Q+Q + + + +L ++ +
Sbjct: 329 EDHDAAQQLQGFYDQVANPLLKDVEL 354
>gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein), isoform CRA_b [Homo sapiens]
Length = 914
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|345806502|ref|XP_548489.3| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H3 [Canis lupus familiaris]
Length = 891
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVVFVIDVSGSMHGRKMEQTKDALLKILGDMKGEDYLNFILFSGDVITWKDDLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E +E A ++ N G TNI L + + ML +R S II ++TDG
Sbjct: 340 TPENIEEARIFVK-NIHDRGLTNINDGLLRGISMLNRAREEHRVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESMALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPENAIQDLTQNAYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 LDEDMNNFKADVKGHGATNDLT-FTEEVDMKEMEKALKEQDYIFGNYIERLWAYLTIEQL 566
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
+ A E ++L + +S++ + T M++ + +++ ++ PG
Sbjct: 567 LEKRKNAHGEEKEKLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPG 620
>gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99]
gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length = 733
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 142/319 (44%), Gaps = 20/319 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+DV+ ++D SGSM G + + A + LD GD F ++AF+ + +M
Sbjct: 269 RDVVVVLDRSGSMGGWKMVAARRAAGRIVDMLDAGDRFCVLAFD-DRIETPPAMPDGLVP 327
Query: 387 AVER----AHQWIGINFIAGGSTNICAPLTKAVEMLTNS-RGSIPIIFLVTDGAVEDERQ 441
A +R A W+G + + G T + PLT AVEML +S + LV DG + E
Sbjct: 328 ASDRNRFAASSWLG-SLRSRGGTVMAQPLTNAVEMLADSGEDRQASVVLVADGQISGEDH 386
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ ++ + RIY G+ N FL LA + G D ++ M +L
Sbjct: 387 LLRSLAPAVGRT-----RIYCVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARL 441
Query: 502 FTRGFSSVLANIAIDTLK-DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT---LKAK 557
L +I + + E + P R+PD + P ++SGRY+G D ++A
Sbjct: 442 ARTIGRPALTSIRVRAEGFEAIEDTVTPDRVPDAFAGVPCVISGRYRGPAADAALHVEAD 501
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQTG 616
G G + V + A+ + I A+ + DL A A LE+++ S++ G
Sbjct: 502 GATGRYTATVTARPVTDARAV--QTIWARSVVRDLEDAYACGRGTDELEQRLVAHSIRFG 559
Query: 617 VLCEYTRMIIVETDERNNA 635
VL +T + ++ ER+ A
Sbjct: 560 VLSRFTAFVAIDP-ERSEA 577
>gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Ailuropoda melanoleuca]
Length = 849
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 166/374 (44%), Gaps = 68/374 (18%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM G+ ++ T+ AL L L P D FN+++F
Sbjct: 261 FVHYFAP---EGLPTIPKNVIFVIDKSGSMSGRKMQQTREALIKILDDLSPKDQFNLISF 317
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+G+ + + A+ E V +A + GI A G T+I + AV++L +++
Sbjct: 318 SGDAAQWKPLLVPASAENVNQARSYAAGIQ--AHGGTDINEAVLMAVQLLNSAKQKELMP 375
Query: 423 -GSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
G++ +I L+TDG +R + +A+ + + ++ G G ++ F
Sbjct: 376 EGTVSLIILLTDGDPTMGETNPARIQRNVKEAIDGQYS--------LFCLGFGFDVSYAF 427
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L LA+ + G Y+ +Q+Q + + +L + + YPS + +
Sbjct: 428 LEKLALDNGGLARRIYEDSDSALQLQDFYEEVANPLLTAVTFE----------YPSNVVE 477
Query: 534 LSSE---------SPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP--- 579
S+ S ++V+G+ + + PD L AK G L + N + + ++ +
Sbjct: 478 EVSQDNFRLLFKGSEIVVAGKLRDQSPDVLSAKVRGQL-HMQNITFQTESRVTEQEKEFQ 536
Query: 580 ---------LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
++R+ A I L Q A +E + LE + +S+ + T M++
Sbjct: 537 SPKYIFHSFMERLWAYLTIQQLLEQMVSASATEKQALETRALNLSLHYCFVTPLTSMVVT 596
Query: 628 ETD--ERNNASESP 639
+ D ER+ +E P
Sbjct: 597 KPDGRERSEVAEKP 610
>gi|348588538|ref|XP_003480023.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Cavia
porcellus]
Length = 889
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 116/260 (44%), Gaps = 31/260 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ +E TK+AL L + D N + F+G + + A
Sbjct: 280 VVPKNVVFVIDVSGSMSGRKIEQTKDALLRILEDVRKEDYLNFILFSGSVTTWKDQLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ N G TNI L +A+ ML ++ S II ++TDG
Sbjct: 340 TPENLQEARTFVK-NIYDQGLTNINDALLRAIHMLNKAQEEHLVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
D ++ + N +Y G G ++ FL +A+ ++G Y+ ++
Sbjct: 399 NTGESRPDKIRENVRNAIQGKFPLYNLGFGKNLDYSFLESMALENQGLARRIYEDSDADL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY- 546
Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 459 QLQGFYEEVANPLLTGVQVE----------YPDNAIQDLTQNSYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDL 563
F +K +G L DL
Sbjct: 509 DEDMNSFTADVKGQGALNDL 528
>gi|397495909|ref|XP_003818786.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Pan paniscus]
Length = 900
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 166/371 (44%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEEQLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|397495911|ref|XP_003818787.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Pan paniscus]
Length = 935
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 QLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens]
Length = 888
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 247 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 303
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 304 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 358
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 359 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 414
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 415 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 473
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 474 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 533
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 534 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 593
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 594 EKPMEGESRNRNVH 607
>gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens]
Length = 644
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 162/357 (45%), Gaps = 43/357 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F+ E + S+ A+ E
Sbjct: 3 KNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAE 62
Query: 387 AVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG- 434
V +A +F AG G TNI + AV++L +S GS+ +I L+TDG
Sbjct: 63 NVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGD 117
Query: 435 ---AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
+ R I + ++ ++ S ++ G G ++ FL LA+ + G ++
Sbjct: 118 PTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKLALDNGGLARRIHED 173
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
+Q+Q + + +L + + + E E+ + L S ++V+G+ Q + P
Sbjct: 174 SDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKGSEMVVAGKLQDRGP 232
Query: 552 DTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRICAK---QQIDLLTAQ 595
D L A F + S E + Q K I ++R+ A QQ+ T
Sbjct: 233 DVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLWAYLTIQQLLEQTVS 292
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESPGTKKGSKKSDH 650
A ++ + L + +S+ + T M++ + D E++ +E P + ++ H
Sbjct: 293 ASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKPMEGESRNRNVH 349
>gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M]
gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M]
Length = 772
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 143/320 (44%), Gaps = 22/320 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+DV+ ++D SGSM G + + A + LD GD F ++AF+ + +M
Sbjct: 308 RDVVVVLDRSGSMGGWKMVAARRAAGRIVDMLDAGDRFCVLAFD-DRIETPPAMPDGLVP 366
Query: 387 AVER----AHQWIGINFIAGGSTNICAPLTKAVEMLTNS-RGSIPIIFLVTDGAVEDERQ 441
A +R A W+G + + G T + PLT AVEML +S + LVTDG + E
Sbjct: 367 ASDRNRFAASSWLG-SLRSRGGTVMAQPLTNAVEMLADSGEDRQASVVLVTDGQISGEDH 425
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ ++ + RIY G+ N FL LA + G D ++ M +L
Sbjct: 426 LLRSLAPVVGRT-----RIYCVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARL 480
Query: 502 FTRGFSSVLANIAIDTLKDLDEFE--MYPSRIPDLSSESPLIVSGRYQGKFPDT---LKA 556
L +I + + + E + P R+PD + P ++SGRY+G D ++A
Sbjct: 481 ARTIGRPALTSIRVRA-EGFEAIEDTVTPDRVPDAFAGVPCVISGRYRGPAADAALHVEA 539
Query: 557 KGFLGDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKRLEEKVSKMSVQT 615
G G + V + A+ + I A+ + DL A A LE+++ S++
Sbjct: 540 DGATGRYTATVPARPVTDARAV--QTIWARSVVRDLEDAYACGRGTDDLEQRLVAHSIRF 597
Query: 616 GVLCEYTRMIIVETDERNNA 635
GVL +T + ++ ER A
Sbjct: 598 GVLSRFTAFVAIDP-ERTEA 616
>gi|301060207|ref|ZP_07201074.1| von Willebrand factor type A domain protein [delta proteobacterium
NaphS2]
gi|300445719|gb|EFK09617.1| von Willebrand factor type A domain protein [delta proteobacterium
NaphS2]
Length = 768
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 151/352 (42%), Gaps = 36/352 (10%)
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVT----------- 220
+PNIFTL+ ++ G+ I + + + + +P P Y+
Sbjct: 102 RPNIFTLSAGNLNAGTEAVINIEYVTLIDGDEHTLRFLLPTTITPRYIPDHQAEDEDIPL 161
Query: 221 ------PAIKKIPKREKIHLNVNAGTGTEVLCNTSSH--HLKQLRRDVGKLGYSYESEVL 272
P + +P ++HL + A + + SH ++QL + +S ES ++
Sbjct: 162 DHRIHPPYARSVPYGMEMHLTLAAADKFSAI-ESPSHPIRIEQLSGRQTVISFSQESVLM 220
Query: 273 KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFI 332
+ DF + T S + L ++ V Q + + + + K +D++F+
Sbjct: 221 ---DRDFILTLTFSEPGVSQVFLCRNNGATYV-QMDFYMEH-----EEQNKSGNEDLLFV 271
Query: 333 VDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAH 392
VD SGSMQG + + + AL V L L+ G FNI+ F + +++ +ERA
Sbjct: 272 VDCSGSMQGDSIYEARQALDVCLKALEEGRRFNIIRFGSRFESLFSEPRAYSEKTLERAL 331
Query: 393 QWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTN 452
W G T I PL ++ +S I L+TDGAV +E I +++R
Sbjct: 332 SWSRNMQADLGGTEILQPLQHIYKVQGDSDSRYGSILLLTDGAVGNEDDIFYLVRNR--- 388
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
PR++ GIG+ CN F++ LA +G Y + + ++ LF R
Sbjct: 389 ---SGPRVFPVGIGAGCNEAFIKGLARAGKGDSAFIYPGERLGPKILSLFGR 437
>gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens]
Length = 935
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor
[Homo sapiens]
gi|229463048|sp|Q14624.4|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4;
Short=ITI heavy chain H4; Short=ITI-HC4;
Short=Inter-alpha-inhibitor heavy chain 4; AltName:
Full=Inter-alpha-trypsin inhibitor family heavy
chain-related protein; Short=IHRP; AltName: Full=Plasma
kallikrein sensitive glycoprotein 120; Short=Gp120;
Short=PK-120; Contains: RecName: Full=70 kDa
inter-alpha-trypsin inhibitor heavy chain H4; Contains:
RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy
chain H4; Flags: Precursor
gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens]
Length = 930
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor
[Homo sapiens]
Length = 900
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein
(IHRP) [Homo sapiens]
Length = 930
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein), isoform CRA_a [Homo sapiens]
Length = 930
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein
[Homo sapiens]
Length = 930
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens]
Length = 900
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEG 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus]
Length = 932
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 158/352 (44%), Gaps = 55/352 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM GK ++ T+ AL L L D FNI+ F+GE + + AT+E
Sbjct: 272 KNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEE 331
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV--- 436
+ RA + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 332 NLNRAVDYAS-KIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVSLIILLTDGEPTVG 390
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E +I NG ++ G G N+ FL LA+ + G Y+ +
Sbjct: 391 ETNPKIIQKNTQEAINGRY---SLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSAL 447
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLS--------SESPLIVSGRYQ 547
Q+Q + + +L+++ + YPS + D++ S ++V+G+ +
Sbjct: 448 QLQDFYQEVANPLLSSVTFE----------YPSNAVEDVTRYNFQHHFKGSEMVVAGKLR 497
Query: 548 GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQI 589
+ PD L AK G + L N + + +A+ ++R+ A +QQ+
Sbjct: 498 DQGPDVLLAKVSGQM-HLQNITFQTEASIAQQEKEFQGPKYIFHNFMERLWALLTIQQQL 556
Query: 590 DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+ + A +E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 557 EQRIS-ASGAELEALEAQVLNLSLKYNFVTPLTHMVVTKPEDQEQFQVAEKP 607
>gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein) [Homo sapiens]
Length = 930
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPMEGESRNRNVH 619
>gi|395833006|ref|XP_003789538.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Otolemur
garnettii]
Length = 877
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/262 (26%), Positives = 120/262 (45%), Gaps = 33/262 (12%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++DISGSM+G+ L+ TK+AL L + D N + F+G + S+ A
Sbjct: 280 VVPKNVVFVIDISGSMEGRKLKQTKDALLKILDDMKEEDYLNFILFSGHVDTWKESLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ N + G TNI L + ML +R S I+ ++TDG
Sbjct: 340 TAENLQEARAFVR-NMRSTGLTNINDGLLTGISMLNQAREAHVVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G Y+ +
Sbjct: 399 NVGVSRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLEAMALENHGVARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGR- 545
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPENAILDLTQNSYHHFYDGSEIVVAGRL 507
Query: 546 ---YQGKFPDTLKAKGFLGDLS 564
+ F +K G + DL+
Sbjct: 508 VDEHMNSFKADVKGHGAINDLT 529
>gi|170751541|ref|YP_001757801.1| vault protein inter-alpha-trypsin subunit [Methylobacterium
radiotolerans JCM 2831]
gi|170658063|gb|ACB27118.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium
radiotolerans JCM 2831]
Length = 729
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 8/220 (3%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
V F++D SGSM G + K L L +L P D FN++ F+ AT+ A+
Sbjct: 334 VTFVIDNSGSMSGASMRQAKAGLLAGLGRLSPRDRFNVIRFDDTWDALHPEPVPATRAAL 393
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++ A G T + APL A+ G + + +TDGAV DE +I A+ +
Sbjct: 394 AEAEAFVA-ALEARGGTEMLAPLKAALADPHPEDGRVRQVVFLTDGAVGDEERIFAAIHA 452
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
L R++ GIGS N + +R A I RG + DL + + + L+ + S
Sbjct: 453 DLGRT-----RLFMVGIGSAPNGHLMRHAAEIGRGSFTEIRDLAQVGERTRALYEKLESP 507
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
+ ++ + + P +PDL PL+ + + G
Sbjct: 508 AVTDLT--ATFSVPGVAVTPGALPDLYRGEPLVFAAQLPG 545
>gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain precursor [Rattus norvegicus]
gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus]
gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus
norvegicus]
Length = 933
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 158/352 (44%), Gaps = 55/352 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM GK ++ T+ AL L L D FNI+ F+GE + + AT+E
Sbjct: 273 KNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ RA + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNRAVDYAS-KIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVSLIILLTDGEPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E +I NG ++ G G N+ FL LA+ + G Y+ +
Sbjct: 392 ETNPKIIQKNTQEAINGRY---SLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLS--------SESPLIVSGRYQ 547
Q+Q + + +L+++ + YPS + D++ S ++V+G+ +
Sbjct: 449 QLQDFYQEVANPLLSSVTFE----------YPSNAVEDVTRYNFQHHFKGSEMVVAGKLR 498
Query: 548 GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQI 589
+ PD L AK G + L N + + +A+ ++R+ A +QQ+
Sbjct: 499 DQGPDVLLAKVSGQM-HLQNITFQTEASIAQQEKEFQGPKYIFHNFMERLWALLTIQQQL 557
Query: 590 DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+ + A +E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 558 EQRIS-ASGAELEALEAQVLNLSLKYNFVTPLTHMVVTKPEDQEQFQVAEKP 608
>gi|410951365|ref|XP_003982368.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1
[Felis catus]
Length = 910
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 117/247 (47%), Gaps = 25/247 (10%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+++F++DIS SM+G+ ++ TK AL L + PGD F++V F E + S+ A+
Sbjct: 288 LNKNLVFVIDISTSMEGQKVKQTKEALLKILGDMRPGDYFDLVLFGSEVQSWRGSLVPAS 347
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
V A ++ F G+TN+ L + +E+L + GS+P ++ ++TDG
Sbjct: 348 AANVRAAQDFVR-RFHLAGATNLNGGLLRGIEILNQAHGSVPELSNHASVLIMLTDGEPT 406
Query: 435 -AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI +++ + GG +Y G G + FL +++M + G Y+
Sbjct: 407 EGVTDRSQILKNVRNAI--GGRFP--LYNLGFGHNVDWNFLEVMSMENNGRAQRIYEDHD 462
Query: 494 IEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGK 549
Q+Q + + + +L ++ + DT+ L + R S ++V+GR
Sbjct: 463 ATQQLQGFYDQVANPLLVDVELLYPQDTVSALTQ-----HRHKQYYEGSEIMVAGRIADH 517
Query: 550 FPDTLKA 556
+ KA
Sbjct: 518 KSSSFKA 524
>gi|354465725|ref|XP_003495327.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like
[Cricetulus griseus]
Length = 910
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 115/234 (49%), Gaps = 25/234 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++DISGSM+G+ ++ TK AL L + PGD+F++V F + + S+ AT+
Sbjct: 290 KNLVFVIDISGSMEGQKVKQTKEALLKILGDVKPGDNFDLVLFGSQVQSWKGSLVPATQA 349
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F G+TN+ L + +E+L ++GS P I+ ++TDG
Sbjct: 350 NLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPASILIMLTDGEPTEG 408
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
D QI +A++ R +Y G G + FL +++M + G+ Y+
Sbjct: 409 ETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFNFLEVMSMENSGWAQRIYED 460
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
Q+Q + + + +L ++ + +D + R S ++V+GR
Sbjct: 461 HDATQQLQGFYNQVANPLLTDVELQYPQD-SVLTLTQHRHKQYYDGSEIVVAGR 513
>gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein) variant [Homo sapiens]
Length = 699
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 166/371 (44%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 244 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 300
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 301 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 358
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 359 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 414
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 415 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 473
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 474 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 533
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 534 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 593
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 594 MEGESRNRNVH 604
>gi|410951367|ref|XP_003982369.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2
[Felis catus]
Length = 660
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 117/247 (47%), Gaps = 25/247 (10%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+++F++DIS SM+G+ ++ TK AL L + PGD F++V F E + S+ A+
Sbjct: 38 LNKNLVFVIDISTSMEGQKVKQTKEALLKILGDMRPGDYFDLVLFGSEVQSWRGSLVPAS 97
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
V A ++ F G+TN+ L + +E+L + GS+P ++ ++TDG
Sbjct: 98 AANVRAAQDFVR-RFHLAGATNLNGGLLRGIEILNQAHGSVPELSNHASVLIMLTDGEPT 156
Query: 435 -AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI +++ + GG +Y G G + FL +++M + G Y+
Sbjct: 157 EGVTDRSQILKNVRNAI--GGRFP--LYNLGFGHNVDWNFLEVMSMENNGRAQRIYEDHD 212
Query: 494 IEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGK 549
Q+Q + + + +L ++ + DT+ L + R S ++V+GR
Sbjct: 213 ATQQLQGFYDQVANPLLVDVELLYPQDTVSALTQ-----HRHKQYYEGSEIMVAGRIADH 267
Query: 550 FPDTLKA 556
+ KA
Sbjct: 268 KSSSFKA 274
>gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
protein-like [Xenopus (Silurana) tropicalis]
Length = 1179
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/398 (23%), Positives = 172/398 (43%), Gaps = 66/398 (16%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y VS + G V + + F Y P + +KDVIF++D+SG
Sbjct: 256 DFVLQYDVSLKDLAGDVQIYN---------GYFVHYFAP---RGLPPIQKDVIFVIDVSG 303
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G ++ TK+A+ V L+ L DSFNI+ F+ +++ + AT + + A ++
Sbjct: 304 SMFGTKIKQTKSAMHVILNDLHRDDSFNIITFSDVVHVWRPGQSIPATAQNKKSAKDYVN 363
Query: 397 INFIAGGSTNICAPLTKAVEMLTN---------SRGSIPIIFLVTD-----GAVEDERQI 442
A G T+I A L A + S IP+I +TD G + R +
Sbjct: 364 -KIEADGWTDINAALMAAASIFNQTSHKPEKETSTKKIPLIIFLTDGEATSGVLATSRIL 422
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+A K+ GG+I ++ G ++ +R L++ +RG Y+ +Q++ +
Sbjct: 423 SNAQKAM---GGTIS--LFCLAFGEDADYNLMRRLSLENRGIARRIYEYSDATLQLKGFY 477
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
S +L +I ++ L++ + + + P+ S L+V+G+ + D LK +
Sbjct: 478 DEIASPLLFDIELNYLEESAQ-NVTQTLFPNYFEGSELVVTGKLKPGTKD-LKVR----- 530
Query: 563 LSNFVVELKLQLAKDIPLDRICAK---------QQIDLLTAQAW--FSEDKRLE------ 605
L+ + KL + DI ++ + +QI + W F+ LE
Sbjct: 531 LTAHNQKQKLSMENDISMENNSTEPNFGCSTNVEQIQWFVQRLWAYFTVKDLLEARIKAN 590
Query: 606 ---------EKVSKMSVQTGVLCEYTRMIIVETDERNN 634
EK + +S++ + T +IIV+ ++ N
Sbjct: 591 DTVARRLLTEKATNLSLKYNFVTPVTSLIIVKPEDEEN 628
>gi|281343039|gb|EFB18623.1| hypothetical protein PANDA_003717 [Ailuropoda melanoleuca]
Length = 1230
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
V A I G TNI +PL + + RG ++FL+TDGAV + ++ + ++
Sbjct: 421 VAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRLLFLITDGAVNNTGKVLELLR 479
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+ + R Y+FGIG H +R LA +S+G + + ++ +M K + +
Sbjct: 480 NH-----AFSTRCYSFGIGPNVCHRLVRGLATVSKGSAEFLVEGERLQPKMIKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|301759707|ref|XP_002915679.1| PREDICTED: von Willebrand factor A domain-containing protein
5B1-like [Ailuropoda melanoleuca]
Length = 1224
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
V A I G TNI +PL + + RG ++FL+TDGAV + ++ + ++
Sbjct: 421 VAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRLLFLITDGAVNNTGKVLELLR 479
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+ + R Y+FGIG H +R LA +S+G + + ++ +M K + +
Sbjct: 480 NH-----AFSTRCYSFGIGPNVCHRLVRGLATVSKGSAEFLVEGERLQPKMIKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|389844883|ref|YP_006346963.1| hypothetical protein Theba_2086 [Mesotoga prima MesG1.Ag.4.2]
gi|387859629|gb|AFK07720.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Mesotoga prima MesG1.Ag.4.2]
Length = 704
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 159/336 (47%), Gaps = 33/336 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + L+P K V KDV+F++DISGSM G+ +E K AL L L PGD F IV F
Sbjct: 250 FLLTLIP-RIKEEIVIPKDVVFVLDISGSMYGEKIEQAKRALEQVLQMLRPGDRFAIVTF 308
Query: 370 NGETYLFSTSMELATKEAVERAHQWI-GINFI-AGGSTNICAPLTKAVEMLTN-SRGSIP 426
+G + + S+ A+++A +WI + I A G TNI L +++M + G
Sbjct: 309 DGRVHNLTGSLLDASEKA-----EWIEKVRRIQADGMTNIYGALQTSIDMFSKYDTGRFK 363
Query: 427 IIFLVTDGA-----VEDERQICDAM-KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
+ +TDG + R I DA ++R N +++FG+G+ L L
Sbjct: 364 ALLFLTDGEPTEGITDIGRIISDATPEARARN-----VHLFSFGVGTGVVAELLDRLVQE 418
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + +SIE ++ L+ + L N+ + ++++LD + P L S L
Sbjct: 419 NAGRVSYIIEGESIEAKVTDLYRSIETPALENVTV-SIENLDVKKTLPEGPYSLFSGQAL 477
Query: 541 IVSGRY--QGKFPDTLKA-KGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQA 596
+SG Y +G T++ +G G+ + K D P ++RI A+++I L
Sbjct: 478 RISGIYFDEGDMRVTVEGTRG--GEKYRYEYLFKTSKRSDSPFIERIWAQKRIAHLANLY 535
Query: 597 WF------SEDKRLEEKVSKMSVQTGVLCEYTRMII 626
+ +E L+ ++ +S + ++ E+T +I
Sbjct: 536 RYDPSLTEAEKDELKLEIIALSKKFNIINEFTSYLI 571
>gi|189536038|ref|XP_693697.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Danio rerio]
Length = 1157
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 162/360 (45%), Gaps = 48/360 (13%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + V KDVIF++DISGSM G ++ TK A+ LS L GD FN++ F
Sbjct: 278 FVHYFAP---RGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLREGDYFNLITF 334
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICA----------PLTKAVEML 418
+ + + + + AT++ V A +++ IA G TNI A P T++
Sbjct: 335 SDDVHTWKKDRTVRATRQNVRDAKEFVR-KIIAAGWTNINAALLSAAKLLNPSTRSSSST 393
Query: 419 TNSRGS--IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGI--GSYCNHYFL 474
+ S +P+I +TDG D + L N + FG+ G + L
Sbjct: 394 GRAPSSQRVPMIIFLTDGEATIGETETDVI---LHNAQKSLGLVSLFGLAFGDDADFPML 450
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
R LA+ +RG Y+ D IQ++ + + +L++I + L D +++ S P+
Sbjct: 451 RRLALENRGVARMVYEDDDAAIQLKGFYDEVATPLLSDIQLSYLDD-QVYDITRSLFPNY 509
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFV-VELKLQLAK----------------- 576
S L+V+GR + D LK D V VE +L LAK
Sbjct: 510 FQGSELVVTGRLKPGVRD-LKITLTASDSKQKVKVENELPLAKALINGTDPALVCGQALD 568
Query: 577 DIP--LDRICAKQQI-DLLTAQAWFSE---DKRLEEKVSKMSVQTGVLCEYTRMIIVETD 630
IP + R+ A I +LL A+ S+ + L EK + +S++ + T +++++ D
Sbjct: 569 GIPSFVHRLWAYFTIKELLLAKMNTSDVTIQRLLVEKATNLSLENNFVTPVTSLVVIKPD 628
>gi|126723497|ref|NP_001075477.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Oryctolagus
cuniculus]
gi|75056157|sp|Q9GLY5.1|ITIH3_RABIT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
gi|11041718|dbj|BAB17302.1| inter-alpha-trypsin inhibitor heavy chain3 [Oryctolagus cuniculus]
Length = 903
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 159/361 (44%), Gaps = 48/361 (13%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ LE TK+AL L + D N + F+ + + + A
Sbjct: 280 VVPKNVVFVIDVSGSMYGRKLEQTKDALLKILEDMREEDHLNFILFSSDVTTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + GSTN+ L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARAFVK-SIQDQGSTNLNDGLLRGISMLNTAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL LA+ + G+ Y+
Sbjct: 399 NSGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLESLALENDGFARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+ GF +AN + ++ E E + I DL+ S ++V+GR
Sbjct: 458 LQLH-----GFYEEVANPLLTSV----EMEYPKNAILDLTQNSYQHFYDGSEIVVAGRLA 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI----------PLDRICAKQQIDLLT 593
F +K G L DL+ F E+ L+ + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGALNDLT-FTEEVDLKETEAALKEREYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPGTKKGSKK 647
+ A E + L + +S++ + T M++ + +++ ++ PG ++
Sbjct: 568 EKRKNARGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDQTAIADKPGEEQAQAA 627
Query: 648 S 648
S
Sbjct: 628 S 628
>gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Pongo
abelii]
gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii]
Length = 896
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 164/371 (44%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM GK ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGKKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDQGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRTQALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein
[Catenulispora acidiphila DSM 44928]
gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1033
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 108/450 (24%), Positives = 188/450 (41%), Gaps = 80/450 (17%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AG+ AS+ +P +FTL + I G + IR S +L Y DG+ + F+FP V
Sbjct: 112 AGQRASIAEEE--RPGVFTLRVGNIMPGERVVIRTSLSGRLPYEDGQAT----FRFPLVV 165
Query: 220 TPAIKKIPKREKIHLNVNAGTGT----------------------------EVLCN---- 247
P + IP + V +GT + EV +
Sbjct: 166 AP--RYIPGADLPGEQVGSGTASDTDQVPDASRISPPILLPGFPNPVRLSIEVAVDPVGL 223
Query: 248 -----TSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDF----SYTVSPSHIFGGVLLQS 298
TSS H + G Y + + D DF Y S + V S
Sbjct: 224 PLAGLTSSLHGVSVEESEGS---RYLVRLNPGARADRDFVLRLGYGGSGAATSLAVAWDS 280
Query: 299 PSLHDVDQRE---------MFCMYLLPG--TAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
S ++V + F + +LP T +R +DV I+D SGSM G +
Sbjct: 281 ESANEVAKESAKATPTDIGTFLLTVLPPEPTGATRP---RDVALILDRSGSMGGWKMTAA 337
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAG----G 403
+ A A + L D F ++ F+ + + EA +R H++ + +A G
Sbjct: 338 RRAAARIVDTLTAEDRFAVLTFDDQMET-PDGLPTGLSEATDR-HRFRAVQHLATVDARG 395
Query: 404 STNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYT 462
T + PL +A +L++ ++ L+TDG V +E ++ + +LT+ R++T
Sbjct: 396 GTEMEPPLRRAATLLSDDNPDRDRVLILITDGQVGNEDRLLTTLSPKLTH-----IRVHT 450
Query: 463 FGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK--D 520
GI + N FL+ L+ + G+ D ++ M + R + ++ + + + +
Sbjct: 451 VGIDTAVNAAFLQRLSTLGGGHCELVESEDRLDDAMDAIHHRIATPLVTGLHLTGVGGLE 510
Query: 521 LDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
L++ + P+R+PDL + +PL+V+GR GK
Sbjct: 511 LEQNSVTPTRLPDLFAGAPLVVAGRLGGKM 540
>gi|403291065|ref|XP_003936620.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Saimiri
boliviensis boliviensis]
Length = 906
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 152/370 (41%), Gaps = 59/370 (15%)
Query: 206 EFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGY 265
+ V PF E IKK +K H+ G + C T S L
Sbjct: 204 KLDVQAPFLPKELAAQTIKKSFSGKKGHVLFRPTVGQQQACPTCSTSLL----------- 252
Query: 266 SYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF 325
N DF +Y VS I ++ + F + P +
Sbjct: 253 ----------NGDFKVTYDVSRDKICDLLVANN----------HFAHFFAPQNLTN---M 289
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 290 NKNVVFVIDISGSMRGQKVKQTKEALLQILGDMQPGDYFDLVLFGSRVQSWKGSLVQASE 349
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG---- 434
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 350 ANLQAARDFVR-GFSLDEATNLNGGLLRGIEILNKVQESLPELSNHASILIMLTDGDPTE 408
Query: 435 AVEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
V D QI ++S +I R +Y G G + FL +++M + G Y+
Sbjct: 409 GVTDRSQILKNVRS------AIQGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDR 462
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QG 548
Q+Q +++ +L ++ + +D + R S ++V+GR QG
Sbjct: 463 DAAQQLQGFYSQVAKPLLVDVELQYPQDAVS-ALTQHRHKQYYEGSEIVVAGRIADNKQG 521
Query: 549 KFPDTLKAKG 558
F +KA G
Sbjct: 522 SFKADVKAHG 531
>gi|390355328|ref|XP_789748.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Strongylocentrotus purpuratus]
Length = 1060
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 116/242 (47%), Gaps = 24/242 (9%)
Query: 290 IFGGVLLQSPSLHDVDQREM------FCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGK 342
I G +++ +HD + F Y P G K+RK +V+F++D+SGSM+G+
Sbjct: 297 IMGDFIIKYDVMHDAKAGHLQIVNGYFVHYFSPVGLPKTRK----NVVFVIDVSGSMRGR 352
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF-STSMELATKEAVERAHQWIGINFIA 401
++ TK A L + P D NIV F ++ S M AT + + A + + A
Sbjct: 353 KMDQTKRAFTTILDDVRPIDRINIVLFESNVRVWRSNQMVEATGDNIAAAKNHVN-DISA 411
Query: 402 GGSTNICAPLTKAVEMLTNSRG--SIPIIFLVTD-----GAVEDERQICDAMKSRLTNGG 454
GG TN+ LT AV++L ++P+I ++TD G+V +I R+TN
Sbjct: 412 GGGTNLYDGLTNAVDLLMEHGNGEAMPLIIMLTDGQPTSGSVTSTSEII----KRITNLI 467
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
+++ G G+ + FL L++ ++ Y+ S +QM+ + + +L NI
Sbjct: 468 DGRLSLFSVGFGNGVDFSFLEKLSLSNQALARKVYEDSSASLQMKGFYDEVANPLLFNIN 527
Query: 515 ID 516
I+
Sbjct: 528 IE 529
>gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus
norvegicus]
Length = 706
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 158/352 (44%), Gaps = 55/352 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM GK ++ T+ AL L L D FNI+ F+GE + + AT+E
Sbjct: 273 KNVLFVIDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEE 332
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG---AV 436
+ RA + A G TNI + AVE+L S S+ +I L+TDG
Sbjct: 333 NLNRAVDYAS-KIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVSLIILLTDGEPTVG 391
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E +I NG ++ G G N+ FL LA+ + G Y+ +
Sbjct: 392 ETNPKIIQKNTQEAINGRY---SLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSAL 448
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRYQ 547
Q+Q + + +L+++ + YPS + D++ S ++V+G+ +
Sbjct: 449 QLQDFYQEVANPLLSSVTFE----------YPSNAVEDVTRYNFQHHFKGSEMVVAGKLR 498
Query: 548 GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------LDRICA----KQQI 589
+ PD L AK G + L N + + +A+ ++R+ A +QQ+
Sbjct: 499 DQGPDVLLAKVSGQM-HLQNITFQTEASIAQQEKEFQGPKYIFHNFMERLWALLTIQQQL 557
Query: 590 DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
+ + A +E + LE +V +S++ + T M++ E E+ +E P
Sbjct: 558 EQRIS-ASGAELEALEAQVLNLSLKYNFVTPLTHMVVTKPEDQEQFQVAEKP 608
>gi|348503123|ref|XP_003439116.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Oreochromis niloticus]
Length = 1302
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 111/211 (52%), Gaps = 11/211 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L + RKV + IF++D SGSM G + K+A+ V L L PG FN+V F
Sbjct: 333 FCPDLKSICSDLRKVHG-EFIFLIDRSGSMSGVNISRVKDAMVVILKSLVPGCVFNVVGF 391
Query: 370 NGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
G T+ LF+TS ++A+ A +++ G TNI APLT + S G +
Sbjct: 392 -GSTFKSLFTTSQNYE-EDALAIACEYVRKIRADMGGTNILAPLTWILRQPLIS-GHPRL 448
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
+FL+TDGAV + +I D ++S + R +TFGIG ++ LA +S+G
Sbjct: 449 LFLLTDGAVSNTGKIIDLVRSH-----ARYIRCFTFGIGQNACRRLVQGLATVSKGTAEF 503
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
D + ++ +M K + + VL++I+I+ L
Sbjct: 504 LADGERLQPKMIKSLKKTMTPVLSDISIEWL 534
>gi|355559572|gb|EHH16300.1| hypothetical protein EGK_11564 [Macaca mulatta]
Length = 931
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 161/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINEAMLVAVQLLDSSNQEERLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSMAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AKQQIDLLTAQAWFSED---KRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L Q + D + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQIVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens]
Length = 648
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 166/371 (44%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 373
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 374 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|345794066|ref|XP_535380.3| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Canis lupus familiaris]
Length = 1118
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
V A I G TNI +PL + + RG ++FL+TDGAV + ++ + ++
Sbjct: 421 VAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRLLFLITDGAVNNTGKVLELVR 479
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+ + R Y+FGIG H +R LA +S+G + + ++ +M K + +
Sbjct: 480 NH-----AFSTRCYSFGIGPNVCHRLVRGLATVSKGSAEFLVEGERLQPKMIKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|301767172|ref|XP_002919036.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like
[Ailuropoda melanoleuca]
Length = 910
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 116/240 (48%), Gaps = 33/240 (13%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F E + S+ A+
Sbjct: 288 LNKNVVFVIDISTSMEGQKVKQTKEALLKILGDIRPGDYFDLVLFGSEVQSWRGSLVQAS 347
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ A ++ F+A G+TN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 348 PANLRAAQDFVRRFFLA-GATNLNGGLLRGIEILNQAQGSLPELSNHASILIMLTDGEPT 406
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 407 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHDVDLNFLEVMSMENNGRAQRIY 458
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ Q+Q + + + +L ++ + DT+ L + R S ++V+GR
Sbjct: 459 EDHDATQQLQGFYDQVANPLLVDVELLYPPDTVSALTQ-----HRHKQYYEGSEIMVAGR 513
>gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis]
Length = 1225
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 167/383 (43%), Gaps = 54/383 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +KDV+F++DISGSM G ++ TK A+ V LS L D FNIV F
Sbjct: 262 FVHYFAP---RGLPPVQKDVVFVIDISGSMYGTKMKQTKKAMHVILSDLHQDDFFNIVTF 318
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ S + AT + +++A ++ A G T+I A L A + +S
Sbjct: 319 SDTVNVWKPSQSIQATPQNIKKAKDYVS-KMEADGWTDINAALLAAASVFNHSSPMAGKI 377
Query: 423 ---GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
IP+I +TDG V +I + L G+I ++ G ++ LR
Sbjct: 378 MRDQRIPLIIFLTDGEPTSGVTTGSRILSNAQQALK--GTIS--LFGLAFGDDADYGLLR 433
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
L++ +RG Y+ +Q++ + S +L ++ + L + + ++ + P+
Sbjct: 434 RLSLENRGVARRIYEDADATLQLKGFYDEIASPLLYDVELAYLDGVAQ-DLTQNLFPNYF 492
Query: 536 SESPLIVSGRYQGKFPD---TLKAKGFLGDLSNFVVELKLQLAKDIP------------- 579
S L+V+G+ + + G G LS ++ + +P
Sbjct: 493 QGSELVVAGKVKPGVSELRVRTTGHGQEGPLS-LENDISANTTEAVPFGCSGDANKIGHF 551
Query: 580 LDRICAKQQI-DLLTAQAWFSED-----KRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
+ R+ A I DLL QA F + + L EK + +S++ + T +I+V D
Sbjct: 552 VQRVWAYFTIQDLL--QARFRANDTATRRLLTEKATNLSLKYNFVTPVTSLIVVRPDAHK 609
Query: 634 NASE------SPGTKKGSKKSDH 650
+ ++ +PG KG K + H
Sbjct: 610 DKAQPAKVTIAPGVTKGPKATPH 632
>gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens]
Length = 560
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 166/371 (44%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 171 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 227
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 228 STEATQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 285
Query: 422 RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL L
Sbjct: 286 EGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 341
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 342 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 400
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 401 SEMVVAGKLQDRGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLW 460
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ + D E++ +E P
Sbjct: 461 AYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKP 520
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 521 MEGESRNRNVH 531
>gi|426340861|ref|XP_004034345.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Gorilla gorilla gorilla]
Length = 930
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 165/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQDE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDWGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPVEGESRNRNVH 619
>gi|403291058|ref|XP_003936617.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Saimiri
boliviensis boliviensis]
Length = 898
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 160/360 (44%), Gaps = 40/360 (11%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDQSGSMSGRKIQQTQEALIKILDDLSPRDQFNLITF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E +S S+ A+ E V +A + GI+ + G TNI + AV++L S
Sbjct: 316 SSEATQWSPSLVPASAENVNKARSFAAGIHALGG--TNINDAVLMAVQLLDRSNREERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
S+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 TRSVSLIILLTDGDPTVGETNPRNIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G Y+ +Q+Q + + +L + + + E E+ L
Sbjct: 430 ALDTGGLARRIYEDSDSALQLQDFYQEVANPLLTTVTFEYPSNAVE-EVTQDNFRFLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ + + PD L AK F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLRDQGPDVLTAKVSGHLTTQNVTFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AKQQIDLL---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L + A +E + L+ + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEGSVSASDAERQVLQTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
>gi|426340863|ref|XP_004034346.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Gorilla gorilla gorilla]
Length = 900
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 165/374 (44%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNS---- 421
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQDE 370
Query: 422 ---RGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I + ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q PD L A F + S E + Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDWGPDVLTATVSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD--ERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ + D E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPVEGESRNRNVH 619
>gi|148692826|gb|EDL24773.1| inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus]
Length = 886
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 121/262 (46%), Gaps = 16/262 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++D+SGSM G+ ++ T+ AL L + D N + F+ + + + A
Sbjct: 278 VVPKNIVFVIDVSGSMSGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQA 337
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T ++ A ++ N TNI L K +EML +R S II ++TDG
Sbjct: 338 TPANLKEAKTFVK-NIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTSIIIMLTDGDA 396
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL LA+ + G Y+
Sbjct: 397 NTGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLETLALENHGLARRIYEDSDAN 455
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D K
Sbjct: 456 LQLQGFYEEVANPLLTNVEVEYPENAI-LDLTRNSYPHFYDGSEIVVAGRLVDRNMDNFK 514
Query: 556 A----KGFLGDLSNFVVELKLQ 573
A G L DL+ F E+ ++
Sbjct: 515 ADVKGHGALNDLT-FTEEVDME 535
>gi|281338027|gb|EFB13611.1| hypothetical protein PANDA_007566 [Ailuropoda melanoleuca]
Length = 868
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 116/240 (48%), Gaps = 33/240 (13%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F E + S+ A+
Sbjct: 243 LNKNVVFVIDISTSMEGQKVKQTKEALLKILGDIRPGDYFDLVLFGSEVQSWRGSLVQAS 302
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ A ++ F+A G+TN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 303 PANLRAAQDFVRRFFLA-GATNLNGGLLRGIEILNQAQGSLPELSNHASILIMLTDGEPT 361
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 362 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHDVDLNFLEVMSMENNGRAQRIY 413
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ Q+Q + + + +L ++ + DT+ L + R S ++V+GR
Sbjct: 414 EDHDATQQLQGFYDQVANPLLVDVELLYPPDTVSALTQ-----HRHKQYYEGSEIMVAGR 468
>gi|15929704|gb|AAH15276.1| Inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus]
Length = 886
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 121/262 (46%), Gaps = 16/262 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++D+SGSM G+ ++ T+ AL L + D N + F+ + + + A
Sbjct: 277 VVPKNIVFVIDVSGSMSGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQA 336
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T ++ A ++ N TNI L K +EML +R S II ++TDG
Sbjct: 337 TPANLKEAKTFVK-NIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTSIIIMLTDGDA 395
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL LA+ + G Y+
Sbjct: 396 NTGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLETLALENHGLARRIYEDSDAN 454
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D K
Sbjct: 455 LQLQGFYEEVANPLLTNVEVEYPENAI-LDLTRNSYPHFYDGSEIVVAGRLVDRNMDNFK 513
Query: 556 A----KGFLGDLSNFVVELKLQ 573
A G L DL+ F E+ ++
Sbjct: 514 ADVKGHGALNDLT-FTEEVDME 534
>gi|326679269|ref|XP_692918.4| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Danio rerio]
Length = 1268
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 109/212 (51%), Gaps = 13/212 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L T+ R + + + +F++D SGSM G + K+A+ V L L P FNIV F
Sbjct: 340 FCPDLRSTTSDLRSI-QGEFVFLIDRSGSMSGVNINRVKDAMVVILKSLFPACLFNIVGF 398
Query: 370 NGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPII 428
+ LFSTS +E++ A +++ G TNI APL + + RG ++
Sbjct: 399 GSKFKTLFSTSQSY-DEESLALACEYVKKIRADMGGTNILAPLNWILRQPMH-RGHPRLL 456
Query: 429 FLVTDGAVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
FL+TDGAV + ++ + ++S R T R +TFGIG + L+ +SRG
Sbjct: 457 FLLTDGAVSNTGKVIELLRSHARFT-------RCFTFGIGQAACRRLVSGLSAVSRGTAE 509
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
+ + ++ +M K + +SVL +I+I+ L
Sbjct: 510 FLAEGERLQPKMIKSLKKCMTSVLTDISIEWL 541
>gi|47522638|ref|NP_999089.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Sus scrofa]
gi|3024032|sp|Q29052.1|ITIH1_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1;
Short=ITI heavy chain H1; Short=ITI-HC1;
Short=Inter-alpha-inhibitor heavy chain 1; Flags:
Precursor
gi|565283|dbj|BAA07632.1| inter-alpha-trypsin inhibitor heavy-chain H1 [Sus scrofa]
Length = 902
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 103/207 (49%), Gaps = 26/207 (12%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL LS L PGD F++V F + S+ A+
Sbjct: 281 LNKNVVFVIDISSSMEGQKVKQTKEALLKILSDLKPGDYFDLVLFGSAVQSWRGSLVQAS 340
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
++ A ++ F GSTN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 341 TANLDAARSYVR-QFSLAGSTNLNGGLLRGIEILNKAQGSLPEFSNRASILIMLTDGEPT 399
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI DA++ R +Y G G FL + A+ + G Y
Sbjct: 400 EGVTDRSQILKNVRDAIRGRFP--------LYNLGFGHDVEWNFLEVRALENNGRAQRIY 451
Query: 490 -DLDSIEIQMQKLFTRGFSSVLANIAI 515
D DS + Q+Q + + + +L ++ +
Sbjct: 452 EDHDSAQ-QLQGFYDQVANPLLKDVEL 477
>gi|351710075|gb|EHB12994.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Heterocephalus
glaber]
Length = 885
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 117/260 (45%), Gaps = 31/260 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V+F++D+SGSM G+ ++ T++AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVVFVIDVSGSMSGRKIQQTRDALLKILEDVKKDDYLNFILFSGDVTTWKDQLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ N G+TNI L +A+ ML ++ S II ++TDG
Sbjct: 340 TPENLQEARTFVK-NIRDQGATNINDALLRAIHMLNKAQEEHVVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G ++ FL +A+ + G Y+ ++
Sbjct: 399 NTGESRPEKIQENVRNAIEGKFPLYNLGFGRNMDYSFLERMALENHGLARRIYEDSDADL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY- 546
Q+Q + + +L ++ ++ YP I DL+ S ++V+GR
Sbjct: 459 QLQGFYEEVANPLLTSVQVE----------YPDNAIQDLTQNNYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDL 563
F +K G L DL
Sbjct: 509 DEDMNNFKADVKGHGALSDL 528
>gi|159110717|ref|NP_032433.2| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Mus
musculus]
gi|341941049|sp|Q61704.3|ITIH3_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
Length = 889
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 121/262 (46%), Gaps = 16/262 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++D+SGSM G+ ++ T+ AL L + D N + F+ + + + A
Sbjct: 280 VVPKNIVFVIDVSGSMSGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T ++ A ++ N TNI L K +EML +R S II ++TDG
Sbjct: 340 TPANLKEAKTFVK-NIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL LA+ + G Y+
Sbjct: 399 NTGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLETLALENHGLARRIYEDSDAN 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D K
Sbjct: 458 LQLQGFYEEVANPLLTNVEVEYPENAI-LDLTRNSYPHFYDGSEIVVAGRLVDRNMDNFK 516
Query: 556 A----KGFLGDLSNFVVELKLQ 573
A G L DL+ F E+ ++
Sbjct: 517 ADVKGHGALNDLT-FTEEVDME 537
>gi|431899873|gb|ELK07820.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Pteropus alecto]
Length = 749
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 168/398 (42%), Gaps = 82/398 (20%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLED-----------------TKNALA 352
F Y P + K+VIF++D SGSM+GK ++ T+ AL
Sbjct: 324 FVHYFAP---EGLPTIPKNVIFVIDKSGSMRGKKIQQADPPSGIFLGRGLPGFKTREALI 380
Query: 353 VALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPL 411
L L P D FN+V F+ E + SM A+ E V+ A + GI A G TNI + +
Sbjct: 381 KILDDLSPKDQFNLVPFSEEATQWKPSMVPASAENVKEARTYAAGIQ--AQGGTNINSAM 438
Query: 412 TKAVEMLTNS-------RGSIPIIFLVTDG--------AVEDERQICDAMKSRLTNGGSI 456
AV++L + GS+ +I L+TDG + +R + DA+ GG
Sbjct: 439 LMAVQLLERATREELLPSGSVSLIILLTDGDPTVGETSPEKIQRNVQDAI------GGQY 492
Query: 457 CPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
++ G G ++ FL LA+ + G Y+ +Q+Q + + +L +A +
Sbjct: 493 S--LFCLGFGFDVSYAFLERLALDNGGLARRIYEDSDSALQLQDFYQEVANPLLTAVAFE 550
Query: 517 TLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRYQGKFPDTLKAK---GFLGDLS 564
YPS + D++ + S ++V+G+ + + PD L AK L
Sbjct: 551 ----------YPSNAVEDVTQDNFRLLFKGSEMVVAGKLRDQSPDVLSAKISGQLLMQNI 600
Query: 565 NFVVELKLQLAKDIP---------LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMS 612
F E + ++ P ++R+ A I L Q A +E + L+ + +S
Sbjct: 601 TFQTESGMAEQEEFPSPKYIFYNFMERLWAFLTIQQLLEQIVSASDTEKQALKTRALGLS 660
Query: 613 VQTGVLCEYTRMIIV--ETDERNNASESPGTKKGSKKS 648
+ + T M++ E E++ +E P K ++
Sbjct: 661 LNYSFVTPLTSMVVTKPEGQEQSQVAEKPVENKNRHRN 698
>gi|695636|emb|CAA49843.1| inter-alpha-inhibitor H3 chain [Mus musculus]
Length = 886
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 121/262 (46%), Gaps = 16/262 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++D+SGSM G+ ++ T+ AL L + D N + F+ + + + A
Sbjct: 277 VVPKNIVFVIDVSGSMSGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQA 336
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T ++ A ++ N TNI L K +EML +R S II ++TDG
Sbjct: 337 TPANLKEAKTFVK-NIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTSIIIMLTDGDA 395
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL LA+ + G Y+
Sbjct: 396 NTGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLETLALENHGLARRIYEDSDAN 454
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D K
Sbjct: 455 LQLQGFYEEVANPLLTNVEVEYPENA-ILDLTRNSYPHFYDGSEIVVAGRLVDRNMDNFK 513
Query: 556 A----KGFLGDLSNFVVELKLQ 573
A G L DL+ F E+ ++
Sbjct: 514 ADVKGHGALNDLT-FTEEVDME 534
>gi|402859845|ref|XP_003894347.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Papio anubis]
Length = 935
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 163/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINDAMLVAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|355746639|gb|EHH51253.1| hypothetical protein EGM_10593 [Macaca fascicularis]
Length = 931
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 161/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINEAMLVAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AKQQIDLLTAQAWFSED---KRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L Q + D + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQIVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 MEGESRNRNVH 619
>gi|402859847|ref|XP_003894348.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Papio anubis]
Length = 896
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 163/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINDAMLVAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|449136098|ref|ZP_21771502.1| von Willebrand factor type A (vWA) domain-containing protein
[Rhodopirellula europaea 6C]
gi|448885261|gb|EMB15716.1| von Willebrand factor type A (vWA) domain-containing protein
[Rhodopirellula europaea 6C]
Length = 752
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 140/315 (44%), Gaps = 17/315 (5%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL 382
++ +++I ++D SGSM G + + L L+P D F ++AF+ + F
Sbjct: 327 EITPRELILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRSTAFQPDAVS 386
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
AT ++ A Q++ + A G TN+ L A+ + + L+TD V ++ I
Sbjct: 387 ATDANIQSAKQFVR-SLRASGGTNLLPALKLALGGEADESARPRYMVLMTDALVGNDHSI 445
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ R++ G+ N Y + A + RG+ + D+ ++
Sbjct: 446 LRYLRQPEFQDA----RVFPVAFGAAPNDYLISRAAELGRGFSMQVTNQDNSPEIARRFH 501
Query: 503 TRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
+ ++ ID +KD PSR+PDL + PLIV GRY T+ KG
Sbjct: 502 ELPSQPYMTDLQIDWGGLVVKD-----QVPSRLPDLYAGKPLIVLGRYDTPAIGTITLKG 556
Query: 559 FL-GDLSNFVVELKL--QLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQT 615
+ G +EL+L Q A+ + + A+Q+I + + +E +++++ ++
Sbjct: 557 NVSGQPVQMGMELELPEQEAEHDSIGPVWARQRIRQIWNRDVGAETPAGRAEITELGLKH 616
Query: 616 GVLCEYTRMIIVETD 630
++ +YT I VE +
Sbjct: 617 QLMTQYTSFIAVEKE 631
>gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3
[Macaca mulatta]
Length = 888
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 162/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 247 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 303
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 304 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINEAMLVAVQLLDSSNQEERLP 361
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 362 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 417
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 418 ALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 476
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 477 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 536
Query: 585 AKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L Q A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 537 AYLTIQQLLEQIVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 596
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 597 VEGESRNRNVH 607
>gi|440798115|gb|ELR19183.1| ubiquitin domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 2088
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 150/322 (46%), Gaps = 31/322 (9%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD-SFNIVAFNGETY--LFSTSMELAT 384
+ IF+VD S SM+G+P +D K L L +L D +FNI++F G TY LF S A
Sbjct: 668 EFIFLVDRSSSMKGEPFDDMKRTLLACLERLANVDATFNIISF-GSTYERLFVDSQSTAD 726
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
A+ A ++ G T + PL +++ ++ + + +F+ +DG ++ + +
Sbjct: 727 PSALATAKNYVDKMRPNFGGTELWQPL-RSLFLMAHEERTPRNVFIFSDGHPTNQESLVE 785
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLF 502
+K + R+++FG GS C+ + +R LA + G + A L + ++++ F
Sbjct: 786 LIKQNAAHT-----RVFSFGFGSNCSRHMVRSLARVGGGVAEFMPANKLPT-RSKIERQF 839
Query: 503 TRGFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
R L+ I +D + D P+++ L IV G TLKAK
Sbjct: 840 RRALQPALSEINVDWHISDRGSVSAAPAQLTSLFYGERQIVYGFVDYCTAATLKAKNGDM 899
Query: 562 DLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAW----FSED--------KRLEEKVS 609
++SN V +L + + + + A+ I + W + +D ++ +E++
Sbjct: 900 EVSNLVSTHELSITRGSIVHTLTARAMI-----RDWEDGNYDKDALIHEIIKRKRKEEII 954
Query: 610 KMSVQTGVLCEYTRMIIVETDE 631
K+S Q ++ ++T + +E E
Sbjct: 955 KLSTQYSLVTQFTSFVAIEKRE 976
>gi|410966308|ref|XP_003989675.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
isoform 2 [Felis catus]
Length = 1228
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 94/189 (49%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
V A I G TNI +PL + + RG I+FL+TDGA+ + ++ +
Sbjct: 421 VAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRILFLITDGAINNTGKVLE--- 476
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
L + R Y+FGIG H ++ LA +S+G + + ++ +M K + +
Sbjct: 477 --LVRNHAFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLMEGERLQPKMIKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|410966306|ref|XP_003989674.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
isoform 1 [Felis catus]
Length = 1222
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 94/189 (49%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
V A I G TNI +PL + + RG I+FL+TDGA+ + ++ +
Sbjct: 421 VAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRILFLITDGAINNTGKVLE--- 476
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
L + R Y+FGIG H ++ LA +S+G + + ++ +M K + +
Sbjct: 477 --LVRNHAFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLMEGERLQPKMIKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016]
gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016]
Length = 688
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 169/387 (43%), Gaps = 43/387 (11%)
Query: 296 LQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L++ S D Q E + L PG S +D +F++D SGSM GK T+ +
Sbjct: 276 LEAVSYRDPQQSERGTIKLTFTPGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTE-GVKR 334
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIA-GGSTNICAPLT 412
L KL GD F I+ F+ + + V +A + IN IA GG TN+ L
Sbjct: 335 GLGKLPSGDRFRILMFDNRVQEITNGFIAVNQNNVTQAIE--TINQIATGGGTNLYDALE 392
Query: 413 KAVEMLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSY 468
+AV L + R + I LVTDG V +++Q M+ R+YTF +G+
Sbjct: 393 RAVSGLDSDRTTG--IILVTDGVANVGVTEKKQFLKLMQR-------YDVRLYTFIMGNS 443
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT----LKDLDEF 524
N L + +S G+ + + D I + + ++ NI ID L DL
Sbjct: 444 ANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQAYRNIRIDVDGVKLNDLT-- 501
Query: 525 EMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIPL 580
P I L L V G Y GK TL AK G ++L Q ++ +
Sbjct: 502 ---PENIHSLYRGEQLTVFGHYNKAGKSTITLSAKVDGVTRKYQT-TIQLAEQQLENPEI 557
Query: 581 DRICAKQQIDLLTAQAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
+R+ A I + ++ + E D E+ + ++++ G+L +YT +++VE ++
Sbjct: 558 ERLWAFSAIRDIESKMQYLEQNDNESEQAIESIALEYGLLTDYTSLLVVE--------DT 609
Query: 639 PGTKKGSKKSDHQKILDSEVPKTIMLQ 665
+ G +K++ Q++ + + + Q
Sbjct: 610 VFEQNGIRKTNQQRVEKEQAARELRQQ 636
>gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6]
gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6]
Length = 688
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 169/387 (43%), Gaps = 43/387 (11%)
Query: 296 LQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L++ S D Q E + L PG S +D +F++D SGSM GK T+ +
Sbjct: 276 LEAVSYRDPQQSERGTIKLTFTPGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTE-GVKR 334
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIA-GGSTNICAPLT 412
L KL GD F I+ F+ + + V +A + IN IA GG TN+ L
Sbjct: 335 GLGKLPSGDRFRILMFDNRVQEITNGFIAVNQNNVTQAIE--TINQIATGGGTNLYDALE 392
Query: 413 KAVEMLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSY 468
+AV L + R + I LVTDG V +++Q M+ R+YTF +G+
Sbjct: 393 RAVSGLDSDRTTG--IILVTDGVANVGVTEKKQFLKLMQR-------YDVRLYTFIMGNS 443
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT----LKDLDEF 524
N L + +S G+ + + D I + + ++ NI ID L DL
Sbjct: 444 ANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQAYRNIRIDVDGVKLNDLT-- 501
Query: 525 EMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIPL 580
P I L L V G Y GK TL AK G ++L Q ++ +
Sbjct: 502 ---PENIHSLYRGEQLTVFGHYNKAGKSTITLSAKVDGVTRKYQT-TIQLAEQQLENPEI 557
Query: 581 DRICAKQQIDLLTAQAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
+R+ A I + ++ + E D E+ + ++++ G+L +YT +++VE ++
Sbjct: 558 ERLWAFSAIRDIESKMQYLEQNDNESEQAIESIALEYGLLTDYTSLLVVE--------DT 609
Query: 639 PGTKKGSKKSDHQKILDSEVPKTIMLQ 665
+ G +K++ Q++ + + + Q
Sbjct: 610 VFEQNGIRKTNQQRVEKEQAARELRQQ 636
>gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Macaca mulatta]
Length = 900
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 162/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINEAMLVAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L Q A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQIVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Macaca mulatta]
Length = 935
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 162/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V A + GI + G TNI + AV++L +S
Sbjct: 316 STEATQWRPSLVPASAENVNEARSFAAGIQALGG--TNINEAMLVAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EITQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E + Q K I ++R+
Sbjct: 489 SEMVVAGKLQAQGPDVLTATVSGKLPTQDITFQTESSVAEQEAEFQSPKYIFHNFMERLW 548
Query: 585 AKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A I L Q A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQIVSASDADQQALRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|296090633|emb|CBI41017.3| unnamed protein product [Vitis vinifera]
Length = 88
Score = 85.1 bits (209), Expect = 1e-13, Method: Composition-based stats.
Identities = 36/63 (57%), Positives = 47/63 (74%)
Query: 425 IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
+ +IF++ D VED R IC+A++ +L N G ICPRI TF +GSYC+H FL+MLA I RGY
Sbjct: 1 MSLIFVIIDKVVEDGRHICNALQGQLRNRGPICPRICTFDMGSYCSHNFLKMLAQIGRGY 60
Query: 485 YGA 487
Y A
Sbjct: 61 YDA 63
>gi|149728587|ref|XP_001492576.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like [Equus
caballus]
Length = 908
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 111/232 (47%), Gaps = 17/232 (7%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+++F++DIS SMQG+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 288 LNKNLVFVIDISTSMQGQKVQQTKEALLKILGDMRPGDYFDLVLFGSGVQSWKGSLVPAS 347
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
++ A ++ F GSTN+ L + +E+L ++GS+P I+ ++TDG
Sbjct: 348 AANLQAARDFVQ-RFTLEGSTNLNGGLLQGIEILNKAQGSLPEVSNHASILIMLTDGEPT 406
Query: 435 -AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI +++ + +Y G G + FL +++M + G Y+
Sbjct: 407 EGVTDRSQILKNVRNAIRGKFP----LYNLGFGQNVDFNFLEVMSMENNGRAQRIYEDHD 462
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
Q+Q + + + +L ++ + +D + R S ++V+GR
Sbjct: 463 ATQQLQGFYDQVATPLLVDVELQYPQDAVS-ALTQHRHKQYYEGSEIVVAGR 513
>gi|403362158|gb|EJY80795.1| von Willebrand domain-containing protein [Oxytricha trifallax]
Length = 1102
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 102/206 (49%), Gaps = 11/206 (5%)
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
KS K+ IFIVD SGSM+GK ++ TK AL + ++ L G F I++F G+ S
Sbjct: 480 KSIDTSKQAFIFIVDRSGSMRGKKMDMTKEALKLFIASLPSGSLFEIISFGGKFNPTSFR 539
Query: 380 ME--LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI---IFLVTDG 434
E T + V+ Q I G TNI PL A+ + S+ P IFL+TDG
Sbjct: 540 GEGFNNTDKEVKNVKQKIDFFLADMGGTNIYNPLDYALNKIL-SQPQNPFQKKIFLLTDG 598
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
++D + + +R N S+ ++TFGIG+ C+ Y ++ LA RG Y + D++
Sbjct: 599 EIKDPDNVINL--ARQANSRSV---VHTFGIGNGCSKYLVKGLAKAGRGSYSFVEENDNL 653
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKD 520
+ ++ K + L + KD
Sbjct: 654 KAKVIKALKKAVEPSLQGCRFEFPKD 679
>gi|73985485|ref|XP_533794.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Canis
lupus familiaris]
Length = 910
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 116/234 (49%), Gaps = 25/234 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F E + S+ A+
Sbjct: 290 KNVVFVIDISTSMEGQKVKQTKEALLKILGDMRPGDYFDLVLFGSEVQSWKGSLVQASPA 349
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
+ A ++ F+A G+TN+ L + +E+L ++ ++P ++ ++TDG
Sbjct: 350 NLRAAQDFVKHFFLA-GATNLNGGLLRGIEILNQAQKNLPKLSNHASVLIMLTDGEPTEG 408
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V D QI +++ + P +Y G G + FL +++M + G Y+
Sbjct: 409 VTDRSQILKNVRNAIRGK---FP-LYNLGFGDNVDFNFLDVMSMENNGRAQRIYEDHDAA 464
Query: 496 IQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
Q+Q + + + +L ++ + DT+ DL + R S ++V+GR
Sbjct: 465 QQLQGFYDQVANPLLVDVELLYPQDTISDLTQ-----HRHKQYYEGSEIMVAGR 513
>gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Sus scrofa]
gi|3024051|sp|P79263.1|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4;
Short=ITI heavy chain H4; Short=ITI-HC4;
Short=Inter-alpha-inhibitor heavy chain 4; AltName:
Full=Inter-alpha-trypsin inhibitor family heavy
chain-related protein; Short=IHRP; AltName: Full=Major
acute phase protein; Short=MAP; Flags: Precursor
gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa]
gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein
[Sus scrofa]
gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor
Length = 921
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 160/370 (43%), Gaps = 37/370 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+VIF++D SGSM+G+ ++ T+ AL L L D FN+V+F
Sbjct: 257 FVHYFAPEVWSA---IPKNVIFVIDTSGSMRGRKIQQTREALIKILGDLGSRDQFNLVSF 313
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+GE A+ E VE A + A G TNI + AV++L +
Sbjct: 314 SGEAP--RRRAVAASAENVEEAKSY-AAEIHAQGGTNINDAMLMAVQLLERANREELLPA 370
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
S+ I L+TDG ++ + ++ G G + FL +A+ +
Sbjct: 371 RSVTFIILLTDGDPTVGETNPSKIQKNVREAIDGQHSLFCLGFGFDVPYAFLEKMALENG 430
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G Y+ +Q++ + + +L +A + + E E+ S L+V
Sbjct: 431 GLARRIYEDSDSALQLEDFYQEVANPLLRLVAFEYPSNAVE-EVTQDNFRLFFKGSELVV 489
Query: 543 SGRYQGKFPDTLKAKGFLGDLS----NFVVELKL--QLAKDIP--------LDRICAKQQ 588
+G+ + + PD L AK G L FV+E ++ Q A+ + ++R+ A
Sbjct: 490 AGKLRDQSPDVLSAK-VRGQLHMENVTFVMESRVAEQEAEFLSPKYIFHSFMERLWAYLT 548
Query: 589 IDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP---G 640
I L AQ A +E K LE + +S+ + T M+I E E++ +E P G
Sbjct: 549 IQQLLAQTVSASDAEKKALEARALSLSLNYSFVTPLTSMVITKPEGQEQSQVAEKPVENG 608
Query: 641 TKKGSKKSDH 650
++G+ S H
Sbjct: 609 NRQGNTHSGH 618
>gi|431899875|gb|ELK07822.1| Inter-alpha-trypsin inhibitor heavy chain H1 [Pteropus alecto]
Length = 935
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 95/186 (51%), Gaps = 16/186 (8%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL L L PGD F++V F + + S+ A+
Sbjct: 280 LNKNVVFVIDISASMEGQKVKQTKEALLKILGDLRPGDYFDLVLFGSDVQSWKGSLVQAS 339
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI-------PIIFLVTDG--- 434
V+ A ++ +F G+TN+ L + +E+L ++ SI PI+ ++TDG
Sbjct: 340 AANVQSAKNFVR-HFSLAGATNLNGGLLRGIEILNKAQRSIPGLTDHAPILIMLTDGEPT 398
Query: 435 -AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI +++ + GG +Y G G + FL +++M + G Y+
Sbjct: 399 EGVTDRSQILKNIRNAI--GGRFP--LYNLGFGHNVDLNFLEVMSMENNGRVQRIYEDHD 454
Query: 494 IEIQMQ 499
Q+Q
Sbjct: 455 AAQQLQ 460
>gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis]
Length = 707
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 141/315 (44%), Gaps = 18/315 (5%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+FI+D SGSM G+ +E T+ AL L+ L D F ++ F+ + + + ATK
Sbjct: 267 KNVVFIIDQSGSMHGRKIEQTRTALIHILNDLAEDDFFGLLTFDSNIFQWKRELVQATKA 326
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
+E A + N A G+TNI A + K ML GS I+ L+TDG +
Sbjct: 327 NLESAKTF-ARNIRANGATNINAAVLKGSSMLNAHPREGSASILILLTDGDPTTGETNPE 385
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
A++S + N + +Y G G N FL +++ + G Y+ ++Q++ +
Sbjct: 386 AIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEKMSLQNNGVARRIYEDSDADLQLKGFYEE 445
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL--------KA 556
+ +L N+ + + + + + S ++V+G +T K+
Sbjct: 446 VATPLLTNVMMIYVGGTN---LTKTSFSHHYKGSEIVVAGVISDNDVETFAPQVVAISKS 502
Query: 557 KGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWF---SEDKRLEEKVSKMSV 613
+ + D N E + +A D + RI A + L + SE + + + K+S+
Sbjct: 503 RRLVFDDKNATTETSV-MASDSQMQRIWAHLTVKQLLDKELHLSGSEKEDAKNEALKLSL 561
Query: 614 QTGVLCEYTRMIIVE 628
+ G + T M++ +
Sbjct: 562 RYGFVTPLTSMVVTK 576
>gi|390355326|ref|XP_785426.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Strongylocentrotus purpuratus]
Length = 1016
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 118/241 (48%), Gaps = 14/241 (5%)
Query: 310 FCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
F Y P G +RK +VIF++D+SGSM G+ TK A L + P D NI+
Sbjct: 314 FVHYFSPDGLPNTRK----NVIFVIDVSGSMYGQKTRQTKRAFTTILDDVRPIDRINIIL 369
Query: 369 FNGETYLF-STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML--TNSRGSI 425
F+ +++ M AT + + A + + GG TNI L KAVE+L ++ ++
Sbjct: 370 FSSYAHVWREDQMVEATSDNIAAAKRHVN-GLSVGGGTNIYDSLMKAVEILLEHDTGDAM 428
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
P+I ++TDG V + I + S + GG + +++ G G+ + FL L++ ++
Sbjct: 429 PLIIMLTDGQVGNAAAIVRDVTSVI--GGRLS--LFSIGFGNGVDFPFLEKLSLSNQALA 484
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDL-DEFEMYPSRIPDLSSESPLIVSG 544
Y+ S +QM+ + + +L NI ++ L D+ + SR + + V+G
Sbjct: 485 RKVYEDSSASLQMKGFYDEVANPLLFNINMEYNNRLVDQNSLTKSRFISFFDGTEITVAG 544
Query: 545 R 545
+
Sbjct: 545 K 545
>gi|443685342|gb|ELT88977.1| hypothetical protein CAPTEDRAFT_95231 [Capitella teleta]
Length = 616
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/396 (23%), Positives = 180/396 (45%), Gaps = 54/396 (13%)
Query: 267 YESEVLKWSNIDFDFS--------YTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGT 318
Y+ +V + N D + Y S+I G +L ++ ++F + P
Sbjct: 189 YQPDVTQQQNADAELGISGSLNVLYDTEHSNIAGDILTEN---------DIFAHFFSPN- 238
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN-------G 371
+ + K V+F+VD+SGSM G+ +E+TK AL + ++ P D F + F+ G
Sbjct: 239 ENALQPLDKRVVFVVDMSGSMLGERIENTKRALEFIVRQMRPSDLFMFLLFSDSLVRWPG 298
Query: 372 ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA-VEMLTNSRGSIPIIFL 430
Y A+ +V A +++ N IA G+T+I LT+A + + ++G +I
Sbjct: 299 RGYPMQ-----ASPPSVIAAMEFLQQNLIANGATDINMALTQAGLGFNSQAKGVYDMIVF 353
Query: 431 VTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
+TDG V + I + + S T G I ++T G G+ N FL+ L+ RG
Sbjct: 354 LTDGEPTSGVTNHNAIINNVVS--TCGDKIS--LFTVGFGASLNENFLKTLSGRVRGSVS 409
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEF-EMYPSRIPDLSSESPLIVSGR 545
YD + I +++ F R S ++ ++ +DL E M + S +IV+G+
Sbjct: 410 FVYDTEDIYAELKDFFLRIDSPLITDVKFSYNEDLIEASSMTSMEFSEYFEGSQIIVAGQ 469
Query: 546 YQ---GKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP------LDRICAKQQIDLLTAQA 596
+ F + G GD +++ +Q D+ L+R+ A +I+ L +
Sbjct: 470 MKSIPSNFQALVSGTG-RGDSRITLLQPTIQNVADLSSNVDGFLERLYAHLKIENLFLDS 528
Query: 597 WFSEDK----RLEEKVSKMSVQTGVLCEYTRMIIVE 628
+++ +EE+ +++++ ++ + T +I+ +
Sbjct: 529 LLEKNQSRIEEIEEEALQLALKHNLVTKLTSLIVFQ 564
>gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Nomascus leucogenys]
Length = 930
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 163/374 (43%), Gaps = 46/374 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTAMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSR--- 422
+ E + S+ A+ E V +A +F AG G TNI + AV++L +S
Sbjct: 316 STEAIQWRPSLVPASAENVNKAR-----SFAAGIQALGGTNINDAMLMAVQLLDSSNQEE 370
Query: 423 ----GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL
Sbjct: 371 RLPDGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFL 426
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
LA+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 427 EKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLL 485
Query: 535 SSESPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLD 581
S ++V+G+ Q + PD L A F + S E Q K I ++
Sbjct: 486 FKGSEMVVAGKLQDQGPDVLTATVSGKLPTQNITFQTESSVAEQEADFQSPKYIFHNFME 545
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNAS 636
R+ A QQ+ T A ++ + L + +S+ + T M++ E E++ +
Sbjct: 546 RLWAYLTIQQLLEQTVSASDADQQALRTQALNLSLAYSFVTPLTSMVVTKPEDQEQSQVA 605
Query: 637 ESPGTKKGSKKSDH 650
E P + ++ H
Sbjct: 606 EKPVEGESRNRNVH 619
>gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Nomascus leucogenys]
Length = 900
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 163/371 (43%), Gaps = 40/371 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 259 FVHYFAP---EGLTAMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ E + S+ A+ E V +A + GI A G TNI + AV++L +S
Sbjct: 316 STEAIQWRPSLVPASAENVNKARSFAAGIQ--ALGGTNINDAMLMAVQLLDSSNQEERLP 373
Query: 423 -GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
GS+ +I L+TDG + R I ++ ++ S ++ G G ++ FL L
Sbjct: 374 DGSVSLIILLTDGDPTVGETNPRSIQKNVREAVSGRYS----LFCLGFGFDVSYAFLEKL 429
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G ++ +Q+Q + + +L + + + E E+ + L
Sbjct: 430 ALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE-EVTQNNFRLLFKG 488
Query: 538 SPLIVSGRYQGKFPDTLKAK----------GFLGDLSNFVVELKLQLAKDI---PLDRIC 584
S ++V+G+ Q + PD L A F + S E Q K I ++R+
Sbjct: 489 SEMVVAGKLQDQGPDVLTATVSGKLPTQNITFQTESSVAEQEADFQSPKYIFHNFMERLW 548
Query: 585 AK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
A QQ+ T A ++ + L + +S+ + T M++ E E++ +E P
Sbjct: 549 AYLTIQQLLEQTVSASDADQQALRTQALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 608
Query: 640 GTKKGSKKSDH 650
+ ++ H
Sbjct: 609 VEGESRNRNVH 619
>gi|355559573|gb|EHH16301.1| hypothetical protein EGK_11565 [Macaca mulatta]
Length = 891
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 127/284 (44%), Gaps = 35/284 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K V K+V F++DISGSM G+ LE TK AL L + D N + F
Sbjct: 283 FVHFFAP---KGLPVVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILF 339
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+G+ + + AT E ++ A ++ + G TNI L + + ML +R
Sbjct: 340 SGDVSTWKEHLVQATPENLQEAKTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPE 398
Query: 424 -SIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S I+ ++TDG + ++ + N G P +Y G G+ N+ FL +A+ +
Sbjct: 399 RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALEN 457
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---- 537
G+ Y+ ++Q+Q GF +AN + + E E + I L+
Sbjct: 458 HGFARRIYEDSDADLQLQ-----GFYEEVANPLLTGV----EMEYPENAILGLTQNTYQH 508
Query: 538 ----SPLIVSGRYQGK----FPDTLKAKGFLGDLSNFVVELKLQ 573
S ++V+GR G+ F +K G DL+ F E+ ++
Sbjct: 509 FYDGSEIVVAGRLVGEDVNSFKADVKGHGAANDLT-FTEEVDMK 551
>gi|39652254|emb|CAC79611.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens]
Length = 886
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 159/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 276 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQA 335
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 336 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 394
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 395 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 453
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRYQ 547
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 454 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 504
Query: 548 GKFPDTLKA----KGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
+ ++LKA G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 505 DEDMNSLKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 563
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 564 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 616
>gi|133925809|ref|NP_002208.3| inter-alpha-trypsin inhibitor heavy chain H3 preproprotein [Homo
sapiens]
gi|166203665|sp|Q06033.2|ITIH3_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; AltName:
Full=Serum-derived hyaluronan-associated protein;
Short=SHAP; Flags: Precursor
gi|77748471|gb|AAI07605.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens]
gi|77748473|gb|AAI07606.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens]
gi|78070482|gb|AAI07815.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens]
Length = 890
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|62897073|dbj|BAD96477.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens]
gi|62898698|dbj|BAD97203.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens]
Length = 890
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|288563|emb|CAA47439.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens]
Length = 885
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 276 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 335
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 336 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 394
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 395 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 453
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 454 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 504
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 505 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 563
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 564 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 616
>gi|207029558|ref|NP_001125590.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Pongo
abelii]
Length = 879
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 155/355 (43%), Gaps = 52/355 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVE----------YPENAILDLTQNTYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 VDEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQL 566
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 567 LEKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|332216203|ref|XP_003257234.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Nomascus
leucogenys]
Length = 890
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + + DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAVLDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKNAHSEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|119585667|gb|EAW65263.1| inter-alpha (globulin) inhibitor H3, isoform CRA_b [Homo sapiens]
Length = 890
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|115496418|ref|NP_001068821.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus]
gi|122142424|sp|Q0VCM5.1|ITIH1_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1;
Short=ITI heavy chain H1; Short=ITI-HC1;
Short=Inter-alpha-inhibitor heavy chain 1; Flags:
Precursor
gi|111308523|gb|AAI20097.1| Inter-alpha (globulin) inhibitor H1 [Bos taurus]
Length = 906
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 121/260 (46%), Gaps = 46/260 (17%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P ++ K K+V+F++DIS SM+G+ L+ TK AL L + PGD F++V F
Sbjct: 272 FAHFFAP---QNLKKLNKNVVFVIDISSSMEGQKLKQTKEALHKILGDMRPGDYFDLVLF 328
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ S+ A+ +E A ++ F G+TN+ L + +E+L ++ S+P
Sbjct: 329 GSAVQSWKGSLVQASPANLEAARNFVQ-QFSLAGATNLNGGLLRGIEILNKAQQSLPELS 387
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG V D QI D +K R +Y G G + FL
Sbjct: 388 NHASILIMLTDGEPTEGVMDRTQILKNVRDGIKGRFP--------LYNLGFGHDVDLNFL 439
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPD 533
++++ + G Y+ Q+Q GF +AN L+D++ +YP + D
Sbjct: 440 EVMSLENNGRVQRIYEDHDATQQLQ-----GFYEQVAN---PLLRDVE--LLYPREAVSD 489
Query: 534 LSSE--------SPLIVSGR 545
L+ S ++V+GR
Sbjct: 490 LTQHRHKQYYEGSEIMVAGR 509
>gi|89094360|ref|ZP_01167301.1| Uncharacterized protein containing a von Willebrand factor type
A(vWA) domain [Neptuniibacter caesariensis]
gi|89081419|gb|EAR60650.1| Uncharacterized protein containing a von Willebrand factor type
A(vWA) domain [Oceanospirillum sp. MED92]
Length = 707
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 20/337 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
+ + L PGT +D +F++DISGSM+GK + LS L P D F IV F
Sbjct: 295 YKLTLTPGTDLPAFNQGRDWVFVLDISGSMKGK-FAALVEGVREGLSNLSPNDRFRIVLF 353
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
N + F+ A K VE + G TN+ A L + L + R + I
Sbjct: 354 NNQARSFTQGYLPADKTTVENTLNQLD-QIQPGQGTNLYAGLQTGINQLDSDRST--AIV 410
Query: 430 LVTDGAVEDERQICDAMKSRLTN-GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
LVTDG + KS+ N R++TF +G+ N L + +S G+ +
Sbjct: 411 LVTDGVA----NVGTTHKSKFLNLLEQKDVRLFTFIMGNSANRPLLEEMTRVSNGFAMSI 466
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
+ D I Q+ + + + +I ++ + + + P L L + G Y
Sbjct: 467 SNSDDIVGQLMLAQGKMTHAAMRDIEVEVVGVRSKDMVKPRLANTLYHGQQLTLFGHYYK 526
Query: 549 KFPDTLKAKGFL-GDLSNFVVELKL-QLAKDIP-LDRICAKQQIDLLTAQA-WFSEDKRL 604
++ G + G+ + +L + D P L+R+ A I+ L A+ + DK
Sbjct: 527 PGEAEIRLTGSINGEKKTYTTRFQLPDTSTDHPELERLWAFAAIEELQAEMDYLGSDKDT 586
Query: 605 EEKVSKMSVQTGVLCEYTRMIIVETD-------ERNN 634
E+ + +V+ G++ +YT M++V + ERNN
Sbjct: 587 EQAIVDTAVEYGLVTDYTSMVVVRDEVFKSLNIERNN 623
>gi|296474790|tpg|DAA16905.1| TPA: inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos
taurus]
Length = 906
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 121/260 (46%), Gaps = 46/260 (17%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P ++ K K+V+F++DIS SM+G+ L+ TK AL L + PGD F++V F
Sbjct: 272 FAHFFAP---QNLKKLNKNVVFVIDISSSMEGQKLKQTKEALHKILGDMRPGDYFDLVLF 328
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ S+ A+ +E A ++ F G+TN+ L + +E+L ++ S+P
Sbjct: 329 GSAVQSWKGSLVQASPANLEAARNFVQ-QFSLAGATNLNGGLLRGIEILNKAQQSLPELS 387
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG V D QI D +K R +Y G G + FL
Sbjct: 388 NHASILIMLTDGEPTEGVMDRTQILKNVRDGIKGRFP--------LYNLGFGHDVDLNFL 439
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPD 533
++++ + G Y+ Q+Q GF +AN L+D++ +YP + D
Sbjct: 440 EVMSLENNGRVQRIYEDHDATQQLQ-----GFYEQVAN---PLLRDVE--LLYPREAVSD 489
Query: 534 LSSE--------SPLIVSGR 545
L+ S ++V+GR
Sbjct: 490 LTQHRHKQYYEGSEIMVAGR 509
>gi|75070765|sp|Q5RB37.1|ITIH3_PONAB RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
gi|55728562|emb|CAH91023.1| hypothetical protein [Pongo abelii]
Length = 876
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 155/355 (43%), Gaps = 52/355 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 277 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQA 336
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 337 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 395
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 396 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 454
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 455 LQLQGFYEEVANPLLTGVEVE----------YPENAILDLTQNTYQHFYDGSEIVVAGRL 504
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 505 VDEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQL 563
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 564 LEKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 617
>gi|109039132|ref|XP_001085463.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Macaca
mulatta]
Length = 891
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 159/360 (44%), Gaps = 37/360 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K V K+V F++DISGSM G+ LE TK AL L + D N + F
Sbjct: 269 FVHFFAP---KGLPVVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILF 325
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+G+ + + AT E ++ A ++ + G TNI L + + ML +R
Sbjct: 326 SGDVSTWKEHLVQATPENLQEAKTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPE 384
Query: 424 -SIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S I+ ++TDG + ++ + N G P +Y G G+ N+ FL +A+ +
Sbjct: 385 RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALEN 443
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
G+ Y+ ++Q+Q + + +L + ++ ++ + + S ++
Sbjct: 444 HGFARRIYEDSDADLQLQGFYEEVANPLLTGVEMEYPENAI-LGLTQNTYQHFYDGSEIV 502
Query: 542 VSGRYQGK----FPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQ 587
V+GR G+ F +K G DL+ F E+ + ++ K + ++R+ A
Sbjct: 503 VAGRLVGEDVNSFKADVKGHGAANDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYL 561
Query: 588 QIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
I+ L + A E + L + +S++ + T M++ + DER A + PG
Sbjct: 562 TIEQLLEKRKNAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|320159019|ref|YP_004191397.1| hypothetical protein VVMO6_04172 [Vibrio vulnificus MO6-24/O]
gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O]
Length = 688
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 169/387 (43%), Gaps = 43/387 (11%)
Query: 296 LQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L++ S D Q E + L PG S +D +F++D SGSM GK T+ +
Sbjct: 276 LEAVSYRDPQQSERGTIKLTFTPGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTE-GVKR 334
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIA-GGSTNICAPLT 412
L KL GD F I+ F+ + + V +A + IN IA GG TN+ L
Sbjct: 335 GLGKLPSGDRFRILMFDNRVQEITNGFIAVNQNNVTQAIE--TINQIATGGGTNLYDALE 392
Query: 413 KAVEMLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSY 468
+AV L + R + I LVTDG V +++Q M+ R+YTF +G+
Sbjct: 393 RAVSGLDSDRTTG--IILVTDGVANVGVTEKKQFLKLMQR-------YDVRLYTFIMGNS 443
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDT----LKDLDEF 524
N L + +S G+ + + D I + + ++ NI +D L DL
Sbjct: 444 ANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQAYRNIRLDVDGVKLNDLT-- 501
Query: 525 EMYPSRIPDLSSESPLIVSGRYQ--GKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIPL 580
P I L L V G Y GK TL AK G ++L Q ++ +
Sbjct: 502 ---PENIHSLYRGDQLTVFGHYNKAGKSTITLSAKVDGVTRKYQT-TIQLAEQQLENPEI 557
Query: 581 DRICAKQQIDLLTAQAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
+R+ A I + ++ + E D E+ + ++++ G+L +YT +++VE ++
Sbjct: 558 ERLWAFSAIRDIESKMQYLEQNDNESEQAIESIALEYGLLTDYTSLLVVE--------DA 609
Query: 639 PGTKKGSKKSDHQKILDSEVPKTIMLQ 665
+ G +K++ Q++ + + + Q
Sbjct: 610 VFEQNGIRKTNQQRVEKEQAARELRQQ 636
>gi|432106633|gb|ELK32299.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Myotis davidii]
Length = 652
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/284 (27%), Positives = 129/284 (45%), Gaps = 35/284 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P + V K+V+F++DISGSM G+ ++ TK+AL L D N + F
Sbjct: 79 FVHFFAP---RRLPVVPKNVVFVIDISGSMYGRKIQQTKDALLKILEDKKEEDYLNFILF 135
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+ + + ++ AT E ++ A +++ N G TNI L + ML ++
Sbjct: 136 DSDVTTWKDTLVQATPENIQEAREFVK-NIRDRGMTNINDGLLTGISMLNKAQEEHTVPD 194
Query: 423 GSIPIIFLVTDGAVED-ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S I+ ++TDG E + ++ L+ G P +YT G G+ N+ FL LA+ +
Sbjct: 195 RSTSIVIMLTDGHPNTGESRPAKIQENVLSAIGGRFP-LYTLGFGNNLNYNFLESLALEN 253
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE---- 537
G+ Y+ +Q+Q GF +AN + + E E + I DL+
Sbjct: 254 DGFARRIYEDSDANLQLQ-----GFYEEVANPLLTAV----EVEYPENAILDLTRNTYQH 304
Query: 538 ----SPLIVSGRY----QGKFPDTLKAKGFLGDLSNFVVELKLQ 573
S ++V+GR F +K G L DL+ F E+ L+
Sbjct: 305 FYDGSEIVVAGRLVDEEMNNFKADVKGHGALTDLT-FTEEVDLK 347
>gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus]
gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus]
Length = 919
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 128/281 (45%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+ K+V+F++DIS SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 246 FVHYFAP---KNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQDRFNIIGF 302
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + ++ + G T+I L A+++L N
Sbjct: 303 SNRIKMWKDHLLPVTPDNIRNGKIYM-YHLSPTGGTDINGALQTAIKLLNNYVAQNDIED 361
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ GG IC I+T GIG+ + L L++
Sbjct: 362 RSVSLIIFLTDGKPTFGETNTLRILSNTKEATGGQIC--IFTVGIGNDVDFRLLEKLSLE 419
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID D E + + P+ + S +
Sbjct: 420 NCGLTRRVHEEEKAGAQLIGFYDEIRTPLLSDIRIDYPPDAVE-RVTKTLFPNYFNGSEI 478
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ K D L + + FV+ L +DIP++
Sbjct: 479 VIAGKMADKKFDQLHVEVTASNSKKFVI-----LKRDIPVE 514
>gi|114587336|ref|XP_001172570.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1
[Pan troglodytes]
Length = 890
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 122/270 (45%), Gaps = 32/270 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKLQ 573
F +K G DL+ F E+ ++
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMK 537
>gi|293342867|ref|XP_001069890.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
[Rattus norvegicus]
gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus]
Length = 953
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 128/281 (45%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+ K+V+F++DIS SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 280 FVHYFAP---KNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQDRFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + ++ + G T+I L A+++L N
Sbjct: 337 SNRIKMWKDHLLPVTPDNIRNGKIYM-YHLSPTGGTDINGALQTAIKLLNNYVAQNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ GG IC I+T GIG+ + L L++
Sbjct: 396 RSVSLIIFLTDGKPTFGETNTLRILSNTKEATGGQIC--IFTVGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID D E + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEKAGAQLIGFYDEIRTPLLSDIRIDYPPDAVE-RVTKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ K D L + + FV+ L +DIP++
Sbjct: 513 VIAGKMADKKFDQLHVEVTASNSKKFVI-----LKRDIPVE 548
>gi|397495907|ref|XP_003818785.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Pan
paniscus]
Length = 890
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 122/270 (45%), Gaps = 32/270 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKLQ 573
F +K G DL+ F E+ ++
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMK 537
>gi|119775138|ref|YP_927878.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella
amazonensis SB2B]
gi|119767638|gb|ABM00209.1| inter-alpha-trypsin inhibitor domain protein [Shewanella
amazonensis SB2B]
Length = 753
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 140/311 (45%), Gaps = 20/311 (6%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
++++ ++D SGSM G + ++AL AL L P DSFNI+AF+ + + AT
Sbjct: 395 LARELVLVIDTSGSMAGDSMVQARSALIHALGGLGPQDSFNIIAFSSDARPLWPDAKPAT 454
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVE---MLTNSRGSIPIIFLVTDGAVEDERQ 441
+ A Q++ + A G T + + L A++ ++ + + +TDGAV E
Sbjct: 455 AFNLGAAQQFV-RSLEADGGTEMASALELALKTPSVVDEDTKRLRQVLFITDGAVNGEDA 513
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ + ++ RL R++ IG+ N YF+ A RG + + +M +L
Sbjct: 514 LFNLIERRLGTS-----RLFPVAIGAAPNGYFMSRAAAAGRGSFTFIGHGGEVAEKMNQL 568
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
+R V++++++ T D + P +PDL + L +S R PD L G
Sbjct: 569 LSRIEHPVVSDLSV-TWADGSPVDAMPGVLPDLYAGEALNLSLR---TVPDALMPIIVKG 624
Query: 562 DLSNFVVELKLQLAKDIP----LDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGV 617
+ + E KL + +P LD K ++D L Q +R + ++ +
Sbjct: 625 NTDGQIWERKL-TPRAVPGGSGLDLQYGKARVDDLVRQTLTPSQRR--AATVALGLEYHL 681
Query: 618 LCEYTRMIIVE 628
+ +T ++ V+
Sbjct: 682 VTAHTSLVAVD 692
>gi|405961810|gb|EKC27555.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Crassostrea gigas]
Length = 632
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 104/225 (46%), Gaps = 5/225 (2%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
KDV+ I+D+SGSM G L+ K+A+ L+ L GD FNI+ F+ + S +A K+
Sbjct: 21 KDVLLILDVSGSMYGTKLKQMKDAVLGILNDLHEGDRFNILKFSSSLSFYKESSVIANKK 80
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT---NSRGSIPIIFLVTDGAVEDERQIC 443
V A ++ N +A G TNI + L L N R +IF +TDG
Sbjct: 81 TVREAKIYVK-NMVAAGGTNINSALLDGSNFLNKAENFRERAQVIFFLTDGQPSSGETNL 139
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ + + S IY+ G ++ F++ +A+ + G Y+ +Q++ +
Sbjct: 140 NKILENVRIANSNAFPIYSLAFGQGADYNFIKKVAIQNNGLSRKIYEDSDATLQIKGFYD 199
Query: 504 RGFSSVLANIAIDTLKDLDEFE-MYPSRIPDLSSESPLIVSGRYQ 547
++VL +++ L + + + P S L+V+G+ +
Sbjct: 200 EVSAAVLKDVSFKYLNNASSVQNLTKISFPSYLLGSELVVAGKIE 244
>gi|440905495|gb|ELR55871.1| Inter-alpha-trypsin inhibitor heavy chain H1, partial [Bos
grunniens mutus]
Length = 908
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 121/260 (46%), Gaps = 46/260 (17%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P ++ K K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F
Sbjct: 274 FAHFFAP---QNLKKLNKNVVFVIDISSSMEGQKVKQTKEALHKILGDMQPGDYFDLVLF 330
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ S+ A+ +E A ++ F G+TN+ L + +E+L ++ S+P
Sbjct: 331 GSAVQSWKGSLVQASPANLEAARNFVQ-QFSLAGATNLNGGLLRGIEILNKAQQSLPELS 389
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG V D QI D +K R +Y G G + FL
Sbjct: 390 NHASILIMLTDGEPTEGVMDRTQILKNVRDGIKGRFP--------LYNLGFGHDVDLNFL 441
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPD 533
++++ + G Y+ Q+Q GF +AN L+D++ +YP + D
Sbjct: 442 EVMSLENNGRVQRIYEDHDATQQLQ-----GFYEQVAN---PLLRDVE--LLYPREAVSD 491
Query: 534 LSSE--------SPLIVSGR 545
L+ S ++V+GR
Sbjct: 492 LTQHRHKQYYEGSEIMVAGR 511
>gi|405957680|gb|EKC23874.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Crassostrea gigas]
Length = 882
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 153/355 (43%), Gaps = 45/355 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K KDV+ I+D SGSM G+ + K A+ L L GD FNI+ F
Sbjct: 290 FVHFFAPSDMKE---IPKDVLLILDCSGSMGGRKIGQMKAAVIGVLKDLHEGDRFNILKF 346
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI---- 425
+ + SM A K++++ A +++ + G TNI A + + V+ L S G I
Sbjct: 347 SEIVEYYKDSMVYANKKSIDDAKEYVE-SLSPTGGTNINAGMLQGVDFLKTS-GDINERS 404
Query: 426 PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
++F +TDG + D +I + ++ N S +Y+ G ++ F++ +A+ +
Sbjct: 405 QVVFFLTDGEATSGITDNSRILENIR----NANSKAFPVYSLAFGKGADYEFVKKVAVQN 460
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFE-MYPSRIPDLSSESPL 540
G Y+ +Q++ + S++L N+ L + + + P + S +
Sbjct: 461 NGLSRRIYEDSDATLQIKGFYDEVSSAILKNVTFKYLNNASSVQNLTTVNFPSYFTGSEI 520
Query: 541 IVSGRYQGK----FPDTLKAKGFLGDLSNFVVELKLQL-AKDIPLDRICAKQQIDLLTAQ 595
+V+G+ + F +++ G G VEL L K P + + +T +
Sbjct: 521 VVAGKIENNAVRLFDLSVQGLGSTGS-----VELALSADVKSKPFPELTKSGDYEQITEK 575
Query: 596 AWF-----------------SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
W + + L+ K ++S+Q + T M++ + +E++
Sbjct: 576 IWAYLTIKQLLEKSIGETDSTTKEELKNKALQLSLQYKFVTPLTSMVVTKPEEKS 630
>gi|355746640|gb|EHH51254.1| hypothetical protein EGM_10594 [Macaca fascicularis]
Length = 891
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 121/262 (46%), Gaps = 16/262 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 294 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 353
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 354 TPENLQEAKTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 412
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 413 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 471
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGK----FP 551
+Q+Q + + +L + ++ ++ + + S ++V+GR G+ F
Sbjct: 472 LQLQGFYEEVANPLLTGVEMEYPENAI-LGLTQNTYQHFYDGSEIVVAGRLVGEDVNSFK 530
Query: 552 DTLKAKGFLGDLSNFVVELKLQ 573
+K G DL+ F E+ ++
Sbjct: 531 ADVKGHGAANDLT-FTEEVDMK 551
>gi|348588789|ref|XP_003480147.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like [Cavia
porcellus]
Length = 910
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 121/250 (48%), Gaps = 25/250 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+K+++F++DISGSM+G+ ++ TK AL L + P D F++V F + S+ A+K
Sbjct: 289 RKNLVFVIDISGSMEGQKVKQTKEALLKILQDMRPEDYFDLVLFGSNVQSWKGSLVQASK 348
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG---- 434
++ A ++ F G+TN+ L + +E+L ++ +P I+ ++TDG
Sbjct: 349 ANLQAAGDFVR-RFSLHGATNLNGGLLRGIEILNKAQERLPELNKHASIVIMLTDGEPTE 407
Query: 435 AVEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
V D QI +++ +I R +Y G G+ + FL +++M + G+ Y+
Sbjct: 408 GVTDRAQIQKNIRN------AIQGRFPLYNLGFGNDVDFTFLEVMSMENNGWAQRIYEDH 461
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ----G 548
Q+Q + + S +L ++ + D + R S ++V+GR G
Sbjct: 462 DAAQQLQGFYNQVSSPLLVDVDMQYPPDAVS-ALTQHRHKQYYDGSEIVVAGRIADNKLG 520
Query: 549 KFPDTLKAKG 558
F +KA+G
Sbjct: 521 VFKADVKAQG 530
>gi|344276189|ref|XP_003409891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Loxodonta africana]
Length = 826
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 159/367 (43%), Gaps = 54/367 (14%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN+V F
Sbjct: 259 FVHYFAP---EGLPTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLRPNDQFNLVVF 315
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+ E + S+ A+ + V+ A + + A G TNI + AV++L +S
Sbjct: 316 SKEATQWKPSLVPASTQNVKEAKSY-AASIQALGGTNINDAMMMAVKLLDSSNKEELLPA 374
Query: 423 GSIPIIFLVTDG-AVEDERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
GS+ +I L+TDG E E K+ R G ++ G G ++ FL +A+
Sbjct: 375 GSVSLIILLTDGDPTEGETNPAKIQKNVREAIYGQYS--LFCLGFGFDVSYAFLEKMALD 432
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE-- 537
+ G Y+ +Q+Q + + +L +A + YPS + ++ E
Sbjct: 433 NGGLARRIYEDSDSALQLQDFYQEVANPLLTGVAFE----------YPSNAVEKVTQENF 482
Query: 538 ------SPLIVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP---------- 579
S ++V+G+ + + D L AK G + NF + + +A+
Sbjct: 483 RLFFKGSEMVVAGKLRDQSSDVLSAKVSGRVHK-QNFTFQTESNVAEQEEEFRSPKYIFH 541
Query: 580 --LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDER 632
++R+ A I L Q A +E LE K +S+ + T M++ E E+
Sbjct: 542 NFMERLWAYLTIQQLLEQIISASDAERPALENKALSLSLNYSFVTPLTSMVVTKPEGQEQ 601
Query: 633 NNASESP 639
+ +E P
Sbjct: 602 SQVAEKP 608
>gi|449276862|gb|EMC85224.1| Inter-alpha-trypsin inhibitor heavy chain H4, partial [Columba
livia]
Length = 576
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 115/502 (22%), Positives = 201/502 (40%), Gaps = 54/502 (10%)
Query: 93 LDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSY----H 148
L VDC + + F T ++ S+S + V + I I GK Y
Sbjct: 6 LHVDCKVTSRFAHTVITSKI-ANRASESREATFEVELPKTAFITNFSMSIDGKIYPGIIK 64
Query: 149 TQLIALGENDGA---GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDG 205
+ A E D A G+SA + + K F +++ I S ++ L + + L + G
Sbjct: 65 EKASAQKEYDTAISRGQSAGLVKITGRKLEQFHVSVS-IAATSKVTFELMYEELLKRQLG 123
Query: 206 EFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGY 265
+F + + V P K++ K +I +++ G L S+ +L + K+
Sbjct: 124 KFELLIK------VQP--KQLVKHFQIDVHIFEPQGIRFLETDSTFMTNELTEALTKVQN 175
Query: 266 SYESEVL----------------KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM 309
++ +L N DF Y V G + + +
Sbjct: 176 ETKAHILFKPTVEQQKINPEVDETLLNGDFVVRYDVKRVATAGDIQIVNG---------Y 226
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + F K+VIF++D SGSM G+ +E T+ AL L L P D F+ + F
Sbjct: 227 FVHYFAPHEMPA---FPKNVIFVIDRSGSMAGRKIEQTRAALLKILQDLHPEDHFSFITF 283
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG----SI 425
N + + +S+ AT E V A ++ A G T+I L AV +L + S+
Sbjct: 284 NSKVVEWKSSLLQATAENVASATGFMR-TLSASGGTDINRALLTAVSLLDKAEKLPERSV 342
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+I L+TDG + ++ + N + ++ G G ++ FL +A+ + G
Sbjct: 343 SMIILLTDGQPTSGENNVEVIQENIQNAINGKYALFCLGFGFDVSYKFLEKMALSNGGIA 402
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
Y+ +Q+Q + + L I + ++ E + + S LIVSG+
Sbjct: 403 RRIYENSDAALQLQGFYQEVATPTLMKIEMQYPENAVE-GLTKNNFKLFFEGSELIVSGK 461
Query: 546 YQGK---FPDTLKAKGFLGDLS 564
+ P +KA+ DL+
Sbjct: 462 ISNELDLLPVEIKAQSHASDLT 483
>gi|426249409|ref|XP_004018442.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Ovis
aries]
Length = 906
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 116/245 (47%), Gaps = 43/245 (17%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DIS SM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 284 LNKNVVFVIDISSSMEGQKVKQTKEALRKILGDMRPGDYFDLVLFGSAVQSWKGSLVQAS 343
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+E A ++ F G+TN+ L + +E+L ++ S+P I+ ++TDG
Sbjct: 344 PANLEAARNFVQ-QFSLAGATNLNGGLLRGIEILNKAQQSLPELSNHASILIMLTDGEPT 402
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI DA+K R +Y G G + FL ++++ + G Y
Sbjct: 403 EGVTDRTQILKNVRDAIKGRFP--------LYNLGFGHDVDLNFLEVMSLENNGRVQRIY 454
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPL 540
+ Q+Q GF +AN L+D++ +YP + DL+ S +
Sbjct: 455 EDHDATQQLQ-----GFYEQVAN---PLLRDVE--LLYPQEAVSDLTQHRHKQYYEGSEI 504
Query: 541 IVSGR 545
+V+GR
Sbjct: 505 MVAGR 509
>gi|332216197|ref|XP_003257231.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2
[Nomascus leucogenys]
Length = 769
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 118/249 (47%), Gaps = 25/249 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+ E
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMPPGDYFDLVLFGTRVQSWKGSLVQAS-E 207
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
A RA Q F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 208 ANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAAILIMLTDGDPTEG 267
Query: 436 VEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI ++S +I R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNIRS------AIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRD 321
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QGK 549
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 322 ATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQSS 380
Query: 550 FPDTLKAKG 558
F ++A G
Sbjct: 381 FKADVQAHG 389
>gi|332216195|ref|XP_003257230.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1
[Nomascus leucogenys]
Length = 911
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 118/249 (47%), Gaps = 25/249 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+ E
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMPPGDYFDLVLFGTRVQSWKGSLVQAS-E 349
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
A RA Q F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 350 ANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAAILIMLTDGDPTEG 409
Query: 436 VEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI ++S +I R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNIRS------AIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRD 463
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QGK 549
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 464 ATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQSS 522
Query: 550 FPDTLKAKG 558
F ++A G
Sbjct: 523 FKADVQAHG 531
>gi|332216199|ref|XP_003257232.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 3
[Nomascus leucogenys]
gi|332216201|ref|XP_003257233.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 4
[Nomascus leucogenys]
Length = 623
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 118/251 (47%), Gaps = 25/251 (9%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMPPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
EA RA Q F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 -EANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAAILIMLTDGDPT 119
Query: 435 -AVEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI ++S +I R +Y G G + FL +++M + G Y+
Sbjct: 120 EGVTDRSQILKNIRS------AIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 173
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 174 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQ 232
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 233 SSFKADVQAHG 243
>gi|345326273|ref|XP_001510478.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like
[Ornithorhynchus anatinus]
Length = 1613
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 175/403 (43%), Gaps = 50/403 (12%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y VS I G V + + F Y P + +K+V+F++D+SG
Sbjct: 415 DFVVQYDVSMKDIIGDVQIYN---------GYFVHYFAP---RGLPPVQKNVVFVIDVSG 462
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G ++ TK A+ V L+ L D FNIV F+ ++ S + AT ++ A ++
Sbjct: 463 SMFGTKMKQTKKAMHVILNDLHHDDYFNIVTFSDAVSVWKASGSIQATPPNIKSAKVYVN 522
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSRGS---------IPIIFLVTDG----AVEDERQIC 443
A G T+I A L A + S G IP+I +TDG V +I
Sbjct: 523 -KMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPLIIFLTDGEATAGVTVASRIL 581
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
K L G+I ++ G +++ +R L++ +RG Y+ +Q++ +
Sbjct: 582 SNAKQSLK--GNIS--LFGLAFGDDADYHLMRRLSLENRGVARRIYEDADATLQLKGFYD 637
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD--------TLK 555
S +L +I + L D ++ + P+ S L+V+G+ + D K
Sbjct: 638 EIASPLLYDIELTYL-DSSVQDVTQNLFPNYFGGSELVVAGKVKPGVRDLHVQMTAQNYK 696
Query: 556 AKGFLG-DLSNFVVELKLQLAKDIP-----LDRICAKQQI-DLLTA--QAWFSEDKR-LE 605
+ +G D++ E A+D+ + R+ A I DLL A +A + ++R L
Sbjct: 697 EQVVVGNDITTNSTEAAFGCAEDLSQIERFVQRLWAYFTIQDLLQARFKANDTANRRLLS 756
Query: 606 EKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKS 648
EK + +S++ + T +++V +E ++ G+ S
Sbjct: 757 EKATNLSLKYNFVTPVTSLVVVRPEEDEGKPQATPASGGTTTS 799
>gi|334338568|ref|XP_001380258.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Monodelphis domestica]
Length = 780
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 151/335 (45%), Gaps = 49/335 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D SGSM G+ ++ TK AL L L P D FN++ F+G + + LA E
Sbjct: 261 KNIVFLIDKSGSMAGRKIKKTKAALIKILDDLKPEDHFNMITFSGHVTRWKPELVLALDE 320
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG-AVED 438
++ A ++ N A G TN+ + AV ML S GS+ +I L+TDG + E
Sbjct: 321 HLKEAKTFLS-NTPALGVTNVNGAVLAAVSMLDESNKKKELPEGSVSMIILLTDGDSTEG 379
Query: 439 E---RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
E ++I + +K+ + ++ G G N+ FL LA+ + G ++ E
Sbjct: 380 ETKLQKIHENVKAAIRGQY----HLFCLGFGFDINYVFLERLALDNGGMARHIFEGLDAE 435
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q + + +L + EF+ + L+ + S L+V+G+
Sbjct: 436 LQLQDFYQEVANPLLTQV---------EFQYSDHAVELLTKDSFGIFFRGSELVVAGKLK 486
Query: 547 ---QGKFPDTLKAKG------FLGDLSNFVVELKLQLAKDIP---LDRICAK---QQIDL 591
Q F ++ K F ++ E Q K+I ++R+ A QQ+
Sbjct: 487 PQAQELFSAHVRGKSNNQTFTFQTQVNTTSKEKVFQSGKNISHSFMERLWAYLTIQQLQE 546
Query: 592 LTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMII 626
A +E +RLE + +S+ L +T +I+
Sbjct: 547 KVVLASGTELQRLEARAVNLSLNFNFLTPFTNLIV 581
>gi|443700702|gb|ELT99546.1| hypothetical protein CAPTEDRAFT_222769 [Capitella teleta]
Length = 828
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 159/356 (44%), Gaps = 50/356 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+FI+D+SGSM G + TK+A+ LS L GD FNI+ F+ + L E
Sbjct: 243 KNVLFILDVSGSMAGAKISQTKDAMKKILSDLRSGDMFNIITFSTSLKFWHGEGLLPANE 302
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-------NSRGSIPIIFLV----TDGA 435
+R + A G TNI L + V+ L+ NS + ++FL T G
Sbjct: 303 QNKRKAKEFIKGMKAEGGTNINDALLRGVDFLSQQESLDPNSERTSMLMFLTDGEPTAGI 362
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
R + + R +NG +++ G G+ + FL+ +++ +RG Y +
Sbjct: 363 TSTSRIMANL---RASNGNRFT--LFSLGFGTDVDFTFLQKVSLQNRGLARKIYTGADAD 417
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRI--PDLSS---ESPLIVSGRYQGKF 550
+Q++ ++ + V+++I I L+D + P + D S+ S ++V+G+ +
Sbjct: 418 LQLEGFYSEVSTPVMSDIKIRYLED----SVSPDSVTTTDFSAFFRGSEIVVAGKLTDRL 473
Query: 551 PD-TLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICAKQQIDLLTAQAW-FSEDKRLEEK 607
+ LK G G + + L ++ +P ++R + T + W + K+L++
Sbjct: 474 SNLNLKVSG--GSARGY---MSLDMSVKVPKVERESTRTAFSNFTEKLWAYMTIKKLDKS 528
Query: 608 VSKMS---VQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKILDSEVPK 660
++ S Q + T M+IV+ DE P G ++LD E P+
Sbjct: 529 LNSSSFLDFQYNFVTPLTSMVIVKPDE-------PTVNSG-------ELLDKEAPR 570
>gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein) [Bos taurus]
Length = 916
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 159/369 (43%), Gaps = 58/369 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P S K+VIF++D SGSM G+ ++ T+ AL L L P D F++++F
Sbjct: 260 FVHYFAP---DSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFDLISF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + + A+ E V A + GI A G TNI + AV++L +
Sbjct: 317 SSEATTWKPLLVPASTENVNEAKSYATGIQ--AQGGTNINDAMLMAVQLLEKANQEELLP 374
Query: 422 RGSIPIIFLVTDG-AVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
GSI +I L+TDG E + K+ + NG ++ G G ++ FL +A
Sbjct: 375 EGSITLIILLTDGDPTVGETNPSNIQKNVRKAINGQH---SLFCLGFGFDVSYAFLEKMA 431
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE- 537
+ + G Y+ +Q+Q + + ++ ++A + YPS + ++
Sbjct: 432 LENGGLARRIYEDSDSALQLQDFYQEVANPLMTSVAFE----------YPSNAVESVTQD 481
Query: 538 --------SPLIVSGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLAKDIPL------- 580
S L+V+G+ + + PD L A+ G L N + +A+ +
Sbjct: 482 TFRVFFKGSELVVAGKLREQSPDVLLAQ-IRGQLHRENITYMMMSHVAEQEEMFRSPKYI 540
Query: 581 -----DRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETD 630
+R+ A I L Q A +E + LE + +S+ + T M+I E
Sbjct: 541 FHSFIERLWAYLTIQQLLEQMVSALDAEKQALEARALSLSLSYSFVTPLTSMVITKPEGQ 600
Query: 631 ERNNASESP 639
E++ +E P
Sbjct: 601 EQSQVAEKP 609
>gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus]
gi|122140331|sp|Q3T052.1|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4;
Short=ITI heavy chain H4; Short=ITI-HC4;
Short=Inter-alpha-inhibitor heavy chain 4; Flags:
Precursor
gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive
glycoprotein) [Bos taurus]
Length = 916
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 159/369 (43%), Gaps = 58/369 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P S K+VIF++D SGSM G+ ++ T+ AL L L P D F++++F
Sbjct: 260 FVHYFAP---DSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFDLISF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + + A+ E V A + GI A G TNI + AV++L +
Sbjct: 317 SSEATTWKPLLVPASTENVNEAKSYATGIQ--AQGGTNINDAMLMAVQLLEKANQEELLP 374
Query: 422 RGSIPIIFLVTDG-AVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
GSI +I L+TDG E + K+ + NG ++ G G ++ FL +A
Sbjct: 375 EGSITLIILLTDGDPTVGETNPLNIQKNVRKAINGQH---SLFCLGFGFDVSYAFLEKMA 431
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE- 537
+ + G Y+ +Q+Q + + ++ ++A + YPS + ++
Sbjct: 432 LENGGLARRIYEDSDSALQLQDFYQEVANPLMTSVAFE----------YPSNAVESVTQD 481
Query: 538 --------SPLIVSGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLAKDIPL------- 580
S L+V+G+ + + PD L A+ G L N + +A+ +
Sbjct: 482 TFRVFFKGSELVVAGKLREQSPDVLLAQ-IRGQLHRENITYMMMSHVAEQEEMFRSPKYI 540
Query: 581 -----DRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETD 630
+R+ A I L Q A +E + LE + +S+ + T M+I E
Sbjct: 541 FHSFIERLWAYLTIQQLLEQMVSALDAEKQALEARALSLSLSYSFVTPLTSMVITKPEGQ 600
Query: 631 ERNNASESP 639
E++ +E P
Sbjct: 601 EQSQVAEKP 609
>gi|431891316|gb|ELK02193.1| von Willebrand factor A domain-containing protein 5B1 [Pteropus
alecto]
Length = 1185
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 99/191 (51%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D SGSM G + K+A+ VAL L P FN++ F G T+ LF +S + +
Sbjct: 360 EFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPSCLFNVIGF-GSTFKALFPSS-QTYNE 417
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
E++ A I G TNI +PL + + RG ++FL+TDGAV + ++ +
Sbjct: 418 ESLAIACDSIQRMRADMGGTNILSPLKWIIRQPVH-RGHPRLLFLITDGAVNNTGKVLE- 475
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
L + R Y+FGIG H ++ LA +S+G + + ++ +M K +
Sbjct: 476 ----LVRNHAFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLVEGERLQPKMVKSLKKA 531
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 532 MAPVLSDVTVE 542
>gi|426222816|ref|XP_004005578.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1 [Ovis aries]
Length = 1240
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 102/207 (49%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + + IF+VD SGSM G ++ K+A+ VAL L P FNI+ F
Sbjct: 359 FCPDLQSVQPNLRKT-RGEFIFLVDRSGSMNGTNIQCVKDAMLVALKSLMPSCLFNIIGF 417
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S + ++E + A I G TNI +PL K V RG ++F
Sbjct: 418 GSTFKTLFPSSQTYSEENLAMACDSIQKMRADMGGTNILSPL-KWVIRQPVLRGCPRLLF 476
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDGAV + ++ + +++ + R Y+FGIG H ++ LA +S+G
Sbjct: 477 LITDGAVNNTGKVLELVRNH-----AFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLE 531
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 532 EGERLQPKMVKSLKKAMAPVLSDVTVE 558
>gi|355559574|gb|EHH16302.1| hypothetical protein EGK_11566 [Macaca mulatta]
Length = 911
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 118/251 (47%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGSRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAKDFVR-RFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVRAHG 531
>gi|296474801|tpg|DAA16916.1| TPA: inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos
taurus]
Length = 916
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 159/369 (43%), Gaps = 58/369 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P S K+VIF++D SGSM G+ ++ T+ AL L L P D F++++F
Sbjct: 260 FVHYFAP---DSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFDLISF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + + A+ E V A + GI A G TNI + AV++L +
Sbjct: 317 SSEATTWKPLLVPASTENVNEAKSYATGIQ--AQGGTNINDAMLMAVQLLEKANQEELLP 374
Query: 422 RGSIPIIFLVTDG-AVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
GSI +I L+TDG E + K+ + NG ++ G G ++ FL +A
Sbjct: 375 EGSITLIILLTDGDPTVGETNPLNIQKNVRKAINGQH---SLFCLGFGFDVSYAFLEKMA 431
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE- 537
+ + G Y+ +Q+Q + + ++ ++A + YPS + ++
Sbjct: 432 LENGGLARRIYEDSDSALQLQDFYQEVANPLMTSVAFE----------YPSNAVESVTQD 481
Query: 538 --------SPLIVSGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLAKDIPL------- 580
S L+V+G+ + + PD L A+ G L N + +A+ +
Sbjct: 482 TFRVFFKGSELVVAGKLREQSPDVLLAQ-IRGQLHRENITYMMMSHVAEQEEMFRSPKYI 540
Query: 581 -----DRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETD 630
+R+ A I L Q A +E + LE + +S+ + T M+I E
Sbjct: 541 FHSFIERLWAYLTIQQLLEQMVSALDAEKQALEARALSLSLSYSFVTPLTSMVITKPEGQ 600
Query: 631 ERNNASESP 639
E++ +E P
Sbjct: 601 EQSQVAEKP 609
>gi|355746641|gb|EHH51255.1| hypothetical protein EGM_10595 [Macaca fascicularis]
Length = 911
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 118/251 (47%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGSRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAKDFVR-RFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVRAHG 531
>gi|194391096|dbj|BAG60666.1| unnamed protein product [Homo sapiens]
Length = 698
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 116/261 (44%), Gaps = 31/261 (11%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLS 564
F +K G DL+
Sbjct: 509 DEDMNSFKADVKGHGATNDLT 529
>gi|426340857|ref|XP_004034343.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1
[Gorilla gorilla gorilla]
gi|426340859|ref|XP_004034344.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 2
[Gorilla gorilla gorilla]
Length = 890
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 155/354 (43%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 MPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRY- 546
+Q+Q GF +AN + + E E + I DL+ S ++V+GR
Sbjct: 458 LQLQ-----GFYEEVANPLLTGV----EMEYPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLLT 593
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLL 567
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+ A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|402859853|ref|XP_003894351.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Papio
anubis]
Length = 911
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 118/251 (47%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGSRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAKDFVR-RFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVRAHG 531
>gi|440905493|gb|ELR55869.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Bos grunniens mutus]
Length = 927
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 159/369 (43%), Gaps = 58/369 (15%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P S K+VIF++D SGSM G+ ++ T+ AL L L P D F++++F
Sbjct: 258 FVHYFAP---DSLSTIPKNVIFVIDKSGSMMGRKIKQTREALIKILDDLSPHDQFDLISF 314
Query: 370 NGETYLFSTSMELATKEAVERAHQW-IGINFIAGGSTNICAPLTKAVEMLTNS------- 421
+ E + + A+ E V A + GI A G TNI + AV++L +
Sbjct: 315 SSEATTWKPLLVPASTENVNEAKSYATGIQ--AQGGTNINDAMLMAVQLLEKANQEELLP 372
Query: 422 RGSIPIIFLVTDG-AVEDERQICDAMKS--RLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
GSI +I L+TDG E + K+ + NG ++ G G ++ FL +A
Sbjct: 373 EGSITLIILLTDGDPTVGETNPSNIQKNVRKAINGQH---SLFCLGFGFDVSYAFLEKMA 429
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE- 537
+ + G Y+ +Q+Q + + ++ ++A + YPS + ++
Sbjct: 430 LENGGLARRIYEDSDSALQLQDFYQEVANPLMTSVAFE----------YPSNAVESVTQD 479
Query: 538 --------SPLIVSGRYQGKFPDTLKAKGFLGDL--SNFVVELKLQLAKDIP-------- 579
S L+V+G+ + + PD L A+ G L N + +A+
Sbjct: 480 TFRVFFKGSELVVAGKLREQSPDVLLAQ-IRGQLHRENITFMMMSHVAEQEEMFRSPKYI 538
Query: 580 ----LDRICAKQQIDLLTAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETD 630
++R+ A I L Q A +E + LE + +S+ + T M+I E
Sbjct: 539 FHSFMERLWAYLTIQQLLEQMVSALDAEKQALEARALSLSLSYSFVTPLTSMVITKPEGQ 598
Query: 631 ERNNASESP 639
E++ +E P
Sbjct: 599 EQSQVAEKP 607
>gi|149034207|gb|EDL88977.1| inter-alpha trypsin inhibitor, heavy chain 3 [Rattus norvegicus]
Length = 886
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 118/261 (45%), Gaps = 14/261 (5%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K++ F++D+SGSM G+ ++ T+ AL L + D N + F+ + + A
Sbjct: 280 VVPKNIAFVIDVSGSMSGRKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVKA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV 436
T +E A ++ N TNI L + +EML +R S I+ ++TDG
Sbjct: 340 TPANLEEARAFVK-NIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTSILVMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G+ N+ FL LA+ + G+ Y+ +
Sbjct: 399 NTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFARRIYEDSDASL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D KA
Sbjct: 459 QLQGFYEEVANPLLTNVELEYPENAI-LDLTRNSYPHFYDGSEIVVAGRLVDRNVDNFKA 517
Query: 557 ----KGFLGDLSNFVVELKLQ 573
G L DL+ F E+ ++
Sbjct: 518 DVKGHGALNDLT-FTEEVDMK 537
>gi|8393899|ref|NP_059047.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Rattus
norvegicus]
gi|3024056|sp|Q63416.1|ITIH3_RAT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
gi|602886|emb|CAA58233.1| pre-alpha-inhibitor, heavy chain 3 [Rattus norvegicus]
Length = 887
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 118/261 (45%), Gaps = 14/261 (5%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K++ F++D+SGSM G+ ++ T+ AL L + D N + F+ + + A
Sbjct: 280 VVPKNIAFVIDVSGSMSGRKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVKA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV 436
T +E A ++ N TNI L + +EML +R S I+ ++TDG
Sbjct: 340 TPANLEEARAFVK-NIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTSILVMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G+ N+ FL LA+ + G+ Y+ +
Sbjct: 399 NTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFARRIYEDSDASL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKA 556
Q+Q + + +L N+ ++ ++ ++ + P S ++V+GR + D KA
Sbjct: 459 QLQGFYEEVANPLLTNVELEYPENAI-LDLTRNSYPHFYDGSEIVVAGRLVDRNVDNFKA 517
Query: 557 ----KGFLGDLSNFVVELKLQ 573
G L DL+ F E+ ++
Sbjct: 518 DVKGHGALNDLT-FTEEVDMK 537
>gi|148692830|gb|EDL24777.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_d [Mus
musculus]
Length = 651
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 118/255 (46%), Gaps = 14/255 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 274 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 330
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ A+ ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 331 GSKVQSWKGSLVPASNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 389
Query: 427 ----IIFLVTDGA-VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
I+ ++TDG E E +K+ P +Y G G + FL +++ +
Sbjct: 390 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP-LYNLGFGHDLDFSFLEVMSTEN 448
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
G+ Y+ Q+Q + + + +L ++ + +D + R S ++
Sbjct: 449 NGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQYYDGSEIV 507
Query: 542 VSGRYQGKFPDTLKA 556
V+GR +T KA
Sbjct: 508 VAGRIANHKLNTFKA 522
>gi|344276191|ref|XP_003409892.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Loxodonta
africana]
Length = 915
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 120/262 (45%), Gaps = 20/262 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F +L P + K+++F++DISGSMQG+ ++ TK AL L + PGD F++V F
Sbjct: 285 FAHFLAPQNLTN---LNKNLVFVIDISGSMQGQKVKQTKEALLKILGDMRPGDHFDLVLF 341
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI---- 425
+ + S+ A+ ++ A ++ F G+TN+ L + +E+L ++ S+
Sbjct: 342 GAQVQSWRGSLVQASAANLQAARNFVQ-QFSVAGATNMNGGLLRGIEILDRAQESLSELS 400
Query: 426 ---PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMI 480
I+ ++TDG E D K R +I R +Y G G+ + FL ++M
Sbjct: 401 NSASILIMLTDG--EPTEGETDRFKIRENVRKAIRGRFPLYNLGFGNSVDFNFLDAMSME 458
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G Y+ Q++ + + + +L ++ + +D + S +
Sbjct: 459 NNGRAQRIYEDHDAAQQLEGFYNQVANPLLVDVEMQYPQDT-VVALTQHHHKHYYEGSEI 517
Query: 541 IVSGRYQ----GKFPDTLKAKG 558
+V+GR G F ++A G
Sbjct: 518 VVAGRIAEHKLGDFKANIRAHG 539
>gi|148692829|gb|EDL24776.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_c [Mus
musculus]
Length = 658
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 118/255 (46%), Gaps = 14/255 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 281 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 337
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ A+ ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 338 GSKVQSWKGSLVPASNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 396
Query: 427 ----IIFLVTDGA-VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
I+ ++TDG E E +K+ P +Y G G + FL +++ +
Sbjct: 397 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP-LYNLGFGHDLDFSFLEVMSTEN 455
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
G+ Y+ Q+Q + + + +L ++ + +D + R S ++
Sbjct: 456 NGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQYYDGSEIV 514
Query: 542 VSGRYQGKFPDTLKA 556
V+GR +T KA
Sbjct: 515 VAGRIANHKLNTFKA 529
>gi|124249351|ref|NP_032432.2| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Mus
musculus]
gi|341940849|sp|Q61702.2|ITIH1_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1;
Short=ITI heavy chain H1; Short=ITI-HC1;
Short=Inter-alpha-inhibitor heavy chain 1; Flags:
Precursor
gi|74146297|dbj|BAE28922.1| unnamed protein product [Mus musculus]
gi|148692827|gb|EDL24774.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_a [Mus
musculus]
Length = 907
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 120/262 (45%), Gaps = 28/262 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 274 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 330
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ A+ ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 331 GSKVQSWKGSLVPASNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 389
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG D QI +A++ R +Y G G + FL
Sbjct: 390 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFL 441
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+++ + G+ Y+ Q+Q + + + +L ++ + +D + R
Sbjct: 442 EVMSTENNGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQY 500
Query: 535 SSESPLIVSGRYQGKFPDTLKA 556
S ++V+GR +T KA
Sbjct: 501 YDGSEIVVAGRIANHKLNTFKA 522
>gi|410920147|ref|XP_003973545.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Takifugu rubripes]
Length = 724
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 115/246 (46%), Gaps = 17/246 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G ++ + A+ L L P D F I+ F+ + S+ LAT+E
Sbjct: 209 KNVVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPEDHFGIIQFDSSVDSWRNSLSLATEE 268
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
+ A ++ STNI A + KAV+ML R SI +I L+TDG
Sbjct: 269 NISEAMAYVK-KIAHRESTNINAAVLKAVDMLVTDRKAKRLPEKSIDMIILLTDGDPTTG 327
Query: 440 RQICDAMKSRLTN--GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
++ + N GG++ +Y G G+ ++ FL +++ ++G Y +Q
Sbjct: 328 ETRIPLIQENVRNAIGGNMS--LYGLGFGNDVDYAFLDVMSRENKGLARRIYTGADAALQ 385
Query: 498 MQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ----GKFPDT 553
+Q + S +L ++ + + E + + L + S +IV+GR FP
Sbjct: 386 LQGFYDEVSSPLLLDVDLRYPDNAVE-SLTTNHFSQLFNGSEIIVAGRLTDNDIDNFPVE 444
Query: 554 LKAKGF 559
+ +GF
Sbjct: 445 VYGQGF 450
>gi|197101601|ref|NP_001126282.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Pongo
abelii]
gi|55730948|emb|CAH92192.1| hypothetical protein [Pongo abelii]
Length = 911
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 118/251 (47%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + +R S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNRHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|332245261|ref|XP_003271779.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1 [Nomascus leucogenys]
Length = 1085
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 109/222 (49%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 328 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 387
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + ++E++ A I G TNI +PL
Sbjct: 388 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEESLAMACDDIQRMKADMGGTNILSPLKWV 445
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++FL+TDGAV + ++ + L + R Y+FGIG H +
Sbjct: 446 IRQPVH-RGHPRLLFLITDGAVNNTGKVLE-----LVRNHAFSTRCYSFGIGPNVCHRLV 499
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S+G + + ++ +M K + + VL+++ ++
Sbjct: 500 KGLASVSKGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 541
>gi|296225412|ref|XP_002758286.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Callithrix
jacchus]
Length = 860
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 153/354 (43%), Gaps = 50/354 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V F++D+SGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKSVAFVIDVSGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEAKMFVK-SIDDRGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N +Y G G+ N+ FL +A+ ++G+ Y+ ++
Sbjct: 399 NVGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLENMALENQGFARRIYEDSDADL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY- 546
Q+Q + + +L + ++ YP I DL+ S ++V+GR
Sbjct: 459 QLQGFYEEVANPLLTGVEVE----------YPENAILDLTQNTYQHFYDGSEIVVAGRLV 508
Query: 547 ---QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL- 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 509 DEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGDYIERLWAYLTIEQLL 567
Query: 593 --TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+A E + L + +S++ + T M++ + DER A + PG
Sbjct: 568 EKRKKAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|110832907|ref|YP_691766.1| von Willebrand factor A [Alcanivorax borkumensis SK2]
gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain
[Alcanivorax borkumensis SK2]
Length = 698
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 160/380 (42%), Gaps = 40/380 (10%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
R F + + PG D +F++DISGSM K L + + AL KL D F I
Sbjct: 298 RGTFMLSITPGDDLPPITTGSDWVFVLDISGSMNAK-LATLGDGVRQALGKLRGNDRFRI 356
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
V F+ ++ AT + + Q I + + G TN+ L+ A+ L R +
Sbjct: 357 VLFDDRAEELTSGFVDATPNNIRQYTQKI-MQLQSRGGTNLFGGLSLALTPLDADRPTG- 414
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICP----RIYTFGIGSYCNHYFLRMLAMISR 482
I LVTDG + + K+R + + R++TF +G+ N L + S
Sbjct: 415 -IVLVTDG-------VANVGKTRQKDFIDLLENHDVRLFTFVMGNSANRPMLTAMTDASN 466
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G+ + + D I Q+ ++ + ++ ID + + ++ P RI + +++
Sbjct: 467 GFAISVSNSDDIAGQILNATSKVTHQAMNDVEID-IDGIRTADLQPQRIGSIYHGRQIVL 525
Query: 543 SGRYQGKFPD--TLKAKGFLGDLSNFVVELKL--QLAKDIPLDRICAKQQI-DLLTAQAW 597
G Y G P TL+A+ G + Q + L+R+ A + D+ +
Sbjct: 526 LGHYWGDGPAKITLRAE-VAGQEKTYKTRFDFPAQGGNNPELERLWAYATVQDMQQEIEY 584
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIV------------------ETDERNNASESP 639
F E+K L++ ++ + ++ G++ YT M+++ E ++R A
Sbjct: 585 FGEEKDLQQGITDLGIEYGLVTHYTSMVVLREEVFKEYGIDRRNTKRREIEKRAKAQRQA 644
Query: 640 GTKKGSKKSDHQKILDSEVP 659
G++ + Q + +S P
Sbjct: 645 SAPSGNRVDNSQPMYNSNRP 664
>gi|395832774|ref|XP_003789430.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Otolemur
garnettii]
Length = 910
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 111/234 (47%), Gaps = 25/234 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++DISGSMQG+ ++ T+ AL L + P D F++V F + S+ A++
Sbjct: 290 KNIVFVIDISGSMQGQKVKQTREALLKILGDMQPDDYFDLVLFGTHVRSWKGSLVQASEA 349
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI-------PIIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L ++ S+ PI+ ++TDG
Sbjct: 350 NLQAARDFVQ-RFSLDEATNLNGGLLRGIEILNKAQESLPELSNHAPILIMLTDGDPTEG 408
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 409 VTDRSQILKNIREAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 460
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
Q+Q + + +L ++ + KD ++ R S +IV+GR
Sbjct: 461 RDATQQLQGFYKQVARPLLVDVELQYPKDA-VLDLTQHRHKQYYEGSEIIVAGR 513
>gi|402853237|ref|XP_003891304.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Papio anubis]
Length = 1215
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 111/222 (50%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVITLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + ++E++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEESLAMACDDIQRMRADMGGTNILSPLKWI 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++FL+TDGAV + ++ + +++ + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFLITDGAVNNTGKVLELLRNH-----AFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S+G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSKGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|402859851|ref|XP_003894350.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Papio
anubis]
Length = 891
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 96/201 (47%), Gaps = 10/201 (4%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+V F++DISGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 TPENLQEARTFVK-SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ + G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAID 516
+Q+Q + + +L + ++
Sbjct: 458 LQLQGFYEEVANPLLTGVEME 478
>gi|149034205|gb|EDL88975.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 899
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 115/245 (46%), Gaps = 25/245 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++DISGSM+G+ ++ TK AL L + P D+F++V F + + S+ A+
Sbjct: 285 KNLVFVIDISGSMEGQKVKQTKEALLKILGDMKPVDNFDLVLFGSQVQSWKGSLVPASHA 344
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F G+TN+ L + +E+L ++GS P I+ ++TDG
Sbjct: 345 NLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNRAQGSHPELSSPASILIMLTDGEPTEG 403
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
D QI +A++ R +Y G G + FL ++++ + G+ Y+
Sbjct: 404 ETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFLEVMSIENNGWAQRIYED 455
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
Q+Q + + +L ++ + +D + R S ++V+GR
Sbjct: 456 HDATQQLQGFYNQVAHPLLTDVELQYPQDA-VLALTQHRHKQYYDGSEIVVAGRIADHKL 514
Query: 552 DTLKA 556
T KA
Sbjct: 515 STFKA 519
>gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus]
Length = 918
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 126/281 (44%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+ K+V+F++DIS SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 246 FVHYFAP---KNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQDRFNIIGF 302
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + ++ + G T+I L A+++L N
Sbjct: 303 SNRIKMWKDHLLPVTPDNIRNGKIYM-YHLSPTGGTDINGALQAAIKLLNNYVAQNDIED 361
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ G IC I+T GIG + L L++
Sbjct: 362 RSVSLIIFLTDGKPTFGETNTLKILSNTKEATRGQIC--IFTVGIGDDVDFKLLEKLSLE 419
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ D Q+ + + +L++I ID D+ E + P+ + S +
Sbjct: 420 NCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDIRIDYPPDVVE-HATKTLFPNYFNGSEI 478
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ K D L + + FV+ L +DIP++
Sbjct: 479 VIAGKMVDKKFDQLHVEVTASNSKKFVI-----LKRDIPVE 514
>gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus
musculus]
gi|81873944|sp|Q8BJD1.1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5;
Short=ITI heavy chain H5; Short=ITI-HC5;
Short=Inter-alpha-inhibitor heavy chain 5; Flags:
Precursor
gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus]
gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus]
gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus]
gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus]
gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus]
Length = 952
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 126/281 (44%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+ K+V+F++DIS SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 280 FVHYFAP---KNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQDRFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + ++ + G T+I L A+++L N
Sbjct: 337 SNRIKMWKDHLLPVTPDNIRNGKIYM-YHLSPTGGTDINGALQAAIKLLNNYVAQNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ G IC I+T GIG + L L++
Sbjct: 396 RSVSLIIFLTDGKPTFGETNTLKILSNTKEATRGQIC--IFTVGIGDDVDFKLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ D Q+ + + +L++I ID D+ E + P+ + S +
Sbjct: 454 NCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDIRIDYPPDVVE-HATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ K D L + + FV+ L +DIP++
Sbjct: 513 VIAGKMVDKKFDQLHVEVTASNSKKFVI-----LKRDIPVE 548
>gi|440795664|gb|ELR16781.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 904
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 152/347 (43%), Gaps = 47/347 (13%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFS--------T 378
++IFIVD SGSM G + K+ L + L L G FNI+ F T +LF
Sbjct: 285 EMIFIVDRSGSMSGSRMNQVKDTLQIFLRSLGEGTMFNIIGFGTSTQHLFREGSVEYNDK 344
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
++E+ATK E + N G TNI PL + + T G +F++TDG V +
Sbjct: 345 NLEIATKHVKE-----MSANL---GGTNILRPLQEVLRAQTK-EGYPRQLFILTDGEVGN 395
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
++ D ++ + R++TFG+G+ + ++ LA G++ +++E ++
Sbjct: 396 TQECVDFVRKH-----AETTRVFTFGVGNEASQDLVKGLAKAGEGFFEFVRSGEAMEEKV 450
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPS--RIPDLSSESPLIVSGRYQGKFPDTLKA 556
+ R L +I + T K + P+ R+P L L+V G +
Sbjct: 451 MRQLHRAMQPALTDITV-TWKGAAARHVQPAPFRLPPLFCGGRLVVYGIIESAAAADEAG 509
Query: 557 K------GFLGDLSNFVVELKLQLAKDIP---LDRICAKQQIDLLTAQAWFSEDKRLEEK 607
G + F +K+ L+K L+++ A+ + L + D+ K
Sbjct: 510 GEVEVVIGAKTAVKPFEAVVKVDLSKASKGTLLNKLAARTLLKDLEEGRSYVHDREGVVK 569
Query: 608 VSK--------MSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSK 646
K +S+ +GVL ++T + VE ER+ A+E GT K K
Sbjct: 570 AGKDVVAEMVRVSLASGVLSKHTAFVAVE--ERDEATE--GTMKVRK 612
>gi|338721864|ref|XP_003364438.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Equus caballus]
Length = 1114
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G + K+A+ VAL L P FN++ F S + ++E+
Sbjct: 361 EFIFLIDRSGSMSGTNILLVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEES 420
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+ A I G TNI +PL K + RG ++FL+TDGAV + ++ + ++
Sbjct: 421 LAVACDNIQRMRADMGGTNILSPL-KWITRQPVHRGHPRLLFLITDGAVNNTGKVLELVR 479
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+ + R Y+FGIG H ++ LA +S+G + + ++ +M K + +
Sbjct: 480 NH-----AFSTRCYSFGIGPTVCHRLVKGLATVSKGSAEFLAEGERLQPKMVKSLKKAMA 534
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 535 PVLSDVTVE 543
>gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus]
Length = 952
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 126/281 (44%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+ K+V+F++DIS SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 280 FVHYFAP---KNLPPLPKNVVFVLDISASMVGAKLQQTREALVTILNDLRPQDRFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + ++ + G T+I L A+++L N
Sbjct: 337 SNRIKMWKDHLLPVTPDNIRNGKIYM-YHLSPTGGTDINGALQAAIKLLNNYVAQNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ G IC I+T GIG + L L++
Sbjct: 396 RSVSLIIFLTDGKPTFGETNTLKILSNTKEATRGQIC--IFTVGIGDDVDFKLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ D Q+ + + +L++I ID D+ E + P+ + S +
Sbjct: 454 NCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDIRIDYPPDVVE-HATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ K D L + + FV+ L +DIP++
Sbjct: 513 VIAGKMVDKKFDQLHVEVTASNSKKFVI-----LKRDIPVE 548
>gi|410918621|ref|XP_003972783.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H2-like [Takifugu rubripes]
Length = 928
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 131/291 (45%), Gaps = 31/291 (10%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+K+++F++D+SGSM G ++ T A+ L L D F+I+ FN +S + +
Sbjct: 259 LRKNIVFVIDVSGSMWGVKMKQTVEAMQAILDDLTIDDYFSIIDFNHNVRCWSEELVPGS 318
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG--A 435
V+ A ++I N G TNI L +AV+ML + S+ +I LV+DG
Sbjct: 319 SIQVKDAKKYIQ-NIRPNGGTNINEALMRAVQMLVRASNQGLIDPRSVSMIILVSDGDPT 377
Query: 436 VED------ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V + ++ + AM+ + +++ GIG ++ FL +A +RG Y
Sbjct: 378 VGEIKLSTIQKNVKRAMREEFS--------LFSLGIGFDVDYDFLERIATENRGIAQRIY 429
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGK 549
+ Q+Q + R S +L IAI D E+ +R S S L+V+G+
Sbjct: 430 ANHNAADQLQSFYRRVSSPLLRRIAIQ-FPDDSVSEVTQNRFDKYFSGSELVVAGKVLPS 488
Query: 550 FPDTLKAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQIDL--LTAQAW 597
+TL + L S +++ LQ D LD AKQ+ Q W
Sbjct: 489 ESNTLTS---LITASAARLDITLQTDVDTTELDAELAKQEHSFTGFARQMW 536
>gi|313151177|ref|NP_001100761.2| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Rattus
norvegicus]
gi|149034204|gb|EDL88974.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform
CRA_a [Rattus norvegicus]
gi|187469475|gb|AAI66831.1| Itih1 protein [Rattus norvegicus]
Length = 904
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 115/245 (46%), Gaps = 25/245 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++DISGSM+G+ ++ TK AL L + P D+F++V F + + S+ A+
Sbjct: 285 KNLVFVIDISGSMEGQKVKQTKEALLKILGDMKPVDNFDLVLFGSQVQSWKGSLVPASHA 344
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F G+TN+ L + +E+L ++GS P I+ ++TDG
Sbjct: 345 NLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNRAQGSHPELSSPASILIMLTDGEPTEG 403
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
D QI +A++ R +Y G G + FL ++++ + G+ Y+
Sbjct: 404 ETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFLEVMSIENNGWAQRIYED 455
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFP 551
Q+Q + + +L ++ + +D + R S ++V+GR
Sbjct: 456 HDATQQLQGFYNQVAHPLLTDVELQYPQDA-VLALTQHRHKQYYDGSEIVVAGRIADHKL 514
Query: 552 DTLKA 556
T KA
Sbjct: 515 STFKA 519
>gi|358411195|ref|XP_003581958.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1-like, partial [Bos taurus]
Length = 708
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 10/201 (4%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P K+R F IF+VD SGSM G ++ K+A+ VAL L P FN++ F
Sbjct: 350 PNLRKTRGEF----IFLVDRSGSMNGTNIQCVKDAMLVALKSLVPTCLFNVIGFGSTFKT 405
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
S + +E + A I G TNI +PL K V RG ++FL+TDGA
Sbjct: 406 LFPSSQTYNEENLAMACDSIQKMRADMGGTNILSPL-KWVIRQPVLRGCPRLLFLITDGA 464
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V + ++ + +++ + R Y+FGIG H ++ LA +S+G + + ++
Sbjct: 465 VNNTGKVLELVRNH-----AFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLEEGERLQ 519
Query: 496 IQMQKLFTRGFSSVLANIAID 516
+M K + + VL+++ ++
Sbjct: 520 PKMVKSLKKAMAPVLSDVTVE 540
>gi|359063543|ref|XP_003585859.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1-like [Bos taurus]
Length = 1173
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 10/201 (4%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P K+R F IF+VD SGSM G ++ K+A+ VAL L P FN++ F
Sbjct: 350 PNLRKTRGEF----IFLVDRSGSMNGTNIQCVKDAMLVALKSLVPTCLFNVIGFGSTFKT 405
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
S + +E + A I G TNI +PL K V RG ++FL+TDGA
Sbjct: 406 LFPSSQTYNEENLAMACDSIQKMRADMGGTNILSPL-KWVIRQPVLRGCPRLLFLITDGA 464
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V + ++ + +++ + R Y+FGIG H ++ LA +S+G + + ++
Sbjct: 465 VNNTGKVLELVRNH-----AFSTRCYSFGIGPNVCHRLVKGLATVSKGSAEFLEEGERLQ 519
Query: 496 IQMQKLFTRGFSSVLANIAID 516
+M K + + VL+++ ++
Sbjct: 520 PKMVKSLKKAMAPVLSDVTVE 540
>gi|355744984|gb|EHH49609.1| hypothetical protein EGM_00299 [Macaca fascicularis]
Length = 1190
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 111/222 (50%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + ++E++ A I G TNI +PL K
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEESLAMACDDIQRMRADMGGTNILSPL-KW 446
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ RG ++FL+TDGAV + ++ + +++ + R Y+FGIG H +
Sbjct: 447 IFRQPVHRGHPRLLFLITDGAVNNTGKVLELLRNH-----AFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S+G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSKGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|291407509|ref|XP_002720066.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Oryctolagus
cuniculus]
Length = 1320
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 118/252 (46%), Gaps = 23/252 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G ++ TK A+ V LS L D FNI++F
Sbjct: 268 FIHYFAP---RGLPPVEKNVVFVIDVSGSMFGTKMQQTKKAMNVILSDLQANDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ + A G T+I A L +A +L +S
Sbjct: 325 SDTVSVWRAGGSIQATSQNVHSAKNYLD-HMEAAGWTDINAALLEAASVLNHSNQEPGRS 383
Query: 423 ---GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
G P+I +TDG V I ++ L + S +++ G + LR
Sbjct: 384 PGVGRTPLIIFLTDGEPTAGVTTPSVILSNVRQALGHRVS----LFSLAFGDDADFPLLR 439
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
L++ +RG Y+ +Q++ L+ +LA++ +D L L + + P+
Sbjct: 440 RLSLENRGVARRIYEDTDAALQLEGLYEEISMPLLADVRLDYLGGLVGASPW-ALFPNYF 498
Query: 536 SESPLIVSGRYQ 547
S L+V+G+ Q
Sbjct: 499 GGSELVVAGQVQ 510
>gi|47229095|emb|CAG03847.1| unnamed protein product [Tetraodon nigroviridis]
Length = 608
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 130/314 (41%), Gaps = 56/314 (17%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V+ + G V +HD F Y P K V KDVIF++D+SG
Sbjct: 211 DFILQYDVALRDLLGEV-----QVHD----GYFVHYFAP---KGLPVVPKDVIFVIDVSG 258
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G ++ TK A++ L+ L GD FNI+ F+ + + + AT++ V A +++
Sbjct: 259 SMIGTKIQQTKQAMSTILADLREGDHFNIITFSDQVRTWKRGRTVRATRQNVRDAKEFVR 318
Query: 397 INFIAGG-----------------------------STNICAPLTKAVEMLT-------N 420
IA G TNI A L A +++
Sbjct: 319 -RIIAEGCESEATEHHLTASLCLFLLLYEFSFSFPSGTNINAALLSAAQLINPPSSSRHL 377
Query: 421 SRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGI--GSYCNHYFLRMLA 478
S +P++ +TDG D + LTN FG+ G + L+ LA
Sbjct: 378 SSHRVPLVIFLTDGEATIGVTAGDTI---LTNAKKALGSASLFGLAFGDDADFLLLKRLA 434
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ +RG Y+ +Q++ + S +L+++ + L D F++ S P+ S
Sbjct: 435 LDNRGVARMVYEDADAALQLKGFYDEVASPLLSDVQLSYLDD-QAFDITRSLFPNYFQGS 493
Query: 539 PLIVSGRYQGKFPD 552
L+V+G+ + D
Sbjct: 494 ELVVAGKVKPGIKD 507
>gi|297282389|ref|XP_001108622.2| PREDICTED: von Willebrand factor A domain-containing protein
5B1-like [Macaca mulatta]
Length = 1215
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 101/191 (52%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D S SM G + K+A+ VAL L P FNI+ F G T+ LF +S + ++
Sbjct: 361 EFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGF-GSTFKSLFPSS-QTYSE 418
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
E++ A I G TNI +PL K + RG ++FL+TDGAV + ++ +
Sbjct: 419 ESLAMACDDIQRMRADMGGTNILSPL-KWIFRQPVHRGHPRLLFLITDGAVNNTGKVLEL 477
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+++ + R Y+FGIG H ++ LA +S+G + + ++ +M K +
Sbjct: 478 LRNH-----AFSTRCYSFGIGPNVCHRLVKGLASVSKGSAEFLMEGERLQPKMVKSLKKA 532
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 533 MAPVLSDVTVE 543
>gi|390465468|ref|XP_002807018.2| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1 [Callithrix jacchus]
Length = 1101
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 111/222 (50%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPSLRKAQGEFIFLIDRSSSMSGISVHRVKDAILVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +E++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYNEESLAMACDNIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + +G ++FL+TDGAV + ++ + +++ ++ R Y+FGIG H +
Sbjct: 448 IRQPVH-QGHPRLLFLITDGAVSNTGKVLELVRNHASST-----RCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S+G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSKGSAEXLMEGERLQTKMVKSLKKAMAPVLSDVTVE 543
>gi|261878614|ref|NP_001159906.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform b [Homo
sapiens]
Length = 769
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 208
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 209 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 267
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 319
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 320 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 378
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 379 SSFKADVQAHG 389
>gi|221042234|dbj|BAH12794.1| unnamed protein product [Homo sapiens]
Length = 769
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 208
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 209 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 267
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 319
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 320 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 378
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 379 SSFKADVQAHG 389
>gi|355557627|gb|EHH14407.1| hypothetical protein EGK_00328 [Macaca mulatta]
Length = 1220
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 101/191 (52%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D S SM G + K+A+ VAL L P FNI+ F G T+ LF +S + ++
Sbjct: 361 EFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGF-GSTFKSLFPSS-QTYSE 418
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
E++ A I G TNI +PL K + RG ++FL+TDGAV + ++ +
Sbjct: 419 ESLAMACDDIQRMRADMGGTNILSPL-KWIFRQPVHRGHPRLLFLITDGAVNNTGKVLEL 477
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+++ + R Y+FGIG H ++ LA +S+G + + ++ +M K +
Sbjct: 478 LRNH-----AFSTRCYSFGIGPNVCHRLVKGLASVSKGSAEFLMEGERLQPKMVKSLKKA 532
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 533 MAPVLSDVTVE 543
>gi|327265809|ref|XP_003217700.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Anolis carolinensis]
Length = 885
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 117/270 (43%), Gaps = 24/270 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+V F++D+SGSM G + K A+ + L D FNI+ F
Sbjct: 268 FVHFFAP---KNLAHLPKNVAFVIDVSGSMWGSKIRQAKEAMIKIVEDLKEDDHFNIILF 324
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
E + + AT E V+ A +IG N G TN L +EML N+
Sbjct: 325 ESEVRKWKDGIIKATPENVQEAKYFIG-NITESGLTNFNGGLMAGIEMLNNAHKLKIVPE 383
Query: 424 -SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLA 478
S + +++DG E + + + R+ + +Y+ G G ++ FL L+
Sbjct: 384 RSASLTIMLSDG----EANVGETDQFRIQENAKNASQGKYPLYSLGFGYNLDYGFLERLS 439
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
++ G YD +Q+Q + + +L ++A++ +++ ++ + S
Sbjct: 440 KVNNGVARRIYDDSDAALQLQGFYDEVANPLLTDVALEYPENVIS-DVTENNFKHYYDGS 498
Query: 539 PLIVSGRYQ----GKFPDTLKAKGFLGDLS 564
L+V+GR +KA G L DL+
Sbjct: 499 ELVVAGRITDNDLNSITAEVKAHGALKDLT 528
>gi|449473497|ref|XP_004176342.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 2
[Taeniopygia guttata]
Length = 891
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 156/344 (45%), Gaps = 51/344 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K++IF++D SGSM G+ +E TK AL L + D FNI+ F+ E + ++ AT E
Sbjct: 282 KNIIFVLDTSGSMSGREIEQTKEALLKILDDIKEDDFFNIILFDSEISTWKETLIKATPE 341
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG----A 435
++ A +++ + A G TN+ L + +++L + + S II ++TDG
Sbjct: 342 NLDEARKFVQ-HISAQGLTNLHGGLMRGIDILNAAHEENLVPKRSASIIIMLTDGQPNVG 400
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ + +I +A+K + + +Y G GS ++ FL +A+ ++G Y
Sbjct: 401 LSNTHEIENAVKKAIDGRYT----LYNLGFGSGVDYGFLERMALENKGLARRIYPDSDAA 456
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L ++ ++ YP + I DL++ S ++V+GR+
Sbjct: 457 LQLQGFYDEVSNPMLIDVELN----------YPENEISDLTTNSFKHFYDGSEIVVAGRF 506
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP---------LDRICAKQQIDLLT 593
Q ++ +G LS + Q A+ ++R+ A I+ L
Sbjct: 507 VDSNQNHLSVDVRGEGAHNALSYTTEQGAEQTAQAFQEQQYIFGEYIERLWAYLTIEQLL 566
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
+ A E + L + +S+ + T M++ + +E +N
Sbjct: 567 EKRITATGEEKENLTAQALALSLLYKFVTPLTSMVVTKPEENDN 610
>gi|119585664|gb|EAW65260.1| inter-alpha (globulin) inhibitor H1, isoform CRA_b [Homo sapiens]
Length = 893
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|119585665|gb|EAW65261.1| inter-alpha (globulin) inhibitor H1, isoform CRA_c [Homo sapiens]
Length = 911
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|3024063|sp|P97280.1|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3;
Short=ITI heavy chain H3; Short=ITI-HC3;
Short=Inter-alpha-inhibitor heavy chain 3; Flags:
Precursor
gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus]
Length = 886
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/345 (22%), Positives = 151/345 (43%), Gaps = 32/345 (9%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++DISGSM G+ ++ T+ AL L + D N + F+ + S+ A
Sbjct: 277 VVPKNIVFVIDISGSMAGRKIQQTRVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQA 336
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
T +E A ++ + G TNI L + + MLT++R S II ++TDG
Sbjct: 337 TPANLEEARTFVR-SISDQGMTNINDGLLRGIRMLTDAREQHTVPERSTSIIIMLTDGDA 395
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + +Y G G+ N+ FL +A+ + G Y+ +
Sbjct: 396 NTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANL 455
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QGKFPD 552
Q+Q + + +L N+ ++ ++ ++ + P S V+GR F
Sbjct: 456 QLQGFYEEVANPLLTNVEVEYPENAI-LDLTKNSYPHFYDGSETAVAGRLADSDMNNFKA 514
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKDIPL-----------DRICAKQQIDLLTAQ---AWF 598
+K G L DL+ F E+ ++ D L +R+ A I+ L + A
Sbjct: 515 DVKGHGALNDLT-FTEEVDMK-EMDAALKEQGYIFGNYIERLWAYLTIEQLLEKRKNAHG 572
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIV---ETDERNNASESPG 640
E + L + ++S++ + T M++ + +++ + ++ PG
Sbjct: 573 EEKENLTAQALELSLKYHFVTPLTPMVVTKPEDNEDQTSIADKPG 617
>gi|449473501|ref|XP_002191423.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1
[Taeniopygia guttata]
Length = 839
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 156/344 (45%), Gaps = 51/344 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K++IF++D SGSM G+ +E TK AL L + D FNI+ F+ E + ++ AT E
Sbjct: 240 KNIIFVLDTSGSMSGREIEQTKEALLKILDDIKEDDFFNIILFDSEISTWKETLIKATPE 299
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG----A 435
++ A +++ + A G TN+ L + +++L + + S II ++TDG
Sbjct: 300 NLDEARKFVQ-HISAQGLTNLHGGLMRGIDILNAAHEENLVPKRSASIIIMLTDGQPNVG 358
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ + +I +A+K + + +Y G GS ++ FL +A+ ++G Y
Sbjct: 359 LSNTHEIENAVKKAIDGRYT----LYNLGFGSGVDYGFLERMALENKGLARRIYPDSDAA 414
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L ++ ++ YP + I DL++ S ++V+GR+
Sbjct: 415 LQLQGFYDEVSNPMLIDVELN----------YPENEISDLTTNSFKHFYDGSEIVVAGRF 464
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP---------LDRICAKQQIDLLT 593
Q ++ +G LS + Q A+ ++R+ A I+ L
Sbjct: 465 VDSNQNHLSVDVRGEGAHNALSYTTEQGAEQTAQAFQEQQYIFGEYIERLWAYLTIEQLL 524
Query: 594 AQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNN 634
+ A E + L + +S+ + T M++ + +E +N
Sbjct: 525 EKRITATGEEKENLTAQALALSLLYKFVTPLTSMVVTKPEENDN 568
>gi|325955198|ref|YP_004238858.1| OmpA/MotB domain-containing protein [Weeksella virosa DSM 16922]
gi|323437816|gb|ADX68280.1| OmpA/MotB domain protein [Weeksella virosa DSM 16922]
Length = 753
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 165/392 (42%), Gaps = 61/392 (15%)
Query: 176 IFTLTLPQIDGGSYLSIRLRWSQKLSY---------------RDGEFSVNVPFKF--PEY 218
+ T+ L + G L IR++W++ + + R F+V F+F EY
Sbjct: 124 LMTVDLSDLPAGKKLKIRIKWTKIVDHNKKDYNFLLSSPIVKRTKNFTVTRRFQFDKKEY 183
Query: 219 VTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNID 278
PK +N+ G L + H Q+ +V +L ++ES V ID
Sbjct: 184 T-------PKNFLYKINILGENGQ--LSRSLPAHF-QMINNVNRLIKTFESSVFFGHTID 233
Query: 279 FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL-----PGTAKSRKVFKKDVIFIV 333
+ Y ++ +Q+ L D C Y L P + K+ ++ IF++
Sbjct: 234 LYYHYFMTE--------MQTNLLRFDDGN---CAYFLGNIVPPVKIEQEKLKPREYIFLL 282
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
D SGSM+G LE TK + + KL + FNI FN + +T ATK+ +E+A +
Sbjct: 283 DTSGSMRGFALEGTKKMMLSVIEKLKESEKFNIYYFNKHSNQLATHSVFATKQNIEKAKE 342
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
+ +N G+ + L K ++ + + ++ + +DG ++ + I +KS L N
Sbjct: 343 ML-LNLEGSGNMKLNEALKKVHQIPVDYSFN-RLVVIASDGRIDPNQNIHLDIKSNLKNA 400
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL-----DSIEIQMQKLFTRGFSS 508
+ + GIG + L L GY L ++I+ +++ + +
Sbjct: 401 -----QYFVLGIGKVVDKNNLNYL-----GYTTGISPLIIESSENIDTKLKAFESMILTP 450
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+L NI + + K ++ E +P S+ P+
Sbjct: 451 LLRNIEVQS-KTINLEETFPKNFNGFLSDRPI 481
>gi|269128710|ref|YP_003302080.1| Vault protein inter-alpha-trypsin domain-containing protein
[Thermomonospora curvata DSM 43183]
gi|268313668|gb|ACZ00043.1| Vault protein inter-alpha-trypsin domain protein [Thermomonospora
curvata DSM 43183]
Length = 795
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 115/558 (20%), Positives = 226/558 (40%), Gaps = 68/558 (12%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALG-------ENDGAGKSASVETGSFLKPNIFTLT 180
P+ D+ ++ + E + + L G E AG A++ +P++FT+
Sbjct: 69 PLPDRAAVTRMRMETADGTVEAALKERGRARADYQEAVAAGHRAALAEED--RPDVFTMQ 126
Query: 181 LPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYV-------------------- 219
+ I G +++RL Q L Y DG + P P Y+
Sbjct: 127 VGNIPPGERVTVRLTLDQPLPYEDGAATFRFPLVVAPRYIPGTALPDERAGDGIAADTDA 186
Query: 220 TPAIKKI-----------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYE 268
P +I P R + +++ +S H + R G+ +
Sbjct: 187 VPDASRISPPVLLPGFPDPVRLSLEADIDPAGFPLGEIRSSLHVVAADTRGSGRTTVRLQ 246
Query: 269 SEVLKWSNIDFDFSYTVS---PSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF 325
+D DF ++ P V L +P + F + +LP + +
Sbjct: 247 ----PGERLDRDFVLRLAYGRPEQAAASVTL-TPDAEG--ESGTFTLTVLPPSERC-APR 298
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+DV+ ++D SGSM G + + A A + L D F +++F+ + ++
Sbjct: 299 PRDVVILLDRSGSMHGWKMVAARRAAARIVDTLTGRDRFAVLSFD-DMVERPAGLDGGLS 357
Query: 386 EAVERAHQWIGINFIAG----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
A +R +++ + +AG G T + APL + +L ++ G ++ L+TDG V +E Q
Sbjct: 358 PATDR-NRFRAVEHLAGLQARGGTELAAPLREGAALLDDA-GRDRVLVLITDGQVGNEDQ 415
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ A+ NG RI+ GI N FL LA +G D ++ M+ +
Sbjct: 416 LL-ALIDPFLNGL----RIHAVGIDQAVNAGFLGRLATAGQGRLELVESEDRLDEAMEHI 470
Query: 502 FTRGFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD-TLKAKGF 559
R + +L ++++ + ++ + P R + PL+V+GRY D +L +
Sbjct: 471 HHRINAPLLTGLSLEWSGVRVEPGTLTPPRPGAVFPGVPLVVAGRYLRAADDCSLTVRAV 530
Query: 560 LGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLC 619
G+ + + + + + A+ + L + + + LE+++ + S++ GVLC
Sbjct: 531 TGEGAPWQCRAVVSRTEAGAATSVWARAHLRDLEDRYAVTGGRDLEQQIVETSLRFGVLC 590
Query: 620 EYTRMIIVETDERNNASE 637
+T + + D+R A +
Sbjct: 591 RFTAFVAI--DDRTVAGD 606
>gi|33989|emb|CAA45188.1| inter-alpha-trypsin inhibitor heavy chain ITIH1 [Homo sapiens]
Length = 911
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|390342800|ref|XP_787298.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like,
partial [Strongylocentrotus purpuratus]
Length = 511
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 109/225 (48%), Gaps = 24/225 (10%)
Query: 290 IFGGVLLQSPSLHDVDQREM------FCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGK 342
I G +++ +HDV + F Y P G K+RK +V+F++D+SGSM+G+
Sbjct: 296 IMGDFIIKYDVMHDVKAGHLQIVNGYFVHYFSPVGLPKTRK----NVVFVIDVSGSMRGR 351
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF-STSMELATKEAVERAHQWIGINFIA 401
++ TK A L + P D NIV F + ++ S M AT + + A + + A
Sbjct: 352 KIDQTKRAFTTILDDVRPIDRINIVLFESDVRVWRSNQMVEATSDNIAAAKRHVN-RIRA 410
Query: 402 GGSTNICAPLTKAVEMLTNSRG--SIPIIFLVTD-----GAVEDERQICDAMKSRLTNGG 454
GG TN+ L AV++L ++P+I ++TD G+V+ +I R+TN
Sbjct: 411 GGGTNLYDGLRNAVDLLMEHGNGEAMPLIIMLTDGQPTSGSVKSTSEII----QRITNLI 466
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+++ G+ + FL L++ ++ Y+ S +QM+
Sbjct: 467 DGRLSLFSVSFGNGVDFSFLEKLSLSNQALARKVYEDSSASLQMK 511
>gi|261878616|ref|NP_001159907.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo
sapiens]
gi|261878618|ref|NP_001159908.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo
sapiens]
Length = 623
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 117/253 (46%), Gaps = 29/253 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ ++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 EANLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPT 119
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 120 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIY 171
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY--- 546
+ Q+Q +++ +L ++ + +D + + S ++V+GR
Sbjct: 172 EDHDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADN 230
Query: 547 -QGKFPDTLKAKG 558
Q F ++A G
Sbjct: 231 KQSSFKADVQAHG 243
>gi|432099375|gb|ELK28615.1| Inter-alpha-trypsin inhibitor heavy chain H5-like protein [Myotis
davidii]
Length = 1085
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 163/383 (42%), Gaps = 51/383 (13%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G ++ TK A+ V L L D FNI++F
Sbjct: 33 FVHYFAP---RGLPPVEKNVVFVIDVSGSMFGTKMKQTKKAMNVILGDLRANDYFNIISF 89
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-----RG 423
+ ++ + AT + V A ++G + A G T+I L A +L +S RG
Sbjct: 90 SDTVSVWKAGGSIQATIQNVHSAKDYLG-HMEADGWTDINTALLAAASVLNHSNQEPGRG 148
Query: 424 S----IPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
S IP+I +TDG V I ++ L N + +++ G + LR
Sbjct: 149 SSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQALDNRVA----LFSLAFGDDADFPLLR 204
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
L++ +RG Y+ +Q+Q L+ +LA++ +D L L + + P+
Sbjct: 205 RLSLENRGAARRIYEDTDAALQLQGLYEEISMPLLADVRLDYLGGLVGASPW-ALFPNYF 263
Query: 536 SESPLIVSGRYQG---KFPDTLKAKGFLGDL----SNFVVELKLQLAKDIP--------- 579
S L+V+G Q + L A+G G L + V Q A P
Sbjct: 264 GGSELVVAGLVQPGEQELGIHLAARGPKGQLLVARHSEVATNSSQKAFGCPGEPAPNVAH 323
Query: 580 -LDRICAKQQIDLLTAQAWFSEDKR----LEEKVSKMSVQTGVLCEYTRMIIVETDE--- 631
+ R+ A I L + + D L KV +S++ + T +++V+ E
Sbjct: 324 FIRRLWAYVTIGELLEARFRARDTTTRHLLAAKVLNLSLEYNFVTPLTSLVMVQPKEASE 383
Query: 632 ----RNNASESPGTKKGSKKSDH 650
+ + + PGT S S H
Sbjct: 384 KARRQTSTTTGPGTIMSSSTSRH 406
>gi|20381126|gb|AAH28814.1| Itih1 protein [Mus musculus]
Length = 911
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 119/262 (45%), Gaps = 28/262 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 278 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 334
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ + ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 335 GSKVQSWKGSLVPVSNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 393
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG D QI +A++ R +Y G G + FL
Sbjct: 394 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFL 445
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+++ + G+ Y+ Q+Q + + + +L ++ + +D + R
Sbjct: 446 EVMSTENNGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQY 504
Query: 535 SSESPLIVSGRYQGKFPDTLKA 556
S ++V+GR +T KA
Sbjct: 505 YDGSEIVVAGRIANHKLNTFKA 526
>gi|156119625|ref|NP_002206.2| inter-alpha-trypsin inhibitor heavy chain H1 isoform a precursor
[Homo sapiens]
gi|2851501|sp|P19827.3|ITIH1_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1;
Short=ITI heavy chain H1; Short=ITI-HC1;
Short=Inter-alpha-inhibitor heavy chain 1; AltName:
Full=Inter-alpha-trypsin inhibitor complex component
III; AltName: Full=Serum-derived hyaluronan-associated
protein; Short=SHAP; Flags: Precursor
gi|825630|emb|CAA49279.1| inter-alpha-trypsin inhibitor heavy chain H1 [Homo sapiens]
gi|62550746|gb|AAH69464.1| Inter-alpha (globulin) inhibitor H1 [Homo sapiens]
gi|119585663|gb|EAW65259.1| inter-alpha (globulin) inhibitor H1, isoform CRA_a [Homo sapiens]
gi|158258937|dbj|BAF85439.1| unnamed protein product [Homo sapiens]
Length = 911
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|189054356|dbj|BAG36876.1| unnamed protein product [Homo sapiens]
Length = 911
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae]
gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae]
Length = 863
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 128/288 (44%), Gaps = 26/288 (9%)
Query: 285 VSPSHIFGGVLLQSPSLHDVDQREM------FCMYLLPGTAKSRKVFKKDVIFIVDISGS 338
+SPS I G L++ D+ ++ F Y P V K+++FI+D SGS
Sbjct: 190 MSPSGISGDFLVRYDVKRDLSVGDIQIVNGYFVHYFAPS---GLPVVPKNIVFIIDKSGS 246
Query: 339 MQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS-TSMELATKEAVERAHQWIGI 397
M G + TK A+ L L D FN++ F+ + ++ M LAT+E +E A ++
Sbjct: 247 MGGTKMRQTKQAMNTILKDLRDHDRFNVMPFSYSSTMWRPNEMVLATRENIESARTYVRR 306
Query: 398 NFIAGGSTNICAPLTKAVEMLT-------NSRGSIPIIFLVTDG----AVEDERQICDAM 446
+ AGG TNI + A ++L NS S +I +TDG R I +
Sbjct: 307 SINAGGGTNINQAIIDAADLLRRVTDDQPNSPRSASLIIFLTDGLPSVGESKPRNIMVNV 366
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
K+ + S ++ G G + FL +A+ +RG Y+ +Q++ +
Sbjct: 367 KNAIREQVS----LFCLGFGKDVDFPFLEKMALENRGLARRIYEDSDAALQLKGFYDEVA 422
Query: 507 SSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
+ +L ++ + ++L ++ P S L+V+GR + TL
Sbjct: 423 TPLLFDVQMRYPENLVT-DLTPVDFNTYFDGSELVVAGRLSNEVGRTL 469
>gi|825681|emb|CAA34346.1| inter-alpha-trypsin inhibitor C-terminal [Homo sapiens]
Length = 837
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 217 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 276
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 277 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 335
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 336 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 387
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 388 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 446
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 447 SSFKADVQAHG 457
>gi|221044458|dbj|BAH13906.1| unnamed protein product [Homo sapiens]
Length = 623
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 117/253 (46%), Gaps = 29/253 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ ++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 EANLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPT 119
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 120 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIY 171
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY--- 546
+ Q+Q +++ +L ++ + +D + + S ++V+GR
Sbjct: 172 EDHDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADN 230
Query: 547 -QGKFPDTLKAKG 558
Q F ++A G
Sbjct: 231 KQSSFKADVQAHG 243
>gi|15488640|gb|AAH13465.1| inter-alpha trypsin inhibitor, heavy chain 1, partial [Mus
musculus]
Length = 909
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 119/262 (45%), Gaps = 28/262 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 276 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 332
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ + ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 333 GSKVQSWKGSLVPVSNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 391
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG D QI +A++ R +Y G G + FL
Sbjct: 392 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFL 443
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+++ + G+ Y+ Q+Q + + + +L ++ + +D + R
Sbjct: 444 EVMSTENNGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQY 502
Query: 535 SSESPLIVSGRYQGKFPDTLKA 556
S ++V+GR +T KA
Sbjct: 503 YDGSEIVVAGRIANHKLNTFKA 524
>gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster
Length = 889
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 117/261 (44%), Gaps = 14/261 (5%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K+++F++DISGSM G+ ++ T+ AL L + D N + F+ + S+ A
Sbjct: 280 VVPKNIVFVIDISGSMAGRKIQQTRVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAV 436
T +E A ++ + G TNI L + + MLT++R S II ++TDG
Sbjct: 340 TPANLEEARTFVR-SISDQGMTNINDGLLRGIRMLTDAREQHTVPERSTSIIIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + +Y G G+ N+ FL +A+ + G Y+ +
Sbjct: 399 NTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANL 458
Query: 497 QMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----QGKFPD 552
Q+Q + + +L N+ ++ ++ ++ + P S V+GR F
Sbjct: 459 QLQGFYEEVANPLLTNVEVEYPENAI-LDLTKNSYPHFYDGSETAVAGRLADSDMNNFKA 517
Query: 553 TLKAKGFLGDLSNFVVELKLQ 573
+K G L DL+ F E+ ++
Sbjct: 518 DVKGHGALNDLT-FTEEVDMK 537
>gi|397495923|ref|XP_003818793.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2
[Pan paniscus]
Length = 769
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 208
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 209 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 267
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 319
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 320 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 378
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 379 SSFKADVQAHG 389
>gi|695632|emb|CAA49841.1| inter-alpha-inhibitor H1 chain [Mus musculus]
Length = 907
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 119/262 (45%), Gaps = 28/262 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ K+++F++DISGSM+G+ + TK AL L + P D+F++V F
Sbjct: 274 FTHFFAP---KNLTNMSKNLVFVIDISGSMEGQKVRQTKEALLKILEDMRPVDNFDLVLF 330
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP--- 426
+ + S+ + ++ A ++ F G+TN+ L + +E+L ++GS P
Sbjct: 331 GSKVQSWKGSLVPVSNANLQAAQDFVR-RFSLAGATNLNGGLLRGIEILNKAQGSHPELS 389
Query: 427 ----IIFLVTDG----AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
I+ ++TDG D QI +A++ R +Y G G + FL
Sbjct: 390 SPASILIMLTDGEPTEGETDRSQILKNVRNAIRGRFP--------LYNLGFGHDLDFSFL 441
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDL 534
+++ + G+ Y+ Q+Q + + + +L ++ + +D + R
Sbjct: 442 EVMSTENNGWAQRIYEDHDATQQLQGFYNQVANPLLTDVELQYPQDA-VLALTQHRHKQY 500
Query: 535 SSESPLIVSGRYQGKFPDTLKA 556
S ++V+GR +T KA
Sbjct: 501 YDGSEIVVAGRIANHKLNTFKA 522
>gi|221042196|dbj|BAH12775.1| unnamed protein product [Homo sapiens]
Length = 645
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|221042216|dbj|BAH12785.1| unnamed protein product [Homo sapiens]
Length = 677
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 HDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|444728070|gb|ELW68534.1| von Willebrand factor A domain-containing protein 5B1 [Tupaia
chinensis]
Length = 1267
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 96/191 (50%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D S SM G ++ K+A+ VAL L P FN++ F G T+ LF +S + +
Sbjct: 372 EFIFLIDRSSSMNGANMDRVKDAMLVALKSLMPTCLFNVIGF-GSTFKTLFPSSQTYSEE 430
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
G+ GG TNI APL + + +G ++FL+TDG V + ++ +
Sbjct: 431 SLAMACDHIQGMQADMGG-TNILAPLKWIIRQPVH-QGHPRLLFLITDGTVSNTGKVLE- 487
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
L + R Y+FGIG H ++ LA +S+G + + ++ +M K +
Sbjct: 488 ----LVRNHAFSTRCYSFGIGPNVCHRLVKGLASVSKGSAEFLVEGERLQPKMVKSLKKA 543
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 544 MAPVLSDVTVE 554
>gi|332817002|ref|XP_003309880.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan
troglodytes]
Length = 769
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 208
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 209 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 267
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 319
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 320 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 378
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 379 SSFKADVQAHG 389
>gi|403287469|ref|XP_003934967.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Saimiri boliviensis boliviensis]
Length = 1228
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 110/222 (49%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P ++ +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSAQPSLRKAHGEFIFLIDRSSSMSGISVHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + ++E++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEESLAMACDNIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + +G ++FL+TDGAV + ++ + L + R Y+FGIG H +
Sbjct: 448 IRQPVH-QGHPRLLFLITDGAVNNTGKVLE-----LVRNHAFSTRCYSFGIGPDVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S+G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSKGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|397495921|ref|XP_003818792.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1
[Pan paniscus]
Length = 911
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|332817004|ref|XP_003339088.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan
troglodytes]
gi|332817006|ref|XP_516520.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 6
[Pan troglodytes]
Length = 623
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 117/253 (46%), Gaps = 29/253 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ ++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 EANLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPT 119
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 120 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIY 171
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY--- 546
+ Q+Q +++ +L ++ + +D + + S ++V+GR
Sbjct: 172 EDRDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADN 230
Query: 547 -QGKFPDTLKAKG 558
Q F ++A G
Sbjct: 231 KQSSFKADVQAHG 243
>gi|114587324|ref|XP_001172464.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 5
[Pan troglodytes]
Length = 911
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|344282809|ref|XP_003413165.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Loxodonta africana]
Length = 1230
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 101/191 (52%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D SGSM G + K+A+ VAL L P FNI+ F G T+ LF S + +
Sbjct: 361 EFIFLIDRSGSMSGTNILRVKDAMLVALKSLLPACLFNIIGF-GSTFKTLFPAS-QTYNE 418
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
E++ A I G TNI +PL + + +G ++FL+TDGAV + ++ +
Sbjct: 419 ESLAMACDNIQRMRADMGGTNILSPLKWIIRQPVH-QGHPRLLFLITDGAVSNTGKVLEL 477
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+++ ++ R Y+FGIG H ++ LA +S+G + + ++ +M K +
Sbjct: 478 VRNHASST-----RCYSFGIGPNVCHRLVQRLASVSKGSAEFLVEGERLQPKMVKSLKKA 532
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 533 MAPVLSDVTVE 543
>gi|338721494|ref|XP_001916970.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Equus
caballus]
Length = 899
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 122/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FN++ F
Sbjct: 237 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNVIGF 293
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++V +I + G T+I L +A+ +L N
Sbjct: 294 SNRIKVWKDHLVSVTPDSVRDGKVYIH-HMSPTGGTDINGALQRAIRLLNNYVAHNDIED 352
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G IC I+T GIG + L L++
Sbjct: 353 RSVSLIIFLTDGKPTVGETNTLKILNNTKEATRGQIC--IFTIGIGDDVDFKLLEKLSLE 410
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G +D D Q+ + + +L++I ID L E + P+ + S +
Sbjct: 411 NCGLTRRVHDEDDAGSQLIGFYDEIRTPLLSDIRIDYPPSLVEHATR-TLFPNYFNGSEI 469
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + FV+ L D+P+
Sbjct: 470 IIAGKLVDRKMDQLHVEVTASNSKKFVI-----LKTDVPV 504
>gi|66808461|ref|XP_637953.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
gi|74853613|sp|Q54MG1.1|Y5981_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0285981
gi|60466393|gb|EAL64449.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
Length = 831
Score = 79.7 bits (195), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 142/325 (43%), Gaps = 34/325 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELAT 384
K + IF++D SGSM G+P++ K AL + + L+ FNI F T F S ++
Sbjct: 310 KSEFIFLIDCSGSMSGEPIKKAKRALEIIIRSLNENCKFNIYCFGSRFTKAFDNS-KMYN 368
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQIC 443
E + + ++ G T + P+ + T S P +F++TDG V + +
Sbjct: 369 DETLAQISGYVEKIDADLGGTELLPPIRDILS--TESDFEYPRQLFILTDGEVSERDSLI 426
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ + + N RI+T+GIG+ + + L+ +GYY D + E Q+ KL +
Sbjct: 427 NYVATESNN-----TRIFTYGIGNSVDTELVIGLSKACKGYYEMIKDNSNFEEQVMKLVS 481
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ-GKFPDTLKAKGFLGD 562
F L+NI +D +L + E P++I L S LIV + K P + +GD
Sbjct: 482 IAFEPTLSNIKVDWGTEL-QIEQGPTKIRPLYSGETLIVYALLKDNKIPQSTVQVSLIGD 540
Query: 563 ---LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEE------------K 607
S + L +K I + + + A F+ K LEE +
Sbjct: 541 GPTGSKLEFPITLDFSKTIDYEN-------NSVHTLAAFNIIKDLEEVERKGNHSNNRDR 593
Query: 608 VSKMSVQTGVLCEYTRMIIVETDER 632
+ ++ G++ +YT I+ E+
Sbjct: 594 IEELGKSYGLISKYTSYIVTAASEQ 618
>gi|426340867|ref|XP_004034348.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2
[Gorilla gorilla gorilla]
Length = 769
Score = 79.7 bits (195), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 149 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 208
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 209 NLQAAQDFVR-GFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 267
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 268 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 319
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 320 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 378
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 379 SSFKADVQAHG 389
>gi|426340865|ref|XP_004034347.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1
[Gorilla gorilla gorilla]
Length = 911
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 117/251 (46%), Gaps = 29/251 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQASEA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 351 NLQAAQDFVR-GFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
V D QI +A++ R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 461
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY----Q 547
Q+Q +++ +L ++ + +D + + S ++V+GR Q
Sbjct: 462 RDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADNKQ 520
Query: 548 GKFPDTLKAKG 558
F ++A G
Sbjct: 521 SSFKADVQAHG 531
>gi|426340869|ref|XP_004034349.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 3
[Gorilla gorilla gorilla]
gi|426340871|ref|XP_004034350.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 4
[Gorilla gorilla gorilla]
Length = 623
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 117/253 (46%), Gaps = 29/253 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ ++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 EANLQAAQDFVR-GFSLDEATNLNGGLLQGIEILNQVQESLPELSNHASILIMLTDGDPT 119
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 120 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIY 171
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY--- 546
+ Q+Q +++ +L ++ + +D + + S ++V+GR
Sbjct: 172 EDRDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADN 230
Query: 547 -QGKFPDTLKAKG 558
Q F ++A G
Sbjct: 231 KQSSFKADVQAHG 243
>gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881]
gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881]
Length = 684
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 144/330 (43%), Gaps = 22/330 (6%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
R F + + PG D +F++DISGSM K L + ++ AL KL GD F I
Sbjct: 284 RGTFMLSITPGDDLPPITTGSDWVFVLDISGSMSAK-LATLGDGVSQALGKLRGGDRFRI 342
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
V F+ ++ AT + + + + + + G TN+ L+ A+ L R +
Sbjct: 343 VLFDDRAEELTSGFVDATPNNIRQYTKKV-MQLQSRGGTNLFGGLSLALNPLDADRPTG- 400
Query: 427 IIFLVTDGAVE----DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
I LVTDG ++ D +++ R++TF +G+ N L + S
Sbjct: 401 -IVLVTDGVANVGKTQQKDFIDLLENHDV-------RLFTFVMGNSANRPMLTAMTNASN 452
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G+ + + D I Q+ + + ++ ID + + ++ P RI + +++
Sbjct: 453 GFAISVSNSDDIAGQILNATAKLTHQAMNDVEID-IDGVRTADLQPQRIGSIYHGRQIVL 511
Query: 543 SGRYQGKFPD--TLKAKGFLGDLSNFVVELKL--QLAKDIPLDRICAKQQI-DLLTAQAW 597
G Y G P TL+A G + Q + L+R+ A I D+ +
Sbjct: 512 LGHYWGDGPAKVTLRAN-VAGQNKQYKTRFNFPAQGGDNPELERLWAYATIQDMQQEINY 570
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIV 627
F E+K L++ + + ++ G++ YT M+++
Sbjct: 571 FGEEKDLQQGIVDLGLEYGLVTNYTSMVVL 600
>gi|440795667|gb|ELR16784.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 946
Score = 79.3 bits (194), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 152/351 (43%), Gaps = 51/351 (14%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLF--------ST 378
++IFIVD SGSM G + K+ L + L L G FNI+ F T +LF
Sbjct: 285 EMIFIVDRSGSMSGSRMNQVKDTLQIFLRSLGEGTMFNIIGFGTSTQHLFREGSVEYNDK 344
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
++E+ATK E + N G TNI PL + + T G +F++TDG V +
Sbjct: 345 NLEIATKHVKE-----MSANL---GGTNILRPLQEVLRAQTK-EGYPRQLFILTDGEVGN 395
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
++ D ++ + R++TFG+G+ + ++ LA G++ +++E ++
Sbjct: 396 TQECVDFVRKH-----AETTRVFTFGVGNEASQDLVKGLAKAGEGFFEFVRSGEAMEEKV 450
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPS--RIPDLSSESPLIVSGRYQGKFPDTLKA 556
+ R L +I + T K + P+ R+P L L+V G
Sbjct: 451 MRQLHRAMQPALTDITV-TWKGAAASHVQPAPFRLPPLFCGGRLVVYGIIDDSAAAAGGN 509
Query: 557 K----------GFLGDLSNFVVELKLQL---AKDIPLDRICAKQQIDLLTAQAWFSEDKR 603
+ G + F +K+ L +K L+++ A+ + L + D+
Sbjct: 510 EAGGGEVEVVIGAKTAVKPFEAVVKVDLSKASKGTLLNKLAARTLLKDLEEGRSYVHDRE 569
Query: 604 LEEKVSK--------MSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSK 646
K K +S+ +GVL ++T + VE ER+ A+E GT K K
Sbjct: 570 GAVKAGKDVVAEMVRVSLASGVLSKHTAFVAVE--ERDEATE--GTMKVRK 616
>gi|444513526|gb|ELV10372.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Tupaia chinensis]
Length = 888
Score = 79.3 bits (194), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 104/467 (22%), Positives = 192/467 (41%), Gaps = 85/467 (18%)
Query: 139 EAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQ 198
E E++ K Y + GK+A + S K FT+++ + GS ++ L + +
Sbjct: 107 EKEVAKKQYEKAV-------SQGKTAGLVKASGRKLEKFTVSV-NVAAGSKVTFELTYEE 158
Query: 199 KLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQL-- 256
L R G K+ Y+ K++ + +I +++ G +L +S L
Sbjct: 159 LLKRRQG--------KYEMYLKVQPKQLVRDFEIEVDIFEPQGLSMLDAQASFITNDLLG 210
Query: 257 ----RRDVGKLGY-SYESEVLKWS----------NIDFDFSYTVSPSHIFGGVLLQSPSL 301
+ GK G+ S++ + + N DF +Y V+ +SP+
Sbjct: 211 SALTKSFSGKKGHVSFKPSLDQQRSCPTCTDSLLNGDFTITYDVN---------RESPAN 261
Query: 302 HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
+ F + P + V K+V+F++DISGSM G+ ++ T+ AL L +
Sbjct: 262 VQI-VNGYFVHFFAP---QGLPVVPKNVVFVIDISGSMSGRKIQQTREALLKILDDVKED 317
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQW---IGINFIAGGSTNICAPLTKAVEML 418
D N + F+G + + AT E ++ A + IG++ G TNI L + +L
Sbjct: 318 DYLNFILFSGGVTTWKDHLVQATPENLQEARAFVRKIGVD----GMTNINDGLLTGITVL 373
Query: 419 TNSRG-------SIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCN 470
+R S I+ ++TDG + ++ + N G P +Y G G+ N
Sbjct: 374 NEARKEHRVPERSTSIVIMLTDGDANTGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLN 432
Query: 471 HYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR 530
+ FL +A+ + G Y+ +Q+Q + + +L ++ + YP
Sbjct: 433 YNFLETMALQNHGLARRIYEDSDASLQLQGFYEEVANPLLMDVEV----------QYPEN 482
Query: 531 -IPDLSSE--------SPLIVSGRY----QGKFPDTLKAKGFLGDLS 564
I DL+ S ++V+GR F +K +G + DL+
Sbjct: 483 AILDLTQNTYQHFYDGSEIVVAGRLLDEEMNSFKADVKGQGAINDLT 529
>gi|444724394|gb|ELW64999.1| Inter-alpha-trypsin inhibitor heavy chain H5 [Tupaia chinensis]
Length = 915
Score = 79.3 bits (194), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 146/344 (42%), Gaps = 41/344 (11%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + +I + G T+I L +A+ +L N
Sbjct: 337 SNRIKVWKDHLISVTPDNIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNNYVAHNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + + ++ G IC I+T GIG+ + L L++
Sbjct: 396 RSVSLIIFLTDGKPTVGETQTLKILNNTKEAARGQIC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID L E + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPSLVE-HVTKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICA-------KQQIDLLT 593
I++G+ K D L + + FVV L D+P++ + A I+ L
Sbjct: 513 IIAGKLVDKQLDQLHVEVTASNSKKFVV-----LKTDVPVEPLKAGDDSEEDPNHIERLW 567
Query: 594 AQ--------AWFSED-----KRLEEKVSKMSVQTGVLCEYTRM 624
+ +W D ++L +K +SV L +T M
Sbjct: 568 SHLTMKELMSSWLQSDNEQEKEQLRQKAQALSVSYHFLTPFTSM 611
>gi|355757394|gb|EHH60919.1| Inter-alpha-trypsin inhibitor heavy chain H5-like protein [Macaca
fascicularis]
Length = 1313
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 113/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G +E TK A+ V LS L D FNI++F
Sbjct: 268 FIHYFAP---RGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILSDLRANDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 325 SDTINVWKAGGSIQATIQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + ++T G + LR L++
Sbjct: 384 PSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+RG Y+ +Q++ L+ +LA++ ++ L L + + P+ S
Sbjct: 444 ENRGIARRIYEDTDAALQLEGLYEEISMPLLADVHLNYLGGLVGASPW-AVFPNYFGGSE 502
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 503 LVVAGQVQ 510
>gi|340375276|ref|XP_003386162.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Amphimedon queenslandica]
Length = 1263
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 125/571 (21%), Positives = 242/571 (42%), Gaps = 76/571 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D ++ G EA I+GK ++ + K A E G++L P++
Sbjct: 685 EAKYVFPLDDMAAVCGFEAFINGKHIIGEVKEKEQAHREYKQAISEGHGAYLMDEETPDV 744
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNV 236
FT+++ + G+ + I++ + +L +GE NV F P V P ++ E ++V
Sbjct: 745 FTVSVGNLPPGASVLIKITYVAELQ-AEGE---NVVFSLPGSVAPWKQEAALDETTQVDV 800
Query: 237 NAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEV--LKWSNIDFDFSYTVSPSHIFGG- 293
+V + S + Q+ D+ + ES +K D + + P G
Sbjct: 801 EK---VKVRSDADSLSV-QISIDMPFTIRTIESPAHKIKMKKTDTKATVELCPGEKLGTG 856
Query: 294 --VLLQSPSLH-------DVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPL 344
+L+ +H + D+ CM ++ + + ++IF++D+S SM+G L
Sbjct: 857 FQLLIGLAEIHVPRMWVEENDKGHHACMLTFYPEFEAESIVENEIIFLLDVSNSMKGDAL 916
Query: 345 EDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGG 403
++ K L + L L P FN+ F +F +S+++ E + A + + G
Sbjct: 917 DNAKKVLLLLLHHLPPKTYFNVFTFGAMFESIFPSSVQI-NNETLSFATKQVQSCQAVMG 975
Query: 404 STNICAPL----------TKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDAMKSRLTN 452
+T+I PL + + LTNS P +F+++DG + +E A+K
Sbjct: 976 NTDIWRPLHSLYLLSEAASSSTGGLTNSSTLPPCSVFVISDGHMTEEAPSLSAIKD---- 1031
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRG--YYGAAYDLDSIEIQMQKLFTRGFSSVL 510
G+ R++TFG+ S N +FLR +A + G + + E ++++ ++ F L
Sbjct: 1032 -GAQNYRVFTFGVSSTANRHFLRSMARVGGGCSEFFDSQKKSRWERKVKEQLSKAFQPAL 1090
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ---GKFPDTLKAKGFLGDLSNFV 567
+ + + DE P + P + + SG Q G P +A L F+
Sbjct: 1091 TGVDVQ-WQQHDENAPPPIQAPKSITS---LFSGSRQVVYGFVPHCKQA-----SLKAFI 1141
Query: 568 VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSE-------DKRLEEKVSKMSVQT----- 615
++Q + Q + LTAQA + D R E ++ K ++T
Sbjct: 1142 GRKEIQTMVSTSDLATTSGQTLHQLTAQALIRDWSEGSLNDDRTEHEIMKRDLKTFIINT 1201
Query: 616 ----GVLCEYTRMIIV---ETDERNNASESP 639
++ ++T + + E DE+ +A++ P
Sbjct: 1202 SKEYSIVSQFTSFVAIEKREKDEKFDATKGP 1232
>gi|66808455|ref|XP_637950.1| hypothetical protein DDB_G0285975 [Dictyostelium discoideum AX4]
gi|74853616|sp|Q54MG4.1|Y5975_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0285975
gi|60466390|gb|EAL64446.1| hypothetical protein DDB_G0285975 [Dictyostelium discoideum AX4]
Length = 917
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 142/325 (43%), Gaps = 34/325 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELAT 384
K + IF++D SGSM G+P++ K AL + + L+ FNI F T F S ++
Sbjct: 337 KSEFIFLIDCSGSMSGEPIKKAKRALEIIIRSLNENCKFNIYCFGSRFTKAFDNS-KMYN 395
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQIC 443
E ++ ++ G T + P+ + T S P +F++TDG V + +
Sbjct: 396 DETLKEISGYVEKIDADLGGTELLPPIRDILS--TESDFEYPRQLFILTDGEVSERDSLI 453
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ + + N RI+T+GIG+ + + L+ +GYY D + E Q+ KL +
Sbjct: 454 NYVATESNN-----TRIFTYGIGNSVDTELVIGLSKACKGYYEMIKDNSNFEEQVMKLVS 508
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ-GKFPDTLKAKGFLGD 562
F L+NI +D +L + E P++I L S LIV + K P + +GD
Sbjct: 509 IAFEPTLSNIKVDWGTEL-QIEQGPTKIRPLYSGETLIVYALLKDNKIPQSTVQVSLIGD 567
Query: 563 ---LSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEE------------K 607
S + L +K I + + + A F+ K LEE +
Sbjct: 568 GPTGSKLEFPITLDFSKTIDYEN-------NSVHTLAAFNIIKDLEEVERKGNHSNNRDR 620
Query: 608 VSKMSVQTGVLCEYTRMIIVETDER 632
+ ++ G++ +YT I+ E+
Sbjct: 621 IEELGKNYGLISKYTSYIVTAASEQ 645
>gi|354465012|ref|XP_003494974.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5-like [Cricetulus griseus]
Length = 953
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 125/281 (44%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 281 FVHYFAP---KDLPPLPKNVVFVLDTSASMVGMKLQQTREALVTILNDLRPQDHFNIIGF 337
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + +I + G T+I L A+++L N
Sbjct: 338 SNRIKVWKDHLLPVTPDNIRNGKLYI-YHLSPTGGTDINGALQMAIKLLNNYVAQNDIED 396
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ GG IC I+T GIG+ + L L++
Sbjct: 397 RSVSLIVFLTDGKPTFGETNTLKILSNTKEATGGQIC--IFTIGIGNDVDFKLLEKLSLE 454
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID D+ E + P+ + S +
Sbjct: 455 NCGLTRHVHEEEKAGEQLIGFYDEIRTPLLSDIRIDYPPDVVE-HATKTLFPNYFNGSEI 513
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D + + FV+ L KDIP++
Sbjct: 514 VIAGKMVDRKFDQFHVEVTASNSKKFVI-----LKKDIPVE 549
>gi|344238973|gb|EGV95076.1| Inter-alpha-trypsin inhibitor heavy chain H5 [Cricetulus griseus]
Length = 913
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 125/281 (44%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L+ T+ AL L+ L P D FNI+ F
Sbjct: 241 FVHYFAP---KDLPPLPKNVVFVLDTSASMVGMKLQQTREALVTILNDLRPQDHFNIIGF 297
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + +I + G T+I L A+++L N
Sbjct: 298 SNRIKVWKDHLLPVTPDNIRNGKLYI-YHLSPTGGTDINGALQMAIKLLNNYVAQNDIED 356
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG + + ++ GG IC I+T GIG+ + L L++
Sbjct: 357 RSVSLIVFLTDGKPTFGETNTLKILSNTKEATGGQIC--IFTIGIGNDVDFKLLEKLSLE 414
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID D+ E + P+ + S +
Sbjct: 415 NCGLTRHVHEEEKAGEQLIGFYDEIRTPLLSDIRIDYPPDVVE-HATKTLFPNYFNGSEI 473
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D + + FV+ L KDIP++
Sbjct: 474 VIAGKMVDRKFDQFHVEVTASNSKKFVI-----LKKDIPVE 509
>gi|296083444|emb|CBI23397.3| unnamed protein product [Vitis vinifera]
Length = 88
Score = 79.3 bits (194), Expect = 8e-12, Method: Composition-based stats.
Identities = 34/64 (53%), Positives = 47/64 (73%)
Query: 425 IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
+P+IF++ D VED R IC++++ +L N G ICP I TFG+GSYC++ L+MLA I RGY
Sbjct: 1 MPLIFVIADRVVEDGRHICNSLEGQLRNRGPICPCICTFGMGSYCSYNSLQMLAHIGRGY 60
Query: 485 YGAA 488
Y A
Sbjct: 61 YDAG 64
>gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5-like protein-like [Macaca mulatta]
Length = 1313
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 113/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G +E TK A+ V LS L D FNI++F
Sbjct: 268 FIHYFAP---RGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILSDLRANDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 325 SDTINVWKAGGSIQATIQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + ++T G + LR L++
Sbjct: 384 PSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+RG Y+ +Q++ L+ +LA++ ++ L L + + P+ S
Sbjct: 444 ENRGIARRIYEDTDAALQLEGLYEEISMPLLADVHLNYLGGLVGASPW-AVFPNYFGGSE 502
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 503 LVVAGQVQ 510
>gi|355704836|gb|EHH30761.1| Inter-alpha-trypsin inhibitor heavy chain H5-like protein [Macaca
mulatta]
Length = 1313
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 113/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G +E TK A+ V LS L D FNI++F
Sbjct: 268 FIHYFAP---RGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILSDLRANDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 325 SDTINVWKAGGSIQATIQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + ++T G + LR L++
Sbjct: 384 PSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+RG Y+ +Q++ L+ +LA++ ++ L L + + P+ S
Sbjct: 444 ENRGIARRIYEDTDAALQLEGLYEEISMPLLADVHLNYLGGLVGASPW-AVFPNYFGGSE 502
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 503 LVVAGQVQ 510
>gi|402910281|ref|XP_003917814.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6 [Papio
anubis]
Length = 1313
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 113/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G +E TK A+ V LS L D FNI++F
Sbjct: 268 FIHYFAP---RGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILSDLRANDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 325 SDTINVWKAGGSIQATIQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + ++T G + LR L++
Sbjct: 384 PSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+RG Y+ +Q++ L+ +LA++ ++ L L + + P+ S
Sbjct: 444 ENRGIARRIYEDTDAALQLEGLYEEISMPLLADVRLNYLGGLVGASPW-AVFPNYFGGSE 502
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 503 LVVAGQVQ 510
>gi|345310282|ref|XP_001521494.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like,
partial [Ornithorhynchus anatinus]
Length = 389
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 13/198 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K V K+V+F++D+SGSM G+ L TK AL L + D N + F
Sbjct: 197 FVHFFAP---KDLPVVPKNVVFVIDVSGSMYGRKLVQTKEALLKILEDMKEEDYLNFILF 253
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+ E + ++ AT E +++A +++ N G TNI L + ++ML +R
Sbjct: 254 SSEITTWKDTLIKATPENLKKAKEFVK-NIKDEGLTNINDGLMRGIKMLNEARETNVVPK 312
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S +I ++TDG D ++ + N G P +Y G G N+ L +A+ +
Sbjct: 313 RSTSLIIMLTDGEANVGEIRADKIQENVRNAIGGKFP-LYNLGFGYDLNYNLLEKMALEN 371
Query: 482 RGYYGAAYDLDSIEIQMQ 499
G Y+ ++QMQ
Sbjct: 372 HGLARRIYEDSDADLQMQ 389
>gi|403306470|ref|XP_003943756.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6 [Saimiri
boliviensis boliviensis]
Length = 1313
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 117/252 (46%), Gaps = 23/252 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G ++ TK A+ V LS L D FNI++F
Sbjct: 268 FVHYFAP---RGLPPIEKNVVFVIDVSGSMFGTKMKQTKKAMNVILSDLQVNDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 325 SDTVSVWKAGGSIQATTQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRV 383
Query: 423 ---GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
G IP+I +TDG V I ++ L + S +++ G + LR
Sbjct: 384 PSVGRIPLIIFLTDGEPTTGVTTPSVILSNVRQALGHRVS----LFSLAFGDDADFTLLR 439
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
L++ +RG Y+ +Q++ L+ +LA++ +D L L + + P+
Sbjct: 440 RLSLENRGIARRIYESTDAALQLEGLYEEISMPLLADVRLDYLGGLVGASPW-AVFPNYF 498
Query: 536 SESPLIVSGRYQ 547
S L+V+G+ Q
Sbjct: 499 GGSELVVAGQVQ 510
>gi|347755380|ref|YP_004862944.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347587898|gb|AEP12428.1| Uncharacterized protein containing a von Willebrand factor type A
(vWA) domain protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 806
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 133/312 (42%), Gaps = 19/312 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+ V+ ++D S SM + L+ A ++ L L P D FN+V FN + FS + + T E
Sbjct: 317 RSVVVMLDTSLSMNFEKLDRAYEACSLFLKSLRPEDRFNLVLFNDDVQAFSETPQTGTAE 376
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDG-----AVEDERQ 441
RA +I ++++GG T+ L + + + G + L+TDG +
Sbjct: 377 NTARALDFIRRSYLSGG-TDFAKALQRGLALAKALPGDERTLILITDGNPTLTTTRGNKL 435
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
I D ++ G + R++ FG+G+ N FL LA +SRG + D + ++
Sbjct: 436 IRDFAEANARGGAPV--RVFAFGVGADANREFLSELARVSRGVSEFGRETDDLTFRLDAF 493
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD-------TL 554
F R + + + + + +YP+ S + GRY+ T
Sbjct: 494 FARVGQRPVEGLKLRLSDPDNAYAVYPAEETTGYDGSRVSFIGRYRRPVAQVECTVSGTQ 553
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-LLTAQAWFSEDKRLEEKVSKMSV 613
+ + + E + A L R+ A+++I LL A ED+ L +V +S
Sbjct: 554 AGRSVTATATGALPE---RDATHSHLPRLWAQERISALLRRMALDGEDEALIAEVIALSK 610
Query: 614 QTGVLCEYTRMI 625
+ ++ YT +
Sbjct: 611 KYNIVTPYTAFL 622
>gi|390479818|ref|XP_003735788.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H6, partial [Callithrix jacchus]
Length = 583
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 117/252 (46%), Gaps = 23/252 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G ++ TK A+ V LS L D FNI++F
Sbjct: 107 FVHYFAP---RGLPPMEKNVVFVIDVSGSMFGTKMKQTKKAMNVILSDLRANDYFNIISF 163
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 164 SDTVSVWKAGGSIQATTQNVHSAKDYLH-RMEADGWTDINSALLAAASVLNHSNQEPGRV 222
Query: 423 ---GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
G IP+I +TDG V I ++ L + S +++ G + LR
Sbjct: 223 PNVGRIPLIIFLTDGEPTTGVTTPSMILSNVRQALGHRVS----LFSLAFGDDADFTLLR 278
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLS 535
L++ +RG Y+ +Q++ L+ +LA++ +D L L + + P+
Sbjct: 279 RLSLENRGIARRIYEGTDAALQLKGLYEEISMPLLADVRLDYLGGLVGASPW-AVFPNYF 337
Query: 536 SESPLIVSGRYQ 547
S L+V+G+ Q
Sbjct: 338 GGSELVVAGQVQ 349
>gi|332807855|ref|XP_003307894.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Pan troglodytes]
Length = 1104
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 108/222 (48%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +++++ A I G TNI +PL
Sbjct: 390 SLLPACLFNIIGF-GSTFKSLFPSS-QTYSEDSLAMACDDIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++FL+TDGAV + ++ + L + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFLITDGAVNNTGKVLE-----LVRNHAFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSEGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|296225410|ref|XP_002758285.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Callithrix
jacchus]
Length = 844
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 93/187 (49%), Gaps = 20/187 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++DISGSM+G+ ++ TK AL L + PGD F++V F + S+ A++
Sbjct: 291 KNVVFVIDISGSMKGQKVKQTKEALLQILGDMRPGDYFDLVLFGSRVQSWRGSLVQASQA 350
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG----A 435
++ A ++ F +TN+ L + +E L R S+P I+ ++TDG
Sbjct: 351 NLQAARDFVR-GFSLDEATNLNGGLLRGIETLNKVRESLPELSNHASILIMLTDGDPTEG 409
Query: 436 VEDERQICDAMKSRLTNGGSICPR--IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
V D QI ++S +I R +Y G G + FL +++M + G Y+
Sbjct: 410 VTDRSQILKNVRS------AIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRD 463
Query: 494 IEIQMQK 500
Q+Q+
Sbjct: 464 AAQQLQE 470
>gi|198433657|ref|XP_002122417.1| PREDICTED: similar to PK-120 [Ciona intestinalis]
Length = 864
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 117/530 (22%), Positives = 223/530 (42%), Gaps = 79/530 (14%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQ----KLS------YRD-----G 205
G+SA + T + F +++ + GS ++ L + + KLS Y D G
Sbjct: 133 GESAGLVTAKIREAQTFDVSV-NVAAGSSVTFTLEYQELLKRKLSRYHHVIYADPGQVVG 191
Query: 206 EFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGY 265
+F + V P +T + P K+ +N T +T + ++RR +
Sbjct: 192 DFKIEVEITEPTGLT-TVTATPPSPKLRRTINPDT-----FDTGN---VEIRRSETQAYV 242
Query: 266 SY------ESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTA 319
SY + + + S++ F +Y V+ + G +L++ F + P
Sbjct: 243 SYRPTREQQRNIRRRSDLSFLVNYDVTREELGGEILIKD---------GYFVHFFAP--- 290
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS-T 378
+ V K V+F++D+SGSM G + TK AL L L+ D FNI+ F+ T ++
Sbjct: 291 TNLPVIPKKVVFVIDVSGSMSGHKIVQTKEALRTILDDLNEIDQFNIITFSSTTNVWHPN 350
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-------TNSRGSIPIIFLV 431
M + A + + + GG TN A +++L TN+ ++ L+
Sbjct: 351 EMVDVNPTNIRNAKKHVRSMYARGG-TNFNAAALDGIQLLETISSNRTNTLEEASMMILL 409
Query: 432 TDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
TDG V I ++ R+ S I+ G G + +H FL +A ++G
Sbjct: 410 TDGQPTVGVTGNEAIRRNIRERVNGRYS----IFCLGFGQHLDHEFLDQIASENKGLSRK 465
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRY 546
Y+ +Q++ + S +LA++ + T DLD + + + S ++V+GR
Sbjct: 466 IYNDADAALQLKDFYDEVASPLLAHVIMRYTGPDLD--GLTKTTFDNFFDGSEILVTGRI 523
Query: 547 Q-GKFPDTL-------KAKGFLGDLSNFVVELKLQLAKD-IPLDRICAKQQI-DLLTAQA 596
Q + P L A+G + +S + + D +++I A I ++L ++
Sbjct: 524 QSSQVPQPLICIIGGSTAEGLVSIVSQG-KHFEESFSHDWFNMEKIWAYLTIKEMLRNKS 582
Query: 597 WFSED-----KRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGT 641
+D + ++ ++ ++S+ + T M++ + DE AS P T
Sbjct: 583 HVPDDDTTTHQNIKNEIVRLSLLYNFVTPMTSMVVTKPDEDVLASTVPPT 632
>gi|407698137|ref|YP_006822925.1| protein containing a von Willebrand factor type A domain
[Alcanivorax dieselolei B5]
gi|407255475|gb|AFT72582.1| Protein containing a von Willebrand factor type A domain
[Alcanivorax dieselolei B5]
Length = 689
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 151/347 (43%), Gaps = 33/347 (9%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
R F + L PG D +F++DISGSM GK L ++ AL KL PGD F I
Sbjct: 291 RGTFMLSLTPGDDLQPITTGSDWVFVLDISGSMHGK-LATLAEGVSQALGKLRPGDRFRI 349
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
V F+ + + E V + + GG TN+ L A+ L R +
Sbjct: 350 VLFDDHASELTNGYVVVNPENVRHYVKRVQALGPQGG-TNLFDGLKLALRPLDADRPT-- 406
Query: 427 IIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
I LVTDG V D+++ D +++ R++TF +G+ N L L S+
Sbjct: 407 GIVLVTDGVANVGVTDQKKFLDLLETHDV-------RLFTFVMGNSANRPMLTALTDASK 459
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA--IDTLKDLDEFEMYPSRIPDLSSESPL 540
G+ + D I Q+ ++ + N+ ID +K D M P R + +
Sbjct: 460 GFAVNVSNSDDIVGQILLATSKVTHQAMNNVKVEIDGVKTAD---MIPERFGTVYRGRQM 516
Query: 541 IVSGRYQGKFPD--TLKAKGFLGDLSNFVVELKLQLA-KDIP-LDRICAKQQIDLLTAQ- 595
++ G Y G+ P TL A+ G+ + + + D P L+R+ A I+ + +
Sbjct: 517 VLMGHYWGQGPADVTLTAE-VGGEKRRYKTRFEFPASGGDNPELERLWAYATIEGMQREI 575
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD-------ERNNA 635
F ++ L + + ++ G++ YT M+++ + +R+NA
Sbjct: 576 NNFGDNADLRNGIVDLGLEYGLVTNYTSMVVLREEMFQQYGIDRDNA 622
>gi|189524901|ref|XP_693183.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Danio
rerio]
Length = 963
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 112/242 (46%), Gaps = 12/242 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ ++ T+ AL LS+L D F +V F+ + S+ AT E
Sbjct: 335 KNVVFVIDRSGSMMGEKMKQTQEALTTILSELHEDDYFGLVTFDDVIESWRPSLSKATPE 394
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGS----IPIIFLVTDGAVEDERQ 441
V A +++ IN A T+I + AV+MLT+ + + + +I L+TDG Q
Sbjct: 395 NVTEAKEYVQTIN--ARSMTDINKGILYAVDMLTSEKSASFPNMSMIILLTDGQPSSGEQ 452
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
++ + N + ++ G G ++ L LA + G Y+ +Q+Q
Sbjct: 453 DLSKIQENVRNAINGSMSLFCLGFGYDLDYILLDTLAKQNDGLARRVYEASDAALQLQGF 512
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ----GKFPDTLKAK 557
+ + +L + ++ D ++ S S ++V+GR Q FP + A
Sbjct: 513 YEEVATPLLIEVNLN-YPDNAVTDVTQSHFRHFFKGSEIVVAGRLQELEINAFPTEVSAN 571
Query: 558 GF 559
G
Sbjct: 572 GL 573
>gi|395827363|ref|XP_003786874.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 1
[Otolemur garnettii]
Length = 945
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 163/379 (43%), Gaps = 49/379 (12%)
Query: 301 LHDVDQR------EMFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L+DVD+ E+F Y + A +S K+++F++D+SGSM G ++ T A+
Sbjct: 276 LYDVDREKKAGELEVFNGYFVHFFAPESLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKT 335
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
L L D F+++ FN + + ATK V A ++I G TNI L +
Sbjct: 336 ILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKTQVADAKRYIE-KIQPSGGTNINEALLR 394
Query: 414 AVEMLTNSR-------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
A+ +L + S+ +I LV+DG ++ + +++ G+G
Sbjct: 395 AIFILNEANNLGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGMG 454
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLD 522
++ FL+ L+ +RG Y Q++K + + + +L N+ + ++ D+
Sbjct: 455 FDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPHTSVTDVT 514
Query: 523 EFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ--------- 573
+ + P+ S ++V+G++ PD L + ++ EL L+
Sbjct: 515 Q-----NSFPNYFGGSEIVVAGKFD---PDKLDEIQSIITATSANTELVLETLAQMDDLE 566
Query: 574 --LAKDIPLDRICAKQQIDLLTAQAWFSE---------DKRLEEKVSKMSVQTGVLCEYT 622
L+KD D ++ LT +E +++ +K+ +MS++ V+ T
Sbjct: 567 DFLSKDKHADPDFTRKLWAYLTINQLLAERSLAPTAATKRKITKKILQMSLEHHVVTPLT 626
Query: 623 RMIIVET--DERNNASESP 639
M+I DER A P
Sbjct: 627 AMVIENEAGDERMLADSPP 645
>gi|397486682|ref|XP_003814454.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Pan paniscus]
Length = 1215
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 110/222 (49%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +++++ A I G TNI +PL
Sbjct: 390 SLLPACLFNIIGF-GSTFKSLFPSS-QTYSEDSLAMACDDIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++FL+TDGAV + ++ + +++ + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFLITDGAVNNTGKVLELVRNH-----AFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSEGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|395827365|ref|XP_003786875.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 2
[Otolemur garnettii]
Length = 934
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 163/379 (43%), Gaps = 49/379 (12%)
Query: 301 LHDVDQR------EMFCMYLLPGTA-KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAV 353
L+DVD+ E+F Y + A +S K+++F++D+SGSM G ++ T A+
Sbjct: 265 LYDVDREKKAGELEVFNGYFVHFFAPESLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKT 324
Query: 354 ALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
L L D F+++ FN + + ATK V A ++I G TNI L +
Sbjct: 325 ILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKTQVADAKRYIE-KIQPSGGTNINEALLR 383
Query: 414 AVEMLTNSR-------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
A+ +L + S+ +I LV+DG ++ + +++ G+G
Sbjct: 384 AIFILNEANNLGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGMG 443
Query: 467 SYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLD 522
++ FL+ L+ +RG Y Q++K + + + +L N+ + ++ D+
Sbjct: 444 FDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPHTSVTDVT 503
Query: 523 EFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ--------- 573
+ + P+ S ++V+G++ PD L + ++ EL L+
Sbjct: 504 Q-----NSFPNYFGGSEIVVAGKFD---PDKLDEIQSIITATSANTELVLETLAQMDDLE 555
Query: 574 --LAKDIPLDRICAKQQIDLLTAQAWFSE---------DKRLEEKVSKMSVQTGVLCEYT 622
L+KD D ++ LT +E +++ +K+ +MS++ V+ T
Sbjct: 556 DFLSKDKHADPDFTRKLWAYLTINQLLAERSLAPTAATKRKITKKILQMSLEHHVVTPLT 615
Query: 623 RMIIVET--DERNNASESP 639
M+I DER A P
Sbjct: 616 AMVIENEAGDERMLADSPP 634
>gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus]
Length = 1029
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 114/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + + +K+V+F++D+SGSM G L+ TK A+ LS L DSFNI+ F
Sbjct: 232 FIHYFAP---RGLQPVEKNVVFVIDVSGSMFGTKLQQTKKAMDKILSDLQTSDSFNIITF 288
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + + A ++ A G T+I A L A +L +S
Sbjct: 289 SDTVNIWKAEGSIQATVQNIHNAKNYVS-RMEANGWTDINAALLAAASVLNHSNQEPGKG 347
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + + +++ G + LR L++
Sbjct: 348 RGVGQIPLIMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDADFSLLRRLSL 407
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
++G Y+ +Q++ L+ +LA++ +D L L +R P+ S
Sbjct: 408 ENQGEARRIYEDADAALQLEGLYAEISRPLLADVHLDYLGGLVGASSR-ARFPNYFCGSE 466
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 467 LVVAGKVQ 474
>gi|403291063|ref|XP_003936619.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Saimiri
boliviensis boliviensis]
Length = 891
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 153/355 (43%), Gaps = 52/355 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
V K V F++D+SGSM G+ LE TK AL L + D N + F+G+ + + A
Sbjct: 280 VVPKSVAFVIDVSGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQA 339
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAV 436
E ++ A ++ + G TNI L + + ML +R S I+ ++TDG
Sbjct: 340 IPENLQEAKTFVK-SIDDRGMTNINDGLLRGIGMLNKAREEHRVPERSTSIVIMLTDGDA 398
Query: 437 EDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ ++ + N G P +Y G G+ N+ FL +A+ ++G+ Y+ +
Sbjct: 399 NVGESRPEKIQENVRNAIGGKFP-LYNLGFGNNLNYNFLENMALENQGFARRIYEDSDAD 457
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSR-IPDLSSE--------SPLIVSGRY 546
+Q+Q + + +L + + YP I DL+ S ++V+GR
Sbjct: 458 LQLQGFYEEVANPLLTGVEVG----------YPENAILDLTQNTYQHFYDGSEIVVAGRL 507
Query: 547 ----QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP---------LDRICAKQQIDLL 592
F +K G DL+ F E+ + ++ K + ++R+ A I+ L
Sbjct: 508 VDEDMNSFKADVKGHGATNDLT-FTEEVDMKEMEKALQERDYIFGDYIERLWAYLTIEQL 566
Query: 593 ---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE----TDERNNASESPG 640
+A E + L + +S++ + T M++ + DER A + PG
Sbjct: 567 LEKRKKAHGEEKENLTAQALDLSLKYHFVTPLTSMVVTKPEDNEDERAIA-DKPG 620
>gi|194381326|dbj|BAG58617.1| unnamed protein product [Homo sapiens]
Length = 1107
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 99/193 (51%), Gaps = 10/193 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ + IF++D S SM G + K+A+ VAL L P FNI+ F G T+ LF +S +
Sbjct: 254 RGEFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGF-GSTFKSLFPSS-QTY 311
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
+++++ A I G TNI +PL + + RG ++F++TDGAV + ++
Sbjct: 312 SEDSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVH-RGHPRLLFVITDGAVNNTGKVL 370
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+ L + R Y+FGIG H ++ LA +S G + + ++ +M K
Sbjct: 371 E-----LVRNHAFSTRCYSFGIGPNVCHRLVKGLASVSEGSAELLMEGERLQPKMVKSLK 425
Query: 504 RGFSSVLANIAID 516
+ + VL+++ ++
Sbjct: 426 KAMAPVLSDVTVE 438
>gi|221044450|dbj|BAH13902.1| unnamed protein product [Homo sapiens]
Length = 623
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 116/253 (45%), Gaps = 29/253 (11%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+V+F++DISGSM+G+ ++ K AL L + PGD F++V F + S+ A+
Sbjct: 1 MNKNVVFVIDISGSMRGQKVKQIKEALLKILGDMQPGDYFDLVLFGTRVQSWKGSLVQAS 60
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDG--- 434
+ ++ A ++ F +TN+ L + +E+L + S+P I+ ++TDG
Sbjct: 61 EANLQAAQDFVR-GFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPT 119
Query: 435 -AVEDERQIC----DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
V D QI +A++ R +Y G G + FL +++M + G Y
Sbjct: 120 EGVTDRSQILKNVRNAIRGRFP--------LYNLGFGHNVDFNFLEVMSMENNGRAQRIY 171
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY--- 546
+ Q+Q +++ +L ++ + +D + + S ++V+GR
Sbjct: 172 EDHDATQQLQGFYSQVAKPLLVDVDLQYPQDA-VLALTQNHHKQYYEGSEIVVAGRIADN 230
Query: 547 -QGKFPDTLKAKG 558
Q F ++A G
Sbjct: 231 KQSSFKADVQAHG 243
>gi|392343110|ref|XP_003754799.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H6-like [Rattus norvegicus]
Length = 1320
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 114/248 (45%), Gaps = 15/248 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + + +K+V+F++D+SGSM G L+ TK A+ LS L DSFNI+ F
Sbjct: 268 FIHYFAP---RGLQPVEKNVVFVIDVSGSMFGTKLQQTKKAMDKILSDLQTSDSFNIITF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + + A ++ A G T+I A L A +L +S
Sbjct: 325 SDTVNIWKAEGSIQATVQNIHNAKNYVS-RMEANGWTDINAALLAAASVLNHSNQEPGKG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP+I +TDG + S + + +++ G + LR L++
Sbjct: 384 RGVGQIPLIMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDADFSLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
++G Y+ +Q++ L+ +LA++ +D L L +R P+ S
Sbjct: 444 ENQGEARRIYEDADAALQLEGLYAEISRPLLADVHLDYLGGLVGASSR-ARFPNYFCGSE 502
Query: 540 LIVSGRYQ 547
L+V+G+ Q
Sbjct: 503 LVVAGKVQ 510
>gi|449275888|gb|EMC84624.1| von Willebrand factor A domain-containing protein 5B1, partial
[Columba livia]
Length = 467
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 129/279 (46%), Gaps = 14/279 (5%)
Query: 243 EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH 302
E+L + S H+ + + G + + + LK N DF PS +++
Sbjct: 39 EILIHPSEPHMPHVLMEEGDMTPAEYEQHLKGKN-DFIKGTKKDPSAEKKTEIIRKRLNK 97
Query: 303 DVDQREMFCMYLLPGTAKSR---KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
D+ + + P + + +++ IF++D S SM G + K AL V L L
Sbjct: 98 DIPHHPVVMLNFCPDLRTVQPDLQQAQREFIFLIDRSRSMSGVNITRVKEALLVILKSLM 157
Query: 360 PGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
P FNI+ F G T+ LF S + +E++ A + I G TNI +PL +
Sbjct: 158 PACLFNIIGF-GSTFKTLFPAS-QTYCEESLAIACESIKNIRADMGGTNILSPLKWVIRQ 215
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
+ RG ++FL+TDGA+ + ++ + +++ S R Y+FGIG + L
Sbjct: 216 PIH-RGHPRLLFLLTDGAISNTGKVLELLRNH-----SCSTRCYSFGIGPNACRRLVGGL 269
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
A +SRG D + ++ +M K + + VL++++++
Sbjct: 270 AAVSRGIAEFLVDSERLQPKMIKSLKKAMAPVLSDVSVE 308
>gi|34532077|dbj|BAC86312.1| unnamed protein product [Homo sapiens]
Length = 447
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 98/191 (51%), Gaps = 10/191 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D S SM G + K+A+ VAL L P FNI+ F G T+ LF +S + ++
Sbjct: 78 EFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGF-GSTFKSLFPSS-QTYSE 135
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+++ A I G TNI +PL + + RG ++F++TDGAV + ++ +
Sbjct: 136 DSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVH-RGHPRLLFVITDGAVNNTGKVLE- 193
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
L + R Y+FGIG H ++ LA +S G + + ++ +M K +
Sbjct: 194 ----LVRNHAFSTRCYSFGIGPNVCHRLVKGLASVSEGSAELLMEGERLQPKMVKSLKKA 249
Query: 506 FSSVLANIAID 516
+ VL+++ ++
Sbjct: 250 MAPVLSDVTVE 260
>gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus
scrofa]
Length = 944
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 280 FVHYFAP---RDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILHDLRPQDHFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ ++ + T V +I + G T+I L + +L +S
Sbjct: 337 SNRIKVWKDHLVSVTPNNVRDGKVYIH-HMSPSGGTDINGALQTGIALLHDSVARHDLED 395
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + +R G +C I+T GIG + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTPKILNNTREAARGRVC--IFTVGIGDDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G+ ++ + Q+ + + +L++I +D D E ++ + P+ + S +
Sbjct: 454 NCGFTRHVHEDEDAGAQLIGFYDEIRTPLLSDIRVDYPPDAVE-QVTRTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
IV+G+ + D L + + FVV L KD+P++
Sbjct: 513 IVAGKLADRAKDRLHVEVTASNSKKFVV-----LKKDVPVE 548
>gi|172045919|sp|Q5TIE3.2|VW5B1_HUMAN RecName: Full=von Willebrand factor A domain-containing protein
5B1; Flags: Precursor
Length = 1220
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 108/222 (48%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +++++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEDSLAMACDDIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++F++TDGAV + ++ + L + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFVITDGAVNNTGKVLE-----LVRNHAFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSEGSAELLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus]
Length = 1031
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 121/280 (43%), Gaps = 24/280 (8%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y VS I G V + F Y P + + +K+V+F++D+SG
Sbjct: 211 DFTIHYDVSMEDIIGDVQVYG---------GYFIHYFAP---RGLQPLEKNVVFVIDVSG 258
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G L+ TK A+ LS L DSFNI+ F+ ++ + AT + + A ++
Sbjct: 259 SMFGTKLQQTKKAMDTILSDLQASDSFNIITFSDTVNIWKAEGSIQATVQNIHSAKNYVS 318
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDGAVEDERQICDAMK 447
A G T+I A L A +L +S G IP+I +TDG +
Sbjct: 319 -RMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIPLIMFLTDGEPTAGETTPSVIL 377
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
S + + +++ G + LR L++ ++G Y+ +Q++ L+
Sbjct: 378 SNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISR 437
Query: 508 SVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+LA++ +D L + P+ S L+V+G+ Q
Sbjct: 438 PLLADVHLDYLGGWVGASSR-AHFPNYFHGSELVVAGKVQ 476
>gi|363738482|ref|XP_414253.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Gallus
gallus]
Length = 881
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 114/232 (49%), Gaps = 17/232 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIFI+DISGSM G+ +E T+ AL L + D FN + F + +++ ++ AT E
Sbjct: 282 KNVIFIIDISGSMSGREIEQTREALLKILDDIKEDDHFNFILFGSDVHIWKETLIKATPE 341
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG----A 435
++ A +++ + G TN+ + K ++ML + + S II ++TDG
Sbjct: 342 NLDEARKFVR-SIDTEGMTNLYGGIMKGIDMLNAAHEGNLVPKRSASIIIMLTDGQPNVG 400
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ + + I +K + + +Y G G ++ FL +A+ ++G Y
Sbjct: 401 ISNTQDIQTHVKKAIEGKYT----LYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDSA 456
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+Q+Q + + +L ++ ++ ++ E ++ + S ++V+GR++
Sbjct: 457 LQLQGFYDEVSNPMLLDVELNYPENEIE-DVTKNSFKHFYDGSEIVVAGRFR 507
>gi|210147462|ref|NP_001034589.2| von Willebrand factor A domain-containing protein 5B1 precursor
[Homo sapiens]
gi|119615327|gb|EAW94921.1| hCG1812043 [Homo sapiens]
Length = 1215
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 108/222 (48%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +++++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEDSLAMACDDIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++F++TDGAV + ++ + L + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFVITDGAVNNTGKVLE-----LVRNHAFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSEGSAELLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|426328180|ref|XP_004024879.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Gorilla gorilla gorilla]
Length = 1144
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 107/222 (48%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P + +K + IF++D S SM G + K+A+ VAL
Sbjct: 330 LHKDIPHHSVIMLNFCPDLQSVQPCLRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALK 389
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + +++++ A I G TNI +PL
Sbjct: 390 SLMPACLFNIIGF-GSTFKSLFPSS-QTYSEDSLAMACDDIQRMKADMGGTNILSPLKWV 447
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ + RG ++FL+TDG V + ++ + L + R Y+FGIG H +
Sbjct: 448 IRQPVH-RGHPRLLFLITDGTVNNTGKVLE-----LVRNHAFSTRCYSFGIGPNVCHRLV 501
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +S G + + ++ +M K + + VL+++ ++
Sbjct: 502 KGLASVSEGSAEFLMEGERLQPKMVKSLKKAMAPVLSDVTVE 543
>gi|417515979|gb|JAA53791.1| inter-alpha-trypsin inhibitor heavy chain H5 [Sus scrofa]
Length = 944
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 280 FVHYFAP---RDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILHDLRPQDHFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ ++ + T V +I + G T+I L + +L +S
Sbjct: 337 SNRIKVWKDHLVSVTPNNVRDGKVYIH-HMSPSGGTDINGALQTGIALLHDSVARHDLED 395
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + +R G +C I+T GIG + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTPKILNNTREAARGRVC--IFTVGIGDDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G+ ++ + Q+ + + +L++I +D D E ++ + P+ + S +
Sbjct: 454 NCGFTRRVHEDEDAGAQLIGFYDEIRTPLLSDIRVDYPPDAVE-QVTRTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
IV+G+ + D L + + FVV L KD+P++
Sbjct: 513 IVAGKLADRAKDRLHVEVTASNSKKFVV-----LKKDVPVE 548
>gi|440906273|gb|ELR56555.1| Inter-alpha-trypsin inhibitor heavy chain H5, partial [Bos
grunniens mutus]
Length = 942
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNIV F
Sbjct: 283 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGF 339
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++ +I + G T+I L + +++L +
Sbjct: 340 SNRIKVWKDHLVSVTPNSIRDGKVYIH-HMSPSGGTDINGALQRGIQLLNDYVAHNDIED 398
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 399 RSVSLVVFLTDGKPTVGETHTLKILNNTREAARGRVC--IFTVGIGADVDFKLLEKLSLE 456
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ Q+ + + +L++I +D L E + P+ + S +
Sbjct: 457 NCGLTRRVHEDHDARAQLIGFYDEIRTPLLSDIRVDYPPSLVEHATR-TLFPNYFNGSEI 515
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D L + + FVV L KD+P++
Sbjct: 516 VIAGKLVDRAMDRLHVEVTASNSKKFVV-----LKKDVPVE 551
>gi|395516877|ref|XP_003762610.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1
[Sarcophilus harrisii]
Length = 880
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 107/243 (44%), Gaps = 12/243 (4%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K+ + K+V+F++DISGSM G+ ++ T+ AL L + P D FN + F
Sbjct: 260 FAQFFAP---KNLRNMVKNVVFVIDISGSMAGQKIKQTRQALLKILDDIKPEDYFNFILF 316
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------R 422
G + S+ AT + ++ A ++ F G+TN+ L + + +L + R
Sbjct: 317 GGHVRAWKNSLIQATADNLQEARVFVQ-TFSLAGATNLNGGLLEGINILNKALEVDPRLR 375
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+ PI+ ++TDG ++ + N +Y G G + FL +++ +
Sbjct: 376 NNAPIVIMLTDGEPTVGETNLAQIRKNVKNAIQGKFPLYNLGFGENVDFNFLEAMSLEND 435
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G Y + Q++ + + +L N+ K + + R+ S ++V
Sbjct: 436 GTARRIYRDNDAAEQLKGFYEEVANPLLTNVEFQYPKASIQ-ALTEHRVKQFYDGSEIVV 494
Query: 543 SGR 545
+G
Sbjct: 495 AGH 497
>gi|395821385|ref|XP_003784022.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Otolemur garnettii]
Length = 1247
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 103/209 (49%), Gaps = 11/209 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + IF+VD S SM G + K+A+ VAL L P FN++ F
Sbjct: 344 FCPDLQSVQPNVRKTLG-EFIFLVDRSSSMSGTSINLVKDAMLVALKSLMPACLFNVIGF 402
Query: 370 NGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI 427
G T+ LF +S + ++E + A I G TNI +P + + +G +
Sbjct: 403 -GSTFKSLFPSS-QAYSEENLALACDNIQKMRADMGGTNILSPFKWIIRQPVH-QGHPRL 459
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
+FL+TDGAV + ++ + L + R Y+FGIG+ H ++ LA +S+G
Sbjct: 460 VFLITDGAVSNTGKVLE-----LVRNHAFSTRCYSFGIGANVCHRLVKGLASVSKGSAEF 514
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 515 LVEGERLQPKMIKSLKKAMAPVLSDVTVE 543
>gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5
[Oryctolagus cuniculus]
Length = 940
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 125/280 (44%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILHDLRPQDRFNIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ ++ T ++ +I + G T+I L A+ +L N
Sbjct: 337 SNRIKVWKDNLISVTPNSIRDGKIYIH-HMSPTGGTDINGALQTAIRLLNNYVAHNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTKEAAQGRVC--IFTIGIGNDVDFKLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ ++ Q+ + + +L++I +D L E + + P+ + S +
Sbjct: 454 NCGLTRRVHEEENAGAQLIGFYDEIRTPLLSDIRMDYPPGLVE-QATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ K D L + + FV+ L KD+P+
Sbjct: 513 IIAGKLTNKKLDQLHVEVTASNSKKFVI-----LKKDVPV 547
>gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H6 [Mus musculus]
gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H6 [Mus musculus]
Length = 1321
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 121/280 (43%), Gaps = 24/280 (8%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y VS I G V + F Y P + + +K+V+F++D+SG
Sbjct: 245 DFTIHYDVSMEDIIGDVQVYG---------GYFIHYFAP---RGLQPLEKNVVFVIDVSG 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G L+ TK A+ LS L DSFNI+ F+ ++ + AT + + A ++
Sbjct: 293 SMFGTKLQQTKKAMDTILSDLQASDSFNIITFSDTVNIWKAEGSIQATVQNIHSAKNYVS 352
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDGAVEDERQICDAMK 447
A G T+I A L A +L +S G IP+I +TDG +
Sbjct: 353 -RMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIPLIMFLTDGEPTAGETTPSVIL 411
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
S + + +++ G + LR L++ ++G Y+ +Q++ L+
Sbjct: 412 SNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISR 471
Query: 508 SVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+LA++ +D L + P+ S L+V+G+ Q
Sbjct: 472 PLLADVHLDYLGGWVGASSR-AHFPNYFHGSELVVAGKVQ 510
>gi|390475180|ref|XP_002758468.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix
jacchus]
Length = 1023
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 153/358 (42%), Gaps = 39/358 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D SGSM G+ ++ T+ AL L L P D FN++ F
Sbjct: 389 FVHYFAP---EGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLITF 445
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
+ E +S S+ A+ E V +A + GG TNI + AV++L S
Sbjct: 446 SSEATQWSPSLVPASAENVNKARSFAAAIHALGG-TNINDAVLMAVQLLDRSNREERLPT 504
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF--GIGSYCNHYFLRMLAMI 480
S+ +I L+TDG + + + S N Y+ G+G + Y LA+
Sbjct: 505 RSVSLIILLTDG----DPTVGEGPASXXKNVREAVSGQYSSLPGLG-FDVSYAPEKLALD 559
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G Y+ +Q+Q + + +L + + + E E+ L S +
Sbjct: 560 TGGLARRIYEDSDSAMQLQDFYQEVANPLLTAVTFEYPNNAVE-EVTQDNFRFLFKGSEM 618
Query: 541 IVSGRYQGKFPDTLKAK--GFLGDLSNFVVELKLQLAKDIP------------LDRICAK 586
+V+G+ + + PD L AK G L + N + + +A+ ++R+ A
Sbjct: 619 VVAGKLRDQGPDVLTAKVSGHL-TMQNLTFQTESSVAEQEAEFQSPKYIFRNFMERLWAY 677
Query: 587 QQIDLL---TAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIV--ETDERNNASESP 639
I L T A +E + L + +S+ + T M++ E E++ +E P
Sbjct: 678 LTIQQLLEGTVSASDAEQQVLRTRALNLSLAYSFVTPLTSMVVTKPEDQEQSQVAEKP 735
>gi|392408705|ref|YP_006445312.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Desulfomonile tiedjei DSM 6799]
gi|390621841|gb|AFM23048.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Desulfomonile tiedjei DSM 6799]
Length = 718
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 123/266 (46%), Gaps = 17/266 (6%)
Query: 302 HDVDQREMFCMYLLP---GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
H + ++ F M L+P +K R+ +KDV+F++D S SM L K + + +L
Sbjct: 287 HRDNGKKGFFMALIPPPIAPSKDRQP-EKDVVFLIDASRSMSPADLAAAKRTVISGVERL 345
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
PGD FN++ + T + + ++ RA ++ GG+T++ + A+E
Sbjct: 346 RPGDRFNVIVVDTRTRMLFEKLVPLSEHNAARAVGFVN-ALQRGGATDLYNSVIGALEQF 404
Query: 419 TNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
NSR + +G + + + R++T G+G + + FL +A
Sbjct: 405 -NSRKRPGFVVFAGNGHGTVGITNPETILEHVQKSNRFRARVFTLGLGMHKDIAFLERVA 463
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSV----LANIAIDTLKDLDEFEMYPSRIPDL 534
++ GA+++ D + + L R FS+V ++ I ++ D+ E+ P IPDL
Sbjct: 464 AFTK---GASFN-DQGKEDFESLLNRLFSTVSPPRMSEIQLE-FPDISPEEVDPDPIPDL 518
Query: 535 SSESPLIVSGRY--QGKFPDTLKAKG 558
E +++ GRY +G P +K K
Sbjct: 519 IGEENIVLLGRYSTKGDVPSKIKLKA 544
>gi|74315933|ref|NP_001028276.1| inter-alpha (globulin) inhibitor H3 precursor [Danio rerio]
gi|72679321|gb|AAI00122.1| Inter-alpha (globulin) inhibitor H3 [Danio rerio]
Length = 892
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 98/213 (46%), Gaps = 14/213 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+FI+D SGSMQG +E T+ A+ LS L D F ++ F
Sbjct: 253 FVHYFAPTDVQR---IPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDDYFGLITF 309
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN--SRGSIPI 427
+ + + AT E VE A ++ +GG+T+I + AV M+ GS I
Sbjct: 310 SSHIQAWKPELLKATAENVEEAKTFVK-QIRSGGATDINGAVLNAVNMINQYTQEGSASI 368
Query: 428 IFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ L+TDG V + I +K+ + G P +Y G G FL +++ + G
Sbjct: 369 LILLTDGDPTSGVTNPVTIQQNVKTAI---GGKYP-LYCLGFGFNVRFEFLEKMSLENNG 424
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
Y+ ++Q+Q + +L N+ ++
Sbjct: 425 AARRIYEDSDADLQLQGFYEEVAIPLLTNVQLN 457
>gi|395538994|ref|XP_003771459.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5
[Sarcophilus harrisii]
Length = 949
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 122/282 (43%), Gaps = 21/282 (7%)
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
+ F Y P K K+V+F++D S SM G L TKNAL L L P D FNIV
Sbjct: 285 DYFVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKNALFTILHDLRPQDHFNIV 341
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-------- 419
F+ ++ + T + + +I + G T+I L K +E+L
Sbjct: 342 GFSNRIKVWQNHLVPVTPNNIRDSKVYIH-HMTPTGGTDINGALQKGIELLNDHIADNDI 400
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAM-KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
+SR I+FL A E Q + ++ G +C I+T GIG+ + L L+
Sbjct: 401 DSRSVSLIVFLTDGKATVGETQSSRILSNTKEAARGQVC--IFTIGIGNDVDFKLLEKLS 458
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G + D+ + Q+ + + +L++I ID E + + P+ + S
Sbjct: 459 LENCGMTRRVQEEDNAKEQLIGFYDEIRTPLLSDIRIDYPPGSVE-RVTQTLFPNYFNGS 517
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+I++G+ D L + + FVV L D+P+
Sbjct: 518 EIIIAGKLIDSQLDNLHVEVTASNSKKFVV-----LKTDVPV 554
>gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter
fumaroxidans MPOB]
gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter
fumaroxidans MPOB]
Length = 680
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 150/351 (42%), Gaps = 32/351 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + + P + R D F++DISGSM G+ + ++ L K+ D F IV F
Sbjct: 286 FMVVVTPAASLKRIAEGVDWTFVLDISGSMTGRKITTLIEGVSRVLGKMSANDRFRIVTF 345
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
N F+ A+ E V+ Q + AGGST + L A +L R + I
Sbjct: 346 NTTAADFTGGYVPASPENVQTWMQRVK-QIQAGGSTALFDGLDLAYRLLDGERTT--GIV 402
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICP----RIYTFGIGSYCNHYFLRMLAMISRGYY 485
LVTDG +C+ +R + R++TF IG+ N + LA S G+
Sbjct: 403 LVTDG-------VCNVGPTRHDEFLGLLKQHDVRLFTFVIGNSANQPLMDRLAKESGGFA 455
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ D I ++ + + F L + + + P+R L L+ GR
Sbjct: 456 MNVSESDDIAGRLIQAKAKVFHECLHGVVL-KFHGEKVSRLTPARPGSLHLGRQLVAFGR 514
Query: 546 YQ--GKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP-LDRICA----KQQIDLLTAQAWF 598
Y+ G+ LKAK + S + + ++ P L+R+ A ++ ++++ +
Sbjct: 515 YEGHGEVEVELKAKVSGQERSWRCKAVLPETDEENPELERLWALSSIEEHMEMIRERG-- 572
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE-------RNNASESPGTK 642
E + L EKV ++ + ++ +YT M+++ D RNNA G +
Sbjct: 573 -ESESLREKVVRLGKEYSLVTDYTSMVVIPQDAMEAEGIGRNNADRVAGER 622
>gi|345306041|ref|XP_001508722.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Ornithorhynchus anatinus]
Length = 1104
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 110/238 (46%), Gaps = 25/238 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K V+F++D SGSM G+ + T+ AL L L+P D FN+V FN + S+ AT+E
Sbjct: 348 KSVVFVIDKSGSMDGRKIVQTREALLKVLGDLNPEDQFNLVVFNSMISQWQPSLLKATQE 407
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEML--TNSR-----GSIPIIFLVTDGAVEDE 439
V A +++ ++ A G TNI + AV +L +N R S+ +I L+TDG
Sbjct: 408 NVGSAKKFV-LDIRASGGTNINEAVLAAVHLLDESNQRELLPENSVSMIILLTDGEPTVG 466
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+ ++ + ++ G G ++ FL +A+ + G Y+ +Q+Q
Sbjct: 467 ETNPENIQQNIQRSLDGKYALFCLGFGFDVSYSFLEKMALDNSGLARRIYEDSDAALQLQ 526
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE--------SPLIVSGRYQGK 549
+ + +L ++ EF+ + + +LS S L+V+G+ K
Sbjct: 527 DFYQEVATPLLKHV---------EFKFPENTVLELSQNNFNLFFKGSELVVAGKLHSK 575
>gi|405951692|gb|EKC19584.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Crassostrea gigas]
Length = 864
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 103/221 (46%), Gaps = 18/221 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+ V+FI+D SGSM G + K A++ LS L GD FNI+ F
Sbjct: 193 FVHYFAP---QGLPRMKRKVLFILDYSGSMLGTKITQLKTAMSEILSNLAKGDRFNIIRF 249
Query: 370 NGE-TYLFSTSMELATKE-AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP- 426
NG+ T ++ S + E +V A +I + A G TNI LT+ +++L P
Sbjct: 250 NGDVTKWWNHSALVEVNEYSVNAAKNFIN-DIEADGWTNINDALTEGMDLLQEDDEKDPT 308
Query: 427 ----IIFLVTDGAVE---DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+IFL A E D +I D + SR P I++ G ++ FL+ L+
Sbjct: 309 YASLMIFLTDGYATEGEMDNSKILDTINSR---NKDRVP-IFSLSFGDEADYGFLKKLSA 364
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKD 520
+ G+ Y+ +Q+ + S++L+N+ L D
Sbjct: 365 QNNGFVRKIYEESDASLQLTGFYGEVASTLLSNLTFRYLDD 405
>gi|351713656|gb|EHB16575.1| von Willebrand factor A domain-containing protein 5B1
[Heterocephalus glaber]
Length = 1221
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 127/278 (45%), Gaps = 13/278 (4%)
Query: 243 EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH 302
E+L + S H+ + + G + + + LK DF T S + + LH
Sbjct: 279 EILIHPSEPHMPHVLMEKGDMTLAEFDQYLK-RRTDF-IKGTKKDSSVERKTEIIRKRLH 336
Query: 303 -DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
D+ + + P + +K + IF++D S SM G + K+A+ VAL L
Sbjct: 337 KDIPHHSVIMLNFCPDLQSVQPNLRKTHGEFIFLIDRSSSMSGTSIHHVKDAMLVALKSL 396
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML 418
P FN++ F G T+ S + +E++ A I G TNI +PL V
Sbjct: 397 MPTCLFNVIGF-GSTFKTLPSSQTYNEESLAMACDNIQRIRADMGGTNIFSPLKWIVRQP 455
Query: 419 TNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
+ +G ++FL+T GA+ + ++ + +++ + R Y+FGIG H ++ LA
Sbjct: 456 VH-QGHPRLLFLITAGAISNTGKVLELLRNH-----AFSTRCYSFGIGPSVCHRLVKGLA 509
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+S+G + + ++ +M K + + VL+++ ++
Sbjct: 510 SVSKGSAEFLVEGERLQPKMVKSLKKAMAPVLSDVTVE 547
>gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H6 [Nomascus leucogenys]
Length = 1313
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 127/288 (44%), Gaps = 40/288 (13%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V I G V ++D + F Y P + +K+V+F++D+SG
Sbjct: 245 DFLVQYDVVMEDIIGDV-----QIYD----DYFIHYFAP---RGLPPMEKNVVFVIDVSG 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G +E TK A+ V LS L D FNI++F+ ++ + AT + V A ++
Sbjct: 293 SMFGTKIEQTKKAMNVILSDLKANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLH 352
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDGAVEDE-------- 439
A G T+I + L A +L +S G IP+I + DG
Sbjct: 353 C-MEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLMDGEPTAGVTTPSVIL 411
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+C A+ R++ +++ G + LR L++ +RG Y+ +Q++
Sbjct: 412 SNVCQALGHRVS--------LFSLAFGDDADFTLLRRLSLENRGIAQRIYEDTDAALQLE 463
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
L+ +LA++ ++ L L + + P+ S L+V+G+ Q
Sbjct: 464 GLYEEISMPLLADVRLNYLGGLVGASPW-AVFPNYFGGSELVVAGQVQ 510
>gi|344340674|ref|ZP_08771598.1| Vault protein inter-alpha-trypsin domain-containing protein
[Thiocapsa marina 5811]
gi|343799355|gb|EGV17305.1| Vault protein inter-alpha-trypsin domain-containing protein
[Thiocapsa marina 5811]
Length = 703
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 151/363 (41%), Gaps = 33/363 (9%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
R F + L PG D +F++D+SGSMQGK + AL KL P D F I
Sbjct: 295 RGTFLLTLTPGDDLQPISQGTDWVFVLDLSGSMQGK-YGTLAEGIRQALGKLRPEDRFRI 353
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
V FN + AT E V + GG T++ + L ++ + R S
Sbjct: 354 VTFNNSARELTRGYVPATAEGVLEWADRVAAEQPNGG-TDLYSGLQLGLDRVDADRTS-- 410
Query: 427 IIFLVTDGAVE----DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
I LVTDG ++R RL + R++TF +G+ N L L S
Sbjct: 411 GIILVTDGVANVGETEQRSFL-----RLIEQKDV--RLFTFIMGNSANRPLLEALTQASD 463
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G+ + + D I ++ ++ + L I ++ + + ++ PS+ L L++
Sbjct: 464 GFARSVSNADDIVGEILAASSKLTHAALHGIRVE-IDGVRTTDLTPSKPGSLYRGQQLML 522
Query: 543 SGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLA------KDIPLDRICAKQQIDLLTAQ- 595
G Y G TL G LS + + + A + L+R+ A +I + +
Sbjct: 523 LGHYWGDGEATLT---LSGRLSGAETQYRGRFAFPARSTANPELERLWAYDRIREMEHEI 579
Query: 596 AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETD-------ERNNASESPGTKKGSKKS 648
F D +E+ V + V G++ + T M+++ + ER NA+ S + ++
Sbjct: 580 ETFGPDADMEDAVRDLGVAYGLVTDRTSMVVLREEVFEQLGIERLNATRSDLERAAREQR 639
Query: 649 DHQ 651
HQ
Sbjct: 640 THQ 642
>gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa]
gi|3024050|sp|O02668.1|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2;
Short=ITI heavy chain H2; Short=ITI-HC2;
Short=Inter-alpha-inhibitor heavy chain 2; Flags:
Precursor
gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa]
Length = 935
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 157/364 (43%), Gaps = 47/364 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P +S K+++F++D+SGSM G ++ T A+ L L D F++V F
Sbjct: 284 FVHFFAP---ESMDPIPKNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSLVDF 340
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------- 422
N + + ATK V A +I G TNI L +A+ +L +
Sbjct: 341 NHNIRTWRNDLVSATKTQVADAKTYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDP 399
Query: 423 GSIPIIFLVTDG-AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S+ +I LV+DG E Q+ K+ N +++ GIG ++ FL+ L+ +
Sbjct: 400 NSVSLIILVSDGDPTVGELQLSKIQKNVKQNIQDNV-SLFSLGIGFDVDYDFLKRLSNDN 458
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSE 537
RG Y Q++K + + + +L N+ + ++ D+ + + P+
Sbjct: 459 RGMAQRIYGNQDTASQLKKFYNQVSTPLLRNVQFNYPQASVTDVTQ-----NSFPNYFGG 513
Query: 538 SPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ-----------LAKDIPLDRICAK 586
S ++V+G++ P+ L+ + ++ VEL L+ LAKD D K
Sbjct: 514 SEIVVAGKFN---PEKLEQLQGIITATSANVELVLETLAEMDGLEAFLAKDRHADPDFTK 570
Query: 587 QQIDLLTAQAWFSEDKR-----LEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNA 635
+ LT E R +++K++K MS+ ++ T M++ DER A
Sbjct: 571 KLWAYLTINQLLDERSRAPSAAVKKKITKSILQMSLDHHIVTPLTAMVVENEAGDERMLA 630
Query: 636 SESP 639
P
Sbjct: 631 DAPP 634
>gi|34783791|gb|AAH56811.1| Zgc:112265 protein, partial [Danio rerio]
Length = 927
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 8/209 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+V+FI+D SGSM G+ + T++AL L LD D F ++ F
Sbjct: 261 FVHYFAPSDVPH---IPKNVVFIIDRSGSMHGRKIRQTRSALLTILKDLDEDDHFGLITF 317
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPI 427
+ E + + ATK E A ++ G+TNI + V+M+ + +G+ I
Sbjct: 318 DAEIDFWRRELLQATKANRENAESFVK-RIQDRGATNINDAVLAGVDMINRNPRKGTASI 376
Query: 428 IFLVTDG-AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
+ L+TDG E I M + GS P +Y G G N FL +++ +
Sbjct: 377 LILLTDGDPTAGETNIEKIMANVKEAIGSKFP-LYCLGFGYDVNFDFLTKMSLENNAVAR 435
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
Y+ +IQ+Q + +L +I +
Sbjct: 436 RIYEDSDADIQLQGFYDEVAVPLLTDIQL 464
>gi|296481522|tpg|DAA23637.1| TPA: inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos
taurus]
Length = 940
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNIV F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++ +I + G T+I L + +++L +
Sbjct: 337 SNRIKVWKDHLVSVTPNSIRDGKVYIH-HMSPSGGTDINGALQRGIQLLNDYVAHNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLVVFLTDGKPTVGETHTLKILNNTREAARGRVC--IFTVGIGADVDFKLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ Q+ + + +L++I +D L E + P+ + S +
Sbjct: 454 NCGLTRRVHEDHDARAQLIGFYDEIRTPLLSDIRVDYPPSLVE-RATRTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D L + + FVV L KD+P++
Sbjct: 513 VIAGKLVDRAMDRLHVEVTASNSKKFVV-----LKKDVPVE 548
>gi|408491107|ref|YP_006867476.1| vWFA superfamily protein [Psychroflexus torquis ATCC 700755]
gi|408468382|gb|AFU68726.1| vWFA superfamily protein [Psychroflexus torquis ATCC 700755]
Length = 689
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 116/247 (46%), Gaps = 13/247 (5%)
Query: 309 MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQG-KPLEDTKNALAVALSKLDPGDSFNIV 367
F + + P + + +V +K+ + I+D SGSM+G + K A ++ L+ GD+FN++
Sbjct: 258 FFGLVVEPESNANTEVIEKNFVLIIDSSGSMRGGNKMAQAKEASEFIVNNLNIGDNFNVI 317
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV---EMLTNSRGS 424
F+ LF + + A +I N +A G+TNI L A+ E + +
Sbjct: 318 DFDNNIVLFQPELVEYNIQNSNAALDFIE-NIVALGATNISESLVTAINQFEAGAEDKAN 376
Query: 425 IPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRI--YTFGIGSYCNHYFLRMLAMISR 482
I I+F GA E E + ++ I I +TFGIG L +LA+ +
Sbjct: 377 I-IVFFTDGGATEGETNTQNILQLAEDTVNQIETEIFLFTFGIGEDVTTDLLTLLAVQNN 435
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLAN--IAIDTLKDLDEFEMYPSRIPDLSSESPL 540
G+ D + ++I + + + VL N I +D + ++ +YP +P+L L
Sbjct: 436 GFVTFLGDNEIVDI-ISNFYLTIRNPVLLNPVITVDPVGAIN--NVYPDPLPNLYKGQQL 492
Query: 541 IVSGRYQ 547
+ +GRY+
Sbjct: 493 VFTGRYE 499
>gi|68448495|ref|NP_001020335.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Danio
rerio]
gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio]
Length = 915
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 8/209 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K+V+FI+D SGSM G+ + T++AL L LD D F ++ F
Sbjct: 255 FVHYFAPSDVPR---IPKNVVFIIDRSGSMHGRRIRQTRSALLTILKDLDEDDHFGLITF 311
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPI 427
+ E + + ATK E A ++ G+TNI + V+M+ + +G+ I
Sbjct: 312 DAEIDFWKRELLQATKANRENAESFVK-RIQDRGATNINDAVLAGVDMINRNPRKGTASI 370
Query: 428 IFLVTDG-AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
+ L+TDG E I M + GS P +Y G G N FL +++ +
Sbjct: 371 LILLTDGDPTAGETNIEKIMANVKEAIGSKFP-LYCLGFGYDVNFDFLTKMSLENNAVAR 429
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
Y+ +IQ+Q + +L +I +
Sbjct: 430 RIYEDSDADIQLQGFYDEVAVPLLTDIQL 458
>gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Meleagris gallopavo]
Length = 881
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/347 (23%), Positives = 153/347 (44%), Gaps = 53/347 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+VIFI+DISGSM G+ ++ T+ AL L + D FN + F + + + ++ AT E
Sbjct: 282 KNVIFIIDISGSMSGREIQQTREALLKILDDIKEDDHFNFILFGSDVHTWKETLIKATPE 341
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG---- 434
++ A +++ GI+ G TN+ + K ++ML + + S II ++TDG
Sbjct: 342 NLDEARKFVRGID--TKGLTNLYGGMMKGIDMLNAAHEGNLVPKRSASIIIMLTDGQPNV 399
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
+ + + I +K + + +Y G G ++ FL +A+ ++G Y
Sbjct: 400 GISNTQDIQTHVKKAIEGKYT----LYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDS 455
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSE--------SPLIVSGR 545
+Q+Q + + +L ++ ++ YP + I D++ S ++V+GR
Sbjct: 456 ALQLQGFYDEVSNPMLRDVELN----------YPENEIADVTKNSFKHFYDGSEIVVAGR 505
Query: 546 Y----QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIP---------LDRICAKQQIDLL 592
Q +K +G LS + K Q AK ++R+ A I+ L
Sbjct: 506 LRDINQNSLTVDVKGEGANDALSFTSQQDKEQTAKAFQEQEYIFGDFIERLWAYLTIEQL 565
Query: 593 TAQ---AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNAS 636
+ A E L + +S++ + T M++ + +E N +
Sbjct: 566 LEKRISATGEEKDNLTAEALDLSLKYKFVTPLTSMVVTKPEEDENQA 612
>gi|432090835|gb|ELK24134.1| Inter-alpha-trypsin inhibitor heavy chain H1 [Myotis davidii]
Length = 686
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 114/241 (47%), Gaps = 13/241 (5%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
K+++F++DIS SM+G+ ++ TK AL L + P D F++V F + + S+ A+
Sbjct: 147 LNKNLVFVIDISNSMEGQKVKQTKEALLKILEDVRPEDYFDLVLFGSQVQSWRGSLVPAS 206
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGS-------IPIIFLVTDGA-V 436
++ A ++ +F G+TN+ L +++L + S PI+ ++TDG
Sbjct: 207 AANLQAAQDFVR-SFSLAGATNLNGGLLLGIDILNKGQRSNPELSNHAPILIMLTDGEPT 265
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
E E +K+ + G P +Y G G + FL +++M + G Y+
Sbjct: 266 EGETDRTQILKNVRSAIGGKFP-LYNLGFGHNVDFSFLEVMSMENNGRAQRIYEDHDATQ 324
Query: 497 QMQKLFTRGFSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
Q+Q + + + +L ++ + KD + E + + S ++V+GR + + K
Sbjct: 325 QLQGFYNQVATPLLVDVELQYPKDTVSELTQHSHK--QYYEGSEIVVAGRIADQKLSSFK 382
Query: 556 A 556
A
Sbjct: 383 A 383
>gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus]
gi|187609595|sp|A2VE29.1|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5;
Short=ITI heavy chain H5; Short=ITI-HC5;
Short=Inter-alpha-inhibitor heavy chain 5; Flags:
Precursor
gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus]
Length = 940
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 122/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNIV F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++ +I + G T+I L + +++L +
Sbjct: 337 SNRIKVWKDHLVSVTPNSIRDGKVYIH-HMSPSGGTDINGALQRGIQLLNDYVAHNDIED 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG V + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLVVFLTDGKPTVGETHTFKILNNTREATRGRVC--IFTVGIGADVDFKLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ Q+ + + +L++I +D L E + P+ + S +
Sbjct: 454 NCGLTRRVHEDHDARAQLIGFYDEIRTPLLSDIRVDYPPSLVE-RATRTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D L + + FVV L KD+P++
Sbjct: 513 VIAGKLVDRAMDRLHVEVTASNSKKFVV-----LKKDVPVE 548
>gi|334348381|ref|XP_003342055.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5-like [Monodelphis domestica]
Length = 1031
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 146/359 (40%), Gaps = 50/359 (13%)
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
+ F Y P K K+V+F++D S SM G L TK+AL L L P D FNIV
Sbjct: 367 DYFVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIV 423
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG---- 423
F+ ++ + T + + +I + G T+I L K +E+L
Sbjct: 424 GFSNRIKVWQDRLVSVTPNNIRDSKVYIH-HMTPTGGTDINGALQKGIELLNGHIADNDI 482
Query: 424 ---SIPIIFLVTDG-AVEDERQICDAM-KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
S+ +I +TDG A E Q + ++ G +C I+T GIG+ + L L+
Sbjct: 483 DARSVSLIVFLTDGKATVGETQSSKILSNTKEAAQGRVC--IFTIGIGNDVDFKLLEKLS 540
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G + D+ + Q+ + + +L++I ID E + + P+ + S
Sbjct: 541 LENCGMTRRVQEEDNAKEQLIGFYDEIRTPLLSDIRIDYPPGTVE-HVTKTLFPNYFNGS 599
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL------------------ 580
+I++G+ D L + + FVV L D+P+
Sbjct: 600 EIIIAGKLVDSHLDNLHVEVTASNSKKFVV-----LKTDVPVGSQSTGNDVTGVSTPKGQ 654
Query: 581 --DRICAKQQIDLLTAQ----AWF-----SEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
DR ++ LT + +W E K L EK ++V L +T + + E
Sbjct: 655 DADRNYIERLWSYLTIKELLTSWLKSNDEQEKKHLMEKARALAVNYQFLTPFTSLEVKE 713
>gi|328867084|gb|EGG15467.1| type A von Willebrand factor domain-containing protein
[Dictyostelium fasciculatum]
Length = 854
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 94/200 (47%), Gaps = 11/200 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
K + IF++D SGSM G +E K L + + L FNI F G TY LF TS+ L
Sbjct: 294 KGEFIFLIDCSGSMAGFTIEKAKKTLQILMRSLPTNSYFNIYCFGG-TYKSLFQTSV-LY 351
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
+ + A ++I G T + +PL + + T IF++TDG + + +
Sbjct: 352 DDDNLAAASEYISSIDANMGGTQLISPLKRIYDFETFPNHPRQ-IFIITDGEIPNRDYVI 410
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D + S+ RI+T G+GS + + L+ + GYY + ++E Q+ L
Sbjct: 411 DFVASKRD-----VSRIFTVGLGSDVDKALVTGLSNVCGGYYELVTNNATMETQVLSLMA 465
Query: 504 RGFSSVLANIAIDTLKDLDE 523
R L NI ID KDL +
Sbjct: 466 RSVEPALTNIKID-WKDLQQ 484
>gi|344246215|gb|EGW02319.1| Inter-alpha-trypsin inhibitor heavy chain H5-like protein
[Cricetulus griseus]
Length = 983
Score = 75.9 bits (185), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 115/249 (46%), Gaps = 17/249 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G L+ TK A+ LS + D FNI++F
Sbjct: 268 FIHYFAP---RGLPPVEKNVVFVIDVSGSMFGTKLQQTKKAMNTILSDMQASDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ + ++ + AT + + A ++ + AGG T+I A L A +L +S
Sbjct: 325 SDKVNIWKAEGSIQATVQNIHSAKNYLSL-MEAGGWTDINAALLAAASVLNHSNQEPGKG 383
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G IP++ +TDG + S + +++ G + LR L++
Sbjct: 384 RGVGQIPLVVFLTDGEPTAGVTTPSVILSNIRQALGHRVSLFSLAFGDDADFSLLRRLSL 443
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSES 538
+RG Y+ +Q++ L+ +LA++ +D L L P + P+ S
Sbjct: 444 ENRGEARRIYEDTDAALQLEGLYEEISKPLLADVHLDYLGGL--VGASPGAYFPNYFCGS 501
Query: 539 PLIVSGRYQ 547
L+V+G+ Q
Sbjct: 502 ELVVAGKVQ 510
>gi|405963485|gb|EKC29054.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Crassostrea gigas]
Length = 343
Score = 75.9 bits (185), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 107/227 (47%), Gaps = 9/227 (3%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
KDV+ I+D+SGSM G L+ K+A+ LS L GD FNI+ F+ + S +A K+
Sbjct: 6 KDVLLILDVSGSMYGTKLKQMKDAVLGILSDLHEGDRFNILKFSSSLSFYKKSSVIANKK 65
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG---SIPIIFLVTDGA-VEDERQI 442
V A ++ N +A G T+I + L L + +IF +TDG E +
Sbjct: 66 TVREAKIYVK-NMVAAGGTDINSALMDGSFFLNKAENFTERAQVIFFLTDGQPTSGETNL 124
Query: 443 CDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+++ R N + IY+ G ++ F++ +A+ + G Y+ +Q++
Sbjct: 125 NKILENVRFANWHAFP--IYSLAFGQGADYNFIKKVAIQNNGLSRKIYEDSDATLQIKGF 182
Query: 502 FTRGFSSVLANIAIDTLKDLDEFE-MYPSRIPDLSSESPLIVSGRYQ 547
+ ++VL +++ L + + + P S ++V+G+ +
Sbjct: 183 YDEVSAAVLKDVSFKYLNNASSVQNLTKISFPSYLLGSEIVVAGKIE 229
>gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like,
partial [Saccoglossus kowalevskii]
Length = 627
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 153/353 (43%), Gaps = 57/353 (16%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSME-LAT 384
+K+V+F++D+SGSM G + TK AL V L + D FNI+ F+ E + +M LAT
Sbjct: 299 QKNVLFVIDVSGSMDGAKMGQTKEALRVILDDMRSFDRFNILTFSYEVSFWKENMMILAT 358
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-------TNSRGSIPIIFLVTDGA-V 436
+E + A ++ N A G TN L + VEML N+ S ++ ++TDG
Sbjct: 359 QENILEAKNFVN-NLRASGGTNFNGGLVEGVEMLRRVTDDAENTERSAFLVIMLTDGQPT 417
Query: 437 EDERQICDAMKSRLTN-GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
E Q+ ++ T G ++ G G N FL+ +++ ++G Y+
Sbjct: 418 SGETQLTKIQENAKTYIDGQYS--LFCLGFGGDVNFKFLQKISLENQGIARRIYEDADGY 475
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+ +GF +A M S P+ + S L+V+GR D L+
Sbjct: 476 LQL-----KGFYDEVAT-------------MTQSTFPNYFNGSELVVAGRLVDN--DVLQ 515
Query: 556 AKGFLGDLSNFVVELKLQL----------AKDIP---LDRICAKQQIDLLTAQAWFS--- 599
+ + ++ +EL ++ K IP ++R+ A I L + +
Sbjct: 516 TR-ITANTADTQLELDAEIDPTVMPSVLVNKHIPEDFIERLWAYLTIKELLKKRVLTVDP 574
Query: 600 -EDKRLEEKVSKMSVQTGVLCEYTRMIIVETDE------RNNASESPGTKKGS 645
E+ L K +S++ + T M++++ +E R A ES S
Sbjct: 575 EENASLSAKALDLSLKYNFVTPLTSMVVIKPEEPTTNAARGEAEESEDNTANS 627
>gi|123482632|ref|XP_001323846.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121906718|gb|EAY11623.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 720
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 106/215 (49%), Gaps = 11/215 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K + FI+D SGSM G +E+ K L + + L G F+I+ F G +Y S+ +
Sbjct: 240 KSEFYFIIDCSGSMSGSRIENAKFCLNILIHSLPIGCRFSIIQF-GNSYKEVVSICDYSN 298
Query: 386 EAVERAHQWIG-INFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+ V+ A I IN GG T+I +PL + +G I IFL+TDG V + IC
Sbjct: 299 KNVKYAMSAIARINADMGG-TDILSPLEYVFKKKL-GKGFIRKIFLLTDGEVHNSDMICS 356
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++ N RI+ G+GS + ++ ++ S G Y D D++ + ++
Sbjct: 357 RVQKERENN-----RIFAIGLGSGADPGLIKNISAKSGGNYVLIADDDNMNNMIVEIMKS 411
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
S L+NI+I + D+ EM+P+ P L + +P
Sbjct: 412 ALSPSLSNISIQG--ESDQTEMWPTPYPPLIARNP 444
>gi|354476049|ref|XP_003500237.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5-like protein-like [Cricetulus griseus]
Length = 1319
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 107/226 (47%), Gaps = 22/226 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +K+V+F++D+SGSM G L+ TK A+ LS + D FNI++F
Sbjct: 268 FIHYFAP---RGLPPVEKNVVFVIDVSGSMFGTKLQQTKKAMNTILSDMQASDYFNIISF 324
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ + ++ + AT + + A ++ + AGG T+I A L A +L +S
Sbjct: 325 SDKVNIWKAEGSIQATVQNIHSAKNYLSL-MEAGGWTDINAALLAAASVLNHSNQEPGKG 383
Query: 423 ---GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
G IP++ +TDG V I ++ L + S +++ G + LR
Sbjct: 384 RGVGQIPLVVFLTDGEPTAGVTTPSVILSNIRQALGHRVS----LFSLAFGDDADFSLLR 439
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDL 521
L++ +RG Y+ +Q++ L+ +LA++ +D L L
Sbjct: 440 RLSLENRGEARRIYEDTDAALQLEGLYEEISKPLLADVHLDYLGGL 485
>gi|410963185|ref|XP_003988146.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Felis
catus]
Length = 1149
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 295 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIIGF 351
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + V +I + G T+I L +A+ +LT+
Sbjct: 352 SNRIKVWKDHLVSVTPDNVRDGKVYIH-HMSPSGGTDINGALQRAIRLLTDYVAHDDVGD 410
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG V + + +R G IC I+T GIG+ + L L++
Sbjct: 411 RSVSLVIFLTDGKPTVGETHTLRILNNTREAARGQIC--IFTIGIGNDVDFKLLEKLSLE 468
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + D Q+ + + +L++I ID E + + P+ + S +
Sbjct: 469 NCGLTRHVLEEDDAGSQLVGFYDEIRTPLLSDIRIDYPPAAVE-RVTRTLFPNYFNGSEI 527
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + + L + + FVV L D+P++
Sbjct: 528 VIAGKLAHRTVERLHVEVTASNSKKFVV-----LKADVPVE 563
>gi|313228337|emb|CBY23487.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 128/259 (49%), Gaps = 35/259 (13%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
+ F++D SGSM G+ L+ + + L AL+ L P D+FNIV+ + + +FS AT ++
Sbjct: 49 ITFVIDTSGSMFGEKLKASVDGLLAALAGLSPADNFNIVSSSRKEEIFSQEALPATAHSI 108
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDG---AVEDERQIC 443
RA +W+ + GG ++I + A+ +T S + IP+I +++D A E+ QI
Sbjct: 109 ARAKEWL-MTLRHGGVSSIMTGVQSAIWHITASDDKKRIPLIVILSDSVSDATEENFQI- 166
Query: 444 DAMKSRLTNGGSICPRIYTFGIG--SYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQK 500
A+ S ++N + I +G + FL+ L+ +RG +DL +EI +K
Sbjct: 167 -ALNS-ISNSAAERISISALAVGYTDTSSWQFLQRLSTRNRGLAFRLFDLTKFVEIIPEK 224
Query: 501 L-----FT---RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ F+ RG + V + +++ DL + + ++ L+V G++ GK P
Sbjct: 225 VDGIKKFSRSLRGLTDVHFSFNSESVADLTD-----TSFSHFTAGEDLVVLGKFVGKAP- 278
Query: 553 TLKAKGFLGDLSNFVVELK 571
FL NF +E +
Sbjct: 279 -----AFL----NFTIEYR 288
>gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 precursor [Oryctolagus
cuniculus]
gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus]
Length = 946
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/346 (23%), Positives = 153/346 (44%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVVDFNHNIRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRG-----SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L N+ G S+ +I LV+DG
Sbjct: 369 QIADAKRYIE-KIQPNGGTNINEALLRAIFILNEANNMGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIQDNVSLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G++ PD LK
Sbjct: 488 KFYNQVSTPLLRNVQFNYPHTSVTDVTQNSFH-----NYFGGSEIVVAGKFD---PDKLK 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWF- 598
+ ++ EL L+ L+KD D ++ A I+ L A+
Sbjct: 540 QIQSIITATSANTELVLETLAQMDDLEEFLSKDKHADPDFTEKLWAYLTINQLLAERSLA 599
Query: 599 ---SEDKRLEEKVSKMSVQTGVLCEYTRMIIVET--DERNNASESP 639
+ +R+ + + +MS++ ++ T M+I DER A P
Sbjct: 600 PTAATKRRITKTILQMSLEHHIVTPLTSMVIENDAGDERMLADSPP 645
>gi|145503451|ref|XP_001437701.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404855|emb|CAK70304.1| unnamed protein product [Paramecium tetraurelia]
Length = 588
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 84/181 (46%), Gaps = 13/181 (7%)
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
A ++K+ + + +FI+D SGSM G + K AL + L L FNI++F Y
Sbjct: 332 ADNQKIIRGNYLFIIDRSGSMSGSRISKAKEALILFLKSLPQDSEFNIISFGSNFYSLWN 391
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-----SRGSIPIIFLVTD 433
++ ++ ++E+A + G T I PL EM+ N S + +FL+TD
Sbjct: 392 ESKMYSQNSLEQAINHVQSMDANLGGTRIIVPLK---EMVYNKYYGASNKTTLNVFLLTD 448
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
G ED D + + RIYT GIG C+ Y +R +A + G + D +
Sbjct: 449 G--ED---FADPIIDLVQKNNRAQTRIYTLGIGEGCSQYLIRRVAEVGNGKHQIVSDKED 503
Query: 494 I 494
I
Sbjct: 504 I 504
>gi|313246079|emb|CBY35036.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 128/259 (49%), Gaps = 35/259 (13%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
+ F++D SGSM G+ L+ + + L AL+ L P D+FNIV+ + + +FS AT ++
Sbjct: 49 ITFVIDTSGSMFGEKLKASVDGLLAALAGLSPADNFNIVSSSRKEEIFSQEALPATAHSI 108
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDG---AVEDERQIC 443
RA +W+ + GG ++I + A+ +T S + IP+I +++D A E+ QI
Sbjct: 109 ARAKEWL-MTLRHGGVSSIMTGVQSAIWHITASDDKKRIPLIVILSDSVSDATEENFQI- 166
Query: 444 DAMKSRLTNGGSICPRIYTFGIG--SYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQK 500
A+ S ++N + I +G + FL+ L+ +RG +DL +EI +K
Sbjct: 167 -ALNS-ISNSAAERISISALAVGYTDTSSWQFLQRLSTRNRGLAFRLFDLTKFVEIIPEK 224
Query: 501 L-----FT---RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD 552
+ F+ RG + V + +++ DL + + ++ L+V G++ GK P
Sbjct: 225 VDGIKKFSRSLRGLTDVHFSFNSESVADLTD-----TSFSHFTAGEDLVVLGKFVGKAP- 278
Query: 553 TLKAKGFLGDLSNFVVELK 571
FL NF +E +
Sbjct: 279 -----AFL----NFTIEYR 288
>gi|410919731|ref|XP_003973337.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Takifugu rubripes]
Length = 1254
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 91/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+FI+D SGSM G+ +E T+ AL L+ L D F I+ F+ + + + A +
Sbjct: 267 KNVVFIIDQSGSMHGRKIEQTRTALVRILNDLAEDDFFGIITFDSRIFQWKRELVQANQA 326
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
+E A N G+T+I A + K +ML +GS I+ L+TDG +
Sbjct: 327 NLESART-FARNIKDNGATDINAAVLKGADMLNAHPRQGSASILILLTDGDPTTGETNLE 385
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G + FL +++ + G Y+ ++Q++ +
Sbjct: 386 RIQSNVRKAIADKFPLYCLGFGFDVDFEFLEKMSLQNNGVARRIYEDSDADLQLKDFYNE 445
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 446 VATPLLTDVTM 456
>gi|432936008|ref|XP_004082075.1| PREDICTED: poly [ADP-ribose] polymerase 4-like [Oryzias latipes]
Length = 1013
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/432 (21%), Positives = 186/432 (43%), Gaps = 58/432 (13%)
Query: 86 LQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGK 145
LQ+ + V C L +V ++ + + + + + P+G+ ++ G EA I+GK
Sbjct: 387 LQLPLQSIHVKCKLIDLLSQVI-VFQKYTNLSTVPIEAKYVFPLGNSATVCGFEAFINGK 445
Query: 146 SYHTQLIALGENDGAGKS--ASVETG--SFL----KPNIFTLTLPQIDGGSYLSIRLRWS 197
Q+ E + A K +VE G ++L P++FT+++ + G+ + I++ +
Sbjct: 446 HVVGQV---KEKETARKEYRQAVEKGHGAYLMDQDAPDVFTISVGNLPPGATVLIKVTYV 502
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTP-------------AIKKI------PKREKIHLNVNA 238
+L +G + F P V P +++K+ + L+++
Sbjct: 503 SELVVSEG----RIHFSLPGSVAPWQESDGLNQTTQVSVEKVCVTDKSQSSREFSLDMSV 558
Query: 239 GTGTEVL-CNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQ 297
E++ +++H +K R D + + L D D + S H+ +
Sbjct: 559 EMPNEIVYLKSNTHQIKTKRTDCKAVISVVPGQPLHPYGFDLDIGF--SDVHLPRMWVEN 616
Query: 298 SPSLHDVDQREMFCMYLL-PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALS 356
P D+ CM + P S ++VI ++D S SM+G+ L + L
Sbjct: 617 HP-----DKDSQACMLVFYPDFEVSSGSAAEEVILLLDTSESMRGESLRSAQRIALQLLH 671
Query: 357 KLDPGDSFNIVAF---NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+L+ N++ F + E +L + + E ++ A +I + GG T + PL +
Sbjct: 672 RLNRDFRLNVILFGSDHKEAFLKAQPV----SEVLQDAESFIKSGSLVGGGTELWRPL-R 726
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
A+ +L SRG + + L++DG +++ + +L G+ R++T G+ S N +
Sbjct: 727 ALGLLPPSRG-VRNLLLLSDGHIQNPE-----LTLQLLRDGAAHSRLFTCGLSSTANRHM 780
Query: 474 LRMLAMISRGYY 485
LR LA G Y
Sbjct: 781 LRALAQAGGGAY 792
>gi|281212631|gb|EFA86791.1| hypothetical protein PPL_00596 [Polysphondylium pallidum PN500]
Length = 749
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 103/218 (47%), Gaps = 10/218 (4%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKE 386
+ IF++D SGSM G ++ K AL + + L FNI F + LF TS ++ +
Sbjct: 298 EFIFLIDCSGSMSGSQIQKAKLALEILMRSLTEKSKFNIWLFGSSHKSLFPTS-QIYSDA 356
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
+E A +I G T + APL KA+ +F++TDG + D + D
Sbjct: 357 TLESASAYISKIDANLGGTELYAPL-KAIFAQAYDPQYPRQLFILTDGEISDRDRTID-- 413
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
L S+ RI+T GIGS + + L+ +GYY +E ++ KL +
Sbjct: 414 ---LAGKDSLTTRIFTLGIGSGVDRNLVVGLSKSCKGYYDFIDSNTEMENRVMKLMSIAM 470
Query: 507 SSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSG 544
+++NI +D DLD + P + + S+ +I+SG
Sbjct: 471 EPIISNIKVD-WADLDVIQA-PKVVRPIYSKERMIISG 506
>gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 710
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 160/367 (43%), Gaps = 48/367 (13%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
F++D SGSM G +E+ K L + + L F+I+ F G +Y + T + V+
Sbjct: 245 FLIDCSGSMYGSRIENAKFCLNLLIHSLPIDSRFSIIKF-GTSYEEIFPICDYTNKNVKI 303
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A + I T+I +PL + V T G IFL+TDG V + IC + +
Sbjct: 304 AMRQIKDLDADMDGTDILSPL-EYVYTQTTKNGYHRKIFLLTDGQVHNSDVICSLAQEKR 362
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
N RIY G+GS + ++ +++ S G Y D D++ ++ +L S L
Sbjct: 363 DNN-----RIYAIGLGSGADPGLIKNVSLKSWGNYVLIADKDNMNEKVIELMESSISPYL 417
Query: 511 ANIAIDTLKDLDEFEMYPSRIPDLSSESP------------LIVSGRYQGK-----FPDT 553
NI+I + + EM+PS P + S +P +++SG + + P +
Sbjct: 418 TNISIQSENTVT--EMWPSPCPPIYSRNPQNFFIKSPFVENILISGFVEKEEICLTIPVS 475
Query: 554 LKAKGFLGDLSNFVVELKLQLAKDIPLDRICAK-QQIDLLTAQAWFSE--DKRLEEKVSK 610
K LG L L + LDR+ + L +++ SE D L+ + K
Sbjct: 476 -KCNDNLG-LKQLFNRYILDDYESKILDRVINRIPYFYLRDIRSYISERVDSVLKNECVK 533
Query: 611 MSVQTGVLCEYTRMIIV----ETDERNN-------------ASESPGTKKGSKKSDHQKI 653
+S+++GVLC +T I V + D R N A E P +G KS +K
Sbjct: 534 LSIESGVLCYFTSYIGVGYQSDIDMRINYDAYIRTHCLCSIAPEPPNLNRGCVKSGSRKQ 593
Query: 654 LDSEVPK 660
+ ++ K
Sbjct: 594 FEFDIKK 600
>gi|405962206|gb|EKC27907.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
Length = 1559
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/314 (28%), Positives = 136/314 (43%), Gaps = 33/314 (10%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKE 386
+VI ++D S SM+ D K A + L +DP FN+V+F T LF +S + A K
Sbjct: 915 EVILMIDSSNSMKDSSHRDAKKAALLVLHLMDPTWRFNVVSFGTAFTELFPSS-QPANKF 973
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
V+ A Q I G+T + PL L S IFL +DG + ++ +
Sbjct: 974 NVQTAKQHIQKLQANQGNTELYRPLHSY--YLLKPESSTRNIFLFSDGYINNDEDTLAKI 1031
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG---YYGAAYDL---DSIEIQMQK 500
+ R++T GI S N + L+ +A + G +Y + + D + Q+QK
Sbjct: 1032 SQNTQH-----TRVFTMGISSVANRHLLKAIARVGAGSHEFYDSKFKSKWEDKVRSQLQK 1086
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFE---MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
VL ++++D D D+ P +I L S S +V G + TLKAK
Sbjct: 1087 ----AAQPVLTSVSVDWRHD-DQHPAPIQAPQQITALFSGSRQVVYGFVDNCYMATLKAK 1141
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAW----FSEDKRLEEKVSKMSV 613
++S V L + K L R+ AK I + W S D R +V KM++
Sbjct: 1142 IGGQEISTVVSTSDLSVTKGKMLHRLTAKGVI-----RDWEDGVLSPD-RTSHEVKKMNL 1195
Query: 614 QTGVLCEYTRMIIV 627
+ V+ R IV
Sbjct: 1196 KQYVIELSKRYSIV 1209
>gi|119613593|gb|EAW93187.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_b [Homo
sapiens]
Length = 523
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 142/329 (43%), Gaps = 31/329 (9%)
Query: 230 EKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYE-SEVLKWSNIDFDFSYTVSPS 288
EK ++ + GTEV + + L Q + +L S +++K +I+ S S
Sbjct: 92 EKFRISTSLAAGTEVTFSLAYEELLQRHQGQYQLVVSLRPGQLVKRLSIEVTVSERTGIS 151
Query: 289 HIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTK 348
+I+ + F Y P + +K+V+F++D+S SM G +E TK
Sbjct: 152 YIYD---------------DYFIHYFAP---RGLPPMEKNVVFVIDVSSSMFGTKMEQTK 193
Query: 349 NALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNI 407
A+ V LS L D FNI++F+ ++ + AT + V A ++ A G T++
Sbjct: 194 TAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHC-MEADGWTDV 252
Query: 408 CAPLTKAVEMLTNSR---------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
+ L A +L +S G IP+I +TDG + S +
Sbjct: 253 NSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQALGHRV 312
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTL 518
+++ G + LR L++ +RG Y+ +Q++ L+ +LA++ ++ L
Sbjct: 313 SLFSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADVRLNYL 372
Query: 519 KDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
L + + P+ S L+V+G+ Q
Sbjct: 373 GGLVGASPW-AVFPNYFGGSELVVAGQVQ 400
>gi|449276861|gb|EMC85223.1| Inter-alpha-trypsin inhibitor heavy chain H3, partial [Columba
livia]
Length = 866
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 113/241 (46%), Gaps = 37/241 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K++IF++DISGSM G+ +E T+ AL L + D FN + F E + + ++ AT +
Sbjct: 243 KNIIFVIDISGSMAGREIEQTREALLKILDDIKEDDFFNFILFGSEVHTWKETLIKATPK 302
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG---- 434
++ A ++ GI G TN+ + + + ML ++ + S II ++TDG
Sbjct: 303 NLDEARAFVRGIG--TSGMTNLHGGIMRGINMLNDAHEQNVVPKRSASIIIMLTDGQPNV 360
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
V + + I +K + P +Y G G ++ FL +A+ ++G Y
Sbjct: 361 GVSNTQDIQTDVKKAIEGK---YP-LYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDS 416
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSE--------SPLIVSGR 545
+Q+Q + + +L ++ ++ YP + I DL+ S ++V+GR
Sbjct: 417 ALQLQGFYDEVSNPMLTDVELN----------YPENEISDLTKNTFKHFYDGSEIVVAGR 466
Query: 546 Y 546
+
Sbjct: 467 F 467
>gi|38348336|ref|NP_940912.1| inter-alpha-trypsin inhibitor heavy chain H6 precursor [Homo
sapiens]
gi|74762375|sp|Q6UXX5.1|ITIH6_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H6;
AltName: Full=Inter-alpha-trypsin inhibitor heavy chain
H5-like protein; Short=Inter-alpha inhibitor H5-like
protein; Flags: Precursor
gi|37181448|gb|AAQ88537.1| ITI-like protein [Homo sapiens]
gi|119613592|gb|EAW93186.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_a [Homo
sapiens]
gi|162318154|gb|AAI57043.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct]
gi|162318494|gb|AAI56206.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct]
Length = 1313
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 127/284 (44%), Gaps = 32/284 (11%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V I G V ++D + F Y P + +K+V+F++D+S
Sbjct: 245 DFLVQYDVVMEDIIGDV-----QIYD----DYFIHYFAP---RGLPPMEKNVVFVIDVSS 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G +E TK A+ V LS L D FNI++F+ ++ + AT + V A ++
Sbjct: 293 SMFGTKMEQTKTAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLH 352
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDG----AVEDERQIC 443
A G T++ + L A +L +S G IP+I +TDG V I
Sbjct: 353 C-MEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVIL 411
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
++ L + S +++ G + LR L++ +RG Y+ +Q++ L+
Sbjct: 412 SNVRQALGHRVS----LFSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYE 467
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+LA++ ++ L L + + P+ S L+V+G+ Q
Sbjct: 468 EISMPLLADVRLNYLGGLVGASPW-AVFPNYFGGSELVVAGQVQ 510
>gi|426240689|ref|XP_004014226.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5 [Ovis aries]
Length = 936
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/279 (24%), Positives = 121/279 (43%), Gaps = 21/279 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNIV F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++ +I + G T+I L + +++L +
Sbjct: 337 SNRIKVWKDHLVSVTPNSIRDGKVYIH-HMSPSGGTDINGALQRGIQLLNDYVAHNDIED 395
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
S+ ++ +TDG + +K G IC I+T GIG+ + L L++ +
Sbjct: 396 RSVSLVVFLTDG--KPTVGETHTLKILXATRGRIC--IFTVGIGADVDFKLLEKLSLENC 451
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
G ++ Q+ + + +L++I +D L E + P+ + S +++
Sbjct: 452 GLTRRVHEEHDARAQLIGFYDEIRTPLLSDIRVDYPPGLVE-RATRTLFPNYFNGSEIVI 510
Query: 543 SGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+G+ + D L + + FVV L KD+P++
Sbjct: 511 AGKLVDRAMDRLHVEVTASNSKKFVV-----LKKDVPVE 544
>gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6 [Pan
troglodytes]
Length = 1312
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 127/284 (44%), Gaps = 32/284 (11%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V I G V ++D + F Y P + +K+V+F++D+S
Sbjct: 245 DFLVQYDVVMEDIIGDV-----QIYD----DYFIHYFAP---RGLPPMEKNVVFVIDVSS 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G +E TK A+ V LS L D FNI++F+ ++ + AT + V A ++
Sbjct: 293 SMFGTKMEQTKMAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLH 352
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDG----AVEDERQIC 443
A G T+I + L A +L +S G IP+I +TDG V I
Sbjct: 353 C-MEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVIL 411
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
++ L + S +++ G + LR L++ +RG Y+ +Q++ L+
Sbjct: 412 SNVRQALGHRVS----LFSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYE 467
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+LA++ ++ L L + + P+ S L+V+G+ Q
Sbjct: 468 EISMPLLADVRLNYLGGLVGASPW-AVFPNYFGGSELVVAGQVQ 510
>gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 688
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 146/304 (48%), Gaps = 25/304 (8%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL--ATKEAV 388
F+VD SGSM G + + + + + L G F+I+ F F T ++ + E V
Sbjct: 248 FVVDCSGSMSGARIINAVKCMRLFIQSLPLGCRFSIIKFGTS---FETVLQPCDYSDENV 304
Query: 389 ERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC-DAM 446
++A + +N GG T+I +PL + + L G + IFL+TDG V + C A+
Sbjct: 305 DKALNLLKSVNAKMGG-TDILSPL-QHIAGLKPQPGFVKQIFLLTDGEVNNPDITCATAL 362
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
K+R N RI++ G+GS + ++ LA S G Y D D++ ++ L +
Sbjct: 363 KNRNEN------RIFSIGLGSGADPGLIKGLAKKSGGNYIMIADEDNMNEKVITLLSSAI 416
Query: 507 SSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNF 566
+ NI+I T D E++PS P + S +P + K P + G S+
Sbjct: 417 APAATNISIQT--DKPATEVWPSPCPSVYSNNP----QSFLIKAPHSENVL-ISGTCSDE 469
Query: 567 VVELKLQLAK---DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTR 623
V++ + ++K ++ + ++ A+ I+ Q + L +K ++S+++GVL ++T
Sbjct: 470 TVDIVIPVSKCDDNLGMKQLFARYIIEDYETQILLQRNDELRQKCIQLSIESGVLSKFTA 529
Query: 624 MIIV 627
+ V
Sbjct: 530 YVGV 533
>gi|321479342|gb|EFX90298.1| hypothetical protein DAPPUDRAFT_309818 [Daphnia pulex]
Length = 828
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 108/447 (24%), Positives = 181/447 (40%), Gaps = 50/447 (11%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETG--SFLK----PNIFTLTL 181
P+ + + AE+ G+S ++ E ++++G +FL P++F +
Sbjct: 57 PVDEGAGVCHFTAEVDGRSIEG-VVKETEEARNDYEQAIQSGHSAFLVEEKLPDVFKAKV 115
Query: 182 PQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKRE---------- 230
+ GS IRL + +L GE +P P YV P E
Sbjct: 116 GNLAAGSGAKIRLTYVTELKVEGGEIRFYLPTTIAPRYVPPTDLSSATAELASINYTQTN 175
Query: 231 --KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVL-KWSNIDFDFSYTVSP 287
++ +NV T + + S H + +VG + ++ K S +D DF V
Sbjct: 176 QYQVEINVTVQTASIIQKIYSPTHKIDVNHNVGDNPMKAKLQLFEKVSCMDRDFVIYVEV 235
Query: 288 SHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDT 347
+ L+ S + + L+P K + IF+VD SGSMQG + +
Sbjct: 236 AEPHQPRLIHEKSKNGTS---ALMLSLVPSFKLDE--MKIEAIFVVDRSGSMQGPGIVEA 290
Query: 348 KNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA------TKEAVERAHQWIGINF 399
K AL + L L NIV F G TY LFS+S + TKE Q++ +
Sbjct: 291 KRALKLFLHSLPIDCCVNIVGF-GSTYKPLFSSSRKYGHECLALTKECA----QFLEADL 345
Query: 400 IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR 459
G T I PL E T + G +F++TDG V + ++ ++ + N R
Sbjct: 346 ---GGTEIYDPLAWIFEQPTIN-GYSRQVFVLTDGEVSNVNKVISLVRCQRGNA-----R 396
Query: 460 IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK 519
++ G+GS +HY + +A G A + +E ++ + L +I I+ +
Sbjct: 397 VFALGLGSSASHYLVEGIARAGNGTALFASLDERLEKKVMQQLQDALQPALTDIKIN-WQ 455
Query: 520 DLDEFEMYPSRI-PDLSSESPLIVSGR 545
DE E S P + +E L+ G+
Sbjct: 456 GYDETEQLSSPFNPPIQTEKTLMGYGK 482
>gi|310800880|gb|EFQ35773.1| vault protein inter-alpha-trypsin [Glomerella graminicola M1.001]
Length = 1046
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 160/380 (42%), Gaps = 38/380 (10%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP------- 227
++FT T+ + G+++S+ + + +L + D E + F P + P P
Sbjct: 129 DVFTTTVGNVPAGAHISVEIVYLGELKH-DAEVD-GIRFTIPTSIAPRFGSYPGTLLDKP 186
Query: 228 ----KREKIHLNVNAG--TGTEVLCNTSSHH--------LKQLRRDVG-KLGYSYESEVL 272
+ I + V+A G+ + S H +R D L ++ + L
Sbjct: 187 TSVSTQGGISIVVDAEVPAGSVIKSIQSPSHPIAVSVGSTSTMRADATPSLQFASATLSL 246
Query: 273 KWSNIDFDFSYTVSPSHIFG--GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVI 330
+ +D DF V ++ VL P+ + ++ L+P + K +++
Sbjct: 247 GTAQLDDDFIIQVIATNTSSPVAVLETHPT---ISHQQALMATLVPKFKLAPT--KPEIV 301
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVE 389
FI D SGSMQG ++D KNAL V L L G FNI +F ++LF SM K +E
Sbjct: 302 FICDRSGSMQGNKIDDLKNALKVFLKSLPVGAMFNICSFGSSYSFLFDKSMTY-DKNTLE 360
Query: 390 RAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSR 449
A + + G T P+ K E+ +F++TDG + +E ++ +
Sbjct: 361 TAMKHVDTFDANYGGTQFHGPIQKTFELRHTDLDL--EVFVLTDGEIWNEERLFTMVNDH 418
Query: 450 LTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
+ G+I R++T G+G +H + LA G+ A + + + ++ ++ +
Sbjct: 419 VQKSQGAI--RLFTLGVGRDVSHSLIEGLARAGNGFSQAVSENEKMSGKVVRMLKGALTP 476
Query: 509 VLANIAIDTLKDLDEFEMYP 528
+ + ++ D E E P
Sbjct: 477 HVNDYTLEVKYDDAESETEP 496
>gi|348575580|ref|XP_003473566.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Cavia
porcellus]
Length = 949
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 154/346 (44%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 310 KNILFVIDVSGSMWGIKMKQTVEAMKTILEDLRAEDQFSVVDFNHNVRTWRNDLVSATKT 369
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 370 QITDAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 428
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 429 ELKLSKIQKNVKQNIPDNVSLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSSQLK 488
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ ++ D+ + + P+ S ++V+G++ P+ L+
Sbjct: 489 KFYNQVSTPLLRNVQFSYTHPSVTDVTQ-----NSFPNYFGGSEIMVAGKFD---PNKLQ 540
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQ---A 596
+ ++ +L L+ L+KD D ++ A I+ L A+ A
Sbjct: 541 QIQSIITATSANTDLVLETLAQMDDLEDFLSKDKHADPNFTRKLWAYLTINQLLAERSLA 600
Query: 597 WFSEDKR-LEEKVSKMSVQTGVLCEYTRMIIVET--DERNNASESP 639
+ DKR + + + +MS++ V+ T M+I T DER A P
Sbjct: 601 PTAADKRKITKTILQMSLEHHVVTPLTSMLIENTVGDERMLADAPP 646
>gi|82617834|gb|ABB84829.1| VIT-vWFA-RpoN multidomain protein [uncultured delta proteobacterium
DeepAnt-1F12]
Length = 1156
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 101/216 (46%), Gaps = 14/216 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+D+I ++D SGSM G+PL + +A + +L D ++ FN + F + AT
Sbjct: 300 RDLICLIDTSGSMSGRPLAQAQRVVAALVDRLGDDDRLELIEFNHQVRRFRSEPVPATTL 359
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
A W+G + AGG+T + + A+ L SR +I L+TDG + E+QI +
Sbjct: 360 GKAAAMTWLG-SLTAGGATEMHTAVLAALRPL-RSRAQRQVI-LITDGHIGFEQQIVKEL 416
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
L R++T G+G+ N R A G + +E Q+L R
Sbjct: 417 IENLPE----TSRLHTVGVGTSVNRTLTRGAARAGGGAEIIVGLEEDVERVAQRLIVRTS 472
Query: 507 SSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPL 540
+ ++ N+ + D L+ + P+R+PDL + P+
Sbjct: 473 APLVTNLVVEGDALRGV-----APARVPDLFAGCPV 503
>gi|330793834|ref|XP_003284987.1| hypothetical protein DICPUDRAFT_148826 [Dictyostelium purpureum]
gi|325085108|gb|EGC38522.1| hypothetical protein DICPUDRAFT_148826 [Dictyostelium purpureum]
Length = 849
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELAT 384
K + IF+VD SGSM G P+ K AL + + L + FNI F +G +F S+ L
Sbjct: 295 KSEYIFVVDCSGSMSGTPITKAKRALEICVRSLSEQNKFNIYCFGSGFNKVFPESL-LYN 353
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQIC 443
E++ A +I G T + P+ + +S P +FL+TDGAV Q+
Sbjct: 354 DESLAMASAYIDNISANLGGTELLPPIKDI--LGKDSDAEYPRQVFLLTDGAVSARDQLI 411
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D + R++TFGIG+ + + L+ +G+Y D +E ++ KL +
Sbjct: 412 DYVGKEAN-----TTRMFTFGIGTSVDKELVIGLSKACKGFYEFILDNGDMEDKVMKLLS 466
Query: 504 RGFSSVLANIAID 516
LANI I+
Sbjct: 467 IANEPTLANIKIE 479
>gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 688
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 143/311 (45%), Gaps = 25/311 (8%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
FI+D SGSM G +++ K L + + L G F+I+ F G Y + T E V
Sbjct: 238 FIIDCSGSMSGSCIQNAKLCLNIFMHSLPIGCRFSIIKF-GSDYEVALHPCDYTDENVSE 296
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A + + G T+I +PL +E LT +G I +FL+TDG + ++C +
Sbjct: 297 AMKQLNNIDAEMGGTDILSPLKYVME-LTPKQGFIKQVFLLTDGQDSNTNELCALAQENR 355
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ--KLFTRGFSS 508
TN RI++ GIGS + + ++ S G Y D +S ++ + +L S
Sbjct: 356 TNN-----RIFSIGIGSGADKDLIINVSQKSGGNYVFVDDDESEKLNEKVIELLNSAISY 410
Query: 509 VLANIAIDTLKDLDEFEMYPSRIPDL---SSESPLIVSGRYQGKFPDTLKAKGFLGDLSN 565
L ++ + + EM+PS P L +S++ +I S + D + G + N
Sbjct: 411 ALMHVCFQ--GERNRAEMWPSPCPPLFSKNSQNFIIKSDK-----TDNVVISGI---IDN 460
Query: 566 FVVELKLQLAK---DIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYT 622
V + + ++K + + ++ ++ ID + D+ LE+K ++S+ +GV C+ T
Sbjct: 461 EEVSISIPVSKCDESLGMKQLYSRNLIDDYESMLSHKPDRNLEQKCIELSLTSGVPCKCT 520
Query: 623 RMIIVETDERN 633
V + N
Sbjct: 521 SFTGVPCNSDN 531
>gi|432858531|ref|XP_004068892.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Oryzias latipes]
Length = 910
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 152/345 (44%), Gaps = 44/345 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ + T+ AL L + D F +V F+ + ++ AT+E
Sbjct: 296 KNVVFVIDRSGSMSGRKMSQTREALLAILDDIHEEDYFALVQFDHTIDFWKETLTKATEE 355
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG----A 435
V A ++ GG T+I L K + ML R S +I L+TDG
Sbjct: 356 NVAEAMAYVKTIQDYGG-TDIHGGLMKGINMLIQDRQANRLPERSTDMIILLTDGMPNSG 414
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V D I + ++S + GG+I ++ G G ++ FL +L+ ++G ++
Sbjct: 415 VSDLPSIQNDVRSAI--GGNIS--LFCLGFGKDVDYSFLDVLSKQNQGLARRIFEASDAA 470
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
+Q+Q + + +L + + ++ +F + S L + S ++V+GR
Sbjct: 471 LQLQGFYEEVSTPMLLEVDLLYPDNVVDF-VTTSHFRQLFNGSEIVVAGRLSDN-----N 524
Query: 556 AKGFLGDLSNFVVELKLQL---AKDIPLDRICAKQQIDL--LTAQAW-FSEDKRLEEKVS 609
FL ++S +E K Q+ A + D +C ++ T + W + ++L EK +
Sbjct: 525 INNFLVEVSGQGLEEKFQVEGQASTLDWDVLCPDEEYVFGDFTERLWAYLTIQQLLEKST 584
Query: 610 K---------------MSVQTGVLCEYTRMIIVETDERNNASESP 639
MS+Q + T M++++ E + SP
Sbjct: 585 TAGPDEKINATAKALDMSLQYSFVTPLTSMVVIKP-EVEEGTSSP 628
>gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1]
gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1]
Length = 730
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 153/345 (44%), Gaps = 32/345 (9%)
Query: 303 DVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD 362
+ D+R F + L+P R + KD++FI+DISGSM G+ +E K AL L L GD
Sbjct: 251 EADRRGYFLLTLVPPREPER-IIPKDIVFILDISGSMSGQKIEKAKLALLQVLQMLHEGD 309
Query: 363 SFNIVAFNGETYLFSTSMELATKEAVERAHQWIGI-NFIAGGSTNICAPLTKAVEML--T 419
F+I+ FN E + + + +R + + +AGG TNI L + +E+L
Sbjct: 310 RFSIITFNNEVNNLTERLLPFS----DRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQ 365
Query: 420 NSRGSIPIIFLVTDGA-VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
++ ++ +TDGA E I ++ ++ FG+G N L LA
Sbjct: 366 STDDRYKVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDVNAELLDELA 425
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
G + + I+ ++ +L+ + V++N+ ++ + D + P L S +
Sbjct: 426 EKGGGKVKYIVENEEIDEKVLELYRMIETPVMSNVHLE-INGTDISYVLPKGPYTLFSGT 484
Query: 539 PLIVSGRY--QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQA 596
L +SG Y +G TL + L+ +V+ K++ D L R + I L AQ
Sbjct: 485 ALRISGVYYHEGVVNVTLSWE---EKLNGVIVQNKIKYKFD--LTRNSSFPFIATLWAQK 539
Query: 597 WFSE---------------DKRLEEKVSKMSVQTGVLCEYTRMII 626
SE + + +++ +S + ++ E+T +I
Sbjct: 540 RISELANIYRYDETLTPAQREEIRQEIINLSKKYNIINEFTSYLI 584
>gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein
[Shewanella amazonensis SB2B]
gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Shewanella amazonensis SB2B]
Length = 713
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/339 (24%), Positives = 152/339 (44%), Gaps = 31/339 (9%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+D +F++D SGSM GK + L KL D F I+ F+ T FS
Sbjct: 331 RDWVFVLDKSGSMNGK-YATLVEGVRQGLGKLPAQDRFRIILFDESTQEFSKGFVPVDSN 389
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDG----AVEDERQ 441
+ +A W+ GI+ G T++ L +A+ L R + + L+TDG V ++R+
Sbjct: 390 NINQALAWVEGIS--PGNGTDLYQGLKRALTPLDADRSTG--VVLITDGVANVGVTEKRR 445
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ M+ + R++TF +G+ N L + +S G + + D I + +
Sbjct: 446 FLELMQQQDV-------RLFTFIMGNSANTPLLVPMTRLSNGVATSVSNADDIVGHLMNI 498
Query: 502 FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLG 561
++ NI +D + + ++ P I L L++ G Y L +G
Sbjct: 499 TSKLTHQAYRNIRLD-MDGVKINDLTPQEIASLYRGEQLVLFGHYFAAGEAKLTLTMDIG 557
Query: 562 DLSN-FVVELKL-QLAKDIP-LDRICAKQQIDLLTAQAWFSE--DKRLEEKVSKMSVQTG 616
S + ++ L + A D P L+R+ A I+ L Q + E D E+ + ++++ G
Sbjct: 558 AESREYSTKVMLPEAALDYPELERMWAFSAINGLQEQMDYLEQKDSDKEKAIEDIAIEYG 617
Query: 617 VLCEYTRMIIVETD-------ERNNASESPGTKKGSKKS 648
+L +YT +++VE D ER+N T++ ++K+
Sbjct: 618 LLTDYTSLLVVEEDVFRQLGIERDNRRRV-STEQAARKT 655
>gi|73949158|ref|XP_535195.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Canis
lupus familiaris]
Length = 946
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 152/346 (43%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + AT+
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATRT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG--AVE 437
V A ++I G TNI L +A+ +L + S+ +I LV+DG V
Sbjct: 369 QVTDAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPESVSLIILVSDGDPTVG 427
Query: 438 DER--QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ + +I +K + + S +++ GIG ++ FL+ L+ +RG Y
Sbjct: 428 ELKLSKIQKNVKQHIRDNIS----LFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTS 483
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
Q++K + + + +L N+ + + ++ + + S ++V+G++ PD L+
Sbjct: 484 SQLKKFYNQVSTPLLRNVQFN-YPHMSVSDVTQNSFHNYFGGSEIVVAGKFN---PDKLE 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLDRICAKQQIDLLTAQAWFSE---- 600
+ ++ EL L+ L+KD D K+ LT +E
Sbjct: 540 QLQSIITATSANTELVLETLAQMDDLEDFLSKDKHADPNFTKKLWAYLTINQLLAERSLA 599
Query: 601 -----DKRLEEKVSKMSVQTGVLCEYTRMIIVET--DERNNASESP 639
+++ + + +MS+ +L T M+I DER A P
Sbjct: 600 PTAASKRKITKTILQMSLDHHILTPLTTMVIENEAGDERMLADSPP 645
>gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium
DG1235]
gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium
DG1235]
Length = 808
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 168/396 (42%), Gaps = 58/396 (14%)
Query: 259 DVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGT 318
+ G+L Y +ES+ + +D DF + G L + D+ F M + PG
Sbjct: 217 ETGELLYRFESQ---GAILDEDFVFYYMLEENLPGRLEVLTYRENEDKPGTFMMVMTPGV 273
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS- 377
D +F +D+SGSMQGK L + + A+ +L P D F +VAFN + +
Sbjct: 274 DLHPLEGGADFVFALDVSGSMQGK-LHTLASGVKKAIGQLKPEDRFRVVAFNNTAFDLNR 332
Query: 378 ---TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDG 434
++ E +E R Q + G TN+ A + A+E L R + + LVTDG
Sbjct: 333 GWVSATEANLRETFARLDQ-----LNSNGGTNVYAGVHLALERLDADR--VATLILVTDG 385
Query: 435 AVEDERQICDAMKSRLTNGGSICP------------RIYTFGIGSYCNHYFLRMLAMISR 482
+TN G + P R Y F +G+ N ++++ S
Sbjct: 386 ---------------VTNQGIVDPKAFYKLMHKQDLRFYGFLLGNSSNWPLMQLMCDASG 430
Query: 483 GYYGAAYDLDSI--EIQMQKLFTRGFSSVLANIAIDTLK--DLDEFEMYPSRIPDLSSES 538
G Y A + D I E+ + K S A I+I+ + D+ +F +I +
Sbjct: 431 GSYRAVSNSDDIIGEVMIAKNKIVYESMRHAEISIEGVNTHDVGDF-----KIGKIHYGD 485
Query: 539 PLIVSGRYQ--GKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQIDLLTA 594
L++ GRY+ G+ +LKA+ G F E+ +LA P L+R+ A Q+ +
Sbjct: 486 QLVLFGRYEQGGRARVSLKAR-INGQDEVFSTEIDFPELATAHPELERMWALDQVRKIQV 544
Query: 595 Q--AWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
A F + + + + V ++ + T MI ++
Sbjct: 545 NEMAGFVDAGEAKVAIRDIGVAYQIVTDETSMIALD 580
>gi|426240691|ref|XP_004014227.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Ovis
aries]
Length = 946
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/346 (22%), Positives = 150/346 (43%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDHFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A +I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVVDAKNYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y +Q++
Sbjct: 428 ELKLSKIQKNVRQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + P+ S ++V+G+ P+ L
Sbjct: 488 KFYNQVSTPLLRNVQFNYPQTSVTDVTQ-----NSFPNYFGGSEIVVAGKVD---PEKLG 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWF- 598
+ ++ EL L+ L+KD D ++ A I+ L A+
Sbjct: 540 QLQSIITATSANAELVLETLAEMDGLEDFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 599
Query: 599 ---SEDKRLEEKVSKMSVQTGVLCEYTRMIIVET--DERNNASESP 639
+E +++ + + +MS+ ++ T M+I DER A P
Sbjct: 600 PTAAEKRKITKTILQMSLDHHIVTPLTAMVIENEAGDERMLADSPP 645
>gi|395547385|ref|XP_003775165.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6
[Sarcophilus harrisii]
Length = 1202
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 31/295 (10%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V+ + G V + F Y P + +K++IF++D+SG
Sbjct: 332 DFVIQYDVAMKDVIGDVQISD---------GYFVHYFAP---RGLPRMQKNIIFVIDVSG 379
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G L+ TK A+ V L L P D FNIV F+ +++ T+ + AT ++RA IG
Sbjct: 380 SMTGTKLKQTKKAMHVILGDLCPEDYFNIVTFSDTVHIWKTAGSVQATSPNIQRAKAHIG 439
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSRGSIP-----IIFLVTDG----AVEDERQICDAMK 447
AG T++ A L A +L + G+ P ++ +TDG V I +
Sbjct: 440 -RMEAGRWTDMNAALLAAASILNQTGGAGPSGGARLVIFLTDGEPTAGVTSPTHILANAQ 498
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
L + ++ +G + LR L++ +RG + +Q++ + R S
Sbjct: 499 RALAGQAA----LFGLALGDDADLPLLRRLSLENRGTAHRIREDQDAAVQLKGFYDRISS 554
Query: 508 SVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD---TLKAKG 558
+L++I + + + S P+ S L+V G+ P+ L+A+G
Sbjct: 555 PLLSDIHLHYQARQAGRADPAGSHFPNYFGGSELVVVGQLGHGEPELCVRLEAQG 609
>gi|323498500|ref|ZP_08103493.1| von Willebrand factor type A (vWA) domain-containing protein
[Vibrio sinaloensis DSM 21326]
gi|323316389|gb|EGA69407.1| von Willebrand factor type A (vWA) domain-containing protein
[Vibrio sinaloensis DSM 21326]
Length = 697
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 156/375 (41%), Gaps = 36/375 (9%)
Query: 306 QREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
+R + PG +R +D +F++D SGSM GK + L KL D F
Sbjct: 291 KRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMSGK-YSTLVEGVRQGLGKLPSEDRFR 349
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
+V F+ TY + V +A Q + + G TN+ + A L + R +
Sbjct: 350 VVMFDSNTYDLTGGFVAVNAANVTKALQSVEQVEPSNG-TNLYEGMAAATRKLDDDRPT- 407
Query: 426 PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
I LVTDG V ++R+ D M+ R++TF +G+ N L + +S
Sbjct: 408 -GIVLVTDGVANVGVTEKRRFFDLMEKHDV-------RLFTFIMGNSANTPLLVPMTKLS 459
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSE 537
G + + D I + + ++ NI +D +KDL P +I L
Sbjct: 460 NGIATSVSNADDIIGHLMSITSKLTYQAYRNIQLDIDGVKIKDLT-----PQQISSLYRG 514
Query: 538 SPLIVSGRY-QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQIDLLTA 594
L V G Y +G D G G + ++ L A + P L+R+ A I L A
Sbjct: 515 EQLTVFGHYFKGGEADIKLTMGIGGQSREYRTKVTLPDTATEHPELERMWAFSAIRDLQA 574
Query: 595 QAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETD-------ERNNASESPGTKKGS 645
Q + E D E+ + ++++ G+L +YT +++VE + ER N + +
Sbjct: 575 QMDYLEQKDHDKEQAIEDIALEYGLLTDYTSLLVVEEEVFQQLGIERKNQTRVLKEQSAR 634
Query: 646 KKSDHQKILDSEVPK 660
+ +Q + + V K
Sbjct: 635 EIKQNQPVKPTRVDK 649
>gi|123414681|ref|XP_001304538.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121885997|gb|EAX91608.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 709
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 105/215 (48%), Gaps = 11/215 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K + FI+D SGSM G + + K L + + L G F+I+ F G++Y + S+ +
Sbjct: 240 KSEFYFIIDCSGSMSGSRIGNAKFCLNILIHSLPIGCRFSIIQF-GDSYKETVSICDYSN 298
Query: 386 EAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
V A I GIN GG T+I +PL + +G I IFL+TDG V + +IC
Sbjct: 299 RNVRNAMSAIAGINADMGG-TDILSPLEYVFKKKL-EKGFIRKIFLLTDGEVNNSDEICS 356
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ N RI+ G+GS + ++ +++ + G Y D ++ + +L
Sbjct: 357 KAQLERENN-----RIFAIGLGSGADPGLIKNVSIKNGGNYVLIADEYNMNNMIVELMKS 411
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
S L NI+I + D+ EM+P+ P L + +P
Sbjct: 412 AISPSLTNISIQG--ESDQTEMWPTPCPPLIARNP 444
>gi|410933173|ref|XP_003979966.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like,
partial [Takifugu rubripes]
Length = 590
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 92/191 (48%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+FI+D SGSM G+ +E T+ AL L+ L D F I+ F+ + + + A +
Sbjct: 267 KNVVFIIDQSGSMHGRKIEQTRTALVRILNDLAEDDFFGIITFDSRIFQWKRELVQANQA 326
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
+E A + N G+T+I A + K +ML +GS I+ L+TDG +
Sbjct: 327 NLENARTF-ARNIKDNGATDINAAVLKGADMLNAHPRQGSASILILLTDGDPTTGETNLE 385
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G + FL +++ + G Y+ ++Q++ +
Sbjct: 386 RIQSNVRKAIADKFPLYCLGFGFDVDFEFLEKMSLQNNGVARRIYEDSDADLQLKDFYNE 445
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 446 VATPLLTDVTM 456
>gi|326914347|ref|XP_003203487.1| PREDICTED: poly [ADP-ribose] polymerase 4-like [Meleagris gallopavo]
Length = 1617
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 123/579 (21%), Positives = 240/579 (41%), Gaps = 73/579 (12%)
Query: 103 FVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK 162
FV ++ + S + + P+ D+ ++ G EA ++G Q+ + +
Sbjct: 651 FVAQVVMFQTYTNQNSSPIEAKYVFPLDDKAAVCGFEAFVNGTHIIGQVKEKKQAHREYR 710
Query: 163 SASVE-TGSFL----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPE 217
A E G++L P+IF +T+ I S + I++ + +LS ++G + F P
Sbjct: 711 KAISEGDGAYLMDQDAPDIFVITVGNICPNSTVLIKVTYITELSCQNG----CITFHMPA 766
Query: 218 YVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVG-KLGYSYESEVLKW-- 274
V+P + E N + +C LK+ D+ ++ YS ES + W
Sbjct: 767 SVSPWQQDKALNENTE---NTQDTVKKICIKQVETLKKFSLDMSIEMPYSIES-LHSWTH 822
Query: 275 ------------------SNID---FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMY 313
S++D F +S +++ + + P ++ CM
Sbjct: 823 KLKIKKTECKAVIKTVDNSSLDSSGFGLDIWISHAYVPRMWVEKHP-----NKNSEACML 877
Query: 314 LLP---GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
+ A + ++I ++D S SM G L K AL +L+ + N++ F
Sbjct: 878 VFQPEFEAAFDEEQLSSEIIILLDCSNSMAGSALLQAKQIALHALKQLNSRQNVNVIKFG 937
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFL 430
FS+ + A K+ ++I G+T++ L + + +L S+G I+ L
Sbjct: 938 TNFSEFSSFSKDALKDLASLT-EFITSATATMGNTDLWKSL-RYLSLLFPSQGHRNIL-L 994
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY----- 485
++DG +++E +K + + R++T G+GS N + LR L+ G +
Sbjct: 995 ISDGHIQNESVTFQLVKDNVHH-----TRLFTCGVGSAANRHMLRSLSQYGAGAFEYFDS 1049
Query: 486 GAAYDLDS-IEIQMQKLFTRGFSSV---LANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+ Y+ ++ I+ Q+ ++F+ G SSV I+ K + P++I L + L+
Sbjct: 1050 KSKYNWEAKIQSQVSRIFSPGCSSVSIKWQQFDINAPKPMQA----PAQIQSLFTNERLL 1105
Query: 542 VSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQA 596
V G TL+A +L V +LQ L ++ A+ I +L
Sbjct: 1106 VYGFVPHCTQATLRAIINDQELQTVVSTTELQKTTGTVLHKLTARAFIRDYEDGILHENE 1165
Query: 597 WFSEDKR--LEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
E K+ L+ + +S++ ++ ++T I VE + N
Sbjct: 1166 TEHEMKKQILKNMIIHLSLENSIITQFTSFIAVEKRDTN 1204
>gi|254508395|ref|ZP_05120516.1| protein contAining a von Willebrand factor type A domain [Vibrio
parahaemolyticus 16]
gi|219548708|gb|EED25712.1| protein contAining a von Willebrand factor type A domain [Vibrio
parahaemolyticus 16]
Length = 696
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 146/338 (43%), Gaps = 29/338 (8%)
Query: 306 QREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
+R + PG +R +D +F++D SGSM GK + L KL D F
Sbjct: 291 KRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMSGK-YSTLVEGVRQGLGKLPSEDRFR 349
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
+V F+ TY + + + +A Q + + G TN+ + A+ L + R +
Sbjct: 350 VVMFDSNTYDLTGGFVAVNQSNISKALQAVEQVEPSNG-TNLYEGMAAAIRKLDDDRPT- 407
Query: 426 PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
I LVTDG V ++R+ + M+ R++TF +G+ N L + +S
Sbjct: 408 -GIVLVTDGVANVGVTEKRRFFELMEKHDV-------RLFTFIMGNSANTPLLVPMTKLS 459
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSE 537
G + + D I + + ++ NI +D +KDL P +I L
Sbjct: 460 NGVATSVSNADDIIGHLMSMTSKLTHQAYRNIQLDIDGVKIKDLT-----PEQISSLYRG 514
Query: 538 SPLIVSGRY-QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQIDLLTA 594
L V G Y QG D G+ + ++ L A + P L+R+ A I L
Sbjct: 515 EQLTVFGHYFQGGEADIKLTMDIGGESREYRTKVMLPDTATEHPELERMWALSAIRDLQE 574
Query: 595 QAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETD 630
Q + E D+ E+ + ++++ G+L +YT +++VE +
Sbjct: 575 QMDYLEQKDRDKEQAIEDIALEYGLLTDYTSLLVVEEE 612
>gi|194227183|ref|XP_001916967.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Equus
caballus]
Length = 946
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 147/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + AT
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDQFSVVDFNHNVRTWRNDLVSATTT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAHRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L NI + ++ D+ + + P+ S ++V+G++ P+ L+
Sbjct: 488 KFYNQVSTPLLRNIQFNYPHTSVTDVTQ-----NSFPNYFGGSEIVVAGKFH---PEKLE 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLDRICAKQQIDLLTAQAWFSE---- 600
+ ++ EL L+ L+KD D K+ LT +E
Sbjct: 540 QLQGIITATSANTELVLETLAEMDDLEDFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 599
Query: 601 -----DKRLEEKVSKMSVQTGVLCEYTRMIIVET--DERNNASESP 639
+++ + + +MS+ ++ T M+I DER A P
Sbjct: 600 PTAAAKRKITKTILQMSLDHHIVTPLTSMVIENEAGDERMLADSPP 645
>gi|395521727|ref|XP_003764967.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Sarcophilus harrisii]
Length = 1198
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P SR +K + IF++D SGSM G + K AL + L
Sbjct: 339 LHKDIPHHSVVMLNFCPDLQLSRATLRKTHGEFIFLIDRSGSMSGININRVKEALILMLK 398
Query: 357 KLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE 416
L P FNI+ F S ++ +E + A + I G TNI +PL
Sbjct: 399 SLMPTCLFNIIGFGSSFKTLFPSSQIYNEENLATACENIQHIRADMGGTNILSPLKWIAR 458
Query: 417 MLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRM 476
T++ G ++FL+TDG+V + ++ + ++ + R Y+FGIG ++
Sbjct: 459 QPTHA-GHPRLLFLLTDGSVNNTGKVLELVRIH-----AFTTRCYSFGIGPKACPRLVQG 512
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
LA +S+G + ++ +M K + + V++++ ++
Sbjct: 513 LAAVSKGSAEFLMQGERLQPKMIKSLKKAMAPVVSDVTVE 552
>gi|118385358|ref|XP_001025812.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89307579|gb|EAS05567.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 856
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 55/173 (31%), Positives = 87/173 (50%), Gaps = 15/173 (8%)
Query: 314 LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
L+ + SR F IF++D SGSM G+P++ AL + L L P FN+ +F G
Sbjct: 310 LIDHSESSRSEF----IFLLDRSGSMNGRPIKKATEALNLFLKSLPPNSYFNVYSF-GTR 364
Query: 374 Y--LFSTSMELATKEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFL 430
Y +F S++ K ++E A + + NF A G T+I +PLT E+ G IFL
Sbjct: 365 YVPMFPNSVQYTGK-SLEIALKKVK-NFKANLGRTDILSPLTNIFEVQEKINGYNKQIFL 422
Query: 431 VTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+TDG V++ ++ RL + RI + G GS + + + A+ +G
Sbjct: 423 LTDGGVKNRDKVI-----RLIKKNNKNSRINSIGFGSGADQHLINTSAIAGKG 470
>gi|344278003|ref|XP_003410786.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Loxodonta
africana]
Length = 905
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 243 FVHYFAP---KDLPPLPKNVVFLLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 299
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + + +I AGG T+I L + + +L N
Sbjct: 300 SNRIKVWKDHLVSVTPDNIRDGKVFIHHMSPAGG-TDINGALQRGIRLLNNYVAHNDIED 358
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G IC I+T GIG+ + L L++
Sbjct: 359 RSVSLIIFLTDGKPTVGETHTLKILNNTKEAARGQIC--IFTIGIGNDVDFRLLEKLSLE 416
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q + + +L++I ID L E + + P+ + S +
Sbjct: 417 NCGLTRRVHEEEDAGSQFIGFYDEIRTPLLSDIRIDYPPSLVE-HVTKTLFPNYFNGSEI 475
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
I++G+ + D L + + FV+ L D+P++
Sbjct: 476 IIAGKLVDRKMDQLHVEVTASNSKKFVI-----LKTDVPVE 511
>gi|348521290|ref|XP_003448159.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Oreochromis niloticus]
Length = 939
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 124/280 (44%), Gaps = 13/280 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D+F+I+ FN +S + +
Sbjct: 302 KNIVFVIDVSGSMWGVKMKQTVQAMKAILDDLTIDDNFSIIDFNHNVRCWSEELVPGSSI 361
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I N G TNI L +AV+ML + S+ +I LV+DG
Sbjct: 362 QIADAKRYIE-NIKPNGGTNINEALLRAVQMLVKASNQKVIDPRSVSMIILVSDGDPTVG 420
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
A++ + +++ GIG ++ FL+ +A +RG Y Q++
Sbjct: 421 EIKLSAIQKNVKRVMREEFSLFSLGIGFDVDYDFLQRIATENRGMAQRIYANHDAAEQLR 480
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
+++ S +L I + +D E+ + S S L+V+G+ +TL +
Sbjct: 481 AFYSQVSSPLLRKITVQFQED-SVAEVTQNHFDKFFSGSELVVAGKVLPSESNTLTSFT- 538
Query: 560 LGDLSNFVVELKLQLAKDIPLDRICAKQQIDL--LTAQAW 597
+G + + L+ A LD AKQQ Q W
Sbjct: 539 MGSAARLDIILETD-ADTTELDAELAKQQHSFTGFARQMW 577
>gi|449487232|ref|XP_002190370.2| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Taeniopygia guttata]
Length = 1326
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 128/279 (45%), Gaps = 14/279 (5%)
Query: 243 EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH 302
E+L + S H+ + + G + + LK N DF PS +++
Sbjct: 263 EILIHPSEPHMPHVLMEEGDMTPAEYERHLKGKN-DFIKGTKKDPSAKKKMEIIRKRLNK 321
Query: 303 DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
D+ + + P + +K + IF+VD S SM G + K+AL V L L
Sbjct: 322 DIPHHPVIMLNFCPDLRAVQPGLRKAQGEFIFLVDRSRSMSGVNISHAKDALLVILKSLM 381
Query: 360 PGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
P FN++ F G T+ LF S + +E++ A Q I GS N+ +PL K V
Sbjct: 382 PACLFNVIGF-GSTFKTLFPVS-QAYCEESLLMACQSIRRIRADMGSINLLSPL-KWVTR 438
Query: 418 LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
RG ++FL+T GA+ + ++ + +++ + R Y+FGIG +R L
Sbjct: 439 QPLHRGHPRLLFLLTGGAISNTGKVLELLRTTPCST-----RCYSFGIGPNACRRLVRGL 493
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
A +SRG + + ++ +M + + + VL++++++
Sbjct: 494 AAVSRGVSEFLAEGERLQPKMIRSLKKAMAPVLSDVSVE 532
>gi|283837909|ref|NP_001164635.1| poly [ADP-ribose] polymerase 4 [Gallus gallus]
Length = 1615
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 120/574 (20%), Positives = 234/574 (40%), Gaps = 66/574 (11%)
Query: 103 FVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK 162
FV ++ + S + + P+ D+ ++ G EA ++G+ Q+ + +
Sbjct: 651 FVAQVVMFQTYTNQNSSPIEAKYVFPLDDKAAVCGFEAFVNGRHIIGQVKEKKQAHREYR 710
Query: 163 SASVE-TGSFL----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPE 217
A E G++L P+IF +T+ I S + I++ + +LS +G + F P
Sbjct: 711 KAISEGDGAYLMDQDAPDIFVITVGNICPNSTVLIKVTYITELSCLNG----CITFHMPA 766
Query: 218 YVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVG-KLGYSYESEVLKWS- 275
V+P ++ LN N + +C LK+ D+ ++ YS ES + W+
Sbjct: 767 SVSPW------QQDKALNENTQDTIKKICVKQVETLKKFSLDMSIEMPYSIES-IHSWTH 819
Query: 276 -----------------NIDFD---FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL 315
N D F + SH + + + + M
Sbjct: 820 KLKIKKTECKAVIKTVENSSLDSSGFGLDIWISHAYLPRMWVEKHPNKNSEACMLVFQPE 879
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
A + ++I ++D S SM G L K AL + + N++ F G +
Sbjct: 880 FEAAFDEEQLSSEIIILLDCSNSMAGSALLQAKQIALHALKQFSSRQNVNLIKF-GTNFS 938
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
+S T + + ++I G+T++ L + + +L S+G I+ L++DG
Sbjct: 939 EFSSFSKNTSKDLASLTEFITSATATMGNTDLWKTL-RYLSLLFPSQGHRNIL-LISDGH 996
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY-----GAAYD 490
+++E +K + + R++T G+GS N + LR L+ G + + Y+
Sbjct: 997 IQNESVTFQLVKDNVHH-----TRLFTCGVGSTANRHMLRSLSQYGAGAFEYFDSKSKYN 1051
Query: 491 LDS-IEIQMQKLFTRGFSSV---LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRY 546
++ I+ Q+ ++F+ G SSV I+ K + P++I L + L+V G
Sbjct: 1052 WEAKIQNQVSRIFSPGCSSVSIKWQQFDINAPKPMQA----PAQIQSLFTNERLLVYGFV 1107
Query: 547 QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSED 601
TL+A +L V +LQ L ++ A+ I +L E
Sbjct: 1108 PHCTQATLRAIINDQELQTVVSTTELQKTTGTVLHKLTARAFIRDYEDGILHENETEHEM 1167
Query: 602 KR--LEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
K+ L+ + +S++ ++ ++T I +E + N
Sbjct: 1168 KKQILKNMIIHLSMENSIITQFTSFIAIEKRDAN 1201
>gi|348571245|ref|XP_003471406.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A
domain-containing protein 5B1-like [Cavia porcellus]
Length = 1216
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 119/277 (42%), Gaps = 10/277 (3%)
Query: 243 EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH 302
E+L + S HL + + G + S + LK DF S +++
Sbjct: 274 EILIHPSEPHLPHVLMEKGDMTLSEFDQYLK-GRTDFIKGTKKDSSAERKTEIIRKRLHK 332
Query: 303 DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALSKLD 359
D+ + + P + +K + IF++D S SM K+A+ VAL L
Sbjct: 333 DIFHHSVIMLNFCPDLQSVQPNLRKTHGEFIFLIDRSNSMSETNFHRVKDAMLVALKSLM 392
Query: 360 PGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT 419
P FN++ F S + ++E++ A I G TNI +PL V
Sbjct: 393 PNCLFNVIGFGSMFKTVFPSSQTYSEESLAMACDNIQSMRADMGGTNIFSPLKWIVRQPV 452
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
+ +G ++FL+T GA+ + ++ + L + R Y+FGIG H ++ LA
Sbjct: 453 H-QGHPRLLFLITGGAISNTGKVLE-----LVRNHAFSTRCYSFGIGPSVCHRLVKGLAS 506
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+S+G + + ++ +M K R + VL+++ ++
Sbjct: 507 VSKGSAEFLAEGERLQPKMVKSLKRAMAPVLSDVTVE 543
>gi|354477377|ref|XP_003500897.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
isoform 2 [Cricetulus griseus]
Length = 1219
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + + IF++D S SM ++ K A+ VAL L P FN++ F
Sbjct: 341 FCPDLQSVQPNLRKAYG-EFIFLIDRSNSMSRTSIQRVKEAMLVALKSLRPACLFNVIGF 399
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S ++ +E + A I G+TNI +PL K V G +F
Sbjct: 400 GSTFKTVFASSQIYNEENLATACDCIQRTRADMGNTNILSPL-KWVLRKPVHHGHPRTLF 458
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDGAV + ++ + +++ ++ R Y+FGIG ++ LA +++G
Sbjct: 459 LITDGAVNNTGKVLELVRNHASST-----RCYSFGIGPSVCFRLVKGLASVTKGTAEFLE 513
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 514 EGERLQPKMIKSLKKAMAPVLSDVTVE 540
>gi|354477375|ref|XP_003500896.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
isoform 1 [Cricetulus griseus]
Length = 1219
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + + IF++D S SM ++ K A+ VAL L P FN++ F
Sbjct: 341 FCPDLQSVQPNLRKAYG-EFIFLIDRSNSMSRTSIQRVKEAMLVALKSLRPACLFNVIGF 399
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S ++ +E + A I G+TNI +PL K V G +F
Sbjct: 400 GSTFKTVFASSQIYNEENLATACDCIQRTRADMGNTNILSPL-KWVLRKPVHHGHPRTLF 458
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDGAV + ++ + +++ ++ R Y+FGIG ++ LA +++G
Sbjct: 459 LITDGAVNNTGKVLELVRNHASST-----RCYSFGIGPSVCFRLVKGLASVTKGTAEFLE 513
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 514 EGERLQPKMIKSLKKAMAPVLSDVTVE 540
>gi|354477379|ref|XP_003500898.1| PREDICTED: von Willebrand factor A domain-containing protein 5B1
isoform 3 [Cricetulus griseus]
Length = 1209
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + + IF++D S SM ++ K A+ VAL L P FN++ F
Sbjct: 341 FCPDLQSVQPNLRKAYG-EFIFLIDRSNSMSRTSIQRVKEAMLVALKSLRPACLFNVIGF 399
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S ++ +E + A I G+TNI +PL K V G +F
Sbjct: 400 GSTFKTVFASSQIYNEENLATACDCIQRTRADMGNTNILSPL-KWVLRKPVHHGHPRTLF 458
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDGAV + ++ + +++ ++ R Y+FGIG ++ LA +++G
Sbjct: 459 LITDGAVNNTGKVLELVRNHASST-----RCYSFGIGPSVCFRLVKGLASVTKGTAEFLE 513
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 514 EGERLQPKMIKSLKKAMAPVLSDVTVE 540
>gi|444378679|ref|ZP_21177873.1| hypothetical protein D515_2688 [Enterovibrio sp. AK16]
gi|443677225|gb|ELT83912.1| hypothetical protein D515_2688 [Enterovibrio sp. AK16]
Length = 695
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 145/338 (42%), Gaps = 29/338 (8%)
Query: 306 QREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFN 365
+R + PG +R +D +F++D SGSM GK + L KL D F
Sbjct: 291 KRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMSGK-YSTLVEGVRQGLGKLPSEDRFR 349
Query: 366 IVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
+V F+ TY + A V +A Q + + G TN+ ++ A+ L + R +
Sbjct: 350 VVMFDSNTYDLTGGFVAANPANVSKALQSVEQVEPSNG-TNLYEGMSAAIRKLDDDRPT- 407
Query: 426 PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
I LVTDG V ++R+ D M+ R++TF +G+ N L + +S
Sbjct: 408 -GIVLVTDGVANVGVTEKRRFFDLMEKHDV-------RLFTFIMGNSANTPLLVPMTKLS 459
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSE 537
G + + D I + + ++ NI +D +KDL P +I L
Sbjct: 460 NGVATSVSNADDIIGHLMNITSKLTHQAYRNIQLDIDGVKIKDLT-----PEQISSLYRG 514
Query: 538 SPLIVSGRY-QGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI--PLDRICAKQQIDLLTA 594
L V G Y +G D G+ + ++ L + + L+R+ A I L
Sbjct: 515 EQLTVFGHYFKGGEADIKLTMDIAGESREYRTKVMLPDSATVHPELERMWAFSAIRDLQE 574
Query: 595 QAWFSEDK--RLEEKVSKMSVQTGVLCEYTRMIIVETD 630
Q + E K E+ + ++++ G+L +YT +++VE +
Sbjct: 575 QMDYLEQKDGDKEQAIEDIALEYGLLTDYTSLLVVEEE 612
>gi|344238868|gb|EGV94971.1| von Willebrand factor A domain-containing protein 5B1 [Cricetulus
griseus]
Length = 1215
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + + IF++D S SM ++ K A+ VAL L P FN++ F
Sbjct: 341 FCPDLQSVQPNLRKAYG-EFIFLIDRSNSMSRTSIQRVKEAMLVALKSLRPACLFNVIGF 399
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S ++ +E + A I G+TNI +PL K V G +F
Sbjct: 400 GSTFKTVFASSQIYNEENLATACDCIQRTRADMGNTNILSPL-KWVLRKPVHHGHPRTLF 458
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDGAV + ++ + +++ ++ R Y+FGIG ++ LA +++G
Sbjct: 459 LITDGAVNNTGKVLELVRNHASST-----RCYSFGIGPSVCFRLVKGLASVTKGTAEFLE 513
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 514 EGERLQPKMIKSLKKAMAPVLSDVTVE 540
>gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca]
Length = 885
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 80/346 (23%), Positives = 152/346 (43%), Gaps = 40/346 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 246 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKT 305
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG +
Sbjct: 306 QIVDAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPESVSLIILVSDG--DPT 362
Query: 440 RQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
C+ S++ R +++ GIG ++ FL+ L+ +RG Y
Sbjct: 363 VGNCELKLSKIQKNVKQNIRDNIALFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTS 422
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
Q++K + + + +L N+ + L ++ + + S ++V+G+Y P+ L+
Sbjct: 423 SQLKKFYNQVSTPLLRNVQFN-YPHLSVSDVTQNSFHNYFGGSEIVVAGKYT---PERLE 478
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 479 QLQSIVTATSANTELVLETLAEMDDLEDFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 538
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
+ K++K MS++ ++ T M+I DER A P
Sbjct: 539 STAAAKRKITKTILQMSLEHHIVTPLTAMVIENDAGDERMLADSPP 584
>gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Ailuropoda melanoleuca]
Length = 946
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 148/342 (43%), Gaps = 34/342 (9%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QIVDAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPESVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIRDNIALFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
K + + + +L N+ + L ++ + + S ++V+G+Y P+ L+
Sbjct: 488 KFYNQVSTPLLRNVQFN-YPHLSVSDVTQNSFHNYFGGSEIVVAGKYT---PERLEQLQS 543
Query: 560 LGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFSEDKR 603
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 544 IVTATSANTELVLETLAEMDDLEDFLSKDKHADPDFTKKLWAYLTINQLLAERSLASTAA 603
Query: 604 LEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
+ K++K MS++ ++ T M+I DER A P
Sbjct: 604 AKRKITKTILQMSLEHHIVTPLTAMVIENDAGDERMLADSPP 645
>gi|62531155|gb|AAH92555.1| LOC594926 protein, partial [Xenopus (Silurana) tropicalis]
Length = 895
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 110/251 (43%), Gaps = 28/251 (11%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + P K K++IFI+D S SM G ++ TK AL L + D FN V F
Sbjct: 261 FVHFFAPSKLKE---VPKNIIFIIDRSISMIGLKMQQTKEALLKILDDVKEHDHFNFVIF 317
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG--SIP- 426
+ ++ S+ AT E + RA ++ N G TNI L A+ +L + S+P
Sbjct: 318 DWGVEIWEQSLVKATPENLNRAKAYVR-NLYPKGWTNINDALLSAISLLDQAHDARSVPK 376
Query: 427 ----IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+I +TDG + D ++ N +Y+ G G ++ FL L++ +
Sbjct: 377 RSASLIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLYSLGFGVGVDYPFLEKLSLENS 436
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE----- 537
G Y+ +QM+ GF +AN TL D+ E + + I D++
Sbjct: 437 GVARRIYEESDAALQME-----GFYDEVAN---PTLMDI-ELQYPENAISDVTQNKFKHY 487
Query: 538 ---SPLIVSGR 545
S ++V+GR
Sbjct: 488 FDGSEIVVAGR 498
>gi|390342131|ref|XP_798930.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
[Strongylocentrotus purpuratus]
Length = 988
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 145/338 (42%), Gaps = 49/338 (14%)
Query: 172 LKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAI-------- 223
L+ N+ + +P ++ G RL + +KL R G + + FPE V +
Sbjct: 222 LENNLLVIAVP-VEPGERAYFRLTYERKLPRRLGMYEQKIGI-FPEQVVEVMNLDTVIIE 279
Query: 224 -KKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYES-EVLKWSNIDFDF 281
+ + + + L+ + + L H +++ + ++ YS + E L S +
Sbjct: 280 PQGLSRLDSFLLDQSDPSNPISLA----HRVQRRSDNRWEVHYSPRAREQLGVSPMGIMA 335
Query: 282 SYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQG 341
YTV + G L+D F Y P V +K++IF++DISGSM G
Sbjct: 336 DYTVRYDVVHGNDAGDIQVLND-----YFVQYFSP---SGLSVLRKNIIFVIDISGSMSG 387
Query: 342 KPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFSTSMELATKEAVERAHQWIGINFI 400
L K+AL+ L + D FNI+ F+ + +L ST M +TKE V RA +++ +
Sbjct: 388 TKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGMLYSTKENVRRAKRFV-MGLQ 446
Query: 401 AGGSTNICAPLTKAVEMLTNSRGSIP-------IIFLVTDGAVED--------ERQICDA 445
+TN+ + V ML P ++ ++TDG ER + +A
Sbjct: 447 EMDNTNLHKAIISGVNMLRAESEQDPQEEEIVSMLIVLTDGNPNHGEIDKTIIERNVHEA 506
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ + ++ G G+ ++ FLR L++ + G
Sbjct: 507 INGDFS--------LFCIGFGADADYPFLRRLSLQNHG 536
>gi|196015342|ref|XP_002117528.1| hypothetical protein TRIADDRAFT_61586 [Trichoplax adhaerens]
gi|190579850|gb|EDV19938.1| hypothetical protein TRIADDRAFT_61586 [Trichoplax adhaerens]
Length = 1262
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 93/393 (23%), Positives = 174/393 (44%), Gaps = 41/393 (10%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SASVETGSFL 172
S+ + + P+ D+ ++ G EA I+ K Q+ E + A K + + + G++L
Sbjct: 674 SSQPIEAKYVFPLDDRAAVCGFEAFINNKHIIGQV---KEKEKAHKEYEAAVARKDGAYL 730
Query: 173 ----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEF-----SVNVPFKFPEYVTPAI 223
P+IFT+++ + S + I++ + +LS F S P+K E ++
Sbjct: 731 MDEESPDIFTVSVGNLPPNSKVLIKITYVTELSMEGDCFDFFLPSNLAPWKKQEALSNVT 790
Query: 224 KKIPKREKIH--LNVNAGTGTEV-------LCNTSSHHLKQLRRDVGKLGYSYESEVLKW 274
+ KI+ +VNA V ++ +H ++Q + ++ K+
Sbjct: 791 QDAVSTVKINEVKSVNATVQVRVEMPFEIRSISSPTHQIRQ--KTTATKAVVETGQMTKF 848
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVD 334
N F ++ H+ + + P D D + CM + + ++I +D
Sbjct: 849 DN-GFRLLIRLAEIHVPRMWVEKYP---DTDSQA--CMLTFYPEFDVTENPRPEIIIALD 902
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQ 393
+S SM+ + L DT+ A+ L+ L P FNIV F + LF E++ KE+V A +
Sbjct: 903 MSNSMK-QCLMDTQKIAALILTNLPPECRFNIVVFGSAHNELFPMYQEVS-KESVNMAIK 960
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRG-SIPIIFLVTDGAVEDERQICDAMKSRLTN 452
+IG + G++N + ++ + S+ +FL++DG DE I ++ +
Sbjct: 961 FIGSLSASWGNSNFYHVIDNFTHIVKELKANSVSNVFLISDGHFGDENSITAILRRDKID 1020
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
R++TF G N YF+R LA I GY+
Sbjct: 1021 N----LRLFTFSTGDTSNRYFMRTLAKIGAGYH 1049
>gi|336273928|ref|XP_003351718.1| hypothetical protein SMAC_00261 [Sordaria macrospora k-hell]
Length = 914
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 96/199 (48%), Gaps = 6/199 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELAT 384
+ +++F+ D SGSM G +E K+AL + L + G FNI +F +LF
Sbjct: 296 RPEIVFVCDRSGSMGGTRIEGLKSALRIFLKSIPVGAKFNICSFGSRHEFLFPEGSRSYD 355
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+E ++RA ++ + G T + PL A E + +FL+TDG + D+ Q+
Sbjct: 356 QETLQRAMEYTNLMRADFGGTEMYRPLEAAFE--KRYKDMDLEVFLLTDGEIWDQGQLFT 413
Query: 445 AMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+++ G+I R++T GIG+ +H + +A G+ + D + + ++ ++
Sbjct: 414 MTNEKVSESKGAI--RLFTLGIGNDVSHALIEGVARAGNGFAQSVTDGEKMNAKVVRMLK 471
Query: 504 RGFSSVLANIAIDTLKDLD 522
G + + + ++ D D
Sbjct: 472 AGLTPHIKDYTLEIKYDKD 490
>gi|242073572|ref|XP_002446722.1| hypothetical protein SORBIDRAFT_06g021180 [Sorghum bicolor]
gi|241937905|gb|EES11050.1| hypothetical protein SORBIDRAFT_06g021180 [Sorghum bicolor]
Length = 103
Score = 72.4 bits (176), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 51 MVYAVITDPKIVDNPDIPSYQPYVHGRCDPPSLIPLQMNAVELDVDC 97
M YAV+ D VD+PD+PSYQP+++GR DPP+LIPL+M VEL VDC
Sbjct: 1 MAYAVVVDSGAVDSPDVPSYQPHMYGRLDPPALIPLKMQEVELRVDC 47
>gi|432916772|ref|XP_004079376.1| PREDICTED: von Willebrand factor A domain-containing protein
5B2-like [Oryzias latipes]
Length = 1426
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 107/233 (45%), Gaps = 8/233 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC LL + K + +++F+VD SGSM G ++ K A+ +AL L G NIV F
Sbjct: 397 FCPDLLSEPLELHKACR-ELLFLVDRSGSMSGTSIQRVKEAMVLALKSLPAGTMLNIVGF 455
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
TS +L T + +A+++I TN+ LT + R +F
Sbjct: 456 GTTVKPLFTSSKLCTDVTLIQAYEYIQRMRADMRGTNLLGALTWLYQQPMK-RMYPRQVF 514
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
++TDG+V + ++ + ++ G R + G+G L+ +A ++ G
Sbjct: 515 IITDGSVSNVAKVLELVRRNTCAG-----RYFGLGLGPRACRRLLQGIAKLTGGLTEFLD 569
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDT-LKDLDEFEMYPSRIPDLSSESPLI 541
D + ++ ++ K + F VL ++ ID L + E + P+ IP L S LI
Sbjct: 570 DDERLQPKLIKSLKKAFEPVLTDVRIDWYLPENTEALLSPNEIPPLYSGDCLI 622
>gi|242091441|ref|XP_002441553.1| hypothetical protein SORBIDRAFT_09g029215 [Sorghum bicolor]
gi|241946838|gb|EES19983.1| hypothetical protein SORBIDRAFT_09g029215 [Sorghum bicolor]
Length = 105
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%), Gaps = 5/84 (5%)
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL--PGTAKSRKVFKKDVIFIVDISGS 338
FS V + GVL+Q +LHD D+REM ++L+ P S F+K V+FIVD SGS
Sbjct: 20 FSNDVYSGNFCHGVLVQPATLHDCDEREMSYIFLVLHPTYLNS---FRKAVVFIVDTSGS 76
Query: 339 MQGKPLEDTKNALAVALSKLDPGD 362
MQGKPLE+ K+A++ ALS+ GD
Sbjct: 77 MQGKPLENVKHAVSTALSEFVQGD 100
>gi|397471231|ref|XP_003807201.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6 [Pan
paniscus]
Length = 1312
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 127/284 (44%), Gaps = 32/284 (11%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V I G V ++D + F Y P + +K+V+F++D+S
Sbjct: 245 DFLVQYDVVMEDIIGDV-----QIYD----DYFIHYFAP---RGLPPMEKNVVFVIDVSS 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWIG 396
SM G ++ TK A+ V LS L D FNI++F+ ++ + AT + V A ++
Sbjct: 293 SMFGTKMKQTKMAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLH 352
Query: 397 INFIAGGSTNICAPLTKAVEMLTNSR---------GSIPIIFLVTDG----AVEDERQIC 443
A G T+I + L A +L +S G IP+I +TDG V I
Sbjct: 353 C-MEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVIL 411
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
++ L + S +++ G + LR L++ +RG Y+ +Q++ L+
Sbjct: 412 SNVRQALGHRVS----LFSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYE 467
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+LA++ ++ L L + + P+ S L+V+G+ Q
Sbjct: 468 EISMPLLADVRLNYLGGLVGASPW-AVFPNYFGGSELVVAGQVQ 510
>gi|380095997|emb|CCC06044.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1052
Score = 72.4 bits (176), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 96/199 (48%), Gaps = 6/199 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELAT 384
+ +++F+ D SGSM G +E K+AL + L + G FNI +F +LF
Sbjct: 296 RPEIVFVCDRSGSMGGTRIEGLKSALRIFLKSIPVGAKFNICSFGSRHEFLFPEGSRSYD 355
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+E ++RA ++ + G T + PL A E + +FL+TDG + D+ Q+
Sbjct: 356 QETLQRAMEYTNLMRADFGGTEMYRPLEAAFE--KRYKDMDLEVFLLTDGEIWDQGQLFT 413
Query: 445 AMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
+++ G+I R++T GIG+ +H + +A G+ + D + + ++ ++
Sbjct: 414 MTNEKVSESKGAI--RLFTLGIGNDVSHALIEGVARAGNGFAQSVTDGEKMNAKVVRMLK 471
Query: 504 RGFSSVLANIAIDTLKDLD 522
G + + + ++ D D
Sbjct: 472 AGLTPHIKDYTLEIKYDKD 490
>gi|373849281|ref|ZP_09592082.1| Vault protein inter-alpha-trypsin domain-containing protein
[Opitutaceae bacterium TAV5]
gi|372475446|gb|EHP35455.1| Vault protein inter-alpha-trypsin domain-containing protein
[Opitutaceae bacterium TAV5]
Length = 693
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 148/340 (43%), Gaps = 36/340 (10%)
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
D+ F M + PG D +++D+SGSMQGK + N ++ AL ++ D F
Sbjct: 282 DKPGTFMMVVTPGIDLQPITRGADYTYVLDVSGSMQGK-IATLANGVSRALGQMRETDRF 340
Query: 365 NIVAFN--GETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
IV FN G L T AT+ VER + + + G T+I A L A+ L R
Sbjct: 341 RIVTFNNSGREVLGWTP---ATRANVERGVELVK-SLTPNGGTDIHAGLKLALRDLDADR 396
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
S I LVTDG A + L R + F +G+ N +R +A +
Sbjct: 397 AS--SIVLVTDGVANQGVVDPKAFHALLKK---YDIRFFGFLMGNSANWPLMRTMADATG 451
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP--- 539
G+Y + D I Q+ L + + IA ++L D F +R+ + + + P
Sbjct: 452 GFYAGVSNDDDIVGQI--LLAK------SKIAFESLHDA-SFRFKGARVTETTGDYPQKI 502
Query: 540 -----LIVSGRYQ--GKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQID 590
L++ GRY G+ LKAK G+ + + D P ++R+ A QI+
Sbjct: 503 YRGQQLVIFGRYDSAGEATVQLKAK-LTGEDKTYTTTFNFPETDTDNPEIERLWALAQIE 561
Query: 591 LLTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVE 628
L + E K ++ + V+ ++ ++T M++++
Sbjct: 562 QLELLESIGQLPPAESKDAITHLGVEYQLVTDHTAMVVLD 601
>gi|373458928|ref|ZP_09550695.1| Vault protein inter-alpha-trypsin domain-containing protein
[Caldithrix abyssi DSM 13497]
gi|371720592|gb|EHO42363.1| Vault protein inter-alpha-trypsin domain-containing protein
[Caldithrix abyssi DSM 13497]
Length = 822
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/412 (21%), Positives = 162/412 (39%), Gaps = 57/412 (13%)
Query: 142 ISGKSYHTQLIALGENDGAGKSASVE--TGSFL---KPNIFTLTLPQIDGGSYLSIRLRW 196
++GKS H+Q++ L E A +E + +++ + NIFTL + + L I+++
Sbjct: 73 LNGKSIHSQILPLDEAQKRYDDAVIEGDSATYIQQHRSNIFTLNIGNLAPKDDLIIQIKL 132
Query: 197 SQKLSYRDGEFSVNVP------------------FKFPE---------YVTPAIK--KIP 227
Q LS + +P F + E ++TP + +P
Sbjct: 133 LQLLSIEGKTIRLTLPTVVGPRYIPGHPIGERSGFGWAEPTDRVPDADWITPPVSFDGVP 192
Query: 228 KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI--DFDFSYTV 285
++V+ E + SH QLR G+ S N+ D DF +
Sbjct: 193 YLVDFQIDVSQNLKVES-VESPSH---QLRFFPSAEGFKITS----LGNVAPDRDFVLNL 244
Query: 286 SPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLE 345
+ + +L ++ D + +F +L + + D F++D SGSM LE
Sbjct: 245 KLAELPTNLLWKT----KFDSKNIFLSWLNVPEVEEAEHLPTDYFFLIDRSGSMASTKLE 300
Query: 346 DTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWI-GINFIAGGS 404
K A+ + K P D F + AF+ + + + + + A W+ IN A G
Sbjct: 301 VVKKAVRLCFRKFMPEDRFALAAFDHNFIFWKNDWQQVSNKNITEAEWWLKKIN--ANGG 358
Query: 405 TNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFG 464
T + L K M N + ++ L+TDG V DE +I D N ++ FG
Sbjct: 359 TELLPALQKFFSMNFNPERRV-VLILLTDGEVGDEAEIADLFDQAPENL-----KVLLFG 412
Query: 465 IGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
I + N + + G Y + +E ++ F R ++ N+ ++
Sbjct: 413 IDTAVNQELFEAIIEKTPGLVEYIYPGEPMEQKINLQFERIEFPLIKNVMVN 464
>gi|410924377|ref|XP_003975658.1| PREDICTED: von Willebrand factor A domain-containing protein
5B2-like [Takifugu rubripes]
Length = 1415
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 110/235 (46%), Gaps = 12/235 (5%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC LL ++ K ++++F+VD SGSM G ++ K A+ +AL L PG NIV F
Sbjct: 377 FCPNLLCEPSELHKA-TRELLFLVDRSGSMSGTKIQSVKEAMVIALKSLPPGTKLNIVGF 435
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE--MLTNSRGSIPI 427
TS +++T + +A +++ TN+ L+ + ML R
Sbjct: 436 GTTIKPLFTSSKMSTDVTILQACEYLQRMRADMRGTNLLGALSWVYQQPML---RSYPRQ 492
Query: 428 IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA 487
+F++TDG++ + ++ + ++ G R + G+G L+ +A ++ G
Sbjct: 493 VFIITDGSISNVAKVLELVRRNAWAG-----RCFGLGLGPRSCRRLLQGVANLTGGTTEY 547
Query: 488 AYDLDSIEIQMQKLFTRGFSSVLANIAIDT-LKDLDEFEMYPSRIPDLSSESPLI 541
D + + ++ K + F VL ++ ID L + E + P+ IP L + LI
Sbjct: 548 LDDEERLRPKLIKSLKKAFEPVLTDVRIDWYLPENMEALLSPNEIPPLYPGNRLI 602
>gi|296481520|tpg|DAA23635.1| TPA: inter-alpha globulin inhibitor H2 polypeptide [Bos taurus]
Length = 946
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDHFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A +I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKNYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y +Q++
Sbjct: 428 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|149390871|gb|ABR25453.1| von willebrand factor type a domain containing protein [Oryza
sativa Indica Group]
Length = 82
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 676 ATAENFPPGSEIPKLPEVAEIFVKAASNCCSTLGNKCCCLCCIECCNKLNDQCVIVLTQL 735
AT EN G K PE E+F KA CCS L + CCC+C I C+K+ND+C IV+ QL
Sbjct: 3 ATRENLSAGFGDTKPPERFEMFDKAV-GCCSRLTDCCCCMCFINTCSKMNDRCAIVMVQL 61
Query: 736 CTALACFGCFECCSNLCCCGQD 757
C AL+C CFECCS LCC G D
Sbjct: 62 CGALSCLACFECCS-LCCGGSD 82
>gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Bos taurus]
gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus]
Length = 946
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDHFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A +I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKNYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y +Q++
Sbjct: 428 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644]
gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644]
Length = 764
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/366 (21%), Positives = 156/366 (42%), Gaps = 20/366 (5%)
Query: 187 GSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLC 246
G+ +I L++S L G PF + + I K+ R ++ L T
Sbjct: 151 GADRTITLKYSTLLPRTFGAVDFTFPFGGRGFTSKPIGKL--RIEVDLRTTNDLKT---I 205
Query: 247 NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQ 306
++ SH + R+ +++ E + DF + + G ++L P + D
Sbjct: 206 HSPSHDVAIDRKGNRDAMVTFQRESF-LPDQDFRLLFNEGEGAL-GAMVLSHPPTANEDG 263
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
+ T ++ K V+ I+D SGSM GK +E + A+ + L+ D FN+
Sbjct: 264 TLLLLASPTIETTPNKTPPAKTVVLILDRSGSMSGKKIEQARAAMKFVVENLNQDDLFNL 323
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
+ ++ +F + E A ++I G+ GGST+I L ++++ +
Sbjct: 324 ILYDDTVEMFKPELLRCNAENRAEALRFIEGVR--PGGSTDIDQGLRAGLKLIADESRPN 381
Query: 426 PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+IFL TDG +E +I +A ++ + +++ FG+G N L L+ +
Sbjct: 382 YVIFL-TDGLPTSGETNELKIAEAARA----ANPLKAKLFVFGVGYDVNARLLDRLSGEN 436
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
G D++E+ + + + R + L +I + T + ++ P ++PDL L+
Sbjct: 437 GGVSFYVKPDDNLEVAVSRFYERISTPALTDITV-TFEGVEINRAMPRQLPDLFRGGQLV 495
Query: 542 VSGRYQ 547
RY+
Sbjct: 496 YVARYR 501
>gi|156399867|ref|XP_001638722.1| predicted protein [Nematostella vectensis]
gi|156225845|gb|EDO46659.1| predicted protein [Nematostella vectensis]
Length = 930
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/401 (22%), Positives = 162/401 (40%), Gaps = 55/401 (13%)
Query: 111 RVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SASV 166
+ +C + + P+ D ++ G EA I+GK + + E + A K + S
Sbjct: 426 QAYCNNSDVPIEAKYVFPLDDMAAVCGFEAFINGKHI---IGEVKEKEQARKEYREAISA 482
Query: 167 ETGSFL----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA 222
G++L P++FT+ + + + + I++ + +L+ +GE + F+ P V P
Sbjct: 483 GHGAYLMDEETPDVFTVNVGNLPPAASVLIKITYVAELTV-EGEL---ICFRLPGSVAPW 538
Query: 223 IKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFS 282
+R + H +V T + +C + + ++ + + + ++ +
Sbjct: 539 -----QRTQAHDDVTQ-TDVDTICVDEENQSNTSVQVSVEMPFEIRAILCPGHDVKVKRT 592
Query: 283 YT-----VSPSHIFG-GVLLQ-------SPSL----HDVDQREMFCMYLLPGTAKSRKVF 325
T ++P G G LQ P + HD + CM +S
Sbjct: 593 STKAVVEMAPGGTLGSGFQLQIQLAEIHVPRMWVERHDTEADSQACMLAFYPEFESGPPH 652
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+VIF++D S SM+GK L++ K + LS ++ +FNIV F T + K
Sbjct: 653 HVEVIFVLDASCSMKGKALQEAKKLTLMCLSLMEEEWAFNIVVFGSNYSELFTQSQKKDK 712
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDERQICD 444
E V RA +++ GST++ L + NS P +FL TDG V +E
Sbjct: 713 ETVARAAKFVKSVKAVKGSTDLWRVLRSLYLLRCNSTADYPSNVFLFTDGHVTEESTTL- 771
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
Y I CN +FLR +A + G Y
Sbjct: 772 ---------------AYIRDIRPTCNRHFLRSMARVGAGAY 797
>gi|426396070|ref|XP_004064278.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6 [Gorilla
gorilla gorilla]
Length = 1169
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 116/254 (45%), Gaps = 23/254 (9%)
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
+ F Y P + KK+V+F++D+S SM G +E TK A+ V LS L D FNI+
Sbjct: 122 DYFIHYFAP---RGLPPMKKNVVFVIDVSSSMFGTKMEQTKMAMNVILSDLQANDYFNII 178
Query: 368 AFNGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR---- 422
+F+ ++ + AT + V A ++ A G T+I + L A +L +S
Sbjct: 179 SFSDTVNVWKAGGSIQATIQNVHSAKDYLHC-MEADGWTDINSALLAAASVLNHSNQEPG 237
Query: 423 -----GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
G IP+I +TDG V I ++ L + S +++ G +
Sbjct: 238 RGPSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQALGHRVS----LFSLAFGDDADFTL 293
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
L L++ +RG Y+ +Q++ L+ +LA++ ++ L L + + P+
Sbjct: 294 LLCLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADVHLNYLGGLVGASPW-AVFPN 352
Query: 534 LSSESPLIVSGRYQ 547
S L+V+G+ Q
Sbjct: 353 YFGGSELVVAGQVQ 366
>gi|145516504|ref|XP_001444144.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411548|emb|CAK76747.1| unnamed protein product [Paramecium tetraurelia]
Length = 829
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 110/246 (44%), Gaps = 24/246 (9%)
Query: 299 PSLHDVDQREMFCMYLLPG--TAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALS 356
P ++V + + YL G A ++K+ + + +FI+D SGSM G ++ K AL + L
Sbjct: 296 PKFNEVSLDDAYSQYL-DGLIIADNQKIKRGNYLFIIDRSGSMSGSRIKKAKEALILFLK 354
Query: 357 KLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
L +NI++F T L++ S + + +E A + + G T I APL + +
Sbjct: 355 SLPQDSEYNIISFGTNFTKLWNVSQNYS-QNTLETAIKHVEEMDADMGGTCIIAPLKQMI 413
Query: 416 --EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
+ S+ + +FL+TDG + D + + RIYT GIG C+ Y
Sbjct: 414 YHKNYGASKNTTLNVFLLTDG-----QDTADPIIDLVQKNNLAQTRIYTLGIGRECSQYL 468
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPD 533
+R +A + G Y D + I ++ L + L ++ S IP+
Sbjct: 469 IRRVAEVGNGKYQIVSDKEDINEKVIDLLEDSLTPYLEAFTLE------------SNIPN 516
Query: 534 LSSESP 539
++S P
Sbjct: 517 ITSIIP 522
>gi|348544635|ref|XP_003459786.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Oreochromis niloticus]
Length = 806
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 91/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ +E T+ AL L+ L D F ++ F+G + + + A +
Sbjct: 267 KNVVFVIDQSGSMHGRKIEQTRTALVHILNDLAEDDYFGLITFDGNIFHWKRELVQANSK 326
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQICD 444
+E A + N GST+I + + ML N GS I+ L+TDG +
Sbjct: 327 NLESAKNF-ARNIQDRGSTDINEAVLQGARMLNAHNREGSASILILLTDGDPTSGVTNLE 385
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G + FL +++ + G Y+ ++Q++ +
Sbjct: 386 KIQSNVRQAIAGKFPLYCLGFGFDVDFNFLEKMSLQNNGVARRIYEDSDADLQLKGFYEE 445
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 446 VATPLLTDVTM 456
>gi|440906272|gb|ELR56554.1| Inter-alpha-trypsin inhibitor heavy chain H2, partial [Bos
grunniens mutus]
Length = 951
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 313 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDHFSVVDFNHNVRTWRNDLVSATKT 372
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A +I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 373 QVADAKNYIE-KIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVSLIILVSDGDPTVG 431
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y +Q++
Sbjct: 432 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLK 491
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 492 KFYNQVSTPLLRNVQFN 508
>gi|47228304|emb|CAG07699.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1434
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 106/233 (45%), Gaps = 8/233 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC LL G ++++F+VD SGSM G ++ K A+ +AL L PG NIV F
Sbjct: 399 FCPDLL-GEPLELHRATRELLFLVDRSGSMSGTKIQSVKEAMVIALKSLPPGTKLNIVGF 457
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
TS +L+T + +A +++ TN+ L+ + R +F
Sbjct: 458 GTTIKPLFTSSKLSTDVTILQACEYLQRMRADMKGTNLLGALSWVYQQPMQ-RSYPRQVF 516
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
++TDG V + ++ + ++ G R + G+G L+ +A ++ G
Sbjct: 517 IITDGCVSNVAKVLELVRRNACAG-----RCFGLGLGPKACRRLLQGVAKLTGGTAEYLD 571
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDT-LKDLDEFEMYPSRIPDLSSESPLI 541
D + ++ ++ K + F VL ++ +D L + E + P+ IP L + LI
Sbjct: 572 DEERLQPKVIKSLKKAFEPVLTDVRVDWYLPENMEALLSPNDIPPLYPGNRLI 624
>gi|12855146|dbj|BAB30227.1| unnamed protein product [Mus musculus]
Length = 1209
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + IF++D S SM ++ K A+ VAL L P FNI+ F
Sbjct: 336 FCPDLQSVQPNPRKAHG-EFIFLIDRSNSMSKTNIQCIKEAMLVALKSLMPACFFNIIGF 394
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S + +E + A I G TN+ +PL K V RG ++F
Sbjct: 395 GSTFKAVFASSRIYNEENLTMACDCIQRMQADMGGTNMLSPL-KWVLRQPLRRGHPRLLF 453
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDG+V + ++ + +++ ++ R Y+FGIG + ++ LA +S+G
Sbjct: 454 LITDGSVNNTGKVLELVRNHASST-----RCYSFGIGPTVCYRLVKGLASVSKGSAEFLM 508
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 509 EGERLQPKMVKSLKKAMAPVLSDVTVE 535
>gi|198278486|ref|NP_083677.1| von Willebrand factor A domain-containing protein 5B1 precursor
[Mus musculus]
gi|172048428|sp|A9Z1V5.1|VW5B1_MOUSE RecName: Full=von Willebrand factor A domain-containing protein
5B1; Flags: Precursor
gi|130267304|gb|AAI32478.2| Vwa5b1 protein [Mus musculus]
gi|148681323|gb|EDL13270.1| mCG14536 [Mus musculus]
gi|187950813|gb|AAI37852.1| Von Willebrand factor A domain containing 5B1 [Mus musculus]
Length = 1215
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + IF++D S SM ++ K A+ VAL L P FNI+ F
Sbjct: 336 FCPDLQSVQPNPRKAHG-EFIFLIDRSNSMSKTNIQCIKEAMLVALKSLMPACFFNIIGF 394
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S + +E + A I G TN+ +PL K V RG ++F
Sbjct: 395 GSTFKAVFASSRIYNEENLTMACDCIQRMQADMGGTNMLSPL-KWVLRQPLRRGHPRLLF 453
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDG+V + ++ + +++ ++ R Y+FGIG + ++ LA +S+G
Sbjct: 454 LITDGSVNNTGKVLELVRNHASST-----RCYSFGIGPTVCYRLVKGLASVSKGSAEFLM 508
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 509 EGERLQPKMVKSLKKAMAPVLSDVTVE 535
>gi|351696804|gb|EHA99722.1| Inter-alpha-trypsin inhibitor heavy chain H5 [Heterocephalus
glaber]
Length = 948
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 125/285 (43%), Gaps = 29/285 (10%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L+ T++AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLQQTRDALTTILHDLRPQDHFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T V +I + +GG T+I L A+ +L N
Sbjct: 337 SNRIKVWKDHLVPVTPSNVIDGKFYIHLMSPSGG-TDINGALQAAIRLLNNYVSHNDIED 395
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNG------GSICPRIYTFGIGSYCNHYFLRM 476
S+ ++ +TDG + + + ++ N G IC I+T GIG+ + L
Sbjct: 396 RSVSLVIFLTDG----KPTVGETHGPKILNNTKEAAQGQIC--IFTVGIGNDVDFKLLEK 449
Query: 477 LAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
L++ + G ++ + Q+ + + +L++I ID L E + P+ +
Sbjct: 450 LSLDNCGLTRRVHEEEDAGAQLIGFYDEIRTPLLSDIRIDYPSGLVE-HATKTVFPNYFN 508
Query: 537 ESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
S ++++G+ K D L + + FV+ L D+P++
Sbjct: 509 GSEIVIAGKLVDKKTDQLHVEVTASNSKKFVI-----LKTDVPVE 548
>gi|432089723|gb|ELK23540.1| Inter-alpha-trypsin inhibitor heavy chain H5 [Myotis davidii]
Length = 958
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 125/282 (44%), Gaps = 22/282 (7%)
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
E F Y P K K+V+F++D S SM G L TK+AL L L P D FNI+
Sbjct: 296 EYFVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNII 352
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-----TNSR 422
F+ ++ + T + +I + G T+I L + + +L N
Sbjct: 353 GFSNRIKVWKDHLVSVTPNNIRDGKVYIH-HMSPTGGTDINGALQRGIRLLNDYVANNDI 411
Query: 423 G--SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
G S+ +I +TDG V + + ++ G IC I+T GIG+ + L L+
Sbjct: 412 GDRSVSLIIFLTDGKPTVGETHTLKILNNTKEAARGQIC--IFTIGIGNDVDFKLLEKLS 469
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G ++ + Q+ + + +L++I ID + E ++ + P+ + S
Sbjct: 470 LENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIHIDYPPSVVE-QVTRTLFPNYFNGS 528
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+I++G+ K D L + + S FV+ L D+P+
Sbjct: 529 EIIIAGKLVDKKIDRLHVEVRASN-SKFVI-----LKTDVPV 564
>gi|348539812|ref|XP_003457383.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Oreochromis niloticus]
Length = 1608
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 138/611 (22%), Positives = 241/611 (39%), Gaps = 114/611 (18%)
Query: 116 MGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKS--ASVETG--SF 171
+ S + + P+ D ++ G EA I+GK Q+ E + A K ++E G ++
Sbjct: 608 LSSVPIEAKYVFPLDDSAAVCGFEAFINGKHVVGQV---KEKEKARKEYKQAIEKGHGAY 664
Query: 172 L----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP 227
L P++FT+++ + G+ + I++ + +L RDG + F P V P
Sbjct: 665 LMDQDAPDVFTISVGNLPPGATVLIKVTFVSELIVRDG----TILFSLPGSVAPW----- 715
Query: 228 KREKIHLNVNAGTGTEVLCNT-SSHHLKQLRRDV-GKLGYSYES-----EVLKWSNIDFD 280
+E LN E +C T + +++ D+ ++ Y S +K D
Sbjct: 716 -QESAALNQTTQVTVEKVCVTDEAASVREFTLDLSAEMPYEISSLQCITHQIKMKKTDCK 774
Query: 281 FSYTVSPSHIFGGVLLQ-SPSLHDVDQREMF-----------CMYLL-PGTAKSRKVFKK 327
+V P I G Q S L DV M+ CM + P S
Sbjct: 775 AVVSVLPGQIMGPDGFQLSIMLSDVHLPRMWVEKHPDKDSQACMLVFYPDFEVSSSSDAD 834
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKE 386
+VI ++D S SM+G+ L + L LD N+V F + T F T M L E
Sbjct: 835 EVILLLDTSESMKGESLRSARRIALQILKSLDRSLRLNVVLFGTDHTEAFLTVMPLG--E 892
Query: 387 AVERAHQWI------------------------------GINFIA------------GGS 404
E A ++I G ++ GG
Sbjct: 893 VYEEAQRFIKPDNPHALPPALILKAFQTLHGLLKLGLESGNTWLICSRDIFQHSAPLGGG 952
Query: 405 TNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFG 464
T + PL +A+ +L SRG I + L++DG +++ + +L + R++T G
Sbjct: 953 TEMWRPL-RALSLLPPSRG-IRNLLLLSDGHIQNTE-----LTLKLLRDNAQHSRLFTCG 1005
Query: 465 IGSYCNHYFLRMLAMISRGYYGAAYDL-------DSIEIQMQKLFTRGFSSVLANIAIDT 517
+ N + LR LA G Y +D + + Q++++ + G SSV ++
Sbjct: 1006 LSPMANRHMLRALAQAGGGAY-EFFDTKTKHNWAEKVACQLKRMASSGCSSV--SVKWQQ 1062
Query: 518 LKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD 577
P ++ L ++ +V G TL +L V +LQ K
Sbjct: 1063 FSPTSPPLQAPKQLHALFNDCHTLVYGFVPHCTQATLLGNLRGQELKTMVSTSELQKTKG 1122
Query: 578 IPLDRICAKQQI------DLLTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVET 629
L ++ A+ I +L T++A E K+ E K + ++S + +L ++T + +E
Sbjct: 1123 TFLHKLTARALIRDYEDGNLDTSEAE-HEGKKAELKSFIIELSKEFSILSQFTSFVAIE- 1180
Query: 630 DERNNASESPG 640
ER++ PG
Sbjct: 1181 -ERDSEQPQPG 1190
>gi|168008926|ref|XP_001757157.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691655|gb|EDQ78016.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1068
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 101/449 (22%), Positives = 177/449 (39%), Gaps = 84/449 (18%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE------TGSFL- 172
+ + P+ + +++ EAE+ G+ ++I A K E T S L
Sbjct: 53 AVEAAYTFPLYEGAAVVKFEAEVDGR----KIIGKVREKSAAKKEYQEAVSAGKTASLLD 108
Query: 173 --KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKI---- 226
KP++F ++ I G +SIR+ ++ GE V F P V P I
Sbjct: 109 QEKPDVFQTSIGNIPPGKTISIRITLVSEIKQDAGENQVR--FVLPTTVAPRYGAIGFFG 166
Query: 227 -------PKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDF 279
K+ +N++ + S H ++ LG +
Sbjct: 167 NPITPPPDSSAKLSINMSCAMSKPITSVQSPSHTIEVH-----LGTTD------------ 209
Query: 280 DFSYTVSPSHIF------GGVLLQSPSLHDVD------------QREMFCMYLLPGTAKS 321
TV P+HI V L SL D D R + + GT
Sbjct: 210 ----TVDPAHISTFNPNQARVTLTVDSLLDCDLVIVINSLGLDQPRALVERHPSHGTHAI 265
Query: 322 RKVFK----------KDVIFIVDISGSMQGKPLEDTKNALAVALSKLD-PGDSFNIVAF- 369
F+ ++IF+VD SGSM G ++ AL + L + FNIV F
Sbjct: 266 ALTFQPRFALQPLRTSEMIFLVDRSGSMMGTQIKQAGEALELFLRSIPFENHYFNIVGFG 325
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-I 428
+ +LF TS+E T++++++A + G T I E+ R ++P I
Sbjct: 326 SNHNFLFPTSVEY-TEDSLKKAVHYAQTIQANMGGTEIANAF---FEVFQRRRRNVPTQI 381
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGG-SICP-RIYTFGIGSYCNHYFLRMLAMISRGYYG 486
FL+TDG V D Q+ ++ + +G + P R++T G+G+ +H+ + +A GY
Sbjct: 382 FLLTDGMVWDAEQLTKSIIEAVDDGARNNSPVRVFTLGVGNAVSHHLIESVARAGGGYAQ 441
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
+ + +E ++ + G + + N ++
Sbjct: 442 LVLENERMEKKVLNMLKAGLTPSVTNASV 470
>gi|260827623|ref|XP_002608764.1| hypothetical protein BRAFLDRAFT_73983 [Branchiostoma floridae]
gi|229294116|gb|EEN64774.1| hypothetical protein BRAFLDRAFT_73983 [Branchiostoma floridae]
Length = 2524
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 226/563 (40%), Gaps = 81/563 (14%)
Query: 120 SCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KP 174
+ + + P+ D +++G EA I+GK ++ E + A E G++L P
Sbjct: 579 AIEAKYVFPLDDMAAVVGFEAFINGKHVVGEVKEKEEAHREYRQAISEGHGAYLMDEETP 638
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
++FT+++ + + + I++ + +L+ +GE N+ F+ P V P +K EKI
Sbjct: 639 DVFTVSVGNLPPRASVLIKITYVAELAV-EGE---NICFRLPGSVAP-WQKDSLSEKIQK 693
Query: 235 NVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSH----- 289
++ H K+ V + FD SPSH
Sbjct: 694 DLET-------VKVEKHAGKEFSLQVAM-------------EMPFDIRTIQSPSHKIRVK 733
Query: 290 ---------------IFGGVLLQS--PSLH--------DVDQREMFCMYLLPGTAKSRKV 324
+ G LLQ +H D+ CM ++ +
Sbjct: 734 RTASKAVVELEKNCMLGAGFLLQVGLAEIHVPRMWAERHPDKDSQACMLTFYPEFQAEVM 793
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELA 383
+VI ++D S SM+G LE K + L L SFN+VAF G LF+ S +
Sbjct: 794 QGHEVILLLDGSNSMRGSALEAAKKVALLCLCHLPKECSFNVVAFGTGYEELFAVS-QSR 852
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
TK V +A +I + G+T+ PL + +L S G +FL++DG V + Q
Sbjct: 853 TKSNVSKAETFIQNLKASKGNTDAWRPL-RGFFLLPPS-GKTRNVFLISDGHVNNPDQTL 910
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD--LDSIEIQMQKL 501
+ R+++ G+GS N + LR LA + G + D E +++
Sbjct: 911 RDIHQHYQE-----TRVFSCGVGSTSNKHLLRALARVGGGAFEYFDDKTKSKWEKKVKSQ 965
Query: 502 FTRGFSSVLANIAIDTLKDLDEFE---MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
+ L ++++D + D+ P++I L + S +V G TLKA
Sbjct: 966 LWKAAQPGLTSVSVDWQQWDDDAPPPVQAPNQIVSLFNGSRQVVYGYVPHCTQATLKAVI 1025
Query: 559 FLGDLSNFVVELKLQLAKDIPLDRICAKQQI-----DLLTAQAWFSEDKRLEEK--VSKM 611
++S V +L + L ++ A+ I L E K++ K + +
Sbjct: 1026 NNREISTMVSTTELSITTGKILHQLTARAVIRDWEDGTLHTDRMHHETKKMGLKPYIIDL 1085
Query: 612 SVQTGVLCEYTRMIIVETDERNN 634
S + ++ ++T + VE E++
Sbjct: 1086 SKEYSIVTQFTSFVAVEKREKDE 1108
>gi|34980984|gb|AAH57376.1| Vwa5b1 protein [Mus musculus]
Length = 1202
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 98/207 (47%), Gaps = 7/207 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC L RK + IF++D S SM ++ K A+ VAL L P FNI+ F
Sbjct: 330 FCPDLQSVQPNPRKAHG-EFIFLIDRSNSMSKTNIQCIKEAMLVALKSLMPACFFNIIGF 388
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
S + +E + A I G TN+ +PL K V RG ++F
Sbjct: 389 GSTFKAVFASSRIYNEENLTMACDCIQRMQADMGGTNMLSPL-KWVLRQPLRRGHPRLLF 447
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
L+TDG+V + ++ + +++ ++ R Y+FGIG + ++ LA +S+G
Sbjct: 448 LITDGSVNNTGKVLELVRNHASST-----RCYSFGIGPTVCYRLVKGLASVSKGSAEFLM 502
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAID 516
+ + ++ +M K + + VL+++ ++
Sbjct: 503 EGERLQPKMVKSLKKAMAPVLSDVTVE 529
>gi|392590138|gb|EIW79467.1| hypothetical protein CONPUDRAFT_154882 [Coniophora puteana
RWD-64-598 SS2]
Length = 1026
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 156/378 (41%), Gaps = 45/378 (11%)
Query: 171 FLKPNIFTLTLPQIDGGSYLSIRLRWSQKL---SYRDGEFSVNVPFKFPEYVTPAIKKI- 226
++ ++FT+++ + G + RL + L RD + +P PA I
Sbjct: 120 WVTDDVFTISVGSVPSGMSVVTRLVFVMDLLDEGIRD-HIRLQLPTAVAPRYGPAPAAIL 178
Query: 227 --------PKREKIHLNVNAGTGTEVLC-NTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
P ++ + V+ T E+L + +H + LR S W++
Sbjct: 179 DASSASASPGETRLTIRVDVQTSEEILALRSPTHPITLLRYKTRSGRRSTRRMSASWASP 238
Query: 278 DF---DFSYTVSPSHIFGGVLLQSP----SLHDVDQREMFCMYL-----LPGTAKSRKVF 325
+F DF ++ H G L +P S H D M+L G+A R V
Sbjct: 239 EFLAGDFVLSI---HARG---LDAPRCFASRHPTDAHGAVAMHLTLVPTFSGSAP-RAVR 291
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS-FNIVAFNGETYLFSTSMELAT 384
++ +F+VD SGSM G P+E K LAV L L ++ FN+V+F +
Sbjct: 292 AQEYVFVVDRSGSMGGAPMETAKRTLAVLLRALPARETRFNVVSFGSHVDGLWPQSVVYA 351
Query: 385 KEAVER--AH-QWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
+E + R AH + +G ++ G T + L A++ SR + +F +TDG V D +
Sbjct: 352 EETLGRGIAHVESMGADY---GGTEMANALRFALDRRDTSRPT--AVFFLTDGGVTDVQG 406
Query: 442 ICDAMKSRLTNGGSI--CP-RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
D +K+ + S P R++ GIG + +A G A + I +
Sbjct: 407 PADVVKAAVQASASTPAAPLRVFVLGIGPEVSSDVCERIARAGEGECLFALHAEEITGRC 466
Query: 499 QKLFTRGFSSVLANIAID 516
+L G + + ++ +D
Sbjct: 467 ARLLNAGRTRTVESVEVD 484
>gi|281210541|gb|EFA84707.1| type A von Willebrand factor domain-containing protein
[Polysphondylium pallidum PN500]
Length = 880
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 86/189 (45%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D SGSM G +E K L + + L FNI F T +L +
Sbjct: 301 EFIFLIDCSGSMSGSTIEKAKRCLEILMRSLTEKSKFNIWLFGSSFKSQFTESKLYNDQT 360
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+E+A +I G T + P+ + +S+ +F++TDG V ++ D +
Sbjct: 361 LEQASVFIKSIDADLGGTELLPPIQAIISQPVDSQYPRQ-VFILTDGEVSQRDELVDYVA 419
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
G+ RI+T GIGS + + L+ RGY+ D +E Q+ KL +
Sbjct: 420 KE---AGTT--RIFTLGIGSGVDKLLVEGLSKACRGYFEFIDDNSMMETQVMKLMSIAME 474
Query: 508 SVLANIAID 516
++ANI ++
Sbjct: 475 PIIANIQVN 483
>gi|440802448|gb|ELR23377.1| Poly(ADPribose) polymerase, putative [Acanthamoeba castellanii str.
Neff]
Length = 2693
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 144/340 (42%), Gaps = 69/340 (20%)
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
K+ + +++ I +VD S SM+G LED K A+ +L L G + N FN Y+F +
Sbjct: 950 KASRDEEREFILVVDCSSSMKGDTLEDMKRAVIASLDLLPKGTAHNY-RFN--IYVFGSG 1006
Query: 380 MELATKEAVERAHQWIGINFIAG--GSTNICAPLTKAVEMLTNSRGSIPI-------IFL 430
E + A G T++ APL +A+ +L+ + S +FL
Sbjct: 1007 FE----------------SLFAADLGGTDLFAPL-RALHLLSGAGASATAHTKPPRNVFL 1049
Query: 431 VTDGAVEDERQICDAM---KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY-- 485
+DG + DE + A+ + RL R++TFG G+ + + LR LA + G +
Sbjct: 1050 FSDGQINDEEHVAHAIAQHRQRL--------RLFTFGFGANASRHTLRTLARVGAGAHEF 1101
Query: 486 ---GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIV 542
GA + IE Q F R L+NI++ + + E RI E+ +
Sbjct: 1102 MEAGALPNRSKIERQ----FKRALQPSLSNISVKWITEHAE------RIQQAPKETSSLF 1151
Query: 543 SGRYQG--KFPDT-----LKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQ 595
+G + F D L A ++ V +L K L + AK I+
Sbjct: 1152 NGERKMVYSFIDNCTQAFLHAHNGEQEIETRVSTSELCFTKGHLLHTLVAKALINQFKDG 1211
Query: 596 AWFS-------EDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
+ + E +R+++ + +S + ++ E+T + VE
Sbjct: 1212 NYDANKTKHNHERRRIQQDIIDLSKRYSIITEFTSFVAVE 1251
>gi|432867207|ref|XP_004071078.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H6-like
[Oryzias latipes]
Length = 1123
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 94/401 (23%), Positives = 161/401 (40%), Gaps = 58/401 (14%)
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDI 335
N DF Y V + G + ++D F Y P + V KDVIF++DI
Sbjct: 214 NADFIIQYDVDLKDLMGDI-----QVYD----GYFVHYFAP---RGLPVVPKDVIFVIDI 261
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQW 394
SGSM G ++ TK A+ L L GD FNI+ F+ + + + AT++ V A ++
Sbjct: 262 SGSMIGNKIKQTKQAMTTILGDLREGDHFNIITFSDKVQTWKKGRTVRATRQNVREAQEF 321
Query: 395 IGINFIAGGSTNI-------CAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+ +A G TNI L + +P++ +TDG D +
Sbjct: 322 VR-RIMAEGWTNINAALLSAAQLLNPSSSSGRLLSSRVPLVIFLTDGEATIGVTSGDII- 379
Query: 448 SRLTNGGSICPRIYTFGI--GSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
L N FG+ G + L+ LA+ + G Y+ +Q++ +
Sbjct: 380 --LNNARKALGSASLFGLAFGDDADLLLLKRLALENGGVARMVYEDADAALQLKGFYDEV 437
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSN 565
S +L++I + L D F++ S P+ S L+V+GR + D LK DL
Sbjct: 438 ASPLLSDIQLSYLDD-QAFDVTRSMFPNYFQGSELVVAGRVRPGVQD-LKVSMSATDLKE 495
Query: 566 FVVELKLQLAKDI----------PLDRICAKQQIDLLTAQAWF----------------- 598
E+K++ D+ LD + ++I + W
Sbjct: 496 ---EVKMENDVDVSHEKGSGSEGSLDCLEDLERISSFVYRLWAYFTIRELSVAKLNTTDP 552
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
S + L EK + +S++ + +T +++V+ D + P
Sbjct: 553 SAQRLLTEKATNLSLKYNFVTPFTSLVVVKPDAEEPTTTPP 593
>gi|157818937|ref|NP_001101458.1| von Willebrand factor A domain-containing protein 5B1 [Rattus
norvegicus]
gi|149024385|gb|EDL80882.1| similar to hypothetical protein FLJ32784 (predicted) [Rattus
norvegicus]
Length = 1216
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+ IF++D S SM ++ K A+ VAL L P FN++ F S ++ +E
Sbjct: 353 EFIFLIDRSSSMSKTNIQCIKEAMLVALKSLMPACLFNVIGFGSTFKTVFASSQIYNEEN 412
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+ A + I G TNI +PL K V +G ++FL+TDG+V + ++ + ++
Sbjct: 413 LALACECIQRMRADMGGTNIFSPL-KWVLRQPLRQGHPRLLFLITDGSVNNTGKVLELVR 471
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFS 507
+ + SI R Y+FGIG + ++ LA +S+G + + ++ +M K + +
Sbjct: 472 N---HASSI--RCYSFGIGPTVCYRLVKGLASVSKGSAEFLMEGERLQPKMVKSLKKAMA 526
Query: 508 SVLANIAID 516
VL+++ ++
Sbjct: 527 PVLSDVTVE 535
>gi|432089722|gb|ELK23539.1| Inter-alpha-trypsin inhibitor heavy chain H2 [Myotis davidii]
Length = 924
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 96/223 (43%), Gaps = 34/223 (15%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + AT
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATTT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG--AVE 437
V A ++I G TNI L +A+ +L + S+ +I LV+DG VE
Sbjct: 369 QVADAKKYIE-KIQPNGGTNINEALLRAIFILNEANNMGMLDPNSVSLIILVSDGDPTVE 427
Query: 438 DERQICDAMKSRLTNGGSICPR------------------------IYTFGIGSYCNHYF 473
+ + S G + R +++ GIG ++ F
Sbjct: 428 EAPATTTVLASHEPGSGLLAERHSHCELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDF 487
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
L+ L++ +RG Y +Q++K + + + +L N+ +
Sbjct: 488 LKRLSIDNRGIAQRIYGNQDTSLQLKKFYNQVSTPMLRNVQFN 530
>gi|66801115|ref|XP_629483.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
gi|74850896|sp|Q54CQ8.1|Y2740_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0292740
gi|60462845|gb|EAL61044.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
Length = 910
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 93/198 (46%), Gaps = 7/198 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELAT 384
K + IF++D SGSM G P++ + AL + + L+ FNI F +G F
Sbjct: 344 KGEFIFLIDCSGSMSGNPIDSARRALEIIIRSLNEQCKFNIYCFGSGFNKAFQEGSRKYD 403
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+++ ++++ G T + P+ + + IF++TDGAV D ++ +
Sbjct: 404 DDSLAVVNRYVSNISANLGGTELLQPIKDILSKEIDPEYPRQ-IFILTDGAVSDRSKLIE 462
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ S RI+T+GIGS + + L+ +GYY + +E ++ KL +
Sbjct: 463 -----FVSKESKTTRIFTYGIGSSVDVELVVGLSKACKGYYTLIRNSSDMETEVMKLLSI 517
Query: 505 GFSSVLANIAIDTLKDLD 522
F L+N++ D + LD
Sbjct: 518 AFEPTLSNVSFDWSQLLD 535
>gi|328870635|gb|EGG19008.1| type A von Willebrand factor domain-containing protein [Dictyostelium
fasciculatum]
Length = 2504
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 124/568 (21%), Positives = 246/568 (43%), Gaps = 75/568 (13%)
Query: 118 SKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL---- 172
SK + + P+ + G++ G EA I+GK ++ + + A E G++L
Sbjct: 751 SKPIEAKYVFPLDEMGAVCGFEAYINGKHIIGEVKEKEKAHKEYRQAIAEGHGAYLMDEE 810
Query: 173 KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKRE-- 230
KP++FT+++ + + + I++ + +LS DG +++ F P + PA +++ +
Sbjct: 811 KPDVFTVSVGNLPANAEVLIKITYVTELSV-DG---LDISFVLPASIAPAQRQLSGAQFT 866
Query: 231 -----------KIHLNVNAGTGTEVLCN----TSSHHLKQLRRDVGK--LGYSYESEVLK 273
N++ G E+ N S H + ++ K + + +S+
Sbjct: 867 QATTSTVQVSDDSQANLSVQVGLEMPYNIVKIVSPSHPIRFKKTHTKATVELTRDSKATD 926
Query: 274 WSNIDFDFSYTVSPSHIFGGVLLQSPSLH-DVDQR----EMFCMY----LLPGTAKSRKV 324
++ +F + ++ SP + +VD + M Y L G A +
Sbjct: 927 TLSLGKNFQLLIGLEDVY------SPRMWVEVDSKGHHASMLAFYPKLDLDTGIADADST 980
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD-SFNIVAFNGETYLFSTSMELA 383
D+ F+VD+S SM+G P +D L +A+ L + FNIV F G+ Y + + +A
Sbjct: 981 PPTDITFLVDLSASMRGDPFQDMLRTLRMAVQNLRGMNVRFNIVQF-GDIYDWLFAENVA 1039
Query: 384 TKEAVERAHQWIGINFI---AGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDE 439
+A + W IN + G T + PL + + +N+ + P I L++DG V +E
Sbjct: 1040 PTDANLQI-AWNHINTKLQPSYGGTQLLLPLESLILLSSNATPTRPNNIILLSDGNVSNE 1098
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
I +K S C RI+ FGIG + +F++ +A + G + + + +
Sbjct: 1099 STIQQLVK----RASSFC-RIFCFGIGENVSRHFIKSIARLGAGQ--SEFITPNKRPSAK 1151
Query: 500 KL---FTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQG---KFPDT 553
K+ R ++N+ + D+ E PS I + + ++ + G + T
Sbjct: 1152 KIIDQLQRCVQPAMSNVCV-KFDSTDKIEQSPSVITSIFRQQRQVIYA-FSGICTRATLT 1209
Query: 554 LKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS---- 609
+A G G L+N V ++ + + ++ A+ I +SE+K EK+
Sbjct: 1210 CQAPGG-GLLTNTVHTPEIGFIRGSLIHKLAARSMIREYN-DGSYSENKIQHEKIKLAKK 1267
Query: 610 ----KMSVQTGVLCEYTRMIIVETDERN 633
+S++ ++ ++T + VE E+N
Sbjct: 1268 SDVIDLSIRYSIVTQFTSFVAVEKREKN 1295
>gi|354465010|ref|XP_003494973.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Cricetulus griseus]
Length = 946
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/346 (22%), Positives = 148/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILEDLRTEDQFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QIADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNLGMLNPDSVSLIVLVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G+Y P L
Sbjct: 488 KFYNQVSTPLLRNVQFNYPQASVTDVTQNSFH-----NYFGGSEIVVAGKYD---PSKLA 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 540 EVQSIITATSANTELVLETLSQMDDLEDFLSKDKHADPNFTKKLWAYLTINQLLAERSLA 599
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
++ K++K MS+ ++ T M+I DER A P
Sbjct: 600 PTAAIKRKITKTILQMSLDHHIVTPLTAMVIENEAGDERMLADSPP 645
>gi|357624329|gb|EHJ75148.1| hypothetical protein KGM_21424 [Danaus plexippus]
Length = 1057
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/403 (22%), Positives = 168/403 (41%), Gaps = 87/403 (21%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F +L P S K V+F++D S SM G+ +E A+ LS L+P D F+IV F
Sbjct: 440 FVHFLAPS---SLPPLNKYVVFVLDTSSSMIGRKVEQLIAAMDAILSDLNPSDYFSIVEF 496
Query: 370 NGE-------------------TYLFSTSME--------LATKEAVERAHQWIGINFIAG 402
N + ++ STS LA+ E + +A I A
Sbjct: 497 NSDYSVHELKEADEPQPEPQKFSWYGSTSSSNKELVSPSLASPENIAKAKVIIS-RLRAN 555
Query: 403 GSTNICAPLTKAVEML-------------TNSRGSI--------------------PIIF 429
G TNI + L+ A++++ +NS + PII
Sbjct: 556 GGTNIHSALSVAMDLIHKFSGKHDISSEKSNSSDAANEKAIANANDLKTKPVHELEPIII 615
Query: 430 LVTDGAVEDERQICDAMKSRL------TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+TDG + + + SR+ N G + +++ G + FLR L++ + G
Sbjct: 616 FLTDG----DPTVGETSTSRIISHVTEKNSGEMRASLFSLAFGEDADRNFLRKLSLRNEG 671
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIV 542
+ Y+ +Q++ + + S +LA++ ++ + E + ++ + + S ++V
Sbjct: 672 FMRHIYEAADAALQLRDFYKQVSSPLLAHVKFTYPREQIKEGSVSKNKFRTVYAGSEVVV 731
Query: 543 SGRYQGKFPDTLK-AKGFLGDLSNFVV-------ELKLQLAKD-IPLDRICAKQQIDLLT 593
+G D GF G+ + ++ ++K+ K+ +PL+R+ A I L
Sbjct: 732 AGELSDDDVDLRPVVSGFCGNQNGKLIPYENDQSKIKVTRVKEFLPLERLWAYLSIHQLL 791
Query: 594 AQAWFSED---KRLEEKVSKMSVQTGVLCEYTRMIIVETDERN 633
Q SED K E+K ++++ + T +++V+ +E N
Sbjct: 792 DQRDASEDTAAKEHEKKALNLALKYSFVTPLTSLVVVKPNETN 834
>gi|348534300|ref|XP_003454640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Oreochromis niloticus]
Length = 640
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 91/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ + T+ AL L+ L D F ++ F+G + + + A +
Sbjct: 211 KNVVFVIDQSGSMHGRKILQTRTALIHILNDLAEDDHFGLITFDGNVFHWKRELVQANSK 270
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
+E A + N A GST+I + + ML + GS I+ L+TDG +
Sbjct: 271 NLESAKNFAS-NIQARGSTDINEAVLQGARMLNDHSREGSASILILLTDGDPTSGVTNLE 329
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G + FL +++ + G YD ++Q++ +
Sbjct: 330 KIQSNVRQAIAGKFPLYCLGFGFDVDFNFLEKMSLQNNGVARRIYDDSDADLQLKGFYEE 389
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 390 VATPLLTDVTM 400
>gi|3024062|sp|P97279.1|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2;
Short=ITI heavy chain H2; Short=ITI-HC2;
Short=Inter-alpha-inhibitor heavy chain 2; Flags:
Precursor
gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus]
Length = 946
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/351 (22%), Positives = 148/351 (42%), Gaps = 52/351 (14%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTEDQFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QITDAKRYIE-KIQPSGGTNINEALLRAIFILNEASNLGMLNPDSVSLIVLVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYP-SRIPDLSSE--------SPLIVSGRYQGKF 550
K + + + +L N+ + YP + + D++ S ++V+G+Y
Sbjct: 488 KFYNQVSTPLLRNVQFN----------YPQASVTDVTQNSFHNYFGGSEIVVAGKYD--- 534
Query: 551 PDTLKAKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTA 594
P L + ++ EL L+ L+KD D ++ A I+ L A
Sbjct: 535 PSKLAEVQSIITATSTNTELVLETLSQMDDLEDFLSKDKHADPNFTKKLWAYLTINQLLA 594
Query: 595 QAWFSEDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
+ + ++ K++K MS+ ++ T M+I DER A P
Sbjct: 595 ERSLAPTAAIKRKITKTILQMSLDHHIVTPLTAMVIENEAGDERMLADSPP 645
>gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
[Ailuropoda melanoleuca]
Length = 898
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 237 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 293
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + V +I + G T+I L +A+++L +
Sbjct: 294 SNRIKVWKDHLVSVTPDNVRDGKVYIH-HMSPTGGTDINGALQRAIKLLNDYVAHNDIED 352
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G IC I+T GIG+ + L L++
Sbjct: 353 RSVSLIIFLTDGKPTVGETHTLKILNNTKEAARGQIC--IFTIGIGNDVDFMLLEKLSLE 410
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + D Q+ + + +L++I +D L E + + P+ + S +
Sbjct: 411 NCGLTRRVLEEDDAGAQLIGFYDEIRTPLLSDIRVDYPPALVE-QATRTLFPNYFNGSEI 469
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + L + + FV+ L D+P++
Sbjct: 470 VIAGKLADRKVGQLHVEVTASNSKKFVI-----LKTDVPVE 505
>gi|344278001|ref|XP_003410785.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Loxodonta
africana]
Length = 945
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVTDAKKYIE-KIQPNGGTNINEALLRAIFILNEADNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSSQLR 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L NI +
Sbjct: 488 KFYNQVSTPLLRNIQFN 504
>gi|313221459|emb|CBY32209.1| unnamed protein product [Oikopleura dioica]
Length = 1138
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 5/125 (4%)
Query: 314 LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
L P AK R F + IF++D SGSM G+ +E T+ A + L PGD+FNIV+F
Sbjct: 569 LFP--AKLRSSFGVNAIFLIDSSGSMMGERMEQTREAFKFMIEGLKPGDTFNIVSFESVN 626
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-NSRGSIP--IIFL 430
+FS + + + A AGG+T++ A L KA +L+ N R S II+
Sbjct: 627 KVFSDNRMVPVNDRSMFAALKFMDQIQAGGATDVYAALVKASLLLSQNKRTSNQENIIYF 686
Query: 431 VTDGA 435
+TDGA
Sbjct: 687 LTDGA 691
>gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca]
Length = 895
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 123/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 235 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 291
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + V +I + G T+I L +A+++L +
Sbjct: 292 SNRIKVWKDHLVSVTPDNVRDGKVYIH-HMSPTGGTDINGALQRAIKLLNDYVAHNDIED 350
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G IC I+T GIG+ + L L++
Sbjct: 351 RSVSLIIFLTDGKPTVGETHTLKILNNTKEAARGQIC--IFTIGIGNDVDFMLLEKLSLE 408
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + D Q+ + + +L++I +D L E + + P+ + S +
Sbjct: 409 NCGLTRRVLEEDDAGAQLIGFYDEIRTPLLSDIRVDYPPALVE-QATRTLFPNYFNGSEI 467
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + L + + FV+ L D+P++
Sbjct: 468 VIAGKLADRKVGQLHVEVTASNSKKFVI-----LKTDVPVE 503
>gi|156400040|ref|XP_001638808.1| predicted protein [Nematostella vectensis]
gi|156225932|gb|EDO46745.1| predicted protein [Nematostella vectensis]
Length = 798
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 121/536 (22%), Positives = 216/536 (40%), Gaps = 96/536 (17%)
Query: 84 IPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKS-----CDCRIAVPMGDQGSILGV 138
IPLQ V ++ Y T +V V+G K+ + P+ +Q ++ G
Sbjct: 14 IPLQSIKVNAEITGY----------TAQVAAVLGYKNEGSNPIEAVYIFPLDEQAAVCGF 63
Query: 139 EAEISGKSYHTQLI----ALGENDGA---GKSASVETGSFLKPNIFTLTLPQIDGGSYLS 191
+A I G++ ++ A G+ D A G SA + S +IF + + + +
Sbjct: 64 QATIDGRTIVAEIKEKEEARGDYDDAISSGHSAFLLEESDESSDIFQMNVGNLPPNESAT 123
Query: 192 IRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKI-----------------HL 234
++L++ +L E + V F P + P + ++ I +
Sbjct: 124 LQLKFVTELEV--DEENGTVQFVLPTVLNPRYTPVDQQPSISTEIPASSVAKPYSFEFQM 181
Query: 235 NVNAGTG-TEVLCNTSSHHL-------------------KQLRRDVGKLGYSYESEVLKW 274
NV +G+ TE+ +++SH L RRDV ++ E +
Sbjct: 182 NVKSGSAITEI--SSTSHKLCFQPDPSDNCHASVTLAESHTFRRDV-EIQIKSEDPFVAH 238
Query: 275 SNIDFDFSY-TVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIV 333
+ ++ + +P G+ + + L Q+ + + +P K+ + + IF+V
Sbjct: 239 ALVEPGLPRPSDAPEDKTRGLAISTEFL----QKPVAMVNFVPAF-KADDLTCGEFIFVV 293
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTS-------MELAT 384
D SGSM G ++D L + L L G FNIV F G +Y LFS S ++ AT
Sbjct: 294 DRSGSMSGSRIKDAARTLQLFLKSLPDGCYFNIVGF-GSSYKTLFSKSKTYNDETLKTAT 352
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
A A G T I PL + V + G+ +FL+TDG V + Q+
Sbjct: 353 NHAAHLAADL--------GGTEILEPL-RWVYSQSLIEGAPRQLFLLTDGEVGNTAQVIS 403
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
L + R+++FGIG + ++ +A G D +++++ K R
Sbjct: 404 -----LVAENASTARVFSFGIGDGASTELIKGVARAGHGSAEFVRGQDKLQVKVIKTLKR 458
Query: 505 GFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGF 559
L ++A+ L D + + S++P + LI G ++ D+ AKG
Sbjct: 459 ALQPALTDVALSWALPDGWKLQEVQSKLPPVFKGERLIAYGVFKNLNNDS-NAKGI 513
>gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus]
Length = 950
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 148/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 313 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKT 372
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 373 QIADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNMGLLNPDSVSLIILVSDGDPTVG 431
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 432 ELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 491
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G++ P L
Sbjct: 492 KFYNQVSTPLLRNVQFNYPQASVTDVTQNNFH-----NYFGGSEIVVAGKFD---PSKLT 543
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 544 EVQSIITATSANTELVLETLSQMDDLEEFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 603
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
++ K++K MS+ ++ T M+I DER A P
Sbjct: 604 PTAAIKRKITKTILQMSLDHHIVTPLTAMVIENDAGDERMLADSPP 649
>gi|3024068|sp|Q61703.1|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2;
Short=ITI heavy chain H2; Short=ITI-HC2;
Short=Inter-alpha-inhibitor heavy chain 2; Flags:
Precursor
gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus]
Length = 946
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 148/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QIADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNMGLLNPDSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G++ P L
Sbjct: 488 KFYNQVSTPLLRNVQFNYPQASVTDVTQNNFH-----NYFGGSEIVVAGKFD---PSKLT 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 540 EVQSIITATSANTELVLETLSQMDDLEEFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 599
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
++ K++K MS+ ++ T M+I DER A P
Sbjct: 600 PTAAIKRKITKTILQMSLDHHIVTPLTAMVIENDAGDERMLADSPP 645
>gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Mus
musculus]
gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus]
Length = 950
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 148/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 313 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKT 372
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 373 QIADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNMGLLNPDSVSLIILVSDGDPTVG 431
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 432 ELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 491
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G++ P L
Sbjct: 492 KFYNQVSTPLLRNVQFNYPQASVTDVTQNNFH-----NYFGGSEIVVAGKFD---PSKLT 543
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 544 EVQSIITATSANTELVLETLSQMDDLEEFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 603
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
++ K++K MS+ ++ T M+I DER A P
Sbjct: 604 PTAAIKRKITKTILQMSLDHHIVTPLTAMVIENDAGDERMLADSPP 649
>gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus]
Length = 946
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 148/346 (42%), Gaps = 42/346 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QIADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNMGLLNPDSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
K + + + +L N+ + ++ D+ + + + S ++V+G++ P L
Sbjct: 488 KFYNQVSTPLLRNVQFNYPQASVTDVTQNNFH-----NYFGGSEIVVAGKFD---PSKLT 539
Query: 556 AKGFLGDLSNFVVELKLQ-----------LAKDIPLD-----RICAKQQIDLLTAQAWFS 599
+ ++ EL L+ L+KD D ++ A I+ L A+ +
Sbjct: 540 EVQSIITATSANTELVLETLSQMDDLEEFLSKDKHADPDFTKKLWAYLTINQLLAERSLA 599
Query: 600 EDKRLEEKVSK----MSVQTGVLCEYTRMIIVET--DERNNASESP 639
++ K++K MS+ ++ T M+I DER A P
Sbjct: 600 PTAAIKRKITKTILQMSLDHHIVTPLTAMVIENDAGDERMLADSPP 645
>gi|323492793|ref|ZP_08097935.1| von Willebrand factor type A (vWA) domain-containing protein
[Vibrio brasiliensis LMG 20546]
gi|323312864|gb|EGA65986.1| von Willebrand factor type A (vWA) domain-containing protein
[Vibrio brasiliensis LMG 20546]
Length = 694
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 149/354 (42%), Gaps = 36/354 (10%)
Query: 303 DVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD 362
+ +R + PG +R +D +F++D SGSM GK + L KL D
Sbjct: 288 ETSKRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMSGK-YSTLVEGVRQGLGKLPAED 346
Query: 363 SFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR 422
F +V F+ TY + V +A Q + + G TN+ ++ A+ L + R
Sbjct: 347 RFRVVMFDNNTYDLTGGFVAVNPTNVSKALQSVEQVEPSNG-TNLYEGMSAAIRRLDDDR 405
Query: 423 GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
+ I LVTDG V ++R+ + M+ R++TF +G+ N L +
Sbjct: 406 PT--GIVLVTDGVANVGVTEKRRFFELMEKHDV-------RLFTFIMGNSANTPLLVPMT 456
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID----TLKDLDEFEMYPSRIPDL 534
+S G + + D I + + ++ NI +D +KDL P +I L
Sbjct: 457 KLSNGVATSVSNADDIIGHLMNITSKLTYQAYRNIQLDIDGVKIKDLT-----PEQISSL 511
Query: 535 SSESPLIVSGRY-QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQIDL 591
L V G Y +G D G + ++ L A + P L+R+ A I
Sbjct: 512 YRGEQLTVFGHYFKGGEADIKLTMDIGGTSQQYRTKVMLPDSAIEHPELERMWAFSAIRD 571
Query: 592 LTAQAWFSE--DKRLEEKVSKMSVQTGVLCEYTRMIIVETD-------ERNNAS 636
L Q + E D E+ + ++++ G+L +YT +++VE D ER N S
Sbjct: 572 LQEQMDYLEQKDSDKEQAIEDIALEYGLLTDYTSLLVVEEDVFQQLGIERKNQS 625
>gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein
[Hahella chejuensis KCTC 2396]
gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Hahella chejuensis KCTC 2396]
Length = 687
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 147/330 (44%), Gaps = 27/330 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F + + PG +D ++D SGSM GK L +K + D ++ F
Sbjct: 291 FMLTVTPGDDLPAITEGRDWTLVLDRSGSMSGK-FSTLLEGLRKGFAKFNRNDRVRVIMF 349
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAG--GSTNICAPLTKAVEMLTNSRGSIPI 427
N + AT E ++ Q +G AG G TN+ + + A+ L R +
Sbjct: 350 NDNATEVTNGWVQATPENLQ---QVVGAVENAGPSGGTNLMSAIQSALTGLDADRTNA-- 404
Query: 428 IFLVTDGAVE----DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
I+LVTDG ++ + ++ + R++TF +G+ N L + S G
Sbjct: 405 IWLVTDGEANVGETKQKAFIELLEKKDI-------RLFTFIMGNSANRPLLEAITKHSNG 457
Query: 484 YYGAAYDLDSIEIQMQKLFTRGFSSVL--ANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+ + + D I Q+ ++ + L AN+ I +K D ++P +I + L+
Sbjct: 458 FAISVSNSDDIIGQLMLAASKVDHAALHGANLKISGIKTTD---VFPKQIGSVYRGQQLV 514
Query: 542 VSGRYQGKFPDTLKAKGFL-GDLSNFVVELKLQLAKDIP-LDRICAKQQI-DLLTAQAWF 598
+ G Y G ++ G + G + + +++ P L+R+ A QI DL+ Q +
Sbjct: 515 MFGHYYGSGQAKVELTGKVSGSPIRYQTQFEVKDGTLHPELERLWAFAQIEDLMDQQQDY 574
Query: 599 SEDKRLEEKVSKMSVQTGVLCEYTRMIIVE 628
ED ++ V+ ++V+ G++ +YT M++++
Sbjct: 575 GEDADRKQAVTDLAVEYGLVTDYTSMLVLD 604
>gi|402879592|ref|XP_003903418.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5 [Papio anubis]
Length = 956
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 123/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-----NSRG- 423
+ ++ + T +++ +I + G T+I L +A+ +L N G
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHNDIGD 395
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + FV+ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFVI-----LKTDVPV 547
>gi|348575582|ref|XP_003473567.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Cavia
porcellus]
Length = 1153
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 121/281 (43%), Gaps = 21/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F+ D S SM G L+ T++AL L L P D F+I+ F
Sbjct: 481 FVHYFAP---KDLPPLPKNVVFVFDTSASMVGTKLQQTRDALTTILLDLRPQDHFSIIGF 537
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + +I + +GG T+I L A+ +L N
Sbjct: 538 SNRIKVWKDHLVPVTPNNIIDGKFYIHLMSPSGG-TDINGALQAAIRLLNNYVSHNDIEE 596
Query: 423 GSIPIIFLVTDGA-VEDERQICDAM-KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG E + + ++ G IC I+T GIG+ + L L++
Sbjct: 597 RSVSLVIFLTDGKPTVGETHVPKILNNTKEAAQGRIC--IFTVGIGNDVDFRLLEKLSLD 654
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + + Q+ + + +L NI ID L E + P+ + S +
Sbjct: 655 NCGLTRRVLEEEDAGAQLIGFYDEIRTPLLFNIRIDYPSGLVEHATK-TVFPNYFNGSEI 713
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
+++G+ + D L + + F++ L D+P++
Sbjct: 714 VIAGKLVDRKMDQLHVEVTASNSKKFII-----LKTDVPVE 749
>gi|323650180|gb|ADX97176.1| inter-alpha (globulin) inhibitor h4 isoform 1 [Perca flavescens]
Length = 354
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 103/221 (46%), Gaps = 5/221 (2%)
Query: 297 QSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALS 356
+ SL D+ + ++ ++ SR K+V+F++D SGSM G+ ++ T+ AL L+
Sbjct: 52 RDTSLGDIKTSTGYFVHHFAPSSLSR--IPKNVVFVIDQSGSMSGRKMQQTRIALIHILN 109
Query: 357 KLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE 416
L D F ++ F+ + + + A K+ +E A + N G+T+I A + +
Sbjct: 110 DLAEDDHFGLITFDSSIFHWKRELVQANKKNLESAKTF-ARNIQDRGTTDINAAVLEGAR 168
Query: 417 MLTNS--RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
ML GS II L+TDG + ++S + + +Y G G N FL
Sbjct: 169 MLNAHPREGSASIIILLTDGDPTSGETNLERIQSNVRRDIADKFPLYCLGFGHDVNFEFL 228
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
+++ + G Y+ ++Q++ + + +L ++ +
Sbjct: 229 EKMSLQNNGVARRIYEDSDADLQLKGFYDEVATPLLTDVTM 269
>gi|384949568|gb|AFI38389.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor
[Macaca mulatta]
Length = 942
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 123/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-----NSRG- 423
+ ++ + T +++ +I + G T+I L +A+ +L N G
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHNDIGD 395
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + FV+ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFVI-----LKTDVPV 547
>gi|334348383|ref|XP_001365240.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2
[Monodelphis domestica]
Length = 953
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 97/201 (48%), Gaps = 16/201 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + LA+K
Sbjct: 312 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVVDFNHNVRNWRDDLVLASKA 371
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG--AVE 437
+ A ++I G TNI L +A+ +L + S+ +I LV+DG V
Sbjct: 372 QITDAKKYIE-KIQPNGGTNINEALLRAIFILNEASNLGMLDPNSVSLIILVSDGDPTVG 430
Query: 438 DER--QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+ + QI +K + + S +++ GIG ++ FL L+ + G Y
Sbjct: 431 ELKLSQIQKNVKQSMQDNIS----LFSLGIGFDVDYDFLERLSQENHGVAQRIYGNQDTS 486
Query: 496 IQMQKLFTRGFSSVLANIAID 516
+Q+++ + + + +L N+ +
Sbjct: 487 LQLKQFYNQVSTPLLRNVQFN 507
>gi|449480691|ref|XP_004176568.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5
[Taeniopygia guttata]
Length = 961
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 121/265 (45%), Gaps = 19/265 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATK 385
K+V+F++D S SM G L+ TK AL L L P D FNI+ F+ ++ + T
Sbjct: 316 KNVVFVLDSSASMVGTKLKQTKEALFTILQDLRPEDHFNIIGFSNRIKVWQQDRLVPVTP 375
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SRGSIPIIFLVTDGAVED 438
+ A ++I N G TNI L ++L + S+ +I +TDG
Sbjct: 376 NNIRDAKKYIH-NMSPTGGTNINGALQTGAKLLNDYIAQNDIDARSVSLIIFLTDG---- 430
Query: 439 ERQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
+ + S++ + R ++T GIG+ +H L +A+ + G + +
Sbjct: 431 RPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDHKLLERMALENCGMTRLFQEDEDA 490
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
++ + + +L++I +D +D D ++ + P+ + S +I++G+ + D L
Sbjct: 491 ASHLKGFYDEIGTPLLSDIRVDYPED-DVEQVTQNFFPNYFNGSEIIIAGKLINRTSDKL 549
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP 579
+ + +++ LK +A D+P
Sbjct: 550 HVEVTASNSKKYIL-LKTDVAIDLP 573
>gi|320169902|gb|EFW46801.1| von Willebrand domain-containing protein [Capsaspora owczarzaki
ATCC 30864]
Length = 966
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 12/195 (6%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMEL 382
K +VIF+VD SGSM G + +NAL + L L FNIV F G++Y LF ++
Sbjct: 289 IKCEVIFVVDRSGSMGGSQILQARNALTLFLKSLPTNSRFNIVGF-GDSYQTLFPEAVGY 347
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQ 441
++ + A + + G T + APL +A+E + +R P +F++TDG V + Q
Sbjct: 348 -SQANLNTALKHVSTMDANLGGTELLAPL-RAIERM-KTRADFPRQVFVLTDGQVSNTDQ 404
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ + S N R + GIGS +H+ + +A +G D + +E ++ +
Sbjct: 405 VIAQVNSDSKNA-----RYFALGIGSGVSHHLVEGIARAGKGNAQFVVDGERMETKVMRQ 459
Query: 502 FTRGFSSVLANIAID 516
L ++ ++
Sbjct: 460 LKDAIQPSLTDVRVN 474
>gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens]
Length = 748
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 86 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 142
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 143 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 201
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 202 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 259
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 260 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 318
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 319 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 353
>gi|348544633|ref|XP_003459785.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like,
partial [Oreochromis niloticus]
Length = 655
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 90/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ +E T+ AL L+ L D F ++ F+ + + A ++
Sbjct: 245 KNVVFVIDQSGSMHGRKIEQTRIALIHILNDLAEDDHFGLITFDDRISYWKRELVQANRK 304
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQICD 444
+E A + N GST+I + + ML N GS I+ L+TDG +
Sbjct: 305 NLESAKNF-ARNIEERGSTDIHEAVLQGARMLNAHNREGSASILILLTDGDPTSGVTNLE 363
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G N FL +++ + G Y+ ++Q++ +
Sbjct: 364 KIQSNVRQAIAGKFPLYCLGFGFDVNFNFLEKMSLQNNGVARRIYEDSDADLQLKGFYEE 423
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 424 VATPLLTDVTM 434
>gi|348500867|ref|XP_003437993.1| PREDICTED: von Willebrand factor A domain-containing protein
5B2-like [Oreochromis niloticus]
Length = 1350
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 8/233 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
FC LL + K ++++F++D SGSM G ++ K A+ VAL L G NIV F
Sbjct: 351 FCPDLLREPLEVHKA-TRELLFLIDRSGSMSGANIQRVKEAMVVALKSLPSGTMLNIVGF 409
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
TS +L T + +A++++ TN+ L + R +F
Sbjct: 410 GTTIKPLFTSSKLCTDVTLTQAYEYVQRMRADMRGTNLLGALFWVYQQPMQ-RSYPRQVF 468
Query: 430 LVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY 489
++TDG++ + ++ + ++ G R + G+G L+ +A ++ G
Sbjct: 469 IITDGSISNVARMLELVRRNACAG-----RCFGLGLGPRACRRLLQGVAKLTGGTTEFLD 523
Query: 490 DLDSIEIQMQKLFTRGFSSVLANIAIDT-LKDLDEFEMYPSRIPDLSSESPLI 541
D + ++ ++ K + F VL ++ ID L + E + P+ IP L + LI
Sbjct: 524 DEERLQPKLIKSLKKAFEPVLTDVRIDWYLPENMEALLTPNEIPPLYPGNRLI 576
>gi|313225210|emb|CBY21004.1| unnamed protein product [Oikopleura dioica]
Length = 1138
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 66/125 (52%), Gaps = 5/125 (4%)
Query: 314 LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
L P AK R F + IF++D SGSM G+ +E T+ A + L PGD+FNIV+F
Sbjct: 569 LFP--AKLRSSFGVNAIFLIDSSGSMMGERMEQTREAFKFMIEGLKPGDTFNIVSFESVN 626
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-NSRGSIP--IIFL 430
+FS + + + A AGG+T+ A L KA +L+ N R S II+
Sbjct: 627 KVFSDNRMVPVSDRSTFAALKFMDQIQAGGATDAYAALVKASLLLSQNKRTSSQENIIYF 686
Query: 431 VTDGA 435
+TDGA
Sbjct: 687 LTDGA 691
>gi|334328266|ref|XP_001377644.2| PREDICTED: von Willebrand factor A domain-containing protein 5B1
[Monodelphis domestica]
Length = 1188
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 14/222 (6%)
Query: 301 LH-DVDQREMFCMYLLPGTAKSRKVFKK---DVIFIVDISGSMQGKPLEDTKNALAVALS 356
LH D+ + + P ++ +K + IF++D SGSM G + K+A+ + L
Sbjct: 317 LHKDIPHHSVLMLNFCPDLQQTHYSLRKTHGEFIFLIDRSGSMSGVNMHLVKDAMILILK 376
Query: 357 KLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKA 414
L P FNI+ F G T+ LF +S + V + GG TNI +PL K
Sbjct: 377 SLMPTCLFNIIGF-GSTFKTLFPSSQVYSEDNLVSACKNIQHLRADMGG-TNILSPL-KW 433
Query: 415 VEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
+ G ++FL+ DG+V + ++ + ++ N S R Y+FGIG +
Sbjct: 434 ITRQPIHEGHPRLLFLLIDGSVNNTGKVIELLR----NNASTT-RCYSFGIGPKACPRLV 488
Query: 475 RMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ LA +SRG + ++ +M K + + VL+++ ++
Sbjct: 489 QGLAAVSRGSAEFLRQGERLQPKMIKSLKKAMAPVLSDVTVE 530
>gi|410963163|ref|XP_003988135.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 2
[Felis catus]
Length = 935
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 298 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDKFSVVDFNHNVRTWRHDLVSATKS 357
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 358 QTADAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPESVSLIILVSDGDPTVG 416
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 417 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSAQLK 476
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 477 KFYNQVSTPLLRNVQFN 493
>gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii]
Length = 694
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 32 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 88
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 89 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 147
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 148 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 205
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 206 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 264
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + FV+ L D+P+
Sbjct: 265 IIAGKLVDRKLDHLHVEVTASNSKKFVI-----LKTDVPV 299
>gi|169604987|ref|XP_001795914.1| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15]
gi|160706682|gb|EAT86573.2| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15]
Length = 1180
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 92/185 (49%), Gaps = 20/185 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF-----STSM 380
K +++F+ D SGSM G +E K AL V L L G FNI +F G +Y F ++
Sbjct: 296 KPELVFVCDRSGSMNGTSMELAKQALKVFLKSLPVGVKFNICSF-GSSYSFLWKKSASYN 354
Query: 381 ELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDE 439
+ EAV A Q+ N+ T + APL ++ +P+ I L+TDG + D+
Sbjct: 355 QENLDEAVRHAEQF-SANY---NGTEMLAPLKATIDQRYK---DMPLDIILLTDGQIWDQ 407
Query: 440 RQICDAMKSRLTNGGSICP-RIYTFGIGSYCNHYFLRMLAMISRGY---YGAAYDLDSIE 495
++ + +T GS P R++T GIG+ +H + +A G+ GA +D+
Sbjct: 408 ERLFSYLNEAIT--GSKQPVRVFTLGIGNGVSHALIEGVAKAGNGFSQAVGAGEKMDAKV 465
Query: 496 IQMQK 500
++M K
Sbjct: 466 VRMVK 470
>gi|410963161|ref|XP_003988134.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 1
[Felis catus]
Length = 946
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDKFSVVDFNHNVRTWRHDLVSATKS 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDGAVEDE 439
A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QTADAKKYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPESVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSAQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|403296362|ref|XP_003939080.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Saimiri
boliviensis boliviensis]
Length = 946
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 100/233 (42%), Gaps = 21/233 (9%)
Query: 301 LHDVDQRE----------MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNA 350
L+DVD+ E F + P K+++F++D+SGSM G ++ T A
Sbjct: 276 LYDVDREEKAGELEVFNGYFVHFFAPDNLDP---IPKNILFVIDVSGSMWGVKMKQTVEA 332
Query: 351 LAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAP 410
+ L L D F+++ FN + + ATK V A ++I G TNI
Sbjct: 333 MKTILDDLRAEDHFSVIDFNHNVRTWRNDLISATKTQVADAKRYIE-KIQPSGGTNINEA 391
Query: 411 LTKAVEMLTNSR-------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
L +A+ +L + S+ +I LV+DG ++ + +++
Sbjct: 392 LLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSL 451
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
G+G ++ FL+ L+ +RG Y Q++K + + + +L N+ +
Sbjct: 452 GMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLRKFYNQVSTPLLRNVQFN 504
>gi|403296364|ref|XP_003939081.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Saimiri
boliviensis boliviensis]
Length = 728
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 125/288 (43%), Gaps = 37/288 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 66 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 122
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT----NSRG-- 423
+ ++ + T +++ +I + G T+I L A+ +L +S G
Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINEALQTAIRLLNKYMAHSDGGD 181
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 182 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGTDVDFRLLEKLSLE 239
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRI--------P 532
+ G ++ + Q+ + + +L++I ID +PS + P
Sbjct: 240 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIHID---------YHPSSVVQATKTLFP 290
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+ + S +I++G+ + D L + + F++ L D+P+
Sbjct: 291 NYFNGSEIIIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 333
>gi|116192639|ref|XP_001222132.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51]
gi|88181950|gb|EAQ89418.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51]
Length = 952
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 104/215 (48%), Gaps = 19/215 (8%)
Query: 294 VLLQSPSL--HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNAL 351
VL Q P+L H + + LP T K +++F+ D SGSM+G + K AL
Sbjct: 260 VLEQHPTLDNHRALMATLVPKFNLPST-------KPEIVFLCDRSGSMEGPNVRHLKTAL 312
Query: 352 AVALSKLDPGDSFNIVAFNG-ETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAP 410
+ L L G FNI +F +LF + + ++E+A +++ G T+I P
Sbjct: 313 HLFLKSLPVGVKFNICSFGSRHEFLFGNKSQTYDQSSLEKASRYVDGFSADFGGTDIYHP 372
Query: 411 LTKAVEMLTNSR-GSIPI-IFLVTDGAVEDERQICDAMKSRLT-NGGSICPRIYTFGIGS 467
+ E + N R + + +FL+TDG + D+ + + + ++ + G I R++T GIGS
Sbjct: 373 M----EDVINRRYADMDLEVFLLTDGEIWDQSALFELLNKKVAESNGKI--RVFTLGIGS 426
Query: 468 YCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+H + +A G+ + D + + ++ ++
Sbjct: 427 GASHALVEGVAAAGNGFSQSVGDNEKMNTKVVRML 461
>gi|327265885|ref|XP_003217738.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Anolis carolinensis]
Length = 636
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 143/345 (41%), Gaps = 69/345 (20%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++FI+D+S SM G+ L+ T+ AL L + D N V F + + + ++ AT E
Sbjct: 293 KNIVFIIDVSISMSGRKLQQTREALLKILEDIKEDDYLNFVLFGDDVHKWKDTLIKATPE 352
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG--SIP-----IIFLVTDGAVEDE 439
++ A +++ IAG TN+ L +EML + S+P +I ++TDG
Sbjct: 353 NLDEASRYVQQIDIAGW-TNLNGGLMAGIEMLNEAHKNRSLPERSASLIIMLTDGRPTKG 411
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+ + S + N +Y G G ++ L +A + G Y+ +Q+Q
Sbjct: 412 ERDTQVILSNVRNAIQGKYPLYNLGFGYDLDYGSLEKMAAENNGLARRIYEDSDSALQLQ 471
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEM-YPSR-IPDLSSE--------SPLIVSGRYQGK 549
GF +AN L E E+ YP I DL+ S L+V+GR
Sbjct: 472 -----GFYDEVANPL------LTEVELKYPENVISDLTQNHFKHYYDGSELVVAGRITD- 519
Query: 550 FPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS 609
DL++ E+K A+D DL F+E +EE
Sbjct: 520 -----------NDLNSITAEVKAHGAED------------DLT-----FTEQSDMEETAK 551
Query: 610 KMSVQTGVLCEYTR-----MIIVETDERNNASESPGTKKGSKKSD 649
+ Q + +Y + I + E+ NA+ KG +K++
Sbjct: 552 ALEEQQHIFGDYMERLWAYLTIQQLLEKRNAA------KGEEKAN 590
>gi|260812906|ref|XP_002601161.1| hypothetical protein BRAFLDRAFT_214487 [Branchiostoma floridae]
gi|229286452|gb|EEN57173.1| hypothetical protein BRAFLDRAFT_214487 [Branchiostoma floridae]
Length = 495
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 100/224 (44%), Gaps = 11/224 (4%)
Query: 314 LLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
+L K+ + + + I ++D SGSM G + + L + L L G +FNIV F G T
Sbjct: 276 ILASHPKTSEGIQGEYIAVIDRSGSMSGAFIATARETLLLFLKSLPAGSAFNIVGF-GST 334
Query: 374 Y--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLV 431
+ LF S+ + +E V A W+ G TN+ PL + + G +F++
Sbjct: 335 FKPLFDASVPV-NQENVGTASAWVCKMRADLGGTNLLGPL-EWIFSAPRPAGRPREVFIL 392
Query: 432 TDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
TDGAV + ++ D L S R + GIG + +R +A RG ++
Sbjct: 393 TDGAVSNTSRVID-----LVRANSSHTRCWAVGIGEGASRVLIRGIAEAGRGRAEFVTEV 447
Query: 492 DSIEIQMQKLFTRGFSSVLANIAID-TLKDLDEFEMYPSRIPDL 534
D ++ ++ R + ++ I+ + D + P +P+L
Sbjct: 448 DRMQAKLLLCLKRSLQPAICDVNIEWKIPDGVQLVQTPCHLPNL 491
>gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Pongo
abelii]
Length = 942
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + FV+ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFVI-----LKTDVPV 547
>gi|348534302|ref|XP_003454641.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Oreochromis niloticus]
Length = 663
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 90/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ +E T+ AL L+ L D F ++ F+ + + A ++
Sbjct: 251 KNVVFVIDQSGSMHGRKIEQTRIALLHILNDLAEDDHFGLITFDDRISYWKRELVQANRK 310
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQICD 444
+E A + N GST+I + + ML N GS I+ L+TDG +
Sbjct: 311 NLESAKNF-ARNIQDRGSTDINEAVLQGARMLNAHNREGSASILILLTDGDPTSGVTNLE 369
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G N FL +++ + G Y+ ++Q++ +
Sbjct: 370 KIQSNVRQAIAGKFPLYCLGFGFDVNFNFLEKMSLQNNGVARRIYEDSDADLQLKGFYEE 429
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 430 VATPLLTDVTM 440
>gi|380485022|emb|CCF39629.1| von Willebrand domain-containing protein [Colletotrichum
higginsianum]
Length = 523
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 82/166 (49%), Gaps = 17/166 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLF--STSMEL 382
K +V+FI D SGSMQG + D ++AL V L L G FNI +F E ++LF S + +
Sbjct: 330 KPEVVFICDRSGSMQGARMTDLRDALRVFLKSLPVGAMFNICSFGSEHSFLFDKSVTYDQ 389
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI---IFLVTDGAVEDE 439
T EA R NF G T I P+ + T R + + +FL+TDG V E
Sbjct: 390 NTLEAAMRHVDTFAANF---GGTEIRQPIEQ-----TFRRRHVDLDLEVFLLTDGEVWGE 441
Query: 440 RQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
+ + + + G+I R++T G+G +H + LA G+
Sbjct: 442 QGLFGLVGDHVRESKGAI--RLFTLGVGRDVSHSLIEXLARAGNGF 485
>gi|348545854|ref|XP_003460394.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like,
partial [Oreochromis niloticus]
Length = 519
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 90/191 (47%), Gaps = 3/191 (1%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G+ +E T+ AL L+ L D F ++ F+ + + A ++
Sbjct: 196 KNVVFVIDQSGSMYGRKIEQTRIALVHILNDLAEDDHFGLITFDDRISYWKRELVQANRK 255
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQICD 444
+E A + N GST+I + + ML N GS I+ L+TDG +
Sbjct: 256 NLESAKNF-ARNIQDRGSTDINEAVLQGARMLNAHNREGSASILILLTDGDPTSGVTNLE 314
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++S + + +Y G G N FL +++ + G Y+ ++Q++ +
Sbjct: 315 NIQSNVRQAIAGKFPLYCLGFGFDVNFNFLEKMSLQNNGVARRIYEDSDADLQLKGFYEE 374
Query: 505 GFSSVLANIAI 515
+ +L ++ +
Sbjct: 375 VATPLLTDVTM 385
>gi|145530760|ref|XP_001451152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418796|emb|CAK83755.1| unnamed protein product [Paramecium tetraurelia]
Length = 919
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 98/203 (48%), Gaps = 11/203 (5%)
Query: 319 AKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LF 376
A+++ + + + +F +D SGSM G + K +L + L L +FNI++F G T+ L+
Sbjct: 340 AQNQVINRGNYLFFIDRSGSMTGARINKAKQSLLLFLKSLPEDCNFNIISF-GSTFRSLW 398
Query: 377 STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV--EMLTNSRGSIPIIFLVTDG 434
S S + + + +E A + + T I PL++ V + S+ + +FL+TDG
Sbjct: 399 SESKQYS-QNTLEDAIKHVNNMEANMNGTEILKPLSQVVYSKYYGKSKSTTLNVFLLTDG 457
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
VE + I D +K R+YT GIG C+ + ++ LA + G + D + I
Sbjct: 458 EVEAQP-IIDLVKK----NNQAETRVYTLGIGEGCSQFLIKRLAEVGNGKFQFVSDNEDI 512
Query: 495 EIQMQKLFTRGFSSVLANIAIDT 517
++ L + L ++T
Sbjct: 513 NAKVIDLLEDSLTPYLKEFNLET 535
>gi|145514483|ref|XP_001443148.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410521|emb|CAK75751.1| unnamed protein product [Paramecium tetraurelia]
Length = 625
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 89/200 (44%), Gaps = 21/200 (10%)
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKK-DVIFIVDISGSMQGKPLEDTKNALAVALSK 357
P ++V + + YL + +V + + +FI+D SGSM G +E K AL + L
Sbjct: 370 PKFNEVSLDDAYTQYLDGLSIADNQVINRGNYLFIIDRSGSMSGSRIEKAKQALILFLKS 429
Query: 358 LDPGDSFNIVAFN-GETYLFSTSMELATKEAVERAHQWIGI----------NFIAG-GST 405
L FNI++F + +LF + V Q++GI A G T
Sbjct: 430 LPQDSEFNIISFGIADIFLFFNHQSVPLNN-VSSQQQFLGIVQNEAIQHVEEMAANMGGT 488
Query: 406 NICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTF 463
I PL + V + S+ + +F++TDG D QI ++S RIYT
Sbjct: 489 EILTPLQQMVYNASYGTSKNTTLNVFMLTDGET-DADQIIQLVQS----NNQAQTRIYTL 543
Query: 464 GIGSYCNHYFLRMLAMISRG 483
GIG C+ Y ++ +A + G
Sbjct: 544 GIGQGCSQYLIQRVAEVGNG 563
>gi|328873465|gb|EGG21832.1| hypothetical protein DFA_01718 [Dictyostelium fasciculatum]
Length = 837
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 100/214 (46%), Gaps = 17/214 (7%)
Query: 309 MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
+ Y+L T + K + IFIVD SGSM G ++ +K+AL + + L+ FNIV+
Sbjct: 316 FYPSYILDKTPEDELNQKMECIFIVDRSGSMSGDRIQSSKSALQIIMRSLNENTKFNIVS 375
Query: 369 FNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT---KAVEMLTNSRG 423
F G ++ LF S + E + G++ GG T + P+ KA + R
Sbjct: 376 F-GTSFEKLFPESRDYDDTSLTEASKLIEGMSANMGG-TELFEPIKDVLKAPALPDYPRQ 433
Query: 424 SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+F++TDGAV +Q+ + S RI+TFGIG + + L+ +G
Sbjct: 434 ----VFILTDGAVSQRQQLVKFVASEAN-----TTRIFTFGIGGGVDKELVIGLSKACKG 484
Query: 484 YYGAAYD-LDSIEIQMQKLFTRGFSSVLANIAID 516
Y D + E ++ L + +L+N+ ID
Sbjct: 485 SYELISDSANDFEERVLSLLSIAMQPMLSNVKID 518
>gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
[Meleagris gallopavo]
Length = 951
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 122/265 (46%), Gaps = 19/265 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATK 385
K+V+F++D S SM G L TK AL L L P D FNI+ F+ ++ + T
Sbjct: 306 KNVVFVLDSSASMVGTKLRQTKEALFTILQDLRPEDHFNIIGFSNRIKVWQQDRLVPVTP 365
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SRGSIPIIFLVTDGAVED 438
+ A ++I N G TNI + L ++L + S+ +I +TDG
Sbjct: 366 NNIRDAKKYIH-NMSPTGGTNINSALQTGAKLLNDYIAQNNIDARSVSLIIFLTDG---- 420
Query: 439 ERQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
+ + S++ + R ++T GIG+ ++ L +A+ + G + +
Sbjct: 421 RPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGMVRHFQEDEDA 480
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
++ + + +L++I ID +D E ++ + P+ + S +I++G+ + D+L
Sbjct: 481 ASHLKGFYDEIGTPLLSDIRIDYPEDNVE-QVTQNFFPNYFNGSEIIIAGKLINRTSDSL 539
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP 579
+ + +++ LK +A D+P
Sbjct: 540 HVEVTASNSKKYIL-LKTDVAIDLP 563
>gi|224092755|ref|XP_002190101.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2
[Taeniopygia guttata]
Length = 948
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 96/205 (46%), Gaps = 24/205 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ LS+L D F+++ FN + ++ AT
Sbjct: 311 KNILFVIDVSGSMWGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRCWRDNLVSATPS 370
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG----- 434
VE A ++I GG TNI L +A +L ++ S+ +I LV+DG
Sbjct: 371 QVEDAKKYIQTIHPNGG-TNINEALLRATFILNEAKSLGMLDPNSVSMIVLVSDGDPTVG 429
Query: 435 ---AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
++ + ++K + ++ GIG ++ FL+ +A +RG +
Sbjct: 430 ELKLTTIQKNVKQSIKDEFS--------LFCLGIGFDVDYDFLQRIATDNRGMAHRIFGN 481
Query: 492 DSIEIQMQKLFTRGFSSVLANIAID 516
+QM+ + + + +L I +
Sbjct: 482 QETSLQMKNFYNQVSTPLLKKIQFN 506
>gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Macaca mulatta]
Length = 946
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVSDAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|426363947|ref|XP_004049089.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Gorilla gorilla gorilla]
Length = 712
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L +R S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEARNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|440790517|gb|ELR11799.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 502
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 82/179 (45%), Gaps = 6/179 (3%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+++F+VD SGSM G ++ KNALA+ L G FN++ F S + E
Sbjct: 222 ELLFVVDRSGSMSGGRIDQAKNALALFFHSLPVGTRFNVIGFGSHYVKLFPSSRVYDDET 281
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+E+A+Q + G T + PL + ++ + +F++TDG V + ++ + ++
Sbjct: 282 LEQANQHVAAIRADLGGTQLLEPLRDVLSSPSDPKYPRQ-VFVLTDGEVGNTNEVIECVR 340
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
+ R++ GIGS + + LA RGY + +E ++ + R
Sbjct: 341 KHARD-----TRVFALGIGSNVSMELVNGLAKAGRGYAEYVVSGERLEAKVLRQLKRAL 394
>gi|397515207|ref|XP_003827848.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan
paniscus]
Length = 728
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 66 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 122
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 181
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 182 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 239
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 240 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 298
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 299 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 333
>gi|432944124|ref|XP_004083334.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Oryzias latipes]
Length = 946
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 92/201 (45%), Gaps = 8/201 (3%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+I+ FN + + A+
Sbjct: 301 KNIVFVIDVSGSMWGVKMKQTAEAMQAILDDLTMEDQFSIIDFNHNVRCWREDLVPASPI 360
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDGAVEDE 439
E A +I N G TNI L +AV+ML + + S+ +I V+DG
Sbjct: 361 QTEDAKNYIQ-NIRPAGGTNINEALLRAVQMLVRTSNHDLLEQRSVSMIIFVSDGDPTVG 419
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ L +++ GIG ++ FL ++M +RG + + Q++
Sbjct: 420 EIKLGTIQKNLKRVMREEFSLFSLGIGYDVDYDFLERISMENRGAAQRIFSNHNAAEQLR 479
Query: 500 KLFTRGFSSVLANIAIDTLKD 520
+ + S +L I+I +D
Sbjct: 480 TFYRQVSSPLLRQISIRFSED 500
>gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo
sapiens]
Length = 728
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 66 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 122
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 181
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 182 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 239
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 240 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 298
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 299 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 333
>gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens]
Length = 824
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 162 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 218
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 219 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 277
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 278 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 335
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 336 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 394
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 395 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 429
>gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan
troglodytes]
Length = 728
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 66 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 122
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 181
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 182 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 239
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 240 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 298
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 299 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 333
>gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct]
Length = 956
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|426363945|ref|XP_004049088.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Gorilla
gorilla gorilla]
Length = 728
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 66 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 122
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 181
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 182 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 239
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 240 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 298
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 299 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 333
>gi|395538999|ref|XP_003771461.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2
[Sarcophilus harrisii]
Length = 1125
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 98/199 (49%), Gaps = 12/199 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 487 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRAEDQFSVVDFNHNVRSWRDELVPATKA 546
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEML---TNSR----GSIPIIFLVTDG-AVED 438
+ A ++I G TNI L +A+ +L +N R S+ +I LV+DG
Sbjct: 547 QITDAKKYIE-KIQPNGGTNINEALLRAIFILNEASNLRMLDPNSVSLIILVSDGDPTVG 605
Query: 439 ERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
E ++ + K+ + + +I +++ GIG ++ FL L+ + G Y IQ
Sbjct: 606 ELKLSNIQKNVKASMQDNIS--LFSLGIGFDVDYDFLERLSQENHGVAQRIYGNQDTSIQ 663
Query: 498 MQKLFTRGFSSVLANIAID 516
+++ + + + +L N+ +
Sbjct: 664 LKQFYNQVSTPLLRNVQFN 682
>gi|187609608|sp|Q86UX2.2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5;
Short=ITI heavy chain H5; Short=ITI-HC5;
Short=Inter-alpha-inhibitor heavy chain 5; Flags:
Precursor
Length = 942
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|390465052|ref|XP_003733333.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 2
[Callithrix jacchus]
Length = 935
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 92/197 (46%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 298 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVVDFNHNIRTWRNDLISATKT 357
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRG-----SIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L N+ G S+ +I LV+DG
Sbjct: 358 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNMGLLDPNSVSLIILVSDGDPTVG 416
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ +RG Y Q++
Sbjct: 417 ELKLSKIQKNVKENIRDNISLFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLR 476
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 477 KFYNQVSTPLLRNVQFN 493
>gi|395827495|ref|XP_003786937.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Otolemur
garnettii]
Length = 940
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 120/280 (42%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 278 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFNIIGF 334
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ ++ + T + + +I + G T+I L +A+ +L +
Sbjct: 335 SSRIRVWKDHVIPVTPDHIRDGKVYIH-HMSPTGGTDINGALQRAIGLLNDYMAHGDVDD 393
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G IC I+T GIG+ + L L++
Sbjct: 394 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGRIC--IFTIGIGNDVDFRLLEKLSLE 451
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + + Q+ + + +L++I +D + + P+ + S +
Sbjct: 452 NCGLTRRVLEEEDAGAQLIGFYDEIRTPLLSDIRVDYPPNAVA-HATKTLFPNYFNGSEI 510
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+++G+ + D L + + FVV L D+P+
Sbjct: 511 VIAGKLVDRQLDRLHVEVTASNSKKFVV-----LKTDVPV 545
>gi|268637758|ref|XP_002649126.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
gi|269933417|sp|C7G046.1|Y6969_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0286969
gi|256012883|gb|EEU04074.1| type A von Willebrand factor domain-containing protein
[Dictyostelium discoideum AX4]
Length = 2079
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 93/196 (47%), Gaps = 13/196 (6%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMEL 382
V + ++IF+VD+S SM+G ++ K AL L L FNI++F + LF+ S++
Sbjct: 350 VVESELIFLVDVSESMEGYNMKHAKKALHRFLHSLSKDTYFNIISFASSHRKLFAQSVKY 409
Query: 383 ATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEM-LTNSRGSIPIIFLVTDGAVEDER 440
E ++ A ++ + I+ G TN+ PL + T R IFL+TDG V +
Sbjct: 410 -NDENLKAATAYVESLKAISHGETNLLEPLKDIYSVDATCPRK----IFLLTDGRVNNIG 464
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
I D ++ N ++ G+G + + + +A G + ++IE ++ +
Sbjct: 465 PIVDLVRQNAHNTS-----VFPIGMGEFVSRQLVEYIANAGSGVAELVIENETIESKVMR 519
Query: 501 LFTRGFSSVLANIAID 516
R ++NI +D
Sbjct: 520 QLKRALQPAMSNIKVD 535
>gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo
sapiens]
Length = 942
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 isoform 1
[Callithrix jacchus]
Length = 946
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 92/197 (46%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVVDFNHNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRG-----SIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L N+ G S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNMGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ +RG Y Q++
Sbjct: 428 ELKLSKIQKNVKENIRDNISLFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLR 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|405971557|gb|EKC36390.1| Inter-alpha-trypsin inhibitor heavy chain H4 [Crassostrea gigas]
Length = 1267
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/325 (20%), Positives = 150/325 (46%), Gaps = 32/325 (9%)
Query: 247 NTSSHHLKQLRRDV---GKLGYSYESEVLKWSNIDFDFSYTVSPSH--------IFGGVL 295
N ++ ++ LR D+ + G + ++++++ S + SY+++ S + G +
Sbjct: 670 NITTVNVPPLRNDILTENREGENRDADIVRLSETEVRVSYSLTASQQRALSEQGVSGQFV 729
Query: 296 LQSPSLHDVDQREMFCM------YLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKN 349
+Q + D E+ M +L P + D++F++D SGSM G ++
Sbjct: 730 VQYDIEREKDSGEVLVMDGYFVHFLAP---EGLDPMPMDIVFVLDRSGSMSGNKMKQLHE 786
Query: 350 ALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICA 409
++ L ++ D F +++F+ + + + AT A+ A ++I + A GST+I +
Sbjct: 787 SMLKILDEISEQDRFMLISFDNQLNFWRNELVQATSSAIADARRFI-TSISARGSTDINS 845
Query: 410 PLTKAVEMLTNSRGS--IPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTF 463
+T+ +++L + R + PI+ +TDG V + +I + +K L N I I++
Sbjct: 846 AMTEGLKVLASERNNSRAPILVFLTDGQPTSGVTNTVKILENIK--LFNEVQIP--IFSL 901
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK-DLD 522
G ++ F + +A + G Y+ +Q+ + + ++ NI+ + L+
Sbjct: 902 AFGKGADYDFTKKVAAQNNGLGKRIYEDSDAALQIAGFYQEISTVLMRNISFKYVDGSLN 961
Query: 523 EFEMYPSRIPDLSSESPLIVSGRYQ 547
+ + + S LIV+G+ Q
Sbjct: 962 KTTLTRNNFNTYFKGSELIVAGQIQ 986
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 47/221 (21%), Positives = 94/221 (42%), Gaps = 4/221 (1%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D+I + D SGSM G+ + + AL L D +++F+ + ++ + A +
Sbjct: 35 DIILVFDKSGSMLGRKMSQLQAALIKILDDTKDIDRIMLLSFSHTVHSWNRGLVAADRNN 94
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIP-IIFLVTDGAVEDERQICDA 445
E A Q+I GG TNI +T+ + E+ T + G P ++ +TDG +
Sbjct: 95 KEAAKQYIREQMAFGG-TNINLGITEGLRELKTYNHGDRPALMIFLTDGKATSGVTDTER 153
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
+ + N + I+ G ++ + +A + G+ ++ +Q+ +
Sbjct: 154 ILQNIINANTYNIPIFALAFGREADYTISKKIAAKNFGFARKIFEDADATLQITGFYDEI 213
Query: 506 FSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIVSGR 545
S +L ++ L L + + R + S L+V+GR
Sbjct: 214 SSLLLEDVNFKYLDGALYDSTVTDRRFNNYFKGSELVVAGR 254
>gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens]
gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1
[synthetic construct]
Length = 942
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor
[Homo sapiens]
Length = 942
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|363727396|ref|XP_417299.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Gallus
gallus]
Length = 954
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 121/265 (45%), Gaps = 19/265 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATK 385
K+V+F++D S SM G L TK AL L L P D FNI+ F+ ++ + T
Sbjct: 309 KNVVFVLDSSASMVGTKLRQTKEALFTILQDLRPEDHFNIIGFSNRIKVWQQDRLVPVTP 368
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SRGSIPIIFLVTDGAVED 438
+ A ++I N G TNI + L ++L + S+ +I +TDG
Sbjct: 369 NNIRDAKKYIH-NMSPTGGTNINSALQTGAKLLNDYIAQNNIDARSVSLIIFLTDG---- 423
Query: 439 ERQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
+ + S++ + R ++T GIG+ ++ L +A+ + G + +
Sbjct: 424 RPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGMVRHFQEDEDA 483
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
++ + + +L++I ID +D E ++ + P+ + S +I++G+ D+L
Sbjct: 484 ASHLKGFYDEIGTPLLSDIRIDYSEDNVE-QVTQNFFPNYFNGSEIIIAGKLINHTSDSL 542
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIP 579
+ + +++ LK +A D+P
Sbjct: 543 HVEVTASNSKKYIL-LKTDVAIDLP 566
>gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2
[Acyrthosiphon pisum]
Length = 919
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/372 (21%), Positives = 156/372 (41%), Gaps = 59/372 (15%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN--GETYLFSTSM 380
K + VIF++D+SGSM G+ L K A+ LS++ D F ++ F+ + + + +
Sbjct: 351 KPLRTHVIFVLDVSGSMVGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQ 410
Query: 381 EL------------------------------ATKEAVERAHQWIGINFIAGGSTNICAP 410
E AT++ V+ A ++I + + STN+
Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQ-DLQSESSTNMEDA 469
Query: 411 LTKA--VEMLTNSR------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYT 462
L KA + L +R PII +TDG + ++N IY+
Sbjct: 470 LNKAHLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYS 529
Query: 463 FGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKD-L 521
G G + FL+ L++ + G+ Y+ +Q++ + S VL+N+ + D +
Sbjct: 530 LGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVTFQYIDDQV 589
Query: 522 DEFEMYPSRIPDLSSESPLIVSGRYQ-GKFPD---TLKAKGFLGDLSNFVV--------- 568
D + L + L+V+G+ + GKFPD TL A GD V+
Sbjct: 590 DTVTVTKKSFNLLFKGTELVVAGKLKDGKFPDFNSTLNADSTEGDFKGPVIITCFDFPIV 649
Query: 569 --ELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS--KMSVQTGVLCEYTRM 624
E+ Q + L+++ A I L Q ++++ EK +++++ + T +
Sbjct: 650 PPEVVPQKRRIGHLEKLWAYLNIQQLMDQYELNKNENSTEKAKALELALKYSFVTPLTSL 709
Query: 625 IIVETDERNNAS 636
++V+ ++ ++
Sbjct: 710 VVVKPNDTTTSA 721
>gi|66801689|ref|XP_629769.1| hypothetical protein DDB_G0292016 [Dictyostelium discoideum AX4]
gi|74851205|sp|Q54DV3.1|Y2016_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0292016
gi|60463171|gb|EAL61364.1| hypothetical protein DDB_G0292016 [Dictyostelium discoideum AX4]
Length = 918
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 88/193 (45%), Gaps = 10/193 (5%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K + IF++D SGSM G+ + + A+ + + L+ NI F +
Sbjct: 297 KSEFIFLIDCSGSMSGQSINKARRAMEIIIRSLNEQHKVNIYCFGSSFNKVFDKSRVYND 356
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT--NSRGSIPIIFLVTDGAVEDERQIC 443
E +E A ++ G T + P+ V++L+ N +F++TDG + + ++
Sbjct: 357 ETLEIAGSFVEKISANLGGTELLPPM---VDILSSPNDPEYPRQVFILTDGEISERDKLI 413
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D + RI+T+GIG+ + + L+ +GYY + ++E Q+ KL
Sbjct: 414 DYVAKEAN-----TTRIFTYGIGASVDQELVIGLSKACKGYYEMIKETTNMEKQVMKLLN 468
Query: 504 RGFSSVLANIAID 516
F +L+NI +D
Sbjct: 469 VAFEPMLSNIKLD 481
>gi|410222448|gb|JAA08443.1| inter-alpha (globulin) inhibitor H5 [Pan troglodytes]
gi|410301624|gb|JAA29412.1| inter-alpha (globulin) inhibitor H5 [Pan troglodytes]
Length = 942
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|355696808|gb|AES00464.1| inter-alpha inhibitor H3 [Mustela putorius furo]
Length = 131
Score = 67.8 bits (164), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/113 (33%), Positives = 62/113 (54%), Gaps = 8/113 (7%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
V+F++D+SGSM G+ +E TK+AL L + D N + F+G+ + ++ AT E +
Sbjct: 1 VVFVIDVSGSMYGRKMEQTKDALLKILEDVKQEDYLNFILFSGDVTTWKDNLVQATPENI 60
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG 434
E+A ++ N G TNI L +A+ ML +R S I+ ++TDG
Sbjct: 61 EQARTFVK-NIREQGMTNINDALLRAISMLNKAREEHRVPERSTSIVIMLTDG 112
>gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1
[Acyrthosiphon pisum]
Length = 884
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/372 (21%), Positives = 156/372 (41%), Gaps = 59/372 (15%)
Query: 323 KVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN--GETYLFSTSM 380
K + VIF++D+SGSM G+ L K A+ LS++ D F ++ F+ + + + +
Sbjct: 351 KPLRTHVIFVLDVSGSMVGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQ 410
Query: 381 EL------------------------------ATKEAVERAHQWIGINFIAGGSTNICAP 410
E AT++ V+ A ++I + + STN+
Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQ-DLQSESSTNMEDA 469
Query: 411 LTKA--VEMLTNSR------GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYT 462
L KA + L +R PII +TDG + ++N IY+
Sbjct: 470 LNKAHLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYS 529
Query: 463 FGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKD-L 521
G G + FL+ L++ + G+ Y+ +Q++ + S VL+N+ + D +
Sbjct: 530 LGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVTFQYIDDQV 589
Query: 522 DEFEMYPSRIPDLSSESPLIVSGRYQ-GKFPD---TLKAKGFLGDLSNFVV--------- 568
D + L + L+V+G+ + GKFPD TL A GD V+
Sbjct: 590 DTVTVTKKSFNLLFKGTELVVAGKLKDGKFPDFNSTLNADSTEGDFKGPVIITCFDFPIV 649
Query: 569 --ELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVS--KMSVQTGVLCEYTRM 624
E+ Q + L+++ A I L Q ++++ EK +++++ + T +
Sbjct: 650 PPEVVPQKRRIGHLEKLWAYLNIQQLMDQYELNKNENSTEKAKALELALKYSFVTPLTSL 709
Query: 625 IIVETDERNNAS 636
++V+ ++ ++
Sbjct: 710 VVVKPNDTTTSA 721
>gi|431917642|gb|ELK16907.1| Inter-alpha-trypsin inhibitor heavy chain H2 [Pteropus alecto]
Length = 698
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 95/202 (47%), Gaps = 13/202 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F++V FN +S + ATK
Sbjct: 298 KNILFVIDVSGSMWGIKMKQTVEAMKTILDDLRTDDQFSVVNFNHNVRTWSNDLVSATKT 357
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
+ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 358 QIADAKKYIE-KIQPSGGTNINEALLRAIFILNEADNLGMLDPNSVSLIILVSDGDPTVG 416
Query: 440 RQICDAMK-SRLTNGGSICPR----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
+ +K S++ R +++ GIG ++ FL+ L+ +RG Y
Sbjct: 417 KYHFRELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGMAQRIYGNQDT 476
Query: 495 EIQMQKLFTRGFSSVLANIAID 516
Q+++ + + + +L N+ +
Sbjct: 477 SSQLKQFYNQVSTPLLRNVQFN 498
>gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex]
Length = 891
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 111/508 (21%), Positives = 201/508 (39%), Gaps = 90/508 (17%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGEND-------GAGKSASVETG 169
SK+ + V + + I G EI GK Y + + AG+SA+ T
Sbjct: 84 SSKAQEATFHVVLPETAFISGFLMEIDGKEYKASVKEKEKAQEEYQSAVDAGQSAAQVTA 143
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPA--IKKIP 227
+ N FT+++ I+ + L + + LS R G + + +VTP + K+
Sbjct: 144 NARNANQFTVSV-NIEPEQKIFFNLTYEELLSRRKGIYEQAI------HVTPGSVVPKMS 196
Query: 228 KREKIH--LNVNA-------GTGTEVLCNTSSHHLKQLRRD-----VGKLGYSYESEVLK 273
R I L +N G E N++ + + Q+ R V + E+E +K
Sbjct: 197 VRVNIFETLPINKITVPQLRGNDVEQDHNSTENEIAQIIRTNETTVVSVVYEPAEAEQIK 256
Query: 274 WSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIV 333
S + V + + +H VD F + +P + K VIF++
Sbjct: 257 MSKDGLQGQFVVQYDVDRSSIEKKGGEIHVVDG--YFVHFFVPADLPT---LPKHVIFVL 311
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPG-DSFNIVAFNG--------------ETYLFST 378
D SGSM G +E TK A+ L +L D F++V F+ + Y F++
Sbjct: 312 DTSGSMAGTRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKPYKGPDHYYFNS 371
Query: 379 SMELATKEA----------------------------VERAHQWIGINFIAGGSTNICAP 410
E T++A V+RA +++ + STNI
Sbjct: 372 PPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMDVT-SSTNINDA 430
Query: 411 LTKAVEMLTNSRGSI---PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTF 463
L A++ + + + PII +TDG +V D +I + R N + I+
Sbjct: 431 LLLALKNSQSVQSRVRLTPIIIFLTDGEPTASVTDTTEILKNV--RKGNSDDVV-SIFCL 487
Query: 464 GIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA-IDTLKDLD 522
G+ ++ FL ++ +RG+ Y+ +Q++ F S +L+N+ +
Sbjct: 488 AFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGFFDEVASPLLSNVNFVYNNNGPV 547
Query: 523 EFEMYPSRIPDLSSESPLIVSGRYQGKF 550
+ ++ + +P+ + +V+GR KF
Sbjct: 548 KHDVTETNVPNFFKGTEFVVAGRIDPKF 575
>gi|441625952|ref|XP_003257666.2| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy
chain H5 [Nomascus leucogenys]
Length = 955
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/288 (22%), Positives = 122/288 (42%), Gaps = 38/288 (13%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDYLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSDIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ ++ +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLVVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYP--------SRIP 532
+ G ++ + Q+ + + +L++I ID YP + P
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIXTPLLSDIRID----------YPPSSVCSHQTLFP 503
Query: 533 DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+ + S +I++G+ + D L + + FV+ L D+P+
Sbjct: 504 NYFNGSEIIIAGKLVDRKLDHLHVEVTASNSKKFVI-----LKTDVPV 546
>gi|328874977|gb|EGG23342.1| type A von Willebrand factor domain-containing protein
[Dictyostelium fasciculatum]
Length = 1697
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 15/211 (7%)
Query: 309 MFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
M Y P +K + ++IF+VD S SM+G + K L + L G FNI++
Sbjct: 271 MVVFY--PKLEPVKKQQEMELIFLVDCSESMEGYNITHAKKGLLLFSHSLPKGTYFNIIS 328
Query: 369 FNGETY--LFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
F G T+ LF S+ +E + +A+ +I G+ +G TN+ APL ++ S
Sbjct: 329 F-GSTHQKLFPQSLPYC-EENLAQANTFIKGLKADSGNDTNLLAPLK---DIYATSATRP 383
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
IFL+TDG V + QI + ++ N ++ G+G + + + +A G
Sbjct: 384 RKIFLLTDGGVNNTGQIAELVRQNADNTS-----VFPIGMGEFVSKQLVDAIASAGCGVA 438
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
A + ++IE ++ + R NI +D
Sbjct: 439 ELAIENETIENKVMRQLKRALQPAYTNIRMD 469
>gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens]
Length = 735
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix
jacchus]
Length = 940
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 279 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 335
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-------NSR 422
+ ++ + T ++V +I + G T+I L A+ +L +
Sbjct: 336 SNRIKVWKDHLISVTPDSVRDGKVYIH-HMSPTGGTDINEALQTAIRLLNKYVAHSDSGD 394
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 395 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 452
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 453 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIHID-YPPSSVVQATKTLFPNYFNGSEI 511
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 512 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 546
>gi|195539501|ref|NP_001124213.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Gallus
gallus]
gi|190576833|gb|ACE79193.1| inter-alpha inhibitor heavy chain 2 precursor [Gallus gallus]
Length = 948
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ LS+L D F+++ FN + ++ AT
Sbjct: 311 KNILFVIDVSGSMWGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRCWRDNLVSATPA 370
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
VE A ++I GG TNI L +A +L ++ S+ +I LV+DG
Sbjct: 371 QVEDAKKYIQTIHPNGG-TNINEALLRATFILNEAQNLGMLDPNSVSMIVLVSDGDPTVG 429
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + ++ GIG ++ FL+ +A +RG + QM+
Sbjct: 430 ELKLTTIQKNVKQSIKDEYSLFCLGIGFDVDYDFLQRIATDNRGMAQRIFGNQETSAQMK 489
Query: 500 KLFTRGFSSVLANIAID 516
+ + + + +L I +
Sbjct: 490 RFYNQVSTPLLKKIQFN 506
>gi|326911070|ref|XP_003201885.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Meleagris gallopavo]
Length = 948
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 96/205 (46%), Gaps = 24/205 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ LS+L D F+++ FN + ++ AT
Sbjct: 311 KNILFVIDVSGSMWGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRCWRDNLVSATPA 370
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG----- 434
VE A ++I GG TNI L +A +L ++ S+ +I LV+DG
Sbjct: 371 QVEDAKKYIQTIHPNGG-TNINEALLRATFILNEAQNLGMLDPNSVSMIVLVSDGDPTVG 429
Query: 435 ---AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
++ + ++K + ++ GIG ++ FL+ +A +RG +
Sbjct: 430 ELKLTTIQKNVKQSIKDEFS--------LFCLGIGFDVDYDFLQRIATDNRGMAQRIFGN 481
Query: 492 DSIEIQMQKLFTRGFSSVLANIAID 516
QM++ + + + +L I +
Sbjct: 482 QETSAQMKRFYNQVSTPLLKKIQFN 506
>gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1]
gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1]
Length = 416
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 94/191 (49%), Gaps = 30/191 (15%)
Query: 313 YLLPGTAK----------SRKVFK----KDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
YLLPG + ++V K +++ F++D SGSM G+ L+ TK A+A A+ L
Sbjct: 14 YLLPGNKQVAYLMVKLTAPKQVEKERPVQNLSFVIDRSGSMAGEKLDYTKKAVAFAVGHL 73
Query: 359 DPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE-- 416
P D ++VAF+ + ++S ++A K+A++ A + I GGSTN+ + V
Sbjct: 74 SPQDYCSVVAFDDMVTMVASSHQVANKDALKMAVESI----YPGGSTNLSGGMLLGVREV 129
Query: 417 MLTNSRGSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
L + I + L+TDG V D + + KSR G + + TFG+G
Sbjct: 130 KLAHKENQINRVLLLTDGMANVGVTDHSALVE--KSREMAAGGV--NLSTFGLGEDFEED 185
Query: 473 FLRMLAMISRG 483
L+ AM+ G
Sbjct: 186 LLQ--AMVEAG 194
>gi|355782622|gb|EHH64543.1| hypothetical protein EGM_17788 [Macaca fascicularis]
Length = 947
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 95/203 (46%), Gaps = 18/203 (8%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVSDAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 R------QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
+ +I +K + + S +++ G+G ++ FL+ L+ +RG Y
Sbjct: 428 KCELKLSKIQKNVKENIQDNIS----LFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQD 483
Query: 494 IEIQMQKLFTRGFSSVLANIAID 516
Q++K + + + +L N+ +
Sbjct: 484 TSSQLKKFYNQVSTPLLRNVQFN 506
>gi|390353616|ref|XP_782776.3| PREDICTED: uncharacterized protein LOC577455 [Strongylocentrotus
purpuratus]
Length = 1951
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 158/362 (43%), Gaps = 31/362 (8%)
Query: 290 IFGGVLLQSPSL----HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLE 345
+ G V + P + H ++ CM ++ V K ++I ++D S SM+ L+
Sbjct: 598 LVGVVDIHRPRMWVEQHPENKHSQACMVTFYPDFEASNVEKPEIILLLDCSNSMKEGALK 657
Query: 346 DTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQWIGINFIAGGS 404
K L + L + FN+V F G LF++S + T+ + A +I G+
Sbjct: 658 QAKQILLLTLHHMSDDSIFNVVTFGTGFEELFASS-QAKTETTILAATTFINQACATQGN 716
Query: 405 TNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFG 464
T+ PL L S G + +FLV+DG + +E+ A+ + R++TFG
Sbjct: 717 TDAWRPLQSY--FLLRSEG-LQNLFLVSDGHINNEQSTLHAIAQNTHS------RVFTFG 767
Query: 465 IGSYCNHYFLRMLAMISRGYYGAAYDLDS---IEIQMQKLFTRGFSSVLANIAIDTLKDL 521
+ S N + L+ +A + G Y +D ++ E +++ ++ ++++ ++ K
Sbjct: 768 VSSSANRHLLKGMARVGCGAY-EFFDNNAKSRWENKVKAQLSKAKQPSVSSLEVE-WKQH 825
Query: 522 DEFEMYPSRIP----DLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD 577
D P + P L + S L+V G Q LKA ++S V ++ + K
Sbjct: 826 DNNPPTPVQAPCQLTSLFNGSRLVVYGFVQYCTEACLKASIDGEEVSTMVSTPEMCITKG 885
Query: 578 IPLDRICAKQQID-----LLTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVETD 630
L R+ A+ I L E ++ E+K V +S + ++ + T + +ET
Sbjct: 886 QMLHRLTARAVIRDWEDGTLDPDKTHHEARKNEQKSYVIDLSKKYSIVTQLTSFVAIETR 945
Query: 631 ER 632
E+
Sbjct: 946 EK 947
>gi|118367268|ref|XP_001016849.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89298616|gb|EAR96604.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 1049
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 84/161 (52%), Gaps = 12/161 (7%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS-FNIVAFNGETY--LFSTSMEL 382
+ + IF++D SGSM G+P+ AL + L L P DS FN+++F G ++ LF +S +
Sbjct: 310 RSEFIFLLDRSGSMSGQPIRRACEALTLFLKSL-PNDSYFNVISF-GSSFDKLFPSSTKY 367
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
T E++E+A I G T I PL V + +G IFL+TDG V+ +Q+
Sbjct: 368 -TSESLEKAILLISKYQADLGGTEIYNPLNN-VFVQNKIQGYNKQIFLLTDGEVDSPQQV 425
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
RL + R+++ G GS + Y ++ A+ +G
Sbjct: 426 V-----RLIKKNNKYNRVHSIGFGSGADQYLIKESAIAGKG 461
>gi|355562281|gb|EHH18875.1| hypothetical protein EGK_19441 [Macaca mulatta]
Length = 947
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 95/203 (46%), Gaps = 18/203 (8%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVSDAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 R------QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
+ +I +K + + S +++ G+G ++ FL+ L+ +RG Y
Sbjct: 428 KCELKLSKIQKNVKENIQDNIS----LFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQD 483
Query: 494 IEIQMQKLFTRGFSSVLANIAID 516
Q++K + + + +L N+ +
Sbjct: 484 TSSQLKKFYNQVSTPLLRNVQFN 506
>gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata
lugens]
Length = 315
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 126/295 (42%), Gaps = 64/295 (21%)
Query: 272 LKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREM----------FCMYLLPGTAKS 321
+ WS D++ +S + G ++++ +DVD+++ F + P
Sbjct: 2 ISWSP-DYEQQKEISKDGLKGRLIIE----YDVDRKKHPSQILIEDGHFVHFFAPAELPP 56
Query: 322 RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS-- 379
+K V+F++DISGSM G+ ++ K+A+ LS L+P D F+IV F+ Y++S +
Sbjct: 57 ---LRKQVVFVLDISGSMFGEKIKQLKDAMLKILSDLNPQDHFSIVLFSDNAYVWSKAKT 113
Query: 380 ----------------------------MELATKEAVERAHQWIGINFIAGGSTNICAPL 411
+ AT + V+ A +++ + STNI L
Sbjct: 114 AVMKKILDEGFYNLDNETLAILDDHRNEILQATPDNVKTAKEFVEL-IKPTTSTNIIDGL 172
Query: 412 TKAVEML--------TNSRGSIPIIFLVTDGA----VEDERQICDAMKSRLTNGGSICPR 459
K ++++ T S PI+F +TDG + D +I + S N P
Sbjct: 173 RKGLKLVKEGKETLDTTKEPSQPIMFFLTDGEPNVDLTDPVEIVNETSS--LNEQLKTP- 229
Query: 460 IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
IY+ G + FL+ L+ + G+ Y+ +Q+ + S +LAN+
Sbjct: 230 IYSLAFGQGADITFLKKLSKANHGFARNIYEGSDATLQLNNFYKEISSPLLANVT 284
>gi|85117086|ref|XP_965177.1| hypothetical protein NCU00984 [Neurospora crassa OR74A]
gi|28926981|gb|EAA35941.1| predicted protein [Neurospora crassa OR74A]
gi|38567055|emb|CAE76353.1| hypothetical protein [Neurospora crassa]
Length = 1086
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 102/217 (47%), Gaps = 21/217 (9%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET--YLFSTSMELA 383
+ +++F+ D SGSM G +E K+AL + L + G FNI +F G T +LFS
Sbjct: 293 RPEIVFVCDRSGSMGGARIEGLKSALRIFLKSIPVGAKFNICSF-GSTFEFLFSDGSRSY 351
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
E++ A ++ G T + PL A E N +FL+TDG + ++ +
Sbjct: 352 DHESLRLAMDYVSRMDADLGGTEMYQPLEAAFEKRYNDMDL--EVFLLTDGEIWNQEHLF 409
Query: 444 DAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ +++ G+I R++T GIG+ +H + A G+ + D + + ++ ++
Sbjct: 410 TMINKKVSESQGAI--RLFTLGIGNDVSHALIEGAARAGNGFAQSVTDSEKMNAKVVRML 467
Query: 503 TRGFSSVLANIAID-------------TLKDLDEFEM 526
G + + + ++ T +++D+FE+
Sbjct: 468 KAGLTPHIKDYTLEIKYAKDSDAGAKTTDEEMDDFEI 504
>gi|397482917|ref|XP_003812660.1| PREDICTED: LOW QUALITY PROTEIN: poly [ADP-ribose] polymerase 4 [Pan
paniscus]
Length = 1724
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 127/563 (22%), Positives = 228/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLNVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAAQALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|317419026|emb|CBN81064.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Dicentrarchus labrax]
Length = 836
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/252 (24%), Positives = 117/252 (46%), Gaps = 24/252 (9%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM+ K ++ T+ AL L L D F ++ F+ + S+ ATKE
Sbjct: 221 KNVVFVIDRSGSMRRKMVQ-TREALLAILKDLHEEDYFALIQFDDRIDSWQKSLTKATKE 279
Query: 387 AVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVED 438
V++A ++ IN+ G T+I + VEML R S+ +I L+TDG
Sbjct: 280 NVDQAMIYVQQINY--RGGTDINQAVLTGVEMLLKDRREKKLPERSVDMIILLTDGMPNS 337
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
++ + + ++ G G+ ++ FL +++ ++G ++ +Q+
Sbjct: 338 GESHLPRIQENVRSAIRGNMSLFCLGFGNDVDYSFLDVMSKQNKGLARRIFEGSDATLQL 397
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
Q + S +L+++ + + E + + L + S ++V+GR
Sbjct: 398 QGFYDEVSSPLLSDVDMRYPDNAVE-SLTTNHFNQLFNGSEIVVAGRLMD---------- 446
Query: 559 FLGDLSNFVVEL 570
DL NF+VE+
Sbjct: 447 --NDLDNFLVEV 456
>gi|395755785|ref|XP_002833596.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2, partial
[Pongo abelii]
Length = 363
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 90/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 20 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKT 79
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 80 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVSLIILVSDGDPTVG 138
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ +RG Y Q++
Sbjct: 139 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSSENRGIAQRIYGNQDTSSQLK 198
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 199 KFYNQVSTPLLRNVQFN 215
>gi|291233159|ref|XP_002736521.1| PREDICTED: poly (ADP-ribose) polymerase family, member 1-like,
partial [Saccoglossus kowalevskii]
Length = 1363
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 126/562 (22%), Positives = 230/562 (40%), Gaps = 76/562 (13%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL--- 172
S + + + P+ +++G EA I+GK ++ K A E G++L
Sbjct: 693 NSVAIEAKYVFPLDTMAAVVGFEAFINGKHIIGEVKEKETAHREYKEAISEGHGAYLMDE 752
Query: 173 -KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREK 231
P++FT+++ + + + I++ + +LS DGE + F P V P K ++
Sbjct: 753 ETPDVFTVSVGNLPPKASVLIKITYVTELSV-DGE---KILFFLPGSVAPWTKDTALSQE 808
Query: 232 IHLNVNA------GTGT-----------EVLCNTSSHHLKQLRRDVGKLGYSYESEVLKW 274
I +V G G+ ++ S H ++R K E++K
Sbjct: 809 IQKDVKTVKVDKVGHGSVSVQASIEMPFDIRNLESPTHEINIKRTATKAVV----ELVKD 864
Query: 275 SNIDFDFSYTVSPSHIFGGVLLQSPSL----HDVDQREMFCMYLLPGTAKSRKVFKKDVI 330
S + F + G + P + H Q CM ++ V +VI
Sbjct: 865 SVLGDSFQL------LIGLAEIHVPRMWVEKHPDKQDSQACMLTFYPEFEADSVADCEVI 918
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAV 388
F+VD+S SM+ LE K + L+ D FNIV F G Y LF +S L +K +
Sbjct: 919 FLVDLSNSMKSCSLEAQKVIMLCVLNLPDTY-RFNIVVF-GTAYDELFPSS-RLKSKSTL 975
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A + F G+T PL + L S+G + +FL++DG + ++ + ++
Sbjct: 976 RAAKDFAARCFANMGNTEAWRPL-HSFYCLRPSQG-VRNVFLISDGHINNQESTIEDVRK 1033
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
+ R++TFG+ NH+ L+ +A + G Y + + + + + R S
Sbjct: 1034 NCQH-----TRVFTFGVSCTANHHILQSMARVGAGAY------EYFDSKTKSKWERKVKS 1082
Query: 509 VLANIAIDTLKDLD-EFEMY----------PSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
++ A L ++ E++ + PS I L S S +V G TLKA+
Sbjct: 1083 QISKAAQPVLTSVNVEWQQFDSNSPIPVQAPSHIVSLFSGSRQVVYGFVPHCTQATLKAE 1142
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKRLEEK--VSK 610
+S V L + + L ++ A+ I L E K+ E+K +
Sbjct: 1143 IDDHFVSTMVSTADLNITEGKILHQLTARAIIRDWEDGTLDIDRTEHEVKKTEQKDYIIN 1202
Query: 611 MSVQTGVLCEYTRMIIVETDER 632
+S + ++ ++T + +E E+
Sbjct: 1203 ISKEFSIVTQFTSFVAIEKREK 1224
>gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens]
Length = 694
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|443708079|gb|ELU03372.1| hypothetical protein CAPTEDRAFT_220539 [Capitella teleta]
Length = 2146
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 116/549 (21%), Positives = 213/549 (38%), Gaps = 52/549 (9%)
Query: 117 GSKSCDCRIAVPMGDQGSILGVEAEISGKSY-------HTQLIALGENDGAGKSASVETG 169
+ S + + P+ + ++ G EA I+GK T E AG A +
Sbjct: 663 ANSSIEAKYVFPLSEMAAVCGFEAFINGKHIVGVVKEKETAHREYREAISAGHGAYLMDQ 722
Query: 170 SFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVP---------FKFPEYVT 220
P++FT+++ + + + I++ + ++ NVP E +
Sbjct: 723 DVETPDVFTVSVGNLPPNAQVLIKITYVAEVQVEGESIVFNVPGSVAPWKQDASLDEVIQ 782
Query: 221 PAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFD 280
+ + R N++ ++ + S + K E+ K ++
Sbjct: 783 SDLDTVKVRPSDFRNISVQVAIQMPFDIRSIKCPTHSVRIKKTATHAVVEMNKNQSLADG 842
Query: 281 FSYTVSPSHIF-GGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSM 339
+ + I + ++ HD CM ++ V + IFI+D+S SM
Sbjct: 843 LQLLIDLAEIHVPRMWVERHPTHD----SQACMLTFYPEFEASAVDVLEFIFILDLSNSM 898
Query: 340 QGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQWIGIN 398
+G L D + AL ++L + P +FN++ F +G LF + AT+ V A ++I
Sbjct: 899 RGDALRDGQKALLLSLRHVPPNAAFNVMVFGSGFDELFPCP-KPATRINVSHAEKFINEV 957
Query: 399 FIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
G+TN L L NS + +FL +DG + E + RL S
Sbjct: 958 EANMGNTNALRALMSC--SLLNS--CLQNVFLFSDGHLSIEDGLL-----RLVRERSHAT 1008
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYY------GAAYDLDSIEIQMQKLFTRGFSSVLAN 512
R++ G+GS N + LR +A G + + ++ QMQ+ G SSV
Sbjct: 1009 RVFAMGVGSSANKHMLRSVANAGCGAFEFFDSKSKSKWESKVKQQMQRARQPGLSSV--- 1065
Query: 513 IAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKL 572
++D + P I L S S ++ G T++A ++S V L
Sbjct: 1066 -SVDWSNARSPIQA-PETISSLFSGSRQVIYGFVNNCTMATIRAVVDDQEVSTVVSTSDL 1123
Query: 573 QLAKDIPLDRICAKQQI--------DLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRM 624
+ + L R+ A+ I D Q + KR ++ V +S+Q ++ ++T
Sbjct: 1124 SITEGQVLHRLAARAVIGDWEDGSLDAGRLQHEVMKKKR-KDYVINLSIQHSIVTQFTSF 1182
Query: 625 IIVETDERN 633
+ VE + N
Sbjct: 1183 VAVEERDAN 1191
>gi|358636443|dbj|BAL23740.1| hypothetical protein AZKH_1418 [Azoarcus sp. KH32C]
Length = 746
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/321 (23%), Positives = 134/321 (41%), Gaps = 46/321 (14%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKIH 233
++T TL + G + I L++S+ L++ +P P Y P + +R +
Sbjct: 106 GLYTATLGNVLAGEAVRIELKYSEPLAWNGQNLRYRMPTTLAPRYGEPGHLEPWQRPDVS 165
Query: 234 LNVNAGTGTEV-----LCNTS----SHHLKQLRRDVGKLGYSYESEVLK------WSNID 278
+ G +V L S SH ++ S E VLK +++D
Sbjct: 166 VTAEYGLAVQVELFGELATASVASPSHAIR----------VSAEPGVLKVVLAGTGASMD 215
Query: 279 FDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGS 338
DF + S++ + + + +L + + LLP S +D + ++D SGS
Sbjct: 216 RDFVLEMLASNL-TSIAVTASALDT----HVAMLTLLPPAVPSDA--PRDTVLLLDCSGS 268
Query: 339 MQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGIN 398
M G + K + +AL L D F I+ F + ++ A+ A+++A +++
Sbjct: 269 MAGDSIRHAKEGVQLALGHLTSADRFGIIGFGSNVIALAPKLQAASDHALDQARRFVAEL 328
Query: 399 FIAGGSTNICAPLTKAVEM-LTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSIC 457
GG+ LT+A+E+ LT + G I L+TDG + + + +
Sbjct: 329 GDLGGTE-----LTRALEIALTYAAGRPLDILLLTDGEAWN-------VTGAIATAQRLG 376
Query: 458 PRIYTFGIGSYCNHYFLRMLA 478
RI+T GIG+ +R LA
Sbjct: 377 ARIFTVGIGAAVAEDTVRTLA 397
>gi|410920319|ref|XP_003973631.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Takifugu rubripes]
Length = 866
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 119/253 (47%), Gaps = 25/253 (9%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V+F++D SGSM G ++ T+ AL L L D F +V F+ + + S+ ATKE
Sbjct: 276 KNVVFVIDRSGSMSGIKMQQTREALQAILKDLHEEDHFALVVFDTDISEWRKSLTKATKE 335
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR--GSIP-----IIFLVTDGA-VED 438
+ +A ++ GST + L +AV ML R G +P +I +TDG
Sbjct: 336 NILQATDYVK-KIKDRGSTALNGALLRAVNMLKEERKEGKLPQRSADMIITLTDGMPNHG 394
Query: 439 ERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
E I K+ + GG++ ++ G G+ ++ FL +++ ++G + +Q
Sbjct: 395 ESNIMKIQKNVQSAIGGNMS--LFCLGFGNDVDYSFLDVMSRQNKGLARRIFLGSDAALQ 452
Query: 498 MQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
+Q + S +L ++ + ++ + + ++ L + S ++VSG+ P+
Sbjct: 453 LQGFYDEVSSPLLLDVDLRYPENAVDL-LTKNQYSQLFNGSEIVVSGQLTDNDPE----- 506
Query: 558 GFLGDLSNFVVEL 570
NF+VE+
Sbjct: 507 -------NFLVEV 512
>gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens]
Length = 942
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 120/280 (42%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 PNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPS-SVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|198423074|ref|XP_002122302.1| PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4
[Ciona intestinalis]
Length = 2278
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 122/560 (21%), Positives = 240/560 (42%), Gaps = 57/560 (10%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SASVETGSFL----K 173
+ + P+ D ++ G EA I+GK + + E + A K + + G++L +
Sbjct: 682 EAKYVFPLDDSAAVCGFEAFINGKHV---IGKVKEKEVAHKEYKEAVAAGHGAYLMDEER 738
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKK--IPKREK 231
++FT+++ + + + I++ + +LS ++ FK P + P +++ + K
Sbjct: 739 ADVFTISIGNLPPKAKVVIKITYVAELSMVSNNGLESIQFKLPGSIAPNVRENTLIKGPV 798
Query: 232 IHLNVNAGTGTEVLCNTSSHHLK-QLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHI 290
V+ T E SS L ++ + KL + ++K ++ + +
Sbjct: 799 TQYAVDRVTVQESAVRKSSLVLSIEMPFSIRKLECTTHKVLIKQTDTKATIRL-ANNEQL 857
Query: 291 FGGVLLQSPSLHDVDQREMF------------CMYLLPGTAKSRKVFKKDVIFIVDISGS 338
GG L +V M+ CM + V I ++D+S S
Sbjct: 858 GGGGFNFLVGLAEVHVPRMWVEEDDDGVGGQACMLAFYPEFEVEPVTNYRAIILIDMSQS 917
Query: 339 MQGK--PLEDTKNALAVALSKLDPGDSF-NIVAFNGETY--LFSTSMELATKEAVERAHQ 393
M+G+ LE +++ + K P + NIV F G +Y LF L+ E++ +
Sbjct: 918 MKGEGGKLETYSKKISLMILKQLPKNCLVNIVKF-GSSYQELFVYEQVLSENVVWEKSVE 976
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGS--IPIIFLVTDGAVEDERQICDAMKSRLT 451
+I + G++ + PLT ++ + + G+ + IFLV+DG + +E++ D
Sbjct: 977 FIQSCYANMGASEVWRPLT-SLSWVNDYEGTDRLSNIFLVSDGQLGEEKRTLDVAAML-- 1033
Query: 452 NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGA-AYDLDS-IEIQMQKLFTRGFSSV 509
GG R++TF +G+ N Y L+ ++ S G + A+D S E ++ + + V
Sbjct: 1034 -GGGRKSRVFTFSLGTTANQYLLKKISRSSGGSHEHFAFDAKSKWERKVSQQLEKAQQPV 1092
Query: 510 LANIAID--TLKDLDEFEM-------YPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
L +I ++ T + +E ++ P I L ++S ++V G L AK
Sbjct: 1093 LTDIKVNWKTYNENEETQVSSALPKQAPEYISSLFNQSRVVVYGFVPHCRQAELSAKICG 1152
Query: 561 GDLSNFVVELKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKRLEEK------VSKMSV 613
++S V +L + K L + ++ I D S +R E K + +MSV
Sbjct: 1153 KEISTVVSVSELSVTKGKTLHHLTSRALIRDYEDGLHHLSPLERAEAKKLRKPHIIEMSV 1212
Query: 614 QTGVLCEYTRMIIVETDERN 633
+ V+ YT + VE +++
Sbjct: 1213 KHSVVSPYTSFVAVEERKKD 1232
>gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor
[Homo sapiens]
gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens]
Length = 702
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 280 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 336
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T +++ +I + G T+I L +A+ +L
Sbjct: 337 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 395
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 396 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 453
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 454 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 512
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 513 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 547
>gi|118357564|ref|XP_001012031.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89293798|gb|EAR91786.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 896
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 83/160 (51%), Gaps = 10/160 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELAT 384
+ + IF++D SGSM G+P++ AL + L L FN+V+F + LF TS++ T
Sbjct: 311 RSEFIFLLDRSGSMSGQPIQKACEALILFLKSLPIDSYFNVVSFGSSHEKLFQTSIKYDT 370
Query: 385 KEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
++E+A + I N+ A G T I PL ++V T G IFL+TDG VE +++
Sbjct: 371 N-SLEKAIKIIK-NYTADLGGTEIYKPL-QSVFKETKIDGYNKQIFLLTDGEVESPKEVV 427
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+L + RI + G GS + Y + A+ +G
Sbjct: 428 -----KLIKKNNKYNRINSIGFGSGADKYLIEESAIAGKG 462
>gi|427730531|ref|YP_007076768.1| hypothetical protein Nos7524_3376 [Nostoc sp. PCC 7524]
gi|427366450|gb|AFY49171.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Nostoc sp. PCC 7524]
Length = 427
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 82/195 (42%), Gaps = 19/195 (9%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM GKPL+ A+ L +L PGD ++VAF G + + E+ E+++
Sbjct: 46 LILDQSGSMHGKPLKTVIEAVERLLDRLQPGDRLSVVAFAGSAQVIIPNQEVLDPESIKA 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVE------------ 437
I A G T I L + E++ +RG++ FL+TDG E
Sbjct: 106 Q---IRKKLTASGGTTIAEGLQLGITELMKGTRGAVSQAFLLTDGYGESSLKILRWEIGT 162
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
D+ + C + + I + T G G N L +A G + Q
Sbjct: 163 DDSKRCLELAKK---AAKINLTLNTLGFGDRWNQDLLEKIADAGGGTLTHIERPEQATHQ 219
Query: 498 MQKLFTRGFSSVLAN 512
+LFTR S L N
Sbjct: 220 FNRLFTRVQSIGLTN 234
>gi|222615393|gb|EEE51525.1| hypothetical protein OsJ_32720 [Oryza sativa Japonica Group]
Length = 53
Score = 66.6 bits (161), Expect = 5e-08, Method: Composition-based stats.
Identities = 29/51 (56%), Positives = 35/51 (68%), Gaps = 6/51 (11%)
Query: 36 MDKSGA------VDMLLPKSPMVYAVITDPKIVDNPDIPSYQPYVHGRCDP 80
MD+ G +LLP +PM YAV+TDP VD PD+PSYQPYV+GR DP
Sbjct: 1 MDRGGGGGDASVAALLLPSAPMAYAVVTDPGTVDTPDMPSYQPYVYGRLDP 51
>gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1]
Length = 471
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 80/167 (47%), Gaps = 15/167 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE--TYLFSTSMELATK 385
D++ ++D+SGSM G+ L+ ++ L + L D F +V F+ + T M A K
Sbjct: 59 DLVAVIDVSGSMAGQKLKMVQSTLEFLMRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHK 118
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN---SRGSIPIIFLVTDG----AVED 438
EA Q AG TN+ L + VE++ ++G++ I L+TDG V D
Sbjct: 119 EACLADVQ----KLRAGSCTNLSGGLFRGVELMQQRGATKGAVSSILLMTDGIANEGVRD 174
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+ +C A++ + G + IYTFG G N LR L+ G Y
Sbjct: 175 KDDMCRALRGLM--GPAPDYTIYTFGYGKDHNENMLRQLSETGNGMY 219
>gi|405968700|gb|EKC33746.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
Length = 3371
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 120/264 (45%), Gaps = 22/264 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+V+F++D+S SM+G D++ L +AL L FNI+ F G ++ LF + TK
Sbjct: 870 EVVFLMDLSNSMKGDAFRDSRKVLLLALENLPEKACFNIMTF-GSSFRELFQFPHRI-TK 927
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+ A ++I G +++ +PL K++ +L+ + GS IFL++DG V +E
Sbjct: 928 NSTNEAMKYIERANPVMGCSDVLSPL-KSLYLLSPAEGSRS-IFLISDGFVGNEGLFETV 985
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRG 505
K N R++ G+GS + + LR LA Y G Y E K +
Sbjct: 986 AKHSSKN------RLFCMGVGSNRDSHTLRSLA----NYGGGTY-----EYFNSKTKSNW 1030
Query: 506 FSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSN 565
S V T K LD P +IP L S + L+V G + LKAK +
Sbjct: 1031 QSKVQVEWGPLTDK-LDPPLQAPCKIPSLFSANRLVVYGFVKNCTQAILKAKVDEQEFET 1089
Query: 566 FVVELKLQLAKDIPLDRICAKQQI 589
V +L L + L ++ A+ I
Sbjct: 1090 IVSTTELNLTEGKMLHQLAARAVI 1113
>gi|449272883|gb|EMC82586.1| Inter-alpha-trypsin inhibitor heavy chain H2, partial [Columba
livia]
Length = 882
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 94/205 (45%), Gaps = 24/205 (11%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ LS+L D F+++ FN + ++ AT
Sbjct: 244 KNILFVIDVSGSMWGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRCWRDNLVSATPS 303
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG----- 434
VE A ++I G TNI L +A +L ++ S+ +I LV+DG
Sbjct: 304 QVEDAKKYIQ-TIQPNGGTNINEALLRATFILNEAQNLGMLDPNSVSMIVLVSDGDPTVG 362
Query: 435 ---AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDL 491
++ + ++K + ++ GIG ++ FL+ +A +RG +
Sbjct: 363 ELKLTTIQKNVKQSIKDEFS--------LFCLGIGFDVDYDFLQRIATDNRGMAQRIFGN 414
Query: 492 DSIEIQMQKLFTRGFSSVLANIAID 516
QM+ + + + +L I +
Sbjct: 415 QETSAQMKNFYNQVSTPLLKKIQFN 439
>gi|348510538|ref|XP_003442802.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Oreochromis niloticus]
Length = 1401
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 80/354 (22%), Positives = 155/354 (43%), Gaps = 38/354 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+V++++D SGSM G +E T+ ++ L L D F I+ F + + + ATK
Sbjct: 805 KNVVYVIDTSGSMHGAKIEQTRESMVAILQDLHEEDHFGILLFERKISYWKDYLTKATKR 864
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG----A 435
V A ++ GSTNI + + AV +L S+ +IFL+TDG
Sbjct: 865 NVSDAIDYVK-KIQDSGSTNINSAILDAVSLLKKERKEKKVPERSMDMIFLLTDGMPNSG 923
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V + I + S + GGS+ ++ G G+ ++ FL ++ ++G +D
Sbjct: 924 VTSPQLIQQNVFSAI--GGSMS--LFCLGFGNDVDYSFLDVMCRQNKGVARRIFDGSDAA 979
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLK 555
IQ++ + S +L + + ++ D + + L + S ++V+G+ + D
Sbjct: 980 IQLKGFYEEVSSPLLLEVDLRYSENTD--FLTKTHYSQLFNGSEIVVAGQLKENDMDNFS 1037
Query: 556 AKGFL-GDLSNFVVELKLQLAK-------------DIPLDRICA----KQQIDLLTAQAW 597
+ F G NFV + K + D ++R+ A +QQ++ +A +
Sbjct: 1038 VEVFAHGSEENFVAQGKASVTDWQTVYPEKGYIFGDF-IERMWAYLTIQQQLE-NSALST 1095
Query: 598 FSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQ 651
E +E K +S++ + T +++ + + N A+ S K ++ Q
Sbjct: 1096 KEERGNIEAKALNLSLKYNFVTPLTSLVVTKPENENGANRSLIADKMTESQRQQ 1149
>gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor]
gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor]
Length = 584
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 94/203 (46%), Gaps = 23/203 (11%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET------YLFSTSME 381
D++ ++D+SGSMQG L K A+ + L P D +IV+F+ + S +
Sbjct: 149 DLVTVLDVSGSMQGSKLALLKQAMGFVIDNLGPADRLSIVSFSNDASREIRLTRMSGDGK 208
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDE 439
+ KEAVE + +A GSTNI L A E+L + R ++ + L++DG
Sbjct: 209 ASAKEAVE--------SLVADGSTNISRGLLVASEVLADRRYRNAVTSVILLSDGQDNQS 260
Query: 440 ---RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
R + + + S I+TFG GS + + +A ++RG + +L I+
Sbjct: 261 GVGRNHQNLVPPLFRDADSRPGSIHTFGFGSDHDAAAMHAIAEVARGTFSFVENLAVIQD 320
Query: 497 QMQKLFTRGFSSVLAN---IAID 516
+ G SV+A IA+D
Sbjct: 321 SFAQCIG-GLLSVVAQNARIAVD 342
>gi|322435085|ref|YP_004217297.1| von Willebrand factor A [Granulicella tundricola MP5ACTX9]
gi|321162812|gb|ADW68517.1| von Willebrand factor type A [Granulicella tundricola MP5ACTX9]
Length = 794
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 73/155 (47%), Gaps = 7/155 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
++VI ++D S SMQ LE + AL L L P D F+++ FN + F AT E
Sbjct: 302 RNVILLLDTSLSMQWDKLERSYAALETVLRSLQPTDRFSLMLFNQDLSWFRPDPTQATPE 361
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD-- 444
+V+ A Q+I + + GG T++ L A LT ++ ++L TDG + I D
Sbjct: 362 SVQEALQFIRASRLRGG-TDLGKALAAA---LTQAKSPNQSLYLFTDGNSDRGVTILDNK 417
Query: 445 -AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
A S PR F IG N LR+LA
Sbjct: 418 IAAAYTQQWNHSTHPRTNVFAIGDDANLPLLRLLA 452
>gi|390342129|ref|XP_003725596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like
[Strongylocentrotus purpuratus]
Length = 393
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 107/253 (42%), Gaps = 34/253 (13%)
Query: 277 IDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDIS 336
I +D +T + SHI Q H F + P + K V+F++D+S
Sbjct: 162 IKYDVEHTPAGSHI------QVQDNH-------FVHFFSPSELNA---LNKQVVFVIDVS 205
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE--AVERAHQW 394
SM G L TK AL L +L+P D F+I+ F+ + S LA +E A +
Sbjct: 206 ASMYGNKLSQTKEALKTMLDRLNPRDYFSIITFSDGVRYWKGSSRLAPAHPRYIEEAKAY 265
Query: 395 IGINFIAGGSTNICAPLTKAVEMLT-----NSRGS--IPIIFLVTDG----AVEDERQIC 443
+ TN+ + +A E+L N G + +I ++TDG V D ++I
Sbjct: 266 VD-GLRDDSETNLNEAILRADELLDSETVYNRPGDEYLSMIIVLTDGIPSVGVTDPQEIL 324
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D + + S +YT G G + L LA + G Y+ S Q++ +
Sbjct: 325 DNAREAIAGEHS----LYTLGFGRLADFDLLVKLAYENDGIARMIYEDASAAEQLRDFYL 380
Query: 504 RGFSSVLANIAID 516
+ +L +++I+
Sbjct: 381 ELYRPLLFDVSIE 393
>gi|328866384|gb|EGG14768.1| Mast cell surface antigen-1 [Dictyostelium fasciculatum]
Length = 925
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 125/291 (42%), Gaps = 25/291 (8%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ ++I+++D SGSM G + ++AL + + L FNI F G ++ LF +S+E
Sbjct: 293 RSEIIYLLDCSGSMSGGSINSARSALQLLMRSLTENCKFNIYLF-GSSFKSLFPSSVEY- 350
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
E + A ++I G T + P+ + + +S+ +F++TDG + ++
Sbjct: 351 NDETLAEASRYIQTIDANLGGTELLPPVKSILALPYDSKYPRQ-LFILTDGEISQRDEMI 409
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
D + S RI+T GIG + + L+ GYYG + +E ++ +L +
Sbjct: 410 DYVGKE-----SNTTRIFTLGIGGGVDKELVVGLSKACMGYYGFIKENKDMETEVMRLLS 464
Query: 504 RGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS----GRYQGKFPDTLKAKGF 559
+AN+ + DLD + P + E +I + Q P + G
Sbjct: 465 IAMEPTVANVKVH-WGDLDVVQAPKVIRPIFNHERMMIYALLNKMPTQSSCPIKITCDGP 523
Query: 560 LGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSK 610
G +L+ D+ +D AK + L F K+LEE+ K
Sbjct: 524 QGT----------ELSFDLEIDFTAAKHDTNTLHTLTAFEIIKQLEEEERK 564
>gi|168065273|ref|XP_001784578.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663855|gb|EDQ50597.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1185
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 90/423 (21%), Positives = 176/423 (41%), Gaps = 40/423 (9%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSY--HTQLIALGENDGAGKSASVETGSFLK---PNI 176
D P+ + +++G EAE++G+ Q A + + ++ + S L+ P++
Sbjct: 242 DAVYTFPLYEGAAVVGFEAEVNGRKIVGKVQEKAAAKKEYEEAKSAGKLASLLEQESPDV 301
Query: 177 FTLTLPQIDGGSYLSIRLR-WSQ-KLSYRDGEFSVNVPFKF-PEYVTPA----IKKIPKR 229
F ++ I + + IR+ WS+ K + + +P P Y PA ++ P
Sbjct: 302 FQTSIGNIPASTAIIIRITLWSEMKHDAEENQVRFVLPTVIAPRYGWPAPEVNLELPPPF 361
Query: 230 EKIHLNVNAGTGTEVLCNTSSHHLKQ--LRRDVGKL--GYSYESEVLKW-----SNIDFD 280
+ + +N++ + S H + L G+ S+E + S ++ D
Sbjct: 362 KLLSVNMSCAMSKAITSIKSPSHTIEVHLGNSSGETPTAESFEPNRARVTLTSDSFLEKD 421
Query: 281 FSYTVSPSHIFGGVLLQSP----SLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDIS 336
F V + L P H D + LP A R + ++IF+VD S
Sbjct: 422 FVLVVQ------ALGLNEPCALVERHPRDGTHAITLTFLPRFAL-RPMSSSELIFVVDRS 474
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDS-FNIVAF-NGETYLFSTSMELATKEAVERAHQW 394
GSMQG P++ AL + L + D FNI+ F + LF S +E + + ++
Sbjct: 475 GSMQGTPIKQAGQALELFLRSIPCEDHYFNIIGFGDNHKTLFPKSTPY-NEETLTKGLRY 533
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDERQICDAMK-SRLTN 452
G T + + E+ + R +P IFL+TDG + D + + ++ ++
Sbjct: 534 AQALEADMGGTEMMSAFE---EIFEHRRRDVPTQIFLLTDGEIWDVDSLIECIRDAKKEE 590
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN 512
R+++ GIGS +H+ + + + GY + + +E ++ + + N
Sbjct: 591 KSDNFVRVFSLGIGSNVSHHLVESVGRAADGYAQLIVEGERMEKKVINMLKSALVPAVTN 650
Query: 513 IAI 515
+A+
Sbjct: 651 VAV 653
>gi|197102664|ref|NP_001125927.1| poly [ADP-ribose] polymerase 4 [Pongo abelii]
gi|55729699|emb|CAH91578.1| hypothetical protein [Pongo abelii]
Length = 1720
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 128/563 (22%), Positives = 228/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTKGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKIAYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + L + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIELIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITNNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDGPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|218670644|ref|ZP_03520315.1| hypothetical protein RetlG_02834 [Rhizobium etli GR56]
Length = 392
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 115/271 (42%), Gaps = 58/271 (21%)
Query: 138 VEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWS 197
+E +I + ++ + +G K+A +E +PNIFT + I G + +++ +
Sbjct: 138 IEGQIKPRQEAREIYEQAKAEGK-KTALLEQ---QRPNIFTNQVANIGPGEEIVVQIEYQ 193
Query: 198 QKLSYRDGEFSVNVPFKFPEYVTPAIKKIP---------------------KREKIH--- 233
Q + GEFS+ +FP V P P REKI
Sbjct: 194 QTVRQSGGEFSL----RFPMVVAPRYNPAPIVQTVEFNNGAGFATPSDPVENREKIEAPV 249
Query: 234 -------------LNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI--- 277
L V+ G + TSS H D+G+ G + LK ++
Sbjct: 250 LDPRENAKINPVSLTVDLKAGFPLGDVTSSFH----GVDIGQDGDQVRTISLKGDSVPAD 305
Query: 278 -DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF-KKDVIFIVDI 335
DF+ ++ +P L + +D + ++ P TA K++V+F++D
Sbjct: 306 KDFELTWKAAPGKTPSAGLFR----EVIDGKTYLLAFVTPPTAPDAAAPAKREVVFVIDN 361
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
SGSM G+ +E + +LA+A+SKL+P D FN+
Sbjct: 362 SGSMSGQSIEQARQSLALAISKLNPDDRFNV 392
>gi|440799217|gb|ELR20276.1| hypothetical protein ACA1_320100 [Acanthamoeba castellanii str.
Neff]
Length = 393
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 79/158 (50%), Gaps = 9/158 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGD-SFNIVAF-NGETYLFSTSMELATK 385
+++F++D SGSM G +E +NAL + L L G SFNIV F + LF +S+
Sbjct: 129 EIVFLLDRSGSMSGWSIEQARNALMLFLKSLPGGHVSFNIVGFGSSHVTLFGSSVAY-DD 187
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
++++A Q + G T I PL + + + G + L+TDG V + Q+
Sbjct: 188 TSLQKAQQHVLSLDADLGGTEILRPLEE-IFRVPEKPGVPRQVILLTDGEVSNTEQVIQF 246
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+K N R++T GIG+ +H ++ +A RG
Sbjct: 247 VKQHSKNT-----RVFTLGIGAGASHRLIKGMAHAGRG 279
>gi|168272920|dbj|BAG10299.1| poly [ADP-ribose] polymerase 4 [synthetic construct]
Length = 1724
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 127/563 (22%), Positives = 228/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKEIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D+ E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDVPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|169844512|ref|XP_001828977.1| von Willebrand domain-containing protein [Coprinopsis cinerea
okayama7#130]
gi|116510089|gb|EAU92984.1| von Willebrand domain-containing protein [Coprinopsis cinerea
okayama7#130]
Length = 885
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 97/208 (46%), Gaps = 5/208 (2%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS-FNIVA 368
F + ++P T K + ++ IFIVD SGSM+G ++ K++L + L + +S FNI A
Sbjct: 259 FQLTMVPRT-KLPPIRSQEYIFIVDCSGSMEGPRIQTAKDSLVMMLQMIPSHNSIFNIFA 317
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPII 428
F E + + + + +E A ++ G T + L V + + S+G ++
Sbjct: 318 FGNECKSWVAHSQNYSGKTLEEAIRYTESMQADLGGTEMRNALR--VALSSCSKGLPTVV 375
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
FL+TDG D C +G + RI+T GIG + +A + G
Sbjct: 376 FLLTDGGCFDVDG-CKQEVKGFVDGCTAPKRIFTLGIGESASSDLCESIARVGNGESFMV 434
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAID 516
D S+ + KLFT G + + ++ ID
Sbjct: 435 IDTSSVVQKCAKLFTAGRTPFVRDVVID 462
>gi|27529698|dbj|BAA11494.2| KIAA0177 protein [Homo sapiens]
Length = 1725
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 127/563 (22%), Positives = 228/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 655 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 714
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 715 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 770
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 771 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 829
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 830 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAN-----ESEV 878
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + + A
Sbjct: 879 IICLDCSSSMEGVTFLQAKEIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTA- 937
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 938 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 988
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 989 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1048
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D+ E P+++P L L+V G TL A +
Sbjct: 1049 SPSCHSVSVKWQQLNPDVPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1108
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1109 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1168
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1169 LITQFTSFVAVEKRDEN---ESP 1188
>gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens]
Length = 460
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 120/280 (42%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 63 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGF 119
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ + + T +++ +I + G T+I L +A+ +L
Sbjct: 120 SNRIKVRKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHSGIGD 178
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 179 RSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSLE 236
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G ++ + Q+ + + +L++I ID + + P+ + S +
Sbjct: 237 NCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRID-YPPSSVVQATKTLFPNYFNGSEI 295
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
I++G+ + D L + + F++ L D+P+
Sbjct: 296 IIAGKLVDRKLDHLHVEVTASNSKKFII-----LKTDVPV 330
>gi|440297578|gb|ELP90240.1| hypothetical protein EIN_394270 [Entamoeba invadens IP1]
Length = 445
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 141/317 (44%), Gaps = 51/317 (16%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFS-------TS 379
DVIFI D SGSM GK +E K+AL + L +L +F+I++F + LFS TS
Sbjct: 10 DVIFICDRSGSMCGKGIESLKSALQLFLRQLPTDSTFDIISFGSQFDSLFSGLKKYDETS 69
Query: 380 MELATKEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
+ A+K AVE NF + G TNI PL A+ + I L+TDG V
Sbjct: 70 FDFASK-AVE--------NFESNYGGTNILEPLKSAMAQKSQ-------IILLTDGQVRQ 113
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
++I + +K N G ++ G+G+ + ++ + + G S+ ++
Sbjct: 114 NKKI-EILKYLEDNRGKAI--VHCIGLGNGVDSDLIKDIGNVGGGI--------SVVVRD 162
Query: 499 QKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
K R S + + + +K+ + + D SE + + + F DT
Sbjct: 163 TKNLRRELSKITNKVLLPAIKN-GKVTLTTGVFEDGKSEEVVGIFNKETFAFFDTTSDSK 221
Query: 559 FLGDLSNFVVELKLQLA-------KDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKM 611
G L+ V E K+ L KD L + A +++ +L E K +EK K+
Sbjct: 222 VEGTLTGNVGEKKVTLTSTEKVRIKDTILGQFKAFEKLKVL-------ESKEEKEKCVKL 274
Query: 612 SVQTGVLCEYTRMIIVE 628
S++ VL +YT + ++
Sbjct: 275 SLKYNVLSKYTTFVAID 291
>gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona
intestinalis]
Length = 1586
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 104/237 (43%), Gaps = 18/237 (7%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA--- 383
K V+F++D+SGSM G ++ + A+ L L D F+++AFN +S S A
Sbjct: 946 KRVVFLIDVSGSMFGIKIDQVRQAMNTILHGLAETDFFSVIAFNSSVSRWSPSGTAAVLA 1005
Query: 384 --TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-------TNSRGSIPIIFLVTDG 434
T + A ++ + G T+I + A+++ TN+ + L+TDG
Sbjct: 1006 SGTTANINSAMNFLNTTVVTRGGTDILQAVEAAIQLFDSAATGGTNTASD--FMVLLTDG 1063
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAY-DLDS 493
D A+ S + N I T G G+ + LR +A + G + DL+S
Sbjct: 1064 RPTDGTVSSTAIISAIRNLNRGRFGINTIGFGTLVDMNLLRKIAAQNSGTSIQIFIDLNS 1123
Query: 494 IEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF 550
Q+ + +L+N + +++D F Y + P L + LI +G + +F
Sbjct: 1124 YA-QISNFYEEISQPILSNTTMTYEQEVDHF--YYQQFPALMQDGELITAGVIEARF 1177
>gi|113931618|ref|NP_001039260.1| inter-alpha-trypsin inhibitor heavy chain 2 precursor [Xenopus
(Silurana) tropicalis]
gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis]
Length = 942
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 94/198 (47%), Gaps = 10/198 (5%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L P D F+I+ FN + + A+
Sbjct: 304 KNILFVIDVSGSMWGLKMKQTVEAMKSILEDLRPDDQFSIIDFNHNIRCWKDELVYASSV 363
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDG-AVED 438
+ A +++ + G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 364 EKQDASKYVQ-SIQPNGGTNINEALLRAIFILKEASNKGMLEPNSVSLIVLVSDGDPTVG 422
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
E ++ K+ TN +++ GIG ++ FL LA + G Y Q+
Sbjct: 423 EIKLSKIQKNVRTNIRDDFA-LHSLGIGFDVDYDFLERLAQENHGMAQRIYGKQDTAAQL 481
Query: 499 QKLFTRGFSSVLANIAID 516
++ + + + +L NI ++
Sbjct: 482 KEFYKKVSTPLLKNIIVN 499
>gi|336377705|gb|EGO18866.1| hypothetical protein SERLADRAFT_412065 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1151
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 122/498 (24%), Positives = 208/498 (41%), Gaps = 71/498 (14%)
Query: 69 SYQPYVHGRCDPPSLI--PLQMNAVE---LDVDC--YLDTAFVRVSG--TWRVHCV--MG 117
SY VH D L+ PL+ VE +DV L FV S T R V +
Sbjct: 3 SYAGIVHRPADQQGLVHLPLEQVIVEAWIVDVSARVQLSQLFVNPSDAPTGRAKYVFPIP 62
Query: 118 SKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVETGSFLKPNIF 177
+++ C + D I G E S ++ T A+ + AG VE ++ +IF
Sbjct: 63 ARAAVCAFELKHADGKIIRGTAKEKS-EANQTFNSAIQAGNMAGL---VE---WVTDDIF 115
Query: 178 TLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIP---------- 227
T+++ I ++ +L + L D +V + P + ++P
Sbjct: 116 TISVGSIPAKETVTSKLTFVMDL--MDDGIRDHVRLQLPVSIGMRYGEVPAAMVNAQAAA 173
Query: 228 KREKIHLNVNAGTGTEVL-CNTSSHHLKQLRRD--VGKLGYSYESEVLKWSNIDF---DF 281
++ +IH++V+ E+ + SH +K LR +GK +S + WS+ F DF
Sbjct: 174 EQTRIHISVDVQMSEEIYDIRSPSHEIKVLRYKTRIGKKSMRRKSAL--WSSPTFMERDF 231
Query: 282 SYTVSPSHIFGGVLLQSPSLHDVD----QREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
TV H G L + ++D + L+P K +V ++ IF+VD SG
Sbjct: 232 VLTV---HSHG--LDNARCFAEIDPMGSNSVAMQLTLVP-RFKMPRVSSQEYIFVVDRSG 285
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDS--FNIVAFNGETYLFSTSMELATKEAVERAHQ-W 394
SM G +E K+ L++ L +L P D+ FNI +F + L + A+ +A + W
Sbjct: 286 SMGGGRIETAKSTLSMLL-RLLPNDNTLFNIFSFGSTVDGLWQNSVLYNQNALSQAVRIW 344
Query: 395 I------GINFIAG----------GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
G+ F+ G T I LT A+ R ++ F++TDG D
Sbjct: 345 YNPLDLSGLTFLKTSHIRSMDANYGGTEIANALTGALTSRRRDRSTVA--FILTDGESSD 402
Query: 439 ERQICDAMKSRLTNGGSICP-RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+ +++ L+ CP R++T GIG + +A G A +SI +
Sbjct: 403 LHKTFSVVRNALSAANPNCPLRVFTLGIGEQVSTAMCEGIARAGNGECLLAVTTESITAK 462
Query: 498 MQKLFTRGFSSVLANIAI 515
+L G + ++ +I++
Sbjct: 463 CAQLLRAGRTRLIEHISV 480
>gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Bermanella marisrubri]
gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65]
Length = 731
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 22/331 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
M L PG D ++DISGSMQGK + + L + +P D +V F
Sbjct: 334 LMMTLTPGDDLQPIQRGTDWTLLLDISGSMQGK-FQTLIEGVKKGLKRFNPQDRVRVVLF 392
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIF 429
N + AT++ + + + + + G T++ + A+ L R S I+
Sbjct: 393 NDYASNLTGGFLPATQKNIAEIIRKLDL-VLPNGGTHLMDGVRFALSGLDADRTSA--IW 449
Query: 430 LVTDGAVE----DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
LVTDG +R+ D +K + R++TF +G+ N L+ + S G+
Sbjct: 450 LVTDGVTNVGETKQRKFVDLLKQKDI-------RVFTFIMGNGANRPLLKAITKASNGFA 502
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
+ D I Q++K ++ L +I + K L+ ++ P+ I L + L +
Sbjct: 503 INVSNSDDIIGQLEKAASKVTHEALRDIKV-RFKGLNVKDVEPTEITSLYRGAQLQLFAH 561
Query: 546 Y--QGKFPDTLKAKGFLGDLSNFVVELKL-QLAKDIP-LDRICAKQQID-LLTAQAWFSE 600
Y GK + AK G + V L ++ + P L+R+ A QI LL+ Q F E
Sbjct: 562 YWQAGKGIIEVTAKRN-GTPVRYEVPFTLPEVDTEYPELERMWAFSQIQRLLSEQEDFGE 620
Query: 601 DKRLEEKVSKMSVQTGVLCEYTRMIIVETDE 631
++ V ++ + G++ YT M+++E +
Sbjct: 621 SADRKDAVLSLAKEYGLVTPYTNMLVLEEQQ 651
>gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens]
gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens]
gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens]
Length = 946
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|332841041|ref|XP_001153521.2| PREDICTED: poly [ADP-ribose] polymerase 4 isoform 3 [Pan troglodytes]
gi|410211052|gb|JAA02745.1| poly (ADP-ribose) polymerase family, member 4 [Pan troglodytes]
Length = 1724
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYVFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLNVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K LS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHVLSLVGEKQKVNIIQFGTGYKELFSYPKHMTSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus
leucogenys]
Length = 946
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDRFSVIDFNQNVRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Homo
sapiens]
gi|229462889|sp|P19823.2|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2;
Short=ITI heavy chain H2; Short=ITI-HC2;
Short=Inter-alpha-inhibitor heavy chain 2; AltName:
Full=Inter-alpha-trypsin inhibitor complex component II;
AltName: Full=Serum-derived hyaluronan-associated
protein; Short=SHAP; Flags: Precursor
Length = 946
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|390346545|ref|XP_787130.3| PREDICTED: uncharacterized protein LOC582064 [Strongylocentrotus
purpuratus]
Length = 2399
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 116/263 (44%), Gaps = 44/263 (16%)
Query: 278 DFDFSYTVSPSHIFGGV-LLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDIS 336
DF Y V+P GGV +QS + F P +S K V+F++D S
Sbjct: 329 DFIIRYQVAPR---GGVGWVQSDGSN-------FIHSFAP---RSLAPVNKRVVFVLDFS 375
Query: 337 GSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF-STSMELATKEAVERAHQWI 395
SM + L TK A+ L ++ D FNI+ F+ + Y + +M A+ +++ A +I
Sbjct: 376 ASMYPRKLRQTKAAMDAILDQMSENDQFNIMPFSDDIYWWKENAMIPASPDSITEAKNYI 435
Query: 396 G-INFIAGGSTNICAPLTKAVEMLTNSRGSIP---------IIFLVTDGAVED------- 438
N+ G TN+ + + +L + GS+ I+FL+TDG
Sbjct: 436 KRFNYKRG--TNLNDAMLEGFGLL-EATGSMDSSSENPMVCILFLLTDGKPSTGTTDINV 492
Query: 439 -ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
ER + A + R + + T G G + FL LA+ +RG Y+ S +Q
Sbjct: 493 IERNVRAANRKRCS--------LVTLGFGKLVDFNFLGRLALQNRGIARKIYETSSATVQ 544
Query: 498 MQKLFTRGFSSVLANIAIDTLKD 520
+Q ++ + +L +I ++ L +
Sbjct: 545 LQGVYNEVATPLLFDIVVEYLNN 567
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 118/273 (43%), Gaps = 35/273 (12%)
Query: 270 EVLKWSNIDFDFSYTVSPSH--------IFGGV-----LLQSPS-LHDVDQREMFCMYLL 315
++++ SN + F Y +P I G V L+ +P+ H Q + F +
Sbjct: 1114 DLVRESNSRYKFRYMPTPDEQSDFSEKGIDGDVVLTYNLMDAPTGSHTQVQNDYFVHFFS 1173
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P V K ++F++D+S SM G L TK AL L L+P D FNI+ F+
Sbjct: 1174 P---IGLDVISKQIVFVIDVSASMYGTKLSQTKEALKTMLDNLNPTDYFNIITFSDGVQY 1230
Query: 376 FSTSMEL--ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML-TNSRGSIP------ 426
+ + L A + ++ A ++ + TN+ + KA E+L + +R + P
Sbjct: 1231 WRENNRLAPAQRRYMDDAMAYVD-SLRDDSETNLNEAIVKAGELLDSEARYNRPGDSVYS 1289
Query: 427 IIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
++ L+TDG D+++I D + + S+ G G + L LA +
Sbjct: 1290 MMILLTDGRPSVGTTDQQEILDNAREVIAGKHSLN----ILGFGRLVDFDLLVKLAYENN 1345
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
G Y+ + Q+++ + + +L ++ +
Sbjct: 1346 GTAKMIYEGTTAAEQLREFYFELYRPLLFDVTM 1378
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 80/177 (45%), Gaps = 26/177 (14%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMEL 382
V +K+VIFI+D+SGSM G L K+AL L+ + D FNI+ F+ + +L M
Sbjct: 1804 VLRKNVIFIIDVSGSMAGVKLRQVKDALTTILNDMPETDKFNIIPFSDDVNFLDRNKMLF 1863
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--------RGSIPIIFLVTDG 434
+T V RA +++ + +TN+ + V ML + + ++ +++DG
Sbjct: 1864 STSSNVRRAKRFVK-SLQERDNTNLHKAIIAGVRMLRDESDQNVRPDENVVSMLIVLSDG 1922
Query: 435 AVED--------ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
ER + +A++ + ++ G G + FL +A + G
Sbjct: 1923 NPNHGEIDKEIIERNVEEAIRGDFS--------LFNLGFGEDLDFPFLERMAYQNHG 1971
>gi|118370542|ref|XP_001018472.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89300239|gb|EAR98227.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 901
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 85/187 (45%), Gaps = 30/187 (16%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+ + IF++D SGSM+GKPL AL + L L P FNIV+F G + K
Sbjct: 320 RSEYIFLIDRSGSMRGKPLTKALEALQLFLQSLPPDSYFNIVSF-GSNF----------K 368
Query: 386 EAVERAHQW------IGINFIAGGS-----TNICAPLTKAVEMLTNSRGSIPIIFLVTDG 434
+ ER+ ++ N I S T+I +PL N RG IF++TDG
Sbjct: 369 KLYERSQKYNSQTLKFACNKIKDYSADMNGTDILSPLNNIFYYGQNIRGYNRQIFVLTDG 428
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA---YDL 491
AV++ + + +K R++ G GS + ++ A+ +G + DL
Sbjct: 429 AVQNRQSVVREIKRNNKKN-----RVHFIGFGSSADKILIQESAIAGKGIHEMVQFEQDL 483
Query: 492 DSIEIQM 498
SI I++
Sbjct: 484 SSIVIKI 490
>gi|330805542|ref|XP_003290740.1| hypothetical protein DICPUDRAFT_155275 [Dictyostelium purpureum]
gi|325079126|gb|EGC32742.1| hypothetical protein DICPUDRAFT_155275 [Dictyostelium purpureum]
Length = 2350
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 95/406 (23%), Positives = 179/406 (44%), Gaps = 64/406 (15%)
Query: 118 SKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SASVETGSFL- 172
S + + + P+ + G++ G EA I+GK + + E + A + + + G++L
Sbjct: 732 SSTIEAKYVFPLDEMGAVCGFEAFINGKHI---IGEVKEKEKAHREYREAVAAGHGAYLM 788
Query: 173 ---KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIK----- 224
KP++FT+++ + + I++ + +LS DG +++ F P +TP +
Sbjct: 789 DEDKPDVFTVSVGNLPPKCDVLIKITYVTELSV-DG---LDISFVLPRSITPRQRLSSGS 844
Query: 225 ----------KIPKREKIHLNVNAGTGTEVLCN----TSSHHLKQLRRDVGKLGYSYESE 270
I +++ ++ G ++ N TS H +++R K + E +
Sbjct: 845 NVTQNVTKTVSISDQDQKDSDLTIQVGIDMPYNIVKLTSPTHNIKIKRTHTKA--TIELQ 902
Query: 271 VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLH-DVDQR----EMFCMYLLPGTAKSRKVF 325
S +D +F + + SP + +VD + M Y +++ F
Sbjct: 903 PGDKSYLDKNFQLLIGLEEPY------SPRMWVEVDDKGHHASMLAFYPKLDIDSTQESF 956
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLD--PGDSFNIVAFNGETY--LFSTSME 381
+ ++D+S SM G ED A+ + + L P FNI F G+ Y LF +
Sbjct: 957 TH-LTLLIDLSASMAGDTFEDLLRAVRLTVRNLRGVPKLLFNICDF-GDNYDWLFVEDV- 1013
Query: 382 LATKEAVERAHQWIGINFIA--GGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVED 438
L T+E ++ A W IN + G T + +PL + ++ + P I L TDG V +
Sbjct: 1014 LPTEENLQIA--WNHINKLKPISGGTKLHSPLESIYLLSEKAKPTNPHNILLFTDGNVSN 1071
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
E + L S R++TFGIG +C+ +F++ ++ I GY
Sbjct: 1072 EE-----LSLMLAKKASNFCRLFTFGIGEHCSRHFIKQISRIGSGY 1112
>gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens]
Length = 543
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 71 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 130
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 131 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 189
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 190 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 249
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 250 KFYNQVSTPLLRNVQFN 266
>gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like
[Anolis carolinensis]
Length = 955
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 125/281 (44%), Gaps = 22/281 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D FNI+ F
Sbjct: 292 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILQDLRPEDHFNIIGF 348
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------S 421
+ ++ + T + A +I N G TNI L + ++L +
Sbjct: 349 SNRIKVWQHDQLVPVTPNNIRDAKVYIH-NMSPSGGTNINGALQISTKILNDYIAQNDIE 407
Query: 422 RGSIPIIFLVTDGAVE----DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRML 477
S+ +I +TDG + +I + K + N C ++T GIG+ ++ L L
Sbjct: 408 ARSVSLIIFLTDGRPTFGEIEPAKIINNTKEAIRN--KFC--LFTIGIGNDVDYKLLERL 463
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A+ + G + + Q++ + + +L++I +D D + ++ + P+ +
Sbjct: 464 ALENCGMMRRVREEEDAAEQLKGFYYEIDTPLLSDIRVDYPTD-NVAQVTQNLFPNYFNG 522
Query: 538 SPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDI 578
S ++++G+ K + L + + +V+ LK +A D+
Sbjct: 523 SEIVIAGKLFNKTANRLHVEVTASNSDKYVL-LKTDIAIDL 562
>gi|402879594|ref|XP_003903419.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Papio
anubis]
Length = 946
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens]
Length = 550
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 83/172 (48%), Gaps = 24/172 (13%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+I+ FN +S + +
Sbjct: 262 KNIVFVIDVSGSMWGVKMKQTVEAMQAILDDLTIDDHFSIIDFNHNVRCWSEELVAGSSI 321
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG-------SIPIIFLVTDG--AVE 437
+ A ++I N G TNI L +AV+ML + S+ +I LV+DG V
Sbjct: 322 QIADAKKYIQ-NIKPNGGTNINEALMRAVQMLLKASNQGLIDPRSVSMIILVSDGDPTVG 380
Query: 438 D------ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ ++ + M+ + +++ GIG ++ FL +AM +RG
Sbjct: 381 EIKLSTIQKNVKRVMREEFS--------LFSLGIGFDVDYDFLERIAMENRG 424
>gi|410336981|gb|JAA37437.1| poly (ADP-ribose) polymerase family, member 4 [Pan troglodytes]
gi|410336983|gb|JAA37438.1| poly (ADP-ribose) polymerase family, member 4 [Pan troglodytes]
Length = 1724
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLNVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K LS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHVLSLVGEKQKVNIIQFGTGYKELFSYPKHMTSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|336463979|gb|EGO52219.1| hypothetical protein NEUTE1DRAFT_105213 [Neurospora tetrasperma
FGSC 2508]
Length = 1073
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 95/202 (47%), Gaps = 10/202 (4%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ +++F+ D SGSM G +E K+AL + L + G FNI +F G TY LFS
Sbjct: 293 RPEIVFVCDRSGSMGGARIEGLKSALRIFLKSIPVGAKFNICSF-GSTYEFLFSDGSRSY 351
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
E++ A ++ G T + PL A E + +FL+TDG + ++ +
Sbjct: 352 DHESLRLAMDYVSRMDADLGGTEMYQPLEAAFE--KRYKDMDLEVFLLTDGEIWNQEHLF 409
Query: 444 DAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ +++ G+I R++T GIG+ +H + A G+ D + + ++ ++
Sbjct: 410 TMINKKVSESQGAI--RLFTLGIGNDVSHALIEGAARAGNGFAQTVTDSEKMNAKVVRML 467
Query: 503 TRGFSSVLAN--IAIDTLKDLD 522
G + + + + I KD D
Sbjct: 468 KAGLTPHIKDYTLEIKYAKDSD 489
>gi|410299132|gb|JAA28166.1| poly (ADP-ribose) polymerase family, member 4 [Pan troglodytes]
Length = 1724
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLNVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K LS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHVLSLVGEKQKVNIIQFGTGYKELFSYPKHMTSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|410259980|gb|JAA17956.1| poly (ADP-ribose) polymerase family, member 4 [Pan troglodytes]
Length = 1724
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLNVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K LS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHVLSLVGEKQKVNIIQFGTGYKELFSYPKHMTSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|359318907|ref|XP_003638936.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Canis
lupus familiaris]
Length = 982
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/280 (22%), Positives = 121/280 (43%), Gaps = 21/280 (7%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 237 FVHYFAP---RDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 293
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T + V +I + G T+I L +A+++L +
Sbjct: 294 SNRIKVWKDHLVSVTPDNVRDGKIYIH-HMSPTGGTDINGALQRAIKLLNDYVAHNDIED 352
Query: 423 GSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
S+ +I +TDG V + + ++ G +C I+T GIG + L L++
Sbjct: 353 RSVSLIVFLTDGKPTVGETHTLKILNNTKEAARGQVC--IFTIGIGDDVDFKLLEKLSLE 410
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
+ G + + Q+ + + +L++I +D + E + + P+ + S +
Sbjct: 411 NCGLTRRVLEEEDAGSQLIGFYDEIRTPLLSDIRVDYPRAAVE-QTTRTLFPNYFNGSEI 469
Query: 541 IVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL 580
+++G+ + L + + FV+ L D+P+
Sbjct: 470 VIAGKLADSSAEQLHVEVTASNSKKFVI-----LKTDVPV 504
>gi|338728866|ref|XP_001915384.2| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor
H5-like [Equus caballus]
Length = 1321
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 97/222 (43%), Gaps = 14/222 (6%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + +++V+F++++ GSM G ++ TK A+ V L L D FNI++F
Sbjct: 267 FIHYFAP---RGLPPVERNVVFVIEVGGSMFGIKMKQTKKAMNVILGDLQANDYFNIISF 323
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR------ 422
+ ++ + AT + V A ++G + AGG T+I L A +L S
Sbjct: 324 SDTVSVWKAGCSIQATIQNVYDAKDYLG-HMEAGGWTDINTALLAAASVLYPSNXEPGRG 382
Query: 423 ---GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
G I +I +TDG + + S + +++ + LR L++
Sbjct: 383 PSVGRISLIIFLTDGEPTADMMTPSVILSNILQALGNRVNLFSLVFWDDADFPLLRHLSL 442
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDL 521
+ G Y + ++ L+ F +L ++ +D L DL
Sbjct: 443 ENWGAAWHIYKDTDAALHLEGLYKEIFMPLLTDVRLDYLWDL 484
>gi|410449542|ref|ZP_11303597.1| von Willebrand factor type A domain protein [Leptospira sp. Fiocruz
LV3954]
gi|410016767|gb|EKO78844.1| von Willebrand factor type A domain protein [Leptospira sp. Fiocruz
LV3954]
Length = 518
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 85/168 (50%), Gaps = 16/168 (9%)
Query: 326 KKDVIFIVDISGSMQG-KPLEDTKNALAVALSKLDPGDSFNIVAFNGE--TYLFSTSMEL 382
KK + F++D+SGSM+G LED+K AL L+++ GD F++V F+ + T++ +++L
Sbjct: 114 KKALCFVLDVSGSMEGDNKLEDSKAALIQFLTEMKVGDEFSLVVFDNDASTFISPQTIDL 173
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDG----AVE 437
+++ + + I AGG TNI A L EM S + L+TDG V
Sbjct: 174 NSRKKIISKVKDIQ----AGGGTNIEAGLILGYSEMSKFHPRSTRRLILLTDGISNVNVH 229
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+I K + G RI T G+G N LR LA +G+Y
Sbjct: 230 TPEEIAKKAKVQYLEGS----RISTIGLGYDVNQTLLRSLAENGKGHY 273
>gi|363746965|ref|XP_428421.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Gallus gallus]
Length = 964
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 103/228 (45%), Gaps = 12/228 (5%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATK 385
KD++F++D+SGSM G ++ TK A+ L L P D FNIVAF+ ++ + A+
Sbjct: 259 KDIVFVIDVSGSMSGTKMKQTKAAMRSILRDLRPRDRFNIVAFSEAACVWQEGGSIPASA 318
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN--SRGSIPIIFLVTDG----AVEDE 439
++ A ++I A G T+I L A +L + + ++ L+TDG V D
Sbjct: 319 SSIRSATRYID-ALQADGWTDINHALQVAAALLQHPAEEPRVALLLLLTDGEPTAGVTDG 377
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
+I + L+ ++ ++ G+ + LR LA S G + ++
Sbjct: 378 PRIVSNARRALSPSAAL---LFGLAFGADADFALLRRLAAASGGTARHIPEDADAAFRLT 434
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQ 547
+F S +L ++ + T + P P S L+V+GR +
Sbjct: 435 DVFQEIDSPLLQDVEL-TYPGTAAQLIAPKLFPGYFQGSELVVAGRLE 481
>gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 521
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 81/163 (49%), Gaps = 22/163 (13%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY----LFSTSMELA 383
D++ ++D+SGSM+G+ L+ K A+ + KL P D ++V+F+ L S +
Sbjct: 59 DLVVVLDVSGSMRGEKLQSMKRAMQFVIMKLTPVDRLSVVSFSSSATRHCPLRSVT---- 114
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEML---TNSRGSIPIIFLVTDGAVEDER 440
+A + + I +A G TNI A L A+ ++ ++ P +FL++DG
Sbjct: 115 --QAAQAELKGIVDGLVANGGTNIKAGLDTALAVIAGRATTKARTPNVFLMSDG------ 166
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
Q D +R + G++ +YTFG G +H L +A S G
Sbjct: 167 QQTDG-DARQVDPGNVA--VYTFGFGKDADHALLSDVAKKSPG 206
>gi|3694920|gb|AAC62491.1| putative poly(ADP-ribosyl) transferase [Homo sapiens]
Length = 1724
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 127/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKEIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHVLRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|395529555|ref|XP_003766876.1| PREDICTED: poly [ADP-ribose] polymerase 4-like, partial
[Sarcophilus harrisii]
Length = 778
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 116/542 (21%), Positives = 222/542 (40%), Gaps = 51/542 (9%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSA-SVETGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A S G++L P +
Sbjct: 251 ETKYVFPLDDKAAVCGFEAFINGKHIIAEVKEKEEAHQEYREAISQGHGAYLMDQDSPGV 310
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFK---------FPEYVTPAIKKIP 227
FT+++ + + + I++ + +LS+ G + +P E + A++K+
Sbjct: 311 FTVSIGNLPPQAKVLIKITYITELSFHHGCANFFMPATVAPWQQDKALNENLQDAVEKVY 370
Query: 228 KRE-KIHLNVNAGTGTEV-----LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDF 281
+E +H + E+ L ++ +H LK + D + + ++ L F
Sbjct: 371 IKEVGVHQQFSLNISIEMPHLIELISSDTHQLKIKKTDCKAVISTLDNSTLDSKG----F 426
Query: 282 SYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQG 341
S + ++ + + + M + TA K + +VI +D S SM+G
Sbjct: 427 SLQIHIGEVYLPRMWVEKHPNKESEACMLVFHPDFSTALPEKAYTNEVIICLDCSNSMEG 486
Query: 342 KPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINF 399
L K ALS + NI+ F G +Y LFS + + ++I
Sbjct: 487 SILLQAKQIALHALSLVQEKQKVNIIKF-GTSYTELFSYPKYSTSNPDLT---EFIMSAT 542
Query: 400 IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR 459
G+T+ L + + +L S+G + I L++DG ++++ +K + + R
Sbjct: 543 PTMGNTDFWKTL-RFLNLLYPSQG-MRSILLLSDGHIQNKSLTLQIVKRNVRH-----TR 595
Query: 460 IYTFGIGSYCNHYFLRMLAMISRG---YYGAAYDLD---SIEIQMQKLFTRGFSSVLANI 513
++T GIGS N + LR L+ G Y+ IE QM ++F+ G SSV
Sbjct: 596 LFTCGIGSTANRHILRTLSQYGAGAFEYFNTKSKYSWKKQIEDQMTRIFSPGCSSVSVKW 655
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ 573
L + + P++I L + L+V G L A + V +LQ
Sbjct: 656 QQLNLNSPEPMQA-PTQIQALFNNDRLLVYGFIPHCTQAILNALIQEREFQTMVSTTELQ 714
Query: 574 LAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTGVLCEYTRMII 626
L ++ A+ I +L E K+ L+ + K+S + ++ ++T +
Sbjct: 715 KTTGTMLHKLTARAIIRDYEDGILHEDETKHEMKKQILKSLIIKLSKENSIITQFTSFVA 774
Query: 627 VE 628
VE
Sbjct: 775 VE 776
>gi|336365136|gb|EGN93488.1| hypothetical protein SERLA73DRAFT_156401 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1182
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 140/318 (44%), Gaps = 41/318 (12%)
Query: 228 KREKIHLNVNAGTGTEVL-CNTSSHHLKQLRRD--VGKLGYSYESEVLKWSNIDF---DF 281
++ +IH++V+ E+ + SH +K LR +GK +S + WS+ F DF
Sbjct: 174 EQTRIHISVDVQMSEEIYDIRSPSHEIKVLRYKTRIGKKSMRRKSAL--WSSPTFMERDF 231
Query: 282 SYTVSPSHIFGGVLLQSPSLHDVD----QREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
TV H G L + ++D + L+P K +V ++ IF+VD SG
Sbjct: 232 VLTV---HSHG--LDNARCFAEIDPMGSNSVAMQLTLVP-RFKMPRVSSQEYIFVVDRSG 285
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDS--FNIVAFNGETYLFSTSMELATKEAVERAHQ-W 394
SM G +E K+ L++ L +L P D+ FNI +F + L + A+ +A + W
Sbjct: 286 SMGGGRIETAKSTLSMLL-RLLPNDNTLFNIFSFGSTVDGLWQNSVLYNQNALSQAVRIW 344
Query: 395 I------GINFIAG----------GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
G+ F+ G T I LT A+ R ++ F++TDG D
Sbjct: 345 YNPLDLSGLTFLKTSHIRSMDANYGGTEIANALTGALTSRRRDRSTVA--FILTDGESSD 402
Query: 439 ERQICDAMKSRLTNGGSICP-RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+ +++ L+ CP R++T GIG + +A G A +SI +
Sbjct: 403 LHKTFSVVRNALSAANPNCPLRVFTLGIGEQVSTAMCEGIARAGNGECLLAVTTESITAK 462
Query: 498 MQKLFTRGFSSVLANIAI 515
+L G + ++ +I++
Sbjct: 463 CAQLLRAGRTRLIEHISV 480
>gi|115666324|ref|XP_798926.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like
[Strongylocentrotus purpuratus]
Length = 870
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 104/229 (45%), Gaps = 10/229 (4%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
KK V+F++D S SM G ++ TK A+ L +++ D FN++ F+ Y S M
Sbjct: 326 KKRVVFVLDFSASMYGNKIKQTKEAMYTILDEMNDSDRFNVLPFSDYVYSGWNSGQMVDV 385
Query: 384 TKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLT---NSRGSIP---IIFLVTDGAV 436
+ A +I ++ G + N ++ T NS S P I+F++TDG
Sbjct: 386 NPYNIRDAKDFIRQLDIQRGTNLNDALLGGLSLLESTGSMNSTSSNPMVCILFVLTDGKP 445
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEI 496
+ ++ + N + I T G G N+ FL LA+ +RG Y+ S
Sbjct: 446 SEGVTSLSEIERNVRNANNQRCSIVTLGFGRLVNYNFLVRLALQNRGMARKIYEDSSAAG 505
Query: 497 QMQKLFTRGFSSVLANIAIDTLKD-LDEFEMYPSRIPDLSSESPLIVSG 544
Q++ +++ + +L NI ++ L + + + P+ + S L VSG
Sbjct: 506 QLRGVYSEVATPLLFNIVVEYLNNAVRPGSTSSTTFPNYFNGSELAVSG 554
>gi|440801394|gb|ELR22414.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 1021
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 33/211 (15%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFST------- 378
+VIF+VD SGSM G + KNALA+ L L G FN++ F G Y LF T
Sbjct: 344 EVIFVVDRSGSMSGSRINQAKNALALFLHSLPIGTRFNVIGF-GSRYEKLFPTSRAYDDS 402
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVE 437
S+E+A + A GI+ GG T + APL + + + P +F++TDG V
Sbjct: 403 SLEIANRHAS-------GISADLGG-TELLAPLRDVLSSPADPQ--FPRQVFVLTDGEVG 452
Query: 438 DERQICD-----AMKSRLTNGG-------SICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+ + A +R+ G + R++T GIGS + + +A +G+
Sbjct: 453 NTEDVVSCVRKHAKGTRVFTLGIGSDASTELGTRVFTLGIGSDASTELVNGIANAGKGHA 512
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ +E ++ + R VL ID
Sbjct: 513 EFVVSGERLEAKVLRQLKRALQPVLKQPTID 543
>gi|296415385|ref|XP_002837370.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633235|emb|CAZ81561.1| unnamed protein product [Tuber melanosporum]
Length = 1101
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 81/161 (50%), Gaps = 6/161 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
K ++IF+VD SGSM+ + K+AL + L L G FNI++F G Y F A
Sbjct: 339 KTEIIFLVDRSGSMRDT-IGHVKSALRIFLKSLPLGCFFNIISF-GSQYTFLWERSRAYT 396
Query: 386 EAVER-AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
E R A I G T + PL +A + G+ I ++TDGA+ ++ +
Sbjct: 397 EGTLRIASDHIETFHANYGGTELLPPLVQACSRRHSEPGTRTEIMVLTDGAIWRLPELLE 456
Query: 445 AMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
++ R +GG++ R+++ G+G +H ++++A GY
Sbjct: 457 YIQQMRKQSGGAV--RLFSIGVGERVSHALVQVIATAGGGY 495
>gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens]
gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens]
Length = 947
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 94/203 (46%), Gaps = 18/203 (8%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 R------QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
+ +I +K + + S +++ G+G ++ FL+ L+ + G Y
Sbjct: 428 KCELKLSKIQKNVKENIQDNIS----LFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQD 483
Query: 494 IEIQMQKLFTRGFSSVLANIAID 516
Q++K + + + +L N+ +
Sbjct: 484 TSSQLKKFYNQVSTPLLRNVQFN 506
>gi|118361736|ref|XP_001014096.1| Glutathionylspermidine synthase family protein [Tetrahymena
thermophila]
gi|89295863|gb|EAR93851.1| Glutathionylspermidine synthase family protein [Tetrahymena
thermophila SB210]
Length = 1547
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 75/158 (47%), Gaps = 6/158 (3%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+ + IF++D SGSM G+P++ AL + L L FN+++F L E
Sbjct: 305 RSEFIFLLDRSGSMSGQPIDRACQALTLFLKSLPTDSYFNVISFGSSFKLLFPQSEKYNS 364
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+++E+A I G T I PL K V + +G +FL+TDG V+ Q+
Sbjct: 365 QSLEKAISNISKYKADLGGTEIYKPL-KNVFVQNKIQGYNKQVFLLTDGEVDSPEQVISL 423
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ R N S R+++ G GS + Y + A+ +G
Sbjct: 424 I--RKNNKFS---RVHSIGFGSGADQYLINQSAIAGKG 456
>gi|322704409|gb|EFY96004.1| von Willebrand domain containing protein [Metarhizium anisopliae
ARSEF 23]
Length = 897
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 102/208 (49%), Gaps = 8/208 (3%)
Query: 324 VFKKDVIFIVDISGSMQG-KPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSME 381
V K +++FI D SGSM G K + + +AL + L L G FNI +F E T+L+ S+E
Sbjct: 280 VEKPEIVFICDRSGSMAGGKTMPNLISALNIFLKSLPVGVRFNICSFGWECTFLWPKSVE 339
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
T++ + A + + G T + AP+ K + S +FL+TDG + D+ +
Sbjct: 340 Y-TQDTLNEAVRHVATFKADYGGTEMLAPVRKTFKKRIQE--SNLEVFLLTDGEIWDQDR 396
Query: 442 ICDAMKSRLT-NGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ + + + G+I R++T GIG +H + A G+ A + + I+ ++ +
Sbjct: 397 LFHLINHEVAASKGAI--RVFTLGIGWGVSHSLINGAARAGNGFAQVAGEDEKIDKKIVR 454
Query: 501 LFTRGFSSVLANIAIDTLKDLDEFEMYP 528
+ + +++ ++ D E + P
Sbjct: 455 MLKGALTPHVSDYTLEIKYDKSEQDTAP 482
>gi|281207774|gb|EFA81954.1| type A von Willebrand factor domain-containing protein
[Polysphondylium pallidum PN500]
Length = 1739
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 90/192 (46%), Gaps = 14/192 (7%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
++IF+VD S SM G L K L + L FNI++F G T+ LF+ SM+
Sbjct: 355 ELIFLVDCSESMAGYNLNHAKKGLHLFCHSLPKSTFFNIISF-GSTFKKLFAKSMQYG-G 412
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICD 444
E +E A Q+I G TN+ APL + + N+ P IF++TDG V + I +
Sbjct: 413 ENLEAATQYIKNLKSESGETNLLAPL----QDIYNAPAVRPRKIFILTDGGVSNVGPIVE 468
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
++ N ++ G+G + + + ++A G A + ++IE ++ + R
Sbjct: 469 LVRQNADNTS-----VFPIGMGEFGSRQLVDVVANAGCGVAELAIENETIENKVMRQLKR 523
Query: 505 GFSSVLANIAID 516
NI ID
Sbjct: 524 ALQPAYTNIRID 535
>gi|118367272|ref|XP_001016851.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89298618|gb|EAR96606.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 1633
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 85/163 (52%), Gaps = 16/163 (9%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS-FNIVAFNG---ETYLFSTSME 381
+ + IFI+D SGSM+G+P+ A+ + L L P DS FN+++F + + FST
Sbjct: 1045 RSEFIFILDRSGSMRGQPIRRACEAIILFLKSL-PNDSYFNVISFGSSFEKLFPFSTKY- 1102
Query: 382 LATKEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
T E++E+A Q I N+ + G T I PL V ++ G IFL+TDG V+
Sbjct: 1103 --TSESLEKAVQIIN-NYDSDLGGTEIYNPL-HNVFIMKRISGYNRQIFLLTDGEVDSSE 1158
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
Q+ + +K + R+++ G G + Y ++ A+ +G
Sbjct: 1159 QVIELIKK-----NNKYNRVHSIGFGFKADQYLIKESAIAGKG 1196
>gi|397515209|ref|XP_003827849.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Pan
paniscus]
Length = 946
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 88/197 (44%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + ATK
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKT 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
V A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 QVADAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G + FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDFDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|47218289|emb|CAG04121.1| unnamed protein product [Tetraodon nigroviridis]
Length = 979
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 112/258 (43%), Gaps = 46/258 (17%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELAT 384
+K+++F++D+SGSM G ++ T A+ L L DSF+IV FN +S + +
Sbjct: 310 LRKNIVFVIDVSGSMWGVKMKQTVEAMQAILDDLTVDDSFSIVDFNHNVRCWSEDLVPGS 369
Query: 385 KEAVERAHQWI-------------------------GINFIAGGSTNICAPLTKAVEMLT 419
V+ A +I G++ +G TNI L +AV+ML
Sbjct: 370 SVQVKDAKTYIENIKPNGGETQTQEAPPPPSPHADCGVSVASG--TNINEALMRAVQMLV 427
Query: 420 NSRG-------SIPIIFLVTDGAVEDERQI-CDAMKSRLTNGGSICPRIYTFGIGSYCNH 471
+ S+ +I LV+DG E R+I A++ + +++ GIG +
Sbjct: 428 RASHQGLVDPRSVSMIILVSDG--EPHREIKLSAIQKNVKRAMREEFSLFSLGIGFDVDF 485
Query: 472 YFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI----DTLKDLDEFEMY 527
FL +A +RG Y Q++ + + + +L I + D++ D+ +
Sbjct: 486 DFLERIATENRGVAQRIYANQDTADQLRSFYRQVAAPLLRRIDVQFPEDSVSDVTQ---- 541
Query: 528 PSRIPDLSSESPLIVSGR 545
+R S L+V+G+
Sbjct: 542 -NRFDKYFEGSELVVAGK 558
>gi|355696817|gb|AES00467.1| inter-alpha inhibitor H5 [Mustela putorius furo]
Length = 868
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 62/283 (21%), Positives = 121/283 (42%), Gaps = 23/283 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 235 FVHYFAP---RDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 291
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRG------ 423
+ ++ + T + V +I + G T+I L + +++L +
Sbjct: 292 SNRIKVWKDHLVSVTPDNVRDGKVYIH-HMSPTGGTDINGALQRGIKLLNDYVAHNDIED 350
Query: 424 ---SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
S+ +I +TDG V + + ++ G +C I+T GIG+ + L L+
Sbjct: 351 RDRSVSLIVFLTDGKPTVGETHTLKILNNTKEAARGQVC--IFTIGIGNDVDFMLLEKLS 408
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G + D Q+ + + +L++I +D + E + + P+ + S
Sbjct: 409 LENCGLTRRVLEEDDAGSQLIGFYDEIRTPLLSDIRVDYPPAMVE-QATRTLFPNYFNGS 467
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLD 581
++++G+ L + + FV+ L D+P++
Sbjct: 468 EIVIAGKLADSKAGQLHVEVTASNSKKFVI-----LKTDVPVE 505
>gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis]
gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis]
Length = 514
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 120/263 (45%), Gaps = 23/263 (8%)
Query: 304 VDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS 363
+++ ++ M L G S D++ ++D+SGSM+G+ + K A+ + KL D
Sbjct: 39 LEETKLKVMLELTGGDSSNDRPGLDLVAVLDVSGSMEGEKISKVKTAMLFMIKKLSSIDR 98
Query: 364 FNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR- 422
+IV F+G+ +++ E +R + + A G+TNI A L + +L + R
Sbjct: 99 LSIVTFSGDARRLCPLRQIS--ETSQRELENLINGLKAEGATNITAGLKTGLNVLNDRRL 156
Query: 423 --GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMI 480
G + I L++DG ++ DA + + N P ++TFG G NH R+L I
Sbjct: 157 SGGRVVGIMLMSDG---EQNAGGDAAQVPVGN----VP-VHTFGFG--INHEP-RVLKAI 205
Query: 481 SRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPL 540
++ G + ++Q ++ FS LA + ++DL ++ +RI D S+ +
Sbjct: 206 AQNSVGGTFS----DVQNTDNLSKAFSQCLAGLLTVVVQDL---KLTVTRIEDESTIEQV 258
Query: 541 IVSGRYQGKFPDTLKAKGFLGDL 563
Q K D GDL
Sbjct: 259 SAGSYPQSKDDDAGSITIMFGDL 281
>gi|456063611|ref|YP_007502581.1| cell division protein FtsZ [beta proteobacterium CB]
gi|455440908|gb|AGG33846.1| cell division protein FtsZ [beta proteobacterium CB]
Length = 775
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 101/423 (23%), Positives = 162/423 (38%), Gaps = 46/423 (10%)
Query: 118 SKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLI--ALGENDGAGKSASVETGSFLKPN 175
+K+ + + PM + +G++AEISG+ +I A A +T ++ N
Sbjct: 47 NKTLEVVYSFPMAWGATFMGMQAEISGQRLVGTVIEKAKATEQYEKAIADGDTPIMIEKN 106
Query: 176 ---IFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREK 231
++T L + + I ++Q L Y G + +P P Y I
Sbjct: 107 SDGLYTANLGNLKPNEDVVIEYSYAQLLKYEQGSMRIGIPTVIAPRYGDAKKGGIEDHHS 166
Query: 232 IH----------LNVNAGTGTE-VLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFD 280
L++ G E SH +K ++D G L S E E +D D
Sbjct: 167 TESDFLTEYPFSLSLQLTGGLEKATAECPSHQVKVTKQDEGLL-VSIEREAF----LDRD 221
Query: 281 FSYTVSPSHIFGGVLLQSPSLHDVDQREMF----CMYLL---PGTAKSRKVFKKDVIFIV 333
F I LQ+ S + V + C L P T ++K D+ +V
Sbjct: 222 F--------ILNLTGLQNQSFYIVTPDKQLGPDGCTVLASFCPPTIDNQKDSSLDLKILV 273
Query: 334 DISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQ 393
D SGSM+G ++ K AL LS L D F+ F T + ++ A + + A
Sbjct: 274 DCSGSMEGDSIQSAKRALHHVLSHLTAEDRFSYSKFGNHTNHLFSKVKPADQFHISAASL 333
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
I G T + A L +++ ++G + + L+TDG V + A K
Sbjct: 334 LIENTNADMGGTEVEAALQSTFDLM-GAQG-MADVLLITDGEVWSTNETIAAAKES---- 387
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
RI+ G+GS L+ LA +S G + IE + ++F R NI
Sbjct: 388 ---GHRIFAVGVGSAPADTLLKELAEMSGGACELISPKEDIEKAIVRMFHRIRQPRACNI 444
Query: 514 AID 516
ID
Sbjct: 445 EID 447
>gi|351700150|gb|EHB03069.1| Poly [ADP-ribose] polymerase 4 [Heterocephalus glaber]
Length = 1704
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 130/606 (21%), Positives = 244/606 (40%), Gaps = 94/606 (15%)
Query: 82 SLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAE 141
+L+PL+ ++ + ++ V + T + H + + + P+ D+ ++ G EA
Sbjct: 618 NLVPLESVHIKGRIIDFVAQVIVFQTYTNQSHVPI-----EAKYIFPLDDKAAVCGFEAF 672
Query: 142 ISGKSYHTQLIALGENDGAGKSA-SVETGSFL----KPNIFTLTLPQIDGGSYLSIRLRW 196
I+GK ++ E ++A S G++L P++FT+++ + + + I++ +
Sbjct: 673 INGKHIVGEIKEKEEAQREYRAAVSQGHGAYLMDQAAPDVFTVSVGNLPPKAKVLIKITY 732
Query: 197 SQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLC---------- 246
+LS R G +V F P V P ++ LN N E +C
Sbjct: 733 ITELSIR-GPVAV---FFMPASVAPW------QQDKALNENLQDAIEKICIKETGTKQSF 782
Query: 247 ---------------NTSSHHLKQLRRDVGKLGYSYESEVLKWSNIDFDFSYTVSPSHIF 291
++ +H LKQ R D + + E L SN F +S +++
Sbjct: 783 SLTVCIEMPYVIDFISSDTHELKQKRTDCRAVISTMEGSTLD-SN-GFSLHIGLSNAYLP 840
Query: 292 GGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDVIFIVDISGSMQGKP 343
+ + P ++ CM + LPG A K +V+ +D S SM+G
Sbjct: 841 RMWVEKHP-----EKESEACMLVFQPDLDATLPGPAN-----KNEVVICLDCSNSMEGVT 890
Query: 344 LEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIA 401
K ALS + N+V F G +Y LFS + ++I
Sbjct: 891 FTQAKQIALYALSLVGEEQKVNVVQF-GTSYKELFSYPKHITNSNV---PTEFIKSATPT 946
Query: 402 GGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIY 461
G+T+ L + + +L S+G + + L++DG +++E ++ +L + R++
Sbjct: 947 MGNTDFWKTL-RYLGLLYPSQG-VRNVLLISDGHLQNE-----SLTLKLVKRNVLHTRLF 999
Query: 462 TFGIGSYCNHYFLRMLAMISRG---YYGAAYDLD---SIEIQMQKLFTRGFSSVLANIAI 515
T G G N + LR LA G Y+ + + IE QM +L + SV +I
Sbjct: 1000 TCGTGPTANRHVLRNLAQCGAGVFEYFNSRSEHSWKKQIEDQMNRLHSPSCHSV--SIKW 1057
Query: 516 DTLK-DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQL 574
L D E P+++P L L+V G TL A ++ + V +LQ
Sbjct: 1058 QQLSADAQEPLQAPAQVPSLFHNDRLLVYGLISHCMQATLCALIQEKEICSMVSTTELQK 1117
Query: 575 AKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEK-------VSKMSVQTGVLCEYTRMIIV 627
+ ++ A+ I ++ E K + K+S + ++ ++T + +
Sbjct: 1118 TTGTMIHKLAARALIRDFEDGILHKDEANHEMKKQIMKSLIIKLSKENSLITQFTSFVAI 1177
Query: 628 ETDERN 633
E + N
Sbjct: 1178 EKRDGN 1183
>gi|350296059|gb|EGZ77036.1| hypothetical protein NEUTE2DRAFT_78055 [Neurospora tetrasperma FGSC
2509]
Length = 1067
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 95/202 (47%), Gaps = 10/202 (4%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ +++F+ D SGSM G +E K+AL + L + G FNI +F G TY LFS
Sbjct: 293 RPEIVFVCDRSGSMGGARIECLKSALRIFLKSIPVGAKFNICSF-GSTYEFLFSDGSRSY 351
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
E++ A ++ G T + PL A E + +FL+TDG + ++ +
Sbjct: 352 DHESLRLAMDYVSRMDADLGGTEMYQPLEAAFE--KRYKDMDLEVFLLTDGEIWNQEHLF 409
Query: 444 DAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
+ +++ G+I R++T GIG+ +H + A G+ D + + ++ ++
Sbjct: 410 TMINKKVSESQGAI--RLFTLGIGNDVSHALIEGAARAGNGFAQTVTDSEKMNAKVVRML 467
Query: 503 TRGFSSVLAN--IAIDTLKDLD 522
G + + + + I KD D
Sbjct: 468 KAGLTPHIKDYTLEIKYAKDSD 489
>gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex]
Length = 829
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 119/552 (21%), Positives = 224/552 (40%), Gaps = 93/552 (16%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYV 219
AG+SA+ T + N FT+++ I+ + L + + LS R G + + +V
Sbjct: 76 AGQSAAQVTANARNANQFTVSV-NIEPEQKIFFNLTYEELLSRRKGIYEQAI------HV 128
Query: 220 TPA--IKKIPKREKIH--LNVNAGTGTEVLCNTSSHHLKQLRRD-----VGKLGYSYESE 270
TP + K+ R I L +N T ++ N + + Q+ R V + E+E
Sbjct: 129 TPGSVVPKMSVRVNIFETLPINKITVPQLRGN--DNEIAQIIRTNETTVVSVVYEPTEAE 186
Query: 271 VLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVI 330
+K S + V + + +H VD F + +P + K VI
Sbjct: 187 QIKMSKDGLQGQFVVQYDVDRSSIEKKGGEIHVVDG--YFVHFFVPADLPT---LPKHVI 241
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPG-DSFNIVAFNG--------------ETYL 375
F++D SGSM G +E TK A+ L +L D F++V F+ + Y
Sbjct: 242 FVLDTSGSMAGTRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKPYKGPDHYY 301
Query: 376 FSTSMELATKEA----------------------------VERAHQWIGINFIAGGSTNI 407
F++ E T++A V+RA +++ + STNI
Sbjct: 302 FNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMDVTS-STNI 360
Query: 408 CAPLTKAVEMLTNSRGSI---PIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRI 460
L A++ + + + PII +TDG +V D +I + R N + I
Sbjct: 361 NDALLLALKNSQSVQSRVRLTPIIIFLTDGEPTASVTDTTEILKNV--RKGNSDDVV-SI 417
Query: 461 YTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA-IDTLK 519
+ G+ ++ FL ++ +RG+ Y+ +Q++ F S +L+N+ +
Sbjct: 418 FCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGFFDEVASPLLSNVNFVYNNN 477
Query: 520 DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFV----------VE 569
+ ++ + +P+ + +V+GR KF + G G +F+ V
Sbjct: 478 GPVKHDVTETNVPNFFKGTEFVVAGRIDPKFNLSASISG-TGASGSFLFPNVIPAAPNVT 536
Query: 570 LKLQLAKDIPLDRICAKQQI-DLLTAQAWFSEDKRLEE---KVSKMSVQTGVLCEYTRMI 625
+ + K L++I A I D+L + + KR+ E K +S+ + T ++
Sbjct: 537 VVTKPKKSFSLEKIWAYMTIQDMLRKRQIIDDAKRIVEINQKALNLSLAYDFVTPLTSLV 596
Query: 626 IVETDERNNASE 637
+V+ + A++
Sbjct: 597 VVKPNSTATAAD 608
>gi|421110938|ref|ZP_15571424.1| von Willebrand factor type A domain protein [Leptospira santarosai
str. JET]
gi|422002848|ref|ZP_16350083.1| von Willebrand factor type A [Leptospira santarosai serovar
Shermani str. LT 821]
gi|410803656|gb|EKS09788.1| von Willebrand factor type A domain protein [Leptospira santarosai
str. JET]
gi|417258593|gb|EKT87980.1| von Willebrand factor type A [Leptospira santarosai serovar
Shermani str. LT 821]
gi|456877621|gb|EMF92636.1| von Willebrand factor type A domain protein [Leptospira santarosai
str. ST188]
Length = 516
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 84/168 (50%), Gaps = 16/168 (9%)
Query: 326 KKDVIFIVDISGSMQG-KPLEDTKNALAVALSKLDPGDSFNIVAFNGE--TYLFSTSMEL 382
KK + F++D+SGSM+G LED+K AL L ++ GD F++V F+ + T++ +++L
Sbjct: 114 KKALCFVLDVSGSMEGDNKLEDSKAALIQFLMEMKVGDEFSLVVFDNDASTFISPQTIDL 173
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDG----AVE 437
+++ + + I AGG TNI A L EM S + L+TDG V
Sbjct: 174 NSRKKIISKVKDIQ----AGGGTNIEAGLILGYSEMSKFHPRSTRRLILLTDGISNVNVH 229
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+I K + G RI T G+G N LR LA +G+Y
Sbjct: 230 TPEEIAKKAKVQYLEGS----RISTIGLGYDVNQTLLRSLAENGKGHY 273
>gi|449302708|gb|EMC98716.1| hypothetical protein BAUCODRAFT_64769 [Baudoinia compniacensis UAMH
10762]
Length = 914
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 80/159 (50%), Gaps = 7/159 (4%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKE 386
+++FI D SGSM+G NAL + + L G FNI+AF N T L+ +S + +
Sbjct: 290 EIVFIADQSGSMRGSKNTALVNALQIFIKSLPLGVRFNIIAFGNNFTSLWPSS-QAYNET 348
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDERQICDA 445
+V++A ++ + G T I P+ A E +P+ + L+TDG V E +
Sbjct: 349 SVDKAIAFVNGFTASYGGTEILKPVKAAFEQRLK---DLPLEVMLLTDGEVWQENSVFSF 405
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
+ ++ + I R++ GIGS+ +H + +A G+
Sbjct: 406 INEQV-HDKKIDARVFALGIGSHVSHTLVEGVARAGNGF 443
>gi|409047759|gb|EKM57238.1| hypothetical protein PHACADRAFT_194803 [Phanerochaete carnosa
HHB-10118-sp]
Length = 868
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 90/195 (46%), Gaps = 5/195 (2%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL-DPGDSFNIVAFNGETYLFSTSMEL 382
V +++ IF+VD SGSM G +E K L+ L L G SFNI +F + E+
Sbjct: 274 VTRQEFIFVVDRSGSMSGSRVETAKRTLSTLLRTLPSAGTSFNIFSFGSYCNSLWQTSEV 333
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
T +A++ A + G T I A L +A SR + ++F++TDG D+ +
Sbjct: 334 YTNDALDVATRHAASMDANYGGTEIGAALLRACNGRDPSRPT--VMFVLTDGKAYDQ-MV 390
Query: 443 CDAMKSRLTNGGSICP-RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ + + + P R++T GIG+ + +A G A + I + L
Sbjct: 391 YTVVANAVARSTAAAPLRVFTLGIGATASSAMCEGIARAGNGKCLMALASEDIVGKCATL 450
Query: 502 FTRGFSSVLANIAID 516
G S +L N+++D
Sbjct: 451 MRAGRSFLLKNVSVD 465
>gi|440790803|gb|ELR12071.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 324
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 87/188 (46%), Gaps = 12/188 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELAT 384
+ ++IF+VD SGSM G ++ K L + L L G FNI +F + + F++S+E
Sbjct: 18 QTELIFLVDRSGSMAGSRMDIAKATLQLFLRSLPEGVRFNIFSFGSKFSSQFASSVEY-D 76
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
++++ RA + G T++ PL + G IFL+TDG V + ++
Sbjct: 77 EQSLMRADAHVNAMSADMGGTDLLKPLQHVLRQ-REHEGLSRQIFLLTDGEVLNAEEVIQ 135
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ RI+TF IG+ + L LA+ RG +++ +++ + + +
Sbjct: 136 LAGT---------TRIFTFAIGNEADKNMLHELALCGRGAMEFVSEVEHMKVHVMRQLQK 186
Query: 505 GFSSVLAN 512
+ N
Sbjct: 187 ALEPAMTN 194
>gi|328865688|gb|EGG14074.1| type A von Willebrand factor domain-containing protein
[Dictyostelium fasciculatum]
Length = 908
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 84/192 (43%), Gaps = 8/192 (4%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELAT 384
K + +F++D SGSM G + K AL + + L FNIV F LF SM
Sbjct: 297 KSEFVFLLDCSGSMSGGAITKAKRALEILMRSLTENSKFNIVLFGSNFKSLFPESMPYDD 356
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+E A +I G T + P+ + + + +F++TDG V + Q+ D
Sbjct: 357 AN-LEIASTYIQKIQADLGGTELLPPIKSILSKPYDPQYPRQ-VFILTDGEVSERDQLID 414
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ RI+T GIGS + + L+ +GYY + +E Q+ KL +
Sbjct: 415 -----FVGKEANTTRIFTLGIGSGVDRELVIGLSKSCKGYYEFIEENSMMETQVVKLMSI 469
Query: 505 GFSSVLANIAID 516
++NI +D
Sbjct: 470 AMEPTISNIRVD 481
>gi|321479343|gb|EFX90299.1| hypothetical protein DAPPUDRAFT_309817 [Daphnia pulex]
Length = 840
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 162/391 (41%), Gaps = 36/391 (9%)
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKI 232
P++F + + GS IRL + +L GE +P P YV P ++
Sbjct: 108 PDVFKAKVGNLAAGSGAKIRLTYVTELKVEGGEIRFYLPTTIAPRYVPPTDHSSVAKDLA 167
Query: 233 HLNVNAGTGTEVLCNTS---SHHLKQLRRDVGKLGYSYES------EVLKWSNI----DF 279
+N + V N + +K++R K+ S++S LK ++ D
Sbjct: 168 SINYTQESHYRVDFNATVRMGSAIKEIRSPTHKIVVSHDSVNKPMDGTLKLDDVVTSMDR 227
Query: 280 DFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSM 339
D + + L+ S + + L+P + + K ++IF+VD SGSM
Sbjct: 228 DLVVYIKVAEPNQPRLIHEKS---SNGSTAVMLSLVPSFKLAEQ--KTELIFLVDRSGSM 282
Query: 340 QGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSM---ELATKEAVERAHQW 394
G +E K AL + L L P D + +A G +Y LF +S E+ A A Q
Sbjct: 283 AGSGIEQAKQALKLFLHSL-PTDCYVNIAGFGFSYEELFPSSRKYDEVVLNHAKAYAEQ- 340
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGG 454
+ N G T I PL + + G + +F++TDG V + Q+ + R +G
Sbjct: 341 LQANL---GGTEIFNPLEQIFQK-PPIDGYLRQVFVLTDGEVSNADQVLGLV--RRHSGR 394
Query: 455 SICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIA 514
+ R++ G+GS +H+ + +A G A + +E ++ + L +I
Sbjct: 395 T---RLFALGLGSSASHHLVEGMARAGNGTALFASLEERLEKKVMQQLQDALQPALTDIK 451
Query: 515 IDTLKDLDEFEMYPSRIPDLSSESPLIVSGR 545
I+ + DE E + + +E L+ G+
Sbjct: 452 IN-WEGYDETEQSTTPAQPVQTEMTLMGYGK 481
>gi|147898495|ref|NP_001088330.1| inter-alpha-trypsin inhibitor heavy chain 2 precursor [Xenopus
laevis]
gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis]
Length = 935
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 94/199 (47%), Gaps = 12/199 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T +A+ L L+ D F I+ FN + EL
Sbjct: 304 KNILFVIDVSGSMWGLKMKQTVDAMKSILEDLNSDDQFGIIDFNHNIRCWKD--ELVYAS 361
Query: 387 AVERAHQWIGINFIA-GGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG-AVE 437
+VE+ + I G TNI L +A+ +L + + S+ +I LV+DG
Sbjct: 362 SVEKGDASKYVQRIQPNGGTNINDALLRAIFILKEASNKGLLEQNSVSLIVLVSDGDPTV 421
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
E ++ K+ TN +++ GIG ++ FL LA + G Y Q
Sbjct: 422 GELKLPKIQKNVRTNIQDDIA-LHSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQ 480
Query: 498 MQKLFTRGFSSVLANIAID 516
+++ + + + +L NI ++
Sbjct: 481 LKEFYNKVSTPLLKNIVVN 499
>gi|118357562|ref|XP_001012030.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89293797|gb|EAR91785.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 894
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 10/160 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ + IF++D SGSM G+P++ AL + L L FN+V+F G +Y LF +S++
Sbjct: 311 RSEFIFLLDRSGSMIGQPIQKACEALILFLKSLPIDSYFNVVSF-GSSYEKLFQSSIKYD 369
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
T ++E+A + I G T I PL ++V T G IFL+TDG VE +++
Sbjct: 370 TN-SLEKAIKIIKEYTADLGGTEIYKPL-QSVFKETKIDGYNKQIFLLTDGEVESPKKVV 427
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+L + RI + G GS + Y + A+ +G
Sbjct: 428 -----QLIRKNNKFNRINSIGFGSGADKYLIEESAIAGKG 462
>gi|380805229|gb|AFE74490.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor,
partial [Macaca mulatta]
Length = 448
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 82/179 (45%), Gaps = 15/179 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L TK+AL L L P D F+I+ F
Sbjct: 269 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGF 325
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT-----NSRG- 423
+ ++ + T +++ +I + G T+I L +A+ +L N G
Sbjct: 326 SNRIKVWKDHLISVTPDSIRDGKVYIH-HMSPTGGTDINGALQRAIRLLNKYVAHNDIGD 384
Query: 424 -SIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
S+ +I +TDG V + + +R G +C I+T GIG+ + L L++
Sbjct: 385 RSVSLIIFLTDGKPTVGETHTLKILNNTREAARGQVC--IFTIGIGNDVDFRLLEKLSL 441
>gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413]
gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413]
Length = 427
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 107/257 (41%), Gaps = 38/257 (14%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM GKPL+ A+ L +L PGD ++VAF+G + + + E+++
Sbjct: 46 LILDQSGSMHGKPLKMVIAAVERLLDRLQPGDRISVVAFSGSATVIIPNQIVEDPESIKT 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVEDERQIC------ 443
I A G T I L + + E++ +RG++ FL+TDG ED +I
Sbjct: 106 Q---IRKKLQASGGTVIAEGLQQGITELMKGTRGAVSQAFLLTDGHGEDSLKIWKWEIGP 162
Query: 444 -DAMKSR--LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
D+ + + I I T G G+ N L +A G + +
Sbjct: 163 DDSRRCQEFAKKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQAVHHFNR 222
Query: 501 LFTRGFSSVLAN----------IAIDTLK-------DLDEFEMYPS-------RIPDL-S 535
LF R S L N + + L+ D+ E + P R+ DL
Sbjct: 223 LFARVQSVGLTNAYLILSLAPQVRLAELRPIAQVAPDIIELPVEPEADGSFIIRLGDLMK 282
Query: 536 SESPLIVSGRYQGKFPD 552
E+ ++++ Y GK P+
Sbjct: 283 DENRVVLANLYLGKLPE 299
>gi|402084959|gb|EJT79977.1| hypothetical protein GGTG_05059 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1080
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/393 (21%), Positives = 169/393 (43%), Gaps = 58/393 (14%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
++FT T+ + G+ + + + + +L + D E + + F P + P + + L
Sbjct: 153 DVFTTTIGNVPAGAEIRVEITYLGELKH-DAE-ADGMRFTIPTTIAPRYGSM-SHDTQPL 209
Query: 235 NVNAG----TGTEVLCNT-----SSHHLKQLR----------RDVGKLGYSYESEVLKWS 275
+AG G ++ + S+H+K ++ + G G S + L
Sbjct: 210 PFSAGWSRAGGIDITVDVEIPAVGSNHIKTIKSPSHPISVEMKAFGATGMSAATAELSIG 269
Query: 276 NIDFDFSYTV----SPSHIFGGVLLQSPSLHDVDQREMFCMYL----LPGTAKSRKVFKK 327
++ D + V + VL Q PS+ QR + ++ LP +
Sbjct: 270 ALELDKDFVVIVVLAKPDAPVVVLEQHPSI--AHQRALMATFVPRFNLP-------QIRP 320
Query: 328 DVIFIVDISGSM-QGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATK 385
+++F+ D SGSM GK + + AL V L L G FNIV+F + ++LF S ++
Sbjct: 321 EIVFVCDRSGSMGMGKRIPNLIAALHVFLRSLPVGPKFNIVSFGSSHSFLFERSRSY-SQ 379
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDERQICD 444
+++ A + + G T + AP+ ++ + + IF++TDG V ++ ++ D
Sbjct: 380 SSLDEASRHVDTFDSGYGGTELLAPID---DVFKRRYADMELEIFVLTDGEVWNQSKLFD 436
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
+ S + R++T GIG+ + + LA RG+ D + + ++ ++
Sbjct: 437 MVNSNIAKCQGAA-RLFTLGIGNDVSTALVEGLARAGRGFAQIVGDEEKLNSKVVRMLKA 495
Query: 505 GFSSVLANIAIDTL-----------KDLDEFEM 526
S+ + + +++ K LD+FEM
Sbjct: 496 SLSTHINDYSLEIQYAAKPKADTQPKPLDDFEM 528
>gi|405954843|gb|EKC22171.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
Length = 984
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 134/322 (41%), Gaps = 58/322 (18%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+VI ++D S SM+ D K A + L +DP FN+
Sbjct: 332 EVILMIDSSNSMKDSSHRDAKKAALLVLHLMDPTWRFNVKL------------------- 372
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMK 447
+A+Q G+T +C PL L S S IFL +DG + ++ +
Sbjct: 373 --QANQ---------GNTELCRPLHSY--YLLKSESSTRNIFLFSDGYINNDEDTLVKI- 418
Query: 448 SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG---YYGAAYDL---DSIEIQMQKL 501
+ S R++T GI S N + L+ +A + G +Y + + D + Q+QK
Sbjct: 419 ----SQNSQHTRVFTMGISSVANRHLLKAIARVGAGSFEFYDSKFKSKWEDKVRSQLQK- 473
Query: 502 FTRGFSSVLANIAIDTLKDLDEFE---MYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKG 558
VL ++++D D D+ P +I L S S +V G + TLKAK
Sbjct: 474 ---AAQPVLTSVSVDWRHD-DQHPAPIQAPQQITALFSGSRQVVYGFVDNCYMATLKAKI 529
Query: 559 FLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKM 611
+ ++S V L + K L R+ AK I +L+ E K+ L++ V ++
Sbjct: 530 GVQEISTIVSTSDLSVTKGKMLHRLTAKGVIRDWEDGVLSPDRTGHEVKKMNLKQYVIEL 589
Query: 612 SVQTGVLCEYTRMIIVETDERN 633
S + ++ T + +E E++
Sbjct: 590 SKRYSIVTSLTSFVAIEKREKD 611
>gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120]
gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120]
Length = 427
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 13/192 (6%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM G+PL+ A+ L +L PGD ++VAF G + + + E+++
Sbjct: 46 LILDQSGSMHGQPLKMVVEAVEKLLDRLQPGDRISVVAFAGSATVIIPNQIVENPESIKT 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVEDERQIC------ 443
I A G T I L + + E++ +RG++ FL+TDG ED +I
Sbjct: 106 Q---IRKKLQASGGTVIAEGLQQGITELMKGTRGAVSQAFLLTDGHGEDSLKIWKWEIGP 162
Query: 444 -DAMK--SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
D+ + I I T G G+ N L +A G + +
Sbjct: 163 DDSRRCLEFAKKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQAVHHFNR 222
Query: 501 LFTRGFSSVLAN 512
LFTR S L N
Sbjct: 223 LFTRVQSVGLTN 234
>gi|119628741|gb|EAX08336.1| poly (ADP-ribose) polymerase family, member 4 [Homo sapiens]
Length = 1724
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 228/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAS-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTT- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHVLRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D+ E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDVPEALQAPAQVPSLFLNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|54038464|gb|AAH84380.1| LOC495168 protein, partial [Xenopus laevis]
Length = 554
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 94/199 (47%), Gaps = 12/199 (6%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T +A+ L L+ D F I+ FN + EL
Sbjct: 304 KNILFVIDVSGSMWGLKMKQTVDAMKSILEDLNSDDQFGIIDFNHNIRCWKD--ELVYAS 361
Query: 387 AVERAHQWIGINFIA-GGSTNICAPLTKAVEMLTNS-------RGSIPIIFLVTDG-AVE 437
+VE+ + I G TNI L +A+ +L + + S+ +I LV+DG
Sbjct: 362 SVEKGDASKYVQRIQPNGGTNINDALLRAIFILKEASNKGLLEQNSVSLIVLVSDGDPTV 421
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
E ++ K+ TN +++ GIG ++ FL LA + G Y Q
Sbjct: 422 GELKLPKIQKNVRTNIQDDIA-LHSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQ 480
Query: 498 MQKLFTRGFSSVLANIAID 516
+++ + + + +L NI ++
Sbjct: 481 LKEFYNKVSTPLLKNIVVN 499
>gi|345801850|ref|XP_536940.3| PREDICTED: von Willebrand factor A domain-containing protein 3A
[Canis lupus familiaris]
Length = 1171
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 92/200 (46%), Gaps = 17/200 (8%)
Query: 313 YLLPGTAKSRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSK--LDPGDSFNI 366
+LL G SR++F ++ V +VD SGSM G L+ K L + + + DSFN+
Sbjct: 930 WLLSG---SRRLFGTILERKVCILVDTSGSM-GPYLQQVKTELVLLIWEQLRKHCDSFNL 985
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
++F + L+ ++ T EA A QW+ + A GST++ L KA +
Sbjct: 986 LSFAEDLQLWRDTLVETTDEACHEAMQWV-THLRAHGSTSVLQALLKAFTF-----HDVE 1039
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
++L+TDG + + RLT G + FLR LA ++ G Y
Sbjct: 1040 GLYLLTDGKPDTSCSLILREVQRLTETGDVKMHTIALNHSGRAAGDFLRNLAALTGGRYH 1099
Query: 487 AAYDLDSIEIQMQKLFTRGF 506
D D++ +++ L T+G
Sbjct: 1100 CPVDEDTL-LRIHGLLTKGI 1118
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 321 SRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF 376
SR+++ +K V+ ++DIS + + + + +L D FNI+AF +
Sbjct: 487 SRRIWGTVCEKRVVILLDISVTNSMYIIHIQHSLRLLLEEQLSNKDCFNIIAFGSTIESW 546
Query: 377 STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
+ M + E ++ A +W + GS N+ + L KAVE+
Sbjct: 547 RSEMVAVSHENLQSAWRW-ALTLQCRGSRNVLSALRKAVEV 586
>gi|327272010|ref|XP_003220779.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like
[Anolis carolinensis]
Length = 919
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 144/343 (41%), Gaps = 33/343 (9%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P + K+++F++D+SGSM G + T A+ L L D F+++ F
Sbjct: 274 FIHYFAPDNLEP---LPKNILFVIDVSGSMWGLKMRQTVEAMKTILDDLRSNDQFSVLDF 330
Query: 370 NGETYLFSTSMELATKEAVERAHQWI-GINFIAGGSTNICAPLTKAVEMLTNSRG----- 423
N + S+ A+ E A ++I GI+ G TNI L +A+ +L +
Sbjct: 331 NHNVRCWRDSLVQASNAQTEAAKKYIEGIH--PNGGTNINDALLRAIFILKEASNMGMLD 388
Query: 424 --SIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
S +I LV+DG ++ + ++ GIG ++ FL+ LA +
Sbjct: 389 PSSTSMIVLVSDGDPTVGELKLPTIQKNVKKNIQDDISLFCLGIGFDVDYDFLKRLAQEN 448
Query: 482 RGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
G + QM+K F + + +L + + +++ ++ + + S ++
Sbjct: 449 NGMAHRVFGNQETSSQMRKFFNQVSTPLLKKLEFNYPQEMVS-DVTHNSFQNYFGGSEIV 507
Query: 542 VSGRYQGKFPDTLK---------AKGFLGDLSNFVVELKLQLAKDIPLD-----RICAKQ 587
V+G+ L+ A+ L L++ V L L KD D ++ A
Sbjct: 508 VAGKVDTANTQHLQSVITATASNAQLVLETLAD-VEGLDDFLNKDKHADPKFTEKLWAYI 566
Query: 588 QIDLLTAQAWFSEDKRLEEKVSK----MSVQTGVLCEYTRMII 626
I+ L A+ + L+ ++K MS+ ++ YT M+I
Sbjct: 567 TINQLMAERNMAPTAALKRNITKQILQMSLHHHIVTPYTAMLI 609
>gi|441509448|ref|ZP_20991366.1| hypothetical protein GOACH_11_01240 [Gordonia aichiensis NBRC
108223]
gi|441446546|dbj|GAC49327.1| hypothetical protein GOACH_11_01240 [Gordonia aichiensis NBRC
108223]
Length = 421
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 77/167 (46%), Gaps = 8/167 (4%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
++D SGSM G PLE K ALA +S+LDP D F +V F+ E + + LA K ER
Sbjct: 43 IVLDRSGSMSGPPLEGAKKALAGVISRLDPHDVFGVVTFDNEAQVAVPAGLLADK---ER 99
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI----IFLVTDGAVEDERQICDAM 446
A + + ++GGST++ + + ++ L + + I + +++DG V D
Sbjct: 100 AADLVE-SIVSGGSTDLSSGYLRGLQELRRAAKQVGISGGTVLVISDGHVNSGISDVDEF 158
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
S + T G G + L +A G + A D D+
Sbjct: 159 ASLTAKAAADGIVTSTLGYGRGYDETLLAAIARSGNGNHVFAEDPDA 205
>gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative
[Ricinus communis]
gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative
[Ricinus communis]
Length = 514
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 94/197 (47%), Gaps = 20/197 (10%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++D+SGSM G +E K A+ + KL P D ++V F+ + ++
Sbjct: 63 DLVAVLDVSGSMAGDKIEKVKTAMLFVIKKLSPIDRLSVVTFSADANRLCPLRQITENSQ 122
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTN---SRGSIPIIFLVTDGAVEDERQICD 444
+ G+N A G+TNI A L +++L++ S G + I L++DG ++ D
Sbjct: 123 KDLEKLINGLN--ADGATNITAGLQTGLKVLSDRSLSGGRVVGIMLMSDG---EQNAGGD 177
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
A + + N P +YTFG G NH R+L I+ G + ++Q ++
Sbjct: 178 AAQVPVGN----VP-VYTFGFG--INHE-PRVLKAIAHNSIGGTFS----DVQDTNNLSK 225
Query: 505 GFSSVLANIAIDTLKDL 521
FS LA + ++DL
Sbjct: 226 AFSQCLAGLLTVVVQDL 242
>gi|112789550|ref|NP_006428.2| poly [ADP-ribose] polymerase 4 [Homo sapiens]
gi|308153574|sp|Q9UKK3.3|PARP4_HUMAN RecName: Full=Poly [ADP-ribose] polymerase 4; Short=PARP-4; AltName:
Full=193 kDa vault protein; AltName:
Full=ADP-ribosyltransferase diphtheria toxin-like 4;
Short=ARTD4; AltName: Full=PARP-related/IalphaI-related
H5/proline-rich; Short=PH5P; AltName: Full=Vault
poly(ADP-ribose) polymerase; Short=VPARP
gi|189442372|gb|AAI67791.1| Poly (ADP-ribose) polymerase family, member 4 [synthetic construct]
Length = 1724
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAS-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTM- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHVLRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D+ E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDVPEALQAPAQVPSLFLNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 RTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|5702306|gb|AAD47250.1| vault protein [Homo sapiens]
Length = 1724
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 226/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQQEYLEAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSAAYLPRMWVEKHP-----EKESEACMLVFQPDLDVDLPDLAS-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQITLHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTT- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --AAEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIGS N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGSTANRHVLRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|198438279|ref|XP_002126427.1| PREDICTED: similar to LOC779593 protein [Ciona intestinalis]
Length = 1012
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 94/434 (21%), Positives = 165/434 (38%), Gaps = 100/434 (23%)
Query: 283 YTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGK 342
Y V + +FG + + + F + P S F K V+F+VD SGSM G
Sbjct: 295 YDVERTQMFGDIAIHN---------GFFAHHFAP---PSLAAFPKLVVFVVDTSGSMFGY 342
Query: 343 PLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFI 400
L+ K ALA +L L+ D FNIV F G+T S + A+ ++ A ++
Sbjct: 343 KLKQVKQALADSLRSLNNEDHFNIVVF-GDTAEPWISGVLSTASTRSINDAITYVDAVSA 401
Query: 401 AGGSTNICA--------------------------------PLT---------KAVEMLT 419
GG+ + A PL +A E T
Sbjct: 402 RGGTNMLVALQTAFAIMEPYLPSLPENETMVEDTTPFPTPVPLQPETNHFIRKRATETQT 461
Query: 420 NSRGSIPIIFLVTDGAVEDERQICDAMKSRL--TNGGSICPRIYTFGIGSYCNHYFLRML 477
+I +TDG + D + SR+ NGG + ++T G GS + FL L
Sbjct: 462 ELSNYAKMIVFLTDGRPTKDDVGTDDIASRIEKINGGRV--NLHTIGFGSLVDMRFLEKL 519
Query: 478 AMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
A ++ G ++ Q++ F + VL ++ I+ D ++ + + L S
Sbjct: 520 AALNGGVSRRVFESLDAATQIRHFFDEVSAPVLTDVTIEYAAD-KVGDVTTNDVDILFSG 578
Query: 538 SPLIVSG-----------------RYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKD--- 577
+ ++V+G R + + TL+ + F + +E+ + D
Sbjct: 579 TEMVVAGKILEGEDLSGLDELGESRPRRQITGTLRGRSFENE-----IEMDFDIDPDATS 633
Query: 578 --------IP--LDRICAKQQIDLLTAQAWFSED----KRLEEKVSKMSVQTGVLCEYTR 623
IP +R+ A ++ +L A + + K L E+ MS+ + T
Sbjct: 634 SITESTHQIPDFTERLWAFLKVKMLLRSANIAPNETLKKSLREEALNMSLSHNFVTPLTS 693
Query: 624 MIIVETDERNNASE 637
+++V ++R E
Sbjct: 694 LVVVRPEDRRKIEE 707
>gi|291392970|ref|XP_002712949.1| PREDICTED: poly (ADP-ribose) polymerase family, member 4 [Oryctolagus
cuniculus]
Length = 2694
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 133/598 (22%), Positives = 251/598 (41%), Gaps = 69/598 (11%)
Query: 82 SLIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAE 141
+LIPL+ ++ + ++ V + T + H + + + P+ D+ ++ G EA
Sbjct: 616 NLIPLENVHIKGRIIDFVAQVVVFQTYTNQSHVPI-----EAKYIFPLDDKAAVCGFEAF 670
Query: 142 ISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNIFTLTLPQIDGGSYLSIRLRW 196
I+GK ++ E + A E G++L P+IFT+++ + + + I++ +
Sbjct: 671 INGKHIVGEIKEKEEAQREYREAISEGHGAYLMDQDAPDIFTVSVGNLPPKAKVLIKITY 730
Query: 197 SQKLSYRDGEFSVNVPFKFPEYVTP------------------AIKKIPKREKIHLNVNA 238
+LS + G +V F P V P +K+I K++ L ++
Sbjct: 731 ITELSLQ-GPVAV---FFMPATVAPWQQDKALNENLQDTVAKVCVKEIGKQQSFSLTMSI 786
Query: 239 GTGTEVLCNTS-SHHLKQLRRDVGKLGYSYESEVLKWSNIDFD-FSYTVSPSHIFGGVLL 296
+ +S +H LKQ D + + E S++D + FS VS S + +
Sbjct: 787 EMPYVIKSISSDTHKLKQKSTDCKAVISTVEG-----SSLDSNGFSLHVSLSDAYLPRMW 841
Query: 297 QSPSLHDVDQREMFCMYLLPGTAKS--RKVFKKDVIFIVDISGSMQGKPLEDTKNALAVA 354
H + E + P A + K +VI +D S SM+G K A
Sbjct: 842 VEK--HPEKESEACMLVFQPHLAATLPELAVKSEVIICLDCSNSMEGVTFRQAKQIALHA 899
Query: 355 LSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
LS + N+V F G LFS +A+ A ++I G+T+ L +
Sbjct: 900 LSLVGEKQKVNVVQFGTGYKELFSYPKHIASNNA---PTEFILSATPTLGNTDFWKTL-R 955
Query: 414 AVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
+ +L ++G + I L++DG ++ E +K + + R++T G+GS N +
Sbjct: 956 YLSLLYPAQG-LRNILLISDGHLQSESLTLQLVKRNVHHT-----RLFTCGVGSTANRHI 1009
Query: 474 LRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK-DLDEFE--MYP 528
LR L+ G Y + S + Q+++ TR S ++++ + +LD E P
Sbjct: 1010 LRTLSQYGAGVFEYFNSNSKQSWKKQIEEQMTRLRSPSCHSVSVKWQQLNLDAPEPLQAP 1069
Query: 529 SRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQ 588
+++P L L+V G TL A + V +LQ + ++ A+
Sbjct: 1070 AQVPSLFHNDRLLVYGFIPHCTQATLCALIDEKEFQAMVSTTELQKTTGTMIHKLTARAL 1129
Query: 589 ID-----LLTAQAWFSEDKR--LEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
I +L E K+ L+ + K+S + ++ ++T + V E+ + +ESP
Sbjct: 1130 IRDYEDGILHENETSHEMKKQSLKSLIVKLSKENSLITQFTSFVAV---EKRDGNESP 1184
>gi|260824341|ref|XP_002607126.1| hypothetical protein BRAFLDRAFT_118667 [Branchiostoma floridae]
gi|229292472|gb|EEN63136.1| hypothetical protein BRAFLDRAFT_118667 [Branchiostoma floridae]
Length = 388
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
+ F Y P V K+VIF++D+SGSM G + TK A+AV L +L D FNIV
Sbjct: 295 DYFVHYFAP---TGLPVMPKNVIFVIDVSGSMSGDKILQTKEAMAVILDELRETDFFNIV 351
Query: 368 AFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGS 404
F+ T + + +++A +E++ A ++ GG+
Sbjct: 352 TFSDRTKRWKSELQVANQESIRAADTYVTSMVAQGGT 388
>gi|301617149|ref|XP_002938014.1| PREDICTED: poly [ADP-ribose] polymerase 4-like [Xenopus (Silurana)
tropicalis]
Length = 952
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 80/341 (23%), Positives = 145/341 (42%), Gaps = 27/341 (7%)
Query: 302 HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
H + E + LP SR +D+ ++D S SM+ + K +A S L+P
Sbjct: 567 HPDEDSEACMLVFLPSFETSRHY--RDLTILLDCSNSMEST-FQTAKFIALLAASSLNPW 623
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
+ NI++F G Y K + Q+I + G+T + PL L
Sbjct: 624 HNINIISF-GTGYKEFRIRPKENKNLLPEVEQFIKMAKPNMGNTELWKPLQSLC--LLAP 680
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ + L++DG +++E + +K N G + R++T G+G+ N + LR LA
Sbjct: 681 PSDMHNVLLISDGHIQNESLVFQILKK---NAGKV--RLFTCGVGATANRHMLRCLARYG 735
Query: 482 RGYYGAAYD------LDSIEIQMQKLFTRGFSSVLANIAIDTLKDLD-EFEMYPSRIPDL 534
G++ D +E Q+QK+ + +S A++ D D E P+ IP L
Sbjct: 736 AGFFEFFEDKAKSSWKKKMEAQLQKMDSPACTS--ASVKWMQFSDNDPEPVQAPAHIPAL 793
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID---- 590
+ L+V G L+A +L V +LQ + L ++ A+ I
Sbjct: 794 FNPCCLLVYGFAHHCTQAKLQALIDDNELDTMVSTTELQKTRGTMLHKLTARAVIRDYED 853
Query: 591 -LLTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVE 628
+L + E K+ + K + ++S + ++ ++T + VE
Sbjct: 854 GILHEKEHEHEMKKQQMKSFIIELSKKYSIITQFTSFVAVE 894
>gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 694
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 95/200 (47%), Gaps = 16/200 (8%)
Query: 321 SRKVFKKDVIFIVDISGSMQ-GKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFS 377
+RK K ++F++D SGSM +E+ A+ + L L+PG F IV F G T+ LF
Sbjct: 231 NRKTDVKSIVFLLDCSGSMTIDNRIENAIKAMDLFLHSLEPGVKFEIVRF-GSTFNSLFD 289
Query: 378 TSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE 437
+ +++ A +I G T I P+ + L S ++F++TDGAV+
Sbjct: 290 FKLTEYNDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNEL-----SPDVLFVLTDGAVD 344
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
+ + + D ++ T +I++ G+G+ + +R LA + G D +
Sbjct: 345 NSQAVLDFVRDSST-------KIFSLGLGAGADMNLVRNLASFTGGVSEHVLDASQLRDS 397
Query: 498 MQKLFTRGFSSVLANIAIDT 517
+ +L + L+N+ I++
Sbjct: 398 IIRLLEDSTNPTLSNVQIES 417
>gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4
precursor (ITI heavy chain H4) (Inter-alpha-inhibitor
heavy chain 4) (Inter-alpha-trypsin inhibitor family
heavy chain-related protein) (IHRP) (Plasma kallikrein
sensitive glycoprotein 120) (P [Tribolium castaneum]
gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum]
Length = 824
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 116/621 (18%), Positives = 240/621 (38%), Gaps = 81/621 (13%)
Query: 93 LDVDCYLDTAFVRVSGTWRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLI 152
+ +D + F T +V + + + + +V + + I E EI GK Y +
Sbjct: 55 MHIDSNVTNRFATTLVTSKVRNLKKT-AAETTFSVVLPENAFISAFEMEIEGKIYKAHVK 113
Query: 153 ALGEND-------GAGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDG 205
E G+SA+ + FT+++ I+ S + RL + + L + G
Sbjct: 114 EKQEAKQIYQEAVSRGQSAAHVELNARDSKTFTVSV-NIEPESKTTFRLTYEELLKRQLG 172
Query: 206 EFSVNVPFKFPEYVTPAIKKIPKREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGY 265
++ + + + P ++ + +++N + S ++ ++ KL
Sbjct: 173 QYELVI------NIHPG--QLVDDLDVRVHINESRPLTFVKTPSLRTGNEISKNADKLDP 224
Query: 266 SYESEVLKWSNIDFDFSYTVSPSHIFGGVL------------------LQSPSLHDVDQR 307
S + E++ ++ +FS FG +L + P +V R
Sbjct: 225 SAKIEIMNSTSALVEFSPDKGKQKEFGQLLGSEKESGLAGQFIVQYDVERDPQGGEVLVR 284
Query: 308 EMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIV 367
+ + ++ + K K ++F++D SGSM G+ ++ A+ L+ L D FNIV
Sbjct: 285 DGYFVHFF--SPSELKPLPKHIVFVLDHSGSMSGRKIDQLIEAMQNILTDLKETDLFNIV 342
Query: 368 AF------------------------NGETYLFSTSMELA---TKEAVERAHQWIGINFI 400
F N E +L ++ A T+E +E A + I +
Sbjct: 343 RFGDLAMVWDVSQNQFTQLPNFNEYGNLEPHLREINLPRAVNGTEENIEAAKKIIE-DKS 401
Query: 401 AGGSTNICAPLTKAVEMLTNSRGSI-----PIIFLVTDGAVEDERQICDAMKSRLT--NG 453
G TN+ L + ++ ++ P+I +TDG D + + +T N
Sbjct: 402 RLGMTNMMYGLEVGLFLIKRTQEETPDKYQPMIVFLTDGHPNAGMSGRDEITNTVTSLNS 461
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANI 513
G I++ G + + FLR ++ + G+ Y+ +Q+Q + + +L+N+
Sbjct: 462 GKKKASIFSLSFGDFADKRFLRKISSKNSGFSRHIYESSDASLQLQDFYRAISAPLLSNV 521
Query: 514 AIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQ 573
+ D+ + + + P L + S L+V+GR P L + +
Sbjct: 522 NFKYVDDVKDVTV--THYPILFNGSELVVAGRINDN-PHVLNHVEAWCPIGPIKFNATIN 578
Query: 574 LAKDIPLDRICAKQQIDLLTAQAWFSED-KRLEEKVSKMSVQTGVLCEYTRMIIV----- 627
L+R+ A + + Q +E+ L +K +S++ + T +++V
Sbjct: 579 TEAVSSLERLWAYLTVKQMLKQKETAENAANLTKKALDLSLKYSFVTPITSLVVVKPNQT 638
Query: 628 ETDERNNASESPGTKKGSKKS 648
E + A SPG G++ +
Sbjct: 639 EAVDSEAAKSSPGRYDGARSA 659
>gi|428216179|ref|YP_007089323.1| hypothetical protein Oscil6304_5940 [Oscillatoria acuminata PCC
6304]
gi|428004560|gb|AFY85403.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Oscillatoria acuminata PCC 6304]
Length = 411
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 102/246 (41%), Gaps = 30/246 (12%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM G+PL K A + +L PGD +IVAF+ + ++ + E +R
Sbjct: 46 LILDHSGSMTGRPLTTVKEAAHRLIDRLKPGDRLSIVAFDHRAKVIVSNQAI---EDTDR 102
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTN-SRGSIPIIFLVTDGAVEDERQICDAMKSR 449
+ I A G T I + K +E L + G++ F++TDG E+E D
Sbjct: 103 IKKQID-KLKADGGTAIDEGMKKGIEELKKGATGTVSQAFILTDG--ENEHGSNDRCLKL 159
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
+ + T G G + N L +A +RG + + +LF+R S
Sbjct: 160 ASEAADANLTLNTLGFGDHWNLDVLESIATAARGALSHIERPEQAVEEFSRLFSRIQSVA 219
Query: 510 LAN----------IAIDTLKDLDEFE-------------MYPSRIPDLSSESPLIVSGRY 546
L N + + LK + + ++ R+ D+ + +I++ Y
Sbjct: 220 LTNAYLQLKLMPKVRLAELKPVAQVSPETIELPVEAKNGLFEVRLGDIMTTERVILANLY 279
Query: 547 QGKFPD 552
GKF +
Sbjct: 280 IGKFAE 285
>gi|194332733|ref|NP_001123856.1| uncharacterized protein LOC100170623 [Xenopus (Silurana) tropicalis]
gi|189442535|gb|AAI67735.1| LOC100170623 protein [Xenopus (Silurana) tropicalis]
Length = 1380
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 79/352 (22%), Positives = 152/352 (43%), Gaps = 30/352 (8%)
Query: 302 HDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPG 361
H + E + LP S + + D+ ++D S SM+ + K +A S L+P
Sbjct: 850 HPKEDSEACMLVFLPSFKTSIQSW--DLTILLDCSNSMEST-FQSAKRIALLAASSLNPW 906
Query: 362 DSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS 421
+ NI++F FS + ++ + Q+I + G+T + PL L
Sbjct: 907 HNINIISFGTGHKEFSIRPK-ESQNLIPELEQFIKMAKPNMGNTELWKPLQSLC--LLAP 963
Query: 422 RGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMIS 481
+ + L++DG +++E + +K N G + R++T G+G+ N + LR LA
Sbjct: 964 PSDMHNVLLISDGHIQNESLVFQILKK---NAGKV--RLFTCGVGATANRHMLRCLAQYG 1018
Query: 482 RGYYGAAYDLD------SIEIQMQKLFTRGFSSVLANIAIDTLKDLD-EFEMYPSRIPDL 534
G++ D +E Q++++ + +S A++ D D E P+ IP L
Sbjct: 1019 AGFFEFFEDKSKTSWKKKMEAQLERMDSPACTS--ASVKWMQFSDNDPEPVQAPAHIPAL 1076
Query: 535 SSESPLIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID---- 590
+ S +V G L+A +L V +LQ + L ++ A+ I
Sbjct: 1077 FNRSFFLVYGFAPRCTQAKLQALIDDDELDTMVSTTELQKTRGTMLHKLTARAVIKDYED 1136
Query: 591 -LLTAQAWFSEDKRLEEK--VSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
+L + E K+ + K + ++S + ++ ++T + V E+ +A E P
Sbjct: 1137 GILHEKEHEHEMKKQQMKSFIIELSKKYSIVTQFTSFVAV---EKRDAQEKP 1185
>gi|119613594|gb|EAW93188.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_c [Homo
sapiens]
Length = 1350
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 129/320 (40%), Gaps = 67/320 (20%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF Y V I G V ++D + F Y P + +K+V+F++D+S
Sbjct: 245 DFLVQYDVVMEDIIGDV-----QIYD----DYFIHYFAP---RGLPPMEKNVVFVIDVSS 292
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMEL-ATKEAVERAHQWI- 395
SM G +E TK A+ V LS L D FNI++F+ ++ + AT + V A ++
Sbjct: 293 SMFGTKMEQTKTAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLH 352
Query: 396 -----GINFIAGGS------------------------------TNICAPLTKAVEMLTN 420
G + GG T++ + L A +L +
Sbjct: 353 CMEADGSRRLEGGVLALGACQSGSGMRHLPPRPCTTKETDKEQWTDVNSALLAAASVLNH 412
Query: 421 SR---------GSIPIIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGS 467
S G IP+I +TDG V I ++ L + S +++ G
Sbjct: 413 SNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQALGHRVS----LFSLAFGD 468
Query: 468 YCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMY 527
+ LR L++ +RG Y+ +Q++ L+ +LA++ ++ L L +
Sbjct: 469 DADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADVRLNYLGGLVGASPW 528
Query: 528 PSRIPDLSSESPLIVSGRYQ 547
+ P+ S L+V+G+ Q
Sbjct: 529 -AVFPNYFGGSELVVAGQVQ 547
>gi|328875636|gb|EGG24000.1| hypothetical protein DFA_06138 [Dictyostelium fasciculatum]
Length = 832
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 90/197 (45%), Gaps = 17/197 (8%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELAT 384
K + I IVD SGSM G +E K AL + + L+ FNIV+F +G LF +S
Sbjct: 340 KMECIIIVDRSGSMSGDRIESAKRALEIMMRSLNENTMFNIVSFGSGFKKLFPSSTRY-N 398
Query: 385 KEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP----IIFLVTDGAVEDER 440
++E+A I G T + P+ + M ++P +F++TDGAV
Sbjct: 399 DASLEKASSLIASMDADMGGTELFEPIKDVLAM-----PALPEYPRQVFILTDGAVSQRE 453
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQ 499
+ + S RI+T GIG+ + + L+ +G+Y D S E ++
Sbjct: 454 VLIKYVASECN-----TTRIFTLGIGNGVDKELVIGLSKACKGHYEMVSDTSSGFEERVL 508
Query: 500 KLFTRGFSSVLANIAID 516
L + +L+N+ I+
Sbjct: 509 SLLSIATQPMLSNVHIN 525
>gi|440804026|gb|ELR24909.1| von Willebrand factor type A domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 925
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 148/355 (41%), Gaps = 49/355 (13%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGE-NDGAGKSASVETGSFL----KPNIFTLTLP 182
P+ + ++ E E+ GK ++ + D G + + +G++L K IF + +
Sbjct: 84 PLDSRAAVCRFEVELDGKRVVGSVLPHAQARDAYGDAIAEGSGAYLLEQKKEEIFQMNVG 143
Query: 183 QIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP-------------AIKKIP-- 227
+ G +I + + +L +GE V F P + P A+K++
Sbjct: 144 NLPPGKECTIYITYVAELPL-EGEM---VRFTLPTTIAPRYQGQGTTASEREALKEVTTM 199
Query: 228 ----------KREKIHLNVNAGTGTEVL-CNTSSHHLK-QLRRDVGKLGYSYESEVLKWS 275
K + L++ + VL + SH ++ QL D G + + +
Sbjct: 200 SSTSSSSSSSKPYGLELDIAIEMPSPVLSVESPSHPIRFQLNGDTPTKGRV--TLATQET 257
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDI 335
+D DF + P L +R+ + L P + + K++ IF++D
Sbjct: 258 ALDRDFVLMIQQKDA------HQPGLWVDSERQAVMLSLYPKFIEVEQ--KREFIFLLDR 309
Query: 336 SGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQW 394
SGSM G + D KNAL + L L G FNIV F + +LF S + E +
Sbjct: 310 SGSMAGSSMNDAKNALQLFLRSLPEGCKFNIVGFGSKYEFLFRQSRAYDDRSLQEASDAV 369
Query: 395 IGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSR 449
+N GG T + AP+T+ +E ++ S IF++TDG V + + ++ R
Sbjct: 370 PRMNANMGG-TELLAPITEILEDHSDPSYSRQ-IFVLTDGEVSNTEAVLRTIRER 422
>gi|149920555|ref|ZP_01909022.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
gi|149818599|gb|EDM78045.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1]
Length = 820
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 70/157 (44%), Gaps = 7/157 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
+D+I ++D SGSM+G+PL + + L D +V F+ +S + +
Sbjct: 302 RDLIVLLDTSGSMRGEPLAHAQAVTEALIRSLRDRDRLELVEFSSRVRRWSQAPASMSAA 361
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
E A +W+G +GG+ L + ++ I L+TDG + E +I A
Sbjct: 362 KREEALRWVGALRASGGTHMRDGILAALASLRPEAQRQ---ILLITDGLIAFESEIVQAA 418
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ G C R++T GIGS N R +A+ G
Sbjct: 419 RQHRPPG---C-RVHTLGIGSSVNRSLTRPVALAGGG 451
>gi|402901592|ref|XP_003913730.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Papio anubis]
Length = 1717
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 123/557 (22%), Positives = 225/557 (40%), Gaps = 71/557 (12%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGKIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L V+ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTVSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + + A
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIA- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG +++E ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQNE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERN 633
++ ++T + VE + N
Sbjct: 1168 LITQFTSFVAVEKRDEN 1184
>gi|355754576|gb|EHH58477.1| Poly [ADP-ribose] polymerase 4 [Macaca fascicularis]
Length = 1724
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 125/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVYLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIV- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRVFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|398409016|ref|XP_003855973.1| hypothetical protein MYCGRDRAFT_29785, partial [Zymoseptoria
tritici IPO323]
gi|339475858|gb|EGP90949.1| hypothetical protein MYCGRDRAFT_29785 [Zymoseptoria tritici IPO323]
Length = 899
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 92/207 (44%), Gaps = 12/207 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
+++FI D SGSM G + AL V L L G FNI AF E +E
Sbjct: 276 EIVFIADQSGSMSGAKNDSLVAALKVFLKSLPFGVRFNICAFGNSYKFLWPESEAYNEEN 335
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVEDERQICDAM 446
+ A ++ + G T + P+ +A R +P+ + L+TDG + +E+ + + +
Sbjct: 336 LNTAVTFVELLRADYGGTQMLEPVKEA---FLRKRADLPMEVMLLTDGEIWEEKTLFEFL 392
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
++ + R++ GIG+ +H + +A RG+ + + ++ ++
Sbjct: 393 NEQIQE-KKVDARVFALGIGNDVSHTLVEGVARAGRGFAQFVTQTEETDRKVMRMLKGAL 451
Query: 507 SSVLANIAID-------TLKDLDEFEM 526
+ + ++D T+ + D+FE+
Sbjct: 452 CAHTQDYSLDVHYEEEGTILEEDDFEL 478
>gi|327269054|ref|XP_003219310.1| PREDICTED: poly [ADP-ribose] polymerase 4-like [Anolis carolinensis]
Length = 1734
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 117/562 (20%), Positives = 226/562 (40%), Gaps = 80/562 (14%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D ++ G EA I GK ++ + + A ++ G++L P+I
Sbjct: 674 EAKYVFPLDDTAAVCGFEAFIKGKHIIGEVKEKEKAHKEYRDAVIQGHGAYLMDQDAPDI 733
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP-------------AI 223
FT+++ + + + I++ + +L+Y G FS F P +TP A+
Sbjct: 734 FTVSVGNLPPSTKVLIKITYVTELNYDYGRFS----FHLPAALTPWQQDKALNENTQDAV 789
Query: 224 KKIPKRE--------KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWS 275
+K+ R+ + ++V E + + +H LK + D + + ++ L
Sbjct: 790 RKVAIRQVGTNIGEFSLQMSVEMPFKIESI-RSWTHELKIKKTDCKAVVCTKDASSLGME 848
Query: 276 NIDFDFSYTVSPSHIFGGVLLQSPSLH-DVDQREMFCMYLLP---GTAKSRKVFKKDVIF 331
D + G V L + D++ CM + T + +
Sbjct: 849 GFTMDI--------VNGNVYLPRMWIEKHPDKKSEACMLVFQPEFDTPFDPIHTSGETVI 900
Query: 332 IVDISGSMQGKPLEDTKNALAVALSKLDP---GDSFNIVAFNGETYLFSTSMELATKEAV 388
+D S SM G ++ K +ALS L+ + ++ F F + E
Sbjct: 901 CLDCSNSMAGSKIQHAKQ---IALSYLNSCYDTEKLTVIKFGTNFIEFPYNFE-----DF 952
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
E Q+I G+++ L + +L S+G I+F ++DG +++E A+
Sbjct: 953 EAMKQFIMSATATMGNSDFWKTL-HYLSLLYPSKGKRNILF-ISDGHIQNE-----AVTL 1005
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY-----GAAYDLDSIEIQMQKLFT 503
+L + RI+T G+G N + LR L+ G + + YD + +Q QK
Sbjct: 1006 QLVKQNAKHTRIFTCGVGPAANRHMLRSLSQYGAGAFEYFDEKSKYDWERKVMQQQK--- 1062
Query: 504 RGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFL 560
R +S + I+I + D E P++I L L+V G L A
Sbjct: 1063 RRWSPECSAISIKWQQFDTDAPELMYAPAQIQSLFEHERLLVYGFISNCTQAALYALLNS 1122
Query: 561 GDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSV 613
+L V +LQ L ++ A+ I +L +E K+ L+ + K+S+
Sbjct: 1123 QELQTIVSTTELQKTTGTILHKLAARAFIRDYEQGILHEDEAENEMKKQLLKSMIIKLSM 1182
Query: 614 QTGVLCEYTRMIIVET-DERNN 634
+ ++ ++T + +E DE+ +
Sbjct: 1183 ENSIVTQFTSFVAIEKRDEKED 1204
>gi|443705946|gb|ELU02242.1| hypothetical protein CAPTEDRAFT_227853 [Capitella teleta]
Length = 856
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 102/240 (42%), Gaps = 7/240 (2%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
+++ + P +S +++ IFIVD SGSM G ++ + AL + L L FNI
Sbjct: 261 QDVIMLNFFPHLLESDYSEQQEFIFIVDRSGSMGGDRIQSAREALLLFLKSLPISCYFNI 320
Query: 367 VAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
V+F + LF ++E+++ A + G T I PL E +
Sbjct: 321 VSFGSQFQALFPEGSRAYSEESLQAALELQSSMQANMGGTEILRPLRAVFEEMQLIDDFP 380
Query: 426 PIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+FL+TDG V + + + +K S R++TFGIG + ++ A + RG
Sbjct: 381 RQLFLLTDGEVSNTEAVLNLIKVN-----SDHTRVFTFGIGDGASTSLVKGAARVGRGRA 435
Query: 486 GAAYDLDSIEIQMQKLFTRGFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSG 544
D ++ ++ + + ++ +D L P ++ L S LI+ G
Sbjct: 436 EMVIGRDRLQAKVMRSLKASMQPPICDLHLDWQLPGCSSVVQIPHQLRPLVSGDALILFG 495
>gi|358378271|gb|EHK15953.1| hypothetical protein TRIVIDRAFT_232424 [Trichoderma virens Gv29-8]
Length = 1413
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 89/186 (47%), Gaps = 27/186 (14%)
Query: 317 GTAKSRKVFKKDVIFIVDISGSM--------QGKPLEDTKNALAVALSKLDPGDSFNIVA 368
G+A + ++FK +++FI+D SGSM Q ++ KNA+++ALS L FNI++
Sbjct: 368 GSAVNPQLFKGEILFILDQSGSMGWTQSARDQRLKIDTMKNAMSLALSGLPSTCRFNIIS 427
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAG-----GSTNICAPLTKAVEMLTNSRG 423
F E M L ++ A E A+ + ++ G T I L AV S
Sbjct: 428 FGSEV----RGMWLRSRGADEPANLSYAMEYLDTVGANMGGTEILLALKGAVNNCEPSLP 483
Query: 424 SIPIIFLVTDGAVEDERQICDAM-----KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
S II L+TDG V E DA+ ++R G I R +T GIG +H + +A
Sbjct: 484 STQII-LITDGEVHHEPH--DAIMEFVWETRQKFGEKI--RFFTLGIGDRVSHRVVEGIA 538
Query: 479 MISRGY 484
+ GY
Sbjct: 539 ELGGGY 544
>gi|374309290|ref|YP_005055720.1| von Willebrand factor type A [Granulicella mallensis MP5ACTX8]
gi|358751300|gb|AEU34690.1| von Willebrand factor type A [Granulicella mallensis MP5ACTX8]
Length = 806
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 15/208 (7%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
P A + +DV+ ++D S SMQ LE + AL L L P D F+++ FN + L
Sbjct: 302 PVAAAPTQTAPRDVVLLLDTSLSMQWDNLERSFAALQAVLLSLKPADHFSLLLFNQDVTL 361
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK--AVEMLTNSRGSIPIIFLVTD 433
F AT EAV++A ++ + + GG T++ LT A L NS +FL+TD
Sbjct: 362 FKPEPLAATPEAVQQALDFVRASKLRGG-TDLGKALTAGLAQAKLPNSS-----LFLLTD 415
Query: 434 GAVED-----ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAA 488
G + +I A + + PR F +G N L++LA + G
Sbjct: 416 GNSDRGTTVITGKIATAYAQQWKQSPT-HPRTNIFAVGDDANLPLLKLLAQ-NDGVLENV 473
Query: 489 YDLDSIEIQMQKLFTRGFSSVLANIAID 516
+ +E ++ S + + +D
Sbjct: 474 LATEPVEAKLNTFLAHSTSHPVGGLHLD 501
>gi|218185115|gb|EEC67542.1| hypothetical protein OsI_34867 [Oryza sativa Indica Group]
Length = 59
Score = 62.0 bits (149), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/34 (70%), Positives = 28/34 (82%)
Query: 47 PKSPMVYAVITDPKIVDNPDIPSYQPYVHGRCDP 80
P +PM YAV+TDP VD PD+PSYQPYV+GR DP
Sbjct: 24 PSAPMAYAVVTDPGTVDTPDMPSYQPYVYGRLDP 57
>gi|321472034|gb|EFX83005.1| hypothetical protein DAPPUDRAFT_302138 [Daphnia pulex]
Length = 843
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 149/360 (41%), Gaps = 33/360 (9%)
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKF-PEYVTPAIKKIPKREKI 232
P++F + + GS IRL + +L E +P P Y+ P K +
Sbjct: 108 PDVFKAKVGNLAPGSGAKIRLTYVTELKVEGKEIRFYLPTTIAPRYIPPTDKSSTAADLA 167
Query: 233 HLNVNAGTGTEVLCNTS---SHHLKQLRRDVGKLGYSYES-EVLKWSNIDFDFSYTVSPS 288
+ + + +V N + + + ++R K+ + + ++ + FD T
Sbjct: 168 SIKYSTESNYQVDFNATVQMASAITEIRSPTHKIAVTRDVVNKPQYGTLRFDNVVTSMDR 227
Query: 289 HIFGGVLLQSPSLHDVDQREMFC-----------MYLLPGTAKSRKVFKKDVIFIVDISG 337
+ + + P +Q + C + L+P + + K ++IF+VD SG
Sbjct: 228 DLVVYIQVAEP-----NQPRLICEKSPKGTTALMLSLVPSFKLTEQ--KTELIFLVDRSG 280
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWI 395
SM G + K AL + L L P D + +A G +Y LF TS + + + A Q +
Sbjct: 281 SMGGSGINQAKQALQLFLHSL-PLDCYVNIAGFGSSYEELFPTSRKY-DETVLNLAKQHV 338
Query: 396 GINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGS 455
G T I PL + + + G + IF++TDG V + +I + R NG +
Sbjct: 339 DSIDANLGGTEIFHPL-EIIFKKPSIEGYLRQIFVLTDGEVSNADEILGLI--RRQNGQA 395
Query: 456 ICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
R++ G+GS +HY + +A G A + +E ++ + L++I I
Sbjct: 396 ---RVFALGLGSSASHYLVEGMARAGNGTALFASLEERLEKKVMQQLQDSLQPALSDIKI 452
>gi|393240751|gb|EJD48276.1| hypothetical protein AURDEDRAFT_113100 [Auricularia delicata
TFB-10046 SS5]
Length = 842
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 92/208 (44%), Gaps = 7/208 (3%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL-DPGDSFNIVAFNGE-TYLFSTSME 381
V ++ IF+VD SGSM G + ++AL + L L G +FN+ +F T L+ S
Sbjct: 277 VPSQEYIFLVDRSGSMSGGRMSAVRSALQIMLRSLPTKGTTFNVFSFGDNCTSLWPASAP 336
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
A E A G++ GG T I L K V+ + +F+VTDG D R
Sbjct: 337 YAKDSVDEAARHVEGMSANYGG-TEIADALRKVVQSRKQNLKRPTALFVVTDGEAWDVRA 395
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKL 501
+ + + + R++ G+G+ + +A RG + + + ++ L
Sbjct: 396 VFKVVADGVADSDKAL-RVFVLGVGNMVSTEMCDGIARAGRGVVTYVREQEKPDAKLVGL 454
Query: 502 FTRGFSSVLANIAID---TLKDLDEFEM 526
S+V+ +I +D L+ +FEM
Sbjct: 455 LKAARSAVVQDITVDWGVPLEPDQDFEM 482
>gi|390369509|ref|XP_795437.3| PREDICTED: poly [ADP-ribose] polymerase 4-like [Strongylocentrotus
purpuratus]
Length = 364
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 91/191 (47%), Gaps = 20/191 (10%)
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKL 358
PS + +FC T +S V +V+ ++D S SM+G+P +D K + L L
Sbjct: 60 PSSNSYASMAVFC-----PTIQSELVADPEVVLLLDCSTSMKGEPKQDAKKICKMILQSL 114
Query: 359 DPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT--KAV 415
FN++ F + T LF T + ++ +E A ++I GGS+ PL +
Sbjct: 115 PEKSRFNVITFGTDFTELFPTVEPVGQRQLLE-ALEFIEGARSVGGSSEAWRPLRSLSLL 173
Query: 416 EMLTNSRGSIPIIFLVTDGAVEDERQICD-AMKSRLTNGGSICPRIYTFGIGSYCNHYFL 474
M+ ++R + LV+DG + +E+ + A K + N RI+T + S N + L
Sbjct: 174 PMMNSARN----VLLVSDGHLTNEKLTLEIASKYKHVN------RIFTCAVSSAGNRHIL 223
Query: 475 RMLAMISRGYY 485
R LA +S G +
Sbjct: 224 RALADVSGGAF 234
>gi|118370680|ref|XP_001018540.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89300307|gb|EAR98295.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 930
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 116/257 (45%), Gaps = 19/257 (7%)
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISG 337
DF+F + + + ++ ++ + F ++ T S+ F IF++D SG
Sbjct: 280 DFNFKFKSQINDAIVQSIAENQNIFSQEYLNSFNQQIIDQTDSSKCEF----IFLLDRSG 335
Query: 338 SMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKEAVERAHQWIG 396
SM G+ +++ AL + + L FNI +F E + LF S + + E VE A I
Sbjct: 336 SMSGQSIQNAIEALILFIKSLPLDSYFNIYSFGTEFSKLFDQSQKYSN-ENVELALNEI- 393
Query: 397 INFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGS 455
I + A G TNI PL++ +G I+++TDG +E++ + ++S +
Sbjct: 394 ITYSANYGGTNIYQPLSEIFNQ-PYVKGYGRQIYILTDGQIENKENVMHLIQS-----NN 447
Query: 456 ICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI 515
I R++ GIG Y + + A +G + D I+ + + S +L ++ +
Sbjct: 448 ISNRVHAIGIGLYVDKDLIIQSAKSGKGCHAHVTDQSLIQESIINILQNSISPILEDVKL 507
Query: 516 DTLKDLDEFEMYPSRIP 532
K E++ S+ P
Sbjct: 508 SYNK-----EIFNSQYP 519
>gi|414864757|tpg|DAA43314.1| TPA: putative RING zinc finger and VWF domain family protein [Zea
mays]
Length = 691
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 95/201 (47%), Gaps = 17/201 (8%)
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKK----DVIFIVDISGSMQGKPLEDTKNALAVA 354
P++ +E+F + + KS D++ ++D+SGSM G + KNA++
Sbjct: 205 PAIEQSTTKEIFAILIHLRAPKSSHSASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFV 264
Query: 355 LSKLDPGDSFNIVAFNGET-YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+ L P D ++VAF+ LF + T ++A Q + + +A G TNI L K
Sbjct: 265 IQTLGPNDRLSVVAFSSTARRLF--PLRRMTLSGRQQALQAVS-SLVASGGTNIADGLKK 321
Query: 414 AVEMLTNSR--GSIPIIFLVTDG----AVEDERQICD---AMKSRLTNGGSICPRIYTFG 464
A +++ + R S+ I L++DG + +R + D + + G +I+TFG
Sbjct: 322 AAKVIEDRRLKNSVCSIILLSDGQDTYTLPSDRNLQDYSAVVPPSILPGTCQHVQIHTFG 381
Query: 465 IGSYCNHYFLRMLAMISRGYY 485
GS + + +A IS G +
Sbjct: 382 FGSDHDSAAMHAIAEISSGTF 402
>gi|444706587|gb|ELW47921.1| von Willebrand factor A domain-containing protein 5A [Tupaia
chinensis]
Length = 880
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 150/365 (41%), Gaps = 44/365 (12%)
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGK---------PLEDTKNALAVAL 355
D M Y P +++ + + +F++D SGSMQ +E K L + L
Sbjct: 339 DPSAMVSFY--PDIPEAQSPVRGEFVFLMDRSGSMQSPMSVQDRSQLRIEAAKETLILLL 396
Query: 356 SKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
L G FN+ F G +Y LF S+E T+E +++A + I + G T I APL K
Sbjct: 397 KSLPIGCYFNVYGF-GSSYKALFRKSVEY-TQETMKKAVERIQVMQADLGGTEILAPLQK 454
Query: 414 AVEMLTNSRGSIPI-IFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHY 472
+ S P+ +F+ TDG V D + +K+ R + FGIG +
Sbjct: 455 IYK--APSIPGYPLQLFVFTDGEVTDTFSVIQEVKTNRQKH-----RCFAFGIGEGASTS 507
Query: 473 FLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI--DTLKDLDEFEMYPSR 530
++ +A +S G D ++ ++ + + V+ + ++ D L + P +
Sbjct: 508 LIKGIARVSGGASEFITGKDRMQSKVLRALKQSLQPVVQDFSLSWDLSPGLSAKMLSPEQ 567
Query: 531 IPDLSSESPLIVSGRYQGKFPDTLKAKG-------FLGDLSNFVVELKLQLAKD--IPLD 581
+ LI+ + G P +A G G V LQ D +
Sbjct: 568 TVIFRGQR-LIIYAQLTGAMPPA-EATGKVCLKYSLQGQSLEEKVTFSLQPKPDANFTIH 625
Query: 582 RICAK---QQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASES 638
R+ AK Q D+ + + S+ K V +S+ +GVL +T I + E N +
Sbjct: 626 RLAAKSLIQAKDMGSGETPASDKK----DVLNISMASGVLSSFTAFIAI-NKELNKPVQG 680
Query: 639 PGTKK 643
P T++
Sbjct: 681 PMTRR 685
>gi|403357978|gb|EJY78623.1| Ubiquitin [Oxytricha trifallax]
Length = 705
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 84/170 (49%), Gaps = 10/170 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATK 385
+ IF++D SGSM G ++ AL + L L G FN+V+F G Y LF S +L +
Sbjct: 131 EYIFLIDRSGSMDGDRIQLAVQALKLFLHSLPMGSKFNVVSF-GSKYNKLFEES-QLYNQ 188
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDA 445
+ ++A I G T I PL KA+ L + ++L+TDG V D +++ D
Sbjct: 189 KTFKQAIDAIKSFKANMGGTEIFLPL-KAIFSLPVDQELPRHLYLLTDGEVNDTKKVVDM 247
Query: 446 MKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
+K S +++ FGIG + ++ A+ +G+Y + + IE
Sbjct: 248 IKFH-----SDTTKVHCFGIGDGVSSELIQNAALAGKGHYSFISNPEDIE 292
>gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens]
Length = 946
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/197 (21%), Positives = 88/197 (44%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + + K
Sbjct: 309 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLFQLQKH 368
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
++ A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 369 RLQIAKRYIE-KIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVG 427
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ G+G ++ FL+ L+ + G Y Q++
Sbjct: 428 ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLK 487
Query: 500 KLFTRGFSSVLANIAID 516
K + + + +L N+ +
Sbjct: 488 KFYNQVSTPLLRNVQFN 504
>gi|414864756|tpg|DAA43313.1| TPA: putative RING zinc finger and VWF domain family protein [Zea
mays]
Length = 692
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 95/201 (47%), Gaps = 17/201 (8%)
Query: 299 PSLHDVDQREMFCMYLLPGTAKSRKVFKK----DVIFIVDISGSMQGKPLEDTKNALAVA 354
P++ +E+F + + KS D++ ++D+SGSM G + KNA++
Sbjct: 206 PAIEQSTTKEIFAILIHLRAPKSSHSASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFV 265
Query: 355 LSKLDPGDSFNIVAFNGET-YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTK 413
+ L P D ++VAF+ LF + T ++A Q + + +A G TNI L K
Sbjct: 266 IQTLGPNDRLSVVAFSSTARRLF--PLRRMTLSGRQQALQAVS-SLVASGGTNIADGLKK 322
Query: 414 AVEMLTNSR--GSIPIIFLVTDG----AVEDERQICD---AMKSRLTNGGSICPRIYTFG 464
A +++ + R S+ I L++DG + +R + D + + G +I+TFG
Sbjct: 323 AAKVIEDRRLKNSVCSIILLSDGQDTYTLPSDRNLQDYSAVVPPSILPGTCQHVQIHTFG 382
Query: 465 IGSYCNHYFLRMLAMISRGYY 485
GS + + +A IS G +
Sbjct: 383 FGSDHDSAAMHAIAEISSGTF 403
>gi|348521288|ref|XP_003448158.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5
[Oreochromis niloticus]
Length = 975
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/297 (22%), Positives = 134/297 (45%), Gaps = 24/297 (8%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G + TK+AL L L PGD+FN V+F
Sbjct: 285 FVHYFAP---KDLPAVPKNVVFVIDTSASMLGTKIRQTKDALFTILRDLRPGDNFNFVSF 341
Query: 370 NGETYLFSTSMEL-ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------S 421
+ + ++ + + T + A ++I + GG TNI + +L + S
Sbjct: 342 SNKIKVWQPNRLVPVTPLNIRDAKKFIYMLAPTGG-TNIDGAIQTGSSLLRDYLLGPDSS 400
Query: 422 RGSIPIIFLVTDG--AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAM 479
S+ +I +TDG V + + + +R C I+T GIG+ ++ L +A+
Sbjct: 401 PNSVSLIIFLTDGRPTVGEIQSVAILGNTRSAVQEKFC--IFTIGIGNDVDYRLLERMAL 458
Query: 480 ISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESP 539
+ G + ++ + + +L++I I+ ++ ++ + + + + S
Sbjct: 459 ENCGMMRRINEEADASAMLKGFYNEIGTPLLSDIRINYTENSVQY-VTQNLFTNYFNGSE 517
Query: 540 LIVSGRYQGKFPDTLKAKGFLGDLSNFVVELKLQLAKDIPL--DRICAKQQIDLLTA 594
++V+G+ + ++L + + ++ L D+PL +I ++ + L TA
Sbjct: 518 IVVAGKLTNESAESLHVQVTASNNDKSII-----LETDVPLRHRQIDTEKNVKLATA 569
>gi|380809832|gb|AFE76791.1| poly [ADP-ribose] polymerase 4 [Macaca mulatta]
Length = 1717
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 125/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIV- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLR 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|357144074|ref|XP_003573161.1| PREDICTED: uncharacterized protein LOC100844482 [Brachypodium
distachyon]
Length = 719
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 93/207 (44%), Gaps = 36/207 (17%)
Query: 325 FKKDVIFIVDISGSM-QGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA 383
F D++ ++D+S SM Q L K A+ + KL P D +IV F+ + + L
Sbjct: 63 FGLDLVAVLDVSESMGQWDRLAKVKTAMQFVIKKLRPIDRLSIVTFS------TNAKRLC 116
Query: 384 TKEAVERAHQWIGINFI----AGGSTNICAPLTKAVEMLTNSR---GSIPIIFLVTDGAV 436
+V +A Q + AGGSTNI L ++LT+ R G + IF+++DG V
Sbjct: 117 HLRSVTKAFQAHLKELVDGLKAGGSTNIKHGLQTGQQVLTDRRLTGGRVASIFVLSDGEV 176
Query: 437 EDER--QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
D + + D +Y FG G+ C+H L +A S+G G +D
Sbjct: 177 NDAKVGDVSDVA-------------VYMFGFGADCDHKVLEGIARESKG--GTFNYVDDT 221
Query: 495 EIQMQKLFTRGFSSVLANIAIDTLKDL 521
E ++ FS +L + ++DL
Sbjct: 222 ENMIEP-----FSQILGGLLSVVIQDL 243
>gi|384945416|gb|AFI36313.1| poly [ADP-ribose] polymerase 4 [Macaca mulatta]
Length = 1717
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 125/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIV- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLR 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|363737367|ref|XP_422758.3| PREDICTED: von Willebrand factor A domain-containing protein
5B2-like [Gallus gallus]
Length = 1264
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 6/206 (2%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
++V+F+VD +G+M G ++ K AL VAL L PG N+ F + S L + E
Sbjct: 349 REVLFLVDRTGAMSGPDVDRVKEALLVALKSLPPGTLLNMATFGTDLQPLFPSSRLCSNE 408
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAM 446
++RA +G +A GS ++ A L+ A+ + + AV D I +
Sbjct: 409 TLQRACCHLGELRVAAGSPDLLAALSWALAQPLHHGYPRQLFLFTAAAAVGDVGHILQLV 468
Query: 447 KSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGF 506
+ + + R ++FG+G L +A +SRG + ++ ++ K +
Sbjct: 469 RRQASTV-----RCFSFGMGPRACRRLLTAIAKLSRGRAEFLDLAERLQPKLIKALKKAM 523
Query: 507 SSVLANIAIDT-LKDLDEFEMYPSRI 531
+++I ID + D E + P+ I
Sbjct: 524 EPAVSDITIDWYVPDSTEALLSPTEI 549
>gi|380809830|gb|AFE76790.1| poly [ADP-ribose] polymerase 4 [Macaca mulatta]
Length = 1717
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 125/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIV- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L +RGS I+ LV+DG ++DE ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLR 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 787
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 84/172 (48%), Gaps = 15/172 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D+I ++D SGSM G+ +E+ KN + + L+ D +I+ FN S + +L
Sbjct: 243 DLICVIDNSGSMDGEKIENVKNTILQLIDMLNDHDRLSIITFN------SYAKQLCGLRK 296
Query: 388 VERAH----QWIGINFIAGGSTNICAPLTKAVEMLTN--SRGSIPIIFLVTDGAVED-ER 440
V + + Q I + A G TNI + L A +L N R S+ +FL++DG + +
Sbjct: 297 VNKDNKENLQKITKSIQADGGTNITSGLQTAFSILQNRKQRNSVSSVFLLSDGQDNNSDS 356
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG--YYGAAYD 490
+I + +++ C I++FG G+ + ++ +A I G YY D
Sbjct: 357 RIRNLLQTTYQQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYYVERND 408
>gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 2033
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 83/167 (49%), Gaps = 15/167 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D+I ++D SGSM G+ +E+ KN + + L+ D +I+ FN S + +L
Sbjct: 1605 DLICVIDNSGSMSGQKIENVKNTILQLIDMLNDNDRLSIITFN------SHAQQLCGLRK 1658
Query: 388 VERAH----QWIGINFIAGGSTNICAPLTKAVEMLTN--SRGSIPIIFLVTDGAVED-ER 440
V + + Q I + A G TNI + L A +L + R S+ IFL++DG + +
Sbjct: 1659 VNKDNKENLQKITKSIYANGGTNITSGLQTAFSILQSRKQRNSVSSIFLLSDGQDNNSDS 1718
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG--YY 485
+I + +++ C I++FG G+ + ++ +A I G YY
Sbjct: 1719 RIRNLLQTTYQQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYY 1765
>gi|403335258|gb|EJY66804.1| von Willebrand factor type A domain containing protein [Oxytricha
trifallax]
Length = 1000
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 82/165 (49%), Gaps = 21/165 (12%)
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS--FNIVAFNGETYLFSTSMELATKEA 387
IFIVD SGSM+G+ ++ T +L + L L P D+ F I++F G Y +++ +K+
Sbjct: 335 IFIVDRSGSMEGQKMQITNQSLMLFLQSL-PSDNSMFQILSF-GTNY---SNLHSDSKKD 389
Query: 388 VERAHQWIGINF---------IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
A+ ++F + G TNI PL A +M + +IFL+TDG ED
Sbjct: 390 EALAYNQENLDFALKKVEKFKASKGGTNIFKPLKAAFDMKSPDDTYEKVIFLLTDGQTED 449
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+ + +++ N +I+ FG+G C+ L+ L +G
Sbjct: 450 PYKCRELIQNMPKN-----TQIHCFGVGYSCDMRLLQQLTSKGKG 489
>gi|403335667|gb|EJY67012.1| von Willebrand factor type A domain containing protein [Oxytricha
trifallax]
Length = 892
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 82/166 (49%), Gaps = 21/166 (12%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS--FNIVAFNGETYLFSTSMELATKE 386
IFIVD SGSM+G+ ++ T +L + L L P D+ F I++F G Y +++ +K+
Sbjct: 334 AIFIVDRSGSMEGQKMQITNQSLMLFLQSL-PSDNSMFQILSF-GTNY---SNLHSDSKK 388
Query: 387 AVERAHQWIGINF---------IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE 437
A+ ++F + G TNI PL A +M + +IFL+TDG E
Sbjct: 389 DEALAYNQENLDFALKKVEKFKASKGGTNIFKPLKAAFDMKSPDDTYEKVIFLLTDGQTE 448
Query: 438 DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
D + + +++ N +I+ FG+G C+ L+ L +G
Sbjct: 449 DPYKCRELIQNMPKN-----TQIHCFGVGYSCDMRLLQQLTSKGKG 489
>gi|146295744|ref|YP_001179515.1| von Willebrand factor type A domain-containing protein
[Caldicellulosiruptor saccharolyticus DSM 8903]
gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus
DSM 8903]
Length = 909
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 81/179 (45%), Gaps = 20/179 (11%)
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQG------KPLEDTKNALAVALSKLDPGDSFNIVA 368
L K R V V+ ++D SGSM G LE K+A A + L+ DS ++A
Sbjct: 397 LKNKEKERNV---AVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDHLESSDSVGVIA 453
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPII 428
F+ Y S +L +K V I + GG T I PLT+AV +L S+ +I
Sbjct: 454 FDHNFYWASKFGKLKSKNEVIENISTIQV----GGGTAIIPPLTEAVNLLKKSKAKDKVI 509
Query: 429 FLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG--YY 485
L+TDG E+ A ++ N +I T G+GS N L +A + G YY
Sbjct: 510 VLLTDGYGEEGGYEYPASIAKRNN-----IKITTIGVGSSINAPILSWMAAYTSGRFYY 563
>gi|344284561|ref|XP_003414034.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Loxodonta africana]
Length = 1718
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 124/563 (22%), Positives = 229/563 (40%), Gaps = 83/563 (14%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SASVETGSFL----K 173
+ + P+ D+ ++ G EA I+GK + + E +GA + + S G++L
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHI---VGEVKEKEGARQEYQEAISQGHGAYLMDQDA 710
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP------------ 221
P +FT+++ + + + I++ + +LS + G +V F P V P
Sbjct: 711 PEVFTVSVGNLPPKAKVLIKITYITELSVQ-GTAAV---FCMPATVAPWQQDKALNENIQ 766
Query: 222 ------AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKW 274
+K+I ++ L++ + ++ +H LKQ R D + + E L
Sbjct: 767 DTVEKICVKEIGTKQSFSLSIYIEMPYVIEFISSDTHKLKQKRTDCKAVISTMEGSSLDS 826
Query: 275 SNIDFDFSYTVSPSHI-FGGVLLQSPSLHDVDQREM-FCMYL--------LPGTAKSRKV 324
+ HI F L + ++E CM + LP TAK
Sbjct: 827 NGFSL---------HIGFSDAYLPRMWVEKYPEKESEACMLVFQPDFNITLPETAK---- 873
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMEL 382
K +VI +D S SM+G+ K ALS +D NIV F G +Y LFS +
Sbjct: 874 -KSEVIICLDCSNSMEGEAFLQAKKIALYALSLVDKNQKINIVKF-GTSYKELFSYPKYI 931
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
+ + ++I G+T+ L + + +L S+G + I L++DG +++E
Sbjct: 932 TSNDM---PTEFIMAATPTMGNTDFWKTL-RYLNLLYPSQG-LRNILLISDGHIQNE--- 983
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQK 500
++ +L R++T G GS N + LR L+ G Y + + Q++
Sbjct: 984 --SLTFKLVKRNVQHTRMFTCGTGSTANRHILRTLSQCGAGVFEYFNSKSKHRWKTQIED 1041
Query: 501 LFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
TR S ++++ + +L E P+++ L L+V G TL A
Sbjct: 1042 QMTRLHSPSCHSVSVKWQQLSTNLPEPLQAPAQVQSLFHNDLLLVYGFIPHCTQATLSAL 1101
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSK 610
+ V +LQ + ++ A+ I +L + K+ L+ + K
Sbjct: 1102 IQEKEFCTMVSTTELQKTTGTMIHKLTARALIRDFEYGILHENETHHQMKKQVLKSLIIK 1161
Query: 611 MSVQTGVLCEYTRMIIVETDERN 633
+S + +L ++T + VE N
Sbjct: 1162 ISKENSLLTQFTSFVAVEKRGEN 1184
>gi|149437045|ref|XP_001515975.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5
[Ornithorhynchus anatinus]
Length = 949
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/385 (21%), Positives = 148/385 (38%), Gaps = 49/385 (12%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F Y P K K+V+F++D S SM G L+ TK AL L L P D+FNIV F
Sbjct: 290 FVHYFAP---KDLPPLPKNVVFVLDSSASMVGAKLKQTKEALFTILHDLRPEDNFNIVGF 346
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN-------SR 422
+ ++ + T ++ +I + G TNI L + +L +
Sbjct: 347 SSRIKVWKDQLVPVTPNSIRDGKVYIH-HMSPSGGTNINGALQTGIRLLNDFVAHNDIDA 405
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPR----IYTFGIGSYCNHYFLRMLA 478
S+ +I +TDG + + ++ N R ++T GIG + L L+
Sbjct: 406 RSVSLIVFLTDG----RPTVGEIQTPKILNNTKEAARDRVCLFTIGIGDDVDFKLLEKLS 461
Query: 479 MISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSES 538
+ + G Q++ + + +L++I +D E + P+ + S
Sbjct: 462 LENCGMTRRFQVEADAAAQLKGFYDEIGTPLLSDIRVDYPAGSVE-HVTKKLFPNYFNGS 520
Query: 539 PLIVSGRYQGKFPDTLKAKGFLGDLSNFV-----VELKLQLAKDIP-------------- 579
++++G+ DTL + + FV V + ++P
Sbjct: 521 EIVIAGKLVDPTLDTLHVEVTASNGKKFVMLTTDVAVGAPKGNEVPGAVPGSEAEDGEDP 580
Query: 580 --LDRICAKQQI-DLLTAQAWFSEDKRLE-----EKVSKMSVQTGVLCEYTRMIIVETDE 631
++R+ + I +LLT AW D E E+ ++V L T M + ET
Sbjct: 581 NYVERLWSYLTIKELLT--AWLKSDNEQEKADLMERARALAVTYHFLTPVTTMKVRETPA 638
Query: 632 RNNASESPGTKKGSKKSDHQKILDS 656
+ + G S + + ++ S
Sbjct: 639 QAGKRKEAGGDGDSAATGPRDVMQS 663
>gi|426374946|ref|XP_004054315.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Gorilla gorilla gorilla]
Length = 1645
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 142/332 (42%), Gaps = 29/332 (8%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFS-TSMELAT 384
+ +VI +D S SM+G K ALS + NI+ F G + FS SM A
Sbjct: 850 ESEVIICLDCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQF-GTGHKFSYVSMMTAN 908
Query: 385 KEAVERAHQWIGINFIAG-----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDE 439
+ +++ + ++F+ G+T+ L + + +L +RGS I+ LV+DG ++DE
Sbjct: 909 GDCIQQGEHNLSLHFLQSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE 966
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQ 497
++ +L R++ GIGS N + LR+L+ G Y A S Q
Sbjct: 967 -----SLTLQLVKRSRPHTRLFACGIGSTANRHILRILSQCGAGVFEYFNAKSKHSWRKQ 1021
Query: 498 MQKLFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTL 554
+ TR S ++++ + D E P+++P L L+V G TL
Sbjct: 1022 TEDQMTRLCSPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATL 1081
Query: 555 KAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEK 607
A + V +LQ + ++ + I +L E K+ L+
Sbjct: 1082 CALIQEKEFCTMVSTTELQKTTGTMIHKLATRALIRDYEDGILHENETSHEMKKQTLKSL 1141
Query: 608 VSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
+ K+S + ++ ++T + VE + N ESP
Sbjct: 1142 IIKLSKENSLITQFTSFVAVEKRDEN---ESP 1170
>gi|118357570|ref|XP_001012034.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89293801|gb|EAR91789.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 894
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 10/160 (6%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELA 383
+ + IF++D SGSM G+P++ AL + L L FN+V+F G +Y LF +S++
Sbjct: 311 RSEFIFLLDRSGSMIGQPIQKACEALILFLKSLPTDSYFNVVSF-GSSYEKLFQSSIKYD 369
Query: 384 TKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
T+ ++E+A I T I PL + T G IFL+TDG VE +++
Sbjct: 370 TR-SLEKAINQIKEYTANLLGTEIYKPLQNIFKE-TKIDGYNKQIFLLTDGEVESPKKVV 427
Query: 444 DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
+L + RI + G GS + Y + A+ +G
Sbjct: 428 -----QLIRKNNKFNRINSIGFGSGADKYLIEESAIAGKG 462
>gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor]
gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor]
Length = 698
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 106/229 (46%), Gaps = 23/229 (10%)
Query: 276 NIDFDFSYTVSPSHIFGGVLLQS----PSLHDVDQREMFCMYLLPGTAKSRKVFKK---- 327
+ID + I G V +++ P++ +E+F + + KS
Sbjct: 181 HIDLQSDMNDEHNAITGSVKIKAYSEFPAIEQSVTKEIFAILIHLRAPKSSHSASSRAPL 240
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET-YLFS-TSMELATK 385
D++ ++D+SGSM G + KNA++ + L P D +++AF+ LF M LA +
Sbjct: 241 DLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPNDRLSVIAFSSTARRLFPLRRMTLAGR 300
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDG----AVEDE 439
++A Q + + +A G TNI L K +++ + R P+ I L++DG + +
Sbjct: 301 ---QQALQAVS-SLVASGGTNIADGLKKGAKVIEDRRLKNPVCSIILLSDGQDTYTLPSD 356
Query: 440 RQICD---AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
R + D + + G +I+TFG GS + + +A IS G +
Sbjct: 357 RNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEISSGTF 405
>gi|311264202|ref|XP_003130043.1| PREDICTED: von Willebrand factor A domain-containing protein 5A
[Sus scrofa]
Length = 780
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 147/366 (40%), Gaps = 47/366 (12%)
Query: 293 GVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQG---------KP 343
G +Q P D M Y P ++ V + +F++D SGSMQG
Sbjct: 247 GKPMQKPDSLMGDTAAMVSFY--PDIPEAEPVTCGEFVFLMDRSGSMQGPMSKQDKSQSR 304
Query: 344 LEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGG 403
++ K L + L L G FN+ F F + T++ +E A + + + G
Sbjct: 305 IDAAKETLTLLLKSLPIGCYFNVYGFGSSYEAFFPNSVQYTQQTMEEALRRVKLMRADLG 364
Query: 404 STNICAPLTKAVEMLTNSRG-SIP----IIFLVTDGAVEDERQICDAMKSRLTNGGSICP 458
T I PL RG SIP +F+ TDG V D + ++S
Sbjct: 365 GTEILIPLQFIY------RGPSIPGHPLQLFVFTDGEVTDTFSVIREVRSNRERH----- 413
Query: 459 RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI--D 516
R ++FGIG + ++ +A +SRG + ++ + + R V+ ++++ D
Sbjct: 414 RCFSFGIGEGASTSLIKGIARVSRGTSEFIIGKERMQAKALRALKRSLQPVVEDVSLSCD 473
Query: 517 TLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD-----------TLKAKGFLGDLSN 565
+ L + P + + LIV + G P TL+ K F ++
Sbjct: 474 LPQGLSAKMLSPEQTVIFRGQR-LIVYAQLTGTMPSAEATGEVCLKYTLQGKSFENKVT- 531
Query: 566 FVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEK-VSKMSVQTGVLCEYTRM 624
L+ + + + R+ AK L T F E ++K V ++S+++GV+ +T
Sbjct: 532 --FSLQPKHDDNFTIHRLAAKS--FLQTKDMGFRETSAGDKKEVLEVSIESGVISSHTAY 587
Query: 625 IIVETD 630
I + D
Sbjct: 588 IAINKD 593
>gi|389749401|gb|EIM90572.1| hypothetical protein STEHIDRAFT_107352 [Stereum hirsutum FP-91666
SS1]
Length = 951
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 117/287 (40%), Gaps = 30/287 (10%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKL-DPGDSFNIVAFNGETYLFSTSMELATK 385
++ +F+VD SGSMQG +E K+ L + L + G + NI +F + + E ++
Sbjct: 285 QEYLFLVDRSGSMQGGRIETAKDTLIMLLKAIPSSGAALNIFSFGSQCTGLWQNSEAYSQ 344
Query: 386 EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICD 444
+V++A G T LT + + + S+P +IF++TDG D Q
Sbjct: 345 TSVDQATTHTQSMQANCGGTETRLALT---HVFSRRKRSMPTVIFILTDGEDHDIEQTLK 401
Query: 445 AMKSRLTNGGSICP-RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFT 503
A+ + + P RI+ GIG+ + +A G A + I + +L
Sbjct: 402 AVATAVAQSTPSAPLRIFVLGIGNTVSTAMCEGIARAGNGLCLMALNSTEIVGKCAQLLR 461
Query: 504 RGFSSVLANIAID----TLKDL--DEFEMYPSRIPDLSSESPLIVSGRYQG--------- 548
G + +L + ID DL + F P ++P ++ Y G
Sbjct: 462 AGRTRLLKGVTIDWGVTEEADLNTNSFRRIQFPPPPFIRQTPSVIQTLYPGLRLVVYAIL 521
Query: 549 ---KFPDTLKAKGFLGD------LSNFVVELKLQLAKDIPLDRICAK 586
K P T+ KG LGD LS V ++L L R AK
Sbjct: 522 QTDKIPSTVTIKGTLGDGQEVLNLSVPVEPIRLSLPSPSSRARSAAK 568
>gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102]
gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102]
Length = 426
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 111/265 (41%), Gaps = 38/265 (14%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM G+P++ A+ + +L GD ++VAF+G + + + E+++
Sbjct: 46 LILDRSGSMHGQPIKTVIQAVEGLIDRLKVGDRISVVAFSGSVEVIIPNQVIEDPESIKS 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSR 449
I A G T I L + E++ +RG++ FL+TDG E +I R
Sbjct: 106 Q---IKSKLSASGGTAIAEGLELGITELMKGTRGAVSQAFLLTDGHGESSLRIWKWDIGR 162
Query: 450 LTN---------GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
N + I TFG G+ N L +A + G + Q +
Sbjct: 163 DDNKRCLKLAQKAAKLNLTINTFGFGNSWNQDLLEKIADVGGGTLAHIEHPEQAVEQFSR 222
Query: 501 LFTR----GFS------SVLANIAIDTLKDL-----DEFEM---------YPSRIPDLSS 536
LF R G + S++ N+ + LK + D E+ + R+ DL
Sbjct: 223 LFGRIQSIGLTNAYLLLSLVPNVRLAELKPIAQVAPDTIELPVERETDGSFAVRLGDLMQ 282
Query: 537 E-SPLIVSGRYQGKFPDTLKAKGFL 560
+ ++++ Y G+ P+ +A G L
Sbjct: 283 DVERVVLANIYLGQLPEGKQAIGHL 307
>gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Moorea producens 3L]
gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Moorea producens 3L]
Length = 413
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 104/247 (42%), Gaps = 31/247 (12%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
++D SGSM G+PLE K A + +L P D +IVAF+ + + + + ++R
Sbjct: 46 LVLDHSGSMHGQPLETVKQAAVGLIERLQPDDRLSIVAFDHRAKVLVRNQPMGNLDQIKR 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSR 449
+G A G T I L V E++ + ++ +FL+TDG E +
Sbjct: 106 KINRLG----ADGGTAIDEGLKLGVKELIKAKQDTVSQVFLLTDGENEHGNNESCIKLAE 161
Query: 450 LTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSV 509
L ++ I + G G+ N L +A I+ G + + + +LF R S
Sbjct: 162 LAAENNLT--INSLGFGANWNQDILEKIADIATGSLSYIEEPEQALSEFARLFNRMQSVG 219
Query: 510 LAN--IAIDTLKDLDEFEMYP---------------------SRIPDLSSESP-LIVSGR 545
L N + +D + + E+ P R+ DL ++S +I++
Sbjct: 220 LTNAYLLLDLMPKVRLAELKPIAQVEPDAIELPAQSEGDRFMVRLGDLMTDSQRVILANL 279
Query: 546 YQGKFPD 552
Y GK P+
Sbjct: 280 YVGKMPE 286
>gi|390337293|ref|XP_781157.3| PREDICTED: poly [ADP-ribose] polymerase 4-like [Strongylocentrotus
purpuratus]
Length = 1795
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 87/179 (48%), Gaps = 15/179 (8%)
Query: 311 CMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFN 370
M + T +S + +V+ ++D S SM+G+P +D K + L L FN++ F
Sbjct: 796 SMAVFYPTIQSELIADPEVVLLLDCSTSMKGEPKQDAKKICKMILQSLPEKSRFNVITFG 855
Query: 371 GE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLT--KAVEMLTNSRGSIPI 427
+ T LF T + ++ +E A ++I GGS+ PL + M+ ++R
Sbjct: 856 TDFTELFPTVEPVGQRQLLE-ALEFIEGARSVGGSSEAWRPLRSLSLLPMMNSARN---- 910
Query: 428 IFLVTDGAVEDERQICD-AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
+ LV+DG + +E+ + A K + N RI+T + S N + LR LA +S G +
Sbjct: 911 VLLVSDGHLTNEKLTLEIASKYKHVN------RIFTCAVSSAGNRHILRALADVSGGAF 963
>gi|123498822|ref|XP_001327483.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121910413|gb|EAY15260.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 729
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 106/232 (45%), Gaps = 22/232 (9%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
FI+D SGSM+ +++ K L + + L G F+I+ F G Y T E V +
Sbjct: 237 FIIDCSGSMEESRIKNAKFCLNLLIHSLPVGCRFSIIKF-GSMYEVVLPTCDYTDENVAK 295
Query: 391 AHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRL 450
A + I T+I +PL K V + G I +FL+TDG QI +++
Sbjct: 296 AMEQINQMDANMEGTDILSPL-KFVSDQSTKEGFIKQVFLLTDGEDIHTDQIYALVQANR 354
Query: 451 TNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQKLFTRGFSSV 509
TN RI+T GIGS + ++ +A IS G D D + ++ +L + S
Sbjct: 355 TNN-----RIFTIGIGSGADRNLIKNIARISGGNNALIEDNDEKMNEKIIELLRKAISYA 409
Query: 510 LANIAIDTLKDLDEFEMYPSRIPDLSS--------ESP----LIVSGRYQGK 549
+ + I T + + +M+PS P L + +SP +++SG GK
Sbjct: 410 MTHFCIKT--ENTQPDMWPSPCPPLFAVNSQNFIIKSPFVEKIVISGLVDGK 459
>gi|443310467|ref|ZP_21040118.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Synechocystis sp. PCC 7509]
gi|442779505|gb|ELR89747.1| uncharacterized protein containing a von Willebrand factor type A
(vWA) domain [Synechocystis sp. PCC 7509]
Length = 413
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 134/299 (44%), Gaps = 56/299 (18%)
Query: 294 VLLQSPSLHDVD--------QREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLE 345
VLLQ P L+DV+ QR++ + + + S + ++ I+D SGSM+G+P+E
Sbjct: 3 VLLQ-PVLNDVNLDANQLNSQRQL-AISISAVSPSSNRTVPLNLCLILDRSGSMKGRPME 60
Query: 346 DTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFI-AGGS 404
K A + KL GD +IV F+ + + + EA++ IN + A G
Sbjct: 61 TVKQAANRIVDKLSYGDRLSIVVFDHRAKVLVPNQLVEDPEAIKAQ-----INKLKADGG 115
Query: 405 TNICAPLTKAVEML-TNSRGSIPIIFLVTDGAVE--DERQICD----AMKSRLTNGGSIC 457
T I + +E L + +I FL+TDG E D+++ + A + LT
Sbjct: 116 TAIDEGMKLGIEELGKGKKEAISQAFLLTDGENEHGDDKRCLNFAQLAAEYNLT------ 169
Query: 458 PRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR----GFS------ 507
+ T G G + N L +A I+ G +++ +E + +LF+ G +
Sbjct: 170 --LNTLGFGDHWNQDILEKIADIAGGKLSYIEEVNRVEEEFGQLFSSIQSIGLTNAYLLF 227
Query: 508 SVLANIAIDTLKDL-----DEFEM---------YPSRIPDLSSE-SPLIVSGRYQGKFP 551
S+ AN+ + LK + D E+ R+ DLS + S +I++ Y GK P
Sbjct: 228 SLAANVRLAELKPIAQVAPDTVELSSIEEADGRLAVRLGDLSKDVSRVILANMYVGKLP 286
>gi|403354535|gb|EJY76824.1| von Willebrand domain-containing protein [Oxytricha trifallax]
Length = 1103
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 99/215 (46%), Gaps = 33/215 (15%)
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF---------N 370
KS K+ IFIVD SGSM+G+ ++ TK AL + +S L P F I++F N
Sbjct: 440 KSTDTSKQAYIFIVDRSGSMRGEKMDMTKEALKLFVSSLPPDCLFEIISFGEEFSTTSPN 499
Query: 371 GETYLFSTSMELATKEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTN--SRGSIPI 427
G+ + + + K +++ FIA TN+ PL A+ + + S PI
Sbjct: 500 GDGFHNNDWEIVEVKSKIDK--------FIANMRGTNMYPPLNYALNAIFDIVSEPEDPI 551
Query: 428 ---IFLVTDG--------AVEDERQI-CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLR 475
IFL+TDG V + I D++ ++ C ++TFGIG+ N ++
Sbjct: 552 QKKIFLLTDGYDGYIGSRRVPHNQMIEFDSVIDKIRKRDHDCA-VHTFGIGNDYNKQLVK 610
Query: 476 MLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVL 510
A RG Y D+D+++ ++ K + L
Sbjct: 611 GFAKAGRGSYSFVEDIDNLKGKIIKALQKAIEPSL 645
>gi|345569946|gb|EGX52771.1| hypothetical protein AOL_s00007g107 [Arthrobotrys oligospora ATCC
24927]
Length = 1543
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 79/162 (48%), Gaps = 8/162 (4%)
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMEL 382
V + +V+FI D SGSM G + + ALAV L L G FNI +F + ++L+S S
Sbjct: 304 VARPEVVFICDRSGSMSGNETQ-LRKALAVFLQSLPVGCKFNICSFGSSHSFLWSKSKTY 362
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
+ + E G GG T + P+ AVE + L+TDG + Q+
Sbjct: 363 SEETLKEAQKHVAGFEADMGG-TEMFPPVKAAVERRYKDVNL--ELLLLTDGDIWQRDQL 419
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
D ++ + G ++ R++T GIG+ +H + LA + GY
Sbjct: 420 MDYVREQC-KGKTV--RMFTLGIGNGISHALIDGLATVGGGY 458
>gi|340369127|ref|XP_003383100.1| PREDICTED: hypothetical protein LOC100635885 [Amphimedon
queenslandica]
Length = 455
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 80/179 (44%), Gaps = 17/179 (9%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++D SGSM G+ L+ K L + +L D ++V ++ + L T
Sbjct: 46 DIVAVIDKSGSMHGEKLDLVKETLEFVIDQLKDTDRLSVVTYDTNVKV---DFGLTTMNQ 102
Query: 388 VERAHQWIGINFIA-GGSTNICAPLTKAVEML---TNSRGSIPIIFLVTDG----AVEDE 439
+ + + I G STN+C L K +E + T ++ + + L+TDG + +
Sbjct: 103 ENKERSKVTVKAIRDGSSTNLCGGLLKGMEQIIGRTGTKAQVASVLLLTDGLANVGISNR 162
Query: 440 RQICDAMKSRLTNGGSICP---RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
I + M+ L P IYTFG GS + R+L +S G Y +DS E
Sbjct: 163 EGIMEEMRKVLDPKPGNTPFDGTIYTFGFGS---DHDARLLESVSTQGGGVYYYIDSAE 218
>gi|301781376|ref|XP_002926102.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like
[Ailuropoda melanoleuca]
Length = 1184
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 94/200 (47%), Gaps = 17/200 (8%)
Query: 313 YLLPGTAKSRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSK--LDPGDSFNI 366
+LL G SR++F + V +VD SGS+ G L+ K L + + + DSFN+
Sbjct: 943 WLLSG---SRRLFGTILESKVCILVDTSGSV-GPHLQQVKTELVLLIWEQLRKHCDSFNL 998
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
++F + L+ ++ T A A QW+ + A GST+I L KA+ S +
Sbjct: 999 LSFAEDLRLWRDTLVETTDAACHEAMQWV-THLRAHGSTSILHALLKAL-----SFHDVE 1052
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
++L+TDG + + + RLT + + FLR LA ++ G Y
Sbjct: 1053 GLYLLTDGKPDTSCSLILSEVQRLTEKRDVKMHTISLNGSGRAAADFLRNLAALTGGRYH 1112
Query: 487 AAYDLDSIEIQMQKLFTRGF 506
D D++ +++ L T+GF
Sbjct: 1113 CPVDEDTL-LKIHGLLTKGF 1131
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 35/152 (23%), Positives = 66/152 (43%), Gaps = 7/152 (4%)
Query: 321 SRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF 376
SR+++ +K V+ ++DIS + + + + +L D FNI+AF T +
Sbjct: 506 SRRIWGTVCEKRVVILLDISVTNSMYIIHIQHSLRLLLEEQLSNKDCFNIIAFGSTTESW 565
Query: 377 STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM--LTNSRGSIPIIFLVTDG 434
M + E ++ A +W + GS N+ + L KAVE+ + I+L T G
Sbjct: 566 RPEMVAVSHENLQSAWRW-ALTLRCQGSRNVLSALRKAVEVDFRDKDKHQSQGIYLFTGG 624
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIG 466
+ + + A + G + + F +G
Sbjct: 625 IPDQDMPVLSAYMAEACGGCDLQLNVCLFYVG 656
>gi|345319454|ref|XP_001517973.2| PREDICTED: von Willebrand factor A domain-containing protein 5B1,
partial [Ornithorhynchus anatinus]
Length = 564
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 82/168 (48%), Gaps = 6/168 (3%)
Query: 349 NALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNIC 408
+AL V L L P FN++ F S + ++E++ A + I G TNI
Sbjct: 1 DALLVILKSLMPACLFNVIGFGSTFKTLFPSSQTYSEESLATACKNIKRLRADMGGTNIL 60
Query: 409 APLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSY 468
+PL V + RG ++FL+TDGAV + ++ + +++ S R Y+FGIG
Sbjct: 61 SPLKWIVRQPIH-RGHPRLLFLLTDGAVSNTGKVLELVRNH-----SFSTRCYSFGIGPN 114
Query: 469 CNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAID 516
++ LA +S+G + + ++ +M K + + +L+++ ++
Sbjct: 115 ACRRLVQGLAAVSKGSAEFLAEGERLQPKMIKSLKKAMAPILSDVTVE 162
>gi|225543415|ref|NP_001139452.1| poly [ADP-ribose] polymerase 4 [Canis lupus familiaris]
Length = 1677
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 126/578 (21%), Positives = 238/578 (41%), Gaps = 80/578 (13%)
Query: 110 WRVHCVMGSKSCDCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGK----SAS 165
++ + S + R P+ D+ ++ G EA I+GK + + E + A + + S
Sbjct: 629 FQTYTNQSSVPIEARYIFPLDDKAAVCGFEAFINGKHI---VGEVKEKEAAHREYREAIS 685
Query: 166 VETGSFL----KPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP 221
G++L P++FT+++ + + + I++ + +LS + G +V F P V P
Sbjct: 686 QGHGAYLMDQDTPDVFTVSVGNLPPKAKVLIKITYITELSIQ-GSKAV---FFMPATVAP 741
Query: 222 ------------------AIKKIPKREKIHLNVNAGTGTEVLCNTS-SHHLKQLRRDVGK 262
+IKKI ++ L+++ + +S +H L+Q R D
Sbjct: 742 WQQDKVLNENIQDTVEKISIKKIGTKQSFSLSMSIEMPYMIESISSDTHELRQKRTDCKA 801
Query: 263 LGYSYESEVLKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL-------- 314
+ + E L + F +S +++ + + P ++ CM +
Sbjct: 802 VISTVEGSSL--DSDGFTLHVGLSDAYLPRMWVEKHP-----EKESEACMLIFQPNLNIT 854
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-T 373
LP K+ +VI +D S SM+G K ALS + NIV F + T
Sbjct: 855 LPDLTKNN-----EVIICLDCSNSMEGATFLQAKQIALYALSLMGKKQKINIVKFGTDYT 909
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
LFS +A+ + ++I G+T+ L + + +L S+G + I L++D
Sbjct: 910 ELFSYPKYIASNNMLT---EFIMSATPTMGNTDFWKTL-RYLNLLYPSQG-LRSILLISD 964
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDL 491
G +++E +K + + R+++ GIGS N + LR L+ G Y +
Sbjct: 965 GHLQNESLTLQLVKRNVQHT-----RLFSCGIGSTANRHMLRTLSQYGAGVFEYFNSKSK 1019
Query: 492 DSIEIQMQKLFTRGFSSVLANIAIDTL---KDLDEFEMYPSRIPDLSSESPLIVSGRYQG 548
S + Q++ TR S +++I D E P+ + L L+V G
Sbjct: 1020 HSWKKQIEDQMTRLHSPSCHSVSIKWQLLSTDAPEPLQAPAHVQSLFHNDRLLVYGFVPH 1079
Query: 549 KFPDTLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR 603
TL A + S V +LQ + ++ A+ I +L E K+
Sbjct: 1080 CTQATLCALIQEKEFSTIVSTTELQKTTGTMIHKLTAQGLIRDYEDGILHENETNHEMKK 1139
Query: 604 --LEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESP 639
L+ + K+S + ++ ++T + V E+ + ESP
Sbjct: 1140 QILKSLIIKLSKENSLITQFTSFVAV---EKRDGKESP 1174
>gi|123454693|ref|XP_001315098.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
gi|121897764|gb|EAY02875.1| von Willebrand factor type A domain containing protein [Trichomonas
vaginalis G3]
Length = 665
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 127/304 (41%), Gaps = 26/304 (8%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELA-- 383
K D F++D SGSM+G +E + + L L F+IV F E F T + +
Sbjct: 238 KADYFFVIDCSGSMKGDRIEKAVKCMRLILQSLPMKCRFSIVCFGSE---FQTLLPIVEY 294
Query: 384 TKEAVERAHQWIG-INFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
E V A I I I GG T+I PL E + + G IFL+TDG + +I
Sbjct: 295 NNENVLLAMNLIKYIQAIMGG-TDIYHPL----EYIFSQNGMTKKIFLLTDGEDSNSEEI 349
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
RL IYT GIG + +R LA ++ G + D ++ ++ L
Sbjct: 350 I-----RLVQENKQFGNIYTVGIGIGADSGLIRNLAEVTNGKWTYVLDNENFNEKLISL- 403
Query: 503 TRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGD 562
SS + N ++D + E IP S+ PL S Q F + L
Sbjct: 404 ---LSSSIENNSVDIT--IHANETIAETIP--SNPYPL-CSNISQNYFIKAPFCENVLIS 455
Query: 563 LSNFVVELKLQLAKD-IPLDRICAKQQIDLLTAQAWFSEDKRLEEKVSKMSVQTGVLCEY 621
F + + + +D I + + K I D+ +++K +S+ +G+LC+Y
Sbjct: 456 SEEFDLLVPVMKIEDGIGMKSLFTKMIIHEYETYLRTHNDEAMKQKCIDLSISSGILCKY 515
Query: 622 TRMI 625
T I
Sbjct: 516 THYI 519
>gi|432893918|ref|XP_004075917.1| PREDICTED: von Willebrand factor A domain-containing protein
5A-like [Oryzias latipes]
Length = 786
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 121/532 (22%), Positives = 217/532 (40%), Gaps = 65/532 (12%)
Query: 160 AGKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYR-DGEFSVNVPFKFPEY 218
+G+ A + S P+IF+L++ + G SI L++ +L+ DG F P
Sbjct: 92 SGQQAFLLEESEQSPDIFSLSVGSLPPGESASIMLQYVIELAVEADGALR----FCLPAV 147
Query: 219 VTPAIKKIPK-REKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
+ P + P+ E + V++ +E+ + S R V K+ S E L++ N
Sbjct: 148 LNPRYQ--PQGSEGPGVPVSSVPASEIPYSLSFSARVSSPRPVSKVESSCSLEPLQYLNT 205
Query: 278 DF-DFSYTVSPSHIF---------------------GGVLLQSPSLHDVDQREMFCMY-L 314
D + + ++ H F G P D M +Y
Sbjct: 206 DHTEAAVKLAAGHKFDRDVELLIYYKDAHQPSAVVEAGQDSAKPGTLMGDPVVMVSLYPE 265
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQ---------GKPLEDTKNALAVALSKLDPGDSFN 365
P + S++ + +F+VD SGSMQ + ++ L + L L G FN
Sbjct: 266 FPESVMSKRASCGEFVFLVDRSGSMQCPTNNWNQQETRISSARDTLLLLLKSLPMGCYFN 325
Query: 366 IVAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM--LTNSR 422
I F ++F S+E + K ++E A + + + G T I PL ++N
Sbjct: 326 IYGFGSRYDHIFPKSVEYSEK-SMEEALKKVEVMQADLGGTEILQPLQHIYSQPCISNQP 384
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+ F+ TDG V + +++ D ++ N GS R ++FGIG + + LA R
Sbjct: 385 RQL---FVFTDGEVGNTKEVLDLVRK---NAGSH--RCFSFGIGEGASSALINGLAQEGR 436
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPL 540
G+ D ++ ++ + V+ +I++ D K + + P + L
Sbjct: 437 GHAQFITGSDRMQAKVMQSLKFALQPVVKDISVTWDLPKGVSATVLSPPITTIFQGQRSL 496
Query: 541 IVSGRYQGKFPDT---LKAKGFLGDLS-NFVVELKLQLAKD--IPLDRICAKQQIDLLTA 594
+ + + GK T + LGD + L AKD + + R+ A+ I L +
Sbjct: 497 VYA-QLSGKSSQTDCSVTVNYTLGDHPYESKQQFNLNPAKDTGLTVHRLAARILIRSLES 555
Query: 595 QAWFSEDKRLEE---KVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ + K E+ KV ++SVQ+GV +T I V D +A + P ++
Sbjct: 556 EERENRGKSNEDMKRKVVELSVQSGVSSHHTAFIAVNKD-NGDAVQGPLVRR 606
>gi|395848258|ref|XP_003796771.1| PREDICTED: poly [ADP-ribose] polymerase 4 [Otolemur garnettii]
Length = 1612
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 128/563 (22%), Positives = 230/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSA-SVETGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK + E + A S G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGVIKEKEEAQQEYREAVSQGHGAYLMDQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS + G +V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSIQ-GTAAV---FFMPAGVAPWQQDKALNENLQDAV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + ++ +H LKQ R D + + E L
Sbjct: 770 KTICIKEIGTKQSFSLTMSIEMPYVIEFISSDTHELKQKRTDCKAVISTVEGSSLDSGG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLL-PGTAKSRKVF--KKDVIFIVD 334
F +S +++ + + P ++ CM + P S K +VI +D
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPKLDVSLPDLTDKSEVIICLD 882
Query: 335 ISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAVERAHQ 393
S SM+G K ALS + NI+ F G LFS + + +
Sbjct: 883 CSNSMEGVTFLQAKQIALYALSLVGEMQKVNIIQFGTGYKELFSYPKYITSSNV---PTE 939
Query: 394 WIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNG 453
+I G+T+ L + + +L S+GS I+ LV+DG +++ER +K L +
Sbjct: 940 FIMSATPTMGNTDFWKTL-RYLGLLYPSQGSRNIL-LVSDGHLQNERLTVQLVKRSLPH- 996
Query: 454 GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS-------IEIQMQKLFTRGF 506
R++T G+GS N + LR L+ G + ++ S IE QM +L +
Sbjct: 997 ----TRLFTCGVGSTANRHVLRTLSQCGAGVF-EHFNTKSKHSWKKQIEDQMSRLHSPSC 1051
Query: 507 SSV---LANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
SV ++ D L+ L P+++P L L+V G TL A +
Sbjct: 1052 HSVSVKWQQLSADALEPLQA----PAQVPSLFHGDRLLVYGFIPHCMQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 HTMVSTTELQKTTGTMIHKLTARALIRDYEDGILHEDETSHEMKKQILKSVIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDDN---ESP 1187
>gi|358418909|ref|XP_003584073.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like
[Bos taurus]
gi|359079644|ref|XP_003587863.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like
[Bos taurus]
Length = 1171
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 95/202 (47%), Gaps = 21/202 (10%)
Query: 313 YLLPGTAKSRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSK--LDPGDSFNI 366
+LL G SR++F + V ++D SGSM G L+ K L + + + DSFN+
Sbjct: 930 WLLSG---SRRLFGVILESKVCILLDTSGSM-GPYLQQVKTELTLLIWEQLRKHCDSFNL 985
Query: 367 VAFNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP 426
++F + ++ T A +A QW+ A GST++ L KA S +
Sbjct: 986 LSFAEGLQPWQDTLVETTDAACHKAMQWVTC-LQAQGSTSVLQALLKAF-----SFPDVE 1039
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCN--HYFLRMLAMISRGY 484
++L+TDG + + + RL I +++T C FLR LA ++RG
Sbjct: 1040 GLYLLTDGKPDTSCSLILSEVQRLKEKRDI--KVHTISPDCSCRTAASFLRDLAALTRGR 1097
Query: 485 YGAAYDLDSIEIQMQKLFTRGF 506
Y DS+ ++MQ L TRGF
Sbjct: 1098 YHCPVGEDSL-LEMQGLLTRGF 1118
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 321 SRKVF----KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLF 376
SR+++ +K V+ ++DIS + + + + +L D FNI+AF +
Sbjct: 487 SRRIWGTVCEKRVVILLDISVTNSMYIIHIQHSLRRLLEEQLSNKDCFNIIAFGSTIESW 546
Query: 377 STSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM 417
M A+ ++ A +W+ +N GS N+ + L KAVE+
Sbjct: 547 RPEMVAASHSNLQSAWRWV-LNLQCQGSRNVLSALRKAVEV 586
>gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio]
gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio]
Length = 868
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 114/260 (43%), Gaps = 32/260 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K V+F++D S SM G + TK AL L +L D F I+ F+ ++ + AT+E
Sbjct: 255 KMVVFVIDNSYSMYGNKMAQTKEALGTILGELPEDDYFAIIVFSTTFVVWRPYLSKATEE 314
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEML-------TNSRGSIPIIFLVTDGAVED- 438
V+ A +++ + GG T + VEML T + + ++ L+TDG
Sbjct: 315 NVKEAQEYVKTIEVIGG-TELHDATIHGVEMLYAAQRNGTAPKNMVLMMILLTDGQPNQY 373
Query: 439 ERQICDAMKS-RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQ 497
R + + +S R G+I ++ G+ ++ FL L+ + G Y+ +Q
Sbjct: 374 PRSLPEIQESIRKAIDGNIT--LFGLAFGNDADYGFLDTLSKQNNGIVRRIYEDSDAPLQ 431
Query: 498 MQKLFTRGFSSVLANIAI----DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDT 553
++ + S +L+ + +T+ L S L + S ++V+GR
Sbjct: 432 LKGFYEEVSSPLLSEVNFHYPDNTVNSLTR-----SHFKQLFNGSEIMVAGRLDD----- 481
Query: 554 LKAKGFLGDLSNFVVELKLQ 573
L D++NF +E+ Q
Sbjct: 482 ------LNDINNFDIEVSAQ 495
>gi|400598432|gb|EJP66149.1| vault protein inter-alpha-trypsin [Beauveria bassiana ARSEF 2860]
Length = 1012
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 80/168 (47%), Gaps = 16/168 (9%)
Query: 324 VFKKDVIFIVDISGSMQ-GKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSME 381
V K +V+FI D SGSM G L + K++L V L L G FNI +F + ++L+ S
Sbjct: 292 VEKPEVVFICDRSGSMDTGTRLPNLKSSLQVFLRSLPVGVLFNICSFGSSHSFLWPKSQP 351
Query: 382 LATK---EAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI-IFLVTDGAVE 437
+ K EA + N G T I P+ N GS + +FL+TDG
Sbjct: 352 YSQKTLDEAAAHVRTFTANN----GGTRIREPIEA---TFANREGSRNLEVFLMTDGETW 404
Query: 438 DERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
DE +A+ + G+I R++ G+G+ +H + +A RG+
Sbjct: 405 DENATFEAINRAVKEADGAI--RVFALGVGADASHSLIEGVARAGRGF 450
>gi|432894395|ref|XP_004075972.1| PREDICTED: von Willebrand factor A domain-containing protein
5A-like [Oryzias latipes]
Length = 826
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 154/352 (43%), Gaps = 34/352 (9%)
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQ---------GKPLEDTKNALAVALSKLDPGDSFN 365
P + S++ + +F+VD SGSMQ + ++ L + L L G FN
Sbjct: 266 FPESVMSKRASCGEFVFLVDRSGSMQCPTNNWNQQETRISSARDTLLLLLKSLPMGCYFN 325
Query: 366 IVAF-NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEM--LTNSR 422
I F +G ++F S+E + K ++E A + + + G T I PL ++N
Sbjct: 326 IYGFGDGYDHIFPQSVEYSEK-SMEEALKKVEVMQADLGGTEILQPLQHIYSQPCISNQP 384
Query: 423 GSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+ F+ TDG V + +++ D ++ N GS R ++FGIG + + LA R
Sbjct: 385 RQL---FVFTDGEVGNTKEVLDLIRK---NAGS--HRCFSFGIGEGASSALINGLAQEGR 436
Query: 483 GYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPL 540
G+ D ++ ++ + V+ +I++ D K + + P + L
Sbjct: 437 GHAQFITGSDRMQAKVMQSLKFALQPVVKDISVTWDLPKRVSATVLSPPITTIFQGQRSL 496
Query: 541 IVSGRYQGKFPDT---LKAKGFLGDLS-NFVVELKLQLAKD--IPLDRICAKQQIDLLTA 594
+ + + GK T + LGD + L AKD + + R+ A+ I L +
Sbjct: 497 VYA-QLCGKSSQTDCSVTVNYTLGDHPYESKQQFNLNPAKDTGLTVHRLAARILIRSLES 555
Query: 595 QAWFSE---DKRLEEKVSKMSVQTGVLCEYTRMIIVETDERNNASESPGTKK 643
+ + D+ ++ KV ++SVQ+GV +T I V D +A + P ++
Sbjct: 556 EERENRGKSDEDMKRKVVELSVQSGVSSHHTAFIAVNKD-NGDAVQGPLVRR 606
>gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 700
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 85/198 (42%), Gaps = 23/198 (11%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++DISGSM G L K A+ + L P D +++AF+ L
Sbjct: 263 DLVTVIDISGSMAGTKLALLKRAMGFVIQHLGPSDRLSVIAFSSSARRL---FHLQRMSH 319
Query: 388 VERAHQWIGINFI-AGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
R IN + AGG TNI L KA +++ + + S+ I L++DG +D IC
Sbjct: 320 YGRQQALQAINSLGAGGGTNIADALKKATKVIEDRSYKNSVCSIILLSDG--QDTYNICS 377
Query: 445 AMKSRLTNGGSICP-----------RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
++ + S+ P I+ FG G+ + L +A S G + D
Sbjct: 378 NVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIED--- 434
Query: 494 IEIQMQKLFTRGFSSVLA 511
E MQ F + +L+
Sbjct: 435 -EGVMQDAFAQCIGGLLS 451
>gi|281208413|gb|EFA82589.1| type A von Willebrand factor domain-containing protein
[Polysphondylium pallidum PN500]
Length = 904
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 144/324 (44%), Gaps = 35/324 (10%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG---ETYLFSTSMEL 382
K + IF++D SGSM G+ ++ K L + + L+ FNIV F + Y ST
Sbjct: 338 KAEFIFVLDCSGSMSGRRIQKAKAVLQLIMRSLNENIKFNIVRFGSSFVKVYPRSTVYND 397
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQ 441
T +V Q+ + GG+ P + + + T + + P +F++TDG + D +Q
Sbjct: 398 ETLSSVSTQIQFFDADL--GGTE--LFPAIRDI-LSTEADPAYPRQVFVLTDGEIYDRKQ 452
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS-IEIQMQK 500
+ + S RI+T GIG + + L+ +G+Y + D+ E ++ K
Sbjct: 453 LISYVAKE-----SSSTRIFTVGIGGQVDRELVIGLSNSCKGFYDFIAENDTDYEGKIMK 507
Query: 501 LFTRGFSSVLANIAID--TLKDLDEFEMYPSRI-PDLSSESPLIVSGRYQGKFPD----- 552
L F +++NI +D LK + P+ I P + E +I + F +
Sbjct: 508 LLEISFQPMISNIKVDWGQLKTIQS----PAAIRPIYAGERMMIYGVLNKQDFKNHKTTV 563
Query: 553 TLKAKGFLGDLSNFVVELKLQLAKD----IPLDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
T+ G G+ + E+ + L KD + L + +I+ L + + K +E +
Sbjct: 564 TITGDGPTGETLKY--EIPIVLTKDNDATVNLHSMATYMRINDLEREE--EKGKNNKELI 619
Query: 609 SKMSVQTGVLCEYTRMIIVETDER 632
++S + G++ ++T I+ E
Sbjct: 620 IRLSKRFGLVSKHTSYIVTNESEE 643
>gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 674
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 86/200 (43%), Gaps = 23/200 (11%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++DISGSM G L K A+ + L P D +++AF+ L
Sbjct: 263 DLVTVIDISGSMAGTKLALLKRAMGFVIQHLGPSDRLSVIAFSSSARRL---FHLQRMSH 319
Query: 388 VERAHQWIGINFI-AGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDGAVEDERQICD 444
R IN + AGG TNI L KA +++ + + S+ I L++DG +D IC
Sbjct: 320 YGRQQALQAINSLGAGGGTNIADALKKATKVIEDRSYKNSVCSIILLSDG--QDTYNICS 377
Query: 445 AMKSRLTNGGSICP-----------RIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDS 493
++ + S+ P I+ FG G+ + L +A S G + D
Sbjct: 378 NVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIED--- 434
Query: 494 IEIQMQKLFTRGFSSVLANI 513
E MQ F + +L+ +
Sbjct: 435 -EGVMQDAFAQCIGGLLSVV 453
>gi|405978194|gb|EKC42604.1| Putative sulfite oxidase, mitochondrial [Crassostrea gigas]
Length = 1766
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/240 (21%), Positives = 101/240 (42%), Gaps = 8/240 (3%)
Query: 310 FCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF 369
F +L P K + D++F++D SGSM G + ++++ L + D I+AF
Sbjct: 822 FVHFLAPEGVKPMPM---DIVFVLDKSGSMGGTKMTQLQDSMKKILDDVKADDKIMIIAF 878
Query: 370 NGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTN---SRGSIP 426
+ + T T E + A +I N AGG TNI L ++ LT + G P
Sbjct: 879 DSYLSYWKTDFVQVTPENINNAKNYIR-NTHAGGGTNIDLGLRDGIQKLTQISGNNGRAP 937
Query: 427 IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
++ +TDG + + + L I++ G + ++ +A + G+
Sbjct: 938 VLVFLTDGEATVGETNTERILNNLKKENEADIPIFSLAFGRGADFDIVKRVAAQNNGFAR 997
Query: 487 AAYDLDSIEIQMQKLFTRGFSSVLANIAIDTLK-DLDEFEMYPSRIPDLSSESPLIVSGR 545
Y+ +Q+ + + +L ++ + + L + E+ + S ++VSG+
Sbjct: 998 KIYEDSDAALQIAGFYKEISTVLLKDVKFNYVDGTLYDTEVTNTEFKTYFRGSEMVVSGK 1057
>gi|345310089|ref|XP_001515962.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2
[Ornithorhynchus anatinus]
Length = 951
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 87/197 (44%), Gaps = 8/197 (4%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKE 386
K+++F++D+SGSM G ++ T A+ L L D F+++ FN + ++ AT
Sbjct: 315 KNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDQFSVIDFNHNVRSWKDNLVPATDL 374
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSR-------GSIPIIFLVTDGAVEDE 439
A ++I G TNI L +A+ +L + S+ +I LV+DG
Sbjct: 375 MTTDAKKYIE-KIQPNGGTNINEALLRAIFILREASNLGMLDPNSVSLIILVSDGDPTVG 433
Query: 440 RQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
++ + +++ GIG ++ FL L+ + G Y Q++
Sbjct: 434 ELKPTVIQKNVKKNMRDNISLFSLGIGFDVDYDFLERLSRENHGMAQRIYGNQDTSSQLK 493
Query: 500 KLFTRGFSSVLANIAID 516
+ + + + +L N+ +
Sbjct: 494 QFYNQVSTPLLQNVQFN 510
>gi|168702570|ref|ZP_02734847.1| LPXTG-motif cell wall anchor domain protein [Gemmata obscuriglobus
UQM 2246]
Length = 874
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 137/344 (39%), Gaps = 49/344 (14%)
Query: 128 PMGDQGSILGVEAEISGKSYHTQLIALGENDG---AGKSASVETGSFLK--PNIFTLTLP 182
P+ ++ GV+ T+L + +G A KSA + + P++FTL++
Sbjct: 75 PLPGDAAVTGVQVRFGAAEIRTELKEREKAEGEYKAAKSAGRQAALVTRESPDVFTLSVA 134
Query: 183 QIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFK-FPEYVTP-------------AIKKIPK 228
I G + +R + Q +S+ VP P YV A+ + P
Sbjct: 135 GIRTGEEVVVRTDYVQVAKAEGAGWSLRVPLTTAPRYVRADESTSRHADGQPLALLRDPG 194
Query: 229 REKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEV-------LKWSNIDFDF 281
+ LNV V TS+ H + RD +L E EV L W D
Sbjct: 195 -HRFALNVTFAEAERV---TSATHALAVERDRAQL---REGEVIPDRDCVLIWRAKAEDR 247
Query: 282 SYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--LPGTAKSRKVFKKDVIFIVDISGSM 339
S L+ + D + + + L P A ++KV ++VI +VD SGSM
Sbjct: 248 SA------------LRVWTQPDPATGKAYFLALCAPPKFADAKKV-PREVILLVDHSGSM 294
Query: 340 QGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAHQWIGINF 399
G E A+ L+ L D+F++ F+ T F AT E V A +++ +N
Sbjct: 295 SGAKWEAADWAVERFLAGLSEDDAFSLGLFHSTTKWFGERTRKATPENVRAAVEFLKLNR 354
Query: 400 IAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQIC 443
GG T + L +A+ ++ + ++TD V D +I
Sbjct: 355 DQGG-TELGVALEQALARSRSAETPARHVLILTDAEVTDAGRIL 397
>gi|297274889|ref|XP_001110102.2| PREDICTED: poly [ADP-ribose] polymerase 4-like, partial [Macaca
mulatta]
Length = 759
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 101/465 (21%), Positives = 190/465 (40%), Gaps = 70/465 (15%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 203 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 262
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS G V F P V P
Sbjct: 263 FTVSVGNLPPKAKVLIKITYITELSIL-GTVGV---FFMPATVAPWQQDKALNENLQDTV 318
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 319 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVISTMEGSSLDSSG- 377
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 378 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 426
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVE 389
I +D S SM+G K ALS + NI+ F + S+ A + ++
Sbjct: 427 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTD-----VSVMTANGDCIQ 481
Query: 390 RAHQWIGINFIAG-----GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+ + ++F+ G+T+ L + + +L +RGS I+ LV+DG ++DE
Sbjct: 482 QGEHNLSLHFLQSATPTMGNTDFWKTL-RYLSLLYPARGSRNIL-LVSDGHLQDE----- 534
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLF 502
++ +L R++ GIG N + LR+L+ G Y A S Q++
Sbjct: 535 SLTLQLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQM 594
Query: 503 TRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSG 544
TR S ++++ + D E P+++P L L+V G
Sbjct: 595 TRLRSPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYG 639
>gi|434398532|ref|YP_007132536.1| von Willebrand factor type A [Stanieria cyanosphaera PCC 7437]
gi|428269629|gb|AFZ35570.1| von Willebrand factor type A [Stanieria cyanosphaera PCC 7437]
Length = 413
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 101/221 (45%), Gaps = 12/221 (5%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM GKPLE K A + KL D ++VAF+ + + ++ +ER
Sbjct: 46 LILDRSGSMNGKPLEKVKAAAVNLIKKLTNRDRISVVAFDHRAKVIIPNQLVSDSSQIER 105
Query: 391 AHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVED-ERQICDAMKS 448
+A G T I L + E+ + ++ IFL+TDG E + Q C + +
Sbjct: 106 QID----TLVADGGTAIDEGLKLGIKEIAAGKQNTVSQIFLLTDGENEHGDNQRCLKL-A 160
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSS 508
+L + +I I T G G + N L +A + G + + +KLFTR +
Sbjct: 161 QLASEHNIT--INTLGFGEHWNQDVLEQIADSAGGSLTYIEQPEQAIAEFEKLFTRVQTV 218
Query: 509 VLAN--IAIDTLKDLDEFEMYP-SRIPDLSSESPLIVSGRY 546
L N + ++ + ++ E+ P +++ + E P + G Y
Sbjct: 219 GLTNAYLMMELMPEVRLAELKPVAQVSPETVELPTQLEGNY 259
>gi|171684709|ref|XP_001907296.1| hypothetical protein [Podospora anserina S mat+]
gi|170942315|emb|CAP67967.1| unnamed protein product [Podospora anserina S mat+]
Length = 1075
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 81/372 (21%), Positives = 156/372 (41%), Gaps = 44/372 (11%)
Query: 161 GKSASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVT 220
G+ A + SF ++FT T+ I + L + + + +L + D + + F P +
Sbjct: 116 GQVAGLLEQSFEASDVFTTTIGNIPADASLKVEITYLGELKH-DAQVD-GIRFTIPTSIA 173
Query: 221 PAIKKIP------------KREKIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYE 268
P P K I ++ +G+++ S H + +G +
Sbjct: 174 PRYGSYPGDLLQSTQFGATKGISITVDAEMPSGSQIKSVQSPSHPIAVTVGSTSVGAAKG 233
Query: 269 SEV----------LKWSNIDFDFSYTVSPSHIFG--GVLLQSPSL--HDVDQREMFCMYL 314
+E+ L S +D DF V ++ VL P++ H + +
Sbjct: 234 AEMSLQKASATLSLGTSELDKDFILHVVATNTANPVAVLETHPTIPNHRALMATLVPKFN 293
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSM-QGKPLEDTKNALAVALSKLDPGDSFNIVAFNGET 373
LP + K +++F+ D SGSM GK + + + AL + L L G FNI +F G
Sbjct: 294 LPSS-------KPEIVFVCDRSGSMGDGKRIPNLQTALHLFLKSLPLGVKFNICSF-GSH 345
Query: 374 YLFSTSMELATKEAVERAHQWIGINFIAG--GSTNICAPLTKAVEMLTNSRGSIPIIFLV 431
+ F T +A AH +N I+ G T + PL + + +F++
Sbjct: 346 WDFMFPEGSRTYDASSLAHATQYVNSISANYGGTEMRMPLQDTFKR--RYKDMDLEVFML 403
Query: 432 TDGAVEDERQICDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD 490
TDG + D++Q+ + + + G+I R+++ GIG+ +H + +A G+ + D
Sbjct: 404 TDGEIWDQQQVFGMINTHVAESEGAI--RVFSLGIGNDVSHALIEGIAQAGNGFSQSVAD 461
Query: 491 LDSIEIQMQKLF 502
+S+ ++ ++
Sbjct: 462 DESMNSKVIRML 473
>gi|403347255|gb|EJY73047.1| Ubiquitin [Oxytricha trifallax]
Length = 935
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 100/213 (46%), Gaps = 18/213 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGE-TYLFSTSMELATKE 386
+ IF++D SGSM GK ++ AL + L L G FN+V+F T LF S E E
Sbjct: 368 EYIFLIDRSGSMSGKRIKLAVQALQLFLHSLPMGCKFNVVSFGSNYTKLFEESQEY-DDE 426
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIP-IIFLVTDGAVEDERQICDA 445
++ERA + IG G T + P+ + + ++P ++L+TDG +ICD
Sbjct: 427 SLERAIEQIGKFDANMGGTELFQPVQDILSQPCDP--NLPRHLYLLTDG------EICDT 478
Query: 446 MK-SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
K + +++FGIG + ++ A +G+Y + + IE ++ + +
Sbjct: 479 QKLIEMIRANRATTTVHSFGIGDGVSSELIKNSASAGQGHYCFIMNPEEIERKVLQALQK 538
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSE 537
F + +KD++ + + + I L +E
Sbjct: 539 DFLEYI------IVKDINILDQHKNIIEHLVTE 565
>gi|156358341|ref|XP_001624479.1| predicted protein [Nematostella vectensis]
gi|156211262|gb|EDO32379.1| predicted protein [Nematostella vectensis]
Length = 614
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 139/345 (40%), Gaps = 34/345 (9%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
+ + IF+VD S +M+G+ ++ + + + L L G FN++++ T E+
Sbjct: 275 RGEFIFLVDRSKNMKGENIDRAREVVLLFLKSLPDGCHFNVISYGASYIKLFTQSEVCDH 334
Query: 386 EAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+ ERA ++ + AG + PL + G +FLVT G V + Q+ D
Sbjct: 335 SSRERASDFVW-DLKAGIDNARPIDPLREVYSHNLIDTGYPRQVFLVTGGEVGNAEQVTD 393
Query: 445 AMKSRLTNGGSICPRIYTFGI--GSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLF 502
++ + R +T GI G + +H +R +A RG LD + ++Q ++
Sbjct: 394 EVRK-----NAHTARCFTLGINVGKHSDH--VRRIARAGRGTCELVSQLDRVAPKVQSMW 446
Query: 503 TRGFSSVLANIAID-TLKDLDEFEMYPSRIPDLSSESPLIVSG---RYQGKFPD------ 552
V+ N + TL D P +P + LI+ G R K P
Sbjct: 447 DEATQPVVTNFHVSWTLPDGWVARCIPYNLPPIYRGHTLILYGLLMREDNKQPSYIVAVE 506
Query: 553 ---TLKAKGFLGDLSNFV-VELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLEEKV 608
T+ A GD + V E++ P+DR + LTA+A E EE
Sbjct: 507 GKATISASIQHGDTTKDVHQEVRFSALSGNPVDRDIFLHR---LTAKALIEE----EEDK 559
Query: 609 SKMSVQTGVLCEYTRMIIVETDERNNASESPGTKKGSKKSDHQKI 653
++ +G L TR IV + N T KS H+ I
Sbjct: 560 RVWNLTSGGLA--TRAPIVSLSKSTNIISKATTFVAVDKSTHEVI 602
>gi|333922873|ref|YP_004496453.1| von Willebrand factor type A [Desulfotomaculum carboxydivorans
CO-1-SRB]
gi|333748434|gb|AEF93541.1| von Willebrand factor type A [Desulfotomaculum carboxydivorans
CO-1-SRB]
Length = 421
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 88/185 (47%), Gaps = 20/185 (10%)
Query: 313 YLLPGTAKSRKVFK----KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVA 368
YL+ +V K +++ F++D SGSM G+ L+ TK A+ A+ L P D ++VA
Sbjct: 24 YLMVKLTAPEQVVKERPVQNLSFVIDRSGSMSGEKLDYTKKAVTFAIGHLSPQDYCSVVA 83
Query: 369 FNGETYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVE--MLTNSRGSIP 426
F+ + + S ++ K+A++ A + I GGSTN+ + + L + I
Sbjct: 84 FDDMVTMVAPSHKVENKDALKMAVESI----YPGGSTNLSGGMLLGLREVKLAHKENQIN 139
Query: 427 IIFLVTDG----AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISR 482
+ L+TDG V D + + K+R G + + TFG+G L+ +A
Sbjct: 140 RVLLLTDGMANVGVTDHGALVE--KAREMAAGGV--NLSTFGLGDDFEEDLLQAMAEAGG 195
Query: 483 G--YY 485
G YY
Sbjct: 196 GNFYY 200
>gi|159129667|gb|EDP54781.1| von Willebrand domain protein [Aspergillus fumigatus A1163]
Length = 946
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/208 (23%), Positives = 104/208 (50%), Gaps = 16/208 (7%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELA 383
K ++IF++D SGSM K ++ K+AL V L L G FNI +F ++L+ S+
Sbjct: 291 IKPEIIFVIDRSGSMMDK-IDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSL-FY 348
Query: 384 TKEAVERAHQWIGINFIAGGSTNICA-PLTKAVEMLTNSRGSIP--IIFLVTDGAVEDER 440
T E+++ A ++F+ G N+ + +AVE SR + ++TDG + +++
Sbjct: 349 TAESLQEA-----LSFVDGVRANMGGTEMQEAVEATVRSRMKDKELEVLILTDGQIWNQQ 403
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ ++ + G+ R ++ GIG+ +H + +A G+ + + ++ ++ +
Sbjct: 404 TLFGFIRETAADNGA---RFFSLGIGNGASHSLVEGIARAGNGFSQMVVNYEELDRKVVR 460
Query: 501 LFTRGFSSVLANIAIDTLKD--LDEFEM 526
+ + + + ++ D ++EFE+
Sbjct: 461 MLKGALTPHIFDYKLEIEHDDPVEEFEV 488
>gi|401881874|gb|EJT46156.1| hypothetical protein A1Q1_05367 [Trichosporon asahii var. asahii
CBS 2479]
Length = 987
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 63/272 (23%), Positives = 114/272 (41%), Gaps = 13/272 (4%)
Query: 273 KWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFI 332
K +N+D D ++PS L +QR + L+P A + ++IF+
Sbjct: 231 KETNLDTDLVIVITPSKTGEPTALLETHATLPNQRALMAS-LVPKFALPAET--PEIIFV 287
Query: 333 VDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVERAH 392
D SGSMQ + D K AL + L L G +FNI +F L +++V+ A
Sbjct: 288 CDRSGSMQSN-IPDLKRALNIFLKSLPQGVAFNICSFGSRFSFLWPKSRLYDQKSVDEAT 346
Query: 393 QWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTN 452
+ + G T + PL + + G +FL+TDG + DE + +
Sbjct: 347 RHVATFDANYGGTEMYRPLEATFKQRLH-HGYNLEVFLLTDGEIWDEENVFRLINEASEK 405
Query: 453 GGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLAN 512
G R+++ GIGS + + +A G+ D ++++ ++ ++ + N
Sbjct: 406 GNGDL-RVFSLGIGSGASTSLVEGVARAGNGFAQFVADGENMDKKVVRMLKASLFPHINN 464
Query: 513 IAID-----TLKDLDEFEMY--PSRIPDLSSE 537
++ + D+FE+ S + D SE
Sbjct: 465 FKLEIDYEGAGGESDDFELIDAASTVVDFESE 496
>gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908]
gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908]
Length = 640
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 108/226 (47%), Gaps = 37/226 (16%)
Query: 329 VIFIVDISGSMQGK---PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSME---- 381
++F++D+SGSM + PL KNAL + +LD GD +IV + G + + ++
Sbjct: 254 LVFLLDVSGSMSSQDKLPL--LKNALKMLSQQLDEGDRISIVVYAGASGVVLDGVKGNDT 311
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNS--RGSIPIIFLVTDG----A 435
LA +A+++ AGGSTN A + A ++ G + + L TDG
Sbjct: 312 LAISQALDK--------LKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVILATDGDFNVG 363
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
V D++ + D ++ + G I FG G+Y +H + LA G+Y AY +D++
Sbjct: 364 VSDQQALEDMIEEKRKQG--IALTTLGFGQGNYNDH-LMEQLADKGNGHY--AY-IDTLN 417
Query: 496 IQMQKLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLI 541
+ +K+ S+ L IA D +K EF P L SE LI
Sbjct: 418 -EARKVLVDEISATLLTIAKD-VKVQIEFN------PALVSEYRLI 455
>gi|71001856|ref|XP_755609.1| von Willebrand domain protein [Aspergillus fumigatus Af293]
gi|66853247|gb|EAL93571.1| von Willebrand domain protein [Aspergillus fumigatus Af293]
Length = 946
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/208 (23%), Positives = 104/208 (50%), Gaps = 16/208 (7%)
Query: 325 FKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELA 383
K ++IF++D SGSM K ++ K+AL V L L G FNI +F ++L+ S+
Sbjct: 291 IKPEIIFVIDRSGSMMDK-IDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSL-FY 348
Query: 384 TKEAVERAHQWIGINFIAGGSTNICA-PLTKAVEMLTNSRGSIP--IIFLVTDGAVEDER 440
T E+++ A ++F+ G N+ + +AVE SR + ++TDG + +++
Sbjct: 349 TAESLQEA-----LSFVDGVRANMGGTEMQEAVEATVRSRMKDKELEVLILTDGQIWNQQ 403
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+ ++ + G+ R ++ GIG+ +H + +A G+ + + ++ ++ +
Sbjct: 404 TLFGFIRETAADNGA---RFFSLGIGNGASHSLVEGIARAGNGFSQMVVNYEELDRKVVR 460
Query: 501 LFTRGFSSVLANIAIDTLKD--LDEFEM 526
+ + + + ++ D ++EFE+
Sbjct: 461 MLKGALTPHIFDYKLEIEHDDPVEEFEV 488
>gi|451996278|gb|EMD88745.1| hypothetical protein COCHEDRAFT_1182029 [Cochliobolus
heterostrophus C5]
Length = 914
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 85/359 (23%), Positives = 149/359 (41%), Gaps = 48/359 (13%)
Query: 175 NIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHL 234
++FT T+ I + + +++ Q+L + D E V F P ++ P P R I L
Sbjct: 116 DVFTTTIGNIPHNAAVKTTIKYVQELKH-DAEVD-GVRFTMPTWIAPRYGGYPGRYPIEL 173
Query: 235 ----------------NVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESE-------- 270
+++ G + S H Q+ +G L S E
Sbjct: 174 QQSLKPVPSEGISVTVDISMAEGIPIKKIVSPSHPIQV--TLGSLSSSTIDEDQSLSRGS 231
Query: 271 ---VLKWSNIDFDFSYTVSPSHIF--GGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVF 325
L + ++ DF V I +L + P+L + QR + + KS+K
Sbjct: 232 ATLALGTAVLEKDFILQVVAKDIGVPQAILERHPTLPN--QRALMTTLVPKFNLKSQK-- 287
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATK 385
++I I D SGSMQG+ + K+AL + L + G +FNI +F ++ +
Sbjct: 288 -PEIILIADRSGSMQGENITTLKSALKIFLKSIPLGCTFNICSFGSRHEFLWHESQVYDE 346
Query: 386 EAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICD 444
+++A + + NF A G T A + VE S+ S + L+TDG + + ++
Sbjct: 347 RTLQQAIKCVQ-NFDANLGGTETLAAVKACVE--ARSKDSPTELILLTDGDIWSQDELFS 403
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD---LDSIEIQMQK 500
+ SR T G + R++ GIG + + +A G+ D +DS ++M K
Sbjct: 404 YV-SRETESGDV--RVFPLGIGGGVSSALIEGVARAGCGFAQMVTDNEKMDSKVVRMLK 459
>gi|443722621|gb|ELU11382.1| hypothetical protein CAPTEDRAFT_82870, partial [Capitella teleta]
Length = 525
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 82/179 (45%), Gaps = 8/179 (4%)
Query: 307 REMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNI 366
+++ + P +S +++ IFIVD SGSM G ++ + AL + L L FNI
Sbjct: 243 QDVIMLNFFPHLLESDYSEQQEFIFIVDRSGSMGGDRIQSAREALLLFLKSLPISCYFNI 302
Query: 367 VAFNGE-TYLFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSI 425
V+F + LF +K++++ A G T I PL E + G
Sbjct: 303 VSFGSQFQALFPEGSRAYSKKSLQAALGLQSSMKANMGGTEILGPLRAVFEEM-QPIGDF 361
Query: 426 P-IIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
P +FL+TDG V + + + +K S R++TFGIG + ++ A + RG
Sbjct: 362 PRQLFLLTDGEVSNTETVLNLIKV-----NSDHTRVFTFGIGDGASTSLVKGAARVGRG 415
>gi|298491707|ref|YP_003721884.1| von Willebrand factor type A ['Nostoc azollae' 0708]
gi|298233625|gb|ADI64761.1| von Willebrand factor type A ['Nostoc azollae' 0708]
Length = 426
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 79/162 (48%), Gaps = 21/162 (12%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVE- 389
I+D SGSM G+P+ A+ L++L PGD +IVAF G + + + + E+++
Sbjct: 46 LILDKSGSMHGEPINTVIQAVEQLLAQLQPGDHISIVAFAGTSEVIIPNQIVQDAESIKC 105
Query: 390 RAHQWIGINFIAGGSTNICAPLTKAV-EMLTNSRGSIPIIFLVTDGAVE----------- 437
+ H+ AGG T I L+ + E+L ++G++ FL+TDG +
Sbjct: 106 QLHK----RLKAGGGTIIAEGLSLGITELLKGTKGAVSQAFLLTDGHGDRGLKIWKWEMG 161
Query: 438 -DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLA 478
++++ C + + T + + TFG G+ N L +A
Sbjct: 162 PNDKKRCLELAQKATR---VSLTLNTFGFGNDWNQDLLEKIA 200
>gi|451851062|gb|EMD64363.1| hypothetical protein COCSADRAFT_142782 [Cochliobolus sativus
ND90Pr]
Length = 934
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 99/430 (23%), Positives = 179/430 (41%), Gaps = 61/430 (14%)
Query: 115 VMGSKSC--DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGA---------GKS 163
V GSK + R A P+ D S++ E+ ++ + + +ND G+S
Sbjct: 46 VNGSKDALDEIRYAFPLYDGVSVVDFYCEVGERTIYG--LVKEKNDAKKTYEEAKQRGES 103
Query: 164 ASVETGSFLKPNIFTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP-- 221
A++ ++FT T+ I + + I +++ Q+L + D E V F P ++ P
Sbjct: 104 AALLEQLPDAADVFTTTIGNIPHNAAIKITIKYVQELKH-DAEVD-GVRFTIPTWIAPRY 161
Query: 222 -------------AIKKIPKRE-KIHLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSY 267
++K IP I ++++ G + S H Q+ +G L S
Sbjct: 162 GSYRRGYPSELQESLKPIPSEGISITVDISMAEGIPIKKIVSPSHPIQV--TLGSLSSST 219
Query: 268 ESE-----------VLKWSNIDFDFSYTVSPSHIF--GGVLLQSPSLHDVDQREMFCMYL 314
E L + ++ DF V + +L P+L + QR + +
Sbjct: 220 IDEDQSLSRGSATLALGTAALENDFILQVVAKDVGVPQAILESHPTLPN--QRALMATLV 277
Query: 315 LPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY 374
KS+K ++I I D SGSMQG+ + +AL + L + G +FNI +F
Sbjct: 278 PKFNLKSQK---PEIILIADRSGSMQGENITTLISALKIFLKSIPLGCTFNICSFGSSHE 334
Query: 375 LFSTSMELATKEAVERAHQWIGINFIAG-GSTNICAPLTKAVEMLTNSRGSIPIIFLVTD 433
++ + +++A +++ F AG G T A L E T + S + L+TD
Sbjct: 335 FLWHESQIYDERTLQQAIKYVQ-EFDAGFGGTETLAALKACFEACT--KDSPTELILLTD 391
Query: 434 GAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYD--- 490
G + + ++ + SR T G + R++ GIG + + +A G+ D
Sbjct: 392 GDIWSQDELFSYV-SRETESGDV--RVFPLGIGGGVSSALIEGVARAGCGFAQMVTDNEK 448
Query: 491 LDSIEIQMQK 500
+DS ++M K
Sbjct: 449 MDSKVVRMLK 458
>gi|413934094|gb|AFW68645.1| putative RING zinc finger and VWF domain family protein [Zea mays]
Length = 725
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 90/205 (43%), Gaps = 25/205 (12%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETY-LFSTSMELATKE 386
D++ ++D+SGSM G L K A+ + LDP D ++VAF+ + LF + T
Sbjct: 260 DLVTVLDVSGSMSGPKLALVKRAMRFVIENLDPSDRLSVVAFSSAAWRLF--PLRRMTAS 317
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVEDERQICD 444
+++ Q + +A G TNI L KA ++ + P+ I L++DG + +
Sbjct: 318 GQQQSLQAVNC-LVADGGTNIADGLRKATRVVQDRHARNPVCSIMLLSDGV---DSHVLP 373
Query: 445 AMKSRLTNGGSICPR------------IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLD 492
+ + G + PR I+ FG G + L +A +S G +
Sbjct: 374 PRRGSAPDYGRLVPRSALLPGSEHHVPIHAFGFGIDHDSRALHAVAQMS----GGTFSFI 429
Query: 493 SIEIQMQKLFTRGFSSVLANIAIDT 517
+ +Q F + +L+ +A +T
Sbjct: 430 DVVGSIQDAFAQCIGGLLSVVAQET 454
>gi|194212847|ref|XP_001501860.2| PREDICTED: von Willebrand factor A domain-containing protein 5A
[Equus caballus]
Length = 781
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 136/326 (41%), Gaps = 35/326 (10%)
Query: 328 DVIFIVDISGSMQGK---------PLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFST 378
+ +F++D SGSMQG +E K L + L L G FNI F F
Sbjct: 281 EFVFLMDCSGSMQGPMSKQDKSQLRIEAAKETLILLLKSLPIGCYFNIYGFGSSYEAFFP 340
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVED 438
+ + +E A + + + G T I PL K + ++ G +F+ TDG V D
Sbjct: 341 NSVKYNQHTMEEALRRVNLMQADLGGTEILRPL-KTIYGASSIPGHPLQLFVFTDGEVID 399
Query: 439 ERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQM 498
+ ++S S+ R ++FGIG + ++ +A ++ G + ++ ++
Sbjct: 400 SFSVIKEVRS-----NSLRHRCFSFGIGEGASTSLIKGIARVAGGTSEFITGKERMQSKV 454
Query: 499 QKLFTRGFSSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPD---- 552
+ R V+ ++++ D + L + P + + LIV + G P
Sbjct: 455 LRALKRSLQPVVEDVSLSWDLPQGLSAKMLSPEQTVIFRGQR-LIVYAQLTGMMPPAEAT 513
Query: 553 -------TLKAKGFLGDLSNFVVELKLQLAKDIPLDRICAKQQIDLLTAQAWFSEDKRLE 605
L+ K F ++ L+ +L ++ + R+ AK L T F E +
Sbjct: 514 GEVCLKYVLQGKSFENKVT---FSLQPKLDANLTIHRLAAKS--FLQTKDMGFRETPAND 568
Query: 606 EK-VSKMSVQTGVLCEYTRMIIVETD 630
+K V K+S++ GV+ +T I + D
Sbjct: 569 KKDVLKVSLECGVISSHTAFIAINKD 594
>gi|380489242|emb|CCF36836.1| von Willebrand domain-containing protein [Colletotrichum
higginsianum]
Length = 610
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 118/504 (23%), Positives = 195/504 (38%), Gaps = 87/504 (17%)
Query: 59 PKIVDNPDIPSYQ-PYVHGRCDPPS-LIPLQMNAVELDVDCYLDTAFVRVSGTWRVHCVM 116
P P PS Q P D P+ +PL +V ++VD + R + R
Sbjct: 3 PNHALTPRAPSLQQPQSSSDTDIPNDCLPLLHASVVVNVDGNI----TRTTLLQRFSNPS 58
Query: 117 GSKSCDCRIAVPMGDQGSIL------GVEAEISGKSYHTQLIALGENDGAGKSASVETGS 170
+ R P+ D ++ G E E+ GK Q N ET +
Sbjct: 59 EQSIPEARYMFPLYDGAVVVAFRCTVGQERELFGK---VQAKEKARNTFEKAVKRHETAA 115
Query: 171 FLK---PNIFTLTLPQIDGGSYLSIRLRW--SQKLSYRDGE---FSVNVPFK-------- 214
L+ P IF + I + + +R+ + K+S D +N+P
Sbjct: 116 LLEEHTPEIFETVIGNIPAQTDIEVRIEYLGEAKISVLDHTGDVIDINIPMSIAPRYGKS 175
Query: 215 --FPEYVTPAIKKIPKREKIHLNV---NAGTGTEVLCN-------------TSSHHLKQL 256
FPE + + RE + + V ++GT ++ CN + L +L
Sbjct: 176 SSFPENTSTMVN----REGLSIEVKVTDSGTVKKLHCNHQVKIEKNVPFNPMKASSLAEL 231
Query: 257 RRDVGKLGYSY----ESEVLKWSN----IDFDFSYTV--SPSHIFGGVLLQSPSLHDVDQ 306
+ + S V+++++ +D DF +V +PSH + SPS D +
Sbjct: 232 SKSDVRASSSAVPPPTQSVIRYASDVAIMDEDFVVSVENNPSHPLRSRAVLSPS--DENG 289
Query: 307 REMFCMYLLPGTAKSRKV----FKKDVIFIVDISGSM--------QGK----PLEDTKNA 350
+ + P R F +++F++D SGSM QG L K A
Sbjct: 290 LAAVMINIKPAEIFKRATRADRFAGELLFVLDRSGSMGSLDYRGRQGNTGRTKLATLKRA 349
Query: 351 LAVALSKLDPGDSFNIVAFNGET-YLFSTSMELATKEAVERAHQWIGINFIAGGSTNICA 409
+ +ALS L FN+V+F + +L+ SM T E ++ A +++ G T +
Sbjct: 350 MPLALSSLPETCHFNVVSFGSDAEFLWRRSMPY-TNENLDFAQRYLQRLEANMGGTELLH 408
Query: 410 PLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYC 469
L KAV+ + S II LVTDG V+ + + +R G I R + GIG
Sbjct: 409 ALKKAVDSRRAEQSSTQII-LVTDGEVQSDSILEFVWNTRQELGDKI--RFFALGIGFQV 465
Query: 470 NHYFLRMLAMISRGYYGAAYDLDS 493
H + + G YG D+++
Sbjct: 466 PHRLINRIGEFG-GGYGEVVDIEA 488
>gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1]
gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel
[Rhodopirellula baltica SH 1]
Length = 900
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 77/169 (45%), Gaps = 15/169 (8%)
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
K R+ ++ ++D SGSM G+ +E K+A A+ L P D+ ++AF+G++Y S
Sbjct: 456 KEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAVELLGPKDAIGVIAFDGDSYTVSEL 515
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE-- 437
+ + A+ A I A G TN+ + A E L + + + L+TDG
Sbjct: 516 RSTSDRGAISDAISTIE----ASGGTNMYPAMADAYEALLGATAKLKHVILMTDGVSSPG 571
Query: 438 DERQIC-DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
D + + D SR+T + T +G + L LA I G Y
Sbjct: 572 DFQGVAGDMSASRIT--------LSTVALGQGSSEDLLEELAQIGGGRY 612
>gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790]
gi|109625454|emb|CAJ51881.1| von Willebrand factor type A domain protein [Haloquadratum walsbyi
DSM 16790]
Length = 799
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 74/181 (40%), Gaps = 20/181 (11%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
PG+A+ VI +D+SGS G ++ K AL +L S +V FN + Y
Sbjct: 383 PGSAR--------VILAIDVSGST-GSGMQIQKAVALNALGQLGDSTSVGVVGFNRQAYE 433
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
+L R AGG TNI L A EML RG+ + L++DG
Sbjct: 434 VVGLEQLTENRDTTRQRIR---QLRAGGGTNIANGLRGAEEMLDGQRGT---VILISDGV 487
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
D R + L G R+ T G G N L +A IS G Y A + D +
Sbjct: 488 --DARSRATVVAESL---GRRGVRVITVGAGQRVNEPLLEQIADISGGTYFQANETDRLR 542
Query: 496 I 496
I
Sbjct: 543 I 543
>gi|66827863|ref|XP_647286.1| hypothetical protein DDB_G0267758 [Dictyostelium discoideum AX4]
gi|74859474|sp|Q55G98.1|Y7758_DICDI RecName: Full=von Willebrand factor A domain-containing protein
DDB_G0267758
gi|60475397|gb|EAL73332.1| hypothetical protein DDB_G0267758 [Dictyostelium discoideum AX4]
Length = 878
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 91/196 (46%), Gaps = 15/196 (7%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDS-FNIVAFNGETY--LFSTSMEL 382
K + IF++D SGSM G+P+ K A+ + + L+ N+V F G ++ +F S +
Sbjct: 314 KSEFIFLIDCSGSMVGEPMRKVKRAMEIIIRSLNENQHRVNVVCF-GSSFKKVFKVSRDY 372
Query: 383 --ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDER 440
T E + + Q I N G T + P+ + N + F++TDG
Sbjct: 373 NDETLECLSKYIQSIEANL---GGTELLTPIKNILSSPPNPEYPRQL-FILTDGEAPHRD 428
Query: 441 QICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQK 500
+I + S+ +N RI+T+GIG + + L+ +G+Y D D+ E Q+ K
Sbjct: 429 KIIHYL-SKESN----TTRIFTYGIGDSVDIDLIIGLSNACKGHYEFITDNDNFEKQVMK 483
Query: 501 LFTRGFSSVLANIAID 516
L + +NI +D
Sbjct: 484 LLNISLKPMFSNIKLD 499
>gi|385803148|ref|YP_005839548.1| hypothetical protein Hqrw_1882 [Haloquadratum walsbyi C23]
gi|339728640|emb|CCC39800.1| von Willebrand factor type A domain protein [Haloquadratum walsbyi
C23]
Length = 799
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 74/181 (40%), Gaps = 20/181 (11%)
Query: 316 PGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL 375
PG+A+ VI +D+SGS G ++ K AL +L S +V FN + Y
Sbjct: 383 PGSAR--------VILAIDVSGST-GSGMQIQKAVALNALGQLGDSTSVGVVGFNRQAYE 433
Query: 376 FSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGA 435
+L R AGG TNI L A EML RG+ + L++DG
Sbjct: 434 VVGLEQLTENRDTTRQRIR---QLRAGGGTNIANGLRGAEEMLDGQRGT---VILISDGV 487
Query: 436 VEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIE 495
D R + L G R+ T G G N L +A IS G Y A + D +
Sbjct: 488 --DARSRATVVAESL---GRRGVRVITVGAGQRVNEPLLEQIADISGGTYFQANETDRLR 542
Query: 496 I 496
I
Sbjct: 543 I 543
>gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative
[Ricinus communis]
gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative
[Ricinus communis]
Length = 514
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 100/212 (47%), Gaps = 23/212 (10%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++D+SGSM G + K A+ + KL P D ++V F+ + ++
Sbjct: 63 DLVAVLDVSGSMAGDKIAKVKTAMLFVIKKLSPIDRLSVVKFSADASRLCPLRQITEDSQ 122
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTN---SRGSIPIIFLVTDGAVEDERQICD 444
+ + G+N A G+TNI A L +++L + S G + I L++DG ++ D
Sbjct: 123 KDLENLINGLN--ADGATNITAGLQTGLKVLNDRSLSSGRVVGIILMSDG---EQNAGGD 177
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR 504
A + + N P +YTFG G NH R+L I+ G + ++Q +
Sbjct: 178 AAQVPIGN----VP-VYTFGFG--INHE-PRVLKAIANNSMGGTFS----DVQNTDNLSL 225
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSS 536
FS LA + ++DL ++ +R+ D S+
Sbjct: 226 AFSQCLAGLLTVVVQDL---KLTVTRVKDDST 254
>gi|421610510|ref|ZP_16051682.1| von Willebrand factor type A [Rhodopirellula baltica SH28]
gi|408498685|gb|EKK03172.1| von Willebrand factor type A [Rhodopirellula baltica SH28]
Length = 890
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 77/169 (45%), Gaps = 15/169 (8%)
Query: 320 KSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTS 379
K R+ ++ ++D SGSM G+ +E K+A A+ L P D+ ++AF+G++Y S
Sbjct: 446 KEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAVELLGPKDAIGVIAFDGDSYTVSEL 505
Query: 380 MELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVE-- 437
+ + A+ A I A G TN+ + A E L + + + L+TDG
Sbjct: 506 RSTSDRGAISDAISTIE----ASGGTNMYPAMADAYEALLGATAKLKHVILMTDGVSSPG 561
Query: 438 DERQIC-DAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
D + + D SR+T + T +G + L LA I G Y
Sbjct: 562 DFQGVAGDMSASRIT--------LSTVALGQGSSEDLLEELAQIGGGRY 602
>gi|338715233|ref|XP_001488642.3| PREDICTED: poly [ADP-ribose] polymerase 4 [Equus caballus]
Length = 1715
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 120/565 (21%), Positives = 228/565 (40%), Gaps = 83/565 (14%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSA-SVETGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ + ++A S G++L P++
Sbjct: 654 EAKYVFPLDDKAAVCGFEAFINGKHIVGEVEEKEKAHREYRTAISQGHGAYLMDQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTPAIKKIPKREKIHLNV 236
FT+++ + + + I++ + +LS + G +V F P V P R+ LN
Sbjct: 714 FTVSVGNLPPKAKILIKITYITELSIQ-GTTAV---FFMPATVAPW------RQDKALNE 763
Query: 237 NAGTGTEVLC-------------------------NTSSHHLKQLRRDVGKLGYSYESEV 271
N E +C ++ +H L+Q D + + E
Sbjct: 764 NIQDTVEKICIKEIGAQQSFSLSMSIEMPYVIEFISSDTHELRQKSTDCKAVISTVEGSS 823
Query: 272 LKWSNIDFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRK 323
L S F +S +++ + + P ++ CM + LP A+
Sbjct: 824 LDSSG--FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDIALPDLAE--- 873
Query: 324 VFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMEL 382
K +V+ +D S SM+ P K ALS + NIV F G LFS +
Sbjct: 874 --KSEVVICLDCSNSMESVPFLQAKQIALYALSLVGKKQKINIVKFGTGYQELFSYPKHV 931
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
+ ++I G+T+ L + + +L S+G + I L++DG +++E
Sbjct: 932 TSNSM---PAEFIMSASPTMGNTDFWKTL-RYLNLLYPSQG-LRNILLISDGHLQNESLT 986
Query: 443 CDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQK 500
++ + + R+++ G+GS NH+ LR+L+ G Y + S + Q++
Sbjct: 987 LQLVRRNVQHT-----RLFSCGVGSTANHHILRILSQCGAGVFEYLNSKSKHSWKKQIED 1041
Query: 501 LFTRGFSSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAK 557
TR S ++++ + D E P+++ L L+V G TL A
Sbjct: 1042 QMTRLCSPSCHSVSVKWQQLSPDAPEPLQAPAQVQSLFHNDRLLVYGFIPHCTQATLYAL 1101
Query: 558 GFLGDLSNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSK 610
+ S V +LQ + ++ A+ I +L E K+ L+ + K
Sbjct: 1102 IQEKEFSTMVSTTELQKTTGTMIHKLTARALIRDYEDGILHENERNHEMKKQVLKSLIIK 1161
Query: 611 MSVQTGVLCEYTRMIIVETDERNNA 635
+S + +L ++T + VE + N A
Sbjct: 1162 LSKENCLLTQFTSFVAVEKRDDNEA 1186
>gi|115488386|ref|NP_001066680.1| Os12g0431700 [Oryza sativa Japonica Group]
gi|77554879|gb|ABA97675.1| von Willebrand factor type A domain containing protein, expressed
[Oryza sativa Japonica Group]
gi|113649187|dbj|BAF29699.1| Os12g0431700 [Oryza sativa Japonica Group]
gi|125536450|gb|EAY82938.1| hypothetical protein OsI_38156 [Oryza sativa Indica Group]
gi|125579179|gb|EAZ20325.1| hypothetical protein OsJ_35934 [Oryza sativa Japonica Group]
Length = 524
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 79/168 (47%), Gaps = 29/168 (17%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ +VD+SGSM+G +E K AL + KL P D +IV F S++ L A
Sbjct: 62 DLVAVVDVSGSMRGHKIESVKKALQFVIMKLTPVDRLSIVTFE------SSAKRLTKLRA 115
Query: 388 VER--AHQWIGI--NFIAGGSTNICAPLTKAVEMLTN---SRGSIPIIFLVTDGAVE--- 437
+ + + GI + IA G T+I A L + +L + + IFL++DG +E
Sbjct: 116 MTQDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKT 175
Query: 438 --DERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG 483
D Q+ N G + +YTFG G +H L +A S G
Sbjct: 176 SGDPTQV---------NPGEVS--VYTFGFGHGTDHQLLTDIAKNSPG 212
>gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii
CS-505]
gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii
CS-505]
Length = 418
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 100/233 (42%), Gaps = 16/233 (6%)
Query: 331 FIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAVER 390
I+D SGSM+G+P+E+ K A + + KL D +IV FN + L+ + V+R
Sbjct: 46 LILDHSGSMKGQPVENVKRAAWLLVDKLRDQDRLSIVVFNHRAEVL-----LSNQNVVDR 100
Query: 391 AHQWIGINFI-AGGSTNICAPLTKAVEMLTNSR-GSIPIIFLVTDGAVEDERQICDAMKS 448
H IN + A G T+I L +E L R +I FL+TDG E+E +
Sbjct: 101 DHIKQQINRLSANGGTSIDEGLRLGIEELAKGRKDTISQAFLLTDG--ENEHGDNNRCLK 158
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTR---- 504
+ T G G+ N + L ++ G D + L TR
Sbjct: 159 FAQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHPDQAMDKFDSLLTRMQTV 218
Query: 505 GFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVS--GRYQGKFPDTLK 555
G ++ +++ L EF+ + PD + E P+ + GR+ K D +K
Sbjct: 219 GLTNAYLLLSLAPNVRLAEFKPFAQVAPD-TIELPVQMEPDGRFGVKLGDLMK 270
>gi|348508316|ref|XP_003441700.1| PREDICTED: von Willebrand factor A domain-containing protein 5A
[Oreochromis niloticus]
Length = 742
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 116/507 (22%), Positives = 208/507 (41%), Gaps = 67/507 (13%)
Query: 174 PNIFTLTLPQIDGGSYLSIRLRWSQKLSYR-DGEFSVNVPFKFPEYVTPAIKKIPKREKI 232
P+IF+L++ + G SIRL + +L+ + DG + F P + P + P E
Sbjct: 77 PDIFSLSVGSLPPGESASIRLEYVTELAVQADG----GLRFCLPGVLNPRYQP-PGSEGA 131
Query: 233 HLNVNAGTGTEVLCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNID-FDFSYTVSPSHIF 291
+ V + + V + S R + K+ + + L++ N D + ++ H F
Sbjct: 132 GVQVTSVPASLVPYSLSFSARVSSPRPISKVESNCSLDPLQYLNTDQTQATVKLAAGHKF 191
Query: 292 ---------------------GGVLLQSPSLHDVDQREMFCMY-LLPGTAKSRKVFKKDV 329
G P D M +Y P + S +
Sbjct: 192 DRDVELLIYYKDAHQPTAVVEAGQASAEPGSLMGDAVVMVSLYPEFPQSVMSSMASCGEF 251
Query: 330 IFIVDISGSMQGKPLEDTK----------NALAVALSKLDPGDSFNIVAF-NGETYLFST 378
+F++D SGSM G P+ ++K + L + L L G FNI +F +G ++F
Sbjct: 252 VFLMDRSGSM-GCPMTNSKQQETRISSARDTLLLLLKSLPMGCYFNIYSFGSGYEHIFPK 310
Query: 379 SMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI----IFLVTDG 434
S+E +++ +E A + + G T I PL S+ IP +F+ TDG
Sbjct: 311 SVEY-SQQTMEEALKTVEQMQANLGGTEILQPLKH-----IYSQPCIPSQPRQLFVFTDG 364
Query: 435 AVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSI 494
V + +++ D +K TN GS R ++FGIG + + +A G+ D +
Sbjct: 365 EVGNTKEVIDLVK---TNSGSH--RCFSFGIGEGASSALISGMAKEGGGHAQFITGTDRM 419
Query: 495 EIQMQKLFTRGFSSVLANIAI--DTLKDLDEFEMYPSRIPDLSSESPLI---VSGRYQGK 549
+ ++ + + +I++ D K++ + P + LI ++G+
Sbjct: 420 QPKVMQSLRFALQPAVVDISVTWDLPKEVSVTVLSPPITALFQGQRSLIYAQLTGQSSEA 479
Query: 550 FPDTLKAK-GFLGDLSNFVVELKLQLAKD--IPLDRICAKQQIDLLTAQAWFS---EDKR 603
+ K G S + L+ A+D + + R+ A+ I L Q + +D
Sbjct: 480 AEGCVTVKYSLAGHPSENQLHFSLRPAEDTGLTVHRLGARALIRSLEMQERENRGQQDGG 539
Query: 604 LEEKVSKMSVQTGVLCEYTRMIIVETD 630
++EKV K+SVQ+GV +T I V D
Sbjct: 540 VKEKVVKLSVQSGVSSSFTAFIAVNKD 566
>gi|145500000|ref|XP_001435984.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403121|emb|CAK68587.1| unnamed protein product [Paramecium tetraurelia]
Length = 981
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 104/230 (45%), Gaps = 21/230 (9%)
Query: 299 PSLHDVDQREMFCMYLLP-GTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSK 357
P + + + YL A+++ + + +F +D SGSM G ++ K +L + L
Sbjct: 319 PKFNQTTLDDAYSQYLEELNIAENQAINRGTYLFFIDRSGSMSGGRIKKAKQSLILFLRS 378
Query: 358 LDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGSTNICAPLTKAV 415
L FNI++F G + L+S S + + ++ ++ A + + T I P +
Sbjct: 379 LPDNCRFNIISF-GTMFRSLWSDSKQYS-QDTLDEAIKHVNAMEANMQGTEIFKPFQDVI 436
Query: 416 --EMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFGIGSYCNHYF 473
S+ + IFL+TDG V D I D +K R+YT GIG C+ Y
Sbjct: 437 YNNQYGKSKTTTLNIFLLTDGEV-DVFPIIDLVKR----NNKAETRVYTLGIGEGCSQYL 491
Query: 474 LRMLAMISRGYYGAAYDLDSIEIQMQKL-------FTRGFS--SVLANIA 514
++ LA + G + D + I ++ L + +GF+ S ++NIA
Sbjct: 492 IKNLADVGNGKFQFVADDEDINAKVIDLLEDSMTPYLQGFNLESNISNIA 541
>gi|429858179|gb|ELA33009.1| von willebrand domain containing protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 1292
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 81/163 (49%), Gaps = 12/163 (7%)
Query: 326 KKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLF--STSMEL 382
K +V+F+ D SGSMQGK + D KNAL + + L G FNI +F T+LF S + +
Sbjct: 378 KPEVVFVCDRSGSMQGK-IADLKNALQIYMKSLPVGCMFNICSFGSRHTFLFPKSQTYDQ 436
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQI 442
T + G ++ G T + AP+ + + + +F++TDG V + +
Sbjct: 437 NTLNTAMKHIDKFGSDY---GGTELYAPIEQTFKR--RHKDLDLEVFVLTDGEVWNHENL 491
Query: 443 CDAMKSRLTNG-GSICPRIYTFGIGSYCNHYFLRMLAMISRGY 484
+ + + GS+ R++T GIG +H + LA +G+
Sbjct: 492 FGMVNKHVEDSKGSV--RLFTLGIGRGVSHALVEGLARAGQGF 532
>gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii
177R1B]
gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii
177R1B]
Length = 900
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 77/164 (46%), Gaps = 15/164 (9%)
Query: 328 DVIFIVDISGSMQ-----GKP-LEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSME 381
DV+ ++D SGSM G P LE K+A A + L+ D ++AF+ Y +
Sbjct: 407 DVMLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDGVGVIAFDHNYYWAYKFGK 466
Query: 382 LATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQ 441
L KE V + I + GG T I PL++AV+ L S+ ++ L+TDG E
Sbjct: 467 LVRKEDVIESISSIEV----GGGTAIIPPLSEAVKTLKKSKAKNKLVVLLTDGMGEQSGY 522
Query: 442 ICDAMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYY 485
A +++ N +I T G+G + N L +A + G +
Sbjct: 523 EIPADEAKRNN-----IKITTIGVGKFVNASVLSWIAAYTSGRF 561
>gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila]
gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena
thermophila SB210]
Length = 2138
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 81/173 (46%), Gaps = 9/173 (5%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D+I ++D SGSM G+PL+ K L + L GD ++ F+ + + + +K+
Sbjct: 1447 DLICVIDTSGSMNGQPLDLLKETLLFLVDLLQTGDRICLIQFSTNAQRLTPLLSIESKDN 1506
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVED-ERQICD 444
++ I +A G TNIC + A ++L R PI +FL++DG + E +I D
Sbjct: 1507 IKSIKNEIN-RLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDGLNDGAENKIRD 1565
Query: 445 AMK--SRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRG---YYGAAYDLD 492
+K + N I TFG G + + ++ + G Y G + +D
Sbjct: 1566 LLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLMDGNFYYIGDIHRID 1618
>gi|156395736|ref|XP_001637266.1| predicted protein [Nematostella vectensis]
gi|156224377|gb|EDO45203.1| predicted protein [Nematostella vectensis]
Length = 756
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 39/371 (10%)
Query: 287 PSHIFGGVLLQSPSLHDVDQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLED 346
P F + S D ++ + + +P K + + IF++D SGSM G +++
Sbjct: 233 PQATFENGVTGSNITQDFLEKPLVTLNFMPDFGKQEALETGEFIFVIDRSGSMSGDRIKN 292
Query: 347 TKNALAVALSKLDPGDSFNIVAFNGETY--LFSTSMELATKEAVERAHQWIGINFIAGGS 404
+ L + L L FN+V F G +Y LFS+S + + + + + GG
Sbjct: 293 ARETLFLFLKSLPEHCHFNVVGF-GSSYEKLFSSSTKYSDSSVNKACNHAKNLEANLGG- 350
Query: 405 TNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKSRLTNGGSICPRIYTFG 464
T I PL K V +GS +FL+TDG V + +Q+ +K T+ R +TFG
Sbjct: 351 TEILEPL-KYVFSQPVIKGSPRQVFLMTDGEVGNTQQVITLVKKNSTHA-----RCFTFG 404
Query: 465 IGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQKLFTRGFSSVLANIAI--DTLKDLD 522
IG + ++ +A +G ++ ++ K + +I + D D
Sbjct: 405 IGQGASTALIKGVARAGQGTAEFITSSHQMQAKVVKTLRNALQPSMEDIKVTWDLPHGWD 464
Query: 523 EFEMYPSRIPD-LSSESPLIVSG---RYQGKFPDTLKAKGFL-----GDLSNFV-----V 568
+ P +P L S + +IV G + G + L+ K L + SN V
Sbjct: 465 S-QTIPKNLPQALFSGNRIIVYGLLSKQGGGRDEELQGKATLQATTRDNTSNMRTITHEV 523
Query: 569 ELKLQLAKD---IPLDRICAK----QQIDLL--TAQAWFSEDKRLEEKVSKMSVQTGVLC 619
KL+L + + R+ AK +Q D + A +W S+D L+ KV +S V+
Sbjct: 524 AFKLELKESDGCSMMHRVAAKRFIQEQQDAMGEDASSW-SDD--LKAKVINISKAANVVS 580
Query: 620 EYTRMIIVETD 630
+ T + V+++
Sbjct: 581 KVTSFVAVDSE 591
>gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana]
Length = 641
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 118/259 (45%), Gaps = 30/259 (11%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELATKE 386
D+I ++D+SGSM G +E KNA++ + L D ++++F+ LF + L ++
Sbjct: 204 DLITVLDVSGSMDGVKMELMKNAMSFVIQNLGETDRLSVISFSSMARRLF--PLRLMSET 261
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVEDERQICD 444
+ A Q + + +A G TNI L ++ R P+ + L++DG +D
Sbjct: 262 GKQAAMQAVN-SLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDG--QDNFTFSH 318
Query: 445 AMKSRLTNGGSICPR-----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
A T+ S+ P I+TFG GS + + ++ +S G + E +Q
Sbjct: 319 AGVRLRTDYESLLPSSCRIPIHTFGFGSDHDAELMHTISEVSSG----TFSFIETETVIQ 374
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF-PDTLKAKG 558
F + +L+ + L+ + E E + +SS I +G Y+ + PD A
Sbjct: 375 DAFAQCIGGLLSVV---ILEQVVEIECIHEQGLKISS----IKAGSYRSRIAPDARTATI 427
Query: 559 FLGDL-----SNFVVELKL 572
+GD+ +F+V L++
Sbjct: 428 DVGDMYAEEERDFLVLLEI 446
>gi|15223093|ref|NP_172283.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana]
gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana]
gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana]
Length = 641
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 118/259 (45%), Gaps = 30/259 (11%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNG-ETYLFSTSMELATKE 386
D+I ++D+SGSM G +E KNA++ + L D ++++F+ LF + L ++
Sbjct: 204 DLITVLDVSGSMDGVKMELMKNAMSFVIQNLGETDRLSVISFSSMARRLF--PLRLMSET 261
Query: 387 AVERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVEDERQICD 444
+ A Q + + +A G TNI L ++ R P+ + L++DG +D
Sbjct: 262 GKQAAMQAVN-SLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDG--QDNFTFSH 318
Query: 445 AMKSRLTNGGSICPR-----IYTFGIGSYCNHYFLRMLAMISRGYYGAAYDLDSIEIQMQ 499
A T+ S+ P I+TFG GS + + ++ +S G + E +Q
Sbjct: 319 AGVRLRTDYESLLPSSCRIPIHTFGFGSDHDAELMHTISEVSSG----TFSFIETETVIQ 374
Query: 500 KLFTRGFSSVLANIAIDTLKDLDEFEMYPSRIPDLSSESPLIVSGRYQGKF-PDTLKAKG 558
F + +L+ + L+ + E E + +SS I +G Y+ + PD A
Sbjct: 375 DAFAQCIGGLLSVV---ILEQVVEIECIHEQGLKISS----IKAGSYRSRIAPDARTATI 427
Query: 559 FLGDL-----SNFVVELKL 572
+GD+ +F+V L++
Sbjct: 428 DVGDMYAEEERDFLVLLEI 446
>gi|390335189|ref|XP_784969.3| PREDICTED: uncharacterized protein LOC579780 [Strongylocentrotus
purpuratus]
Length = 2137
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 92/201 (45%), Gaps = 28/201 (13%)
Query: 305 DQREMFCMYLLPGTAKSRKVFKKDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSF 364
D + CM + + +V+ ++D S SM+G+P++D K + + L L G +
Sbjct: 758 DNKRQACMVVFYPEFPVTTIANPEVVIMLDRSTSMKGQPMKDAKKLVHMILESLPAGCRY 817
Query: 365 ---------------NIVAFNGETY---LFSTSMELATKEAVERAHQWIGINFIAGGSTN 406
+I+ Y L+ TS+ LAT E ++ ++ G+T
Sbjct: 818 KLGIKCRGLEMDSRRDIIETQARHYFDELYPTSV-LATDENIKHTETFLMTVTWERGATE 876
Query: 407 ICAPLTKAVEMLTNSRGSIPII-FLVTDGAVEDERQ-ICDAMKSRLTNGGSICPRIYTFG 464
+ PL +A+ +L++S + FL++DG V +E+ + + +R N R++TFG
Sbjct: 877 VWRPL-RAMSLLSSSDVRLTSNGFLISDGHVTNEQDTLLEVASNRQIN------RLFTFG 929
Query: 465 IGSYCNHYFLRMLAMISRGYY 485
+ S N + LR LA G Y
Sbjct: 930 VSSDSNRHVLRTLADTGGGAY 950
>gi|343926523|ref|ZP_08766027.1| hypothetical protein GOALK_061_00090 [Gordonia alkanivorans NBRC
16433]
gi|343763526|dbj|GAA12953.1| hypothetical protein GOALK_061_00090 [Gordonia alkanivorans NBRC
16433]
Length = 413
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 79/162 (48%), Gaps = 11/162 (6%)
Query: 329 VIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEAV 388
V ++D SGSM G PLE K ALA +++LDP D+F +V F+ + + + L+ K
Sbjct: 31 VQVVLDRSGSMSGSPLEGAKKALAGVVAQLDPSDAFGVVTFDDDAQVVVPAAPLSDK--- 87
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI----IFLVTDGAVEDERQICD 444
+RA IG + +AGG T++ + + L S + I + +++DG V R I +
Sbjct: 88 QRAVDAIG-SIVAGGCTDLSSGYLLGLRELRRSAQAAGIKGGTVLIISDGHV--NRGIVE 144
Query: 445 AMKSRLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGYYG 486
K + I T +G Y Y +L+ ++R G
Sbjct: 145 VDKFESVTSKAAGDGIVTSTLG-YGRGYDETLLSAMARAGNG 185
>gi|403362523|gb|EJY80988.1| von Willebrand domain-containing protein [Oxytricha trifallax]
Length = 922
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 80/167 (47%), Gaps = 31/167 (18%)
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSM-------EL 382
IF+VD SGSM G ++ T +AL + L L G F I++F G Y+ + + +
Sbjct: 296 IFLVDRSGSMSGSKMKTTNDALILFLKSLPVGSYFEIISF-GSNYVSLSHLAGGYENNDQ 354
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT---NSRGSIPIIFLVTDGAVEDE 439
A+++ Q G + G T I PL A++ + N R IF++TDG V +
Sbjct: 355 NLNSAIQQV-QRFGADM---GGTEIFQPLQHAIQNIQTQLNKR-----IFMLTDGEVSNP 405
Query: 440 RQICDAMKSRLTNGGSICP---RIYTFGIGSYCNHYFLRMLAMISRG 483
+++ + + CP RI+T GIG C+ ++ +A RG
Sbjct: 406 QKVQEL--------AAQCPFDIRIHTIGIGRDCDVRLVQSVAKAGRG 444
>gi|355700874|gb|EHH28895.1| Poly [ADP-ribose] polymerase 4 [Macaca mulatta]
Length = 1724
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 124/563 (22%), Positives = 227/563 (40%), Gaps = 74/563 (13%)
Query: 122 DCRIAVPMGDQGSILGVEAEISGKSYHTQLIALGENDGAGKSASVE-TGSFL----KPNI 176
+ + P+ D+ ++ G EA I+GK ++ E + A + G++L P++
Sbjct: 654 EAKYIFPLDDKAAVCGFEAFINGKHIVGEIKEKEEAQREYREAVTQGHGAYLMSQDAPDV 713
Query: 177 FTLTLPQIDGGSYLSIRLRWSQKLSYRDGEFSVNVPFKFPEYVTP--------------- 221
FT+++ + + + I++ + +LS +V V F P V P
Sbjct: 714 FTVSVGNLPPKAKVLIKITYITELSILG---TVGV-FFMPATVAPWQQDKALNENLQDTV 769
Query: 222 ---AIKKIPKREKIHLNVNAGTGTEV-LCNTSSHHLKQLRRDVGKLGYSYESEVLKWSNI 277
IK+I ++ L ++ + + +H LKQ R D + + E L S
Sbjct: 770 EKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDCKAVIGTMEGSSLDSSG- 828
Query: 278 DFDFSYTVSPSHIFGGVLLQSPSLHDVDQREMFCMYL--------LPGTAKSRKVFKKDV 329
F +S +++ + + P ++ CM + LP A + +V
Sbjct: 829 -FSLHIGLSDAYLPRMWVEKHP-----EKESEACMLVFQPDLDVHLPDLAN-----ESEV 877
Query: 330 IFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAF-NGETYLFSTSMELATKEAV 388
I +D S SM+G K ALS + NI+ F G LFS + +
Sbjct: 878 IICLDCSSSMEGVTFLQAKQIALHALSLVGEKHKVNIIQFGTGYKELFSYPKHITSNIV- 936
Query: 389 ERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPIIFLVTDGAVEDERQICDAMKS 448
A ++I G+T+ L + + +L ++GS I+ LV+DG ++DE ++
Sbjct: 937 --ATEFIMSATPTMGNTDFWKTL-RYLSLLYPAQGSRNIL-LVSDGHLQDE-----SLTL 987
Query: 449 RLTNGGSICPRIYTFGIGSYCNHYFLRMLAMISRGY--YGAAYDLDSIEIQMQKLFTRGF 506
+L R++ GIG N + LR+L+ G Y A S Q++ TR
Sbjct: 988 QLVKRSRPHTRLFACGIGPTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRLC 1047
Query: 507 SSVLANIAIDTLK---DLDEFEMYPSRIPDLSSESPLIVSGRYQGKFPDTLKAKGFLGDL 563
S ++++ + D E P+++P L L+V G TL A +
Sbjct: 1048 SPSCHSVSVKWQQLNPDAPEALQAPAQVPSLFRNDRLLVYGFIPHCTQATLCALIQEKEF 1107
Query: 564 SNFVVELKLQLAKDIPLDRICAKQQID-----LLTAQAWFSEDKR--LEEKVSKMSVQTG 616
V +LQ + ++ A+ I +L E K+ L+ + K+S +
Sbjct: 1108 CTMVSTTELQKTTGTMIHKLAARALIRDYEDGILHENETSHEMKKQTLKSLIIKLSKENS 1167
Query: 617 VLCEYTRMIIVETDERNNASESP 639
++ ++T + VE + N ESP
Sbjct: 1168 LITQFTSFVAVEKRDEN---ESP 1187
>gi|15239414|ref|NP_200879.1| C3H4 type zinc finger protein [Arabidopsis thaliana]
gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana]
Length = 704
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 81/168 (48%), Gaps = 14/168 (8%)
Query: 328 DVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYLFSTSMELATKEA 387
D++ ++D+SGSM G L K A+ + L P D ++++F+ T + + L T+
Sbjct: 252 DLVTVLDVSGSMAGTKLALLKRAMGFVIQNLGPFDRLSVISFS-STARRNFPLRLMTETG 310
Query: 388 VERAHQWIGINFIAGGSTNICAPLTKAVEMLTNSRGSIPI--IFLVTDGAVEDERQICDA 445
+ A Q + + ++ G TNI L K +L + R P+ I L++DG +D +
Sbjct: 311 KQEALQAVN-SLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLLSDG--QDTYTMTSP 367
Query: 446 MKSRLTNGGSICPR--------IYTFGIGSYCNHYFLRMLAMISRGYY 485
SR T+ ++ P+ ++ FG G+ + + +A S G +
Sbjct: 368 NGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTF 415
>gi|395538858|ref|XP_003771391.1| PREDICTED: voltage-dependent calcium channel subunit
alpha-2/delta-4 [Sarcophilus harrisii]
Length = 1086
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 84/172 (48%), Gaps = 21/172 (12%)
Query: 327 KDVIFIVDISGSMQGKPLEDTKNALAVALSKLDPGDSFNIVAFNGETYL----FSTSMEL 382
KD++ +VD+SGSM+G + K+ ++ L L D NI+A+N + F ++
Sbjct: 248 KDIVILVDVSGSMKGLQIAIAKHTISTILDTLGENDFVNIIAYNDYVHYVEPCFKGTLVQ 307
Query: 383 ATKEAVERAHQWIGINFIAGGSTNICAPLTKAVEMLT----NSRGSI--PIIFLVTDGAV 436
A ++ E Q +G + G + L++A +L N +G + I L+TDGAV
Sbjct: 308 ADRDNREHFKQLVG-ELMVKGVGIVDKALSEAFRILMQFRENGQGGLCNQAIMLITDGAV 366
Query: 437 EDERQICDAMKSRLTNGGSICPRIYTFGIG---SYCNHYFLRMLAMISRGYY 485
ED I + N R++T+ IG ++ + ++ +A ++GYY
Sbjct: 367 EDYETIFEKY-----NWPDRRVRVFTYLIGREVTFADK--MKWIACNNKGYY 411
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,063,863,183
Number of Sequences: 23463169
Number of extensions: 525160731
Number of successful extensions: 1166187
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1016
Number of HSP's successfully gapped in prelim test: 1858
Number of HSP's that attempted gapping in prelim test: 1161476
Number of HSP's gapped (non-prelim): 3992
length of query: 759
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 608
effective length of database: 8,816,256,848
effective search space: 5360284163584
effective search space used: 5360284163584
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)