BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] (458 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 315 bits (807), Expect = 7e-84, Method: Composition-based stats. Identities = 458/458 (100%), Positives = 458/458 (100%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA Sbjct: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV Sbjct: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ Sbjct: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL Sbjct: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT Sbjct: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR Sbjct: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS Sbjct: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK Sbjct: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 210 bits (533), Expect = 4e-52, Method: Composition-based stats. Identities = 99/470 (21%), Positives = 175/470 (37%), Gaps = 75/470 (15%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + +++ G+F I+TA+L PV+LG GM + V + L++AA +A Sbjct: 1 MSTSFLHPCLRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + L + S + F ++ YL + T Sbjct: 61 LATATALANGTIQTSQAEAFARNFVAGQMANYLQSGVDIKSATGVT----------VQTN 110 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + ++YQV +S YDL +NPL ++++G + + T+ + +S+ Sbjct: 111 TSGNSTSYQVTVSPSYDLTVNPL---MQAVGFTTQHLS--TSGTTIGGHSQTQGSISMYL 165 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + + Y CN Sbjct: 166 ALDKSGSMGEDTATVNEE---------------------------DPTESYTYDCNGH-- 196 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + + S + K ++ A ++ + D VR GA Sbjct: 197 -------YNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVS 249 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------E 355 ++ + + +WG + G T + AM TAY ++ + N E Sbjct: 250 YDIVQYTPSALAWGTSGVSTY-----VNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAE 304 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQGIRIMTIAFSVN 408 D H++K KKYIV +TDG+N + A C+ AKS+GI I TIAF Sbjct: 305 DAAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAP 364 Query: 409 KTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + Q L CAS + +F+A +L F+ IG + ++ R+T Sbjct: 365 EGGQA----LLHYCASDDSHYFQAEKMEDLLAAFK-AIGAKASSQLTRLT 409 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 95/470 (20%), Positives = 172/470 (36%), Gaps = 74/470 (15%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + +++ G+F I+TA+L PV+LG GM + V + L++AA +A Sbjct: 1 MSTSFLRPCLRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + L + + + F ++ YL + + + Sbjct: 61 LATATALANGTIQTTEAEAFARNFVAGQMANYLQSGTDIKSTTS----------VNVQTT 110 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + ++YQV +S Y L +NPL ++++G + + T+ + +S+ Sbjct: 111 TSGKSTSYQVTVSPAYVLTVNPL---MQAVGFTTQHLS--TSGTTIGGHSQTQGSISMFL 165 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + Y CN Sbjct: 166 ALDKSGSMGEDTATVNEE---------------------------SPTESYTYDCN---- 194 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + + S + K ++ A ++ + D VR GA Sbjct: 195 ----LHYNTKNNKWVYDKCTGSRTNYYTKIEALKMAAGNLFSQLNSADPNAQYVRTGAVS 250 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------E 355 ++ + S +WG+ + G T + AM TAY ++ + N E Sbjct: 251 YDINQYAPSSLAWGITGVSSY-----VNALQANGGTNSSGAMNTAYTSLTAKNAAGNDVE 305 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + H+ K KKYIV +TDG+N + C+ AKS+GI I TIAF Sbjct: 306 NSAHQQKTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSKGIEIYTIAFMAP 365 Query: 409 KTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Q L CAS + +F+A +L F IG + ++ R+T Sbjct: 366 AGGQA----LLHYCASDDSHYFQAEKMEDLLAAF-QAIGAKASAQLTRLT 410 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 207 bits (525), Expect = 4e-51, Method: Composition-based stats. Identities = 94/470 (20%), Positives = 167/470 (35%), Gaps = 70/470 (14%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + +++ G+F I+TA+L PV+LG G+ + V + L++AA +A Sbjct: 1 MSTSFLHPCLRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAADSAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + L + S + F ++ YL + + Sbjct: 61 LATATALGNGTIQTSQAEAFARNFVAGQMANYLQNGVDI----------KNATAVNVQTS 110 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + ++YQV ++ YDL +NPL + T + + + + VS+ Sbjct: 111 NSGKSASYQVTVTPSYDLTVNPLMQAVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFL 170 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + Sbjct: 171 ALDKSGSMG------------------------------------DPTETVNKDQPTETF 194 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 L+ + + S + K ++ A ++ + D VR GA Sbjct: 195 TYDCNPHLNKKGKWVYDTCTGSRTNYYTKIEALKMAAGNLFGQLTSADPDAQYVRTGAVS 254 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------E 355 ++ + + +WG + G T + AM TAY ++ + N E Sbjct: 255 YDIDQYTPSTLAWGTSGVSSY-----VNALQAGGGTNSSGAMGTAYSSLTAKNAAGNDAE 309 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQGIRIMTIAFSVN 408 D H++K KKYIV +TDG+N + A C+ AKS+GI I TIAF Sbjct: 310 DAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAP 369 Query: 409 KTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Q L CAS +F+A +L F+ IG + ++ R+T Sbjct: 370 PGGQA----LLQYCASDAAHYFQAEQMEDLLAAFK-AIGAKASAQLTRLT 414 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 206 bits (523), Expect = 7e-51, Method: Composition-based stats. Identities = 174/462 (37%), Positives = 274/462 (59%), Gaps = 7/462 (1%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 MVF+ F K+L K G FF+I+ALL+PV+ V G+L+D+VRW YY ++L QA TA Sbjct: 1 MVFNKSLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ASV L+ S+E+ S S + I++YL+ N + +L NF + + + I++ T V Sbjct: 61 ALSASVQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKV 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLS-LFLRSMGIKSWLIQTKAEAE-TVSRSYHKEHGVS 178 + RK + + + S Y+L LNP S F+ + IKSW I T EAE T ++YHKE GVS Sbjct: 121 NIYNRKGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVS 180 Query: 179 IQWVIDFSRSMLDY-QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +QW+ID S SM R G + ++N + +VSC+ Sbjct: 181 VQWLIDDSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCD 240 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +SLYY+L + E + ++ I+K++LVRDALA+ I+ ++KIDN+ D +RM Sbjct: 241 RSLYYVLNDKKILEDDDLEEKNLDNHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLRM 300 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 +FN+R+ +WG+ + + + + ++E +T I+ +Q AY+ + S NED+ Sbjct: 301 SFMYFNERIDHYFPMTWGIKEFKQEVSSHY-KRKHENTATDIHPILQEAYNKLHSKNEDD 359 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEK 414 H+ KN++E KK+IVLLTDG + + IC+ AK +GI+I TI++SV+ ++++K Sbjct: 360 EHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSERKK 419 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 A FLS CASP+ FFEA +LN IF++ IG+ IFER+++I Sbjct: 420 ANDFLSRCASPDKFFEAYDADKLNMIFKEHIGDAIFERLVKI 461 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 205 bits (522), Expect = 9e-51, Method: Composition-based stats. Identities = 98/470 (20%), Positives = 178/470 (37%), Gaps = 74/470 (15%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + +++ G+F I+TA++ PV+LGV G+ + V + L++AA +A Sbjct: 1 MSTSFLYPRLRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + L + S + F ++ Y+ + + Sbjct: 61 LATATALANGTIQTSQAEAFAQNFVAGQMANYVQSGVD----------FKSGTSVNVQTS 110 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + ++YQV +S YDL +NPL ++++G K+ + TV + +S+ Sbjct: 111 TSGKSTSYQVTVSPSYDLTVNPL---MQAVGFKTQHLS--TSGTTVGGHSQTQGSISMFL 165 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + D+ Y CN Sbjct: 166 ALDKSGSMGEATATVN---------------------------ADDPTESYTYDCN---- 194 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + + S + K ++ A ++ + D + VR GA Sbjct: 195 ----LHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFGQLNSADPNAEYVRTGAVS 250 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------E 355 ++ + + +WG + G T + AM TAY ++ + N E Sbjct: 251 YDINQYTPSNLAWGTAGVTSY-----VNALQANGGTNSSGAMSTAYSSLTAKNAAGNDAE 305 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQGIRIMTIAFSVN 408 D H++K KKYIV +TDG+N + A C+ AKS+GI I TIAF Sbjct: 306 DSAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAP 365 Query: 409 KTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L CAS + +F+A +L F+ IG + ++ R+T Sbjct: 366 AGG----QTLLHYCASDDSHYFQAEKMEDLLAAFK-AIGAKASAQMTRLT 410 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 205 bits (520), Expect = 2e-50, Method: Composition-based stats. Identities = 93/460 (20%), Positives = 178/460 (38%), Gaps = 64/460 (13%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ + ++++ G+F I+TA+++PV+ G GM + V + L++AA +A Sbjct: 1 MMSTSFLHPCLRRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAADSA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + L + S + F ++ YL + Sbjct: 61 ALATATALANGTIQTSQAEAFARDFVAGQMANYLQSGIDIKSTTGVD----------VRT 110 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + ++YQV +S Y++ +NPL ++++G + I T + + + VS+ Sbjct: 111 TTSGKSTSYQVTVSPDYNIAVNPL---MQTIGFTTQNISTSSTTTSGNSQTQGS--VSMF 165 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V+D S SM + + Y C++ Sbjct: 166 LVLDRSGSMGEDTATVN---------------------------ASDPTEEYNYDCSEKD 198 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y + + H K ++ A+ ++ + +D + VR GA Sbjct: 199 RYGNVTKKKT---------CTDTRPHYYTKIEALKLAVGTLTGELDAVDPEKEYVRTGAV 249 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 +N + + WG + + T +A +TAY+ + + ED+ H Sbjct: 250 SYNIEMQKAKALDWGTAHVTKY-----VNKLTATDGTDSGEAFKTAYNKLADAAEDKAHV 304 Query: 361 MKNNLEAKKYIVLLTDGENTQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K KYIV +TDG+N + E C+KA+ +++ TIAF + + Sbjct: 305 DKTGQVPTKYIVFMTDGDNNYTSADTETKTWCDKARDAKMQVYTIAFMA----PARGQAL 360 Query: 419 LSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 LS CA+ P ++F A L K F++ IG + +V R+T Sbjct: 361 LSYCATAPGNYFPAGDMTALLKAFKE-IGMKASNQVTRLT 399 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 203 bits (515), Expect = 6e-50, Method: Composition-based stats. Identities = 63/474 (13%), Positives = 140/474 (29%), Gaps = 39/474 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G I+ ++ +M+ VGG+ +D +R L+ + A++ A+ Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAAD 83 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + ++ + N + D T Sbjct: 84 LDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKDMD----TQFIGRFGF 139 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +V S+ + + + + L S + + + + + S Sbjct: 140 PTLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQDAADIFLDTVLKDENEDLIS 199 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S++ Y PL +R + G D N+ ++ Sbjct: 200 VSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNY 259 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + D+ R+ ++ ++ I + T + + Sbjct: 260 DGRN-----NYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGMKWAAAM 314 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI---------NDAMQTAYDTIISS--NE 355 + + V + T D I ++ NE Sbjct: 315 LDPAFRDINNSLVNAGHVDREFYN-RPASYTDSETLKTVILMTDGANDNSFRISNTYYNE 373 Query: 356 DEVHRMKNNLEAKKYI-----------VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 D + N Y+ + N IC+ AK++ I I +I Sbjct: 374 DSEYVHWNRYNLWWYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIVIWSIG 433 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F V+ ++ + +CAS S F EL++ FR I +I + +R+T+ Sbjct: 434 FEVD----DEDVPAMQDCASSPSHFFRVEGVELSEAFR-AIARQINQ--LRLTQ 480 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 202 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 62/474 (13%), Positives = 140/474 (29%), Gaps = 39/474 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G I+ ++ +M+ VGG+ +D +R L+ + A++ A+ Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAAD 83 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + ++ + N + D T Sbjct: 84 LDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKNMD----TQFIGRFGF 139 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +V S+ + + + + L S + + + + + S Sbjct: 140 PTLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQNAADIFLDTVLKDENQDLIS 199 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S++ Y PL +R + G D N+ ++ Sbjct: 200 VSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNY 259 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + D+ R+ ++ ++ I + T + + Sbjct: 260 DGRN-----NYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGMKWAAAM 314 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI----------NDAMQTAYDTIISSNE- 355 + + V + T + A +Y + + Sbjct: 315 LDPAFRDINNSLVNAGYVDREFYN-RPASYTDSETLKTVILMTDGANDNSYRIRSNYYDS 373 Query: 356 DEVHRMKNNLEAKKYI-----------VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 D + N Y+ T N IC+ AK++ I I +I Sbjct: 374 DSEYVHWNKYNLWWYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVIWSIG 433 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F V+ ++ + +CAS S F EL++ FR I +I + +R+T+ Sbjct: 434 FEVD----DEDVPAMQDCASSPSHFFRVEGVELSEAFR-AIARQINQ--LRLTQ 480 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 198 bits (502), Expect = 2e-48, Method: Composition-based stats. Identities = 101/468 (21%), Positives = 187/468 (39%), Gaps = 79/468 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------IT 63 ++++ G+F ++TAL+ P++L VGG+ VDV ++ L+ A A ++ Sbjct: 11 FMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVS 70 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQK---------IEEYLIRNFENNLKKNFTDREVRDI 114 + P I+ ++++ + + I N ++ + +V Sbjct: 71 DARPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNAT 130 Query: 115 V-RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 TA + ++QV +++++ L N ++ R +G +S I+T++ AE+ + S + Sbjct: 131 EIDITATPNGAKGKSFQVTVANKHLLQFNAMT---RLLGPESIEIETRSTAESATESKN- 186 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 +S+ V+D S SM C P Sbjct: 187 --ALSMYLVLDRSGSMAWKTNTINTGKAKC---------------------------PNY 217 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 N S Y L ++ + K ++ A+ ++ + D + Sbjct: 218 TEANWSKYPDLK----------------ATGPCYVTKIDALKTAVGDLLAQLVTADPESA 261 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR GA +N + S SWG + +G TA +A +TA+ + ++ Sbjct: 262 YVRTGAISYNSAQDAASSLSWGTRGA-----AGYVDALVAIGGTASGNAFKTAFQKVTNA 316 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ED H KN KYIV +TDGEN D+ C+ AK+ ++I ++AF Sbjct: 317 AEDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDTAKASKVQIYSVAFMA---- 372 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + L +CA S + +FEA +L F+ IG V R+TK Sbjct: 373 PDRGQKLLKSCASSSSHYFEAEEASDLVAAFK-AIGERAAASVSRLTK 419 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 61/479 (12%), Positives = 136/479 (28%), Gaps = 55/479 (11%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + +++ ++ G+ +I AL + +LG G+ VD R + +L+ A +A + Sbjct: 6 LAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAAL 65 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 S L + + T+ V T Sbjct: 66 MLSRDLGVGTITPDQVSSKAQTYFNSLYTN---------------KETGAVTVTATYTAK 110 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + S+ + + F++ +G ++ I + + + Sbjct: 111 DGSGSS-----TIAMSGQGAVQTQFMKILGFQTMAIGSSTTTTWGGTR------LRVAMA 159 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D + SM + S + + R + + ++ S + + Sbjct: 160 LDVTGSMASAGKMSAMKTAAKNLVDSLRASAQTADDVYISVVPFAQMVNVGSSNRNANWV 219 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDT----- 294 + + S + V + D D Sbjct: 220 RWDLWDESNGSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYA 279 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R A ++ + T +K + G T M A+ ++ + + Sbjct: 280 TRFPAVDYDACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGD 339 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQD-------------NEEGIAICNKAKS-----Q 396 +N + I+LL+DG NT D + +C+ ++ Sbjct: 340 PLNTPAKDSNYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTN 399 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + I TI + + + L CA +FF ++ + F + RV + Sbjct: 400 PVVIYTIQVNTDG---DPESTVLKYCADSGNFFATTTSSGIGTAFAQIGSSLSKLRVAK 455 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 192 bits (487), Expect = 9e-47, Method: Composition-based stats. Identities = 61/479 (12%), Positives = 135/479 (28%), Gaps = 55/479 (11%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + +++ ++ G+ +I AL + +LG G+ VD R + +L+ A +A + Sbjct: 6 LAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAAL 65 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 S L + + T+ V T Sbjct: 66 MLSRDLGVGTITPDQVSSKAQTYFNSLYTN---------------KETGAVTVTATYTAK 110 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + S+ + + F++ +G ++ I + + + Sbjct: 111 DGSGSS-----TIAMSGQGAVQTQFMKILGFQTMAIGSSTTTTWGGTR------LRVAMA 159 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D + SM + S + + R + ++ S + + Sbjct: 160 LDVTGSMASAGKMSAMKTAAKNLVDSLRASAQTVDDVYISVVPFAQMVNVGSSNRNASWV 219 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDT----- 294 + + S + V + D D Sbjct: 220 RWDLWDESNGSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYA 279 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R A ++ + T +K + G T M A+ ++ + + Sbjct: 280 TRFPAVDYDACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGD 339 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQD-------------NEEGIAICNKAKS-----Q 396 +N + I+LL+DG NT D + +C+ ++ Sbjct: 340 PLNTPAKDSNYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTN 399 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + I TI + + + L CA +FF ++ + F + RV + Sbjct: 400 PVVIYTIQVNTDG---DPESAVLKYCADSGNFFATTTSSGIGTAFAQIGSSLSKLRVAK 455 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 189 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 65/484 (13%), Positives = 130/484 (26%), Gaps = 60/484 (12%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + + G+ +I AL + +LG G +D R S A++ A + + Sbjct: 8 TRSRKVASRFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTALMV 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 S L + S ++ + T+ R + T Sbjct: 68 SKDLGADKIKTSEVSEKAQTYFNSLYTGTEARG----------------VTLTTNYTAKD 111 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + VV++ + + F++ G S I + A A + + + Sbjct: 112 DSGSSTVVVNGDGAVS----THFMKMFGFPSLAIGSAATATWGGTRLRVAMALDVTGSMV 167 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + S + +E + R + ++ S + + Sbjct: 168 LNGST----KLAEMKKAASALVDTLRASAQSKDDLYISVVPFAQMVNVGSSNIDASWIKW 223 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD---ALASVIRSIKKIDNVNDTVRMGATF 301 + + + + V D V + D D T Sbjct: 224 DVWDETEGSCSKSKFKTKTDCEDNGRTWTVTDRSKWKGCVTDRDQPADTTKDAPTSDDTR 283 Query: 302 F---------NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 F + +K D G T M A+ ++ Sbjct: 284 FPALRTLLGTTSCPAQIFPMTSAYAATDAQKIKDVIDDLVADGGTNQPIGMAWAWMSLQQ 343 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------------NEEGIAICNKAK-- 394 N N + I+LL+DG NT D + +C+ K Sbjct: 344 GNPLNTPAKDPNYKYTDAIILLSDGLNTMDRWPDYGDGQRQFDGKIDARQKLLCDNIKLP 403 Query: 395 ---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + TI + + L CA +FF + + F + Sbjct: 404 DSNGKRPVVYTIQVNTTG---DPESTILKYCADGGNFFATTTASGIGTAFAQIGSSLSKL 460 Query: 452 RVIR 455 R+ + Sbjct: 461 RIAK 464 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 187 bits (475), Expect = 3e-45, Method: Composition-based stats. Identities = 57/475 (12%), Positives = 130/475 (27%), Gaps = 42/475 (8%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++ + G + L++ +M+ +GG+ VD++R ++ A A++ A+ Sbjct: 18 MLRRFAREEDGLVTLFAILMILLMILLGGVGVDLMRHERERARVQAVADRAVLAAADLDQ 77 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 E +R + I + N + T + Sbjct: 78 TLSPEAVARDYFDKSGLADYISSVTVEEGLNYRRVTVDAS----RDLKTMFIDKFGQEKL 133 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + + + + + + S K + + + + S S+ Sbjct: 134 HVPAKATAEEKVAKVEISMVLDISGSMRENDKMNNLHDASNVFIDTVIQTDTEDLISISV 193 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG----IRDEKLSPYMVSCNKSLYYMLY 245 + Y + + + + Y + Y Sbjct: 194 VPYTAQVNVGKDIMDELNVTQLHSYSHCVDFEDSDFNLTTISQTRSYEHMQHFEAGYYWN 253 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 D + ++ + + ++ A++ I +T + Sbjct: 254 GNDRDRTGHYDNISNPGCPKQSYEEIETFSQNAAALKSRIANFQPRANTAIHLGLKWGVA 313 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND--AMQTAYDTIISSNEDEVHRMKN 363 + + I I+ + D + + + Sbjct: 314 L-----LDPSFRAINEAIGGDAVFRGRPAEYNDIDTLKTVILMTDGVNVTTRRIAPEAYS 368 Query: 364 NLEAKK--------------------YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 N + + Y T Q + IC+ AK++GI I +I Sbjct: 369 NRDHYRHWSDYPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAKAKGIVIWSI 428 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F V + NCAS +S F E+ F + I +I + +R+T+ Sbjct: 429 GFEVTDHGAA----VMKNCASSDSHFFRVEGVEIVDAF-EAIARQINQ--LRLTQ 476 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 186 bits (472), Expect = 5e-45, Method: Composition-based stats. Identities = 98/468 (20%), Positives = 185/468 (39%), Gaps = 83/468 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-------ITASV 66 +++ G+F ++TAL+ P++L VGG+ VDV ++ L+ A A ++ + Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 67 PLIQSLEEVSSR---AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 P I+ + ++ + + + T E + D V Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETPNG 122 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + ++QV +++++ L N ++ R +G +S ++T++ A++ + S + +S+ V+ Sbjct: 123 TKGKSFQVSVANKHLLQFNAMT---RLLGKESIELETRSTADSATESKN---AISMYLVL 176 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D S SM + R + +S G+ +R Sbjct: 177 DRSGSMAWKTDTVDTSR--------PRCINWTASNWGESNVR------------------ 210 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 ++ + K ++ A+ + + K+D N+ +R GA +N Sbjct: 211 ------------------ATSPCYVDKITTLKSAVDKLFTPLAKMDPGNEYLRAGAASYN 252 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 DR +WG + G T + A A + ++ E+E H KN Sbjct: 253 DRQDRASKLTWGTKNAS-----AHVQGLDATGGTDSSSAFAAAVEELLLDGENEAHLAKN 307 Query: 364 NLEAKKYIVLLTDGENTQD------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +KYIV +TDGENT + A C AK+ GI I T+AF Sbjct: 308 GQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNNGIAIFTVAFMA---- 363 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ + L CA SP+ + EA+ L F ++IG + + R+TK Sbjct: 364 PQRGKDLLKACATSPDHYKEADDAAALVSEF-EKIGQKAAAMIARLTK 410 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 185 bits (468), Expect = 2e-44, Method: Composition-based stats. Identities = 62/474 (13%), Positives = 135/474 (28%), Gaps = 42/474 (8%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +G I ++ ++L +GG+ VD++R L+ AI++A+ Sbjct: 26 RSPLASFKDDESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAADL 85 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + + N + + TA Sbjct: 86 DQTQTPQAVVDDYFETAGLESFLSNVTVDQGINYKTVGAEAQSITT----TAFMKMAGVD 141 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + + + + + S I +K + + +S + S Sbjct: 142 TLNATAAGVAEERIANVEISMVLDISGSMGIGSKMTQLRSAATSFVNTVLSPENEDLVSV 201 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S++ Y + P + E V+ ++ ++ Sbjct: 202 SLVPYSQHVNAGPKIYNELNTNHRHNYSHCVEMADSAYSETELDLSVTYDQMQHFQWNY- 260 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN---- 303 D+ R+ ++ +++ I ++ T + Sbjct: 261 -----SGANQLTDTICPRYSYERITAFSQDASALNAQIAQLQPRAGTQIFMGMKWAAAML 315 Query: 304 ------------DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D F T + + +++ AYD+ Sbjct: 316 DPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKTVVLMTDGKNSSSMRIKSWAYDSSS 375 Query: 352 SSNEDEVHRMK-------NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + N +Y D Q + ICN +K GI I +I Sbjct: 376 DYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAA--QGDALLDDICNASKDAGIVIWSIG 433 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F V+ + ++NCAS S F E+++ F D I +I + +R+T+ Sbjct: 434 FEVDDHGAD----VMANCASSPSHFFRVEGIEISEAF-DAIARQINQ--LRLTQ 480 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 58/475 (12%), Positives = 126/475 (26%), Gaps = 56/475 (11%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 S + +K+ G+ II A+ + M+G G +D R + +++ A +A + S Sbjct: 11 RDVSSRFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKD 70 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + + + ++ N D V Sbjct: 71 LASGVITAGQVSAKAQSYFASLYNNTEAPNITVTATYTAKDSTGSSTVLLKGTG------ 124 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + F+ G + I + + + + + +D + Sbjct: 125 --------------DISTEFMNMFGFPTLGIG------SAATATWGGTRLRVAIALDVTG 164 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + Q R + I ++ S + + Sbjct: 165 SMASAGKMPAMQSAAKTLVDNLRANAQTADDLYISIIPFAQMVNVGKSNKNASWIKWDYW 224 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLV---RDALASVIRSIKKID-----NVNDTVRMGA 299 ++ + S + V + D R A Sbjct: 225 EDTTGSCNWWWLTTKSSCESAGRTWSSTNQSQWGGCVTDRDQPADTTKDAPTTAATRFPA 284 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + T +K + G T M A+ ++ Sbjct: 285 ANYSACPEQILPMTSAYSSSNATTIKDKIDALSPNGGTNQPIGMHWAWMSLQDGAPLNTP 344 Query: 360 RMKNNLEAKKYIVLLTDGENTQD-------------NEEGIAICNKAKSQ------GIRI 400 + + I+LL+DG NT D + +C+ ++ I Sbjct: 345 AKDADYKYTDAIILLSDGMNTIDRWYGNGSSWSKDVDARQKLLCDNIRAASAASTTKTVI 404 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 TI + + + L CA +FF + ++ F + R+ + Sbjct: 405 YTIQVNTDG---DPESEVLKYCADSGNFFATTTASGISTAFAQIGASLSKLRIAK 456 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 61/474 (12%), Positives = 141/474 (29%), Gaps = 49/474 (10%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ +G+ +I AL + ++G+ G +D + + ++QA +A++ A+ + Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDG 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR------DTAVEMNPR 125 + S + + + K + + E ++ +T + Sbjct: 66 KDRAYSLKE-ANDYFKGILNQSNNSGLNCTNIDLVYIDETEELEGHVECSQNTTLSKVAG 124 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 +SS + L + S + V+ + Sbjct: 125 IRHLDFNVSSAATYGIGKLEIAFVFDVSGSMANDNRMGNLKVAAREAVNTLL-------- 176 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + Y D E L V + E+ P+ + Y Sbjct: 177 --PVEGYAGDPEDVRLAMVSYDTM--VNAGPYFKAVTNQDPERTEPFYGYIRERTTCRRY 232 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD------ALASVIRSIKKIDNVNDTVRMGA 299 + R K V + + + + V+ T Sbjct: 233 RN-NGTCREWNYEWRGPYHRSYTIKSTCVWEREGAERYTDASPGHNRWLPPVSATFDSYN 291 Query: 300 TFFNDRVISDPSFSWGVH---KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 ++ +DP + R + F D +TA + + + Sbjct: 292 DSWSTDHQTDPWCNDNTPIPLTYNRNKLHDFIDDMTPRRNTAGHIGQAWGWYLVSPEWNS 351 Query: 357 EVHR-----MKNNLEAKKYIVLLTDGENT---------QDNEEGIAICNKAKSQGIRIMT 402 + +A K +++++DG+ + AIC+K K + + I T Sbjct: 352 VWPAGSKALPYDEPDATKVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYT 411 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIGNEIFERVIR 455 + F + L+ CAS +F ++ + EL + ++ + R+ R Sbjct: 412 VGFDAGY-----GQDVLNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRISR 460 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 181 bits (458), Expect = 3e-43, Method: Composition-based stats. Identities = 68/467 (14%), Positives = 139/467 (29%), Gaps = 32/467 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I+T ++ +M+ VGG+ +D +R L+ AA A++ A+ Sbjct: 21 RFARHEDGSITILTIFIIIMMVMVGGIQLDFMRHEMERTKLQAAADRAVLAAADLDQTLA 80 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + I N N + T V Sbjct: 81 PADVVDEYFAKSGMSDYLSSVTIENGLNFRTVTV----KANNEMKTQFLGRFGFPTLDVP 136 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S+ + + + + L S +K + + + + + S S++ Y Sbjct: 137 ALSKAEERVEKVEISLVLDVSGSMKNNSKLTTMKDAAKTFIDTVLRPETKNNVSLSLIPY 196 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 P D + G + +++ ++ ++ Sbjct: 197 SEQVNVGPDIFNALWVDTRHDFSYCIDVPDGHFVQTQMTPGFPWDQTQHFQWNTYSIESG 256 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + D+ R V ++ + S+ I T + ++ Sbjct: 257 YQQNTLHDTVCPRAVYERVRPISQDGPSLKAQIDLFQPRAGTAIYMGMKWGTALLDPSFR 316 Query: 313 SWGVHKLIRTIVKTFAID---------------ENEMGSTAINDAMQTAYDTIISS---- 353 + ++V++ D G + + + TAY S Sbjct: 317 ETTASLVSDSVVESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTAYYNSSSEVVHW 376 Query: 354 --NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + E + T + N IC+ AK +GI I TI F V Sbjct: 377 SKWNFNYYLSQYIKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVIWTIGFEV---- 432 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + CAS S F EL F I ++I + +R+T+ Sbjct: 433 NDTGADVMKKCASSPSHFFRVEGVELTDAF-SAIASQINQ--LRLTQ 476 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 180 bits (455), Expect = 5e-43, Method: Composition-based stats. Identities = 50/476 (10%), Positives = 134/476 (28%), Gaps = 39/476 (8%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T +++ +G + +L +M+ GG+ +D++ ++ A++ A Sbjct: 21 TALATRARRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAA 80 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + + N + + ++ Sbjct: 81 ADLDNELEAQGVVEDYMDKMALADALISVDVDEGLNYRTVVAEGYKTMPSNFMQILGVD- 139 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Q + +N + + L S K + + ++ Sbjct: 140 ---NLQAYGLAEATERINKVEVSLVLDISGSMDDNDKLANMQDAAGTFIDTLLAEGNEDL 196 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S S++ Y P A+ + + + + +Y Sbjct: 197 VSISLVPYSEQVNAGPEILSYLSANWKHGYSHCIEMPNSVFGSAALDFSRTYEQMQHYQW 256 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + +LS D+ R+ ++ +++ + ++ T + Sbjct: 257 NYDGYNNTLS-----DTVCPRYGYERIQAWSHDASALKAQVNQLQPRAGTSIFMGMKWGT 311 Query: 305 RVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTA---------------INDAMQTAY 347 ++ + G+ ++ + F T AY Sbjct: 312 ALLDPSTRPIASGM-IARGSVDQVFEGRPVAYDDTDVLKTVVLMTDGQHDRSYRIQDWAY 370 Query: 348 DTIISSNEDEVHRMKNN-----LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ + + ++ + +IC AK+QGI I + Sbjct: 371 NSESEYAHWNRYNLWYYLSRYVSSYERSSFYYQKYNADLGDALLGSICAAAKAQGIIIWS 430 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F V + + +CAS + F E+ + F I + + + +R+T+ Sbjct: 431 VGFEVGDHGAD----VMESCASSPAHFFRVEGVEITEAF-STIAHTLNQ--LRLTQ 479 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 179 bits (453), Expect = 9e-43, Method: Composition-based stats. Identities = 98/447 (21%), Positives = 162/447 (36%), Gaps = 74/447 (16%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 TA+L PV+LG GM V V + L++AA +A + + L + S + F Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSEAEAYARNF 60 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 ++ YL + + ++YQV +S YDL +NPL Sbjct: 61 VAGQMANYLQSGVDI----------KSATGVSVQTNTSGNSTSYQVTVSPSYDLTVNPL- 109 Query: 146 LFLRSMGIKSWLIQTKAEAETVS-RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 ++++G + + T + +S+ +D S SM + + Sbjct: 110 --MQAVGFTTQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATVNEE----- 162 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + Y CN + E + S Sbjct: 163 ----------------------DPTESYTYPCN---------PHYNRKGKEVWDTCTGSR 191 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + K ++ A ++ + D VR GA ++ + S +WG + Sbjct: 192 ANYYTKIEALKMAAGNLFAQLSGADPNAQYVRTGAVSYDIVQYAPSSLAWGAIGVSSY-- 249 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGE 378 G T + AM TAY ++ + N ED H++K+ +KYIV +TDG+ Sbjct: 250 ---VNALQAGGGTNSSGAMSTAYLSLTAKNAAGNDAEDSAHKLKSGQIPQKYIVFMTDGD 306 Query: 379 NT-------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFE 430 N + A C+ AKS+GI I TIAF Q L CAS +F+ Sbjct: 307 NNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQA----LLQYCASDASHYFQ 362 Query: 431 ANSTHELNKIFRDRIGNEIFERVIRIT 457 A +L F+ IG + +V R+T Sbjct: 363 AEKMEDLFAAFK-AIGAKASTQVTRLT 388 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 173 bits (437), Expect = 6e-41, Method: Composition-based stats. Identities = 62/508 (12%), Positives = 131/508 (25%), Gaps = 85/508 (16%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + + + G+ I A+ + MLG G VD R + +++ A +A + Sbjct: 6 IRERIRSSAVRFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAAL 65 Query: 63 TASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + + + + + + Sbjct: 66 MVAKDANAASPQMTADQVTAAAQKYFNALYHNTDAQGASVSAVYTPYNNGT--------- 116 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + N + F++ +G +T + A + + Sbjct: 117 -----------PATVVLSGSGNVQTDFMKVVGFPQISFKTNSTATW------GNTKLRVA 159 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D + SM + + + + + I + + N + Sbjct: 160 MALDVTGSMSSAGKLVQMKIAAKKLIDTLKASATAEGDVYISIIPFNVMVNVGANNNTAS 219 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL-----------VRDALASVIRSIKKID 289 + G D S S S K + V Sbjct: 220 WLEWEDGSYDNSSSNYGSCSGSGKSKPNTKSSCIAAGKTWTPKNISSWKGCVTDRGPVSK 279 Query: 290 NVNDTVRMGAT--------------FFNDRVISDPSFSWGV------HKLIRTIVKTFAI 329 + ++ S + + +K Sbjct: 280 PGSGDYDTTKDEPVASTPYTLYLARNYSTCPSSILPMTSAYDSKESDSSTDDSTLKGKIN 339 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------- 382 + G+T AMQ A+ + + + I+LL+DG NTQD Sbjct: 340 NLVANGATNQAIAMQMAWMMLQPTAPFPAPAKDEKYKYTDAIILLSDGLNTQDRWYGNGS 399 Query: 383 ------NEEGIAICNKAKSQGI---------RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + +CN K+ I RI TI + + + L NCA+ Sbjct: 400 DWSSQVDTRQALLCNNIKNDPISKTDPTRRTRIYTIQVNTDG---DPESTVLKNCATDG- 455 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIR 455 FF ++ + F + R+ + Sbjct: 456 FFPTSTASGIASAFAQIGASLSQLRIAK 483 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 65/504 (12%), Positives = 135/504 (26%), Gaps = 80/504 (15%) Query: 7 FIFYSKK---LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + ++ + G I +++ +ML V G+ VD+++ ++ AI+ Sbjct: 13 LRGFCRRLMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILA 72 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 AS + + + +++ I + NF + R ++ M Sbjct: 73 ASDLDQPLPADEVVDDYFAKAGMTEFLDDVQITPGAHLPTTNFRVVQAEARTRTPSIYMA 132 Query: 124 -PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V ++ + + + L S K + + Sbjct: 133 MTGVRSLPVYVAGTAEETIENTEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAA 192 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S +++ Y + P+ +G + + N Sbjct: 193 NTTSLNIVPYAGQTNPGPIVFQRAGGRPFATFIEDSDGNEILYGQTFVDD--EGNSIDVP 250 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + + + A Sbjct: 251 YNTMSSCLDLTNGDFDNIDLPSGGY-------------------DQTPYFMNWPIDAPTM 291 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE----- 357 + S + ++ F D T M+ + S+ D Sbjct: 292 DWGWCPQNKSSIRYAQNNAGQLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRDTFVALN 351 Query: 358 -----------VHRMKNNLEAKKYIVLLTDGENT-------------------------- 380 + +K+IVL+TDG+ T Sbjct: 352 AAGLVPDGFKDRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRD 411 Query: 381 ------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + +ICNKAK+ GI + TIAF + + CA+ +FF Sbjct: 412 TYATQSTNVANFYSICNKAKAAGITVYTIAFEAPANAITQ----MRTCATSPAFFYKVEG 467 Query: 435 HELNKIFRDRIGNEIFERVIRITK 458 E+ F+ I +I E +R+T+ Sbjct: 468 VEIKTAFKS-IARQINE--LRLTQ 488 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 171 bits (432), Expect = 2e-40, Method: Composition-based stats. Identities = 58/473 (12%), Positives = 134/473 (28%), Gaps = 26/473 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 +F++K Y K G + A + ++L V G VD R Y H + A A Sbjct: 1 MFESKNSSYFHKFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 +T + L ++ + + + I + + D + ++V+ Sbjct: 61 LTVAKQLSTTVLTENQIRTGLKNAFRANLNAAGINSQGIDNLDFKVDPGEGTLDVWSSVD 120 Query: 122 MNPRKS--------AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + +V +S+ + + L L S A E + Sbjct: 121 IQTNFIKLGGIGPEKLEVGAASQVNYSRFDVELALVLDVTGSMRPDMNALKEASKSIVNI 180 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + + + + N + G + +PY Sbjct: 181 LLP-DDSNSRESKVRISLVPYSQGVNLGSYATRVTNGGSTWRNCVNERSGPQKFTDAPYN 239 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++S ++ P + + ++R+I + + Sbjct: 240 YAGSRSDFFHGKPKQFVWDYGWTEQWQTRPEACPKTAVEPLTADRTKLLRAISGLKDGGG 299 Query: 294 TVRMGATFFNDRVISD----------PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 T + +S ++G K G Sbjct: 300 TGGQTGIAWGWYTLSPKWKNLWPRDSAPATYGTGSHTDDTKKFAL--IMTDGDFNAAYGW 357 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 I + ++Y + + +C++ KS+ I I T+ Sbjct: 358 DCGCRKIRDKPLYCRKKSNKKSWIERY--FSPSKISHAPAQRAKKLCDEMKSKNIEIFTV 415 Query: 404 AFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 F + +S CAS +++ A++++EL + F + + + Sbjct: 416 YF--DTGGATFGDDLMSYCASGSRNYYRADNSNELIQAFSNIANEIQSIYIAK 466 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 170 bits (429), Expect = 5e-40, Method: Composition-based stats. Identities = 62/491 (12%), Positives = 136/491 (27%), Gaps = 53/491 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS--VPLIQS 71 + G+ ++ L +++ VD + A A + + P Sbjct: 6 FFRDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVAVLRPATVE 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + + + + + + + +N ++ Y+V Sbjct: 66 QAQAAVKLRLDKELGDNPDKVVIGQFNYDTKTRTYYVTAKGTYKPFLLGVVNIKEIPYEV 125 Query: 132 --------VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + L+L+ + + S + A T + + Sbjct: 126 ISETIQAANGTLELALVLDNTDSMGQILNGSSTRLDVLKTAATNLVNTVMTSANKDYVKV 185 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK----- 238 + + AD TV + + + + +C+ Sbjct: 186 AVVPYADYVNVGLANRSQSWVSVGADYTVPAAAKTCTTISTKQVCTGGVYGTCDSIKDGV 245 Query: 239 --SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P + ++ K VR + S + + D + Sbjct: 246 PIKVGCWKTPQTCTTVNITPYQSCNNPQPTYYKWYGCVRHQVDSKTKMLVLPDPLTAYTG 305 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + T + S + +I + T I + +T+ Sbjct: 306 VLETAQ-KCPTAIQPLSNDKTVVTNSIKGLVNSIGSYKPDTFIPGGLHWGVNTLSPPAPF 364 Query: 357 EVHRMK--NNLEAKKYIVLLTDGENTQDN--------------------------EEGIA 388 + N E KK IVL+TDG NT Sbjct: 365 KEGMAYDSKNKEPKKVIVLMTDGANTLYTNSSGQIVSAATGSPPTISSSLVAPTYTAQDN 424 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGN 447 C AK + I + I V A L +CA+ +F+A + ++L + F + IG Sbjct: 425 ACKYAKGKNIEVFVIGLGVTDP---TALSALKSCATDAQHYFDAQNANDLIEAF-EIIGG 480 Query: 448 EIFERVIRITK 458 ++ V+R+ + Sbjct: 481 KLS--VVRLMQ 489 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 169 bits (427), Expect = 9e-40, Method: Composition-based stats. Identities = 58/454 (12%), Positives = 136/454 (29%), Gaps = 52/454 (11%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL + GH I+ A+++P + G+ + D R + ++ AA+ A + S Sbjct: 2 KLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAH--NDP 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + +Q + +Y+ + N R+ + + Sbjct: 60 NQDYGGGGSPSSANQQIVTDYINAYISDVDSINEIKVYKRNCEEIPECKAGLAVGEPRY- 118 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + S F + I +++R Y E V + + DFS SM D Sbjct: 119 FEHEVGVTTSQKSWFPGNDAIVGMGDSFSTSGHSLARKYQSE-AVDVMFAADFSGSMGDR 177 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 D K N ++ + ++ Y Y G Sbjct: 178 WTGGNK-KYEDLIDIIDSISKELQKFNDLEH--NDNDNTMGITAYNEYTYSQYSGSSGGW 234 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 ++ ++ + + + + +N +D Sbjct: 235 WGDDCYLSQAESDGFWGGVSISKTIDGLW---------NEKSKDHCNNSYNSGRFNDIPL 285 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + G T+ A+ + +++ ++ Sbjct: 286 TSNFDVVNQ-----DVSRFWPEGGTSSYQALIRGAQLLT-----------YGTNSRRLLI 329 Query: 373 LLTDGENTQDN----EEGIAICNKAK----------SQGIR--IMTIAFSVNKTQQEKAR 416 +L+DG +T +N +C + ++ IR + I F ++ Sbjct: 330 VLSDGMDTDNNLTSSLVNAGMCRDIQQGLESDKTLDNRPIRAQMAVIGFDYEPSEN---- 385 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L +C + ++A ++ ++ + I EI Sbjct: 386 QALKDCVGAENVYKAENSDDILNTILELISEEIG 419 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 169 bits (426), Expect = 1e-39, Method: Composition-based stats. Identities = 50/475 (10%), Positives = 129/475 (27%), Gaps = 38/475 (8%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + ++ +G I+ L +M+ GG+ +D++ ++ A++ A Sbjct: 20 SALVAQLQGFRRAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAA 79 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + S + + N + + Sbjct: 80 ADLDNELDAQGVVEDYMSKMSLADALVSVNVDEGLNYRTVTADGYRTMPSNFMQLIGIE- 138 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Q S+ +N + + + S K + S + + Sbjct: 139 ---NMQAGGHSQAMERINKVEVSMVLDISGSMDDGDKMAELQTAASDFVDTLLDDGSEDL 195 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S S++ Y P + + ++ ++ ++ Sbjct: 196 VSISLVPYSEHVNAGPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTYDQMQHFQW 255 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 D+ R+ ++ ++ I ++ T + Sbjct: 256 NY------SGSNSLTDTVCPRYAYEQIRPWSQDAGALKTQISQLQPRAGTSIFMGMKWAS 309 Query: 305 RVISDPSFSWGV-HKLIRTIVKTFAIDENEMGSTAINDAM---------------QTAYD 348 ++ + T+ F T + + AY+ Sbjct: 310 ALLDPSTRPIASGMIADGTVDAVFEGRPVAYSDTDVLKTIVLMTDGQHDRSFRIQNWAYN 369 Query: 349 TIISSNEDEVHRMKNNLEAK-----KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + + L + + + ++C AK QGI I +I Sbjct: 370 DENEVEHWSQYNLWHYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIWSI 429 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F V+ + +CAS + F E+++ F I + + +R+T+ Sbjct: 430 GFEVSDHGAN----VMESCASSPAHFFRVEGVEISEAF-STIAQTLNQ--LRLTQ 477 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 166 bits (419), Expect = 9e-39, Method: Composition-based stats. Identities = 67/489 (13%), Positives = 128/489 (26%), Gaps = 75/489 (15%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+ G II AL + + G+ VD R + L QA A + Sbjct: 3 LRRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGS---- 58 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 T ++++ + F+ N K + D V T ++ Sbjct: 59 ------------STGTAAELQQIGQKFFDANFKDSGLDAAGSFSVSVTGDVVSANG---- 102 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + ++ +GI + + A+ + + + S ++ Sbjct: 103 ---------SAQVQTTLMQLVGIDTIAVSESAQVIRSIKGLELALVLDNTGSMTTSDNIG 153 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + ++ FG AD + + ++P ++S N + G Sbjct: 154 ALRDAAQELVDILFGGRADHPTLRVAVVPYSASVNPGPIAPTLISGNDAYAPTNLLGWKG 213 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + S K V G + Sbjct: 214 CVIERVGRAMEDSPASTAPWLRYQWLPAIDNYYDATKASTVRADPSQGNGGTGPNLGCPT 273 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + A+ G T + M + N + K Sbjct: 274 PITPLTGVKATVDSAIQALRAWSRGGTMGDIGMAWGLRVLSPEPPFTEGLAWNTPKWAKA 333 Query: 371 IVLLTDGEN-------------------------------------------TQDNEEGI 387 ++L+TDG+N + N Sbjct: 334 VILMTDGDNQFYKLTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLT 393 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIG 446 +C K +GI + TI F+ Q + CAS +F++ S +L FR Sbjct: 394 QVCQAMKDKGITVYTITFTSGINQAT--KDIYKACASSTAKWFDSPSQADLRASFRAIAT 451 Query: 447 NEIFERVIR 455 RV + Sbjct: 452 ELSQLRVSQ 460 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 52/468 (11%), Positives = 124/468 (26%), Gaps = 27/468 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+ +I + + +LG G VD R S A++ A + + + L Sbjct: 17 FPAASGGNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSGKI 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + T+ + + + + + +V+ Sbjct: 77 TAENVQSAANTYFTSLYKNTDAPSIDVTATYTP-KTSSENAKLTVGGTGSINTEFMKVMN 135 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S+ L + + + + + + ++ + + + S Sbjct: 136 ISQMSLGASSTTTWGGTRLRVALALDVTGSMDSAGKLSAMKTAAKQLIDTLKATSTTKED 195 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKV-GIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 P N + G K + + + Sbjct: 196 VYISIVPFNVMVNVGPGNKNATWLDWDTSYGSCKSKYTTKNACQAGGDSWNYWSNTCQSQ 255 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV----NDTVRMGATFFNDRVIS 308 + + + + + + ++ A ++D + S Sbjct: 256 KTLKSACQAGGHTWTASNVNSWKGCVTDRTQNYDTTKTEPTSATPDTLFLAQNYSDCMAS 315 Query: 309 DPSFSWGV------HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 T +K + G T M A+ T+ ++ Sbjct: 316 LLPMKSAYEATESDSSTDATTLKGRINTLDAQGGTNQGIGMFWAWMTLQATAPLYTPAKD 375 Query: 363 NNLEAKKYIVLLTDGENTQD-------------NEEGIAICNKA--KSQGIRIMTIAFSV 407 + + IVLL+DG NT++ ++ +C+ K G+ TI Sbjct: 376 SEYKYTDAIVLLSDGMNTKNRWYGNGSNWSPQVDDRQKILCDNITTKVNGVPETTIYTIQ 435 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 T + L C S FF + + F++ + R+ + Sbjct: 436 VNTSGDPESSVLKYCGSTGGFFSTTTASGIQSAFQEVGASLTKLRIAK 483 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 163 bits (411), Expect = 7e-38, Method: Composition-based stats. Identities = 57/475 (12%), Positives = 130/475 (27%), Gaps = 46/475 (9%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + K+ + G F I + + +ML GM +D++R+ + L+ A++ A+ Sbjct: 21 KRFVKEFARDEDGAFIIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAADL 80 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + N + ++ T ++ Sbjct: 81 DQTLSPAAVVTDYFAKAGLSSFLTSTTVDQGLNYRIIS----AQGNMTMPTTFMRLSGQT 136 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + + ++ + + L S K + + + + S Sbjct: 137 ELAIRGDATAEERVSNVEISLVVDISGSMGRNNKLSTLRTASHTFIDTVIRPETEDLISL 196 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 +++ Y P D+ + + + + + S Sbjct: 197 NIIPYTAQVNAGPDIFDQLTVDQKHNFSHCIDFE---PADFNTAALDVPPVSTRTYKQMQ 253 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S + + ++ S+ ++ + +T + ++ Sbjct: 254 HFQYGWSSSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRARANTAIHLGMKWGVSML 313 Query: 308 SDPSFSWGVHKLIRTIVKTFAID---------------ENEMG-------------STAI 339 + V G ST Sbjct: 314 DPTFRPIVTAMIANNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRISDEFYSTPS 373 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Y +N + N KK+ TQ + +IC+ AK++GI Sbjct: 374 QIAHWDRYQLFFFTNNYIDRDIDQNYYYKKF-------TATQADTMLQSICDAAKAEGIL 426 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + TI F V+ + + +CAS S F EL++ F R++ Sbjct: 427 VWTIGFEVSNHAAGE----MLDCASSPSHFFRVEGVELSEAFASIARQINQLRLV 477 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 162 bits (409), Expect = 1e-37, Method: Composition-based stats. Identities = 63/494 (12%), Positives = 133/494 (26%), Gaps = 77/494 (15%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G I +++ +ML V G+ VD+++ ++ AI+ AS Sbjct: 3 FKREEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAASDLDQPLPA 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQVV 132 + + + + + I + NF + R ++ M V Sbjct: 63 DEVVDDYFAKAGMTEFLNDVRITPGSDLPTTNFRIVQAEARTRTPSIYMAMTGVRTLPVY 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S + + + + L S K + + S +++ Y Sbjct: 123 VSGTAEETIEKIEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAAKTTSLNIVPY 182 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + NG + + + N Sbjct: 183 AGQTNPGRIVFERAGGLPFATFIEDSNGDEILYGQTIVDD--EGNSIDVPYNTMSSCLDL 240 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + A + + Sbjct: 241 TNSDFDNIDLPSGGY-------------------DQTPYFMNWPIDAPTMDWGWCPQNNS 281 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED---------------- 356 S + ++ F D T M+ + S+ + Sbjct: 282 SIRYAQNDAGRLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRNTFLALNAAGLVPDGFK 341 Query: 357 EVHRMKNNLEAKKYIVLLTDGENT--------------------------------QDNE 384 + +K+IVL+TDG+ T + Sbjct: 342 NRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRTGDRDTYSTQSTNVT 401 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++CNKAK++GI + TIAF + + CA+ +FF ++ F+ Sbjct: 402 NFYSVCNKAKAEGITVYTIAFEAPADAVTQ----MRTCATSPAFFYKVEGVQIKTAFKS- 456 Query: 445 IGNEIFERVIRITK 458 I +I E +R+T+ Sbjct: 457 IARQINE--LRLTQ 468 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 162 bits (409), Expect = 1e-37, Method: Composition-based stats. Identities = 63/481 (13%), Positives = 133/481 (27%), Gaps = 43/481 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +K + G+ ++ + + ++ G+ VD R + A++ AA +A + S Sbjct: 10 LRRTAKAFHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSK 69 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-------TA 119 + S + + K I N V T+ Sbjct: 70 DYAAGVIRASDIQATAEKYFKALYTSPGINNVTVTATYTARSANGSSTVVMNTSGSMPTS 129 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA-ETVSRSYHKEHGVS 178 +A SS L + + S K A +T + + Sbjct: 130 FLKVAGFTALPFTASSTSTWGATRLRVAMALDVTGSMDWDDKLTAMKTAAIKLVNTLKAT 189 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 D S++ + N + D S ++ + + Sbjct: 190 ASTDADVYISIIPFNVMVNVGTANKDAEWLDWDTDYGSCKSNRTTQNSCQ--------AA 241 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-KIDNVNDTVRM 297 + + + + + + + + S + T Sbjct: 242 GETWSWWANSCTSRYTRKSTCVAGGETWIPSGVSNWKGCVTDRTTSNDYDVIKTPPTTAT 301 Query: 298 GATFFNDRVISDPSFSWGV-----------HKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 AT F + S S + +K + G+T + A Sbjct: 302 PATLFLAKSYSACPLSLLPMKAAYSSNESDTSTAESTLKGKINKLDAEGNTNQPIGLFWA 361 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------NEEGIAICNKAKS-- 395 + ++ + + I+LL+DG+NTQ + +C+ K Sbjct: 362 WMSLQTGVPLNTPAKDTEYKYTDAIILLSDGDNTQSGNSNSVSAIDARQKKLCDNIKDPL 421 Query: 396 QG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 G I TI + + L CAS FF++ + ++ F+ + R+ Sbjct: 422 NGTTTIFTIQVNTDGDD---ESAVLKYCASDGQFFQSTTADQIEIAFQSIGSSLTKLRLA 478 Query: 455 R 455 + Sbjct: 479 Q 479 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 162 bits (409), Expect = 1e-37, Method: Composition-based stats. Identities = 93/472 (19%), Positives = 174/472 (36%), Gaps = 73/472 (15%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++K G+F ++TA+ P++L GG+ +D+ ++ L+ A A Sbjct: 1 MRRRSMIGRSFITMLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAA 60 Query: 61 IITASVPL-IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR--- 116 + A+ L ++++ + + F K + + + + + Sbjct: 61 ALAAASALVSDEQPDIAAAKEIARKFLKTQAGGTTTPDAPADSGEGASSGAASSTPDWDD 120 Query: 117 -------DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 T + +QV + ++ N ++ R +G S ++ + AE+ + Sbjct: 121 VNTLEVNITETPNGTKGKIFQVTVINKRVTEFNAMT---RLLGTDSIELEASSTAESATE 177 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 S ++ +S+ V+D S SM +C Sbjct: 178 S---KNALSMYLVLDRSGSMAWKTNTINAAKKSC-------------------------- 208 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 P N S Y L+ +S + K ++ A+ ++ + D Sbjct: 209 -PNYTESNWSRYPNLW----------------ASSPCYVTKIDALKTAVTDLLAQLLVAD 251 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 VR A +N + + +WG + G TA A +TAY Sbjct: 252 PDQIYVRTAAISYNSVQDTAGTLAWGTSGA-----AAYVNALVATGGTASAGAFKTAYQK 306 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ--DNEEGIAICNKAKSQGIRIMTIAFSV 407 +I++ E+ H KN KY+V +TDGEN D+ C+ AK+ + I ++AF Sbjct: 307 VIAATENTAHAAKNGQVPSKYMVFMTDGENNYANDDTVTKQWCDTAKANKVEIYSVAFMA 366 Query: 408 NKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 E+ + L CA S + +FEA +L F+ IG V R+TK Sbjct: 367 ----PERGQALLKYCASSSSHYFEAEEVTDLVAAFK-AIGERAAAVVSRLTK 413 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 162 bits (408), Expect = 1e-37, Method: Composition-based stats. Identities = 91/465 (19%), Positives = 178/465 (38%), Gaps = 83/465 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP--LI 69 ++ + +G+F ++TA+L+PV+LGV G +++ + ++ A +A + A+ L Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTADSAALAAATEARLR 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + F ++E+ L + L+KN N R Y Sbjct: 71 EGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKN--------SPTRVTTTENARGKTY 122 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + ++ + LNP+ F +G K+ + A++ +K +S+ +D S SM Sbjct: 123 AVETTIKHQIQLNPMLGF---IGAKTLDLSVTGTAKST---INKGAPISMYLALDRSGSM 176 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +C S N S Y L Sbjct: 177 SFKTDTVDTTKTSC---------------------------QNYTSDNWSKYPNLA---- 205 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN------VNDTVRMGATFFN 303 + + K ++ A+ ++ ++ K D ++ VR GA+ + Sbjct: 206 ------------KTSPCYVNKAASLKTAVGFLVATLNKADPTYTVNGGSELVRTGASVYT 253 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--EVHRM 361 S WG + + K I E G T ++ AY+ + +N D H+ Sbjct: 254 HETYVAQSIGWGTSGVTSYVDKQ--IPEFPSGGTDARSSLNAAYNALKKANPDEARYHKE 311 Query: 362 KNNLEAKKYIVLLTDGE--------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 K + ++YIVL+TDGE N+ ++ C AK GI+I ++AF + Sbjct: 312 KGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIFSVAFMA----PD 367 Query: 414 KARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 K + L CAS ++++ + ++ F + I + + +T Sbjct: 368 KGKSLLQYCASSADNYYAPENMEQIVTAFGE-IARKAAGSIATLT 411 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 65/455 (14%), Positives = 143/455 (31%), Gaps = 52/455 (11%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L K GH I+ A+++P + G+ + D R + ++ A++ A + S Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAH--ND 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + + T +Q + +Y+ + R + K + Sbjct: 60 PDQPDNGSYTPSTRNRQIVVDYVNAYISDIDAVTDIKVAKRRCELISGCVAGLYKGDARY 119 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L D+ S F + I+ ++++R Y E V + + DFS SMLD Sbjct: 120 -LEHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSE-AVDVMFAADFSGSMLD 177 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 S + + + + + M S + + Sbjct: 178 TWSGSSNPK---YIDLIEIIRNISVELQKFNDLPENRDKSTMGISAFSTFTNSFTSDTGI 234 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 S V+S + + + +A++ + D + GA D + Sbjct: 235 QCSLSQGVNSKNKPGNWFRPVKPANTVANIWN-----EKTEDYCKSGAY----AGFHDVN 285 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + + L G TA A+ + +++ + Sbjct: 286 LTSNFNSLNGQ-----VGSFYAGGGTASYQALIRGAQLLD-----------RGRNSRRLL 329 Query: 372 VLLTDGENTQDNEEG----IAICNKAKS----------QGI--RIMTIAFSVNKTQQEKA 415 ++L+DG + N +C + ++ + I ++ I F + Sbjct: 330 IVLSDGMDNDRNLANGLVSNGMCREIQAGLESDRTPDGRPIAAKMAVIGFDYDPFAN--- 386 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L +C + ++A E+ I + I E+ Sbjct: 387 -KALKDCVGEKNVYKAEDADEVEDIILELINEEVG 420 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 160 bits (403), Expect = 5e-37, Method: Composition-based stats. Identities = 90/475 (18%), Positives = 183/475 (38%), Gaps = 83/475 (17%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + + +++ + G+F ++TA+L+PV+LG G +++ + L+ A +A Sbjct: 1 MISSAYQKMTRRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAA 60 Query: 62 ITASVP--LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + A+ L + + + F ++E+ L + L+KN + Sbjct: 61 LAAATEARLKEGALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKN--------SPVNIG 112 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + R Y + + Y + LNPL F G K+ + A + +K +S+ Sbjct: 113 TTDDARGKTYTIQTTINYQMQLNPLLGF---FGAKTLDLAATGTAVST---VNKGAPISM 166 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 V+D S SM + +C D N Sbjct: 167 YLVLDRSGSMSFKTDTLNTKKTSCQNYTVD---------------------------NWG 199 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN------VND 293 Y L ++ + K ++ A+ ++ ++ K D ++ Sbjct: 200 SYPNLK----------------NTSPCYVNKATSLKTAVGYLVATLNKADPTYTANGGSE 243 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR GA+ + + +WG + + K I E G T ++ AY+ + + Sbjct: 244 LVRTGASVYTHETYAAQPITWGTSSVATYVDKQ--IPEFPSGGTDARSSLNAAYNALKKA 301 Query: 354 N--EDEVHRMKNNLEAKKYIVLLTDGENTQD--------NEEGIAICNKAKSQGIRIMTI 403 N E + H+ K + ++YIVL+TDGE T + ++ C+ AK GI+I ++ Sbjct: 302 NTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIKIFSV 361 Query: 404 AFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 AF +K + L +CAS ++++ + ++ F + I + + +T Sbjct: 362 AFMA----PDKGKSLLQHCASSLDNYYAPENMEQIVTAFGE-IARKAAGSLATLT 411 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 160 bits (403), Expect = 6e-37, Method: Composition-based stats. Identities = 62/477 (12%), Positives = 133/477 (27%), Gaps = 48/477 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F I+ V+ G VD+ + + L+ A+ + + L + Sbjct: 14 FARHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGVI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + +K F N + + ++ V+ + Sbjct: 74 KEADASKAVQNFL------------VANSMAGILQPD-QIVLDRLVVDRTANTVQADAHV 120 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + R + L K + + D + Sbjct: 121 DVALFFPVFGMGNTQRVTASTTSLYSDKTIEVAMMLDVTGSMAANWWAKTDKIGDLQAAA 180 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + L+ P + V+ + V + Sbjct: 181 STAVENLLDNNIDPNNPRVRVAIVPYAEAVNTGGLADSVFVEQAGGSNLPPPVPSAGAPI 240 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 V + ++K A S + N + + +DR+ + P Sbjct: 241 PVGSSVTLRPDKCATERKDKDGYADYSSDGPSELRRNNQNQEYLAKVNRDDRMGTCPKPE 300 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--------DEVHRMKNNL 365 + + D G TA A+Q Y + S +N Sbjct: 301 LIPLTADKQKLLDTIADFKAAGVTAGGIAVQWGYYMLSPSWRSTIVNARLGSGPANFDNR 360 Query: 366 EAKKYIVLLTDGENTQ-----------------DNEEGIAICNKAKSQGIRIMTIAFSVN 408 + K +L+TDG+ +IC+ K GI I TI F ++ Sbjct: 361 KVGKVAILMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDNMKRDGIEIFTIGFDLD 420 Query: 409 KTQ-----QEKARYFLSNCASPN-----SFFEANSTHELNKIFRDRIGNEIFERVIR 455 +++A+ L +C++ + ++EA + EL++ F + N + + Sbjct: 421 DPSMTSTERDQAKSVLQDCSTADTSTLKHYYEAATGPELDEAFNAIVQNIERLTIAK 477 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 159 bits (401), Expect = 9e-37, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 141/469 (30%), Gaps = 50/469 (10%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ G I ++ P + + + ++ + + L A + A + ++ Sbjct: 13 HFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALAIENN 72 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + +E N N + +V + N Y Sbjct: 73 EIPDEPQQIK----------------NNALVLSYVNAYLPSKKFLVPIININDNTHYLEY 116 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ Y S F ++ + A + + +V D+S SM Sbjct: 117 NAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASE---PTDVIFVADYSGSM 173 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-P 248 L +++ + K + I P+ + ++ Sbjct: 174 LYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTY 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVR----DALASVIRSIKKIDNVNDTVRMGATFFND 304 S + + K+ D + +I K ID++ + +D Sbjct: 234 CHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSD 293 Query: 305 ----RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 V S ++ + + I+ G T I+ + +A + + ++ Sbjct: 294 VRFGDVCLQGSNAYSLEQEQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNG--- 350 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG-----------IAICNKAKSQGIRIMTIAFSV-- 407 KK +++L+DG +T D +C + K I++ IA + Sbjct: 351 ------HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSP 404 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + E C ++++EA++ HEL + + V R Sbjct: 405 DNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREVGRN 453 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 159 bits (401), Expect = 9e-37, Method: Composition-based stats. Identities = 62/500 (12%), Positives = 137/500 (27%), Gaps = 63/500 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G+ +ITA ++P +L + G+ +D+ + ++ A +A++ ++ Sbjct: 14 LRSAEGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGALGRQAGNT 73 Query: 74 EV----SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + F Q + N + T + Sbjct: 74 AAETTLDVQTYALALFTDQGGGLDCDPVAVTFDETNLDILGTVRCRQPTYLSSLIGHDEL 133 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + ++S + L + S + + + + D Sbjct: 134 EFNVASTSTYGVGKLDVAFIFDVSGSMNSYNRLAQLKTAAVAAVDELLPDSRERD---GT 190 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV------SCNKSLYYM 243 + S LN T S +G + + + K +Y Sbjct: 191 VRLAIASYNHSLNAGAYIGAVTETVTLSADGSNSTALSRYNSHNTKRMIDQDSGKRFFYY 250 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVI-------------KKKHLVRDALASVIRSIKKIDN 290 + + + R DA I + Sbjct: 251 QSGTCSSWNCGKYSSWSWDTKRRFFDDTGLADACVYERTGTQAATDAAPGSGAWIGAGNP 310 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGV-------------------HKLIRTIVKTFAIDE 331 ++ ++ + + G +T++K Sbjct: 311 RWSFYAGSSSKYDGWQNVENQNATGYGVGAYEGRHGTCMPSGPVPLTEDKTVLKDHVNAL 370 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRM-----KNNLEAKKYIVLLTDGENT------ 380 G TA + + + + + K ++L+TDG+ Sbjct: 371 VAEGGTAGHLGIAWGWYLVSPEWAAIWPEASEPLPYRQPQTSKAVILMTDGDFNIEHPTA 430 Query: 381 --QDNEEGIAICNKAKS--QGIRIMTIAFSVNKTQQEK--ARYFLSNCASPNSF-FEANS 433 + + +C+ K+ + I+I T+ F V + Q R L CA+ S F A+S Sbjct: 431 SRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQRTGDGRTILEYCATSPSHAFSADS 490 Query: 434 THELNKIFRDRIGNEIFERV 453 EL +++R + R+ Sbjct: 491 GEELIEVYRSIARSISDLRL 510 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 82/507 (16%), Positives = 142/507 (28%), Gaps = 103/507 (20%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I Y + G I+T LL+ ML +GGM VD +R+ L+ + A++ A+ Sbjct: 19 IRYLHAFGRDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAASL 78 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + A + + Sbjct: 79 NQTLDSRDVVEDYFAKAGFPNAL--------------------------VGAPIVVDNGN 112 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + V + S L+ + +LR G+ ++ A V I V+D S Sbjct: 113 SRTVTVRS----ALDVNTFYLRLAGMDRLTAPARSSATEGVGK------VEISLVLDISG 162 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM R Q A + G V + + + +M Sbjct: 163 SMRFSNRFVNMQAAA----IAFAEEVLDPANGGTVSLTIIPYAGATNPGPEMFAFMGGVR 218 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF------ 301 D L+ + + + + + + + S S V + Sbjct: 219 YPDTLLAGDDGILGTEDDYFFPQVSSCVEMVGSDWSSAGLPGAGRAQVPHFQVWDIARSV 278 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------ 355 + S ++F T + AM+ A T+ S++ Sbjct: 279 MDWGWCPQDRSSIQYAMATPAQARSFINGLRMHDGTGTHYAMKYALATLDPSSQPAFMHL 338 Query: 356 ------------DEVHRMKNNLEAKKYIVLLTDGENTQ---------------------- 381 ++ E KK IVL+TDG+ TQ Sbjct: 339 SHPGRGLVPPQFANRPAAWDDPETKKIIVLMTDGDITQQERPRIAQQERDIDYIISRSIN 398 Query: 382 -------------DNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + AIC A ++ + + T+AF V + + NCAS Sbjct: 399 GRDNRGQFVDAATNVGRFEAICTLANQPARSVDVYTVAFEVQPNSAADLQ--MRNCASDP 456 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 S F S EL +F R+ Sbjct: 457 SMFFRTSGAELIDVFSGIAERITDLRL 483 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 51/471 (10%), Positives = 117/471 (24%), Gaps = 29/471 (6%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 K ++ G I A ++ V++ +GG +D+ R L A A ++ + Sbjct: 2 IKALISKFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVAT 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L ++ + + + + + + D + + Sbjct: 62 DLSTTVLRDNQIKTRIENSFRANLSD--AEFLDQAIDNLDFDVDSNAGTVTVSSSAGLNN 119 Query: 127 SAYQVV---------------LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + S+ + + L L S A + Sbjct: 120 YFLNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAGDMGALRDAAEEVV 179 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + + + N + G + + Sbjct: 180 DILIEDDASNSASKV-RISLVPYSQGVNLGSYASTVTNGSTSWRNCVNEREGQQKYTDAV 238 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 Y S Y+ S+ + ++I I+ + + Sbjct: 239 YNYDGTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKNTLISDIRNLSSG 298 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLI--RTIVKTFAIDENEMGSTAINDAMQTAY-- 347 T + +S S T D + + Y Sbjct: 299 GGTGGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKFALIMTDGDFNAQYGK 358 Query: 348 -DTIISSNEDEVHRMKNN-LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + + N ++Y N +C+ K++ I I T+ F Sbjct: 359 EERTTCTGRGRNRVCTTNEYWVERYHRYSDY--NDPPATRARTLCDAMKAENIEIFTVFF 416 Query: 406 SVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + +S CAS + ++EA++ EL F + + + Sbjct: 417 --DTGGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRIQQIYLSQ 465 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 158 bits (399), Expect = 1e-36, Method: Composition-based stats. Identities = 61/472 (12%), Positives = 135/472 (28%), Gaps = 52/472 (11%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ G I +++P + + + ++ + + L A + A + ++ Sbjct: 13 HFTLFKKNEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENN 72 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 +E N N V + N Y Sbjct: 73 AIPDEPQQIK----------------NNALVLSYANAYLPSKEFSVPIININDNTYYLEY 116 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ Y + ++ + A + + +V D+S SM Sbjct: 117 NAAVTMAYPAKFLTQTSLTNAITDINITDNGVAIKNKAIEASDL---TDVIFVADYSGSM 173 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-P 248 L +E K + I P+ + ++ Sbjct: 174 LYNFDVNEPNDHERINALRSAFRKLHDIIMNNSNINAIGYIPFSWGTKRIVFENQQQKIY 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVR----DALASVIRSIKKIDNVNDTVRMGATFFND 304 S + + + KK D + +I K I+++ + +D Sbjct: 234 CHFPFSSKIYKPKGNYLSDEIKKSSNALLLLDYIGDIIDYDKTIESITGNAQPIDIPMSD 293 Query: 305 ----RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 V S ++ + + I G T I+ + +A + + Sbjct: 294 VRTKNVCLQASNAYSLEQEQYINNIDNIIKMEPYGWTLISSGILSANNLFKKEANNR--- 350 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG-------------IAICNKAKSQGIRIMTIAFSV 407 +K +++L+DG +T + +C + S GI++ IA + Sbjct: 351 ------HRKLMIILSDGVDTYQDNFLPNKGLFISKTLVEKGMCERVISSGIQMAFIAIAY 404 Query: 408 --NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + E C ++++EA++ EL + + I V R T Sbjct: 405 SPDDDVNEPEYINWRQCVGKDNYYEAHNADELMRDIQQAISKSATSEVGRNT 456 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 158 bits (399), Expect = 2e-36, Method: Composition-based stats. Identities = 70/456 (15%), Positives = 141/456 (30%), Gaps = 42/456 (9%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K+ GH I+ A+++PV+ G+ + D R + ++ A + A + + Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRN---FENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S + ++ +YL ++ R DI ++ + + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFF 119 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + S F + I + +++R Y E V + + DFS+SM Sbjct: 120 EYEVEALTT----QNSWFPGNNVISGFGDTFSTRGHSLARKYQSE-AVDVVFAADFSKSM 174 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + V S + + I D+K + + Sbjct: 175 EEPWTGG--RQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNY 232 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + ++ + + +S H KK ++ L ++ I+ ND+ + D V D Sbjct: 233 NSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIF-----IEKGNDSCGFKSDD-PDAVFHD 286 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + TA + + + + + Sbjct: 287 IYLTNDFDTFNK-----EIMKFRPGNGTASYQGIIRSAQMLRKGTNSRRLLIIISDGNDW 341 Query: 370 YIVLLTDGENT---QDNEEGIAICNKAKSQ------------GIRIMTIAFSVNKTQQEK 414 Y E + +CNK + RI I F + + Sbjct: 342 YYPYSGYKETDKEIANKLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKN-- 399 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L NCA ++ F+A EL I EI Sbjct: 400 --KALLNCAGEDNVFKAQYRDELLDQILSLITEEIG 433 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 141/469 (30%), Gaps = 50/469 (10%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ G I ++ P + + + ++ + + L A + A + ++ Sbjct: 13 HFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENN 72 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + +E N N + +V + N Y Sbjct: 73 EIPDEPQQIK----------------NNALVLSYVNAYLPSKKFLVPIININDNTHYLEY 116 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ Y S F ++ + A + + +V D+S SM Sbjct: 117 NAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASE---PTDVIFVADYSGSM 173 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-P 248 L +++ + K + I P+ + ++ Sbjct: 174 LYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTY 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVR----DALASVIRSIKKIDNVNDTVRMGATFFND 304 S + + K+ D + +I K ID++ + +D Sbjct: 234 CHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSD 293 Query: 305 ----RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 V S ++ + + I+ G T I+ + +A + + ++ Sbjct: 294 VRFGDVCLQGSNAYSLEQEQYINNIDNIIEMEPHGWTLISSGILSANNLFKNKAKNG--- 350 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG-----------IAICNKAKSQGIRIMTIAFSV-- 407 KK +++L+DG +T D +C + K I++ IA + Sbjct: 351 ------HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSP 404 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + E C ++++EA++ HEL + + V R Sbjct: 405 DNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREVGRN 453 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 65/455 (14%), Positives = 137/455 (30%), Gaps = 52/455 (11%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L K GH I+ A+++P + G+ + D R + ++ A++ A + S Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAH--ND 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + + T +Q + +Y+ + R + Sbjct: 60 PDQPDNGSYTPSTRNRQIVVDYVNAYISDVDAVTDIKVAKRRCELIPECVAGLYDGDMRY 119 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L D+ S F + I+ ++++R Y E V + DFS SMLD Sbjct: 120 -LEHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSE-AVDAMFAADFSGSMLD 177 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 S + + + + + + M S + + Sbjct: 178 TWSGSSNPK---YVDLIEIIRNISAELQKFNDLPENRNKSTMGISAFSTFTNSFTSDTGI 234 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 S V+ + + + +A++ D + GA D + Sbjct: 235 QCSLSQGVNGRNGPATWFRPVKAANTVANIWNP-----KTEDYCKSGAY----AGFHDVN 285 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + + G TA A+ + N ++ + Sbjct: 286 LTSNF-----NYLNGQVGSFYAGGGTASYQALIRGAQLLRKGNNS-----------RRLL 329 Query: 372 VLLTDGENT----QDNEEGIAICNKAKS----------QGI--RIMTIAFSVNKTQQEKA 415 ++L+DG + D +C ++ + I ++ I F N Sbjct: 330 IVLSDGMDNDTQLADGLVSAGMCRDIQNGLESDRTPDRRPIAAKMAVIGFDYNPFAN--- 386 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L +C + ++A E+ I + I EI Sbjct: 387 -KALKDCVGEKNVYKAEDADEVEDIILELINEEIG 420 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 157 bits (397), Expect = 3e-36, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 141/469 (30%), Gaps = 50/469 (10%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ G I ++ P + + + ++ + + L A + A + ++ Sbjct: 13 HFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENN 72 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + +E N N + +V + N Y Sbjct: 73 EIPDEPQQIK----------------NNALVLSYVNAYLPSKKFLVPIININDNTHYLEY 116 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ Y S F ++ + A + + +V D+S SM Sbjct: 117 NAAVTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASE---PTDVIFVADYSGSM 173 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-P 248 L +++ + K + I P+ + ++ Sbjct: 174 LYNFNENKPRDHERIDALRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTY 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVR----DALASVIRSIKKIDNVNDTVRMGATFFND 304 S + + K+ D + +I K ID++ + +D Sbjct: 234 CHFPFSPKIHKPKGNYLSDEIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSD 293 Query: 305 ----RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 V S ++ + + I+ G T I+ + +A + + ++ Sbjct: 294 VRFGDVCLQGSNAYSLEQEQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNG--- 350 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG-----------IAICNKAKSQGIRIMTIAFSV-- 407 KK +++L+DG +T D +C + K I++ IA + Sbjct: 351 ------HKKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSP 404 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + E C ++++EA++ HEL + + V R Sbjct: 405 DNNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQQAVSGSTTREVGRN 453 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 157 bits (395), Expect = 5e-36, Method: Composition-based stats. Identities = 92/454 (20%), Positives = 181/454 (39%), Gaps = 59/454 (12%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A A Sbjct: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I+ + ++ +L + R ++ K+ + + R +N++K++ + Sbjct: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA--------- 111 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 V ++ ++N + + + + + ++ + Y+ + S Sbjct: 112 ----------VFYNTEIQNIVNSSRISM------THMANNRLDSSNNTIFYNMDVMTSYD 155 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + F +L Q + F R R L +V + S+ Sbjct: 156 YRLQFIEHLL---NQRYNQKIVSFIPALLRIEMGE---------RPIFLIELVVDLSGSM 203 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + P D + + K +++AL + SI + +V + V MG Sbjct: 204 HCAMNSDPEDVNSAPI------CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-------S 353 + RV + SWG K+ + + + +D + T AM+ AY + S + Sbjct: 258 GYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N +L +K+I+ LTDGEN + N I IC+KAK I+I+TI+ + + Sbjct: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 Query: 412 QEKARYFLSNCA-SPNSFFEANSTHELNKIFRDR 444 + L C SP + + L +F++ Sbjct: 376 ----QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 157 bits (395), Expect = 5e-36, Method: Composition-based stats. Identities = 70/456 (15%), Positives = 140/456 (30%), Gaps = 42/456 (9%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K+ GH I+ A+++PV+ G+ + D R + ++ A + A + + Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRN---FENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S + ++ +YL ++ R DI ++ + + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFF 119 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + S F + I + +++R Y E V + + DFS+SM Sbjct: 120 EYEVEALTT----QNSWFPGNNVISGFGDTFSTRGHSLARKYQSE-AVDVVFAADFSKSM 174 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + V S + + I D+K + + Sbjct: 175 EEPWTGG--RQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNY 232 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + ++ + + +S H KK ++ L ++ I+ ND+ + D V D Sbjct: 233 NSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIF-----IEKGNDSCGFKSDDP-DAVFHD 286 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + TA + + + + + Sbjct: 287 IYLTNDFDTFNK-----EIRKFRPGNGTASCQGIIRSAQMLRKGTNSRRLLIIISDGNDW 341 Query: 370 YIVLLTDGENT---QDNEEGIAICNKAKSQ------------GIRIMTIAFSVNKTQQEK 414 Y E + +CNK + RI I F + + Sbjct: 342 YYPYSGYKETDKEIANKLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKN-- 399 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L NCA ++ F+A EL I EI Sbjct: 400 --KALLNCAGEDNVFKAQYRDELLDQILSLITEEIG 433 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 156 bits (393), Expect = 8e-36, Method: Composition-based stats. Identities = 60/538 (11%), Positives = 127/538 (23%), Gaps = 102/538 (18%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ G ML VGG+ VD++R L+ A++ A+ Sbjct: 24 RSFLREEDGVLAKPMIGTFLAMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAADLDQSL 83 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + +Q + + T D + + + Sbjct: 84 DADAVVLDYLTKAGLEQYYSDPDDQKGLGYKSVEATI----DTDFEAYLLKFAGGDNMSL 139 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK------------------ 173 +SR + ++ + + + S + + Sbjct: 140 YANSRAEEIIGSVEISMVLDISGSMNSGNRLVNLQAAAKSFVTQITSNTDVSNLSISIIP 199 Query: 174 ------------EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + D+S + + LN + + Sbjct: 200 YATQVNAGEKLLSKYTKVSQEHDYSYCVNFIKDQFSKHTLNQNEDLIRTAHFDTFTYSMN 259 Query: 222 VGIRD-------EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + R + P+ K Y+ + + S+L + + Sbjct: 260 MIDRPVCPTRPGSAILPFTNDAAKLHAYIDSLTASGNTSIDIGMKWGSALLDPTAQPVVN 319 Query: 275 RDALASVIRSIKKIDN---VNDTVRMGATFFNDRVISDPSF-----SWGVHKLIRTIVKT 326 VI + + +D ++ G+ + Sbjct: 320 ALVDDKVISENFRGRPKAYGSGDTLKIIILMSDGQNTNQYMVNEHRRDGISDVWYNEEAD 379 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-------- 378 + GS+ + S E HR K +D E Sbjct: 380 VFSVYDPSGSSGTGEDYYWYSSERFPSGRWEDHRFGEGTYKDKDCYYCSDKEEPGTSERL 439 Query: 379 --------------------NTQD------------------NEEGIAICNKAKSQGIRI 400 NT D ++ IC+ K QG+ + Sbjct: 440 TYQKLHARVSLASIARDLYRNTSDAWANWFTAGRTVNYRVAKDQHTKTICDITKDQGVIV 499 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ F L +CAS + F E++ F R +R+T+ Sbjct: 500 YSVGFEA----PSAGIKVLEDCASSPAHFFDVEGLEISDAFSSIAT---SIRQLRLTQ 550 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 156 bits (393), Expect = 8e-36, Method: Composition-based stats. Identities = 62/498 (12%), Positives = 142/498 (28%), Gaps = 83/498 (16%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 K + L++ +++ + G+ +DV R L A A ++ + Sbjct: 10 RPLLKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAIDAAALSVAAD 69 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L S+ K + + F D ++++ E K Sbjct: 70 LSTSVMSDEQIKAALADSFKANLAD-----------VEFLDEAIKNLSFVVDAENGTIKV 118 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + L + ++ ++++G +++ T ++ + V + V+D + Sbjct: 119 SSFATLDN---YFIDMGGYGMQALGPETFNFGTSSQVT------YSRFDVELALVVDVTG 169 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + T + S+ + + K + Sbjct: 170 SMRNDMDTLRDASKGLVNILIPETTEEADSKVRISLVPYSQGVNLGTYAAKVKGGVYGYA 229 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 ++E D + ++ + + +K D G + Sbjct: 230 DSSVCVTERQDYDDGEDIYKVRYTDMPYNYY------VKTDPPPKDVFYGGGSNRCSGTS 283 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH-----RMK 362 + R + D ++ G TA + +++I + D Sbjct: 284 KMIPLT-----ADRDTLLDAIADLDDNGGTAGQTGVVWGWNSISPNYSDVWPLASKPEPY 338 Query: 363 NNLEAKKYIVLLTDGENT------------------------------------------ 380 +N + K+ +++TDG+N Sbjct: 339 DNDDVLKFAIIMTDGDNNRFYEFVKEREECDWVYSRRYGWQWTCEMVSVNQWQERSESES 398 Query: 381 ---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ A+C K +GI I + F + +CAS ++++A S+ EL Sbjct: 399 YNNNSSKAQRALCQAMKDEGISIFGVYFGT--NDSSAGSKNMQSCASTGNYYKATSSDEL 456 Query: 438 NKIFRDRIGNEIFERVIR 455 F + V R Sbjct: 457 INAFANIAKKIQQIYVSR 474 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 156 bits (393), Expect = 9e-36, Method: Composition-based stats. Identities = 57/475 (12%), Positives = 131/475 (27%), Gaps = 37/475 (7%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 V + + ++ G + ++ M+ V G+ VDV+R ++Q + Sbjct: 13 VLRDRVRRHVVDFARAEDGVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDAST 72 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A+ + I + + T Sbjct: 73 LAAAHKDNALDPKQVVLDYFDKAALASYISADD--ILVGGGETSTAVEVNLTAQVKTPFI 130 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + ++ V R + + L S + + + + ++ Sbjct: 131 RHLGNESFNVPARGRAEQAYGNSEVSLVLDISGSMDDNRRMSRLHRAANEFVDTVLTPDS 190 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V S S++ Y D + + +P S Sbjct: 191 VDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYC---------VQFTPDDFSTTAIDP 241 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y S + F S + A++ I ++ T Sbjct: 242 EDAYIQGQHFSHVDARFNYISCPTQSYETVTPFSQNNAALEAQINRLTGRERTSIHIGIK 301 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + ++ + + +IV D ++ + D + + + Sbjct: 302 WGAAMLDEAFRPLVNDLVDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKRIKEFA 361 Query: 362 KNNLEAKKYI-------------------VLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + + + + + T + N ICN AK+ GI I + Sbjct: 362 YDTPDMRAHWARHAMDDWDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANGIIIYS 421 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 I F +N ++ + +CAS S F ++++ F I ++ + +R+T Sbjct: 422 IGFEINNDAAQE----MEDCASSPSHFYRVEGVQISEAFSS-IAQQLKQ--LRLT 469 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 59/460 (12%), Positives = 130/460 (28%), Gaps = 36/460 (7%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + L + G+F I A+ + ++ G VD+ R E LK A + + Sbjct: 3 RIQKLLGALWQDRRGNFAAIFAIAIIPVVAAAGATVDISRAYIVESRLKAALDASALAVG 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++ + A++ F + + + N V + T + + Sbjct: 63 GATGMTTSQMQAMAQSFFNANYPASKLGVPGTLSVSQSGNVVSLSVHAQLPTTLMGVV-G 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V +S+ + L + L S + + Sbjct: 122 INTLNVSATSQVTRMGKKLEVALVLDNTGSMASGGRMTVLKTAAKNLITTV--------- 172 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + + P + V ++ + ++ +P N + + Sbjct: 173 ------SAAATNPGDVKVAIVPFNVDVNIGTTNENVSWLHWDEFTPSGGGGNGNGNCNII 226 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 L + + + + + DA + N + N Sbjct: 227 QILLGLCNNNNNSNSHAGWEGCVMDRDQNYDAQNTFPPPNPGGSNATRYPASNSDSDNSN 286 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + G+T + ++ + Sbjct: 287 CNLQTIMPL---STNWSALNSHIDAMASAGNTNTTIGLAWGWNMLTQGGPLSS-AAAPAA 342 Query: 366 EAKKYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K IV LTDG+NT++ N ICN K+ GI++ ++ Sbjct: 343 NLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRVIEG------NA 396 Query: 417 YFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + NCA+ P ++ + EL +F + R+ R Sbjct: 397 TLIRNCATEPGMYYSVTTASELTSVFASIAQSLSNLRISR 436 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 153 bits (385), Expect = 7e-35, Method: Composition-based stats. Identities = 74/481 (15%), Positives = 141/481 (29%), Gaps = 66/481 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K +GH I+ + +PV+ GV + D R + L+ AA+ A++ S Sbjct: 2 KLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVS------- 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S K + YL + + + E ++ TA + + YQV Sbjct: 55 -AYGEEDEVSTQTGKDYVAHYLH-DMSSLVDIKVEKLECSELPECTADDNDRPFVEYQVS 112 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++ +S F + + + +R + + I +++DFS SM Sbjct: 113 GRTKH------ISWFPGNDVTVGFGESFDVTGSSKARKFQSSQPMDITFILDFSGSMNYD 166 Query: 193 QRD---SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 S + S V + ++L Y S + + G Sbjct: 167 WEGHAPSYMEEEIPKVPGRYSPPSRLSDLKYVVQMVTDELQVYNNSTAGPKHRVAMTGYN 226 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +++E I K + K I + GA Sbjct: 227 RRTVNESSNGKFVIRDQRITKYNSDGYDAGDTFYPKKTI--NKQFMVKGAAARVPNGDEK 284 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ ++ G TA + A + + K+ Sbjct: 285 AEFTDIMYTSDFASFNHKIKSFEAFGGTASLQGIIRASQIVS----YHITNDGEEANPKQ 340 Query: 370 YIVLLTDGEN-TQDNEEGI-----AICNKAKSQ--------------------------- 396 I++L+DGE+ + +C+ ++ Sbjct: 341 LIIILSDGEDFNHYLGQTETLVDYGMCDNLRNAIEGGPVSSEDNKEDKIVFSSGSSSGLP 400 Query: 397 -------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +RI I F L NC + F AN+ E+ + + E+ Sbjct: 401 TNTGEDPSVRIAMIGFGDGYDIHAN--TGLLNCVGEENAFSANNKDEILNLIMSLVSEEV 458 Query: 450 F 450 Sbjct: 459 G 459 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 152 bits (384), Expect = 1e-34, Method: Composition-based stats. Identities = 97/461 (21%), Positives = 182/461 (39%), Gaps = 63/461 (13%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ + +LI TG+F I+TA+ +PV+ G+ VDV + L+QA A Sbjct: 1 MINGSGKFRSLIQLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQQATDAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + L S+ + + F ++ YL + T+ Sbjct: 61 ALATATALANGNATTSNAQQLATQFVTGQMSNYLSGDTNTADALK-----AGTTANVTSA 115 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + ++Y V +++ YD+ +N +S + +GIK+ + + + + S + K+ +S++ Sbjct: 116 TNSSGGTSYTVAVNASYDMSVNGMS---QLLGIKTMHVSAASTSTSGSAAAAKQAALSME 172 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D S SML + + + Y N Sbjct: 173 IALDKSGSMLLNTD--------------------------VIDTSQKSCTQYYTEGNYL- 205 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + IKK ++ A+ +++ + D + VR A Sbjct: 206 ----------------YQYPKAKSPCYIKKIAALKTAVGTLLDQLDSADPKSQYVRTAAI 249 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 ++ V S + +WG + N G T + M AY + +S+E Sbjct: 250 AWSSEVDSSSALAWGTTTTRSNV----ISGLNANGGTESSAPMALAYKNVSASSEATAQA 305 Query: 361 MKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K N +K IVL+TDGEN + + +A C AK G+ I ++AF ++ + Sbjct: 306 AKGNTTFQKIIVLMTDGENNATSSDTKTLATCKAAKDAGVLIYSVAFMA----PDRGQTL 361 Query: 419 LSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L NCAS ++F+A +L F+ IGN+ +++ +TK Sbjct: 362 LKNCASSPSNYFDAQQMSDLIAAFK-TIGNQASKQITLLTK 401 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 59/532 (11%), Positives = 124/532 (23%), Gaps = 83/532 (15%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ + ++L + G I ALL M + L+D+ R S H ++ A A + A Sbjct: 2 SRLTRFFRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLMA 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK-KNFTDREVRDIVRDTAVEMN 123 + + E+ S +F + L + + + Sbjct: 62 ARSTAVTDAELESVGDPAFLAEIAGLNLGLSASNASFKAGAGNHIIGTATATVKPIIANL 121 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + +S L + + S + +T + Q Sbjct: 122 WTTDDFNLTATSDVVRSSKNLEVAVVLDITGSMSGSRITDLKTGASDLVDIVVKDQQAPF 181 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA----------------------------------- 208 +++ Y + A Sbjct: 182 YSKVAIVPYSVGVNVGTYADAVRGAVIARTITGVSKTNAAVVASAAHGFIVGDKVTISGV 241 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + + + N Y D S + + + Sbjct: 242 SGPTMLNGNTYNITAASADSFTINANTSNAPKYVSGGVATCDTSTNPGCLNFTFTSASNT 301 Query: 269 KKKHLVRDALASVIRSIKKID------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 K+ + + + D V + + P+ + R Sbjct: 302 KETRTLSTCVTERTGTYAYTDIAPSIAPVGRNYPTTKLENSMQPNPCPTATITPLSSDRV 361 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK------KYIVLLTD 376 +K + GSTA + + + K +VL+TD Sbjct: 362 TLKGQINALSIGGSTAGQIGFAWGWYMVSPNFGYLWPNATQRPAPYNSKDLVKVVVLMTD 421 Query: 377 GENTQDN------------------------------EEGIAICNKAKSQ--GIRIMTIA 404 G + +C+ K + I T+ Sbjct: 422 GAFNTPYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPSLKLTIFTVG 481 Query: 405 FSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIGNEIFERVIR 455 F V L CA+ + + EL F+ R+ + Sbjct: 482 FDVGGDANAVN--MLKYCATDAQHVYFPATGSELKTAFKSIAQEISSLRIAK 531 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 66/576 (11%), Positives = 123/576 (21%), Gaps = 129/576 (22%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + I +G+ ++ + ++ G VD R A++ A + + Sbjct: 7 TQLRKSAARFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTALMV 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD---------------- 108 + + S + + + E Sbjct: 67 AKDYSLNKISASEIDGKAKSIFSALYTNKSANSVEVVAVLTPNTGKGSTIKVDGTGKVPT 126 Query: 109 -----REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN--------------------- 142 + I + + +V L +N Sbjct: 127 DFMKLVNISQIDIGASSTTTWGSTRLRVALVLDTTGSMNDNGKIGALKTATQNLLTQLKD 186 Query: 143 ----PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 P +++ + + + + S S + Sbjct: 187 AAGKPEDVYVSIIPFSKDVNVGASNYTANWIDWTDWKSQPPVLDYAKSGSKSGLVDNVAW 246 Query: 199 ----------------------------QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 P++ T K + G + Sbjct: 247 KDVGPGSKCPFSDSKHGFVCTDRPATEANANKVSNIPSNGTRKGLICPSMDTGGPNGTTG 306 Query: 231 PYMVSCNKSLY------------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 P Y Y S + + A Sbjct: 307 PRNGLYYNGCYDSVQRTVASCTGYNSTECSCSGWPSVCKIYHTWRPADTVTPVTAADTAA 366 Query: 279 --------ASVIRSIKKIDNVNDTVRM--------GATFFNDRVISDPSFSWGVHKLIRT 322 V ND R A+ F IS + Sbjct: 367 TPSTSTWNGCVTDRGTYSGPSNDYDRNVSLPLSGVPASRFPAEQISSCAPKVTEMNNNWA 426 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN-LEAKKYIVLLTDGENTQ 381 + T +G T + + +++ +K+ + IVL++DG NT Sbjct: 427 TMNTTVDGLFPVGGTNQPIGLVWGWQSLVGGGPFPTPPVKDEQYTYQDIIVLMSDGLNTV 486 Query: 382 DN------------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 D C K+ GI++ T+ + N L NCA Sbjct: 487 DRWYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIKVYTVHVNTNG---SPESTLLKNCA 543 Query: 424 SPN-----SFFEANSTHELNKIFRDRIGNEIFERVI 454 SP F S LN F RV Sbjct: 544 SPADDGGKEFQMVTSASGLNAAFNSIATKLTDLRVA 579 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 81/379 (21%), Positives = 137/379 (36%), Gaps = 68/379 (17%) Query: 96 RNFENNLKKNFTDREVR---DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 RNF N+ V + ++YQV +S YDL +NPL ++++G Sbjct: 1 RNFVAGQMANYLQSGVDIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPL---MQAVG 57 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + + T+ + +S+ +D S SM + + Sbjct: 58 FTTQHLS--TSGTTIGGHSQTQGSISMYLALDKSGSMGEDTATVNEE------------- 102 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + Y CN + + + S + K Sbjct: 103 --------------DPTESYTYDCNGH---------YNKKGKWIYDTCTGSRANYYTKIE 139 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ A ++ + D VR GA ++ + + +WG + Sbjct: 140 ALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTY-----VNALQ 194 Query: 333 EMGSTAINDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGENT------ 380 G T + AM TAY ++ + N ED H++K KKYIV +TDG+N Sbjct: 195 AGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSGG 254 Query: 381 -QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELN 438 + A C+ AKS+GI I TIAF + Q L CAS + +F+A +L Sbjct: 255 RSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQA----LLHYCASDDSHYFQAEKMEDLL 310 Query: 439 KIFRDRIGNEIFERVIRIT 457 F+ IG + ++ R+T Sbjct: 311 AAFK-AIGAKASSQLTRLT 328 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 151 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 74/481 (15%), Positives = 140/481 (29%), Gaps = 66/481 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL K +GH I+ + +PV+ GV + D R + L+ AA+ A++ S Sbjct: 4 KLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVS------- 56 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S K + Y+ + N + E ++ TA + + YQV Sbjct: 57 -AYGEEDEVSTQTGKDYVAHYMH-DMSNLVDIEVEKLECSELPECTADDNDRPFVEYQVS 114 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++ S F + + + +R + + I +++DFS SM Sbjct: 115 GRTKHK------SWFPGNDVTVGFGESFDVTGMSKARKFQSSQPMDITFILDFSGSMNYD 168 Query: 193 QRD---SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 S + S V + ++L Y S + + G Sbjct: 169 WEGHAPSYMEEEVPKVPGRYSPPSRLSDLKDVVQMVTDELQVYNNSTTGPKHRVAMTGYN 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +++E I K + K I + GA Sbjct: 229 RRTVNESSNGKFVIRDQRITKYNSDGYDAGDKFYPKKTI--NKQFMVKGAAARVPNGDEK 286 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F+ ++ G TA + A + + K+ Sbjct: 287 AEFTDIMYTSDFASFNHKIKSFEAFGGTASLQGIIRASQIVS----YHITNDGEEANPKQ 342 Query: 370 YIVLLTDGEN-TQDNEEGI-----AICNKAKSQ--------------------------- 396 I++L+DGE+ + +C+ ++ Sbjct: 343 LIIILSDGEDFNHYLGQTETLVDYGMCDNLRNAIEGGPVSSEDNKADKIVFSSGSSSGLP 402 Query: 397 -------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +RI I F L NC + F AN+ E+ + + E+ Sbjct: 403 TNTGEDPSVRIAMIGFGDGYDIHAN--TGLLNCVGEENAFSANNKDEILNLIMSLVSEEV 460 Query: 450 F 450 Sbjct: 461 G 461 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 98/455 (21%), Positives = 175/455 (38%), Gaps = 71/455 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ G+F ++TA+L+PV +GV G+ +D AL+ + A + A+ + + Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + A + +F ++ + R+ + T E V+ T ++N ++Y V Sbjct: 101 SEADAIA-LAKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTT--QVNSSSTSYDVE 157 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L+ Y + +NPLS L + KA + + + E +S+ V+D S SM D Sbjct: 158 LTGSYTITMNPLSRVLGWE-----TVTLKAYGKAQAATTASESPLSMYLVLDRSGSMNDE 212 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + Sbjct: 213 TATTYTGTCTK------------------------------------------TTTSGYG 230 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +++ S S K ++ A+A + +KK D ++ VR GA +N + + Sbjct: 231 WNKKTTTTSYSCTKNYTKIESLKLAVADLAAQLKKADPNSEYVRTGADSYNASADTAQAM 290 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKY 370 SWG + T+ + G T A+ AY + +SN E H + + + +Y Sbjct: 291 SWGTANV-----VTYVNALSATGGTDARGALSAAYSALQTSNKTEITAHNVSSVSKIGRY 345 Query: 371 IVLLTDGENTQDNEEGIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 IV +TDGE T ++ + C K+ GI+I T+AF + LS C Sbjct: 346 IVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMAPANG----KSLLSAC 401 Query: 423 ASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRI 456 AS ++EA L F + IG + R+ Sbjct: 402 ASDASHYYEATDAASLVAAFGE-IGKKATSTSTRL 435 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 149 bits (376), Expect = 7e-34, Method: Composition-based stats. Identities = 68/487 (13%), Positives = 146/487 (29%), Gaps = 53/487 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F I+ L V+ G V+V + +L+ A+ + + L Sbjct: 14 FARDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGAI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN------PRKS 127 + + K+ F + +++ + L + +R + + D V++ Sbjct: 74 KEADANKSVQAFLDANSQAGILQADQIVLDRLIVNRTAKTVQADAHVDVGLYFPIFGTGD 133 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +V S+ + + + S + K + ++ K ++ D Sbjct: 134 MKRVAASTTALYSDKTVEVAMMLDITGSMAKRGKVDKIGDLKTAAKNAVQTMLQKQDPQN 193 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + P Y+ + + SP +V+ + Sbjct: 194 PRIRVA----IVPYASGVNAGKLAENVYAEKQASTELPPVAGSPLLVAKTGKNLLPSFSD 249 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--- 304 +S ++K +A S + N +D Sbjct: 250 Y----ISIVGAAMPRPDNCATERKDKNGNADMSADGPDTVRTDGNGKKFYALVNRDDHLG 305 Query: 305 --RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + P + D G TA A+Q Y + + Sbjct: 306 DGDMNRCPDAKVIPLTADSDALLESIEDFRANGFTAGAIAIQWTYYMLSPQWRTAIRNAG 365 Query: 363 NNLEAK--------KYIVLLTDGENTQ---------------DNEEGIAICNKAKSQGIR 399 A K +L+TDG+ +C+ K+ GI Sbjct: 366 LGKGASDADPKKIAKVAILMTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIE 425 Query: 400 IMTIAFSVNK-----TQQEKARYFLSNCASPN------SFFEANSTHELNKIFRDRIGNE 448 I TI F ++ T++++A+ L +C+S + FF+ ++ EL+ F++ I N Sbjct: 426 IFTIGFDLDDKDMSTTERDQAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEIIRNT 485 Query: 449 IFERVIR 455 + + Sbjct: 486 EKVALTQ 492 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 149 bits (375), Expect = 9e-34, Method: Composition-based stats. Identities = 69/534 (12%), Positives = 143/534 (26%), Gaps = 89/534 (16%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 V + + K+ G F +IT L + ++ G VD+ ++A++ A T+ Sbjct: 19 VMTSVIKEFLKQYRDDRRGQFAVITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSA 78 Query: 62 ITASVPLIQSLEEVSSRAK--NSFTFPKQKIEEYLIRNFENN---------LKKNFTDRE 110 + A + E RA IE+ + + Sbjct: 79 LAAVSYSGEEQTEREKRADTLFLNNTAGIAIEDTDLSEEDGAWVYKAEYKIKTNFLRVVG 138 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS-------------------- 150 + + ++ S VVL + + + Sbjct: 139 IDEFEMESQGAAALANSPMDVVLVLDSSGSMAQDNRMVELKASVKLFLEEFKSNDLTQVA 198 Query: 151 -MGIKSW-----LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + + + A +V+ +I +D + Sbjct: 199 LVPFDTQVKATSSLFGAAGNVSVANPLATGSCATISDPLDRDACYASQNAAPPVVDCSKL 258 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 D + ++ KVG Y+ + + + E + Sbjct: 259 TDLIDAVLCGVNNLGFKVGTTAITDLRYISDRRYDAFIDGNMFRITRKIGEADCSSVCTW 318 Query: 265 RHVIKKKHLVRDALASVIR----SIKKIDNVND----TVRMGATFFNDRVISDPSFSWGV 316 + + A ND F DR + + + Sbjct: 319 KKTYSTTTIFETAAGGGAPATSKPNDAETPNNDLIAQYPGPWPRCFVDRSQPYDANATAM 378 Query: 317 HKLIRTIV------------------------KTFAIDENEMGSTAINDAMQTAYDTIIS 352 + + + +T G+T + +Q + + + Sbjct: 379 NISQKDTIYPEAHCATGSLEPITGLTFDLQSVETAVNKLTPSGNTNVTIGVQWGMEALTA 438 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQD-----------NEEGIAICNKAKSQGIRIM 401 + ++ E +K +++LTDG NTQ+ + +A CN AK+ GI + Sbjct: 439 AAPLT--GVRTGSEVRKVMIVLTDGLNTQNRWWGSRDRNKIDARTLAACNNAKAMGIELY 496 Query: 402 TIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 T+ E L CA + + + S +L F D R+ Sbjct: 497 TVRL------VEGNEDLLKTCAETEDKYHYVTSASQLKTTFADLARQVKGVRLA 544 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 63/486 (12%), Positives = 129/486 (26%), Gaps = 88/486 (18%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + G+ + A++ ++ G VD R + L+ A A + A Sbjct: 4 LSSLLARFRSDIQGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASVGAVS 63 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + I N+ +N + + T + P Sbjct: 64 RTSPAFIAA------------GAMTADGIITAGNDDARNIFNGNMNGTTGYTLNSVTPEV 111 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 VL++ + +F+ +GIK+ +Q ++A Y + ++D S Sbjct: 112 KKTGSVLTATVSFSASVPMMFMNIVGIKTMTLQGMSKATASMPKY-----IDFYLLLDNS 166 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM + + A Y + Sbjct: 167 PSMGVAATPDDVTKMVNATSDAKYGSNRYCAF---------------------------- 198 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN--D 304 +F + + V + ++R A ++ + + + RM F Sbjct: 199 -ACHDYNDSNNFYNLAKSIGVTTRIDVLRSATQQLMDTATQTQTYPNQFRMAIYDFGAAS 257 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA--INDAMQTAYDTIISS--NEDEVHR 360 + I + L ID + T + + + NE Sbjct: 258 KTIGLRALFALSANLSSAKSAAGNIDLMGVYGNNDAYTADKDTPFTAVFPAVNNEISTPG 317 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGI---------------AICNKAKSQGIRIMTIA- 404 KY+ ++DG + N + A+C K++GI+I + Sbjct: 318 DGTTGSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTTLKNRGIKIAVLYT 377 Query: 405 -----------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDR 444 F+ + CASP +FE + T + Sbjct: 378 TYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACASPGFYFEVSPTQGIADAMNAL 437 Query: 445 IGNEIF 450 + Sbjct: 438 FKKAVA 443 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 61/586 (10%), Positives = 135/586 (23%), Gaps = 143/586 (24%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + K+ + +G+ ++ L +P++ G VD+ + L+ + A + A Sbjct: 11 FRKRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYS 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA---------- 119 + ++ F + + + D V Sbjct: 71 NNQQDSHLSKVAEGYFFENAGADARANTDFSYDGIFNEDGSTVLQVSAARRHPTIFGDLL 130 Query: 120 ---VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 A+ + S+ + + L + S + K+ KE Sbjct: 131 SFVTAGELDWRAFPLAARSQIVVQNQSIELVMVLDNSGSMTGRPKSGGGKRKIDTIKEAA 190 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + + + + F + ++ ++ R + Y+ Sbjct: 191 IGLTGQFLK-GAASSTLKLPVQFGVVPFAAAVNIGPDNHDAKWMDTKGRSSIHNEYLDWA 249 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRS---------- 284 N + P+ + S + ++ L + Sbjct: 250 NWKTSRGVKLAERSPTGDYWQEISSKTPLTRFFVYENAHHKNELGPWLGCVESRPNGLAI 309 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGV---------------------------- 316 N + + F Sbjct: 310 TDAEPNYANPDTLFVPSFGPDEYDGSKGDNDYLEDEGRRSMPAETAMSVQAKVAKYFDGS 369 Query: 317 ---------------------HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + + + G T I + + + + + Sbjct: 370 DLQRGNHPGPNRGCLSTPVTPLTDNQATINAAINAMDADGETNIPEGIAWGWRLLSAREP 429 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQD--------------------------------- 382 R + + K +VL+TDG+N Sbjct: 430 FTQGRANDAKDNLKVLVLMTDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFD 489 Query: 383 ------------------NEEGIAICNKAKS--------QGIRIMTIAFSVNKTQQEKAR 416 NE+ AIC K GI I TIAF + + Sbjct: 490 DTSTTSKRANRSSFVSAMNEKVAAICQNIKDDGRKATGEDGIVIFTIAF--DLNDGSSVK 547 Query: 417 YFLSNCASPN-------SFFEANSTHELNKIFRDRIGNEIFERVIR 455 + CAS +++A S+ +L F R+ + Sbjct: 548 KLMEQCASYGITDPTKKLYYDAKSSSDLMAAFDSITEQVSSLRIAK 593 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 74/470 (15%), Positives = 135/470 (28%), Gaps = 87/470 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALL+ + G GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ S ++ F + + TD + + T L Sbjct: 65 AVAAAMTMSGNGTISLGKDDARNIFMSQMSGELTDVHIDLGINVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I + Y + ++D + SM Sbjct: 116 NSQVSFSATVPTTFMRILGRDSITISG-----AATAEYQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 ++ L A + S N Sbjct: 171 TANDVSKLQAKTGCAFACHQMDQSTNNYTI------------------------------ 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--PS 311 + V + +VR A ++ + K +D RMG F + + Sbjct: 201 --------AKGLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---DEVHRMKNNLEAK 368 S L + T +D + N T++D+ ++ D +N+ + Sbjct: 253 ISSPTSDLTKVKNYTDTVDLMTIPYQNYNQDQLTSFDSALTQMNTIIDPAGDGTSNISPE 312 Query: 369 KYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAFSVNKT 410 K + ++DG + C K +G++I T + Sbjct: 313 KILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSN 372 Query: 411 Q---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 Q + + CASP +FE T + + I Sbjct: 373 DWYNKWISPFQSEIPTKMQACASPGFYFEVTPTEGITDAMKALFLKVIRS 422 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 65/488 (13%), Positives = 145/488 (29%), Gaps = 55/488 (11%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+F ++ V+ G V++ + + +L+ A+ + + L + Sbjct: 14 FARDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGVI 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN------PRKS 127 + + + F +++ + L K D+ + + + V++ Sbjct: 74 KEADADNSVKAFLVANSAAGILQPDQVVLDKLIVDKTAKTVQANVHVDVALYFPLFGIGD 133 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +V S+ + + + S + + R+ + ++ D R Sbjct: 134 MQRVAASTTALYSDKTVEVAMMLDITGSMAKRGNVDKIGDLRAAARNAVQTMLQNQDPKR 193 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + P Y+ G + S +V+ + Sbjct: 194 PRIRVA----IVPYASGVNAGKLAENVYAETQGSSELPPVAGSSLLVAKTGKALLPSFSD 249 Query: 248 PLD--PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN----VNDTVRMGATF 301 + + ++ ++ L D +V D G Sbjct: 250 YISIVGAAMPHPDNCTTERKNKNGDADLSADGPDTVRTDRNGKKYYALVNRDDHLDGGGM 309 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + D G TA A+Q Y + + + Sbjct: 310 NRCPDAEVIPLT-----ADSDALLDSIDDFRAAGYTAGAIAIQWTYYMLSPQWRAAIKNV 364 Query: 362 K--------NNLEAKKYIVLLTDGENTQ---------------DNEEGIAICNKAKSQGI 398 N + K +L+TDG+ A+C K+ GI Sbjct: 365 GLGNGASDANAKKIAKVAILMTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGI 424 Query: 399 RIMTIAFSVNK-----TQQEKARYFLSNCASPN------SFFEANSTHELNKIFRDRIGN 447 I TI F +N T++++A+ L C+S + +FEA++ EL+ F++ I N Sbjct: 425 EIFTIGFDLNDKDMSATERDQAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEIIRN 484 Query: 448 EIFERVIR 455 + + Sbjct: 485 TEKVALTQ 492 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 147 bits (370), Expect = 4e-33, Method: Composition-based stats. Identities = 58/492 (11%), Positives = 138/492 (28%), Gaps = 107/492 (21%) Query: 2 VFDTK-FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 + DT+ ++ + +G+ +I A+ + +LG G +D + L+ A A Sbjct: 4 LLDTRHIGRLPRRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAA 63 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ +V + + ++ F I Y + + Sbjct: 64 VL-LAVSNSEINRTTAQAKADAEQFFNATIGAYGLTATIKIEVTENDGKR---------- 112 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 S+ D + FL +G + I ++ + Y Sbjct: 113 -------------SATADFTSTVTTNFLNLIGYPTLAIGNRSTSTVSRPIYQ-----DFY 154 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 ++D S SM ++ + Sbjct: 155 LLLDNSPSMGVAATTADIATMVGNTSDKCA------------------------------ 184 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 ++ + + V + +VR A+ + + + VN+ RM Sbjct: 185 ------FACHDLSDSNNYYNLAKKLGVKMRIDVVRQAVQQLTSTATLMTAVNNQFRMAVY 238 Query: 301 FFNDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 S + + + AID + N+ T +++ +++ + Sbjct: 239 TLGGSCASLGLTTIASLSSAMSSVQTAAGAIDLMSIPKQNYNNDQCTDFNSALAAMNTTI 298 Query: 359 HRMKNNL--EAKKYIVLLTDGENTQDN----------------EEGIAICNKAKSQGIRI 400 + +K++ ++DG +N + C K +GI+I Sbjct: 299 PSSGTGTAAQPQKWLFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTVTQCKAMKDRGIQI 358 Query: 401 MTIA------------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNK 439 + F+ + +CASP+ +FE + T +++ Sbjct: 359 AVLYTTYLALPTNQWYNDHIAPFNAGPYGPSVNSQIAAKMKSCASPDFYFEVSPTQGISE 418 Query: 440 IFRDRIGNEIFE 451 + + Sbjct: 419 AMDALFKKAVAK 430 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 147 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 84/444 (18%), Positives = 175/444 (39%), Gaps = 62/444 (13%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE--EVSSRAKNS 82 +TA+++ V +D+ Y + ++ A A+++ ++ + +++ + Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 T K++I+++L + + +++N D + + T + NP Q + S+ + Sbjct: 61 STIFKKQIKKHLKQG--SYIRENAGDIAQKAQINITKDKNNP----LQYIAESKAQYEIP 114 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 +LFL+ + I S L + + + +SI V+D SRSM D Sbjct: 115 TENLFLKGL-IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND--- 170 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 ++ NK L + + + Sbjct: 171 ----------------------------NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFNDRV--ISDPSFSWGVHK 318 +K ++ ++ +++ SI+K + N +VR+G +N + S +++ Sbjct: 203 PAPA-NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + N +T AM AY + + E H + KK+++ +TDGE Sbjct: 262 VKSR-----LNKLNPYENTNTYPAMHHAYRELYNEKESS-HNTIGSTRLKKFVIFITDGE 315 Query: 379 NT-----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEAN 432 N+ Q+ + IC ++ G++I ++A S Q+ L C S FF N Sbjct: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD----LLRKCTDSSGQFFAVN 371 Query: 433 STHELNKIFRDRIGNEIFERVIRI 456 + EL + F D+I ++I E+ +RI Sbjct: 372 DSRELLESF-DKITDKIQEQSVRI 394 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 147 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 76/481 (15%), Positives = 135/481 (28%), Gaps = 99/481 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM ++G GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + + V + T L Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F++ G S I T + Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMQIFGRDSITISG-----TATAEYQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N Sbjct: 171 TPSDVSKLEAKVGCAFACHQMDKSTNNYTI------------------------------ 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-----IS 308 + V + +VR A ++ + K +D RMG F + + Sbjct: 201 --------AKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGS-----TAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + T + AM I + + + Sbjct: 253 ISGLTSDLTKVKSYTDAVDLMTIPYQNYNSDQITNFDSAMTQMNTIIDPAGDGT-----S 307 Query: 364 NLEAKKYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAF 405 N A+K + ++DG + C K +G++I T Sbjct: 308 NTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYL 367 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + Q + + CASP +FE + T + + I R RI Sbjct: 368 PLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMKALFLKVI--RAPRI 425 Query: 457 T 457 T Sbjct: 426 T 426 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 72/483 (14%), Positives = 129/483 (26%), Gaps = 42/483 (8%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ ++S G+ +I L + ++ G +D + + + A +A++ A++ Sbjct: 9 RYFVNAFLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAAIV 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + ++ A + F + K T + D + Sbjct: 69 NNSNEGDLKKLAADVFKENLGAADLDAKVTAFKYDAKARTVKATAQGSYDPVIMQLFGFK 128 Query: 128 AYQVVLSSRY-DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 ++S L + L S + + VS D Sbjct: 129 NLPYAVTSDAIKAADGTLEVALVLDNTWSMSATVNGTPKIDILKTAAQGLVSTILTKDNK 188 Query: 187 RS----MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV-------- 234 ++ Y N ++ Sbjct: 189 DYVKIAVVPYADYVNVGMANRNMPWVSVAADYSTTSTKTCKTVSTATQCTGGTKGTCTGN 248 Query: 235 --SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + + S K V++ +AS + D Sbjct: 249 QDGVPYTYSCWIVAQTCKTVNVTPYQSCSGGGTTNYKWYGCVKNQVAS--SKVVMPDPTT 306 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + T + S + TI T I M + + Sbjct: 307 PYGGLVQTSQ-TCLNPILPLSNDATVVTNTIKGLVVNIGGYKPETYIPGGMIWGVNALTP 365 Query: 353 SNEDEVHRMK--NNLEAKKYIVLLTDGENTQDNEEGIAI------------------CNK 392 + NN E +K IVL+TDG NT I C+ Sbjct: 366 PAPFTEGKPYDANNKEPRKTIVLMTDGANTLYANTSGGIAVANATQVAVTYSDQIRVCDY 425 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFE 451 AKS+ I I TI F V KA L CA+ +F+A S+ +L K F G Sbjct: 426 AKSKKIEIYTIGFDVTD---SKALSTLKACATDAQHYFDAKSSADLIKAFETIGGKLSKV 482 Query: 452 RVI 454 R+ Sbjct: 483 RLT 485 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 146 bits (368), Expect = 7e-33, Method: Composition-based stats. Identities = 64/520 (12%), Positives = 120/520 (23%), Gaps = 84/520 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + +G T ML VGG+ VD++R L+ A++ A+ Sbjct: 23 LRSFRSDESGALAYPTIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAADLDQT 82 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 V + + + Y E L + ++ S Sbjct: 83 QEPAVVVQDYLNKAGLGEY---YEAPEVETGLGYKKVKATIDATFDAHLLQ-FAGGSDLP 138 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSW-----------------------LIQTKAEAETV 167 V SS + ++ L + L S + V Sbjct: 139 VYASSTAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAARDFIDTMVENTTDGRMSISIV 198 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQ---------------RDSEGQPLNCFGQPADRTV 212 + + S + SE + F Sbjct: 199 PYATQVSVSDELFDEYTTSGTNNFANCINFETSDYSTTALSTTSERERTMHFSPWYTSNT 258 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV------------- 259 ++ S D + + + L + + Sbjct: 259 RASGSPIDYEICDDRSSREILPLQKDATTLKSFITNLTAWGNTSIDIGMKWGVALLDPSA 318 Query: 260 ---------------DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR----MGAT 300 + S ++ + V D R Sbjct: 319 RPAISSLASGASVPSEFSVRPVDYSDPDTLKIIVLMTDGQNTSQYYVEDDHRAGDSNVWY 378 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ--TAYDTIISSNEDEV 358 F+ S + G + + + T+ + + Sbjct: 379 DFSANRYSTYNPDNGYYWRDGYSYWYSSPYGGNNAQQLSYPDLFAYTSLKYLYKYIYADW 438 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + Y V G N+ N +C+ AK+QGI + TI F Sbjct: 439 MGSYSARSEWYYGVYDYHG-NSTKNTRTSNVCSAAKAQGIIVYTIGFEAPSNGVA----V 493 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L +CAS +S + E+ F R +R+T+ Sbjct: 494 LQDCASSDSHYFDVDGLEIRDAFESIAT---SIRKLRLTQ 530 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 146 bits (367), Expect = 8e-33, Method: Composition-based stats. Identities = 57/457 (12%), Positives = 127/457 (27%), Gaps = 65/457 (14%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + IK +GH ++ A+++P + G+ + D R + ++ A++ A++ + Sbjct: 2 RKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAH--NDD 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR---EVRDIVRDTAVEMNPRKSAY 129 + S A + +Q +YL ++ + I A + Sbjct: 60 NQDSQGAGSGSRVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAECRAGLARGEPRFF 119 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 Q + ++ + F + I+ + A+ V+R Y E V I +V D+S SM Sbjct: 120 QYEI----EVSSVQDTWFPGNDSIEGFGDTFSAKGAAVARKYQSE-AVDIIFVSDYSGSM 174 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + N + V YY Sbjct: 175 AWNWSGGRNRKYIDLRNIIQEVTDELQKFNDLNNTDNN-----TVGLTAFNYYTKTV--P 227 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + + VR+ V R + D Sbjct: 228 SNRSNHCFMTQLVNPNGRFSASQTVRNIF------------VEKNNRYCVNHGDSSRFQD 275 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + ++ TA + + ++ Sbjct: 276 LPLTDNYSSFNNSVRS-----FYPNHGTASFQGIIRGAQMLR-----------KGRNPRR 319 Query: 370 YIVLLTDGENTQDNEEGI----AICNKAKSQ------------GIRIMTIAFSVNKTQQE 413 +++L+DGE+ + +C+ + R+ + F + + Sbjct: 320 LLIVLSDGEDGDPSRHMQLVNAGMCSTIVNTLSGDLTPDGHKVKARLAVVGFDYDVNKN- 378 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L C + ++A + ++ + I EI Sbjct: 379 ---RALQKCVGAENVYKAQNRDDILNKILELITEEIG 412 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 146 bits (367), Expect = 8e-33, Method: Composition-based stats. Identities = 59/490 (12%), Positives = 135/490 (27%), Gaps = 89/490 (18%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + + G+ +I AL++ ++ G VD R + L+ AA A + A Sbjct: 3 FAEAIARFHRDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVS 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + ++ T + Sbjct: 63 RTSPAYIAA------------GSMSTDGAITSGADDALRIFNGNLANLTGYTLDSVAATV 110 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 S ++S+ ++F++++ + S + + A Y + ++D S Sbjct: 111 SKSGEAVTSKVTFSAQISTMFMKAVAVSSMTVGGVSTATASMPKY-----IDFYLLLDNS 165 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ + Sbjct: 166 PSMGVGATPTDVAAMVAATANKSSD-------------------------------DHCA 194 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 ++ ++ + + + + ++R A ++ + + RM F Sbjct: 195 FACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTAAATATYTNQFRMAIYDFGASA 254 Query: 307 ISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--EVHRMK 362 + + L AID + N+ T Y I+ + + Sbjct: 255 QTAGLRNLFALSASLSSAKTAAGAIDLMTVKGQNDNNDQDTQYTAILPAIDKLIAAPGTG 314 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRIMTIA---- 404 KY+ ++DG + N + A+C K +G+R+ + Sbjct: 315 AAGSPLKYLFFVSDGVADEYNPACLKPKTGNRCQSPINPALCKTLKDRGVRVAVLYTTYL 374 Query: 405 --------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDRIGN 447 F+ + CASP +FE + T + + Sbjct: 375 DLPSNDWYKKWIAPFNEGPYGPSPNSEIAKNMEACASPGFYFEVSPTQGIAEAMNALFKR 434 Query: 448 EIFERVIRIT 457 + + RIT Sbjct: 435 AVAD--ARIT 442 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 146 bits (367), Expect = 8e-33, Method: Composition-based stats. Identities = 70/472 (14%), Positives = 121/472 (25%), Gaps = 53/472 (11%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP---------------LIQ 70 AL + ML G VDV R S L++A A + + Sbjct: 1 MALAILPMLLAVGGTVDVGRQSSLATDLQEAIDIAALHIAKAPSDAIPGEEDVLQLIKSN 60 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + S A ++ + + + + + Sbjct: 61 ITTKDSRIALKKLDVTEKDVSLHATAEITPFFLGLAGIKNLTAQRATKTAREARGEIEVA 120 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 +VL + + + S R +K K + V + Sbjct: 121 LVLDTTWSMSEKDSSGKSRLDSLKGAA--AKLVDTIFTEDGKTRVAVVPYADYVNVGTQH 178 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 Q + P R + + + + ++S + Sbjct: 179 RNQSWLDVPPSYSTTPSERRCETRTTRTQCTSYAPTYQCTRTVDGVSESTTCGGGCTSSE 238 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + D V + D + Sbjct: 239 TVQVAPYEYCTGGGSSRT------YDWYGCVASRTVGDYRLTDARPDIRYPGFLGTSREC 292 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGS-----TAINDAMQTAYDTIISSNEDEVHRMK-NN 364 VKT + + G T I + + + E NN Sbjct: 293 PGPLLSLSTREADVKTSISNLSYGGGGYRPSTFIPAGLIWGLNVLSPPAPFEEQAYDPNN 352 Query: 365 LEAKKYIVLLTDGENT--------------------QDNEEGIAICNKAKSQGIRIMTIA 404 +K +VL+TDG NT Q + + I+ICN K GI I T+ Sbjct: 353 KLPRKALVLMTDGANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKRSGIEIFTVG 412 Query: 405 FSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIR 455 F VN + L CA+ +F+A S EL+ F R+I+ Sbjct: 413 FMVNSSSAL---DLLKECATDGEHYFDATSPEELHSAFGRIADGLTQIRLIQ 461 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 146 bits (367), Expect = 9e-33, Method: Composition-based stats. Identities = 63/483 (13%), Positives = 138/483 (28%), Gaps = 87/483 (18%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + + G+ +I AL++ ++ G VD R + L+ AA A + A Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + S + +R F NL + T + Sbjct: 63 RTS---PAYVAAGAMSGDGVISSGADDALRIFNGNL---------NGLTGYTLASSSATV 110 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + V++S+ ++F++ +G+ + + + A Y + ++D S Sbjct: 111 TKASDVVTSQVTFSAQISTMFMKVVGMSAMTVGGTSTATASMPKY-----IDFYLLLDNS 165 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ + Sbjct: 166 PSMGVGATPTDVSAMIAATANKSSD-------------------------------DHCA 194 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 ++ ++ + + + + ++R A ++ + ++ RM F Sbjct: 195 FACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTASATATYSNQFRMAIYDFGASA 254 Query: 307 ISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--VHRMK 362 + + L AID + N+ T+Y I + +E Sbjct: 255 QTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINNEISSPGSG 314 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRIMTIA---- 404 + +KY+ ++DG + N + A+C K +GI+I + Sbjct: 315 VSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKIAVLYTTYL 374 Query: 405 --------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDRIGN 447 F+ + CASP +FE + T + + Sbjct: 375 NLPSNDWYKKWIAPFNAGPYGPSPNSEIAQNMEACASPGFYFEVSPTQGIAEAMNALFKK 434 Query: 448 EIF 450 + Sbjct: 435 AVA 437 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 146 bits (367), Expect = 9e-33, Method: Composition-based stats. Identities = 64/490 (13%), Positives = 136/490 (27%), Gaps = 94/490 (19%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + G+ II AL V++ G +D R + L+ A + + A Sbjct: 2 SVLERIFSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALAA 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPK-QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + E RA ++ L + ++ N Sbjct: 62 ASLTQDRNPEDVVRAYVEAALADHPQLLASLQLDVVADISLNSRVVN------------- 108 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + L +GI + ++ +EA + V I V+ Sbjct: 109 -------------ATASVAMPTTMLGLVGINTLTLEHASEA------IEQVRDVEISLVL 149 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D S SM + ++ F + + PY Sbjct: 150 DVSGSMGGSKINALQDAAIEFVEIVLAA--------DAAERTSISVIPYNGGVR------ 195 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P ++ + + V + ++ ++ F Sbjct: 196 -TPREVNQDIVSGNNNHRRQSGCVDMGTDYPVEMTLPYREMEFTEYYGSEQTGNSSSAFC 254 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE------ 357 R + ++ G+T ++ A + + Sbjct: 255 PR----SNMESEFLSQNEGRMRGLINSLRAEGNTGLDVATMWGARALDPAWRGNLGGSFS 310 Query: 358 -VHRMKNNLEAKKYIVLLTDGENT------------------------------QDNEEG 386 ++ + K +V++TDGE T Q E Sbjct: 311 DRPASYDDRDTIKILVVMTDGEATAQIRSEEYTYYDWWGRERTGTRSYELYSARQARENM 370 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 C+ A+ G++I TIAF ++ + R + NCA+ P ++++ + ++ + F Sbjct: 371 AEACDIAEGNGVQIYTIAFQLSG---QTNRDLMRNCANKPQNYYQVEN-LDIAEAFSSIA 426 Query: 446 GNEIFERVIR 455 + R+ R Sbjct: 427 ADINRLRLTR 436 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 63/483 (13%), Positives = 138/483 (28%), Gaps = 87/483 (18%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + + G+ +I AL++ ++ G VD R + L+ AA A + A Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + S + +R F NL + T + Sbjct: 63 RTS---PAYVAAGAMSGDGVISSGADDALRIFNGNL---------NGLTGYTLASSSATV 110 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + V++S+ ++F++ +G+ + + + A Y + ++D S Sbjct: 111 TKASDVVTSQVTFSAQISTMFMKVVGMSAMAVGGTSTATASMPKY-----IDFYLLLDNS 165 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ + Sbjct: 166 PSMGVGATPTDVSAMIAATANKSSD-------------------------------DHCA 194 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 ++ ++ + + + + ++R A ++ + ++ RM F Sbjct: 195 FACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTATATATYSNQFRMAIYDFGASA 254 Query: 307 ISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--VHRMK 362 + + L AID + N+ T+Y I + +E Sbjct: 255 QTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINNEISSPGSG 314 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGI--------------AICNKAKSQGIRIMTIA---- 404 + +KY+ ++DG + N + A+C K +GI+I + Sbjct: 315 VSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKIAVLYTTYL 374 Query: 405 --------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDRIGN 447 F+ + CASP +FE + T + + Sbjct: 375 NLPSNDWYKKWIAPFNAGPYGPSPNSEIAQNMEACASPGFYFEVSPTQGIAEAMNALFKK 434 Query: 448 EIF 450 + Sbjct: 435 AVA 437 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 73/549 (13%), Positives = 152/549 (27%), Gaps = 112/549 (20%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP-- 67 ++ +K +G+F ++ +L ++ GG+ VD V S + L+ A +A + + Sbjct: 3 HTSSFLKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARAGD 62 Query: 68 ----------------------------LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 ++ V + + F + + E Sbjct: 63 MSETQAMKLAKTTITTNYGINVAKVAVSMVDGDATVKASMDQALVFGGFMGRKNAAVSAE 122 Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD----LLLNPLSLFLRSMGIKS 155 +T E+ ++ T + + ++ Q + D L LN L + Sbjct: 123 ATATYAYTKYEIALVLDTTGSMLGGKLTSLQNAVIGLVDGMEALGLNKEQLKFAVVPYAG 182 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG-------QPA 208 ++ T++ + + + D + S+ F Sbjct: 183 FVNVGPEYGPTINGAGKVKKPAAAWIDQDAKAPIPQSDLPSDFSRFAMFNHLKVEWPGCV 242 Query: 209 DRTVKSYSSQNGKVGIRDEKLSP-------YMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + V + + + P + + + + D + + Sbjct: 243 ETRVPADKILHDVKDTVPDITDPKSLFTPFFAIDEPDNKWGYPNSYLPDGGKPVKGNKAT 302 Query: 262 ---------SSLRHVIKKKHLVRDALASVIRSIKKIDN-------VNDTVRMGATFFNDR 305 + KK D ++ KK+ ++ F Sbjct: 303 EAEKQDQLARYGKTGEYKKPKNTDDAIALTGKWKKVKVDNSPSNFYSNKKDPKGPGFGCE 362 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + K ++T GST + + + + + K++ Sbjct: 363 MEPLVPLTTDFSK-----IRTTVKALEANGSTNMLEGVMWGWRVLSDREPFAQGAPKSDA 417 Query: 366 EAKKYIVLLTDGENTQDNEE------------------------------------GIAI 389 +K ++ LTDG+N+ N A Sbjct: 418 SVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKAA 477 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNE 448 C AK G+ I TI L CA S +F+A S +L IF D I Sbjct: 478 CENAKEDGVTIYTIRLEEADVGTG---KMLEECATSSAHYFDAPSRQQLTPIF-DAIKK- 532 Query: 449 IFERVIRIT 457 +R+T Sbjct: 533 -GVVKLRLT 540 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 60/458 (13%), Positives = 132/458 (28%), Gaps = 67/458 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ +GH I+ A+++P + G+ + D R + ++ A++ A++ + + Sbjct: 4 LRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAH--NDDNK 61 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR---EVRDIVRDTAVEMNPRKSAYQV 131 S + + ++ +YL + N I A +Q Sbjct: 62 NSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECVAGLARGEPRFFQY 121 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + +S F I + A+ +R Y E V I +V D+S SM Sbjct: 122 EVEATS----RHVSWFPGDSSIPGFGKTFDAKGAATARKYQSE-AVDILFVADYSGSMAG 176 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + N + + + Sbjct: 177 GWNGGSNRKYIDLRNIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYTKTKPTN------- 229 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 S + + + + +++ I N + + + + D Sbjct: 230 ----------RSNSCFMTQLVYNNNYNINYTKTVNNIFNEKNN-KYCVSHSDSSRFRDID 278 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + T TA + + K ++ + Sbjct: 279 LTDNYSSFNTT-----VNGFYPNHGTASYQGIMRGAQML-----------KKGTNPRRLL 322 Query: 372 VLLTDGENTQ-------DNEEGIAICNKAKSQ---GI---------RIMTIAFSVNKTQQ 412 ++L+DG+++ +C K K + GI R+ + F N Sbjct: 323 IVLSDGDDSGTSQKNIHKQLVNAGMCTKIKQELSTGISSSGQSIKARLAVVGFDYNVNNN 382 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L +CA + F+A +T ++ + I EI Sbjct: 383 ----TALRDCAGAENVFKAQNTDDILNKILELITEEIG 416 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats. Identities = 56/500 (11%), Positives = 126/500 (25%), Gaps = 105/500 (21%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + G+ I+ A + +L G +D R + ++ AA A + Sbjct: 1 MAQRLSGLFDSFHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASV 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + + + N + + T ++ Sbjct: 61 GSVSKASPAFLAA------------GSMTTDGPIAVGSTDATNIFNGNMASQSGYTLSKL 108 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + L+S + + FL +G + I + + + + + Sbjct: 109 DAAVTKSGATLTSTVTFSASVATTFLTIIGKTALAIGGTSVSTSSMP-----VYIDFYLL 163 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM ++ + Sbjct: 164 LDNSPSMGVGATPTDVATMVDNTSDKCA-------------------------------- 191 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 E ++ + + V + ++RDA ++ + + RM F Sbjct: 192 ----FACHDVNDEHNYYELAKTLGVKTRIDVLRDATQQLMDTAAATATYPNQFRMAIYDF 247 Query: 303 NDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--EV 358 S L AID + ND T+Y ++ + + Sbjct: 248 GASAQSAALRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDDRDTSYSKLLPAIDKQITA 307 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGI---------------------AICNKAKSQG 397 + +KY++ ++DG + N A+C +G Sbjct: 308 AGAGTSDAPQKYLLFVSDGVADETNAGCAKTMKNAFWGNKSPRCQSPIDPALCKAMTDRG 367 Query: 398 IRIMTIA------------------------FSVNKTQQEKARYF---LSNCASPNSFFE 430 +++ + F+V + CASP +FE Sbjct: 368 VKVAVLYTTYLALPLKQANGDPSWYASWIAPFNVGPYGPSPNSEIANNMKACASPGFYFE 427 Query: 431 ANSTHELNKIFRDRIGNEIF 450 + T + + Sbjct: 428 VSPTDGIADAMNAIFRKAVA 447 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats. Identities = 73/481 (15%), Positives = 138/481 (28%), Gaps = 97/481 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALL+ + G GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ S ++ F + + TD +V + T L Sbjct: 65 AVAAAMTMSGNGTISLGKDDARSIFMSQISGELTDVQVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I T + Y + ++D + SM Sbjct: 116 NSQVSFSATVPTTFMRVLGRDSITISG-----TATAEYQTASFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 ++ + + Sbjct: 171 TATDVSTMEKNTSDTCA------------------------------------FACHETQ 194 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-----IS 308 + ++ + + V + +VR A + + K ++ RMG F + + Sbjct: 195 NNNNYYNLAKKLGVSMRIDVVRQATKELTVTAKSTRVSSNQFRMGVYTFGTKAEDAKLTT 254 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGS-----TAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + G T+ + A+ I + + Sbjct: 255 ISDPTDDLDKVRSYTDAVDLMTIPFQGYNNDQQTSFDSALTQMKTIITTPGDGS-----T 309 Query: 364 NLEAKKYIVLLTDGENTQDNEE---------------GIAICNKAKSQGIRI---MTIAF 405 +K + ++DG + + + C K + IRI T Sbjct: 310 ATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCQPLKDKSIRIAVLYTTYL 369 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + K Q + + CASP +FE T + + I R RI Sbjct: 370 PLPKNSWYNTWIKPFQGEIPTKMQACASPGLYFEVTPTEGIADAMKALFLKVI--RAPRI 427 Query: 457 T 457 T Sbjct: 428 T 428 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 143 bits (361), Expect = 4e-32, Method: Composition-based stats. Identities = 81/476 (17%), Positives = 139/476 (29%), Gaps = 89/476 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM ++G GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + + V + T L Sbjct: 65 AVAAAMAMNGNGTISLGKTDARDIFMSQVSGELAEVHVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R G S I T + Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMRIFGRDSITISG-----TATAEYQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N Sbjct: 171 TPSDVSKLEAKVGCAFACHQMDKSTNNYTI------------------------------ 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--PS 311 + V + +VR A ++ + K +D RMG F + + Sbjct: 201 --------AKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---DEVHRMKNNLEAK 368 S L + T A+D + N T +D+ ++ D +N A+ Sbjct: 253 ISGLTSDLTKVKNYTDAVDLMTIPYQNYNSDQITNFDSAMTQMNTIIDLAGDGTSNTSAE 312 Query: 369 KYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAFSVNKT 410 K + ++DG + C K +G++I T + Sbjct: 313 KILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSN 372 Query: 411 Q---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Q + + CASP +FE + T + + I R RIT Sbjct: 373 SWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMKALFLKVI--RAPRIT 426 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 143 bits (360), Expect = 6e-32, Method: Composition-based stats. Identities = 75/478 (15%), Positives = 133/478 (27%), Gaps = 51/478 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L G I+ AL + LG+ G+ +D + L AA A++ + Sbjct: 18 LASDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGV------VT 71 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A N+ RN T + V + +++ + Sbjct: 72 AKEYIASNAKQGDATAAGLTAGRNQATKAFAINTGKVPFATVSVSRLDVTRSGQTLTATV 131 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + F + +G+ S A SY + ++D S SM Sbjct: 132 IYTATIQ----NTFGKILGLSSTTFTNTITASADLASY-----LDFYLMVDVSGSMGLPT 182 Query: 194 RDSEGQPL---------NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 ++ + L NC K +++ GK+ +R + ++ + K + Sbjct: 183 AAADAEKLASITKEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVNNAVCELLKRAATPV 242 Query: 245 YPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P S L +R A N+ DT Sbjct: 243 VPNQYRIGFYPFINRLATLSPLSDTTTSMTALRTAAQCDKTWPLAFTNLLDTGSTQLFTG 302 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N+ S K + + T + +T + D + +S + Sbjct: 303 NNPTTGTGSGGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLITDGMQNSQSYSTNND- 361 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQG--IRIMTIAFSVNKTQQE--- 413 + G D + I C + K G I I+ I ++ K Sbjct: 362 ARTFPGSPSLFKGYGNAGWDGSQPAQIDPSKCKELKDAGAIISILYIPYNQVKNYTNDSY 421 Query: 414 -------------KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS F+ ANS ++ RV RIT+ Sbjct: 422 IVWENNRVNGFSPTLADPLRKCASQGFFYTANSADDITASLGAMFDQ--ALRVARITQ 477 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 143 bits (359), Expect = 8e-32, Method: Composition-based stats. Identities = 64/475 (13%), Positives = 142/475 (29%), Gaps = 87/475 (18%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ +GH ++ A+++P + GV + D R + L++A++ A++ Sbjct: 5 MRKQSGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAV---------- 54 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S++ + ++ I+ YL + ++ L ++ A ++ ++ Sbjct: 55 -SAKDEQDHQLAERYIQHYLY-DMDSILDIEVKKLGCDEMPECIAATERGEARYFEYRVA 112 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 S F + I + + +R Y + + I +++DFS SM D Sbjct: 113 G----QTLHKSWFPGNDVISGFGDSFNVTGSSKARRYQSQ-PIDITFIVDFSESMNDSWS 167 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 LN + + N P+ V+ + D + Sbjct: 168 GGRHSKLNDLKDIIEDVADELGAYNDLY-----PEHPHRVALTGFNRRTINKDKNDNLVV 222 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + V S + +A + V + +D D F+ Sbjct: 223 RDQRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNGD-------DDARFYDLYFTT 275 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + K G TA + A + S + K+ I++L Sbjct: 276 DFSSFTKKVKK-----FKAGGGTASLQGIIRAGQIVTSM----------SKNQKQLIIIL 320 Query: 375 TDGENT------QDNEEGIAICNKAKS--------------------------------- 395 +DGE+ + +C+ + Sbjct: 321 SDGEDWNHYAGQTNKLVSKGMCSNILNMVNGGKVSADNTHDDVEVIGGVSQGMMTPDGER 380 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 R+ I F + L NC ++ ++A + ++ I E+ Sbjct: 381 MNARMAVIGFDYELNKNVG----LRNCVGRDNVYKAENKEDILNKILGLITEEVG 431 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 142 bits (358), Expect = 8e-32, Method: Composition-based stats. Identities = 58/486 (11%), Positives = 128/486 (26%), Gaps = 95/486 (19%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + ++ G+ + A++ ++ G VD R + L+ A A + A Sbjct: 4 LSLLLSRFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVGAVS 63 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + R N T + + + + Sbjct: 64 RTSPAFIAAGAMTTD----GVIAAGNDDARKIFNGNMSGTTGYTLDSLTPEVKKTGS--- 116 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 VL++ +LF+ +G K+ +Q + A+ Y + ++D S Sbjct: 117 -----VLTATVSFSATVPTLFMSIVGYKTMSLQGSSTAKASMPKY-----IDFYLLLDNS 166 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ + Sbjct: 167 PSMGVAATPADVTKMVSATSDKCA------------------------------------ 190 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 ++ + + V + ++R A ++ + ++ ++ RM F Sbjct: 191 FACHDYNDANNYYNLAKTLGVTTRIDVLRSATQQLMDTAQQTQTYSNQFRMAIYDFGASS 250 Query: 307 ISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTA--INDAMQTAYDTIISS--NEDEVHR 360 + + L ID + T Y T + + NE Sbjct: 251 KTIGLRALFALSSSLTSAKSAAGNIDLMGVYGNNDSFTADKDTPYTTALPAINNEIATPG 310 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGI---------------AICNKAKSQGIRIMTIA- 404 + KY+ ++DG + N + A+C K++GI+I + Sbjct: 311 DGTSGSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTALKNRGIKIAVLYT 370 Query: 405 -----------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDR 444 F+ + CAS +FE + T + Sbjct: 371 TYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACASDGFYFEVSPTQGIADAMNAL 430 Query: 445 IGNEIF 450 + Sbjct: 431 FKKAVA 436 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 66/550 (12%), Positives = 147/550 (26%), Gaps = 103/550 (18%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + + + G ++ L ++G G+ VD++R+ AL+ A++ Sbjct: 17 RNRLKDRANAFAREEDGLMTVMALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLA 76 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV-----------R 112 A+ Q E R+ + + ++ K + T Sbjct: 77 AADLDQQVSPETVVRSYLEKAGLLEYLSSVTVQEGLGYRKVSATATAELPTHFMKLSGYD 136 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS--------------LFLRSMGIKSWLI 158 + A ++ L +N S L + + I Sbjct: 137 SLTIPAASTAEESIGNVEISLVLDVSGSMNSNSRLYNLKNAAKEFVDHMLSATEPGTVSI 196 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR------------------------ 194 A V+ ++ ++S + Sbjct: 197 SIVPYATQVNAGADILSYYNVSTEHNYSHCVNFIDDEFSQPGLSRVTPLERTMHFDPFSY 256 Query: 195 ---------------------DSEGQPLNCFGQPADRTVKSYSSQNGKVGI------RDE 227 ++ LN + + K G+ Sbjct: 257 TKDPISTPVCPVRASTEILPFSNDQTVLNNYIDGLTGRGNTSIDIGTKWGVVMLDPGTQS 316 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----------KKKHLVRDA 277 +S + + P D + + S + + +A Sbjct: 317 VISGLISDNKVPASFQGRPSAYDSGDVLKVLIVMSDGENTNQYMLNPSLRDGDSPVWYNA 376 Query: 278 LASVIR------SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL---IRTIVKTFA 328 VI + +D + + +D + G + + + A Sbjct: 377 AEDVISGSPDNNTTNAFSIYHDNGNNSYYWPDQNRWADHPYGNGQSEACGYNSSGYYSCA 436 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + + + A A ++ + N +N A+ Y +T E + ++ Sbjct: 437 MRDEPGEAVRLTYAELFAKVSL-AYNAYYNFEFNSNAWAEWYTAAMTHKEASAKDQRTDH 495 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +C+ AK +GI + T+ F + R L CAS +S + E++ F Sbjct: 496 VCDAAKDEGIIVYTVGFEAPYSG----RRVLKRCASSDSHYYDADGLEISDAFTSIAS-- 549 Query: 449 IFERVIRITK 458 R +R+T+ Sbjct: 550 -SIRKLRLTQ 558 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 82/476 (17%), Positives = 140/476 (29%), Gaps = 89/476 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I + +G+F I+TALLM ++G GM VDV L AA A + + Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + TD + + T L Sbjct: 65 AVAAAMTMNGNGTVSLGKTDARNIFMSQTSGELTDIHIDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R G S +I T + Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMRIFGRDSIIISG-----TATAEYQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N Sbjct: 171 TASDVSKLQAKTGCAFACHQMDQSTNNYTI------------------------------ 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--PS 311 + V + +VR A ++ + K +D RMG F + + Sbjct: 201 --------AKSLGVTMRIDVVRQATQALTDTAKAERVSSDQFRMGVYTFGTKAEDAKLTT 252 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---DEVHRMKNNLEAK 368 S L + T A+D + N T++D+ ++ D +N+ + Sbjct: 253 ISGLTSDLTKVKNYTNAVDLMTIPYQNYNSDQLTSFDSAMTQINTIIDPAGDGTSNISPE 312 Query: 369 KYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MTIAFSVNKT 410 K + + DG C K +G++I T + Sbjct: 313 KILFFVADGVGDSYKPSTCTKKTTGGRCQEPIDTTFCKPLKDRGVKIAVLYTTYLPLPSN 372 Query: 411 Q---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Q + + CASP +FE T + + I R RIT Sbjct: 373 SWYNTWIKPFQNEIPTKMQACASPGLYFEVTPTDGIADAMKALFLKVI--RAPRIT 426 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 142 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 75/481 (15%), Positives = 140/481 (29%), Gaps = 97/481 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM +LG GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAA---DAAAVGSIAEKSG 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + D V + T L Sbjct: 65 AVAAAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F+R +G S I T + Y + ++D + SM Sbjct: 116 NSQVSFTAVVPTTFMRVLGKDSITISG-----TATAEYLTASFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + + + Sbjct: 171 TAKDVATMEKNTSDSCA------------------------------------FACHETE 194 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND-----RVIS 308 ++ ++ + + V + +VR A + + K + RMG F + + Sbjct: 195 NKNNYYNLAKTLGVSMRIDVVRQATKELTLTAKSTRVSTNQFRMGVYTFGTKAEDANLTT 254 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGS-----TAINDAMQTAYDTIISSNEDEVHRMKN 363 + + K+ + + G T+ ++A+ D I + + Sbjct: 255 ISDPTDDLDKVRTYTDAVDLMTIPKQGYNNDQQTSFDNALTQMKDIITTPGDGS-----T 309 Query: 364 NLEAKKYIVLLTDGENTQDNEE---------------GIAICNKAKSQGIRI---MTIAF 405 +K + ++DG + + + C K +GIRI T Sbjct: 310 ATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCKPLKDKGIRIAVLYTTYL 369 Query: 406 SVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + K Q + + CASP +FE T + + I R RI Sbjct: 370 PLPKNSWYNTWISPFQSQIPTKMQECASPGLYFEVTPTEGIADAMKALFLKAI--RAPRI 427 Query: 457 T 457 T Sbjct: 428 T 428 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 142 bits (356), Expect = 2e-31, Method: Composition-based stats. Identities = 57/483 (11%), Positives = 141/483 (29%), Gaps = 92/483 (19%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ +++ + ++ +G+ II AL + +L G +D + L + A Sbjct: 1 MLSGHRYLGLLSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + + ++ ++ + + T+ Sbjct: 61 MLA-ATGYTAMRGTAADAKTSATNMYNGQMSSH----------------------KLTSN 97 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +N + + + + F+ G + + + A + Sbjct: 98 SLNITVTDSVTARTVTGTASVVVNTAFMYMFGFPTMTVTASSSASASFP-----TYMDFY 152 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 ++D S S ++ L Q A +++ + + L Sbjct: 153 VLVDNSPSQGLGATTADMTTL----QNATSDKCAFACHDTYTSSTKKTLQTNSY------ 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + V + +VR A S+ + V++ RM Sbjct: 203 ------------------YQIAKNKGVTMRIDVVRSATQSLTDTATSSQVVSNQYRMAVY 244 Query: 301 FFNDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED-- 356 S + + + A+D + + N+ M T +D +S Sbjct: 245 SLGSDCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGAMSGMNGVI 304 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQD----------------NEEGIAICNKAKSQGIRI 400 ++ +K++ ++DG C+ K++GI+I Sbjct: 305 PAQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQEPLNTTTCDTLKARGIKI 364 Query: 401 ---MTIAFSVNKTQ---------QEKARYFLSNCASPNSFFEANS----THELNKIFRDR 444 T ++ ++ + +CASP ++E +S L +F+ Sbjct: 365 AVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCASPGYYYEVDSSGSIGAALTALFQQA 424 Query: 445 IGN 447 I + Sbjct: 425 IAS 427 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 142 bits (356), Expect = 2e-31, Method: Composition-based stats. Identities = 56/473 (11%), Positives = 123/473 (26%), Gaps = 34/473 (7%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +L+ G+ II A+ + ++ G+ VDV R + + A A + Sbjct: 2 IRNSLSRLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRMSAALDAAALAVGS 61 Query: 67 PLIQS-LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 ++ +P + + DT Sbjct: 62 SSGTDAQLSAVAQKFFDANYPTGALGAHPS---VAVKVTGDVISASAVAEVDTVFMKVVG 118 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE----------- 174 + V S + + L L + S +A + + + Sbjct: 119 LNDVPVHADSTVNRQIAGLELAMVLDNTGSMTTNNNIQAVRDAANQLTDILFGTATVHPY 178 Query: 175 --HGVSIQWVIDFSRSMLDY--QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + S+ P + G ++ ++ G + Sbjct: 179 LKIALVPYSAAVNVGSVAPSLITTGDTYAPNDLLGWKGCVVERAGANGVGDTSAATAPWT 238 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 Y Y + + + + A++ +I ++ Sbjct: 239 RYKWLPAVDNNYDATKSSTVLANPSNGNASTGPNLGCPTAITPLTNVKATLTPAINAMEA 298 Query: 291 VNDTVRMGATFFNDRVISDPS-------FSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + WG K + ++ D T+ Sbjct: 299 WSRGGTLSDVGMAWGLRVLSPEPPFTEGLPWGTPKWSKAVILMTDGDNQFYKLTSTTGG- 357 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + S+ + E + T N ++CN K++ I + T+ Sbjct: 358 ----NKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSVCNAMKAKNIIVYTV 413 Query: 404 AFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIR 455 F+ Q + CA+ S +F++ S EL FR + RV + Sbjct: 414 TFTSGINQAT--KDIYKACATDASKYFDSPSQDELKSAFRAIATSLSNLRVSQ 464 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 141 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 61/518 (11%), Positives = 132/518 (25%), Gaps = 76/518 (14%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + +G +I AL + M + GM +D + + + A + + Sbjct: 8 LTARLARFRHTESGSVLVIFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATALALAK 67 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + AK + T++ Sbjct: 68 -LPLDTTDKDLAAKAEAQVLTALKGLPIDALTVTMRHNGDLIEVAAKGATPTSLTRLAGF 126 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH----------- 175 + + +S+ + + L + L S + +R Sbjct: 127 MSMPLSVSAISNRSMTNLEIALVLDNTGSMKGTKLTNLKAAARDLVTSLFQQADPAKPNA 186 Query: 176 ------------------GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 S I+ + +++G P N F AD Sbjct: 187 LKIGVVPFSMTVNVGSGFAGSDWLDINAKSPIHQQIFNAQGVPANRFSLFADMGKPWAGC 246 Query: 218 QNGKVGIRDEK-LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + D + +P + + P D + + R Sbjct: 247 VESRPAPYDVQDTAPSQATPSTLFVPFFAPDESDNDSRAVNDYMADLPSGGSAGGASNRQ 306 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFF---NDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + K V+ T R T + + + +T + Sbjct: 307 LQGMTAKYDKNAFKVSTTARQDGTNYLFGPNAGCEIQPLTRLT--TSQTQLTNAIAAMTV 364 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------- 382 +G T I + + + + + ++ KK+IVL+TDG+N Sbjct: 365 IGDTNIPIGLAWGWHLLSPNGPFKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYS 424 Query: 383 ------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +C+ + I++ + V + Sbjct: 425 GLGFIWQNRIGTTSNDNAVRTKAIDTRLTLLCDNIRKARIQVFAVRVEV----NDGDSAV 480 Query: 419 LSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIR 455 L CA+ + FF+ ++ L +FR R+ + Sbjct: 481 LKACATSPNMFFDVKNSSGLPAVFRAIADQISELRISQ 518 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 141 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 74/464 (15%), Positives = 146/464 (31%), Gaps = 73/464 (15%) Query: 4 DTKFIFYSKKL---IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 +F + ++L + +G+ +I AL + V+ G G +D R Y L+ AA +A Sbjct: 8 THRFAGFRRRLGESCRDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSA 67 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ A + E A +F +NF V D+ Sbjct: 68 VLRAMALKSATDESRGVAADKAFA-------------------ENFGHPGVYDLNGALKR 108 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 E+N + + + S F G S+ + ++A+T I Sbjct: 109 EVNENIISQTYTVHATVS------SYFGAFFGKDSYPVTVVSQAKTSLD------VFEIA 156 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +V+D + SM + + + D + + + P+ S Sbjct: 157 FVLDTTGSMAEANKMPNLKSS------VDSAMAGLLQNGKNLSGSKIAVVPFNTQVRLSD 210 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + S + V R + + A T+ T Sbjct: 211 ATVTTMSSQGLSSGWGNCVH---DRDLATSHDVSASAAQKGKAQ---------TLYPLET 258 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + S + F G T + +Q + + + Sbjct: 259 CDEASLKPVQGLSDNISSAR-----NFIKTLQPGGYTNVTMGVQWGMEVLSPNQPFSDAT 313 Query: 361 MKNNLEAKKYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + +A+K+++++TDG+NT+ ++ C AK++GI + T+ Sbjct: 314 EFGSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALACENAKAKGITVYTVKIIQG--- 370 Query: 412 QEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 L CAS P F++ S ++LN + R+ Sbjct: 371 ---NSNMLRKCASAPEYFYDLTSANQLNAAMSGIFKSINKTRLS 411 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats. Identities = 57/477 (11%), Positives = 132/477 (27%), Gaps = 85/477 (17%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + + G+ ++ A+L+ ++ G +D R ++ A + A+ Sbjct: 21 RSLLRHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAASL 80 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 E R + + + ++ + Sbjct: 81 SQGEDPETIVRNYITAALSEHNGVLERLNVQVSSDLAINSR------------------- 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 D +++ +L L +G + + +EA R+ + I V+D S Sbjct: 122 ------EVTADAVISVPTLMLGIIGYDALTLNRVSEANERVRN------LEISLVLDISG 169 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + + F G+ + PY Sbjct: 170 SMSGSKITALRDAAEEFVGVMM--------DPDLEGLTSLSVIPYNGGVRLPQTVTNDLV 221 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P + ++ ++ V D ++ R + + Sbjct: 222 P--GTPNDSGCLELGVSDPVTMDLAA------------NGYDWLDWQDRDQRGWRSSAFC 267 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED-------EVHR 360 + + + + +++ D + G+T ++ A + + Sbjct: 268 PEENEATVFLEQTPSVLVNLIRDLDAGGNTGLDVATAWGARALDPAWRGRLGGDFASRPA 327 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAI-----------CNKAKSQ-----------GI 398 ++ K +V++TDG T ++A+ G+ Sbjct: 328 AYDDPSTMKVLVVMTDGAATAQIRRAQNWYGDWYSYEIYSASQARDNMADACDAAEAEGV 387 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I TIAF V+ R + +CAS + A +++ F + R+ R Sbjct: 388 HIYTIAFQVSG---STNRNLMRDCASRPENYYAVENLDISAAFNSIAADLNNLRLAR 441 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 140 bits (352), Expect = 4e-31, Method: Composition-based stats. Identities = 65/475 (13%), Positives = 142/475 (29%), Gaps = 87/475 (18%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ GH ++ A+++P + GV + D R + L++A++ A++ Sbjct: 5 MRKQFGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAV---------- 54 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S++ + ++ I+ YL + ++ L +I A ++ ++ Sbjct: 55 -SAKDEQDHQLAERYIQHYLY-DMDSILDIEVKKLGCDEIPECIAATERGEARYFEYRVA 112 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 S F + I + + +R Y + + I +++DFS SM D Sbjct: 113 G----QTLHKSWFPGNDVISGFGDSFNVTGSSKARRYQSQ-PIDITFIVDFSESMNDSWS 167 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 LN + + N P+ V+ + D + Sbjct: 168 GGRHSKLNDLKDIIEDVADELGAYNDLY-----PEHPHRVALTGFNRRTINKDKNDNLVV 222 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + V S + +A + V ++ +D D F+ Sbjct: 223 RDQRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNSD-------DDARFYDLYFTT 275 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + K G TA + A + S + K+ I++L Sbjct: 276 DFSSFTKKVKK-----FKAGGGTASLQGIIRAGQIVTSM----------SKNQKQLIIIL 320 Query: 375 TDGENT------QDNEEGIAICNKAKS--------------------------------- 395 +DGE+ + +C+ + Sbjct: 321 SDGEDWNHYAGQTNKLVSKGMCSNILNMVNGGKVSADNTHDDIEVIGGVSQGMMTPDGER 380 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 R+ I F + L NC ++ ++A + ++ I E+ Sbjct: 381 MNARMAVIGFDYELNKNVG----LRNCVGRDNVYKAENKEDILNKILGLITEEVG 431 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 140 bits (351), Expect = 5e-31, Method: Composition-based stats. Identities = 56/479 (11%), Positives = 134/479 (27%), Gaps = 71/479 (14%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +K + G I+ L+ V+ + G+ VD R+ A A++ + L Sbjct: 27 RAIEKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARAL 86 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + ++ + + ++ Q + L + + + + + Sbjct: 87 QTNGGDQAAALRVAQSYYAQATKNRLSLSNDTINFAIADNATAMVTTGNAVIT------- 139 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + F+ G S I K ++ + I Sbjct: 140 ----------------TPFMGLAGTGSLPILRKDGSDYSKAVLAVGGNAELNLEIAMMLD 183 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + R + + V + + Sbjct: 184 ITGSMRGQKLTDMKAAASDLLNIVVWTDQSKFTSKVAIVPFA-------------YDVRL 230 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + S++ V+++ + A+ + + + + T ++ Sbjct: 231 PAAAFKKATGTTSTNYPCVVERTGTEKYTDAAPATGKYVMVHNTSSTKKNKTTYSPTCDV 290 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV------HRMK 362 S ++ + + GSTA + A+ + + Sbjct: 291 ASSAEVLPLTSDKSTLLAKVNGLSTAGSTAGHIGTAWAWYMLAPNWSSLWTSASSTPAAY 350 Query: 363 NNLEAKKYIVLLTDGENTQ-------------------------DNEEGIAICNKAKSQG 397 N +K VL+TDGE + + ++ C K++G Sbjct: 351 NADNLRKIAVLMTDGEYNTQYTTNGVPDDSSSLTRCPNAANGVCSSAQAVSQCTAMKAKG 410 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + T+ F ++ + A LS CA+ + F+ + + L FRD + + Sbjct: 411 IEVYTVGFQLD---NQTAIDTLSQCATDSSHFYNSTTGDALKAAFRDIALKISTLYLSQ 466 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 140 bits (351), Expect = 6e-31, Method: Composition-based stats. Identities = 74/445 (16%), Positives = 137/445 (30%), Gaps = 36/445 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G I+ AL V+LG+ G+ +D R + + AA A + S Q Sbjct: 21 VAADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLAVSRENFQVS- 79 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 +++ F+ N V + P + V + Sbjct: 80 ------------TNDQLKALAQSYFDANFPPGTMGATTSLSVATSGTP--PTVQGFTVTV 125 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 ++ L+ PL L I S I KA + + G+ + V+D S SM Q Sbjct: 126 TATLPLVFAPLVETLGGPTIGSVGIS-KASGAVFTTQTSNQGGMELVIVLDNSASMKGSQ 184 Query: 194 ---RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP--YMVSCNKSLYYMLYPGP 248 R L+ AD Y G + S + + P Sbjct: 185 EDLRGGVKALLDMLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCP 244 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDAL------ASVIRSIKKIDNVNDTVRMGATFF 302 + + + S+ K D + AS + + + + Sbjct: 245 MATVNGKLNGSRLSNAPPKTFKFDSTTDGVEIQYCGASTLGTSSALSPNRGDADKAIKSY 304 Query: 303 NDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + WG L + + + S ++ + ++ + H Sbjct: 305 VAGGDTLIGEGLVWGWRMLTPSWRGLWNTKDQPGASLPLDYDLPYMKKVLVLMTDGVNHI 364 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFL 419 N A T + ++ + + + ICN A K + + TI + + +Q+ + Sbjct: 365 AGRNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLYTITYGSDTDEQQ-----M 419 Query: 420 SNCASPNS-FFEANSTHELNKIFRD 443 S+CAS S + A +L K F Sbjct: 420 SDCASDPSKHYHAALPQDLAKAFTQ 444 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 139 bits (350), Expect = 8e-31, Method: Composition-based stats. Identities = 62/512 (12%), Positives = 152/512 (29%), Gaps = 69/512 (13%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + FI S+K IK+ G + +++P + + + ++ + + L A + A + Sbjct: 24 SLFIIISRKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQATLAL 83 Query: 65 SVPLIQSLEEVSSRAKNS--FTFPKQKI-----------------EEYLIRNFENNLKKN 105 +V +E+ R F + + + N Sbjct: 84 TVENDDLPDELQMRKNVDLVSNFSSAYLPLEHFSVPEIDIKNNCGQLTYNAKITMSYFAN 143 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVV--------LSSRYDLLLNPLSLFLRSMGIKSWL 157 F + T + + QV + D + F + + Sbjct: 144 FLSKTAMTNAITTIGTEDNGAAIKQVSTIQDKATDVIFVADYSGSMNEGFHGKVPRGEKI 203 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + ++ S K +++ + FS ++ + C + ++ ++ Sbjct: 204 NALRDVFNRLNGSILKNSNINLIGFVPFSWGTKRIVIENSQEKKYCHFPFVPKQYRADNN 263 Query: 218 Q---NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 G++ + +K Y L G + + + R+ + + Sbjct: 264 YFRQYTVSGLKKFPGLEGLTDIDKINYGELTLGEYNTLTNVIKNMAKQEYRNKALEFLRI 323 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-----------WGVHKLIRTI 323 + + ++ + I D + +D D + L Sbjct: 324 TLNIPTYMQQMIFITTTIDYDATIKSINSDAQYIDIPLDDIINESICLNNSNAYSLDSHN 383 Query: 324 VKTFAID----ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ID + +G T ++ + A + ++N K +V+++DG + Sbjct: 384 SHDDLIDKMIAMSPLGQTLVSSGILYANTLF---------KKESNNSNNKLMVIISDGID 434 Query: 380 TQDNEEG-------------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCAS 424 N+ +C + K I+++ IA + + + C Sbjct: 435 VFINDTTIQQSIYISKTLIDKGMCERIKENNIKMVFIAIKDGSNETNEPANYIDWKKCVG 494 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++++ + HEL R + E V R Sbjct: 495 EDNYYYVSDAHELEAALRQSLTTTSSEVVGRN 526 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 53/475 (11%), Positives = 136/475 (28%), Gaps = 46/475 (9%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ TG + +L+ ML ++DV+R + + A A + +I + E Sbjct: 17 MRKQTGGISVFMLVLLMSMLVFAAWVMDVMRIYSVHNQMANATDAA---LASAIISEVPE 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ Y+ +L+ + + + + + + Sbjct: 74 STAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVVLDFVPNSLNIAAQESVPIRTN 133 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 ++ + N + S + + + ++ + ++ S++ Sbjct: 134 AKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASG 193 Query: 195 DSE----------------------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + A + + RD Sbjct: 194 NVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMCRDLDFEGN 253 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + Y+ + + + + + V S +N Sbjct: 254 NPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYR 313 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 A F + +P + R ++ +T + M A + Sbjct: 314 PFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMNTNNAEGMVWAMRLLSP 370 Query: 353 SNEDEV-------HRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKS--QGIR 399 + R ++ + KY+V+ +DG + D +++ IC + K +G++ Sbjct: 371 YWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVK 430 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +MT+ F + + +CAS ++ S + K+F I +I Sbjct: 431 VMTVNFGG-----AASERLMQSCASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 480 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 68/534 (12%), Positives = 131/534 (24%), Gaps = 100/534 (18%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA------ 57 + ++L G+ I A M + +D+ R + + L+ A Sbjct: 6 TSWLRRALRRLA-DDGGNVAITVAFAMVPLAIGTLGAIDLARGASAKVQLQDALDAAALG 64 Query: 58 ---------QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 T L Q+L S +F + L R Sbjct: 65 AARSSANSPDTLQAAGERLLRQNLALGGDFELVSSSFTFGPDNKVLARAQVRVEPYVAGL 124 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 V ++ A E+ ++ L +N + + +K+ ET + Sbjct: 125 AGVNNMDIAAATEVVRAGMQLEIALVLDNTGSMNQNN---KLYHLKTAAKAFVTAMETAA 181 Query: 169 RSYHKEHGVSIQWV------------------IDFSRSMLDYQ----RDSEGQPLNCFGQ 206 + + I V ID + S + Q N F Sbjct: 182 EGNTVPNSIKISLVPFSHTVRVDSDAYRNAAWIDQNGSSPINNEIFPTATGTQWANRFTL 241 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + + D + +P + P D ++S Sbjct: 242 FSQLGTSWRGCVESRQAPYDIQDTPPT-TGATLFTPYFAPDEPDYPAEWYGTKFANSYVD 300 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 R ++ + + + + L Sbjct: 301 DRTSSTNWRVRQGNLTKYVN----TKGLGTSKGPNAGCGLRPIIRLTTDFDGLR-----D 351 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------ 380 D GST I + + T+ + KK +VL+TDGENT Sbjct: 352 AVDDLVADGSTNIPMGLVWGWHTLAPMAPFPDGVPYLTEKHKKIVVLMTDGENTILYKDT 411 Query: 381 ------------------------------------QDNEEGIAICNKAK--SQGIRIMT 402 ++ + +C K ++ I I Sbjct: 412 PNGSDYSGVGHARQGRVLDPAGRPITESSSQRERTAALDDRLLKLCANMKAPAKDIEIYA 471 Query: 403 IAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + L CAS + +++ + ++ F+ G + R Sbjct: 472 I----RVEVSSGSSSVLQTCASSADHYYDVQNAADMTMAFQSIAGQIAALHLSR 521 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 59/253 (23%), Positives = 98/253 (38%), Gaps = 25/253 (9%) Query: 220 GKVGIRDEKLSPYMVS-CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 K G + + + Y P + + S H K ++ A Sbjct: 26 DKSGSMGDPTATVNADDPTEPFTYDCNPHLNKKGTKIIYDTCTGSRAHYYTKIEALKIAA 85 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ + D + VR GA ++ + +WG+ + G T Sbjct: 86 GNLFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWGITAVTSY-----VNALESGGGTN 140 Query: 339 INDAMQTAYDTIISSN------EDEVHRMKNNLEAKKYIVLLTDGENT-------QDNEE 385 + A+ TAY ++ + N ED H++K KKYIV +TDG+N + Sbjct: 141 SSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLPKKYIVFMTDGDNNDDSRGGRSYDTL 200 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDR 444 A C+ AK++GI TIAF + Q L CAS + +F+A +L F+ Sbjct: 201 TKATCDTAKAKGIETYTIAFMAPEGGQA----LLHYCASDDAHYFQAEKMEDLLAAFK-A 255 Query: 445 IGNEIFERVIRIT 457 IG + +V R+T Sbjct: 256 IGAKASAQVTRLT 268 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 138 bits (346), Expect = 2e-30, Method: Composition-based stats. Identities = 53/475 (11%), Positives = 135/475 (28%), Gaps = 46/475 (9%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ TG + +L+ ML + DV+R + + A A + +I + E Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQMANATDAA---LASAIISEVPE 57 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ Y+ +L+ + + + + + + Sbjct: 58 STAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTN 117 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 ++ + N + S + + + ++ + ++ S++ Sbjct: 118 AKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASG 177 Query: 195 DSE----------------------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + A + + RD Sbjct: 178 NVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMCRDLDFEGN 237 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + Y+ + + + + + V S +N Sbjct: 238 NPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYR 297 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 A F + +P + R ++ +T + M A + Sbjct: 298 PFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMNTNNAEGMVWAMRLLSP 354 Query: 353 SNEDEV-------HRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKS--QGIR 399 + R ++ + KY+V+ +DG + D +++ IC + K +G++ Sbjct: 355 YWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVK 414 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +MT+ F + + +CAS ++ S + K+F I +I Sbjct: 415 VMTVNFGG-----AASERLMQSCASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 464 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 59/485 (12%), Positives = 135/485 (27%), Gaps = 106/485 (21%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + +G+ +I A+ + +LG G VD S L+ A A++ +V Sbjct: 11 RGSLGRFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVL-LAVS 69 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + V+ ++ F ++ Y + + + ++V Sbjct: 70 NSAINKTVADAQADAVQFFNATLDGYGLSATIDLSVSENDGKRSAVSSFSSSV------- 122 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + FL +G + I ++ + V ++D S Sbjct: 123 ----------------KTHFLDMIGYPTLAIGNRSTSTVSLP-----VYVDFYLLLDNSP 161 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM S+ + + Sbjct: 162 SMGVAATTSDIATMV------------------------------------ANTSDQCAF 185 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + ++ + + V + +VR A+ + + + V + RMG F Sbjct: 186 ACHDLSTSNNYYNLAKKLGVTMRIDVVRQAVQRLTTTATAMSAVTNQFRMGVYTFGSSCT 245 Query: 308 --SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN- 364 + + + ID + N+ T +D +++ + + Sbjct: 246 AIGLTTVANLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTDFDGSLTAINSAIPSPGSGI 305 Query: 365 -LEAKKYIVLLTDGENTQD----------------NEEGIAICNKAKSQGIRIMTIA--- 404 + +K++ ++DG + +A C KS+GI+I + Sbjct: 306 STQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQEPLTVAQCTAIKSRGIQIAVLYTTY 365 Query: 405 ---------------FSVNKTQQEKARYF---LSNCASPNSFFEANSTHELNKIFRDRIG 446 F+ + +CASP +FE + T + + Sbjct: 366 LALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSCASPGFYFEVSPTQGIAEAMDALFK 425 Query: 447 NEIFE 451 + + Sbjct: 426 KAVAK 430 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 60/467 (12%), Positives = 138/467 (29%), Gaps = 61/467 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K IK G I+ L+P M+ + + + L +AA+ A + + Sbjct: 2 KSIKKNRGVAGILFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLAL-------I 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + ++ Y++ N E+ FT + A S + V Sbjct: 55 ASPKEDEDKNVKYARYLVDRYILDNSEDVDVAVFTRKCEYKDGCVQASGELAPFSDFVVS 114 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +++Y ++ + + V+R Y + V + ++ DFS SM + Sbjct: 115 ATAKYTSWISYEDV--------DLEPEFTVSGRAVTRKYLPQS-VDVYFIGDFSGSMGNS 165 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLD 250 + + L+ + R V N + R +P+ V + G Sbjct: 166 WK-NGKMKLDVVKETIKRVVDDIEKFNTEEKSRVALLGYNPFHVKQTDKTVRVNAYGYYG 224 Query: 251 PSLSEEHFVDSSSLRHVIKK--KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-I 307 + + + S + + + + + F Sbjct: 225 SWRKKYAYNYARSSPGTTVRRMFDKPKLYNEILEPKRGMSRYEVERLHTHNVNFAKYYKF 284 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 D + + ++ T G T+ + + A + NL Sbjct: 285 YDIPLTEDYDEFRSQLMNT---KLQAGGGTSSWNGIIAAAQEANKA---------TNLNP 332 Query: 368 KKYIVLLTDGENTQDNEEG----IAICNKAKS-------------------QGIRIMTIA 404 ++ ++L+DG++ N +C K +S + + I Sbjct: 333 EQVFIVLSDGQDGDKNYLQKLVDQGLCKKLRSTISAKRNRFQSNSPTEAEKTKVTMGVIG 392 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + + +C + + A ++ K + I E + Sbjct: 393 INYKVNESDG----FGDCFGKKNIYHAKDGEDVYKYILNLINEETGK 435 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 47/437 (10%), Positives = 120/437 (27%), Gaps = 61/437 (13%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASLT 70 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 R + + ++E ++ N++ Sbjct: 71 QSRSPAEVVRDYVAKAGLEDYLDEPVVNANTLNVR------------------------- 105 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +++ + ++F++ + I + AE + + + + + Sbjct: 106 ---SVTATAAYSMP--TVFMKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGT 160 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + D+ F S + + + + + L Sbjct: 161 NPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNV-- 218 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 V+ ++ S + + T ++ + Sbjct: 219 -SHRQPYSSCVEFAASDFTTTALANGATLTGSGNSELFSSSSSTQTPT---YYWCPEETA 274 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------------- 355 + + +K + GSTAI+ M+ + S + Sbjct: 275 AGNPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVN 334 Query: 356 ---DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K +VL+TDG++ N K+ I + + + Sbjct: 335 GAFAGRPLAYQSGNVMKVVVLMTDGQHVNQEPRLS---NAVKTGDSDIWATGSATDTNRS 391 Query: 413 EKARYFLSNCASPNSFF 429 R S +++ Sbjct: 392 WSVRL------SNGTYY 402 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 65/222 (29%), Gaps = 12/222 (5%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P + + + + ++ + VR + + + + Sbjct: 357 TDGQHVNQEPRLSNAVKTGDSDIWATGSATDTNRSWSVRLSNGTYY-WKRDGTGGHSGPD 415 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + V + + + ST + + Sbjct: 416 TTIVTRSCTVEQVCTRTRNGSCTRWQDQQVCTDT----PSTVTAHRLSYPDLWHQARVGW 471 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +Y ++ + T NE IC+ A++QGI + ++AF Q Sbjct: 472 VSGLYSSAGVSGRYSSWVSTLDPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQA--- 528 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS + A ++ +F + +R+T+ Sbjct: 529 -LLQYCASTTGHYYATVGPQIRTVFHSIASHITQ---LRLTQ 566 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 74/492 (15%), Positives = 132/492 (26%), Gaps = 53/492 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G+ ++ +L++ + + G VD+ + L A A + V S Sbjct: 12 FPRNRRGNVAMLFSLMLIPITVLSGGAVDINQALNARARLSAALDAAALAVGVHTSVSET 71 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-QVV 132 E + A E L+ N L + V R + + Y V Sbjct: 72 EAAGIASEFIAANYPDRELGLVGNIVVQLDPDQDRVTVGAESRVETIVLGLIGIEYITVH 131 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS---RSYHKEHGVSIQWVIDFSRSM 189 S + L L + S + + + + I Sbjct: 132 WESEVQRARSSLELVMVLDNTGSMGGSKISSLRSAGLLLTDILFDGADPNRLKIGLVPFS 191 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQN--GKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + + A + + + + + D + C ++ Sbjct: 192 ATVNVGTWHERAWWLDANAQSPLHAENFDPAANRWDLYDSLQNRAWEGCVEARAIPHDIE 251 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P + S + N +R + N + Sbjct: 252 DTAPDTGYPETLFLPYFAPDESNYANNAGYANSYLNDGMGGSNERARMRNTPKYTNAWIN 311 Query: 308 SDP-SFSWGVH-------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 S WG R ++ D G+T I + + Sbjct: 312 SSSRGPEWGCTARPITPLTNQRNVIDDAIEDMIASGTTNIPIGISWGVRVLSPGMPFTEG 371 Query: 360 RMKNNLEAKKYIVLLTDGEN-----------------------------------TQDNE 384 + K +V+LTDGEN N+ Sbjct: 372 VSYDEEGTIKAMVVLTDGENYLDGRNNPNYSHYSGYGYMRDGRLGIQTSSDSTIRNALND 431 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRD 443 A C AKS GIR+ TI F VN + R + +CA+ P +F++ S L F Sbjct: 432 RTEAACEYAKSLGIRVYTITFQVNSSST---RDMMRDCATHPTLYFDSPSDDALRSAFEM 488 Query: 444 RIGNEIFERVIR 455 G+ R+ R Sbjct: 489 IAGDLTNLRLSR 500 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 59/473 (12%), Positives = 139/473 (29%), Gaps = 58/473 (12%) Query: 6 KFIFYSKK-LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 +F YS IK G + +P+ +G+ + ++ + L A + A + Sbjct: 8 RFKQYSVGIFIKDEIGAILWPFIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLAL 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + + + + + + YL + + P Sbjct: 68 TIENNTNNPDETQ-TEKNISLVNAYARAYLPSESFSAPVIDIISH--------------P 112 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Y+ + Y L A + + +V+D Sbjct: 113 NYIEYRAATTLNYTPKFLTKELITNIDRRIIVSDNGVAIKNKFTSPGEI---TDVVFVVD 169 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 +S SM D + + + + +N P+ K + + Sbjct: 170 YSVSMDGNFGDEKKTTKIQELRRIFEDLNNTILKNNNTHT--IGFVPFSWGTKKIIGKGI 227 Query: 245 YPG-PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + + + S +K + ++ +V +I + + + F N Sbjct: 228 HRKIYCHFPFVPKTPMPPSYYLGDLKSYNPAKELTDAVKNNIDYDETIKS-ITANYNFIN 286 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDEN---------EMGSTAINDAMQTAYDTIISSN 354 + S+ + +++ I + G T I+ + A D Sbjct: 287 IPIDDIKPSSFCLKGSDAYTLRSDDITNDNIQENIEHEVNGLTLISSGILVANDIFR--- 343 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-----------IAICNKAKSQGIRIMTI 403 K++ K +++++DG + + + + +C + K IR++ I Sbjct: 344 -------KDSKNKDKLMIIISDGNDQEISSDLTQEKITKTLIEKGMCERIKENNIRMVFI 396 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + +C +++EA + HEL R +G V R Sbjct: 397 GIAYTV-----KEIKWEDCVGKRNYYEAQNAHELEADLRQALGTIEASEVGRN 444 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 71/564 (12%), Positives = 154/564 (27%), Gaps = 113/564 (20%) Query: 1 MVFDTKF-IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 M + +L G + ALL+ + + L+D+ R S + L+ A Sbjct: 3 MTMWKRLVTRLVTRLGADERGAIAVQFALLLIPIAVLTFGLIDISRASVQKRQLQDALDA 62 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL--KKNFTDREVRDIVRD 117 A + A+ + ++ + + + + + N ++++V Sbjct: 63 ATLMAARSTATTNADLDTIGDAALATEMAGLGVTFGPGNSSFVLGDNNTVVGTIQNVVIK 122 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + V ++ +N L + L S + ++ + Sbjct: 123 PIISNLWSSTNTPVSATATVMRSINHLEVALVLDNTGSMASSLGSGGSKITALITASKSL 182 Query: 178 SIQW----------------VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 V+ FS ++ QPA V ++++ + Sbjct: 183 VDVLSAAAARATEADAVKISVVPFSMTVNIGSTYQTQTSWLTGTQPAAYGVDNFATSQNR 242 Query: 222 -----------VGIRDEKLSPYMVS-------------------------------CNKS 239 G + + +P+ V+ + + Sbjct: 243 FTLLSNLGLTWGGCVESRPAPFDVTDDAPNPAIGASMFVPFFAPDEPDDNTVNISASSST 302 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID--NVNDTVRM 297 Y ++ + D+ + R + + + K ++ T Sbjct: 303 KYRDARRYSTSYPITNTYLTDTVTPTGTATNAWSTRSTVVAKYATSNKATLLSLAKTGTA 362 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + S + K R VK G+T + + + T+ + Sbjct: 363 YGPNAGCGMTSLMRLTNVKAKADRDTVKGKLDQMIASGNTNVAMGLIWGWHTLSKNAPFA 422 Query: 358 ---VHRMKNNLEAKKYIVLLTDGENTQD-------------------------------- 382 K IVLLTDG+NT D Sbjct: 423 DGVDPATTVGKRTTKVIVLLTDGDNTNDTYNNPNASIYTGYGYITQGRLLNASNSPLGAT 482 Query: 383 ----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEA 431 + C AK+ G++I I V+ + L +CAS P +++ Sbjct: 483 STATNRRDAIDSREARACTNAKAAGVQIYAIGVGVSSHSRG----ILQDCASKPEMYYDV 538 Query: 432 NSTHELNKIFRDRIGNEIFERVIR 455 +L +F G+ R+ + Sbjct: 539 TDAAQLASVFNTIAGSIQNLRITK 562 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 47/437 (10%), Positives = 119/437 (27%), Gaps = 61/437 (13%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASLT 70 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + ++E ++ N++ Sbjct: 71 QSRSPAEVVEDYVTKAGLEDYLDEPVVNANTLNVR------------------------- 105 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +++ + ++F++ + I + AE + + + + + Sbjct: 106 ---SVTATAAYSMP--TVFMKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGT 160 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + D+ F S + I + + + L Sbjct: 161 NPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISISIVPYTGQVNAGADLLATYPNV-- 218 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 V+ ++ S + + T ++ + Sbjct: 219 -SHRQPYSSCVEFAASDFTTTALANGATLTGSGNSELFSSSSSTQTPT---YYWCPEETA 274 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------------- 355 + + +K + GSTAI+ M+ + S + Sbjct: 275 AGNPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVN 334 Query: 356 ---DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K +VL+TDG++ N K+ I + + + Sbjct: 335 GAFAGRPLAYQSGNVMKVVVLMTDGQHVNQEPRLS---NAVKTGDSDIWATGSATDTNRS 391 Query: 413 EKARYFLSNCASPNSFF 429 R S +++ Sbjct: 392 WSVRL------SNGTYY 402 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 65/222 (29%), Gaps = 12/222 (5%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P + + + + ++ + VR + + + + Sbjct: 357 TDGQHVNQEPRLSNAVKTGDSDIWATGSATDTNRSWSVRLSNGTYY-WKRDGTGGHSGPD 415 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + V + + + ST + + Sbjct: 416 TTIVTRSCTVEQVCTRTRNGSCTRWQDQQVCTDT----PSTVTAHRLSYPDLWHQARVGW 471 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +Y ++ + T NE IC+ A++QGI + ++AF Q Sbjct: 472 VSGLYSSAGVSGRYSSWVSTLDPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQA--- 528 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS + A ++ +F + +R+T+ Sbjct: 529 -LLQYCASTTGHYYATVGPQIRTVFHSIASHITQ---LRLTQ 566 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 136 bits (342), Expect = 7e-30, Method: Composition-based stats. Identities = 53/475 (11%), Positives = 135/475 (28%), Gaps = 46/475 (9%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ TG + +L+ ML + DV+R + + A A + +I + E Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQIANATDAA---LASAIISEVPE 57 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ Y+ +L+ + + + + + + Sbjct: 58 STAVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTN 117 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 ++ + N + S + + + ++ + ++ S++ Sbjct: 118 AKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASG 177 Query: 195 DSE----------------------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + A + + RD Sbjct: 178 NVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDKASGRTPAVPGRQRNAMCRDLDFEGN 237 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + Y+ + + + + + V S +N Sbjct: 238 NPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYR 297 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 A F + +P + R ++ +T + M A + Sbjct: 298 PFHEDKAIFDDIECHVNPIVPFITE---RRHFESTVQRLVPGMNTNNAEGMVWAMRLLSP 354 Query: 353 SNEDEV-------HRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKS--QGIR 399 + R ++ + KY+V+ +DG + D +++ IC + K +G++ Sbjct: 355 YWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVK 414 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +MT+ F + + +CAS ++ S + K+F I +I Sbjct: 415 VMTVNFGG-----AASERLMQSCASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 464 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 136 bits (341), Expect = 8e-30, Method: Composition-based stats. Identities = 58/477 (12%), Positives = 138/477 (28%), Gaps = 92/477 (19%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ + ++ +G+ +I A+ + +L G +D + L + A++ + Sbjct: 7 YLSLLSRFRRNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLA-AT 65 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 S + ++ + L N ++ D V Sbjct: 66 GYTAMRGSSSDAKTAATNMFNGQMSAH-------KLTSNSLSIDITDSVSAR-------- 110 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + F+ G + + + A + ++D S Sbjct: 111 -------TVTGSATVVVKTSFMYMFGYPTMTVSASSSASASFP-----TYMDFYVLVDNS 158 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S ++ L Q A +++ + + L Sbjct: 159 PSQGLGATTADMTTL----QNATTDTCAFACHDTYTSSSKKTLQTNSY------------ 202 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D + V + +VR A S+ + V++ RM Sbjct: 203 ------------YDKAKKLGVTMRIDVVRSATQSLTDTATSSQIVSNQYRMAVYSMGADC 250 Query: 307 IS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH--RMK 362 S + + + A+D + + N+ M T +D +S+ + Sbjct: 251 GSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGTMSAMSGVIPTQGDG 310 Query: 363 NNLEAKKYIVLLTDGENTQD----------------NEEGIAICNKAKSQGIRI---MTI 403 ++ +K++ ++DG C K++GI+I T Sbjct: 311 SSTNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVEPLTTTTCTALKARGIKIAVLYTT 370 Query: 404 AFSVNKTQ---------QEKARYFLSNCASPNSFFEANS----THELNKIFRDRIGN 447 ++ ++ + +CASP ++E +S L +F+ I + Sbjct: 371 YLAITSNGYYNTWVKPWRDSIGTIMKSCASPGYYYEVDSSGSIGSALTALFQQAIAS 427 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 40/397 (10%), Positives = 113/397 (28%), Gaps = 52/397 (13%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 11 YALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASLT 70 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 R + + ++E ++ N++ Sbjct: 71 QSRSPAEVVRDYVAKAGLEDYLDEPVVNANTLNVR------------------------- 105 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +++ + ++F++ + I + AE + + + + + Sbjct: 106 ---SVTATAAYSMP--TVFMKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGT 160 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + D+ F S + + + + + L Sbjct: 161 NPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNV-- 218 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 V+ ++ + + ++ + + + + ++ + Sbjct: 219 -SHRQPYSSCVEFAASDFTTT---ALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETA 274 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------------- 355 + + +K + GSTAI+ M+ + S + Sbjct: 275 AGNPTVTPFSHDPEALKAAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVN 334 Query: 356 ---DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + K +VL+TDG++ Sbjct: 335 GAFAGRPLAYQSGNVMKVVVLMTDGQHVNQEPRLSNA 371 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 65/222 (29%), Gaps = 12/222 (5%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P + + + V ++ + VR + + + + Sbjct: 357 TDGQHVNQEPRLSNAVKTGDSDVWATGSATDTNRSWSVRLSNGTYY-WKRDGTGGHSGPD 415 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + V + + + ST + + Sbjct: 416 TTIVTRSCTVEQVCTRTRNGSCTRWQDQQVCTDT----PSTVTAHRLSYPDLWHQARVGW 471 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +Y ++ + T NE IC+ A++QGI + ++AF Q Sbjct: 472 VSGLYSSAGVSGRYSSWVSTLDPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQA--- 528 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS + A ++ +F + +R+T+ Sbjct: 529 -LLQYCASTTGHYYATVGPQIRTVFHSIASHITQ---LRLTQ 566 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 135 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 63/523 (12%), Positives = 142/523 (27%), Gaps = 77/523 (14%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 V DT +K + G + L+ +ML +GGM +D++R L+Q + Sbjct: 18 VLDTGLAAALQKFRRVEEGALYPFGLTLLTLMLMMGGMALDLMRHEQKRTTLQQTLDRST 77 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A+ E R + Q + + N + N + V Sbjct: 78 LAAASLQQSLDPESVVRDYFAKANMTQYLSGVTVDEGMNYREVNALAAADTNPFFMQMVG 137 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 ++ ++ +S + ++ + + + S ++ + + ++ Sbjct: 138 ID----SFDAKAASTAEQRISNVEVSMVLDISGSMASNSRLTRLRPAAKEFIDTVINGSD 193 Query: 182 VIDFSRSMLDYQRDSEG--QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 S S++ Y ++ F A + V N Sbjct: 194 PGRVSISVVPYNAQVNLGAGMMSQFNVNALHSTSYCVELPNSVFGSTGLSQATSFVHNGH 253 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHV---------------IKKKHLVRDALASVIR- 283 + + V + + + ++ Sbjct: 254 FDPWGTGNSSNYNCPPTANVAVTPMSGDAAYLKGRVDLLASMGYTSIDVGVKWGTLLLDP 313 Query: 284 -------------------SIKKIDNVNDTVRMGATFFNDRVIS---------------- 308 + + +D V +D + Sbjct: 314 SAQPLINGLVGLGQVDEDFTDRPLDPDEANVLKVLVVMSDGENTEEYKLTAPYRSGPSAI 373 Query: 309 -----DPSFSWGVHKLIRTIVKTFAIDENE----MGSTAINDAMQ----TAYDTIISSNE 355 + + + T + D G T + M A ++ + Sbjct: 374 YKKTSNSKLTLYSDRASTTSDYYWFSDSKWHTTIDGGTTGSVQMTWPEVWAKWSVRYVAK 433 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D + E + + Q + IC+ AK GI I +I F + + + Sbjct: 434 DIYTKALGGSENSWFETFTDEISYGQKDVRLQQICDAAKDSGIVIFSIGFEAPENGRNQ- 492 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L +CAS S + + ++ FR I ++ +R+T+ Sbjct: 493 ---LRDCASQPSNYFNATGVQITTAFR-AIATQLSH--LRLTQ 529 >gi|170724979|ref|YP_001759005.1| hypothetical protein Swoo_0614 [Shewanella woodyi ATCC 51908] gi|169810326|gb|ACA84910.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908] Length = 503 Score = 135 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 64/497 (12%), Positives = 127/497 (25%), Gaps = 75/497 (15%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 K G I L+ + G+ + ++ R+ ++ L A + A + S+ Sbjct: 9 AKRQRGAAAIYLVFLLIPLFGMVFLALEGTRYIQKKNRLGDATEAASLAVSMANRDD--- 65 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 K I Y+ E + K ++ + + Y+V Sbjct: 66 ----KGYETQLAKDYISSYMRNIKEISQVKVERKEDIDHYPMADGSFEDREYTQYRVTAK 121 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + L+ K Y + V I +V DFS SM + Sbjct: 122 TEHTSWLHSDL----IPSFKETETLANRALARAYPEYLGDRDVDIVFVSDFSGSMKGSRI 177 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD-------------------------EKL 229 +S + + ++ + K Sbjct: 178 NSLKDAITEISNEILVPRDGETEIRNRIALVPYNMRVVEGDSGRSVCMTQLKYRNPSGKT 237 Query: 230 SPYMVSCNKSLYYMLYPGPL----DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + R + V + S RS Sbjct: 238 GSNYTNYESINWREWANKSYNQVSSCVSNSRKCNGLPGPRADARTIKSVVNDAPSSYRSS 297 Query: 286 KKIDNVN--DTVRMGATFFNDRVISDPSFS---------------WGVHKLIRTIVKTFA 328 K D+ N D R FN+ + W V + Sbjct: 298 KWPDSSNWIDYSRSVDQLFNENSNNVQHHPSYQRLYSGSMCNGKFWTVPLTNQKSEIMKV 357 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENTQDNE-- 384 + G T++ + + + + K K I++L+DG + Sbjct: 358 NQMSPDGGTSVYQGLLRGAQILDKGRPVNPNEEELEEYNKRLKMILILSDGMESPYESTF 417 Query: 385 ----EGIAICNKAKSQ------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 +CNK ++Q + + I + + Q NC ++ + N Sbjct: 418 SKLVNNYGMCNKIRAQFNDGELPLHMGVIGIKFSASGQN----AFKNCVGADNIIDVNDL 473 Query: 435 HELNKIFRDRIGNEIFE 451 +L + D I Sbjct: 474 DDLIQEILDLIKKGAKS 490 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 135 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 45/437 (10%), Positives = 123/437 (28%), Gaps = 61/437 (13%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ +S G I ++ +ML +GG+ VDV+R+ + L+ A++ A+ Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASLT 70 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 R + ++E ++ N++ Sbjct: 71 QSRSPAEVVRDYVTKAGLADYLDEPVVNANTLNVR------------------------- 105 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +++ + ++F++ + I + AE + + + + + Sbjct: 106 ---SVTATAAYSMP--TVFMKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGT 160 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + D+ F S + + + + + L Sbjct: 161 NPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLSTYPNV-- 218 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 V+ ++ + + ++ + + + + ++ + Sbjct: 219 -SHRQPYSSCVEFAASDFTTT---ALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETA 274 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------------- 355 + + +K + GSTAI+ M+ + S + Sbjct: 275 AGNPTVTPFSHDPEALKLAIDRLSGEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVN 334 Query: 356 ---DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K +VL+TDG++ N K+ I + + + Sbjct: 335 GAFAGRPLAYQSGNVMKVVVLMTDGQHVNQEPRLS---NAVKTGDSDIWATGSATDTNRS 391 Query: 413 EKARYFLSNCASPNSFF 429 R S +++ Sbjct: 392 WSVRL------SNGTYY 402 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 65/222 (29%), Gaps = 12/222 (5%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P + + + + ++ + VR + + + + Sbjct: 357 TDGQHVNQEPRLSNAVKTGDSDIWATGSATDTNRSWSVRLSNGTYY-WKRDGTGGHSGPD 415 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + V + + + ST + + Sbjct: 416 TTIVTRSCTVEQVCTRTRNGSCTRWQDQQVCTDT----PSTVTAHRLSYPDLWHQARVGW 471 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +Y ++ + T NE IC+ A++QGI + ++AF Q Sbjct: 472 VSGLYSSAGVSGRYSSWVSTLDPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQA--- 528 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CAS + A ++ +F + +R+T+ Sbjct: 529 -LLQYCASTTGHYYATVGPQIRTVFHSIASHITQ---LRLTQ 566 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 134 bits (337), Expect = 3e-29, Method: Composition-based stats. Identities = 57/507 (11%), Positives = 123/507 (24%), Gaps = 91/507 (17%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +++ G ++TAL++ M+ + G +D++ L+ ++ A+ Sbjct: 11 LRSSFLRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAASLTQ 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + K + + +N K+ V DTA Sbjct: 71 TRPPRETIESFLKAAVTKGSYALDVKADELSNAKRV---EASATAVTDTAFLRLIGIDKL 127 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ-----------TKAEAETVSRSYHKEHGVS 178 V + + + + L S + + + ++ Sbjct: 128 TVEAHAEAEEKRKNIEISLLLDMSGSMRFDKSGSYPGPSGAMRINYLRPAAKSFMDMVLA 187 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT-VKSYSSQNGKVGIRDEKLSPYMVSCN 237 + S++ Y P+ +R S Sbjct: 188 DGAEDYTTVSIVPYAGQVSIGPVLFDALARNRRQHDRSSCF------------------- 228 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + G D + + + + H KK D T Sbjct: 229 QFGRNDFTLGVPDFANLPQTQHFTQANHHDALKKAGEAQITEPWWCPDDPHDPRPGTTPD 288 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE-- 355 R +K + T A++ + + + Sbjct: 289 FVAGEGKDTDRTS---VSFLSNDREYLKRQIDNYKLYDGTGTPIALKWGLLLLDPAIQPM 345 Query: 356 --------------------DEVHRMKNNLEAKKYIVLLTDGE----------------N 379 + + K++VL+TDG Sbjct: 346 LREAARYRALSEELDIDARFSNRPASFTDPDTMKFLVLMTDGAISSQRIPKDASKPVQYY 405 Query: 380 TQDNEEGI------------AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + A+C AK + + + TI F + A +SNCAS Sbjct: 406 NNGSLNTDLYSVGDAERFAAALCTAAKQKNVIVFTIGF----DVNDTAAKQMSNCASGAE 461 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVI 454 F + ++ F+ ++I Sbjct: 462 RFYRVNALDIQDAFKSIATAIQKIKLI 488 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 69/523 (13%), Positives = 138/523 (26%), Gaps = 83/523 (15%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 K I G+ II L +++G G VD++R+ L+ A A++ A+ Sbjct: 4 ISQSLTKFIGDRQGNTLIIFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLKATQ 63 Query: 67 PLIQSLE--------------------EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 + S + +S T K+ Y + Sbjct: 64 KIEVSEAAAKTAAAMAFEMNLSDHPELQTASHTFAIETSDNAKVVHYTSEITQRPYFLQL 123 Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + I ++ + VL + + N +L++ + Sbjct: 124 LGLGEQTIRVASSAQSESDPFELLFVLDTTGSMASNNKMTYLKTSVSSVLSSLISTYGDG 183 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + Q + S S + ++Y + +VG Sbjct: 184 NEDVKVGVVAFNTQVRLPASTSYSFVDYTQCYLRTSVNYYACRTVWRAYEALCDQVGSTI 243 Query: 227 EKLSP----YMVSCNKSLYYMLYP-----------------------------GPLDPSL 253 + Y V + + Y + L Sbjct: 244 CATATSKAFYRVYTSGGVTYYAASLMGYAKSGSKYTLYTYNLTTTVNASTLVMSTVSSGL 303 Query: 254 SEEHFVDSSSLRHVIKKKHLV-------RDALASVIRSIKKID-----NVNDTVRMGATF 301 + + + + D D V Sbjct: 304 TTSTGTSVAYSPSGYTAFSSSYFSSLATSSWGGCLTDRNQSFDVSVAPYATDVVDSNYIA 363 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + V T+ + G+T I +Q + + + Sbjct: 364 ASCSTTALAKVLDLTSDFTS--VNTYLSSLSPGGNTNITLGVQFGMEMLSPAEPYTKATA 421 Query: 362 KNNLEAKKYIVLLTDGENTQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + KKY++++TDG NTQ+ N C AK+QGI + + Sbjct: 422 FGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRVE------ 475 Query: 413 EKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L CAS + +++ + +L K +D R+I Sbjct: 476 DGDSSLLEACASQSSYYYDLSQASDLTKTMQDIFATINKLRLI 518 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 71/517 (13%), Positives = 149/517 (28%), Gaps = 99/517 (19%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F K I G+F I++A+L+ +L G VD + + ++ Sbjct: 2 RFKDLLSKFIADDNGNFAIMSAILLMPLLLAVGAAVDYSSARDHRNDIQ----------- 50 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S+ + ++ + L ++ L TD+ D+ + Sbjct: 51 ------VTADSAILAAASSYSSSSGVDSLAAGIDSYLDSKLTDQGSNDVDTAAVPKRLSG 104 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + F++ G+K+ + K+ A + + V+D Sbjct: 105 PTLSADGKEICIVVGEGVPTSFMQLAGVKTVDVSAKSCAALPG-----NIDLEVSLVLDV 159 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SM++ R Q + ++ ++P+ N L + + Sbjct: 160 SSSMIEEGRFVPMQTAVKSFLTSF------ANDATVAKRSKIAIAPFSSRFNIGLTHKDW 213 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND------------ 293 + + + + K+ + +V N Sbjct: 214 LKAYGGNDAVPSRW-TDPKSYYKDSKYSFSQWIDNVTTLAYTSSNYYWIGCVEPRADVEM 272 Query: 294 -----------------TVRMGATFFNDRVISD-PSFSWGVHKLIRTIVKTFAIDENEMG 335 T A +N + + +++ D G Sbjct: 273 KDNGAIGTYGLSDAPPSTEAFVAQDYNTGSSTSFCPPPIVPLTSSFSTLQSAIADMTSEG 332 Query: 336 STAINDAMQTAYDTIISSNEDEVH----RMKNNLEAKKYIVLLTDGE------------- 378 ST ++ M + T+ + + KK IV +TDGE Sbjct: 333 STRLDAGMLAGWYTLSPKWRSAWGGGTAPADYSEKVKKVIVFMTDGEMNVKFGSTDPAKS 392 Query: 379 -----------------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 N + C+ KS I I I++S Q L Sbjct: 393 STEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEIYAISYSSEADVQN-----LQT 447 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 C+S ++ + ST + ++ I I +R+T+ Sbjct: 448 CSSGTKYYFSASTTNIKDVY-TAISKNIIGSTVRLTQ 483 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 68/513 (13%), Positives = 137/513 (26%), Gaps = 88/513 (17%) Query: 3 FDTKFIFYSK---KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 + F + + +L + II A+ + +L G VD + + L+ Sbjct: 14 MSSAFRGFLQQALRLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQS---- 69 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + S ++ ++ + + + T Sbjct: 70 --------AADAASVASISQQSLGYNAALQMTSDGTVQVAVEEATKLFNGNAANSLGYTN 121 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + +N + V L++ + + F+ +G + + +++ + Y + Sbjct: 122 LSLNAQVMKTGVKLAATVAFSADVPTTFMTVVGYRKLTVTGTSKSTSSLPPY-----LDF 176 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQP-----------------ADRTVKSYSSQNGKV 222 ++D S SM D+E L A + N Sbjct: 177 YLMLDVSGSMGLPSTDAEQTRLAAINPDNYKQYPNGCTFACHFTSASSCPAANQKYNTNG 236 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----KKKHLVRDAL 278 +S S K L LS V S + V A+ Sbjct: 237 SCMGYPMSRVSYSGVKKLLTNNGGKLPSSLLSSLTAVTSCPTDGSDACIQLRADAVGAAV 296 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-------------WGVHKLIRTIVK 325 ++ + + R+G F + + + + + + Sbjct: 297 QQLLVTANATQKTPNQFRIGLYPFVRYLYAYSPLTASINGSPTTPGTINHAAANLASQLD 356 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------- 378 T A G T +A T I S + + + Y+ L+TDG Sbjct: 357 TGANASLGSGGTHFENAFPTMNGIITSVGDGSASN-----KTQPYVFLITDGAQNPQVYW 411 Query: 379 ------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK----------------AR 416 + + C KS+GI + + Q Sbjct: 412 NGSWSGSNSATTMDTSKCTTLKSRGIIVSVLYIPYQPIQNPTSFANSEDFYANANIPKIP 471 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L CASP F+ ANS ++ + + Sbjct: 472 PSLQACASPGYFYTANSPADITAALNAMFNHAL 504 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 133 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 70/475 (14%), Positives = 148/475 (31%), Gaps = 110/475 (23%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 ++ A+L+PV+LG+ G+ +D + L+ A A + S+ L + Sbjct: 1 MAMLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDP-------QL 53 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 K + +Y+ N+ + K + P V + + + Sbjct: 54 DKGLVKGAVNQYMAANYPAAVLKG----------------VTPGTEERSVTVKAEATV-- 95 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 ++F+ ++GI S ++ +A A + + + +VID + SM Sbjct: 96 --DTIFMGALGIGSSTVRAQATAGYNN--------LEVVFVIDNTGSMKGTAIQQANAAA 145 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + S + G V R + P V + G L PS E + + Sbjct: 146 TQLAELIMPDGMETSVKVGLVPFRGKVHIPAGVDGL-ADGCRNADGTLAPSWILEEYKQT 204 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + D + + + ++ Sbjct: 205 KYRYPTGSSLNVPKGT-------------------------CDSIPRVQALTSNRTTIVS 239 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I K + T I++ ++ + +N + +K +++LTDG+ Sbjct: 240 AIAKQ--DALGDASGTVISEGIKWGRHVLTPEAPFTQG--SSNKDMRKVMIVLTDGDTED 295 Query: 382 D------------------------------------NEEGIAICNKAKSQGIRIMTIAF 405 N ++ AK +GI I I + Sbjct: 296 GKCGGNYALNYTPNAYWTNAYYGMFDMNTHCENGGKLNAAMLSEAQIAKDKGIEIFAIRY 355 Query: 406 SVNKTQQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + AS + ++ A S ++L +IF+ IG ++ R++R Sbjct: 356 G---DSDSTDISLMKAIASSKAGTDDHYYNAPSAYDLEEIFKK-IGRQLGWRLLR 406 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 47/375 (12%), Positives = 105/375 (28%), Gaps = 30/375 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + +G + ML VGG+ VD++R L+ A++ A+ Sbjct: 20 RSGAQLRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA 79 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + K T D +T + Sbjct: 80 DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKKVQSTI----DTTFETHMLKFSS 135 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + +SR + ++ L + L S ++ V+ + ++ Sbjct: 136 GQDMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTMIANTIDNKM 195 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S++ Y + N ++++++ ++ Sbjct: 196 SISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTQELDRTMHFSVW 255 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G D + + + A ++ K + + + N Sbjct: 256 SGS-----------DYRASANPLDSPTCEDSANREILPFQKDANTLKGFIDGLQAEGNTS 304 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + WG L + + G ++ + + N+ Sbjct: 305 IDV--GMKWGTALLDPSA-RPAISALASGGG------------AMVPATFNNRPAAFNDH 349 Query: 366 EAKKYIVLLTDGENT 380 E K IVL+TDG+NT Sbjct: 350 ETVKVIVLMTDGKNT 364 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/425 (9%), Positives = 101/425 (23%), Gaps = 29/425 (6%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 +++DV L A + + ++ + + Sbjct: 158 LVLDVSGSMGSNSRL------ANLKVAAKDFVDTMIANTIDNKMSISIIPYATQVSLPTE 211 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + + + V + + + D ++ + + Sbjct: 212 LMDQYNTTDEHAYSNCVNF----VGSHFQTTALSTTQELDRTMHFSVWSGSDYRASANPL 267 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ-PADRTVKSYSS 217 + ++ +R + + + + Sbjct: 268 DSPTCEDSANREILPFQKDANTLKGFIDGLQAEGNTSIDVGMKWGTALLDPSARPAISAL 327 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +G + + + N + D + +++V+S A Sbjct: 328 ASGGGAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQYYVESDHREGS-SDVWYNSAA 386 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAIDENE 333 S R + + D S+ S+ + Sbjct: 387 TGSNPRYSLYDPQRDKYWWDRMNRWEDHAYGQGSYRKCGSYSCWWQDEQGEPENVKRLSW 446 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 +T+ + + + Y V + + A+CN A Sbjct: 447 GEL-----FGETSLKYLYKYLYGDWMGQSSARSVWYYGVY-DYWNTSTKDARTRAVCNAA 500 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 K+QGI + TI F + L +CAS ++ E+ F R Sbjct: 501 KNQGIVVYTIGFEAPSSGTA----VLKDCASSDAHHFDVRGLEIRDAFASIAT---SIRQ 553 Query: 454 IRITK 458 +R+T+ Sbjct: 554 LRLTQ 558 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 65/476 (13%), Positives = 142/476 (29%), Gaps = 45/476 (9%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +LI G II L+ ML G +D +R L+ AA +A++ A+ Sbjct: 1 LSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKYKS 60 Query: 71 SLEEVSSRAK-NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + E + N+F + E + + ++ +V D V T +++ +S Sbjct: 61 GMPEATIAKTINAFLSANGEFETAVAGKPQVASDESELCLDVADAVPTTFMKLANIQSVP 120 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + S + L + L S + + + + + + S ++D ++ Sbjct: 121 ISIRSCAALPGVKQLEIALVLDVSSSMIEENRFTPMQTAVAGFLQAFSSNTSLVDKTKIS 180 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + L + ++ + S L Y + Sbjct: 181 IVPFSSRVNFGLANTAWLK--SYNGTAAVPKRWTDP----ESVYTSSGYKLSYWIDGVTP 234 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 S S+ ++ + + + ++ + Sbjct: 235 VMSTSKNYYWMG----CIEPRADVEVRDTGAIGDGMGDAPPSTSAFVAMDANPKSGTSFC 290 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN----NL 365 P + +K+ + GST ++ + + T+ + + + Sbjct: 291 PPPVTPLTG-DFAYLKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPVSD 349 Query: 366 EAKKYIVLLTDGENTQD-----------------------NEEGIAICNKAKSQGIRIMT 402 K +V +TDGE C K GI I T Sbjct: 350 SVHKVMVFMTDGEMNTKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIEIYT 409 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +++S + NCA+ + F S + ++ + I I +R+T+ Sbjct: 410 LSYSADADVVNI-----RNCATNTAHFFTASPATIKTVY-ETIAAAIRGDTLRLTQ 459 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 132 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 58/457 (12%), Positives = 119/457 (26%), Gaps = 30/457 (6%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ---- 70 IK+ G II ALL+ V+LG + VDV RW L ++ I + + Sbjct: 5 IKNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPYLG 64 Query: 71 -----------SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + E S+ + ++ + ++ ++ T ++ Sbjct: 65 EDGHLRLAEEVARENFSAGYLMTPDSGERSATFTAYADEDHRIRVEGTVSSPGNLAGLFG 124 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V+ + + +L+ S + + ++ + ++ + Sbjct: 125 VDWVATSAMGVAKKNEVEIMLVLDRSGSMDGTPMNDLKKAARSFVSFFEETQDQDKMGLV 184 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + A + S + L+ + Sbjct: 185 SFATSVKVDVPLGNNYVSSMTSKINAMDAVGATNAEDSLSQAGNPAKGGLTDQS-GVPGN 243 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + K + Sbjct: 244 KRVQQFVIFFSDGNPTAFRGKFKYNGTDNIDAVVCGTGNDCGTVYTKLGKPEREEWLSYN 303 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F G K ++ + + T S + Sbjct: 304 PRF-TGDGKPKPPGTGTSKCTTRYGGSYVNTTKWYVLDDPDYRLTYRGTTYNSESCFIPT 362 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +N A L+ T + + K ++I TI R FL Sbjct: 363 VGSSNTTAP-----LSTYICTTARGMAVEHAQELKDNNVKIYTIGLG------NIDRDFL 411 Query: 420 SNCASPNSF-FEANSTHELNKIFRDRIGNEIFERVIR 455 S AS SF F A ++ EL IF ++I +I R+++ Sbjct: 412 SQIASGPSFEFYAPTSGELQAIF-NKIAKDIKLRLVQ 447 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 132 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 66/505 (13%), Positives = 149/505 (29%), Gaps = 123/505 (24%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + L + TG+ I+ AL + ML G D +R ++ A+I A Sbjct: 24 RLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAV 83 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + E+ + + + + ++E N Sbjct: 84 KQI-NNSEDTDALKQKVYDWFHAQVENSYALGEIEIDTTNH------------------- 123 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I + + + + + SY +++ VID Sbjct: 124 --------NITATASGTVPTTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDR 170 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + + GK Sbjct: 171 SPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGK------------------------ 206 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + D S+ +++ + + DA+ V+ I + D+ ++ +++G D Sbjct: 207 -------KTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDT 259 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + M T + A+ + + + + Sbjct: 260 TKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALAALQKIVGTGGDGT-----S 314 Query: 364 NLEAKKYIVLLTDGENTQDN-------------EEGIAICNKAKSQGIRIMTI------- 403 + K ++LLTDG +Q C K++ + + Sbjct: 315 SANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGYVKNKSATMAVLYTEYLPI 374 Query: 404 -----------------------------AFSVNKTQQEKARYFLSNCASPNSFF-EANS 433 S + T+++ Y L++CAS S F A+S Sbjct: 375 TTDWGYNDTVGSTMASANWKSNWGGTMRSGVSTSITRRDYIPYALADCASSKSLFLSASS 434 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 + E+ + + +R+T+ Sbjct: 435 STEITEGLSALFTQYLSS--VRLTQ 457 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 47/375 (12%), Positives = 106/375 (28%), Gaps = 30/375 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + +G + ML VGG+ VD++R L+ A++ A+ Sbjct: 22 RSGAQLRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA 81 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + K T D +T + Sbjct: 82 DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKKVQSTI----DTTFETHMLKFSS 137 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + +SR + ++ L + L S ++ V+ + ++ Sbjct: 138 GQDMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTMIANTIDNKM 197 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S++ Y + N ++++++ ++ Sbjct: 198 SISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTEELDRTMHFSVW 257 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G D + + + A ++ K + + + + N Sbjct: 258 SGS-----------DYRASANPLGSPTCEDRADREILPFQKDANTLKGFIDGLSAKGNTS 306 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + WG L + + G ++ + + N+ Sbjct: 307 IDV--GMKWGTALLDPSA-RPAISALASGGG------------AMVPATFNNRPAAFNDH 351 Query: 366 EAKKYIVLLTDGENT 380 E K IVL+TDG+NT Sbjct: 352 ETVKVIVLMTDGKNT 366 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 49/421 (11%), Positives = 107/421 (25%), Gaps = 31/421 (7%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 + +++ + S +++ + F I + ++ T + Sbjct: 154 DGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTMIANTIDNKMSISIIPYATQVSLPTE 213 Query: 115 VRDTAVEMNPRKSAYQVVLSSRY--DLLLNPLSLFLRSMGIKSWLIQT-KAEAETVSRSY 171 + D + + V + L+ R+M W +A A + Sbjct: 214 LMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTEELDRTMHFSVWSGSDYRASANPLGSPT 273 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ----------PADRTVKSYSSQNGK 221 ++ + L D N + +G Sbjct: 274 CEDRADREILPFQKDANTLKGFIDGLSAKGNTSIDVGMKWGTALLDPSARPAISALASGG 333 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + + N + D + +++V+S A S Sbjct: 334 GAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQYYVESDHREGS-SDVWYNSAATGSN 392 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAIDENEMGST 337 R + + D S+ S+ + Sbjct: 393 PRYSLYDPQRDKYWWDRMNRWEDHAYGQGSYRKCGSYSCWWQDEQGEPANVKRLSWGEL- 451 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +T+ + + + Y V + + A+CN AK+QG Sbjct: 452 ----FGETSLKYLYKYLYGDWMGQSSARSVWYYGVY-DYWNTSTKDARTRAVCNAAKNQG 506 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 I + TI F + L +CAS ++ E+ F R +R+T Sbjct: 507 IVVYTIGFEAPSSGTA----VLKDCASSDAHHFDVRGLEIRDAFASIAT---SIRQLRLT 559 Query: 458 K 458 + Sbjct: 560 Q 560 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 56/515 (10%), Positives = 137/515 (26%), Gaps = 106/515 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + ++P+ +G+ + ++ + + L A + A + +V +S + Sbjct: 17 FIKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATLALTVDNDESPD 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + + F + YL + N T YQV + Sbjct: 77 DDNIKKEKNSKFIINYAKAYLPNEKFSKPVINITSHS--------------DYINYQVDM 122 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + Y + + S + A + + + +V D+S SM +Y Sbjct: 123 TIYYPTKILN-----KIFQTVSPEVSISDNARALKYTTTDSKPTDVVFVADYSGSMNEYF 177 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKV-GIRDEKLSPYMVSCNKSLY----------- 241 +S+ + I P+ Sbjct: 178 DESDESDEKKIVALRRIFKDIQNEIKYNNVNIDIIGFVPFSWGTKNFYSSTLANMEKESF 237 Query: 242 ----YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD------------ALASVIRSI 285 ++ ++ +KK +++ + + I Sbjct: 238 CHFPFVPNQFSPSSDYLAKYNFSEKYNISDLKKFPELKNLDIVDRIKNGECTYNNYRKII 297 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +I++ + + + + + + + +IN A T Sbjct: 298 NEINDKLHYEVDLDESYVNEIYDSLDLACNMAYFDEIADIVESNIDYTKTLESINKADNT 357 Query: 346 AYDTIISSNE-------------------------------------------DEVHRMK 362 ++ ++ Sbjct: 358 INIKMVDMPNNSICLRGSKAFTFDRHNRNNTSISKILGTRATGGTLISSGILTGNNIFLE 417 Query: 363 NNLEAKKYIVLLTDGENT------------QDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K +++++DG+++ N +C K K GI++ IA Sbjct: 418 TDNSYNKLMIIISDGDDSRQTDKEKRYYNISKNLIKDGMCEKIKDNGIKMAFIAIGYVP- 476 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + C +F+ A + HEL R + Sbjct: 477 ---REDIDWRRCVGEENFYFAKNAHELELDIRQAL 508 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 74/555 (13%), Positives = 149/555 (26%), Gaps = 114/555 (20%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 T+ K + G + ++TALL +L + VD + L QA A + Sbjct: 17 STRCFQKLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALL 76 Query: 64 ASVPLIQ-----------------------SLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 S +++S++ Q + + +R+ Sbjct: 77 LVAENNAYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENK 136 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV----LSSRYDLLLNPLSLF-----LRSM 151 K N + E++ Q ++ +N L Sbjct: 137 AQKDNHLPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVKT 196 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ------------ 199 K+ + + + V + V DFS SML +++E Sbjct: 197 HTKNGRLPINSGISYAVKEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILRS 256 Query: 200 ----------PLNCFGQPADRTVKSYSSQNGKVGIRDEKLS---PYM-----------VS 235 P + +++ G R +K S PY Sbjct: 257 VVSDIQNILFPTKLSEDASPYNRMGFAAFAGGTRQRGDKNSCVMPYYLKSGVHDFRVAYW 316 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK---HLVRDALA------------- 279 S Y P + + + + + K +R AL Sbjct: 317 QLDSFNYRGSPWDCKDTNVLDDRGNPRPVNACLIKGNPEDALRTALNDRHLSTSMKLIFE 376 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT---------IVKTFAID 330 V+ K I V + + + G + + V Sbjct: 377 DVLDVDKTIKQVENFDGNRVNDYKLTYNNPDHCLGGNEGVETSQAWFTKSKPKVAEALSK 436 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLEAKKYIVLLTDGENTQDNEEG--- 386 GSTA + + ++ N K ++ +++L+DGE+ + + Sbjct: 437 IKPTGSTAASSGFIIGANLLMDKNTVPEAQPAKLGTNTQRILMVLSDGEDNRPTFDTLTT 496 Query: 387 ---IAICNKAKSQG------------IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +C+ + + +I AF + + C N+++E Sbjct: 497 LLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQP--PPEQKAAWQKCVGENNYYEP 554 Query: 432 NSTHELNKIFRDRIG 446 +S L F+ + Sbjct: 555 SSKEALLDAFKQILS 569 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 63/567 (11%), Positives = 143/567 (25%), Gaps = 132/567 (23%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +K ++ G + ++TALL +L + G VD + L Q A + Sbjct: 11 CDSIRKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVAE 70 Query: 68 LIQ------------------------SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 E ++ + K + + + YL N Sbjct: 71 NNDYRENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENANASS 130 Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAY-------QVVLSSRYDLLLNPLSLFLRSMGIKSW 156 + +++ +V L + + +K Sbjct: 131 DAPITIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLVSADKLKQD 190 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--------------- 201 I+ +++ + V + V DFS SM + +D G+ L Sbjct: 191 RIRMESDTSYAIKEKGIVIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVVSEI 250 Query: 202 -------------NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + F + T Q + + + + Sbjct: 251 SKILLPEDVSEGVSPFNRIGFTTFSGGVRQRDVTEGCVFPYEGKVSTTAQYFTIPYWITG 310 Query: 249 LDPSL--SEEHFVDSSS------LRHVIKKKHLVR-DALASVIRSIKKIDNVN------- 292 + + + S+ + ++ + + Sbjct: 311 NNTPWRRGDNGRWERSTVRFEDHYKGYYERFDRRSCRGSGNNTKCTIYAYPKKIMDYALK 370 Query: 293 -DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA----------------------- 328 + FN+ + ++ + + + Sbjct: 371 INDWTTVREIFNNYMNTEGTLAQISEFNGKNRNYDMVFTDEDRCLGGNIGRRTTRAWFDQ 430 Query: 329 ---------IDENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLEAKKYIVLLTDGE 378 G T+ + + + ++ N+ K ++ I++L+DGE Sbjct: 431 KNKDITRELNIVRPSGWTSASSGLLVGANIMMDENKSPDAKPSKLGTNIQRVILVLSDGE 490 Query: 379 NTQDNEEGI------AICNKAKSQ-------------GIRIMTIAFSVNKTQQEKARYFL 419 + + +C+K + Q G RI +AF + Sbjct: 491 DNWPTYSTLTTLLNNGMCDKIREQLGKLQDPNLRELPG-RIAFVAFGYSP--PANQVAAW 547 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIG 446 C + ++ A S EL + F+ IG Sbjct: 548 KKCVG-DQYYTAYSKEELLESFKQIIG 573 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 47/376 (12%), Positives = 109/376 (28%), Gaps = 25/376 (6%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + +G T ML VGG+ VD++R L+ A++ A+ Sbjct: 18 RGAQSLRAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLAAA 77 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V + + + EY + + D + + Sbjct: 78 DLDQTQPPAVVVQDYLNKAG----LGEYYQEPIVESGLGYKRVQATIDATFEAHLLRFSN 133 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V +S+ + ++ L + L S ++ V+ + V+ Sbjct: 134 GNDLPVFATSKAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAAKDFIDTMVANTTDGKM 193 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S++ Y + N + + + + +S+++ + Sbjct: 194 SISVVPYATQVSLPDDLIDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPW 253 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ A V+ K + + ++ + N Sbjct: 254 GYSNRDMRTYYSSPRLVRSPVCDER------ASREVLPLQKDATTLKNFIQNLSAGGNTS 307 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + WG L + A+ T + + + ++ Sbjct: 308 IDV--GMKWGTALLDPSARPAI-------------SAISTGIGASVPGDFSDRPAEYSDS 352 Query: 366 EAKKYIVLLTDGENTQ 381 + K IVL+TDG+NT Sbjct: 353 DTIKIIVLMTDGQNTS 368 Score = 117 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 58/418 (13%), Positives = 111/418 (26%), Gaps = 38/418 (9%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV--R 112 + +++ + S +S+ + F + ++ T + Sbjct: 150 DGLEISLVLDVSGSMNSNSRLSNLKVAAKDFIDTMVANTTDGKMSISVVPYATQVSLPDD 209 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 I + T V NP + + L+ L RSM W + S Sbjct: 210 LIDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPWGYSNRDMRTYYSSPRL 269 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 V D S + L F Q + K G S Sbjct: 270 VRSP-----VCDERASREVLPLQKDATTLKNFIQNLSAGGNTSIDVGMKWGTALLDPSA- 323 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + D S ++ + V+ Sbjct: 324 ----------RPAISAISTGIGASVPGDFSDRPAEYSDSDTIKIIVLMTDGQNTSQYYVD 373 Query: 293 DTVRM--GATFFNDRVISDPSFSWGVHKL-IRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D R ++N S ++ + + GS A++ Y Sbjct: 374 DDHRDGPSGVWYNSHYKSYSTYDSRYGGRYFFHYNNNWYNEPYGNGSGQSGTAVELNYAE 433 Query: 350 IISSNE---------DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + + E + + Y + + G N+ N ++C AK++GI + Sbjct: 434 LFARTSLKYIYRYIFYEWMNFYDARDDWYYGIYSSHG-NSTKNARTRSVCEAAKAKGIVV 492 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 TI F L +CAS ++ + E+ F R +R+T+ Sbjct: 493 YTIGFEAPSNGVA----VLRDCASSDAHYFDVDGLEIKDAFASIAT---SIRQLRLTQ 543 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 61/483 (12%), Positives = 128/483 (26%), Gaps = 80/483 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I +G+F I+TALL+ +LG G VD L A+ Sbjct: 4 RFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYA--------AADAAAVGS 55 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +S A L ++ + + ++ D + + + Sbjct: 56 ITPTSEAAAQANTMSGDGSLTLGKSEAQKIFFSQMSKKQGDAPVTVDISVQKKGD----T 111 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 LSS + F++ MG + + Y + ++D + SM Sbjct: 112 LSSTVSFNATMPTTFMQVMGFDEIAVTG-----AATAQYQTPSYMDFFMLLDNTPSMGVA 166 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + K + + + Sbjct: 167 ATTDDITAMKKATANGHDGGKDKNC-----------------------AFACHIVSEKGV 203 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS- 311 + + + + V + +V A+ +++ K ++ R+ A + Sbjct: 204 EDKNSYYNVARNNGVTIRIDVVASAVKALMAKAKDTQSMPSQFRVAAYTSGKTAQDAKAA 263 Query: 312 ----FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT----AYDTIISSNEDEVHRMKN 363 S + L I + QT A I + + + Sbjct: 264 KLFKVSDLNYDLGAVAAAANMIKLMSIPYQNYYSDQQTSFDEALKGIEGEIKGNIGTGTS 323 Query: 364 NLEAKKYIVLLTDGENTQDNEE-----------------GIAICNKAKSQGIRI---MTI 403 N + +K + + DG C K K +GI++ T Sbjct: 324 NADRQKIVFFVADGVGDSYKPTGCTSPKGANGGRCIEPIDTTYCKKLKDRGIKVAVLYTT 383 Query: 404 AFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + + + CA+P +F + T + + + Sbjct: 384 YLPLPDNGFYKDWVKPFETRIAAKMEECATPGFYFAVSPTEGIEEAMEALFRKIVSAP-- 441 Query: 455 RIT 457 RIT Sbjct: 442 RIT 444 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats. Identities = 66/517 (12%), Positives = 146/517 (28%), Gaps = 79/517 (15%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+ + + + AL + V+L + G +D R + L+ A +A++ A+ + Sbjct: 8 LKQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLAAARAYLT 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S + QKI + + NF + +++ + ++ + + + Sbjct: 68 SS---VQVKETKRQEDSQKIASDYLTANLLSSSNNFENNQIQLVFKEDGEIVGNASTKIK 124 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV-IDFSRSM 189 ++ + L + S ++ +T + ++ I+F S+ Sbjct: 125 LIFGGLFGKSDVVLPALAAATVGDSRKLEIVLVLDTSGSMSSQNRMKQLRTASINFVNSV 184 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + + P + TV + G N + + Sbjct: 185 FDNA--VYERTVQVGVVPWNATVNINMDRPGTWDASPGPAIHNSNYGNGTNQVTSFQDFT 242 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK--------KIDNVNDTVRMGATF 301 + S I L + + + + D + Sbjct: 243 ENLYPPGFSDFGSYSDSDIDDDFGSSGWLGCITATKDERKISSSGNVTPLTDVPPSKMKW 302 Query: 302 F-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 SD R + N G+T + + Y Sbjct: 303 PARKVAGWDPNSDCPSPMLAMSQSRPQIIKKLNQLNPSGNTHADIGLMWGYRMFSQQANW 362 Query: 357 EV---------HRMKNNLEAKKYIVLLTDGENTQD------------------------- 382 ++ +++K +++LTDGENT Sbjct: 363 NNFFGYNSDTKPDSFHSTKSRKIMIMLTDGENTATNSEGYSYYGWCTYTNHYNKWGRYTG 422 Query: 383 ---------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 N + C +S+ + + TIA ++ A L Sbjct: 423 STKDCEVPKGINKDEISNNDLNSLMLDACEVIRSKDVELFTIALDLHSYYDSTAIALLRE 482 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 CA +S +EL++ F E+ + +R++K Sbjct: 483 CAGSDSHAYNIKGNELDETF-----QELASKALRLSK 514 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 130 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 60/482 (12%), Positives = 132/482 (27%), Gaps = 98/482 (20%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+ + G+ +I L +PV+ G VD R + L+ AA A + Sbjct: 5 LKRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASV-------- 56 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 V+S A + T + + ++ +I ++ + Sbjct: 57 GSVAVNSYAYKANTKGHSSFKTGENQALAIFNSNVKKHNDLNNIKVKAKIKKQSTNLVSE 116 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + +++ Y L MG+ + I K+ + + Y + ++D S SM Sbjct: 117 IGVTADY------RPYLLGLMGMNTMPITIKSTSSSTFPPY-----IDFYLLLDNSPSMG 165 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + + G Sbjct: 166 VGATTKDIDTMVANTSDKCAFACHQMDKAGN----------------------------- 196 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-------- 302 + + V + +VR A +++ + K + D RM F Sbjct: 197 ------DYYALAKKLKVTTRIDVVRQATQNLMTTAKNTQTLTDQYRMAIYHFGMAADQID 250 Query: 303 --NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH- 359 N + + + K ID + N QT + + + + Sbjct: 251 SKNPAPYEVSALTTNLSTSASNAAK---IDLMTIPYQNYNSDRQTNFPSYLLGMNKVIPS 307 Query: 360 --RMKNNLEAKKYIVLLTDGENTQDN-------------EEGIAICNKAKSQGIRI---M 401 ++ + ++ + ++DG N + C K++G++I Sbjct: 308 SGDGSSSSKPQQVLFFVSDGANDGYDCAYSNGASCRRISPLDTPQCKAMKARGVKIAVLY 367 Query: 402 TIAFSVNKT------------QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 T + + + CA+ +FE +++ I Sbjct: 368 TTYLPLPTNAFYNSHLAKYVSPTSQLAAKMQECATEGLYFEVGPNEGISEAMNALFAKVI 427 Query: 450 FE 451 Sbjct: 428 ST 429 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 55/217 (25%), Positives = 95/217 (43%), Gaps = 13/217 (5%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 G +L + + KK ++++ A+ +I ++ D VR+GA+ +N Sbjct: 157 AQGAFSMALVLDRSGSMDWNLNGQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNS 216 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 ++ W K F G T DA AY + E+ H K+ Sbjct: 217 KLTGSTKLRWN-----PGKTKEFVDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSG 271 Query: 365 LEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KK+IV +TDG+N + +C+ AK GI + T+AF+ + + LS C Sbjct: 272 QVPKKFIVFMTDGDNNYSSADSSTKHLCDDAKDDGIEVYTVAFAA----PNRGKQLLSYC 327 Query: 423 AS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 AS FF+A ++ +L + F++ IG + V R+T+ Sbjct: 328 ASTEEHFFDAQNSAQLIEAFKN-IGYAASKVVSRLTE 363 Score = 93.5 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 53/361 (14%), Positives = 109/361 (30%), Gaps = 31/361 (8%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + T+ +KL+++ G+F +I A +PV+ G + VD + L+ A +A Sbjct: 1 MMSTRITSKIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAA 60 Query: 62 ITASVPLIQSLE-EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + L + + + F +++E ++ V Sbjct: 61 LATAARLSEEENLTAAQAQAFALKFVNGQVKEDFGAFNGFSVTPTVNIDPVET------- 113 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 ++ ++V +S L + R MG + ++E+ + + S+ Sbjct: 114 ---GGRTVWKVAVSMEGSQSL---TPMARIMGKDKLTVSVVGKSESAGEA---QGAFSMA 164 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V+D S SM + + +N ++ + + + S Y S Sbjct: 165 LVLDRSGSMD--WNLNGQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGST 222 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI--RSIKKIDNVNDTVRMG 298 PG + FVD+ A +V R D + V Sbjct: 223 KLRWNPG------KTKEFVDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKK 276 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F ++ + + A D+ T A + E Sbjct: 277 FIVFMTDGDNN----YSSADSSTKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSYCASTEE 332 Query: 359 H 359 H Sbjct: 333 H 333 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 66/505 (13%), Positives = 148/505 (29%), Gaps = 123/505 (24%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + L + TG+ I+ AL + ML G D +R ++ A+I A Sbjct: 11 RLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAV 70 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + E+ + + + ++E N Sbjct: 71 KQI-NNSEDTDALKQKVSDWFHAQVENSYALGEIEIDTTNH------------------- 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I + + + + + SY +++ VID Sbjct: 111 --------NITATASGTVPTTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDR 157 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + + GK Sbjct: 158 SPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGK------------------------ 193 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + D S+ +++ + + DA+ V+ I + D+ ++ +++G D Sbjct: 194 -------KTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDT 246 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + M T + A+ + + + + Sbjct: 247 TKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALAALQKIVGTGGDGT-----S 301 Query: 364 NLEAKKYIVLLTDGENTQDN-------------EEGIAICNKAKSQGIRIMTI------- 403 + K ++LLTDG +Q C K++ + + Sbjct: 302 SANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGYVKNKSATMAVLYTEYLPI 361 Query: 404 -----------------------------AFSVNKTQQEKARYFLSNCASPNSFF-EANS 433 S + T+++ Y L++CAS S F A+S Sbjct: 362 TTDWGYNDTVGSTMASANWKSNWGGTMRSGVSTSITRRDYIPYALADCASSKSLFLSASS 421 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 + E+ + + +R+T+ Sbjct: 422 STEITEGLSALFTQYLSS--VRLTQ 444 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 63/425 (14%), Positives = 125/425 (29%), Gaps = 72/425 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 S +G I+T L+ ++ G VDV+R+ L+ A A++ A+ + Sbjct: 24 SFATSESGAMTILTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAADLDQELC 83 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + + + K +V Sbjct: 84 PKDVVIDYLKKEGLDKYLT-----------------GDPKVEPDVCGSTAAVLKGYRRVE 126 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++ D+ ++ F++ GI++ + AE + V I V+D S SM Sbjct: 127 ANADMDIEMH----FMKWRGIETIASAATSVAEESIGN------VEISLVLDVSGSMRGS 176 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + ++ + N F + G + PY + Y M + Sbjct: 177 KLENLKKAANLFIDDMFA--------KTEDGKVSISIVPYSEQVSIPDYLMNKLNTQG-T 227 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDAL--------ASVIRSIKKIDNVNDTVRMGATFFND 304 S + VD +S + + + D + + R F Sbjct: 228 NSIANCVDFASADFATTRFTAFDVTDPVTGIVTPGTTLARTIHHDIGDGSDRRPYNGFVS 287 Query: 305 RVISDPSFSWGVHKL-----IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---- 355 I P+ S ++ +K N G T+I+ + + S + Sbjct: 288 STICRPNTSTNHREITILQKDPVALKKEINLLNASGWTSIDVGAKWGVTLLDDSFQPLTK 347 Query: 356 ------------DEVHRMKNNLEAKKYIVLLTDGENTQ-------DNEEGIAICNKAKSQ 396 + + K ++L+TDGENT+ N I A + Sbjct: 348 KLVTESKVPSIFKDRPDQNKGYDTMKVMILMTDGENTKQHKVNPPYNHGTSDIWWNADKE 407 Query: 397 GIRIM 401 + Sbjct: 408 KYSVY 412 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/101 (23%), Positives = 42/101 (41%), Gaps = 9/101 (8%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + Y T Q + +IC KAK + I I +IAF + + L Sbjct: 507 AFGTSYANEWYNTSTTVLNQVQKDPRLTSICQKAKDEKIIIFSIAF----DAPDGVKPLL 562 Query: 420 SNCAS-PNSFFEANSTH-ELNKIFRDRIGNEIFERVIRITK 458 C S +++EA ++ +F IG+ I +R+T+ Sbjct: 563 KGCVSDDGAYYEAKDNDKDIISVFSS-IGSTIQN--LRLTQ 600 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 128 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 71/478 (14%), Positives = 136/478 (28%), Gaps = 83/478 (17%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K + +G+F I+TALLM + G G+ +D+ R + L+QAA S Sbjct: 2 KNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAAD------------S 49 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ +K+ + N + F D R T ++ + Sbjct: 50 AALAAVADMSASVQAAKKMSGDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHGN 109 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 V+ S + + +G + T + Y E ++D + SM Sbjct: 110 VVESSVSFKASVSTTLSGLLGKDFVSVAG-----TATAKYETETFSDFYLLLDNTPSMGV 164 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ L D Sbjct: 165 GATPTDVATLVANTGDKCAFACHIV--------------------------------KDG 192 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRV 306 + + V + +V A AS++ + K ++ RM F + ++ Sbjct: 193 VADPNSYYFKAKKLGVTTRIDVVAKATASLMDTAKSTRKSSNQYRMAVYTFGERAEDTKL 252 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + S + + + + + G I ++ Sbjct: 253 LEVVSLTSDLDAAKKKAGEINLMSIPYQGYNNDQQTDFDRALIQIGDKVGSSGTGASSAN 312 Query: 367 AKKYIVLLTDGENTQDN-------------EEGIAI--CNKAKSQGIRI---MTIAFSVN 408 K I ++DG +E I I C K K +G RI T + Sbjct: 313 PDKVIFFVSDGVGDSYKPSSCTKKLTGGRCQEPIDIKDCTKLKEKGFRIAVLYTTYLPLP 372 Query: 409 KTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Q + + +CASP +FE + + ++ I R+T Sbjct: 373 TNDWYNSWIKPFQAEIGSRMQSCASPGLYFEVSPSQGISDAMTVLFKKAITSP--RLT 428 >gi|84385675|ref|ZP_00988706.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] gi|84379655|gb|EAP96507.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] Length = 520 Score = 128 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 52/509 (10%), Positives = 136/509 (26%), Gaps = 72/509 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G + LL+ ++G+ V+ R+ L+ +A+ A I ++ Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIEDQPDQAR 75 Query: 75 VSSRAKNSF-----------------------------TFPKQKIEEYLIRNFENNLKKN 105 + + + + + Sbjct: 76 GLATKYVENYVRDIKSTNLSADRFHQAEDEGAGVLEYIQYTVNAKTTHDSWFASSFIPSF 135 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 +++ + + +V S + +N R + I A + Sbjct: 136 DQQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMNDRWGSSRHIKIDDLKTAIDAISS 195 Query: 166 TVSRSYHKEHGVS--IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------ 217 + + K+ V ++V D + F V + Sbjct: 196 KILCTSIKQDYVDGEWKYVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSGNQARATSQL 255 Query: 218 ---QNGKVGIRDEKLSPYMVSC------NKSLYYMLYPGPLDPSLSEEH---------FV 259 N K + + N + SE Sbjct: 256 SYKDNYKTNVSPYSYNDVNWDYWRTYSQNDVIRCADRQSRCSYPKSENQKYAKRIKDVIY 315 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + L + + +++++ + G + FN S+ S + Sbjct: 316 QDNYLVADVYNYVDLSTSVSTMFTDKSGLQPDF-YSVSGTSLFNAHGSSNSSQFSNIRLS 374 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDG 377 + G TA + + + + + + + KK +++L+DG Sbjct: 375 NKLSDLNPINSMWADGGTAAFQGILRGSQVLHDGDPNSSDQEEQQVYNKKIKMLLILSDG 434 Query: 378 ENTQDNEEGI-----AICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNS--F 428 + + +N +C+KA+++ G+ I I + + +C + Sbjct: 435 QESPNNGILKGLVDRGMCDKARNEIPGLYIGVIGI----DFRASQQSGFQDCVVDPNEDI 490 Query: 429 FEANSTHELNKIFRDRIGNEI-FERVIRI 456 + ++ EL + + I + ++ Sbjct: 491 IDVSNLDELIEKIEELIRKGSKTSGITKL 519 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 64/495 (12%), Positives = 133/495 (26%), Gaps = 59/495 (11%) Query: 5 TKFIFYSKKL------IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 ++ + ++L + G I+ AL + +LG+ G+ VD + + L Sbjct: 2 SRLLSALRRLGRWAARARRPDGSVNILFALSLIPVLGLVGLAVDYGLAAADKTTLDH--- 58 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + V++++ + + + I N + Sbjct: 59 ------AADTAALAAVVTAKSYIAANQGQANLTANAIA-AGLAQAANVFAVNAGSVPFAQ 111 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 P+ L++ + F + +G + L+ A SY + Sbjct: 112 VTLQPPQLVRSGQTLTATVSYGATIQNSFGKLLGTPTTLLGNSVTASADLPSY-----LD 166 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLN-------------CFGQPADRTVKSYSSQNGKVGIR 225 ++D S SM L C + GK+ +R Sbjct: 167 FYLLVDVSGSMGLPATPGGMTQLASVNKDMWSDYQQGCQFACHFPGFTGWGLAAGKIQLR 226 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + ++ + S + P + ++L + + A + Sbjct: 227 SDAVNAAVCSLIQRASTPAVPNQYRVGI-YPFINQMATLVGITGSVASLNAAAQCALSWP 285 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 N+ DT + D S ++ + + ST+ + Sbjct: 286 LAFTNLLDTGTTQLFAYGDPTTGTASGGTHFEVVMPQLQAAIKAFGDGSSSTSPKPFVFL 345 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQG--IR 399 D + + + D + I C KS G I Sbjct: 346 ITDGMQNGQHYGAPANGTYAYPGNPSSFWGYADAWWDGSQPSQIDPTVCAGLKSAGATIS 405 Query: 400 IMTIAFSVNKTQQEKA----------------RYFLSNCASPNSFFEANSTHELNKIFRD 443 I++I +++ L CASP F AN+ ++ Sbjct: 406 ILSIPYNLITFVNNGGGVAWENNRVSGFSPTLATPLKACASPGFFATANTPADITASLNA 465 Query: 444 RIGNEIFERVIRITK 458 RV R+T+ Sbjct: 466 MFDQ--ALRVARLTR 478 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 86/471 (18%), Positives = 170/471 (36%), Gaps = 89/471 (18%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ-TA 60 + + ++++ G+F I+TA++MPV+LG G+ +D + + L++A A Sbjct: 1 MSTSFLHPCLRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + A+ + + + F ++ Y+ + +++K D V Sbjct: 61 LAAATALASGAASTTADAEAIAKDFVSGQMANYVDTDAISSIKAG--------TSVDIDV 112 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +Y+V +++ Y + + F+ +G K+ A T S + + +S++ Sbjct: 113 SATATSKSYKVTVATSYGIAA---TPFMSVLGYKTL--NIGASTSTSSGTSDTKTALSME 167 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V+D S SM + ++ NGK + +L Sbjct: 168 LVLDQSGSMGEKTTTC-------------------ATYNGKNCKTYVTKIDALKKAADAL 208 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + L D SL ++ +K ++ Sbjct: 209 FDALDTADPDHSLVRTGAYSYNNGLIYNSQKTQIKSM----------------------- 245 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--- 357 +WG T+ G T + M+ A +I +++ Sbjct: 246 ---------SGMAWGTATT-----ATYVSGITASGGTDATEPMRQATLSIAKASDGSDVE 291 Query: 358 --VHRMKNNLEAKKYIVLLTDGENTQD--------NEEGIAICNKAKSQGIRIMTIAFSV 407 H +K N +YI+L+TDGE T + ++ C+ K+ GI+I T+AF Sbjct: 292 TQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFTVAFMA 351 Query: 408 NKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +K + L CASP +++EA + +L F I E + V +T Sbjct: 352 ----PDKGKQLLQYCASPGGNYYEAETMEKLVASFTS-IAKEATKAVTLLT 397 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 62/456 (13%), Positives = 127/456 (27%), Gaps = 69/456 (15%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 GH I+ A+L+P++ GV + D R + ++ A+ A+ + + ++ Sbjct: 3 HKQQGHASILFAMLIPLLFGVFALGSDGARAIQSKARIEDAS-----EAAALALSARDDE 57 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + T + IEEYL + E D+ Q + Sbjct: 58 HAMSDENKTIVQAYIEEYLPVEDSDVTILGIERLECDDMPECRQGSGRGEARYTQYSVR- 116 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + F +++ +R Y V I + DFS SM Sbjct: 117 ---VSADQTPWFGGGSPEVEVPEVWRSQGGAKARKYQSN-AVDIVFAADFSGSMASPWTG 172 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 ++ + N + VS +L Y ++ + Sbjct: 173 GSQPKYRDLIDILEKVTVELAPYNFDSQRYNSS---VGVSGFNALTYRNELCAVNNLEKQ 229 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 R V + + I N D + Sbjct: 230 GLLGVVDYSRTVA-RMWETKSCRPPSIS-------------------NSAGFHDVPLTDD 269 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 RT G TA A+ + + + ++ +++++ Sbjct: 270 YSTFNRT-----VDRFTARGGTASYQAVMSGARLLD-----------HGSNNRQILIVIS 313 Query: 376 DGENT----QDNEEGIAICNKA------------KSQGIRIMTIAFSVNKTQQEKARYFL 419 DG++ + +C + R+ I F + + Sbjct: 314 DGQDNNLNHTNGLVNAGMCRDIISRLEGRPSANGRDVSARLAFIGFDFEPSMN----PAM 369 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 C ++ F+A +T EL + I E+ R Sbjct: 370 VRCVGEDNVFKAENTDELFEQIMFLIREEVGHLATR 405 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 127 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 65/449 (14%), Positives = 131/449 (29%), Gaps = 62/449 (13%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +I+ G + ++P ++ + + V +S QAA +A++ + Sbjct: 2 SVIQRQRGAITLTFTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACA------- 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +N + I +Y NF N + + Q Sbjct: 55 -----YQQNDTGVVTEGILDYYRPNF----------VLPELNKSVKLNSNNGCQISAQYR 99 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L + S + V+ ++ V V+D S SM + Sbjct: 100 FEPAMVNALPVAIDSDTEVVSNSQSSAKLVQNVNVNG---IQNPVDFSLVLDISGSMTWH 156 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + + P+ S L P Sbjct: 157 LPELKKIITDVISDIVPS-----------SNQVRFSIVPFQTGVGVSGAPWLLSSEASPK 205 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 VD R+ +L + N+ Sbjct: 206 -----CVDGLVYRNGNLDADKTVQSLNYSSDRLDF----NEVTPGRWLDRCSETSFILPL 256 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH--RMKNNLEAKKY 370 + ++++IR + GSTA T+ + E ++++ ++ Sbjct: 257 TNNLNRVIRYVESLD----TSGGSTASYQGFIWGVRTLTDQWQKEWQVTPVQSSSLTQR- 311 Query: 371 IVLLTDGENTQ----DNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCAS- 424 ++L TDG++ + ++ +C+ + I++ I F V+ + CA Sbjct: 312 LILFTDGDDNRRDYFNDLMSAGLCDVIQQDLNIQVSFIGFGVSADR----IKQFKQCAGR 367 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERV 453 S F+AN+T EL F D I I +V Sbjct: 368 NGSVFDANNTAELADYFEDAININIETKV 396 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 126 bits (316), Expect = 7e-27, Method: Composition-based stats. Identities = 44/472 (9%), Positives = 129/472 (27%), Gaps = 64/472 (13%) Query: 7 FIFYSKKL--------IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 Y + IK G II ++P+++ + + + + + +AA+ Sbjct: 3 LKRYPRHFRLRGSSMSIKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAE 62 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + + + + ++ ++ Y+ N + + F R Sbjct: 63 VASLALIASPKEG-------DEKNQEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQ 115 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + V +++D ++ + + + SR + + + Sbjct: 116 RSGELAPFTDFVVSAKTKHDSWISYNEGEMGL------TKDFEVMGTSTSRKFLPQ-PLD 168 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I ++ID S SM++ S + +R V R + + + + Sbjct: 169 IYFIIDMSGSMVNPWGGSGKTKYDVVADTINRIVDDLREFKTDRKSRVAVIGFHHTAVKQ 228 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + + + + ++ + + Sbjct: 229 VGRQRTAFDYSSYRTPSATVNNMFTAPKIHSRNDS-----GNIKTFEDI--PLTEDYDAF 281 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 T FN + L + + +T +N + + + + + Sbjct: 282 LTKFNSSNYYAS-----RYGLTESWQGIIGAAQMAEQATDLNP--EQVFILLSDGRDGDF 334 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ--------------------GI 398 R + + + +C K K++ + Sbjct: 335 VRYYLEGRQWREVRYNKY----LNRLVKAGLCEKLKTRISQKRNVFQSENPSDKASKTKV 390 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + I + + + +C ++ + A +++ K + I E Sbjct: 391 TMGVIGVNYVVDKSDGI----GDCFGHDNIYHAKEGNDVYKYILNLINEETG 438 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 126 bits (315), Expect = 9e-27, Method: Composition-based stats. Identities = 61/505 (12%), Positives = 136/505 (26%), Gaps = 89/505 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QS 71 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 72 LEEVSSRAKNSFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREV 111 + ++ + + EY N Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLK 136 Query: 112 RDIVRDTAVEMNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + V N Y VV + + +N + + L + KA Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAI 196 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS---SQNG 220 ++ G++ + FS + + C + + + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE------------------EHFVDSS 262 +++ + + Y L LSE ++F+D + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVH 317 + I + + +I ID +T+ + FF + + S ++ Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP- 375 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ G T + + + S + K +++L+DG Sbjct: 376 --NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDG 422 Query: 378 ENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++ +C K K GI+++ I + Sbjct: 423 DDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEK 480 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 +C +F+ A + HEL + Sbjct: 481 DCVGTGNFYLAKNAHELEISIERAL 505 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 51/482 (10%), Positives = 134/482 (27%), Gaps = 100/482 (20%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + Y +K G+ ++ A+ +L G +D + L+ A A++ Sbjct: 3 RLLRYLRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLAGV 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + +A + + N + + T + Sbjct: 63 ---------AAGKAALDSGATQAAAIAMAQAASSSYFTGNTAKIDATPTINFTTMGR--- 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 LS + F+R +G + + + + S + +++ ++D Sbjct: 111 ------TLSGTGSATSVMNTSFMRLVGFPTMTLN-----ASSASSATMQPYLNVYLLVDI 159 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML + + A ++ + + Sbjct: 160 SSSMLLPATQAGITQMRNGTGCALACHETTNGTD-------------------------- 193 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +V+ + +V + +++ + + V++G F+++ Sbjct: 194 ------------SYSYALKNNVLLRYQVVNQGVQNLLTYLNSSAVYKNYVKVGLWSFDNQ 241 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S + + + +T + + + ++ ++ + + Sbjct: 242 LTQLSSLTSSFSSVAANFPAPGLAYNDAAAATPFDSLIGSFVSSVGTAGDGS-----TSA 296 Query: 366 EAKKYIVLLTDGENTQDNEEGIAI-------------CNKAKSQGIRIMTI--------- 403 +K +++ TDG N CN KS G+ + I Sbjct: 297 TPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFNTAFCNTFKSNGVTVAIINTPYYPMTW 356 Query: 404 --AFSVNKTQQ---------EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 ++ Q + L +CA +F A+ + F R Sbjct: 357 DWGYNATLGQPGSLGGATRVDDIPIALKSCAGS-NFIIASDVATIQNAFTTLFNKASPVR 415 Query: 453 VI 454 + Sbjct: 416 LT 417 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 48/472 (10%), Positives = 132/472 (27%), Gaps = 64/472 (13%) Query: 7 FIFYSKKL--------IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 Y + IK G II ++PV++ + + + + + +AA+ Sbjct: 3 LKRYPRHFRLRGSSMSIKKQQGVAGIIFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAE 62 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + + + + ++ ++ Y+ N + + F R Sbjct: 63 VASLALIASPKEG-------DEKNQEYAQKIVDHYIPDNKGEVVARVFNRRCEYKDGCVQ 115 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + V +++D ++ + + + SR + + + Sbjct: 116 RSGELAPFTDFVVSAKTKHDSWISYNDGEMGL------TKDFEVMGTSTSRKFLPQ-PLD 168 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I ++ID S SM++ S + +R V R + + + K Sbjct: 169 IYFIIDMSGSMVNPWGGSGKTKYDVVADTINRIVDDLREFKTDRKSRVAVIGFHHTAVKK 228 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + + + + I++ + I + + Sbjct: 229 VGRQRTAFDYSSYRTPSATVNNMFTAPKIHSRNDSSN------IKTFEDI-PLTEDYDAF 281 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 T FN + L + + +T +N + + + + + Sbjct: 282 LTKFNSSNYYAS-----RYGLTESWQGIIGAAQMAEQATDLNP--EQVFILLSDGRDGDF 334 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ--------------------GI 398 R + + + +C K K++ + Sbjct: 335 VRYYLEGRQWREVRYNKY----LNRLVKAGLCEKLKTRISQKRNVFQSENPSDKASKTKV 390 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + I + + + +C ++ + A +++ K + I E Sbjct: 391 TMGVIGVNYVVDKSDGI----GDCFGHDNIYHAKEGNDVYKYILNLINEETG 438 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 55/467 (11%), Positives = 131/467 (28%), Gaps = 65/467 (13%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++ + +G F I ++ +L + ++ R+ L A + A Sbjct: 1 MFMQQPLLASIRRFKQDRSGAFAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + + S + + + + + K+ +++ Sbjct: 61 ALALTAEDNGDGAQRNY--TLSSDYFRAYMRHDVDVFKPTVIVKSGISPNNQNLS----- 113 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + R S F + S+ Q +R + + + Sbjct: 114 -----------YVEYRVSGQTLQDSWFSSTF-FPSFDKQVVIGDNGAARKFRSN--MDVI 159 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +V DFS SM + S + + KVG P+ + + Sbjct: 160 FVTDFSGSMNEGFGGSTKLAELKRIVLKLSDELFSYNIDNKVG-----FVPFGWGGKEGI 214 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P S+ + A+ + V+D + Sbjct: 215 DCDFPFVSHGPVPSDILAGGNYKALEKYIDISGSVAAIPN---------PVHDIQIPLSN 265 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + SW V + G T ++ + + S Sbjct: 266 VNGSTCLRNSH-SWKVPLTSSLAEINQINGMSAEGGTLVSSGVLLGVPYLASGTAS---- 320 Query: 361 MKNNLEAKKYIVLLTDGENT------QDNEEGIAICNKAK-----SQGI-RIMTIAFSVN 408 +K +V+++DG + N +C+K + + + +I I + Sbjct: 321 -------RKVMVIVSDGTDDPKNVMITPNLINAGMCDKIRQVLSTDESVGKISFIGIAYY 373 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 T +C +F+ + EL + R + E+ +++ Sbjct: 374 PTVD------WKSCVGDKNFYLPQTIDELEEDLRRAVFEEVGHNILK 414 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 61/505 (12%), Positives = 136/505 (26%), Gaps = 89/505 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QS 71 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTENNYRND 76 Query: 72 LEEVSSRAKNSFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREV 111 + ++ + + EY N Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLK 136 Query: 112 RDIVRDTAVEMNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + V N Y VV + + +N + + L + KA Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAI 196 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS---SQNG 220 ++ G++ + FS + + C + + + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE------------------EHFVDSS 262 +++ + + Y L LSE ++F+D + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVH 317 + I + + +I ID +T+ + FF + + S ++ Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP- 375 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ G T + + + S + K +++L+DG Sbjct: 376 --NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDG 422 Query: 378 ENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++ +C K K GI+++ I + Sbjct: 423 DDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEK 480 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 +C +F+ A + HEL + Sbjct: 481 DCVGTGNFYLAKNAHELEISIERAL 505 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 88/449 (19%), Positives = 176/449 (39%), Gaps = 54/449 (12%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F+F KK+I S +F I+ ++++ +L G+L+ V+ + + ++A++ A +AI++ + Sbjct: 7 FLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILSGAS 66 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 +I +S N + + I + + R ++ +K++ + + + Sbjct: 67 KII---SRISYFGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNI------ 117 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + +S+ + P +S L K I + Sbjct: 118 ----ISQNSKVSITREPHPNVFHEFNNQSILQNKKTFYH-------------ISVETFYD 160 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + + VK G +V + S+ ++ Sbjct: 161 YHIKFFDNLLNKKINSKIISFVPALVKI------DTGEHPFFFVQLVVDLSASMSCLMNS 214 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV-NDTVRMGATFFNDR 305 P + E V S ++ K ++ A+ + S+ + DT +G T + R Sbjct: 215 DPEHAT---EFSVCGKSKKN--SKMDALKKAVLLFLDSVDRGSKTQKDTHYIGLTGYTTR 269 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + SWG K+ + IV+ ID N +G T AM+ AY + S + R + Sbjct: 270 VEKNIEPSWGTGKVRKYIVEE--IDVNMLGQTDSTPAMKKAYQILTSDKKRNFIRNILHK 327 Query: 366 EA-------KKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +K+++ LTDGEN + + I IC KAK I+I+TI+ + + + Sbjct: 328 RIKIPPLPFQKFLIFLTDGENNDPKSDVKTIKICEKAKKNSIKILTISINASANG----K 383 Query: 417 YFLSNCAS-PNSFFEANSTHELNKIFRDR 444 L C S P ++ T L ++F+D Sbjct: 384 RLLKKCVSAPEYYYNVVDTGSLLRVFQDI 412 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 63/463 (13%), Positives = 147/463 (31%), Gaps = 61/463 (13%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ G I+ L+P+M+ + + + + +AA+ A + +S ++ Sbjct: 4 IRKNRGVAGILFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDD 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + +Q ++ Y++ N + + +T + A S + V Sbjct: 64 NV-------KYARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQASGEVAPFSDFVVSAK 116 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + + L+ + + ++V+R Y + V + ++ DFS SM + + Sbjct: 117 AEHKSWIAYEEAELKP--------EFEVAGKSVTRKYLPQ-PVDVYFIGDFSGSMTGHWK 167 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLDPS 252 L+ Q +R V+ + N + R +P V + ++Y Y Sbjct: 168 G-GKTKLDVVKQTIERVVEDIADFNTEEKSRVALLGYNPLHVKQSGTVYLNAYGYRRSWP 226 Query: 253 LSEEHFVDSSSLRHVIKK---KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + + + K V + +R + ++D V + V F + D Sbjct: 227 KKIAYDYARGTSAQTVAKMFDPPSVYSRVQEYVRGMSRLD-VENLVVNNDRFVDYYKFYD 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + G T+ + + A N+ ++ Sbjct: 286 IPLTEDYTHFKARLASAW---LGAEGGTSSWNGIIAAAQ---------EANRATNINPEQ 333 Query: 370 YIVLLTDGENTQDN----EEGIAICNKAKS------------------QGIRIMTIAFSV 407 ++L+DG + N +C K +S + + I Sbjct: 334 VFIVLSDGADNDTNYLQRLVNQGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGV-- 391 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + K +C + + A ++ K + I E Sbjct: 392 --DYRVKESDGFGDCFGRKNIYHAKDGDDVYKYILNLINEETG 432 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 58/549 (10%), Positives = 148/549 (26%), Gaps = 105/549 (19%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ I +K+ I++ +G + I ALL ++ + + ++V + L A + A Sbjct: 1 MIMRKYVITQTKRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-- 118 +++ + + N+ + + + ++ + K F + I +T Sbjct: 61 VLSLTAENNSGRKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMK 120 Query: 119 ------------AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEA 164 + + + ++ +N SLF ++G + + Sbjct: 121 FEPICTTQNNAITPKNGKQYAYSSSHVTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSG 180 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQ------RDSEGQPLNCFGQPADRTVKSYSSQ 218 + + + + V D S SM ++E L Q + Y Sbjct: 181 SMAQKINNVNLPLDLMVVADLSGSMDYNINNHKVYSNTEASKLTLLKQVLEELTDKYLLS 240 Query: 219 NGKVGIRDEKLSPYMVSCNK---------SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + P+ + + + G D + ++ + I+ Sbjct: 241 EEANPNNRISMIPFAMGAQHPIRNSCVLPFEWNQSHIGYNDSQKVSPNEIEYNLRNLPIR 300 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVK 325 + + L ++ + K ++ + + I K + Sbjct: 301 SRTIFTHNLVYLLDTKKTLEKIGTRFNNYDVEYQKSAICLEGSDKFQQQWYEKNQKINFI 360 Query: 326 TFAIDENEMGSTAINDAMQTAYD---------------------TIISSNEDEVHRMKNN 364 G+T + + A + + ++ N Sbjct: 361 NEVKRLKAAGATLASSGLIVAVNNMLNEPARSDVLKQQTRRTILILSDGSDSIGDDSGEN 420 Query: 365 LEAKKYIVLLT-----------------------------DG---------ENTQDNEEG 386 +K I + G N + + Sbjct: 421 NWYQKEIPFMNFSRITENLILGKQELFNKSPQSKNLENHIYGYRYNYPIYLTNNTEKIQT 480 Query: 387 IAICNKA--------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 +C+ K +I+ + N + ++ L +++ A S L Sbjct: 481 KGLCDVIRDKLNTKNKDNNTKIIFVELGYNSSSKDTW---LHCVGGTQNYYSATSKESLL 537 Query: 439 KIFRDRIGN 447 + F+ I Sbjct: 538 EAFKQAISK 546 >gi|238796988|ref|ZP_04640492.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] gi|238719248|gb|EEQ11060.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] Length = 536 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 69/538 (12%), Positives = 159/538 (29%), Gaps = 101/538 (18%) Query: 4 DTKFIFY--SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + Y IK+ G + + +P+ +G+ + ++ ++ + L A + A Sbjct: 13 RSILRKYGLLSLFIKNDKGAILLPFVIFLPIFIGLLFLSFEISQFLQKKAKLSDAIEQAT 72 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL--------------------------- 94 + +V + +++++ + YL Sbjct: 73 LALTVE--NNDIPDANQSQKNRDLVTHYATAYLPSEKFSTPIIDISNNKGHLLYKAETTM 130 Query: 95 ---IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + N+ N V + P + V + D + ++ F + Sbjct: 131 SYPAQFLANSPLANTKISIADSGAARKDVAVGPSELTDVVFV---VDYSSSMINPFHGTY 187 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G +S + + + ++ + K ++ I FS + + C A Sbjct: 188 GSRSKIDELRDIFYKLNGNILKNDNINTIGFIPFSWGIKKIVGTGQQSKTYCHFPYAPIK 247 Query: 212 VKSYSSQ---------------NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 KS + + + N+ Y + + Sbjct: 248 HKSTGDYLRQYTASNLKQFLAPENFNYVDNIEYGELSNRNNRVNYLKIKDEISHSKSNLA 307 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + H I K +++ + L S I K I+ ++ + +D + + S Sbjct: 308 NEFMIKT--HYINKYYIISNILTSNIDYDKTINLMSKKYKSIDIPIDDVLDGNICLSSST 365 Query: 317 HKLI-----RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + T ++ +G T ++ + A + + N + KK + Sbjct: 366 TYSLEFNKVSDESITESLATEPVGLTLVSSGILAANNLFKEA----------NDKNKKLM 415 Query: 372 VLLTDGENT------------------------------QDNEEGIAICNKAKSQGIRIM 401 ++L+DGE++ N +C + IR++ Sbjct: 416 IVLSDGEDSDNTTTFDKDGNLIAVDDYIKNDEDRKPFRITKNLIDKGMCEAIAANKIRMV 475 Query: 402 TIAFSVNKTQQEKARYF--LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 IA + + C ++F+ A HEL + +G E V R T Sbjct: 476 FIAIGYTPVNDAYSPTYIDWEKCVGKDNFYLAKDAHELEADLQQALGGENIRDVGRNT 533 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 70/550 (12%), Positives = 135/550 (24%), Gaps = 136/550 (24%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +L + G+ I AL + + + G VD+ + + Sbjct: 6 LFNLGARLCRETRGNVATIFALTLVPVALLSGGAVDLSQSMNARSR-----------LAQ 54 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + V S + +++ N+ + V Sbjct: 55 ALDAAALAVGVNTNLSSSEATGIANDFIAANYPGRELGVVQNVNVYIDDETD-------- 106 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + L +G+ + ++E + + + V+D + Sbjct: 107 -------TVTVSGEARVRTTMLGMIGLDYITVHWESEVQRA------RQRLELVMVLDNT 153 Query: 187 RSMLDYQRDSEGQPLNCFGQPADR-----------------TVKSYSSQNGKVGIRDEKL 229 SM + + + TV ++ + + L Sbjct: 154 GSMGGSKIRNLRESAELLTGILFDAADDPSDVKIGLVPFAATVNVGTNHARAWWMDPDAL 213 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-------LASVI 282 SP + + +R+ + Sbjct: 214 SPVHAEWAGGNPVEIETCSGRGRGRRRRCQTEEIWVNHWDLFDQLRNTGWEGCVEARPIP 273 Query: 283 RSIKKIDNVNDTVRMGATFF------NDRVISDPSFSWGVH------------------- 317 I + ++ S+ S GV Sbjct: 274 MDIDDTPPSIGNPSTLFVPYFAPDEPDNGSYSNSYLSDGVSGGVSERLQALDKYDNGRPN 333 Query: 318 ----------------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V D G+T I + + I Sbjct: 334 REGPNRSCTTTPVTPLTSTERTVLNAIGDMGASGTTNIPNGVGWGIRLISPGAPFTEGSA 393 Query: 362 KNNLEAKKYIVLLTDGEN-----------------------------------TQDNEEG 386 ++ E K +V+LTDG+N + ++ Sbjct: 394 WDDDEYIKAMVILTDGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSNVLSNELDDRT 453 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 A C A+S GIR+ TI F VN + R + NCAS S +F++ S+ L F Sbjct: 454 EAACAYARSLGIRVYTITFQVNSSST---RSLMQNCASNPSLYFDSPSSEALEDAFEMIA 510 Query: 446 GNEIFERVIR 455 G+ R+ R Sbjct: 511 GDLTNLRLSR 520 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 45/382 (11%), Positives = 104/382 (27%), Gaps = 33/382 (8%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + +G +++ +M +GG+ +D+VR L+ A++ A+ Sbjct: 24 LRAFRSDESGVLAKPMIMILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAADLDQP 83 Query: 71 SLEEVSSRAKNSFTFPKQK----------IEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 E S + + + + + + + + Sbjct: 84 LDPEAVVIDYMSKSGLSDYTTVVVPEVSPTAKRVKASVDTEFTAGWMNSIFYEDYMRNPD 143 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 ++ SS + + + L S + + + Sbjct: 144 TYELEPITLPLLASSTAVESIGNVEISLVLDVSGSMRSNNRLVNLKRAAKEFVQTMDDNT 203 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S S++ Y + N + +++ Sbjct: 204 EDGKMSISIVPYSTQVSMPAAFLDEMRVSDEHSYSNCINFDGSDFNTTGLNLSREYERTM 263 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVRMGA 299 ++ S ++ D +++ DA ++ DNV + A Sbjct: 264 HF-----------SVWNYYDYRDDDEHVRQPTCASDADNPERTALLMSDNVAQLQSYIDA 312 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ D WG L ++ A N+ N +++ Y Sbjct: 313 FEHSENTSIDLGMKWGTALLDPSVQPVIATLANDA---NPNQSIEARY--------ANRP 361 Query: 360 RMKNNLEAKKYIVLLTDGENTQ 381 + E K IV++TDG+NT Sbjct: 362 VSYQDTETLKVIVMMTDGQNTA 383 Score = 93.1 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 31/247 (12%), Positives = 71/247 (28%), Gaps = 43/247 (17%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA--TFFNDR 305 D + ++ ++ + ++ + + + R G ++N Sbjct: 344 ANDANPNQSIEARYANRPVSYQDTETLKVIVMMTDGQNTAQYYIKNDYREGLTPVWYNSE 403 Query: 306 VISDPSFSWGVHKLIRTIVKTFAI--DENEMGSTA------------------------- 338 ++ + + D T Sbjct: 404 ENVYSTYDPNRYGSDKYYWHQTGQWEDHPYGNGTYQETYCDGREYYGRCYDGSWKTRTVD 463 Query: 339 -INDAMQTAYDTIISSNEDEV------HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 +A+Q +Y + + +N Y L + ++ + +A+C+ Sbjct: 464 EPGEALQLSYADLFAETSLRYLYRDLFGDWMSNASWYWYNRLYSYVGDSTKDSRTLAVCD 523 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 AK +GI + TI F + + L CAS S + E++ F Sbjct: 524 AAKEKGIVVFTIGFEA----PWRGQQVLQQCASSASHYYDVDGLEISDAFASIASAI--- 576 Query: 452 RVIRITK 458 R +R+T+ Sbjct: 577 RQLRLTE 583 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 49/509 (9%), Positives = 127/509 (24%), Gaps = 72/509 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G + LL+ ++G+ V+ R+ L+ +A+ A I ++ L Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIEDQPDLAR 75 Query: 75 VSSRAKNSF-----------------------------TFPKQKIEEYLIRNFENNLKKN 105 + + + + + Sbjct: 76 GLATQYVENYVRDIKSTNLSAQRFHQTEDEGAGILEYIQYTVNAKTTHDSWFASSFIPSF 135 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 +++ + + +V S + ++ R I + Sbjct: 136 DEQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMDDRWGSSRHKKIDDLKTAIDQISS 195 Query: 166 TVSRSYHKEHGVS--IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ----- 218 + + V + V D + F V + Sbjct: 196 KILCTSTDLEYVDGEWKEVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSGGQANATSQL 255 Query: 219 ----NGKVGIRDEKLSPYMVSC------NKSLYYMLYPGPLDPSLSEEHFVDSS------ 262 N K + + N+ L + S+ Sbjct: 256 SYKPNYKPNVSPYSYNDVNWDYWRAYSQNEVLNCANWQSYCPNPKSDNQKYAKRIKDVIY 315 Query: 263 ---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A++ + G FN S + + Sbjct: 316 LDNYHVADVYNYVDLSTSVATMFTDKSGLRPNF-YGVNGTDLFNAHGSSSSTQFKNIRLS 374 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDG 377 + G TA + + + + + KK +++L+DG Sbjct: 375 NKLSDLNPISSMWADGGTAAFQGILRGSQILKDGDPNSSDDEEQQAYNKKIKMLLILSDG 434 Query: 378 ENTQDNEEGI-----AICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNS--F 428 + + +N +C+KA+ + G+ I I + + +C + Sbjct: 435 QESPNNGILKGLVDRGMCDKAREEIPGLYIGVIGI----DFRASQQSGFQDCVIDPNEDI 490 Query: 429 FEANSTHELNKIFRDRIGNEI-FERVIRI 456 + ++ EL + + I + ++ Sbjct: 491 IDVSNLDELIEKIEELIRKGSKTSGITKL 519 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 124 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 66/556 (11%), Positives = 128/556 (23%), Gaps = 110/556 (19%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ + L + G+F +I AL + G G+ VD S L+ A A Sbjct: 1 MLSIRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQNALDAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + +V + + +F + + + Sbjct: 61 V--LAVAQRGDKISDAEARSIAASFLTGNLSSAYKNMAVERNGTSVKLSAEATMPLSFGG 118 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + V SS D+ + L S + + + Sbjct: 119 LIGRK--EATVGASSTADMAFAYYEIALVLDTTGSMRGGKLQAMKEAVNGLIDDLSSRVT 176 Query: 181 -----------------------WVIDFSRSMLDYQRDSEGQ--------------PLNC 203 D + ++ L+ Sbjct: 177 DKERLKFALVPFASFVNVGPQFGPEFDRNGRIVPGTGADWLDLQGISPISQLDLLPGLSR 236 Query: 204 FGQPADRTVKSYSSQNGK----VGIRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHF 258 F + D +P + + SL+ D + Sbjct: 237 FEVAHHLGQDWKGCVETRMPTSSSAYDVDDAPVVATDRYSLFVPTFAIDEPDGGRLYANN 296 Query: 259 VDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR- 305 +S+ A + ++ +D + R ++D Sbjct: 297 YIASNTSAFGNSAVAIARRLLRYGLDDAAQAALTGATNLIGLDIRPERWRKVEHEYSDGR 356 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S +K G+T I + + + + Sbjct: 357 GPAYGCLSRPITPLSNDYAALKREVSRFTADGNTNIMEGVAWGMRVLSPREPFTEGKE-P 415 Query: 364 NLEAKKYIVLLTDGENTQD-------------------------------------NEEG 386 + +K +++LTDG N N Sbjct: 416 ASDVEKIMIVLTDGANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRT 475 Query: 387 IAICNKA-------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELN 438 +A C A K + I TI L CA P +F++ S +LN Sbjct: 476 LAACENAKREYTPSKEDDVTIYTIRLEEPDVATG---TLLQECATGPGYYFDSPSRTQLN 532 Query: 439 KIFRDRIGNEIFERVI 454 IF++ R+ Sbjct: 533 AIFKEIRDGITKLRLS 548 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 64/559 (11%), Positives = 140/559 (25%), Gaps = 115/559 (20%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + G+ ++ AL +P M+ + VD+ R S ++ A A + A Sbjct: 9 SALKTFGTRFSDDRRGNVAMMFALALPPMMLMTLGGVDIARVSTVRMNVQDALDAATLAA 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEY--------LIRNFENNLKKNFTDREVRDIVR 116 + +++ + + N + VR +V Sbjct: 69 ARSQYTDNPRINAVGLAALQANLAPYGDVTLDTTQTNFRLNTATGAVEADAKVNVRALVA 128 Query: 117 DTAVEMNPRKSAYQVVLSSRYDL--LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + + + Q+ ++ ++ N + + L S +T + Sbjct: 129 NIFLPPYGQFFDDQLPANAHSEVLRSNNRIEVALVIDNTGSMDGAKLTNTKTAAIDLINR 188 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + I+ + S + G + S + + G + +PY Sbjct: 189 LEAADGRSIEQDAIKISLVPFSMTVRVAQGGTNTPPSFMSNADTHTGGGAWNSGSNPYSA 248 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR-DALASVIRSIKKIDNVND 293 + L+ + + V+S + I+ + + Sbjct: 249 FDTAVGRFTLFGRL---NTTWGGCVESRPQPYDIRDTAPSSGTQATMFVPYFAPDEPDRA 305 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRT----------------------IVKTFAIDE 331 +T+ N + + G T Sbjct: 306 DYPNHSTWQNWQYEGNDYLDDGRPGSNANSPFANTAARTTEWFARVRSVSRYSTTPRNTL 365 Query: 332 NEMGS-------------TAINDAMQ-------------------TAYDTIISSNEDEVH 359 N T A++ + T+ + Sbjct: 366 NTGFGPNRGCDLQPIIRLTDNYTALRTAVNNMIASGNTNVPLGTMWGWHTLSPNAPFGDG 425 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNE----------------------------------- 384 R KK I+++TDG N + Sbjct: 426 RPYGTERLKKIIIIMTDGANVMSDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDN 485 Query: 385 -------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 +C K + I + T+A V+ A+ L CA+ + +F +S Sbjct: 486 RFDHATAATEDMCGNMKDKDIEVYTVAVQVD----STAQTLLRRCATDTDHYFPVDSAAG 541 Query: 437 LNKIFRDRIGNEIFERVIR 455 + F G R+ R Sbjct: 542 IGAAFDRIAGAIENLRISR 560 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 60/505 (11%), Positives = 136/505 (26%), Gaps = 89/505 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QS 71 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 72 LEEVSSRAKNSFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREV 111 + ++ + + EY N Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 136 Query: 112 RDIVRDTAVEMNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + V N Y VV + + ++ + + L + KA Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 196 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS---SQNG 220 ++ G++ + FS + + C + + + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDRNGHYLQRYT 256 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE------------------EHFVDSS 262 +++ + + Y L LSE ++F+D + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVH 317 + I + + +I ID +T+ + FF + + S ++ Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP- 375 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ G T + + + S + K +++L+DG Sbjct: 376 --NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDG 422 Query: 378 ENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++ +C K K GI+++ I + Sbjct: 423 DDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEK 480 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 +C +F+ A + HEL + Sbjct: 481 DCVGTGNFYLAKNAHELEISIERAL 505 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 123 bits (309), Expect = 5e-26, Method: Composition-based stats. Identities = 61/494 (12%), Positives = 134/494 (27%), Gaps = 90/494 (18%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ + + G +I L + M + G VD + L A A + A Sbjct: 13 SRVRKRLRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAA 72 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 Q + + + Q + I + + + D + Sbjct: 73 VKAAAQKESDCVANPAGNNVSNFQGCGQKDIIKAGVAAGVQYMNGDPLMRGADRKPTIEL 132 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + S+ + + + R MG+++ + K + ++ ++ ++D Sbjct: 133 S--SSEGSWSATVNYSADIPTNIARLMGVQTIPVNGKVTSNIALGTH---MYLNFHLLLD 187 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SM + L A Sbjct: 188 RSMSMGIGATSDDISRLQALTGCAFACHSEGY---------------------------- 219 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK--KIDNVNDTVRMGATFF 302 E + D + + + +RDA +++ K N + ++MG F Sbjct: 220 ----------EAQYYDQPKAQGIRFRIDDLRDATGALVAQAKMVASANAREHIQMGVYAF 269 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N V + + + + T DA+ ++++ Sbjct: 270 NHHVSPLVEMTSDLTNVANAVKNLDL--PTHDDGTQAADAVTW----LVANKIKGNGTGL 323 Query: 363 NNLEAKKYIVLLTDGENTQD---------------------------NEEGIAICNKAKS 395 + + + L+TDG + + C+ KS Sbjct: 324 TSAAPLEIVFLVTDGVEDGIYTGWNKMVGPTGLPLPWWPSWMTKAPTSAFPVTACDALKS 383 Query: 396 QG---IRIMTIAFSVNKTQQE---------KARYFLSNCASPNSFFEANSTHELNKIFRD 443 +G + T T Q L CAS FF A+ ++ + + Sbjct: 384 KGAIVAVVYTTYVPFPGTVQYDRLIGPFAPNISPNLQGCASQGYFFTASEPGDITRGMQS 443 Query: 444 RIGNEIFERVIRIT 457 + E +++T Sbjct: 444 LFNRALQELALKLT 457 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 36/382 (9%), Positives = 96/382 (25%), Gaps = 35/382 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + +G +++ +M +GG+ +D+VR L+ A++ A+ Sbjct: 25 RQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAADLDQPL 84 Query: 72 LEEVSSRAKNSFTFPKQK----------IEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 E S + + + + + N ++ + D Sbjct: 85 DPEAVVLDYMSKSGLGDYTTVVVPEVSPTAKRVKASVDTNFTASWMNNVFYDDYIRNPDT 144 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 ++ SS + + + L S + + + Sbjct: 145 YQLEPITLPLLASSTAVESIGNVEISLVLDVSGSMRSNDRLVNLKRAAKEFVQTMDDNTE 204 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 S S++ Y N + + ++++ Sbjct: 205 DGKMSISIVPYSTQVSMPEAFLDELNVSSEHDYSHCINFSGSDFNNAGISTTQAYERTMH 264 Query: 242 YMLY--PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + ++ + ++ + + L+ D + + I +T Sbjct: 265 FTVWNSGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYIDAFVPSENTSIDLG 324 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + ++ +S Sbjct: 325 MKWGSALLDPSVQPVIASLADDANPNQSI-----------------------ASRFANRP 361 Query: 360 RMKNNLEAKKYIVLLTDGENTQ 381 + E K IV++TDG+NT Sbjct: 362 VPYTDTETLKVIVMMTDGQNTS 383 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 38/403 (9%), Positives = 95/403 (23%), Gaps = 18/403 (4%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + P + + D + Sbjct: 191 AAKEFVQTMDDNTEDGKMSISIVPYSTQVSMPEAFLDELNVSSEHDYSHCINFSGSDFNN 250 Query: 123 NPRKSAYQVVLSSRYDLLLNPL-SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + + + + R + + + T + Sbjct: 251 AGISTTQAYERTMHFTVWNSGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYI 310 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 F S + S + + + V + Sbjct: 311 DA-FVPSENTSIDLGMKWGSALLDPSVQPVIASLADDANPNQSIASRFANRPVPYTDTET 369 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + D + ++++ +S + +A V + Sbjct: 370 LKVIVMMTDGQNTSQYYLRNSYREG---DSPVWYNAQERVYSTYDPNRGSKPYYWDNLQR 426 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--- 358 + D + ++ + A+Q +Y + + Sbjct: 427 WADHPYGNGTYEETYCTGTLYYGNCYYGSWQTRTVDEPGTAVQLSYPDLFADTSLRYLRD 486 Query: 359 ---HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +N + L + +T + + IC AK++G+ + TI F + Sbjct: 487 RLFGDWMSNANYYWFSGLFSSVGSTTKDARTLDICEAAKAKGVVVFTIGFEA----PSRG 542 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L CAS S + E++ F R +R+T+ Sbjct: 543 QEVLQACASSASHYYDVDGLEISDAFASIASAI---RQLRLTE 582 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 55/484 (11%), Positives = 144/484 (29%), Gaps = 55/484 (11%) Query: 1 MVFDTKFIFYSK--KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 M +++ + K+ G + ++ P + + ++++V + + L A + Sbjct: 1 MTNTDIIRKFNRLTQFKKNEHGAILVSFIIIFPFFIALTFIILEVSIFLQKKAKLSDAIE 60 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + +V +++ + YL ++ N D Sbjct: 61 QATLALTVE--NDGIPNAAQQTKNRELVLSYANAYLPSEGFSDPIINIDD---------- 108 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 N Y ++ Y + S S+ +A Sbjct: 109 ----NTNYLGYNAAVTMTYPVEFLGRSPLTNSISNIQTTDNGEAIKNKT---IEVSEPTD 161 Query: 179 IQWVIDFSRSM-LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + +V D+S SM L + D + +++ + P+ Sbjct: 162 VVFVADYSGSMLLSFSDDVSIKNGERINALRSAFRILHNTIKNNSNVNTIGFIPFGSGTK 221 Query: 238 KSLYYMLYPGPL---------DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + + ++ + ++ ++ D + + Sbjct: 222 RKVSENGENKEYCHLPFSPKIYKPNGDYLSENAEATKNAWTFLDVIGDHIDYKKTIMSIT 281 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 +NV + + + S+ + + I+ +G T I+ + +A + Sbjct: 282 ENVQPIDIPMRDIKHKEICLSGTNSYSLEREQFDYSIENIIEMAPLGGTLISSGILSANN 341 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-------------IAICNKAKS 395 + ++ KK +++L+DG ++ ++ +C Sbjct: 342 IFKETADNG---------HKKLMIILSDGMDSYNSTMLPNKGFFISKTLIDEGMCEMIIK 392 Query: 396 QGIRIMTIAFSVNK--TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 GI++ IA + + C ++++EA++ HEL + + V Sbjct: 393 NGIQMAFIAIAYSPENNVNAPEYINWKQCVGEDNYYEAHNAHELELELQQAVSVSATSEV 452 Query: 454 IRIT 457 R T Sbjct: 453 GRNT 456 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 60/505 (11%), Positives = 136/505 (26%), Gaps = 89/505 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QS 71 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 72 LEEVSSRAKNSFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREV 111 + ++ + + EY N Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 136 Query: 112 RDIVRDTAVEMNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + V N Y VV + + ++ + + L + KA Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 196 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS---SQNG 220 ++ G++ + FS + + C + + + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE------------------EHFVDSS 262 +++ + + Y L LSE ++F+D + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVH 317 + I + + +I ID +T+ + FF + + S ++ Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP- 375 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ G T + + + S + K +++L+DG Sbjct: 376 --NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDG 422 Query: 378 ENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++ +C K K GI+++ I + Sbjct: 423 DDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEK 480 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 +C +F+ A + HEL + Sbjct: 481 DCVGTGNFYLAKNAHELEISIERAL 505 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 60/505 (11%), Positives = 136/505 (26%), Gaps = 89/505 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QS 71 IK+ G + L+PV +G+ + + + L A + A + S Sbjct: 12 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 71 Query: 72 LEEVSSRAKNSFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREV 111 + ++ + + EY N Sbjct: 72 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 131 Query: 112 RDIVRDTAVEMNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + V N Y VV + + ++ + + L + KA Sbjct: 132 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 191 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS---SQNG 220 ++ G++ + FS + + C + + + + Sbjct: 192 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 251 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE------------------EHFVDSS 262 +++ + + Y L LSE ++F+D + Sbjct: 252 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 311 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVH 317 + I + + +I ID +T+ + FF + + S ++ Sbjct: 312 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP- 370 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ G T + + + S + K +++L+DG Sbjct: 371 --NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDG 417 Query: 378 ENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++ +C K K GI+++ I + Sbjct: 418 DDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEK 475 Query: 421 NCASPNSFFEANSTHELNKIFRDRI 445 +C +F+ A + HEL + Sbjct: 476 DCVGTGNFYLAKNAHELEISIERAL 500 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 74/547 (13%), Positives = 144/547 (26%), Gaps = 113/547 (20%) Query: 21 HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT-AIITASVPLIQSLEEVSSRA 79 + ++ A L M+ +GG +D + E L+ A A++ PL S E+++ A Sbjct: 26 NVALLFAFLSVPMVMIGGAAIDYGFATRLETKLQTATDATALLLCQTPLTTSEAELNTLA 85 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + T + R + + T + ++ +S Sbjct: 86 QTTMTGAMGAANLVVDRLAITSSPRKITLTAHKQSTTFFGGLTGTQRIN-PGAVSQCATP 144 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEA------------------ETVSRSYHKEHGVSIQW 181 L + L S + ++ S K V Sbjct: 145 LPKTFEIALVLDNTGSMAASSGGQSKLRAVQTAATDFVNYVYTSPAFSSATKVAIVPFAA 204 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQP---------------ADRTVKSYSSQNGKVGIRD 226 + S Y + L+ + +++ + G G + Sbjct: 205 AVAVDPSAYRYASWIDQNGLSSYHWTNIVQSNPAPFKNRFEVFAKLQAANRNWGWAGCLE 264 Query: 227 EKLSPYMVSCNKSLYYMLYP----------------GPLDPSLSEEHFVDSSSLRHVIKK 270 P+ V + +S Sbjct: 265 SLPYPFNVQDGAPTTKDSFYVPLLAPDEPGDGTSGGASFPVGNQRSTAYSYNSYIDDETG 324 Query: 271 KHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ ++ K ++ + N + +KT Sbjct: 325 PDCRNSNTYNTALGQACKYVNPKDPASGSPLGIPNGPNFGCTTQPLQRLTNNTNALKTLI 384 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK-------NNLEAKKYIVLLTDGENTQ 381 + GST I++ + T+ ++ + N K I+L+TDG N+ Sbjct: 385 NNMAPSGSTNIHEGFMWGWRTLSPNSVFADGQPYASSANSSNATNINKIIILMTDGTNSW 444 Query: 382 D------------------------------------------------NEEGIAICNKA 393 + C Sbjct: 445 GTNSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNTNIADGNTARKALDALTAEACANT 504 Query: 394 KSQGIRIMTIAFSVNKTQ-QEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFE 451 K+ I I TI FSV + L NCA SP+ F+ ANS+ +L K F+ + Sbjct: 505 KAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSSDDLIKAFKSI---QASI 561 Query: 452 RVIRITK 458 +R+T+ Sbjct: 562 GALRLTQ 568 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 123 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 56/448 (12%), Positives = 125/448 (27%), Gaps = 35/448 (7%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + S G ++ A ++P++ + +++ + + L A A Sbjct: 71 VRFALTQRDDSRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 130 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + +++A T ++ + ++ F +T + Sbjct: 131 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKV 190 Query: 127 SAYQVVLSSRYDLLL-----NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 V + + L + + + V+ + + I Sbjct: 191 PTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISL 250 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V+D S SM + + +G+ + + + + + R+ + P + Sbjct: 251 VLDRSGSMDW-KTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQ--PQYIRRVSYNR 307 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 D + E + + + L DA V+ + + V+ Sbjct: 308 GYDEYDLYDHAWEEVFGLGPAPN-TPWEDLVLAVDAFLRVLDQTPQNEQVS------IAS 360 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +N D + GST I + M + N Sbjct: 361 YNSHGTLDCWLLDDFDSVR-----AAVAQLGPNGSTGIGNGMNSGKTAFTHEN------- 408 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 A K +V++TDG + + S + I T+ F + + Sbjct: 409 -ARPYASKTMVVMTDGNHNYGTQPNTVAQQLMSSSNLNIQTVTFGGGAD-----QETMQE 462 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGN 447 A + A+S EL F + N Sbjct: 463 VAVTGLGRHYHADSGDELVSAFEEIANN 490 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 123 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 45/384 (11%), Positives = 108/384 (28%), Gaps = 57/384 (14%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G + + +M+ + G+ +D++R+ L+ + A++ A+ + Sbjct: 24 FCRDEAGVLTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAADLDQTTNA 83 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + Q + V + + Sbjct: 84 KAVVEDYFAKAGMSQYL-------------------------DGVQVSKGLNFKEVEAQV 118 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S+ + F+ GI++ +++AE ++ + + V+D S SM Sbjct: 119 SATIP------TWFMNMSGIETLDAFARSKAEERIQN------IEVSLVLDISGSMGWDG 166 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + + + +G G+ + PY N S Sbjct: 167 KLANMRTAADQFVRTMMAGNDNVAADG-TGLTSVSIIPYHAVVN---VPDELLDEYAVST 222 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + + R + + ++ T + Sbjct: 223 QQTVSNCVRFTATDFQSISIDRTKTLDRLAHFDRNNSNLHTFNGDRLIGRPWCQVGTYGA 282 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED----------------E 357 T + + G+TA + M+ A + ++ Sbjct: 283 ILPWSTSVTDLTNKVAELGASGNTATDIGMKWAAALLDPGTQNIVDDMIDGGHLEADLAG 342 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQ 381 + ++ E K +VL+TDGENT Sbjct: 343 RPVLYSDPETIKVVVLMTDGENTS 366 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 123 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 51/492 (10%), Positives = 119/492 (24%), Gaps = 79/492 (16%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G I TAL + + G+ ++ R+ ++ L A + A + + Sbjct: 5 RQQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTANQDD---- 60 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN---PRKSAYQVV 132 T+ Q Y+ N N E + + Y+V Sbjct: 61 -------KTYENQLATNYVQTYIRNIAIINDIKVERSEGIDYYPTPDGNEEREYFQYRVT 113 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + L+ Y + + I +V DFS SM Sbjct: 114 AKTDHTSWLSSDI----IPSFSPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGN 169 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD-------------------------- 226 + + + ++ Sbjct: 170 KIRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCLTQLDYRPNFN 229 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + S + + + + R ++ + + Sbjct: 230 GGNYSSYEDIDWSTWSTWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNTSKSETGSGWY 289 Query: 287 KIDNVN--DTVRMGATFFNDRVISDPSFSWG----------------VHKLIRTIVKTFA 328 D + + A F + + S +T + Sbjct: 290 FPDPYSYINFPDSVAKTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTTLSPIQ 349 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGE-----NTQ 381 + + G T++ + + + + I++L+DG+ +T Sbjct: 350 NNMSPDGGTSVYQGLIRGAQILEQGRPTSPSPETSAAYNSRIKMILMLSDGQEMPYVSTF 409 Query: 382 DNEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + +CN K Q + + + + + + NC N+ + Sbjct: 410 NQLVNQGLCNTIKAQFNDSDQQLYMGVLGIEFD----AQGQQGFKNCVGQNNITNVDDVD 465 Query: 436 ELNKIFRDRIGN 447 +L K + I Sbjct: 466 DLIKEILEMIKE 477 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 123 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 50/471 (10%), Positives = 129/471 (27%), Gaps = 49/471 (10%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + AL++ ML ++ V+ S + L A A++ ++ + + Sbjct: 13 QHGVISVTAALMLLGMLTFFSFVLLVIVLSTTDSRLSMLAD-AVLYSTTNSYNAKADAQQ 71 Query: 78 RAKNSF---TFPKQKIEEYLIRNFENNLKKNFTDRE-----------VRDIVRDTAVEMN 123 ++ + ++ N + + D++ + Sbjct: 72 MSEANTPQPNLGLSSLQVDTGNNENAAQVQVSGRVDRGSLALTDTLGTSDVLVTHQAQSK 131 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 ++ ++V+ + + G++ + AE + I Sbjct: 132 IHQTTLEIVVMLDVSNSMKGEPMTQSIKGLRDFADILYAEERRDFSKVVSIVPATGLVNI 191 Query: 184 DFSR---SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S + + + L D + K ++ +L Sbjct: 192 GHRPEFFSASAFAIPRDWRSLAKERGWKDLLHPEVPGRWRKAMCTALPEEQDELTSVSAL 251 Query: 241 YYMLYPGPLDPSLSEE-----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + ++ ++ H L + +K Sbjct: 252 TPNWIRRLELSPPDQNLRLHMEWMSKPAIEHYENDMPLFTYYYSG--NPKEKYSPNKHEQ 309 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R + V I+ +T + + A+ + Sbjct: 310 RGLFDSPDCGVSQIQPLLSTRRAFIKA-----LDTLYPEFNTNNAEGVMWAWRLLSPHWR 364 Query: 356 DEV-------HRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIA 404 R + +K ++L TDG + D + + +A+C + K QGI I++I Sbjct: 365 GYWDKGKSELPRDYQHPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIEIISID 424 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 F+ + +CAS ++ A++ + + + + + Sbjct: 425 FN-------NRSQVMKSCASAGQYYIADNRT-IRSVLKQVATTLSKIELTQ 467 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 62/515 (12%), Positives = 144/515 (27%), Gaps = 133/515 (25%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G+ I+ AL + ML G D +R ++ A+I A Sbjct: 102 RITGALRSLERDRGGNVGIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAV 161 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + E+ + + + +++ + N Sbjct: 162 KQI-NNTEDTDALKEKVSDWFHAQVDNSYTLGDIDIDTVNH------------------- 201 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I++ + + + + SY +++ VID Sbjct: 202 --------NITATANGTVPTTFMKIANIETVPVSVASAVKGPATSY-----LNVYVVIDT 248 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + + GK Sbjct: 249 SPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGKTK---------------------- 286 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + S+ + + + + DA+ V+ I + D ++ +++G D Sbjct: 287 ---------YANNYAYSTAKKIKLRADVAGDAVREVLDMIDESDENHERIKVGLYSLGDT 337 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + +T + ++ T + + + + Sbjct: 338 LSEVLAPTLSTDTARTRLADASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----S 392 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI-----------------------CNKAKSQGIRI 400 + K ++LLTDG ++ + C K+Q + Sbjct: 393 SGSPLKLVLLLTDGVQSKREWVTDGVVWSSGKAISGSYWNKVAPLNPDWCGYLKNQSNTM 452 Query: 401 MTI------------------------------------AFSVNKTQQEKARYFLSNCAS 424 + S + T+++ Y L++CAS Sbjct: 453 AVLYTEYLPITSDWGYNATVGSTMASAKWKSTWGGTMDSGVSTSTTRRDYIPYALADCAS 512 Query: 425 PNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 S F A+S+ E+ + +R+T+ Sbjct: 513 SKSLFISASSSTEITAGLSTLFTQYLSS--VRLTQ 545 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 57/397 (14%), Positives = 109/397 (27%), Gaps = 72/397 (18%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ ++ G +L +M+G G+ +DV+R+ L+ A++ A+ Sbjct: 38 RLAPRQFMRREEGTITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAASL 97 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 E R + + + N + + Sbjct: 98 TQPYDPEGVVRDYFAIAGIDGYRLDVRVEEGLNFRRVHAY-------------------- 137 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 L S+F++ G+++ AE R + + V+D S Sbjct: 138 -----------AELEVRSIFMQMFGVRAMTSPAIGAAEERVRR------IEVSMVLDISG 180 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM + R + +P V S + + + PY N Sbjct: 181 SMGENNRMTNMRPAAREFV---TEVLSANENVNNELLVSVSIVPYNGRVNGGDLIES-VF 236 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 D SE + + ++ I D N+ + Sbjct: 237 TYDDLHSESNCTRFAEADFTSTAIDPAVP-----LQRIAHWDRGNEEEDESFQW--AHCQ 289 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------------- 354 +D + + + N G TAI+ M A + + Sbjct: 290 TDQYGAILPWQHTEAALHAHIDSLNTGGWTAIDLGMNWAVGLLDPAAAPALTGLIASGHV 349 Query: 355 -----------EDEVHRMKNNLEAKKYIVLLTDGENT 380 D + E K +VL+TDG+NT Sbjct: 350 HPEFSDRPAPYRDGDRATTIDDETIKVVVLMTDGDNT 386 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 80/274 (29%), Gaps = 34/274 (12%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM--VSCNKSLY 241 D + ++ D + D + + G I + ++ + Sbjct: 371 DETIKVVVLMTDGDNTRQYDLRDIYDSAGRYVGFREGYAPIFRNTSTGQYSIWWEDQGAF 430 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 ++ DP S + D R+ + D + + N V + A Sbjct: 431 WIPTGNTRDPGGSWQAQPDGGWSRYGMTALEFTEDRANAF----DPAEAGNGEVLLWADL 486 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+D + W + F E S + D + Sbjct: 487 FSDHTAGYIAAEWFHAPADESEQWDFYNQLAENPSHNY---IGWDSDGVRPDGVVGQ--- 540 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 +Q + +AIC+ A + GI + I F ++ + + + Sbjct: 541 ------------------SQADTNLMAICDVANAAGIIVYAIGFEA----PDRGQRVMEH 578 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 CAS ++ + E+++ F + R+I+ Sbjct: 579 CASVDANYFDVEGREISEAFASIARSINQLRLIQ 612 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 68/528 (12%), Positives = 140/528 (26%), Gaps = 106/528 (20%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+ ++ G + +I +L ++G+ + VD + Q + A + L+ Sbjct: 10 LKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLA----LMA 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR-KSAY 129 + + + +Q + + + F + R + P Sbjct: 66 ENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTYFKD 125 Query: 130 QVVLSSRYDLLLNPLSL------------FLRSMGIKSWLIQTKAEAETVSRSYHKE--- 174 + +S +YD N L SWL K + + + Sbjct: 126 NLKISDKYDYQCNNLQGPNGEQLKSIACEISGKFERPSWLYLGKNNGLSFAETTTINANK 185 Query: 175 ----------HGVSIQWVIDFSRSMLDYQRDSEGQPL----------------------- 201 + + V D S SM ++ Sbjct: 186 IYIQKNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNTE 245 Query: 202 -----NCFGQPADRTVKSYSSQNGKVGIRDEKLS--------PYMVSCNKSLYYMLYPGP 248 + + + + + Q M Y Sbjct: 246 EGKVISQYNRIGFTSFAFGAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTMQYSEL 305 Query: 249 LDPSLSEEHFVDSSSLRHVI-------KKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 L S + ++L K + + S+ + + T Sbjct: 306 LSYVGSNQQRYSYATLAQYFDAFVDYDKTIESINSFDGKDLSSLMYFSKNSWCLGSANTR 365 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED-EVHR 360 N I K + + +G+T + + + ++++N D Sbjct: 366 INSTYIWAG-------KNESADLVSRFNRVPALGATLSSSGLLIGANLLMNTNPDENAQP 418 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEE----------GIAICNKAK------------SQGI 398 K ++ I++L+DGE+ +N +C K K Q Sbjct: 419 SKLGANTQRIILVLSDGEDQINNASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPT 478 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 RI +AF + + C ++ AN+ EL + FR IG Sbjct: 479 RIGFVAFGYGPSG--TQKAAWEKCVG-KYYYVANNKEELLESFRKIIG 523 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 63/552 (11%), Positives = 127/552 (23%), Gaps = 113/552 (20%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 K +S +G+F ++ L +P +L VDV + L+ A A + +S Sbjct: 2 LLNKFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLASSHL-- 59 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK----NFTDREVRDIVRDTAVEMNPR 125 ++S F + N + L NF + Sbjct: 60 -GDLDISRTDAFDRYFQANIAGHGELANAQATLTVDRGVNFIKTKAVASADVNLNFGFLF 118 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL---------------------------- 157 + + + N L + L S Sbjct: 119 GHNRHIAVDASAVESDNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEATKSPTRQI 178 Query: 158 ---IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 + A V+ + + + + F Q D Sbjct: 179 RASLVPFVTAVNVNGDEFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKDTGWTE 238 Query: 215 YSSQNGKV-GIRDEKLSPYMVSCNKSLYYM-----------------LYPGPLDPSLSEE 256 G + + Y +S P + ++ Sbjct: 239 AGWNGTGWKGCVEARPGAYNISDTPPDPDKPDTLFVPYFAPDDPEDAQKPSSSYGNAAKY 298 Query: 257 HFVDSSSLRHVIKKKHLVRD-----ALASVIRSIKKIDNVND--------------TVRM 297 + K ++ L+S+ + D T Sbjct: 299 YNNSYLDDVSDKTKTAKLKGNRLGIDLSSLADPVPPADKDAKEKVAKYVAPTKALITETG 358 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + E T +++ + + + Sbjct: 359 SPITVGPNRACPTPVVSLTDDFDKLRKAASEMTEWNGSGTNVSEGLSWGMRVLSPAAPYT 418 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQD----------------------------------N 383 K ++LLTDGEN + Sbjct: 419 DGAPWKTPGISKIVLLLTDGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVD 478 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFR 442 ++C + K+QG++I T+ + R S CAS S ++ N +L +F+ Sbjct: 479 GWTKSVCTQLKNQGVQIYTMVLQSDT---AANRALYSACASDPSGYYAVNDPAKLPDVFQ 535 Query: 443 DRIGNEIFERVI 454 ++ Sbjct: 536 HIANKFSRLQLT 547 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 70/470 (14%), Positives = 149/470 (31%), Gaps = 111/470 (23%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 ALL+PV+LGV G+ VD+ L+ A + S+ L Sbjct: 1 MALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELP---------------- 44 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 Y + + + D ++ E++ V ++++ ++ + Sbjct: 45 -------YDPDLSKGIVTQAVNDMVETNMEEAVVTEISAGTEIRSVKVTAQAEVRM---- 93 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 L + +G+ ++ A A + + + +VID S SM D Q Sbjct: 94 LLMEVLGMADKTVEASAMA--------GFNKLEVVFVIDNSGSMKGTPIDLVKQASEELT 145 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + ++ G V R + V + + G L+ ++EE + ++L Sbjct: 146 DLLIPDGTTPDTKVGLVPFRGKIRLGEAVDGY-AEGCVNADGSLNTGINEEFMDEYNALP 204 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + K+ + + + S +I I Sbjct: 205 YYYKRYITLDT--------------------------CSDIPTVLPLSKNKSTIIAAIGS 238 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--- 382 A T I++ ++ + + + + + +K +++LTDG+ Sbjct: 239 QTATG--AASGTVISEGIKWGRNILTPDAPFT--QAGSKEDFRKIMIVLTDGDTEDGECG 294 Query: 383 ---------------------------------NEEGIAICNKAKSQGIRIMTIAFSVNK 409 N + ++ AK GI I +I F + Sbjct: 295 GTYRATYRPNNYWTNAYYGMGVDTAHCNDGGVLNADMLSEAQLAKDAGIEIFSIRFGSSD 354 Query: 410 TQQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + AS + +F+A S +++ IF+ IG ++ R++ Sbjct: 355 ---TTDINLMKEIASSKAGTDDHYFDAPSVYDIPDIFKQ-IGKQLGWRLL 400 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 45/411 (10%), Positives = 114/411 (27%), Gaps = 69/411 (16%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M T+ Y+++ K G ++ + + L +GG+ +D++R +L+ A Sbjct: 1 MRETTRAAEYAERFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + + + + + + + + L + + T + Sbjct: 61 VLAGATAVNNATARAVIEDYFAKSGQSDYLAAQEAGDIDIRLNSSKVTARATQTLD-TYL 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +S ++ + L + + S I + +A + + + Sbjct: 120 MRLAGVDTLTSAGNSTAEVTIPKLEIAMALDVSGSM-IGARIDALKPAAIEFVDSILDST 178 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S++ + + T K S + + N+ + Sbjct: 179 EPNDAVISVVPFSWGVTPSKEIYEALTVNETHKYSSCLELNDSHFTDTTIDPNTAYNQLI 238 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y + R + L L + + Sbjct: 239 Y---------------------TSREGVTFGDLTTTPLGDFLDT---------------- 261 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE----- 355 +N + F+ + +T + GST+ ++ ++ A + + + Sbjct: 262 -YNQTCYTQDYFNILPYATTKTALHDKINGLQAGGSTSNDEGVKWAAALLDPAFQPVVTS 320 Query: 356 ------------------------DEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + N E K IVL+ DG N Sbjct: 321 LQQPIQVPQDDGTILTYSLVEPALSDMPAVFNESETLKVIVLMGDGANDNS 371 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 48/142 (33%), Gaps = 11/142 (7%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + + ++ + +TA+ + L Sbjct: 453 SVTWSTFNNYNSNTLPGFISSERLSWETAWGLMSPRFYGNTTGNWGPWNN----FLNNPI 508 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 ++ +E IC +AKS+GI I TIAF + Q + CAS N + A + Sbjct: 509 NRSKKDERLDDICREAKSEGIVIYTIAFEMG--SQPTGADKIKKCASSVNHHYNATT--- 563 Query: 437 LNKIFRDRIGNEIFERVIRITK 458 I + +R+T+ Sbjct: 564 -VNIASAFSAIAANVKQLRLTQ 584 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 54/473 (11%), Positives = 123/473 (26%), Gaps = 94/473 (19%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ +G + AL+ ++ VD R + + L A A++ ++ Sbjct: 19 FFRARSGQVAVTFALVTLPVMFATAAAVDYGRRNAAKTQLDAALDGAVLAVMSQKTNTIP 78 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 L + V +N K L Sbjct: 79 TT-----------------TLQNMETQFRTEAAKVPGVTVTSFTPGAPVNTSK-----TL 116 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S + M I + + + A + Y ++ ++D S SM Sbjct: 117 SLTASYTATVKTSLASMMQIPAMPVSGTSSATRNTSQY-----INYYLLLDNSPSMGLAA 171 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 D++ Q + ++ D++ Y ++ Sbjct: 172 TDADVQNMKIATNGCAFACHQHTFDKKGNITGDDQNDNYHIALRN--------------- 216 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---- 309 ++ + ++R+A+++++ + +M FND V Sbjct: 217 ------------NIKLRIQVLREAVSALVDQANVSMLLPQQFQMEMWTFNDSVTQTKLQA 264 Query: 310 -PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + I +A T A+ TI +S + + Sbjct: 265 MTPTLNNIKNAAPNIDIAYAYYNQSDNQTDFERAIARMNTTIPASGDGLTP-----DKPI 319 Query: 369 KYIVLLTDGENTQDNEEGIA------------------ICNKAKSQGIR---IMTIAFSV 407 +++ L+TDG C+ K++ ++ I T + Sbjct: 320 RFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFIGPLSPSTCSALKNKNVKIGIIYTQYLPI 379 Query: 408 NKT---------QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + L CAS +F + ++ + Sbjct: 380 YDNDFYNRYVRPYESQIGPSLQACASDGMYFPVTTNGDITAAMLKLFSTTLAS 432 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 45/456 (9%), Positives = 130/456 (28%), Gaps = 56/456 (12%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK G II ++P+++ + + + + + +AA+ A + + Sbjct: 3 IKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEG--- 59 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + ++ ++ Y+ N + + F R + + V Sbjct: 60 ----DEKNQEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAK 115 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 +++D ++ + + + SR + + + I ++ID S SM++ Sbjct: 116 TKHDSWISYNDGEMGL------TKDFEVMGTSTSRKFLPQ-PLDIYFIIDMSGSMVNPWG 168 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 S + +R V R + + + + Sbjct: 169 GSGKTKYDVVADTINRIVDDLREFKTDRKSRVAVIGFHHTAVKQVGRQRTAFDYSSYRTP 228 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + + I++ + I + + T FN Sbjct: 229 SATVNNMFTAPKIHSRNDSSN------IKTFEDI-PLTEDYDAFLTKFNSSNYYAS---- 277 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + L + + +T +N + + + + + R + + Sbjct: 278 -RYGLTESWQGIIGAAQMAEQATDLNP--EQVFILLSDGRDGDFVRYYLEGRQWREVRYN 334 Query: 375 TDGENTQDNEEGIAICNKAKSQ--------------------GIRIMTIAFSVNKTQQEK 414 + +C K K++ + + I + + + Sbjct: 335 KY----LNRLVKAGLCEKLKTRISQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDG 390 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 +C ++ + A +++ K + I E Sbjct: 391 I----GDCFGHDNIYHAKEGNDVYKYILNLINEETG 422 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 55/537 (10%), Positives = 130/537 (24%), Gaps = 129/537 (24%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 IK+ G + +P+ +G+ + ++ + + L A + A + +V + Sbjct: 17 FIKNENGTILMSFIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATLALTVE--NNNI 74 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S + + YL + + + +N S Sbjct: 75 PSSEQEVKNNILISSFAHAYL----------------PEETFSEPVITINSSASHMDYHA 118 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 L+ + I + A A+ + + +V D+S SM Sbjct: 119 DITMSYPAKFLNKAFNLISISDIKLDESAIAKKNT--SITAIPTDVVFVTDYSGSMNRDF 176 Query: 194 RDSEGQPLN----CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS---------- 239 ++ + ++ I P+ ++ Sbjct: 177 DGTDIDSTDISKVRIVALRRIFKNLHNEIQQNENINLVGFVPFTWGTKRTIDNNNTTPTL 236 Query: 240 ---LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR----------------DALAS 280 ++ D + ++ + + K + Sbjct: 237 LCHFPFVPKKHSPDGNYLTKYNISGLKELPGMDKIVSLNMINDINYGKLSDKEYISISEE 296 Query: 281 VIRSIKKIDN-------------------------VNDTVRMGATFFNDRVISDPSF--- 312 + D D + + D Sbjct: 297 ITDKANYTDREKALRYLSRIYYVKLENVISPVIEENIDYEKTIKSINGSDHTIDIPLSDV 356 Query: 313 -------------SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + + + G T I+ + + + +N + Sbjct: 357 RVNDFCLRTSMAHTFDSNNMDDSQFSKILNS-RANGGTLISSGILSGNNLFKETNNNNR- 414 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGI-------------------AICNKAKSQGIRI 400 K +V+L+DG++ + G +C + K IR+ Sbjct: 415 ---------KIMVILSDGDDNDNTHAGDNRINKDAPYLNITKKLIDNGMCERIKDNDIRM 465 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + IA + C +F+ A++ EL + E V R T Sbjct: 466 VFIAIGYTPDENID----WKKCVGEGNFYLASNAQELELDINRALATE-DTEVGRNT 517 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 58/462 (12%), Positives = 122/462 (26%), Gaps = 56/462 (12%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G +I + P++ GV + V+ R+ + + A + + + + E Sbjct: 8 SRQQGVAVVIFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVAANISSDITEN 67 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + AKN L + K + V E + Y+V LSS Sbjct: 68 KTLAKNYVDGFVPDGTISLA-DINIERKSCDEIYGSQCGVAGVYDEEGLVFTQYKVTLSS 126 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 ++ I T ++ + + +V DFS SM Sbjct: 127 EFESWYPEDDFAPGFEEIVEL-------GGTAVARKYQGFTIDVAFVADFSGSMQQTWNR 179 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + + + +K++ + + Y Sbjct: 180 EIKYKGVVNVISDITRKLETFNDHTEQELNGKKVANKVAFIGYNFYPHNGSTFYSNV-DY 238 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-SFSW 314 + S + + A + K+ + V ++D + Sbjct: 239 KANYSRLSYKWQENIPEINYRRTARDPINNKRTPIIGRYVNNTIPLYSDDSYFYTLDLTD 298 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + TA + + A + N +K I++L Sbjct: 299 NFTQFR-----NTISTFYPDYGTASYEGIIEAAKIVN-----------NGENIRKLIIVL 342 Query: 375 TDGENTQDNEEG---------------IAICNKA-----------KSQGIRIMTIAFSVN 408 +DGE++ + +C ++ +I I F + Sbjct: 343 SDGEDSINENNPYDNRYPGFIAPLIYQSGLCQNIINDLESKEINGRNVEAKIFVIGFGYD 402 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + L CA + A+S E+ I E+ Sbjct: 403 LEKN----PGLKICAGEENVQSADSYQEIFDTVLQLISEEVG 440 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 76/452 (16%), Positives = 160/452 (35%), Gaps = 77/452 (17%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL- 72 K+ +G F I++A ++ V L ++D+ + ++ ++ + AII+ ++ Sbjct: 16 FSKNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPK 75 Query: 73 -EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +++ + + K+ +++NF I + Y+ Sbjct: 76 INDLNPQEERIRDVIKKNAYVNMVQNFPAEHAA-------YIIENANISFSKDLTNKYEY 128 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ L+ + L + + + + + + + + + S++ V+D S SM Sbjct: 129 KITMEAKHQLSGKNFILGFL-MPNVITHISSISTGIIQKPSDKKAFSVEMVLDCSGSM-- 185 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 D Sbjct: 186 --------------------------------------------------LDSMQESCDL 195 Query: 252 SLSEEHFVDSSSLRH-VIKKKHLVRDALASVIRSIKKIDNVNDTV--RMGATFFNDRVIS 308 S + S + K + ++ A + + I++ + R+G FN ++ Sbjct: 196 SSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQISARIGLITFNHYIMQ 255 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEA 367 D S ++K G T M AY+ + + NE + H + +N+ Sbjct: 256 DSKLSNNF-----NVIKKTISRMKPKGGTDTFLPMNAAYEYLNNIPNETKAHNISDNVPL 310 Query: 368 KKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K+YI+L+TDGEN + + I +C+ A+ GI I +I + + CAS Sbjct: 311 KRYIILMTDGENNHPSYDLKTINVCDNARKNGIIIYSIFLNYYEYTD--GYELARKCASS 368 Query: 426 N-SFFEANSTHELNKIFRDRIGNEIFERVIRI 456 FF AN+T L F+ I + I ++ +RI Sbjct: 369 EKHFFYANNTKALLDSFKS-IAHAIQDKAVRI 399 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 77/490 (15%), Positives = 143/490 (29%), Gaps = 49/490 (10%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + + L + G ++ AL + +G+ G+ VD + L AA A Sbjct: 1 MTRAGRLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + V + + + ++ + + K + L N K F + + Sbjct: 61 ALAGVVTAKEFIAANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRTG 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + L + S F R G+ + + + A SY + Sbjct: 121 Q----------TLDATVSYTATVQSTFGRIFGLSATTLTNRVNASVDLASY-----LDFY 165 Query: 181 WVIDFSRSMLDYQRDSE---------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 ++D S SM +DS+ + NC ++ GK+ +R + ++ Sbjct: 166 LMVDVSGSMGLPTKDSDAEVLAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNN 225 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSE--EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + K + P + + L +R A Sbjct: 226 AVCELLKRASTPVVPNQYRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKVWPLAFT 285 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ DT ND S + + T N ST + D Sbjct: 286 NLLDTGSTQLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDG 345 Query: 350 IISSNEDEVHRMKN--NLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRIMT--IA 404 + +S + + K+ D +Q + + C K+ G I I Sbjct: 346 MQNSQSYSAWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIP 405 Query: 405 FSVNKTQQE----------------KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +++ K L CASP F+ AN+ ++ Sbjct: 406 YNIVKNYNNDSYIVWENGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQ- 464 Query: 449 IFERVIRITK 458 RV RIT+ Sbjct: 465 -ALRVARITQ 473 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats. Identities = 66/525 (12%), Positives = 143/525 (27%), Gaps = 84/525 (16%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 7 LTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV-- 120 Q + +Q+IE ++ + R + + Sbjct: 67 LLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLYL 126 Query: 121 ------------------------------EMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 N +A + + + + ++ Sbjct: 127 RSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQT 186 Query: 151 MGIKSWLIQTKAEAET----VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 + S L + + + + + V D SRSM+ + P Sbjct: 187 LVSSSQLYDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINATGNNPPEVNYP 246 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY-------------------YMLYPG 247 + + + + + + N+ + Y + Sbjct: 247 NRRIDALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTNCVLPYYVKSE 306 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA--SVIRSIKKIDNVNDTVRMGAT----- 300 +S + + H+ K L+ L I I + D T Sbjct: 307 DKKREISAKFKKGYNGGNHIAKGFELLDRDLDIPKTIDQISQFDGQKRTYDFTLDSKTSR 366 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN-EDEVH 359 + + + V + G TA+ + + ++ N + E Sbjct: 367 NYCLEDNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEKNKDFEAM 426 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEG------IAICNKAKSQ------------GIRIM 401 K ++ +++L+DGE+ +++ +C + K + RI Sbjct: 427 PNKIGTNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNYPKVETRIA 486 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 +AF +K + C + ++ +S L F+ I Sbjct: 487 FVAFGF--NPPQKQQEVWKKCVG-DQYYSVSSKQALFDAFKQIIS 528 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 56/448 (12%), Positives = 126/448 (28%), Gaps = 35/448 (7%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + +S G ++ A ++P++ + +++ + + L A A Sbjct: 8 VRFALTQRDESRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 67 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + +++A T ++ + ++ F +T + Sbjct: 68 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKV 127 Query: 127 SAYQVVLSSRYDLLL-----NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 V + + L + + + V+ + + I Sbjct: 128 PTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISL 187 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V+D S SM + + +G+ + + + + + R+ + P + Sbjct: 188 VLDRSGSMDW-KTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQ--PQYIRRVSYNR 244 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 D + E + + + L DA V+ + + V+ Sbjct: 245 GYDEYDLYDHAWEEVFGLGPAPN-TPWEDLVLAVDAFLRVLDQTPQNEQVS------IAS 297 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +N D + GST I + M + N Sbjct: 298 YNSHGTLDCWLLDDFDSVR-----AAVAQLAPNGSTGIGNGMNSGKTAFTHEN------- 345 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 A K +V++TDG + + S + I T+ F + + Sbjct: 346 -ARPYASKTMVVMTDGNHNYGTQPNTVAQQLMSSSNLNIQTVTFGGGAD-----QETMQE 399 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGN 447 A + A+S EL F + N Sbjct: 400 VAVTGLGRHYHADSGDELVSAFEEIANN 427 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 69/499 (13%), Positives = 144/499 (28%), Gaps = 103/499 (20%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + +G+ I+TAL ++ + +DVVR + + L+ +A + A+ + Sbjct: 1 MGADRSGNVAILTALAFVPLMLITIGSLDVVRMTTAQAKLQSTLDSATLAAASLSNTADI 60 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 E + ++ N D ++ T ++ +A V + Sbjct: 61 EDTVDEY---------------------IQANLPDTAPWTTLKLTMGDVTDSLNAKSVEI 99 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 ++ D+ + L+ GI + + A+ +++ + + V+D S SM + Sbjct: 100 TATVDIEM----TILKLAGIDKTSVLASSVAQQAAQN------IEVSVVLDISSSMGGSK 149 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S + F + + + P+ + N +Y + Sbjct: 150 ITSLREAAKGFIDTMLKE-------DEDKEYTSLSIIPFGGTVNIGDFYD----TYAVNS 198 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S +DS S + K++ I+ +D M A N R Sbjct: 199 STPGVIDSPSSANYYVNKNVPYGKFMFSTEREGCIEYTDDDFDMAAIPANSRPQVPDFTK 258 Query: 314 WG--------------VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--- 356 W ++ T +K D + T ++ + S Sbjct: 259 WVATNPWCPSEDSAMVLNSNNTTDLKALIDDMDLSDGTGMDIGALWGAKVLSGSMRGQLG 318 Query: 357 ----EVHRMKNNLEAKKYIVLLTDGENTQ------------------------DNEEGIA 388 + N+ + K V++TDG T N + Sbjct: 319 GDFSDRPADFNDEDTLKVAVIMTDGAITAQFRPRDYTTTGKIKNKTQQTIVSKGNINTAS 378 Query: 389 ------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTH 435 +C +++ TI F L CAS ++++ Sbjct: 379 TKADDAVAYFKRVCEYLNDNNVQVYTIGFQ--INSGSLPDQLLKYCASSLSNYYFVE-GL 435 Query: 436 ELNKIFRDRIGNEIFERVI 454 + F RV Sbjct: 436 NIEDAFNAIASAVNNLRVS 454 >gi|258620794|ref|ZP_05715829.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258586992|gb|EEW11706.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 419 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 45/462 (9%), Positives = 122/462 (26%), Gaps = 72/462 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K IK G II ++ P M+ + + + + L +A++ A + + Sbjct: 2 KSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL-------I 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +N+ ++ ++ ++ Y++ N ++ R + + + V Sbjct: 55 ASPKEDDENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVA 114 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++++ ++ ++ L+ + + + V + +++D S SM Sbjct: 115 ATAKHKSWISYENISLK----PEFTVNGSSVTRKFLPQ-----PVDVYFIVDMSASMRAT 165 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLD 250 + ++ R V + +V R + + + Sbjct: 166 W-QNGKSQIDEVKNVITRVVNDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYALYN 224 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + R L + + + + Sbjct: 225 TPQQTVSNMFFPPKRVNPSDSGLFSHRDIDLTQ---------NYSSFLQIMNDRNFYPP- 274 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH---RMKNNLEA 367 + A + +T IN + + + + R Sbjct: 275 -----RSACTESWQGIIAAAQAADKATDINP--EQVFIILSDGADCPWQRQDRWGRLRTT 327 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQ-------------------GIRIMTIAFSVN 408 + Y+ L DG +C K + + + I + Sbjct: 328 EYYLKKLVDG----------GLCKNLKQRIRQKPNRFQSSTPTENEKTKVTMGVIGVNYQ 377 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + +C + + A ++ K + I E Sbjct: 378 VNPNDG----FGDCVGRENIYHATQGEDVYKYILNLINEETG 415 >gi|258624772|ref|ZP_05719703.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] gi|258582934|gb|EEW07752.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] Length = 419 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 45/462 (9%), Positives = 122/462 (26%), Gaps = 72/462 (15%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K IK G II ++ P M+ + + + + L +A++ A + + Sbjct: 2 KSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL-------I 54 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +N+ ++ ++ ++ Y++ N ++ R + + + V Sbjct: 55 ASPKEDDENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVA 114 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++++ ++ ++ L+ + + + V + +++D S SM Sbjct: 115 ATAKHKSWISYENISLK----PEFTVNGSSVTRKFLPQ-----PVDVYFIVDMSASMRAT 165 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLD 250 + ++ R V + +V R + + + Sbjct: 166 W-QNGKSQIDEVKNVITRVVNDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYALYN 224 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + R L + + + + Sbjct: 225 TPQQTVSNMFFPPKRVNPGDSALFSHRDIDLTQ---------NYSSFLQIMNDRNFYPP- 274 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH---RMKNNLEA 367 + A + +T IN + + + + R Sbjct: 275 -----RSACTESWQGIIAAAQAADKATDINP--EQVFIILSDGADCPWQRQDRWGRLRTT 327 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQ-------------------GIRIMTIAFSVN 408 + Y+ L DG +C K + + + I + Sbjct: 328 EYYLKKLVDG----------GLCKNLKQRIRQKPNRFQSSTPTENEKTKVTMGVIGVNYQ 377 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + +C + + A ++ K + I E Sbjct: 378 VNPNDG----FGDCVGRENIYHATQGEDVYKYILNLINEETG 415 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 50/454 (11%), Positives = 124/454 (27%), Gaps = 53/454 (11%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G AL++ +L + G+++++VR + L AA A++ ++ + E+ Sbjct: 6 RQGGGLSPAFALMLTGILALTGVVIELVRGYSGQSLLSAAAD-AVLYSAADSDTAAEDAV 64 Query: 77 SRAKNSFTF-----------PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + ++ +++ L E D+ A + Sbjct: 65 ALVQANLAGRPLQVGPPSLSQSEQGARVILQGHVPALMDLSVIGEGGDMPVAAAARASSA 124 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG------VSI 179 ++ ++ L ++ + G+ + + Sbjct: 125 RTRIEIALVLDVSNSMSGAPMKAIKQGLTEFGEVLFGRERRNQDRVVSIIPATGLVNIGD 184 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + S+ + R V + + Sbjct: 185 HPELFHPESLAFPFGLQTLAHERGWSNLLTRDVPGRQRKAF---CARLPEHVDGIDRLAE 241 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-----LASVIRSIKKIDNVNDT 294 L + ++ H K ++ L + ++ + Sbjct: 242 LTPGWIRKLEQAPVG-----ETQPRLHYSTKPPAIKQYEDGTPLRAFAPRENPLERYLEN 296 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F+D + R + +T + + + + Sbjct: 297 RRDKLGIFDDADCGVSPIQAHLS--TRAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQW 354 Query: 355 EDEV-------HRMKNNLEAKKYIVLLTDGENTQDNEEGIA-----ICNKAKSQGIRIMT 402 + R + +K +VL +DGE+ +C + K +GI++ T Sbjct: 355 QGRWGQGAAELPRPYGQADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYT 414 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSF-FEANSTH 435 +AF + CAS S ++A S + Sbjct: 415 VAFEGDARFVA-------QCASDRSHAYKATSGN 441 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 78/490 (15%), Positives = 146/490 (29%), Gaps = 49/490 (10%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + + L + G ++ AL + +G+ G+ VD + L AA A Sbjct: 1 MTRAGRLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + V + + + ++ + + K + L N K F + + + Sbjct: 61 ALAGVVTAKEFIAANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRSG 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + L + S F R+ G+ + + + A SY + Sbjct: 121 Q----------TLDATVSYTATVQSTFGRTFGLSATTLTNRVNASVDLASY-----LDFY 165 Query: 181 WVIDFSRSMLDYQRDSEGQPL---------NCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 ++D S SM +DS+ + L NC ++ GK+ +R + ++ Sbjct: 166 LMVDVSGSMGLPTKDSDAEALAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNN 225 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSE--EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + K + P + + L +R A Sbjct: 226 AVCELLKRASTPVVPNQYRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKIWPLAFT 285 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ DT ND S + + T N ST + D Sbjct: 286 NLLDTGSTQLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDG 345 Query: 350 IISSNEDEVHRMKN--NLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRIMT--IA 404 + +S + + K+ D +Q + + C K+ G I I Sbjct: 346 MQNSQSYSAWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIP 405 Query: 405 FSVNKTQQE----------------KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +++ K L CASP F+ AN+ ++ Sbjct: 406 YNIVKNYNNDSYIVWENGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQ- 464 Query: 449 IFERVIRITK 458 RV RIT+ Sbjct: 465 -ALRVARITQ 473 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 76/485 (15%), Positives = 139/485 (28%), Gaps = 49/485 (10%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G ++ AL + +G+ G+ VD + L AA A + Sbjct: 10 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 69 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V + + + ++ K + L N K F + + + + Sbjct: 70 VTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSGQ---- 125 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 L + S F R G+ + + A Y + ++D Sbjct: 126 ------TLDATVSYTATVQSTFGRLFGLSVTTLTNRVNASADIAGY-----LDFYLMVDV 174 Query: 186 SRSMLDYQRDSE---------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S SM DS+ NC K ++ GK+ +R + ++ + + Sbjct: 175 SGSMGLPTTDSDAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCAL 234 Query: 237 NKSLYYMLYPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + P + + L ++ N+ DT Sbjct: 235 LDRASKPIVPNQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDT 294 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ND S + + T N S + D + +S Sbjct: 295 GSTQLFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQ 354 Query: 355 EDEVHRMKNNL--EAKKYIVL-LTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNK 409 + K K+ D + +Q + A C K G I I+ I ++ K Sbjct: 355 TYSSWKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVK 414 Query: 410 TQQEKA----------------RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + L CASP FF ANS ++ +V Sbjct: 415 SYTNEGTIVWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQ--ALKV 472 Query: 454 IRITK 458 RIT+ Sbjct: 473 ARITQ 477 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 118 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 59/392 (15%), Positives = 114/392 (29%), Gaps = 56/392 (14%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ K G I + ++M ++L GGM VD++R+ L+ + A + A+ Sbjct: 16 RGIFQRFRKDEDGALIIFSLMMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAADL 75 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 R ++ D V + V N Sbjct: 76 DQVMAPADVVRDYMDKAGMLHFLQG---------------DPIVDQGINYRIVTANASAP 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + L + + G+ S + + AE V I V+D S Sbjct: 121 M------PLFFYDLPKVFSSPFTPGMSSLTVSGSSTAEERVSD------VEISLVLDVSS 168 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM R + +P T ++ N G+ + PY N Sbjct: 169 SMNSNNRMTNLRPAAR----EFVTTVLANNTNAPQGLITISMIPYSAVVN-----PGTDI 219 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +++ H + + L + + + Sbjct: 220 APHLNINRTHEYSTCP-MFDDTEFTTTALNLGASYDHVSHFSYGGSNDMPINPNYTWCFA 278 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE------------ 355 D + + H + T + + G+TAI+ ++ + SS + Sbjct: 279 GDLN-AIKPHTTNEADLHTAINNLHAYGNTAIDMGVKWGVALLDSSTQSLISSLAGASGT 337 Query: 356 ------DEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + + K +VL+TDG+NTQ Sbjct: 338 GVPAIANGRPELHTQADVLKVLVLMTDGQNTQ 369 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 71/245 (28%), Gaps = 20/245 (8%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + Q V +SP + + S+ + +S Sbjct: 366 QNTQQWDLVEPYKSGMSPVWFDLDDVNQPLWDVDFNKTSVQYQGEATNSRWDDWF--YWN 423 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ V + G + + FA Sbjct: 424 GYSGTLRYRDYPNGFNSRLAYVNASPVDASGPGEGTRYVDNGDELYHASWQQLFAD--RS 481 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 + + Y + + + ++++ ++ N IC A Sbjct: 482 YFLINNHYFLDAYY-----AGAWSWNEYWGTDNSIDHLIV----NGSEANTRLSNICAAA 532 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++QGI I T+AF + L +CAS +S + +++ F I ++I R Sbjct: 533 RAQGIVIYTVAFEAPSGG----QTALQDCASSSSHYFDVDGTDISGAF-SAIASDI--RN 585 Query: 454 IRITK 458 +++T+ Sbjct: 586 LKLTQ 590 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 118 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 51/453 (11%), Positives = 122/453 (26%), Gaps = 38/453 (8%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 Y + + T +++ L + ++LG+ G D L A + A Sbjct: 2 RYPSSVFRRKTLSSIVVSGLTLAILLGLNGCS-DKSDDQQKRAEL---ADQTKLAA---- 53 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + +++ + + T + + + Sbjct: 54 --EQQAELKQQVELKDSVERQANRQRDAAIAIHEQATSTKLRTMNAEHRAYIAQPAATIS 111 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 L+ + + P ++ + T + +++ ++ Sbjct: 112 AAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAIDVDTGSYTTLRRML-KEGR 170 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + LN F K+ + + + + M+ L Sbjct: 171 LPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKA 230 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 231 ELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLGAQDKVSIVV---------YAGA 281 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + + GST +Q AY + + Sbjct: 282 AGVVLDGAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL--AKKHLVEGGINR----- 334 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++L TDG+ + + K GI + T+ F + Sbjct: 335 --VILATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADK--GN 390 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +S LN+ R + ++ ++ I K Sbjct: 391 GQYAYIDS---LNEA-RKVLVEQLSATLLTIAK 419 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 118 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 69/506 (13%), Positives = 135/506 (26%), Gaps = 66/506 (13%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K K+ ++ G + IITALL +L + VD + L QA A + Sbjct: 7 LTAKLFNSIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLIQ------------------------SLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 + + + + K + + ++ YL + Sbjct: 67 LLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDD 126 Query: 99 ENNLKKN--------FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +N K + F + + + + A V S + L + S Sbjct: 127 KNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSS 186 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + + + + V D S SM+ D Sbjct: 187 SRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDV 246 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 K D M + D L + + Sbjct: 247 VKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYR 306 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGA-----TFFNDRVISDPSFSWGVHKLIRTIVK 325 + + A + ++ I+ + + F + S G K Sbjct: 307 NNKLDQA-SKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEGKETTRAWFDK 365 Query: 326 ------TFAIDENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLEAKKYIVLLTDGE 378 + + G TA+ M + + +N+D K N ++ +++L+DGE Sbjct: 366 KNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGE 425 Query: 379 NTQDNEEG------IAICNKAKSQ------------GIRIMTIAFSVNKTQQEKARYFLS 420 + + E +CNK K + R+ +A + Sbjct: 426 DNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGY--NPPQDQVNVWK 483 Query: 421 NCASPNSFFEANSTHELNKIFRDRIG 446 C ++ S L FR I Sbjct: 484 QCVG-KQYYTVFSKQGLLDAFRQIIS 508 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 118 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 69/506 (13%), Positives = 135/506 (26%), Gaps = 66/506 (13%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K K+ ++ G + IITALL +L + VD + L QA A + Sbjct: 7 LTAKLFNSIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLIQ------------------------SLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 + + + + K + + ++ YL + Sbjct: 67 LLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDD 126 Query: 99 ENNLKKN--------FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +N K + F + + + + A V S + L + S Sbjct: 127 KNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSS 186 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + + + + V D S SM+ D Sbjct: 187 SRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDV 246 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 K D M + D L + + Sbjct: 247 VKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYR 306 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGA-----TFFNDRVISDPSFSWGVHKLIRTIVK 325 + + A + ++ I+ + + F + S G K Sbjct: 307 NNKLDQA-SKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEGKETTRAWFDK 365 Query: 326 ------TFAIDENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLEAKKYIVLLTDGE 378 + + G TA+ M + + +N+D K N ++ +++L+DGE Sbjct: 366 KNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGE 425 Query: 379 NTQDNEEG------IAICNKAKSQ------------GIRIMTIAFSVNKTQQEKARYFLS 420 + + E +CNK K + R+ +A + Sbjct: 426 DNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGY--NPPQDQVNVWK 483 Query: 421 NCASPNSFFEANSTHELNKIFRDRIG 446 C ++ S L FR I Sbjct: 484 QCVG-KQYYTVFSKQGLLDAFRQIIS 508 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 58/461 (12%), Positives = 133/461 (28%), Gaps = 61/461 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ G I + +P + + +L D + + L A + + + Sbjct: 25 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENN---- 80 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + N + NF + + + +N +++ ++ Sbjct: 81 ----------------AKNDTRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYH 124 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSM-LD 191 + ++ S LI S + K+ + + +V DFS SM D Sbjct: 125 TQLSQYHIDANIEQPTLFPFTSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGD 184 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY----YMLYPG 247 + + + L+ + K SP+ + P Sbjct: 185 FHNPDDPEVLSKLDELKRIFFKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPK 244 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + +K + + +I+ I N+ + + +D + Sbjct: 245 EKNKIYPIPSNEIERNTEAHQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELC 304 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S ++ + NE GST ++ + D ++ Sbjct: 305 LYSSNAYPISLAKDIDDLNVIKTMNEGGSTLVSSGIMQGADLLMDGV-----------NH 353 Query: 368 KKYIVLLTDGEN--------------------TQDNEEGI---AICNKAKSQGIRIMTIA 404 K +++L+DG + D + +C K + RI+ I Sbjct: 354 NKLMIILSDGHDYPTTAVVHDKTITSAKEVRVNVDISRQLVQHGMCKKIRETVGRIVFIG 413 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 N + + +C +F+ A +T EL R + Sbjct: 414 IGYNPSANHYINWA-EDCVDTENFYLAMNTKELEDSIRSAL 453 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 50/491 (10%), Positives = 123/491 (25%), Gaps = 77/491 (15%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G I TAL + + G+ ++ R+ ++ L A + A + + Sbjct: 15 RHQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTANQDD---- 70 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-QVVLS 134 T+ Q Y+ N N E + + + + Y Q ++ Sbjct: 71 -------KTYENQLATGYIQAYIRNITSINNIKIERSEGIDNYPTPDGNEEREYFQYRVT 123 Query: 135 S-RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + ++ Y + + I +V DFS SM + Sbjct: 124 AKTNHISWLSSDIIPSFAPTETVA---NRALARNYPIYLGDKDIDIVFVSDFSGSMKGNK 180 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD--------------------------E 227 + + ++ Sbjct: 181 IRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCITQLDYRPNFNG 240 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + S + + + + R ++ + + Sbjct: 241 GNYSSYEDIDWSTWSTWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNASKSETGSGWYF 300 Query: 288 IDNVN--DTVRMGATFFNDRVISDPSFSWGVH----------------KLIRTIVKTFAI 329 D + + A F + + S +T + Sbjct: 301 PDPYSYINFPGSVAKTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTALSPIQN 360 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGE-----NTQD 382 + + G T++ + + + + I++L+DG+ +T + Sbjct: 361 NMSPDGGTSVYQGLIRGAQILEQGRPTSPSTETSAAYNSRIKMILMLSDGQEMPYVSTFN 420 Query: 383 NEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 +CN K Q + + + + + + NC N+ + + Sbjct: 421 QLVNQGLCNTIKAQFNDSDQPLYMGVLGIEFD----AQGQQGFKNCVGQNNITNVDDVDD 476 Query: 437 LNKIFRDRIGN 447 L K + I Sbjct: 477 LIKEILEMIKK 487 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 77/490 (15%), Positives = 141/490 (28%), Gaps = 49/490 (10%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + + L + G ++ AL + +G+ G+ VD + L AA A Sbjct: 1 MTRAGRLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + V + + + ++ K + L N K F + + + Sbjct: 61 ALAGVVTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSG 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + L + S F R G+ + + + A Y + Sbjct: 121 Q----------TLDATVSYTATVQSTFGRLFGLSATTLTNRVNASADIAGY-----LDFY 165 Query: 181 WVIDFSRSMLDYQRDSE---------GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 ++D S SM DS+ NC K ++ GK+ +R + ++ Sbjct: 166 LMVDVSGSMGLPTTDSDAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNN 225 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + + P + + L ++ Sbjct: 226 AVCALLDRASKPIVPNQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFT 285 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N+ DT ND S + + T N S + D Sbjct: 286 NLLDTGSTQLFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDG 345 Query: 350 IISSNEDEVHRMKNNL--EAKKYIVL-LTDGENTQDNEEGIAICNKAKSQG--IRIMTIA 404 + +S + K K+ D + +Q + A C K G I I+ I Sbjct: 346 MQNSQTYSSWKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIP 405 Query: 405 FSVNKTQQEKA----------------RYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ K+ + L CASP FF ANS ++ Sbjct: 406 YNYVKSYTNEGTIVWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQ- 464 Query: 449 IFERVIRITK 458 +V RIT+ Sbjct: 465 -ALKVARITQ 473 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 65/513 (12%), Positives = 131/513 (25%), Gaps = 72/513 (14%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ----- 58 ++ + + + AL + +L G D + L+ AA Sbjct: 7 KSRATALVARFVGDSKANIATTFALALLPILTAIGCGTDYSMAMRLKVKLQSAADAASIA 66 Query: 59 -----TAIITASVPLIQSLEEVSSRAKNSF--TFPKQKIEEYLIRNFENNLKKNFTDREV 111 +A A++ + + + Y + + + + K + Sbjct: 67 SISVNSAGYAAAMAMTSDGSVTAGVNEADNIFKGNASTFGGYTLTSETSTVTKTRSTLSS 126 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP----LSLFLRSMGIKSWLIQTKAEAE-- 165 + YQ + S L +L S + + + Sbjct: 127 QVQFTAAVPTTFLTVIGYQSITVSGSSSSSVTLPLYLDFYLTLDVSGSMGLPSTSAEAQR 186 Query: 166 --TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 +S ++++ FS DS Q + S K Sbjct: 187 MQAISPDNYRQYPTGCTLACHFSP-QNSACTDSGTQGYPTNNYCLGYAISRVSQSGYKSL 245 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + K +P V S+ L + + + + + + V A+ + Sbjct: 246 LTTNKNNPKGVQLPSSIVSGLPNSLYNKLPTVANCPTDGTDDCIQLRLDAVGYAVNQLFT 305 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR-------------TIVKTFAID 330 + V + R+G F + S + + T++ T Sbjct: 306 TANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSSTINYAAANLATLLDTNTNA 365 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------------ 378 G T I+ A+ + I S + Y+ L+TDG Sbjct: 366 SLGSGGTHIDTALSSVNSLITSVGDGSA-----TTNTLPYVFLVTDGAQDPQVKGVPNGS 420 Query: 379 ----NTQDNEEGIAICNKAKSQGIRIMTIAF-----------------SVNKTQQEKARY 417 N C K++GI I + Sbjct: 421 WSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNASFAGDEDDYANNNIPNIPP 480 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L CASP F+ AN+ ++ + + Sbjct: 481 SLQACASPGFFYTANTPADITSALNAMFNHAVS 513 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 60/557 (10%), Positives = 130/557 (23%), Gaps = 118/557 (21%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 K +S +G+F ++ L +PV+L DV + L+ A +A + +S Sbjct: 2 LLNKFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASSHL-- 59 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK----NFTDREVRDIVRDTAVEMNPR 125 +++ + F + + N + L NF + Sbjct: 60 -GDLDITRNDAFNRYFQANIVGHGELDNAQATLTVDKGVNFVKTKAVASADVHLNFAFLF 118 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + +V+ + N L + L S T ++S + Sbjct: 119 GDSKHIVVDASAVESNNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEAAKSPTRKI 178 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS---------------------------- 217 S + + D KS ++ Sbjct: 179 RASPVPFVTAVNVNGDGFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKKTGWAD 238 Query: 218 -----------------QNGKVGIRDEKLSPYMVSCNKSLYYML----YPGPLDPSLSEE 256 + P + P + + Sbjct: 239 AGWNGTGWKGCVEARPGAYNISDTPPDPAKPDTLFVPYFAPDDPEDAQKPSGSYGNSDKY 298 Query: 257 HFVDSSSLRHVIKKKHL---------VRDALASVIRSIKKIDNVND-------------- 293 + + + + + L ++I + D Sbjct: 299 YNNSYLNDVSDKTRLAQSGINILGIDLSNLLGNLIELLSPDDRDAREKIAKYVAPAKELI 358 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T + + E T +++ + + + Sbjct: 359 TEIGSPVTVGPNRACPTPVVPLTDDFDKLRKAASQMTEWNGSGTNVSEGLSWGMRVLSPA 418 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQD------------------------------- 382 K ++LLTDGEN Sbjct: 419 APYTDGAPWKTPGISKIVLLLTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTA 478 Query: 383 ----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHEL 437 + +C + K++G++I T+ + R S CAS +++ N +L Sbjct: 479 ARNVDGWTKNVCTQLKNEGVQIYTMVLQSDT---AANRALYSACASDPSNYYAVNDPTKL 535 Query: 438 NKIFRDRIGNEIFERVI 454 +F N ++ Sbjct: 536 PNVFLQIANNFTKLQLT 552 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 50/453 (11%), Positives = 121/453 (26%), Gaps = 38/453 (8%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 Y + + T +++ L + ++LG+ G D L A + A Sbjct: 2 RYPSSVFRRKTLSSIVVSGLTLAILLGLNGCS-DKSDDQQKRAEL---ADQTKLAA---- 53 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + +++ + + T + + + Sbjct: 54 --EQQAELKQQVELKDSVERQANRQRDAAIAIHEQATSTKLRTMNAEHRAYIAQPAATIS 111 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 L+ + + P ++ + T + +++ ++ Sbjct: 112 AAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAIDVDTGSYTTLRRML-KEGR 170 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + LN F K+ + + + + M+ L Sbjct: 171 LPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNYDMMLLRIGLKGYEQSKA 230 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 231 ELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLGAQDKVSIVV---------YAGA 281 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + + GST +Q AY + + Sbjct: 282 AGVVLDGAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL--AKKHLVEGGINR----- 334 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++ TDG+ + + K GI + T+ F + Sbjct: 335 --VIFATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADK--GN 390 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + +S LN+ R + ++ ++ I K Sbjct: 391 GQYAYIDS---LNEA-RKVLVEQLSATLLTIAK 419 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 117 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 51/450 (11%), Positives = 124/450 (27%), Gaps = 66/450 (14%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + + I G I ++ M+ + + + QA A + Sbjct: 2 HFNQSISKQHGSVAISYLAMLIPMIIAAASTIVIGYQVQLSNRGMQATDAASLAC----- 56 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + + + + + + + Sbjct: 57 ---------------EFSGEYDGTMAQGYLDYYRPKIDKVSGQIG------THSGCNVSL 95 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 LS+ + L + F+ + + A + + + V+D S SM Sbjct: 96 SYSLSTIFTSLTLSDASFV---------VSSTANEKAYVTEDVASEPLELILVLDISGSM 146 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + A + ++ + VS N + + Sbjct: 147 ---ASDLDDLKAILKRGLASLKEQQNNALSKDHIKVSIVPFSDGVSVNNAPWLNETGTFC 203 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 ++E S++ V + Sbjct: 204 VEGITESGGKFSAAHTVANLDITH-------------DQTPVKTFQPDKWLMDCSAMSVT 250 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNLEA 367 + ++++ G TA + + + + +N + Sbjct: 251 LPLTADLNQVT-----NAVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNFDKV 305 Query: 368 KKYIVLLTDGEN---TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 ++ +VL+TDG + D +C++AK G+ + + F V + E+ + CA Sbjct: 306 ERKLVLMTDGADYGSHFDELINAGLCDRAKDYGVALNFVGFGVYGARLEQ----FTRCAG 361 Query: 425 PNS-FFEANSTHELNKIFRDRIGNEIFERV 453 + F A++T EL+ F + + R+ Sbjct: 362 DANGVFSASNTQELDSYFSQLLSVQYDTRL 391 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 117 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 65/582 (11%), Positives = 135/582 (23%), Gaps = 135/582 (23%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA----------- 56 S++ +G+ + A+ + +L G +D R + A++ A Sbjct: 22 RQVSRRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSRD 81 Query: 57 --------AQTAIITAS---VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 A A ++ + + + + + + + Sbjct: 82 LSQGTITAADVAAKASTYFKALYTSTDAQSVAVTASYTASTSSSASNIQLNASGQIVTQF 141 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL--------------------NPLS 145 + +T +V L+ + + LS Sbjct: 142 MKLVGFPTMTFNTKATTTWGDVKMRVALALDNTGSMAYSGKMTALQNAVAGSGGLIDQLS 201 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS------------------- 186 +S G + A+ V S + + + + Sbjct: 202 ALAKSPGDVYISLIPFAKVVNVGASNYAQSWIDWTDWQNPPTIQPNNGSYQAAIPNASFT 261 Query: 187 ---RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS-LYY 242 M+ N F + + + +S + S Y+ S + Sbjct: 262 QSQWDMVGPGSSCPFTSGNGFPYFSCTSGPATASSSASKVPSSGSYSGYICPGYDSASHS 321 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKK------------------------------KH 272 S + S + Sbjct: 322 YYNGCWNSVQNSTRVNWCTGSYCSCPTTGSNVPNNTCSCTGSGSSTVCKVNTFTHTWIAN 381 Query: 273 LVRDALASVIRSIKKIDNVN--------------DTVRMGATFFNDRVISDPSFSWGVHK 318 V + D + + S Sbjct: 382 ATSTWTGCVADRTQPNDANAVSPASSDVATLFPANQHMENNVQYCSSSASTKLGQIVPLS 441 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 T +K+ G T M A ++I + N + I+LL+DG Sbjct: 442 YNWTSLKSAVNAMEPTGGTNQAIGMAWAVQSLIPNGVLGAPAEDANTTYNRVIILLSDGL 501 Query: 379 NTQD-----------------NEEGIAICNKAKSQG-------IRIMTIAFSVNKTQQEK 414 NT+D + +C+ K+ I TI + + + + Sbjct: 502 NTEDRWPDYGNGSTQASGNPIDARQALLCSNLKNTKDSKGNAMYTIYTIQVNTS-SPADP 560 Query: 415 ARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L NCA SP+ F+ S+ ++ F RV + Sbjct: 561 TSTVLQNCASSPDKFYMLTSSSQIVTTFNSIGTALSKLRVAK 602 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 117 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 52/473 (10%), Positives = 117/473 (24%), Gaps = 96/473 (20%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G + LL+ M+ G+ VD R + ++++ Sbjct: 24 RSGSVGFVFGLLLLPMMVAMGVSVDYARV----------------------SAARSDLAA 61 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 A + K L ++ + ++ + + V + V ++ Sbjct: 62 AADAAVLSVTNKAAMSLDMLSAQARVRDAFLKNIQTMPDISGVSADAVVIDLLGVRAATL 121 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + F +G+++ + + + ++D S SM ++ Sbjct: 122 SYTASYRTAFSGILGMRTLSVSGN-----AASKSAVPIYMDFYLLLDNSPSMGVGATSAD 176 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + D S Sbjct: 177 ISTMVSRTPDKCAFACH-----------------------------------DLSAGNSD 201 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-----ISDPSF 312 + + V + +VR A ++ + V R + Sbjct: 202 YYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTALYTMGADCASVGLTTVSPL 261 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + G ++ D L +K + Sbjct: 262 SSDLAAAKTNAQAIDLMTIQKPGYNNDQCTDFDGVFQSLNGKIDVAGDGSTALTPQKVVF 321 Query: 373 LLTDGENTQD---------------NEEGIAICNKAKSQGIRI---MTIAFSVNKTQ--- 411 L++DG +A C K++GI+I T + Sbjct: 322 LVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLKNRGIKIAVLYTTYLPLPTNDWYN 381 Query: 412 ------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 Q + CASP +FE + T + + + R+T+ Sbjct: 382 TWIAPFQATLPSAMQGCASPGLYFEVSPTQGIADAMTTLFQRTVSQP--RLTQ 432 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 116 bits (290), Expect = 6e-24, Method: Composition-based stats. Identities = 73/573 (12%), Positives = 156/573 (27%), Gaps = 132/573 (23%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + K+ G + ++TALL +L + VD L QA + A + Sbjct: 6 MTMSLFYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAAL 65 Query: 63 TASVPLIQSLEEVS----------------------------SRAKNSFTFPKQKIEEYL 94 + Q + S ++A Q + + Sbjct: 66 LLTTENNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRNQELVQGMVKLY 125 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 +R+++ K + +D V + + + R + ++ + + S + Sbjct: 126 LRSYDKEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVEGDVKRKFWLPWSYTLT 185 Query: 155 S---WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP----------L 201 S + + + + + V D S SM +D P Sbjct: 186 SNNRNTVDINSGKSYAVKEKDILIPIDLMLVNDISTSMFKPPKDDPQGPKKIDSLKTVVK 245 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDE--------------------KLSPYMVSCNKSLY 241 S+ ++GI + N + Sbjct: 246 AVANILIPDEPPKNISKYNRIGITSFGLGAQQAGKQDANQIKKCVLPFRGVDKTVNIKVK 305 Query: 242 YMLYPGP----------------LDPSLSEEHFVDSSSLRHVIKKKHLVRDA---LASVI 282 Y P D S ++E +++R++ K V+ A L I Sbjct: 306 YDGKVYPGSAYNVLKEQHKLKLFEDSSHTQEFNPQKTTVRYMYIKDSAVKLAKIFLGQWI 365 Query: 283 RSIKKIDN--------------------------VNDTVRMGATFFNDR------VISDP 310 + D FN Sbjct: 366 DNTIGYDKHVTDNPLRDVFYQYHDINASLASIASFKGNDISYDITFNKSPLCLEKTQKST 425 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + G T + + + +++ N+D K ++ Sbjct: 426 ATDIWFSSKKSSELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKDATP-DKIKTNTQRI 484 Query: 371 IVLLTDGENTQDNEEGI-----AICNKAKSQ------------GIRIMTIAFSVNKTQQE 413 +++L+DG +T +CNK +++ +I +AF + Sbjct: 485 LLVLSDGVDTALPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFGYEQD--S 542 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + R NC P ++ +A + L ++F+ IG Sbjct: 543 ELRKEWENCVGPGNYHQAKNEKALLEVFKQIIG 575 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 116 bits (290), Expect = 6e-24, Method: Composition-based stats. Identities = 64/512 (12%), Positives = 147/512 (28%), Gaps = 130/512 (25%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + L + TG+ I+ AL + ML G D +R ++ A+I A Sbjct: 8 RCLHTLRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAV 67 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + E+ + + + ++E N Sbjct: 68 KQI-NNTEDTDALKQKVSDWFHAQVENSYALGEIEIDTTNH------------------- 107 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I + + + + + SY +++ V+D Sbjct: 108 --------NITATASGTVPTTFMKIANIDTVPVSVASAVKGPATSY-----LNVYIVVDT 154 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML + + + + GK Sbjct: 155 SPSMLLAATTAGQSTMYSGIKCQFACHTGDTHTIGK------------------------ 190 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + D S+ + + + + DA+ V+ I + D+ ++ +++G D Sbjct: 191 -------KTYANNYDYSTEKGIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYGLGDT 243 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + +T + ++ T + + + Sbjct: 244 LTEVLAPTLSTDIARTRLADSSYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----T 298 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI--------------------CNKAKSQGIRIMTI 403 + K ++LLTDG +Q + C K Q + + Sbjct: 299 SGTPLKLVLLLTDGVQSQREWVTDKVNKNRTSGAYWNKVAPLNPDWCGYLKDQSNTMAVL 358 Query: 404 A------------------------------------FSVNKTQQEKARYFLSNCASPNS 427 S + T+++ Y L++CAS S Sbjct: 359 YTEYLPITSDWGYNATVGSTMASANWKSTWGGTMDSDVSSSITRRDYIPYALADCASSKS 418 Query: 428 -FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F A+S+ E+ + + +R+T+ Sbjct: 419 LFISASSSTEITEGLSALFTQYLSS--VRLTQ 448 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 50/436 (11%), Positives = 122/436 (27%), Gaps = 62/436 (14%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ + +P +L + + + + + QA+ A + Sbjct: 9 RQRGDISLMFVICLPFILTMIAVSILLAMYLLTVTRAGQASDAASLAC------------ 56 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 Q+ ++ L+ + + F + +V N + Sbjct: 57 --------GYSQRADQDLLVGILDYYRPGFVVHDGEALVSIDG--KNRCSIEATYRFNPT 106 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 LL + + + A + + + V+D S SM Sbjct: 107 MMALLPESA-------RTHVSLSSDTGATSHLVINSTPLPMDLALVLDISSSMSAQLPQL 159 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + + +VG L P+ + + + Sbjct: 160 K-------LIINGALEEIRQQDPNEVGGVRFSLVPFETGVGVLNAPWMPKSAAKVTCVDG 212 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + + A++ N+ + + + Sbjct: 213 LSY-GQHSVDYARTVDDLAEPAANL--------NIKSVFASQWLDACSMDATILPLTQDL 263 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNLEAKKYIVLL 374 +VK G+T+ + T++ ++E +++ ++ +VL Sbjct: 264 -----NLVKQRVDALVTSGTTSSYQGLIWGVRTLLPQWQEEWQIPPVESPALIQR-LVLF 317 Query: 375 TDGEN---TQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFF 429 TDG + D+ +C + + I + I F V+ ++ CA + Sbjct: 318 TDGADQGFHLDDLIEQGLCRVIQDKHHIEMSFIGFGVS----DRRLQQFRECAGDKGKVY 373 Query: 430 EANSTHELNKIFRDRI 445 +A +T EL FR+ + Sbjct: 374 DAQNTQELEAFFREAL 389 >gi|153833207|ref|ZP_01985874.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148870478|gb|EDL69393.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 515 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 44/512 (8%), Positives = 122/512 (23%), Gaps = 74/512 (14%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G + LL+ ++G V+ R+ L+ +A+ A + ++ Sbjct: 9 RSVHRQKGVAAVWMGLLLVPIMGFTFWAVEGTRYVQETSRLRDSAEAAAMAVTIEDQAGS 68 Query: 73 EEVSSRAKNSF-----------------------------TFPKQKIEEYLIRNFENNLK 103 + + + + + Sbjct: 69 ASTLAAKYVESYVRDIKSMNVSAQRYYRAADDRAGVLEYIQYTVNAKTTHDSWFASSFIP 128 Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 +++ + +V S + + F R + I Sbjct: 129 SFDEQQDLAGRSLARKYPAYLADNNIDIVFVSDFSGSMREQWGFNRHIKIDDLKTAISQI 188 Query: 164 AETVSRSYHKEHGVS--IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + + + ++ V + V D + + + + Sbjct: 189 SNNILCTSTRQEYVDGEWKDVCDEPGEDTTSDKLLNRVGFVPYNVRTREIIGWNQANTTS 248 Query: 222 VGIRDEKLSPYM--VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH------------- 266 + ++ + N + + + Sbjct: 249 QLNYTNGYNTHLSPYTYNDIDWDYWSRFSQTQVFNCANNQIFCPRPGWNDQQYARRIADV 308 Query: 267 ----------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 V L K N + FN +D + + Sbjct: 309 IRLDQYQVADVYNYVDLPSSVSTMFTD--KSARRSNYYGVNQSQLFNGHGRNDSRQFYNL 366 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLL 374 + G TA + + + + + + KK +++L Sbjct: 367 RLTNKLSDLNPINAMWADGGTAAFQGILRGAQILHEGDPNSSDQEEQQAYNKKIKMLLIL 426 Query: 375 TDGENTQDNEEGI-----AICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +DG+ + +N +C+KA+ + G+ I I + + +C Sbjct: 427 SDGQESPNNGILKGLVDWGMCDKARQEIPGLYIGVIGI----DFRASQQSGFQDCVVDPR 482 Query: 428 --FFEANSTHELNKIFRDRIGNEI-FERVIRI 456 + ++ EL + + I + ++ Sbjct: 483 EDIIDVSNLDELIEKIEELIRKGSKTSGITKL 514 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 116 bits (290), Expect = 8e-24, Method: Composition-based stats. Identities = 58/461 (12%), Positives = 133/461 (28%), Gaps = 61/461 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ G I + +P + + +L D + + L A + + + Sbjct: 38 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENN---- 93 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + N + NF + + + +N +++ ++ Sbjct: 94 ----------------AKNDTRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYH 137 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSM-LD 191 + ++ S LI S + K+ + + +V DFS SM D Sbjct: 138 TQLSQYHIDANIEQPTLFPFTSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGD 197 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY----YMLYPG 247 + + + L+ + K SP+ + P Sbjct: 198 FHNPDDPEVLSKLDELKRIFFKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPK 257 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + +K + + +I+ I N+ + + +D + Sbjct: 258 EKNKIYPIPSNEIERNTEAHQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELC 317 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S ++ + NE GST ++ + D ++ Sbjct: 318 LYSSNAYPISLAKDIDDLNVIKTMNEGGSTLVSSGIMQGADLLMDGV-----------NH 366 Query: 368 KKYIVLLTDGEN--------------------TQDNEEGI---AICNKAKSQGIRIMTIA 404 K +++L+DG + D + +C K + RI+ I Sbjct: 367 NKLMIILSDGHDYPTTAVVHDKTITSAKEVRVNVDISRQLVQHGMCKKIRETVGRIVFIG 426 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 N + + +C +F+ A +T EL R + Sbjct: 427 IGYNPSANHYINWA-EDCVDTENFYLAMNTKELEDSIRSAL 466 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 116 bits (289), Expect = 8e-24, Method: Composition-based stats. Identities = 60/456 (13%), Positives = 140/456 (30%), Gaps = 61/456 (13%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ L+P+M+ + + + + + +AA+ A + +S +E Sbjct: 2 AGILFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDENV----- 56 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + +Q ++ Y++ N + + +T + S + V + + + Sbjct: 57 --KYARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAPFSDFVVSAKAEHKSWI 114 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + + + ++V+R Y + V + ++ DFS SM + + L Sbjct: 115 --------AYEKVDLKPEFEVAGKSVTRKYLPQ-PVDVYFIGDFSGSMNGHWKG-GKTKL 164 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + Q +R V+ + N + R +P V ++Y Y + Sbjct: 165 DVVKQTIERVVEDIENFNTEEKSRVALLGYNPLHVKQTGTVYLNSYGYRRSWPKKVAYDY 224 Query: 260 DSSSLRHVIKK---KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + K V + +R + + D V + V F + D + Sbjct: 225 ARGTTAQTVAKMFDPPSVYSRVQEYVRGMSRHD-VKNLVVNNDRFVDYYKFYDIPLTEDY 283 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + G T+ + + A N+ ++ ++L+D Sbjct: 284 TNFKSRLANA---SLGAEGGTSSWNGIIAAAQ---------EANRATNINPEQVFIVLSD 331 Query: 377 GENTQDN----EEGIAICNKAKS------------------QGIRIMTIAFSVNKTQQEK 414 G + N +C K +S + + I + K Sbjct: 332 GADNDTNYLQRLVNQGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGV----DYRVK 387 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 +C + + A ++ K + I E Sbjct: 388 ESDGFGDCFGRKNIYHAKDGDDVYKYILNLINEETG 423 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 116 bits (289), Expect = 8e-24, Method: Composition-based stats. Identities = 46/453 (10%), Positives = 105/453 (23%), Gaps = 54/453 (11%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + G+ +I L ++ + G VD R + L+QA +A + Sbjct: 5 LADQNRTEPTTFSSCDRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAAL 64 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + ++ + ++++ + T T + Sbjct: 65 AVAKTIVATTTNQQAQSQAQVYLLTNVRNAVAVVTKAEISADRLTLCLDSTAQIPTTIMK 124 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLF---LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 ++ + L S +++ + V+ Sbjct: 125 IAHIETITTKATTCAQTPGGMNGTYEIALVLDNSGSMSKSAGGKSKIAALRDAATSFVNN 184 Query: 180 ------------------QWVIDFSRSMLDYQRDSEGQPLNCFGQ--------------- 206 V+D S S + N Sbjct: 185 IYSKTTDVKMSIVPFSAGVRVLDPSVSSNRTLSWIDVNGNNSQHWLVFGDGSLVAATAKA 244 Query: 207 -----------PADRTVKSYSSQNGKVGIRDEKLSPYM-----VSCNKSLYYMLYPGPLD 250 +KS +S G + P V + + + D Sbjct: 245 AAKTAGFTSRFDIFTKLKSLNSSWDWGGCFEGPKYPLNVSDTAVDTSNAETLFVPFLAPD 304 Query: 251 PSLSEEHFVDSSSLRHV--IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +++ + +S + + N + Sbjct: 305 EPSTKDKYNNSLYTNNYLAETGGSCSGTVTGDWKLLTRACKYGKPKKDGSGAGPNSSCPT 364 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 S + ++ + T E G T +++ + TI + R + Sbjct: 365 SSSQTVLQLTATQSTITTKISGLTENGYTNLHEGFMWGWRTISPTGPFAAGRAYATKDNH 424 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 K IV +TDG N + + ++ G Sbjct: 425 KIIVFMTDGFNNWQSATSTVTGSAYQAAGYYSY 457 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 47/409 (11%), Positives = 95/409 (23%), Gaps = 18/409 (4%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + P L + +N ++ D + Sbjct: 172 AALRDAATSFVNNIYSKTTDVKMSIVPFSAGVRVLDPSVSSNRTLSWIDVNGNNSQHWLV 231 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET---VSRSYHKEHG 176 +A + SW E + Sbjct: 232 FGDGSLVAATAKAAAKTAGFTSRFDIFTKLKSLNSSWDWGGCFEGPKYPLNVSDTAVDTS 291 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + F +D L A+ + G + + Sbjct: 292 NAETLFVPFLAPDEPSTKDKYNNSLYTNNYLAETGGSCSGTVTGDWKL--LTRACKYGKP 349 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS-----VIRSIKKIDNV 291 K S S+ +++ + K + + + + + I Sbjct: 350 KKDGSGAGPNSSCPTSSSQTVLQLTATQSTITTKISGLTENGYTNLHEGFMWGWRTISPT 409 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 A D + G + G + N TA Sbjct: 410 GPFAAGRAYATKDNHKIIVFMTDGFNNWQSATSTVTGSAYQAAGYYSYNG---TANQRFP 466 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + + TD +T +E + C AK+ G+ I TI FSV Sbjct: 467 DGTATNGNGVNYQTTLEAAAGSSTDYHDTSRNMQDELTLEACTNAKTAGVEIYTIGFSVP 526 Query: 409 KTQ-QEKARYFLSNCASPNSFFE-ANSTHELNKIFRDRIGNEIFERVIR 455 + + +CA+ + + A LN F R+ + Sbjct: 527 VDPIDAQGLKMMQDCATDANHYFAATDVDSLNAAFASIGSGVGKLRLSK 575 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 52/472 (11%), Positives = 124/472 (26%), Gaps = 44/472 (9%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L+ G AL++ +L + G+ +++VR + L AA I +A+ + Sbjct: 3 LLSEQRGGLSPAFALMLVGVLALTGVAIELVRGYNGQSLLSAAADAVIYSAADSDTALQD 62 Query: 74 EVSSRAKNSF-----------TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + N + +Q + L + + D + A Sbjct: 63 AQALMQANLSGRNLQVATPRLSQGEQGAQVILQGKVPALMDLSAIDEGEEGMPIAAAARA 122 Query: 123 NPRKSAYQVVLSSRYDLLL---NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + +S +V L + ++ + L + + S G+ Sbjct: 123 SSARSRIEVALVLDISDSMSGAPMKAIKQGLVEFGEVLFGRERRNQERVVSIIPATGLVN 182 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + L + + + K + Sbjct: 183 IGDHPELFHPESIEIPFGLRTLAEERGWLNLLSREVPGRQRKAFCARLPEHVDGIDRVAE 242 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA-----SVIRSIKKIDNVNDT 294 + + K + + +D + Sbjct: 243 VTPGWIRKLEQAPRD-----ELRPHLFYSTKPPPIARYGDGTPLLAFAPKENPLDRYLEN 297 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F+D + R + +T + + + + Sbjct: 298 RRDKLGIFDDPDCGVSPIQAHLS--TRADYRQALDTLYAAFNTNTAEGVMWGWRLLSPEW 355 Query: 355 EDEV-------HRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTI 403 + R + +K +VL +DGE+ + + + + +C + K +GI+I T+ Sbjct: 356 QGRWRQGAAALPRPYELQDNRKIMVLFSDGEHMTEAALRDRKQLLLCREMKRKGIQIYTV 415 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 AF + CAS S + + + + + + Sbjct: 416 AFEGDTRFVA-------QCASDRSLAFKATKSNIRTVLTRLASSINDVVLTK 460 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 56/498 (11%), Positives = 137/498 (27%), Gaps = 118/498 (23%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G+ + A+ + M+ G +D R + ++ A++ A + Sbjct: 19 MVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAIKEI--DEY 76 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + A+ + + E+ T + ++ Sbjct: 77 DEDEIAEKIKDWFDAQSEKQSATY------------------DLTEITVDKSGHTITASA 118 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S + + IK+ + + E + SY + + VID S SML Sbjct: 119 SGTVP------TTLMTLADIKTVPVGVISAIEGPATSY-----LEVYIVIDKSPSMLLAA 167 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + L ++C + + P + ++ Sbjct: 168 TSEDQAMLRADA---------------------------NITCEFACHDTKDPVKKNGTV 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + V + + DA+ V+ + D + +++G + + + Sbjct: 201 IASTYYNYIKSLGVKLRTDVALDAVEEVLDMVDAADEDHARIKVGLYSLGETISEVLEPT 260 Query: 314 WGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + + + + M +T A++ + ++ + + K + Sbjct: 261 YSTSTARKKLSDDSSGLTSATSMSATYFQTALKALKKKVGTAGDGT-----SAASPLKLV 315 Query: 372 VLLTDGENTQDN--------------EEGIAICNKAKSQGIRIMTI-------------- 403 +LLTDG + + C+ K + + Sbjct: 316 LLLTDGVQSNRDWVIKWSGKYWGRVTPLNPDWCDYLKDNDATMAVLYTEYLAIPADWGYN 375 Query: 404 ----------------------AFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKI 440 S + T+ + L +CA S + F A S E+ Sbjct: 376 ATLAKSMGNSDWTSTWGGTLHSGVSSSTTRHDYIPIALQDCASSSDLFISAASEDEITAG 435 Query: 441 FRDRIGNEIFERVIRITK 458 + +R+T+ Sbjct: 436 LSTLFNQYLTS--VRLTQ 451 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 70/471 (14%), Positives = 149/471 (31%), Gaps = 110/471 (23%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 A+L+PV+LG+ G+ +D L+ A + ++ Sbjct: 1 MAVLLPVILGIMGLGLDSGMLYLSHSRLQAA-------VDAAALAGSLQLPYDPAMDKGL 53 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + ++EY+ NF + ++ + P V +++ + + Sbjct: 54 VRAAVDEYMHANFPQAVVQS----------------VLPGAEERSVTVNAEATV----GT 93 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 +F+ ++GI S ++ +A A + + + +VID S SM + Sbjct: 94 IFMGALGIGSSTVRAQASAGYNN--------LEVVFVIDNSGSMKGSPINETNAAATRLV 145 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + S + G V R + P V + + ++ L Sbjct: 146 DLIMPEGMATSVKIGLVPFRGKVRIPADVDG-----------LPSGCRNADGSLNEDGLL 194 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 KK ND +R+ + + + + I + Sbjct: 195 DEYKKPE--------------YRYPYNDRLRVTPYS-CSSIPLTQGLTADRATITQAIGR 239 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-- 383 A ++ T I++ ++ A + + + +K I+LLTDG+ N Sbjct: 240 QDARGDSS--GTVISEGLKWARHVLTPEAPFTEG--SSAKDMRKVIILLTDGDTEDGNCG 295 Query: 384 ----------------------------------EEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ AK GI I I + + Sbjct: 296 GNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVLNNAMLSEAALAKDAGIEIFAIRYGSSD 355 Query: 410 TQQEKARYFLSNCAS-----PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 R + AS + +F+A S ++++ +F+ IG ++ R++R Sbjct: 356 ---AVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDVFK-LIGRQLGWRLLR 402 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 60/454 (13%), Positives = 131/454 (28%), Gaps = 73/454 (16%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ +GH ++ A+++P G+ + D R + L+ AA+ A++ + Sbjct: 4 IRKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLEDAAEAAVLAIAAH-NADNSG 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 SS + + I +Y+ ++ + T DI N +Q + Sbjct: 63 SSSGSAINKKIASDWIGQYM-QDMQAISDIKITKLNCNDIAECKEGLENGESRYFQYEIL 121 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 ++ N LS F + + +R + E V + +V DFS SM + Sbjct: 122 AKT----NHLSWFPGNNSTAGFGESFDVVGSATARKFQSES-VDVMFVSDFSGSMNNKWS 176 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 G V +K + + + + + Sbjct: 177 GGSNSRRYKDLIKII----------GDVIKELDKFNNAHTTTTNRVGFTGF--------- 217 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV--ISDPSF 312 + +K G + D + Sbjct: 218 -----------NTYTRKTADNSCYQDQYDRSAGRTVNKIFEVKGCKSRSSGGAKFHDIAM 266 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + K G TA + + ++ E ++ ++ Sbjct: 267 TDNY-----NEFKNTIKYFKPGGGTASYQGIIRGAQMMDAAPE---------PRPRRIMI 312 Query: 373 LLTDGENT----QDNEEGIAICNKA------------KSQGIRIMTIAFSVNKTQQEKAR 416 +L+DG ++ + +C+K K+ ++ + F N Sbjct: 313 ILSDGIDSKRSRANKLVEEGMCSKILLKLGNANTSDGKAIKTKMAVVGFDYNPASN---- 368 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ C ++ + AN+ ++ + I EI Sbjct: 369 PSLAKCVGEHNVYGANNPEDVLNKILELISEEIG 402 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 115 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 71/526 (13%), Positives = 131/526 (24%), Gaps = 93/526 (17%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 TK + K+ K+ G + IITALL +L VD + L QA A + Sbjct: 7 LTTKLLSTVKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV-- 120 Q + +Q+IE ++ + R + + Sbjct: 67 LLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLYL 126 Query: 121 ----------------------------------EMNPRKSAYQVVLSSRYDLLLNPLSL 146 + A V S + L Sbjct: 127 RSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNQNGTAKSVACVVQGSVQRKFWLPWGQT 186 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 + S + + + + + + V D S SM Y Sbjct: 187 LVSSNQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYIDRKGDAHKPNRRI 246 Query: 207 PADRTVKSYSSQNGKV---------------------------GIRDEKLSPYMVSCNKS 239 A V G + PY V +K+ Sbjct: 247 DALVEVVGEVQNILVTPRKIKEKCESLQSNGLLYLLQQVLAKKGDTSGCVLPYYVQQSKT 306 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y E + + K + +++ + + + Sbjct: 307 EYISELLRGRRGRTIREGLKEIERHMDIAKTVNQIKN--------FNNGEKQSYSFSFNN 358 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE-V 358 F + + V +G TA+ M + + S N + Sbjct: 359 GDFCLGGNEGKETTQAWFDQKKPNVSEALGKIEPLGGTAVTSGMLIGINLMTSKNSEPEA 418 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEG------IAICNKAKSQ------------GIRI 400 K N ++ +++L+DGE+ Q +E+ +C + K + R+ Sbjct: 419 APSKLNTNTRRVLLILSDGEDNQPSEKTLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRV 478 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 IAF + C ++ S L F+ I Sbjct: 479 AFIAFGT--NLPDNQLNAWKQCVG-KHYYSVFSKQGLLDAFKQIIS 521 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 60/408 (14%), Positives = 118/408 (28%), Gaps = 57/408 (13%) Query: 14 LIKSCTGHFFIITALLMPVML-GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 ++ G F II + + +++ GG+ VD +R+ + ++ AI+ A+ Sbjct: 21 FLRDERGSF-IIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAASLDQDLD 79 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + I + + ++ +N T+ EV D V T Sbjct: 80 PEEVVLDYFTKAGLGHVISQD-----DIDVFENQTNGEVADDVAVT-------------T 121 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + FLR + + T+ AE + + + S S Sbjct: 122 RRVEASVSALMPTTFLRLAHMYDLGLYTEGGAEEALSLSEISLVLDVSGSMGNSSSSGYS 181 Query: 193 QRDSEGQPLNCF------GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + F + + G + I + ++ + L + Sbjct: 182 KIYELRRAAKRFVNVMLCNPADADETEDCTLTEGDISINIVPYAEQVLLPSNLLQRFNHT 241 Query: 247 GPLDPSLSEEHF--------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S + V + SL + + I+ D T Sbjct: 242 SEHTESRCITFYEDEFDTVAVPTFSLDTFVTNGRPLPALYGDPIQLTGYFDPSGGTNSTP 301 Query: 299 ATFFNDRVISDPSFSWG-------VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N +D S S ++ D G+T+I+ M+ + Sbjct: 302 NPGSNSPCYNDYSGSTNDYWREIYPMGFSAEALRDEIDDLGASGNTSIDLGMKWGAALLD 361 Query: 352 SSNE----------------DEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + D +K IVL+TDGENT + Sbjct: 362 PAAQPAISDLVAANEVNEAFDGRPFEYTQRGIEKVIVLMTDGENTSQD 409 Score = 75.0 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 99/380 (26%), Gaps = 39/380 (10%) Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + + N ++ + + + Sbjct: 253 YEDEFDTVAVPTFSLDTFVTNGRPLPALYGDPIQLTGYFDPSGGTNSTPNPGSNSPCYND 312 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 Y N + MG + ++ + + S + + G+ + + Sbjct: 313 YSGSTNDYWREIYPMGFSAEALRDEIDDLGASGNTSIDLGMKWGAALLDPAAQPAISDLV 372 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 +N + V + D + + + + S ++ Sbjct: 373 AANEVNEAFDGRPFEYTQRGIEKVIVLMTDGENTSQDY-LRRGYHSGPSGVYKANSSNDW 431 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 D+ R+ + + + + + WG Sbjct: 432 SVYDADDDRYYWV----------NGGQWLDHPYGEGEYEECHSVRQY----------WGY 471 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 V+ + G+T + + + + Sbjct: 472 WWWQYNWVEECEMVYEGNGATQLTFPQLW----LQKTANWYDQWNFLADAHDYF------ 521 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 ++ N+ IC AK+ G+ + TI F V+ +Q + + +CAS +++ + Sbjct: 522 -NYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQHD----IMRSCASAPAYYFDVDGLD 576 Query: 437 LNKIFRDRIGNEIFERVIRI 456 ++ F I EI + +R+ Sbjct: 577 ISAAFA-AIAREISK--LRL 593 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 51/522 (9%), Positives = 142/522 (27%), Gaps = 90/522 (17%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSR------AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI----VRDT 118 ++ + +N ++ + + +K + T Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNEEKMHLIPTCKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHG 176 + N + ++ + S F +G A + Sbjct: 127 KTDTNKKGHTSSSEVTCTVSGTIEHKSWFPLKVGSVEVIPHEVNVASKSKAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + V D S SM + + + + + + + Sbjct: 187 IDLMVVADLSGSMNYDLSNKNEIVGSPMSKLGILQDVLSELAEKTLLSEEANHNNRIYVT 246 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-------LASVIRSIKKID 289 +L + S + ++ L + +++ + +++ + D Sbjct: 247 PFALGAEISSSNCAIPYSWDMNKNNQELENAKNTLSNSKNSQYYRAEFINNLVYKLNTRD 306 Query: 290 NVNDTVRMGAT--FFNDRVISDPSFSW----GVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + ++ + ++ + G+T + + Sbjct: 307 TLTNIGGQKDYKLKYSKGAFCLKNMRTQNKGWYSHNNKSDFIKYVKALKADGATLASSGL 366 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------------------------- 378 A + +I S + + K+ I++L+DG Sbjct: 367 LVAANNMIKSGSRTK---ELGEQTKRVILVLSDGNDEIVKGEISGIPFLNYTRITENLIY 423 Query: 379 -------------------------NTQDNEEGIAICNKAK--------SQGIRIMTIAF 405 NTQ E +CN + + +I+ + F Sbjct: 424 GKQEVFLSQKQKISLSHSTIDTYLTNTQPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEF 483 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +AR +C +++ AN L F+ IG Sbjct: 484 GY----ASRARKAWEHCVGNGNYYSANDKESLLNSFKQAIGE 521 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 47/356 (13%), Positives = 102/356 (28%), Gaps = 31/356 (8%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 + M+ VGG+ +D++R L+ A++ A+ +V + + Sbjct: 6 MIAFLLSMVAVGGIGIDLMRMERDRTILQYTLDRAVLAAADLDQPLPPDVVVQDYLNK-- 63 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + EY D +T + +SR + ++ L Sbjct: 64 --ANLSEYYQPPIAETGIGYKRVESTIDTTFETQWLDFSGGQDMPLYANSRAEESIDGLE 121 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + L S ++ + + V+ S S++ Y Sbjct: 122 ISLVLDVSGSMNSNSRLYNLKNAARDFIDTMVANTADNKMSVSIVPYATQVSLPKDMLDQ 181 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + + N S N+++++ + D+ Sbjct: 182 YNVTDEHEYSNCVNFTGTHFTSTGLSTTASLNRTMHFTPW-----------WSGDARPSN 230 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +I+ A V+ K + + D ++ + N + WG L + Sbjct: 231 GLIQYPVCDERAHREVMPFQKDANRLKDFIQNLQAWGNTSIDV--GMKWGTVLLDPSAQP 288 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + + + + N+ E K IVL+TDG+NT Sbjct: 289 VISALTSSSVN--------------VPGVFADRPAAYNDTETVKVIVLMTDGQNTS 330 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 37/426 (8%), Positives = 104/426 (24%), Gaps = 32/426 (7%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 +++DV L A + + + S + + ++ L + Sbjct: 124 LVLDVSGSMNSNSRLYNLKNAARDFIDTMVANTADNKMSVSIVPYATQVSLPKDMLDQYN 183 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + + + + + + P + ++ Sbjct: 184 VTDEHEYSNCVNFTGTHFTSTGLSTTASLNRTMHFTPWWSGDARPSNGLIQYPVCDERAH 243 Query: 159 QTKAEAETVSRSY-----HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + + + + + + + + ++ + Sbjct: 244 REVMPFQKDANRLKDFIQNLQAWGNTSIDVGMKWGTVLLDPSAQPVISALTSSSVNVPGV 303 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 K+ M + Y + + + +SS + + Sbjct: 304 FADRPAAYNDTETVKVIVLMTDGQNTSQYYVESDHRGGNAPVFYNSAASSDKARYSTYNP 363 Query: 274 VRDALA-SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + + R + + + + + Sbjct: 364 GNQRYYWDKMNRWEDHAYGQGEYRECGYYNCWWQDEQGTPATVTQLTHAELFARTS---- 419 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ++ Y + + + + Y V + G + N AIC+ Sbjct: 420 ----------LRYVYQRLFA----DWMGNSAAKNSWYYGVYDSWGT-STKNARTKAICDA 464 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 AK++GI + TI F L +CAS ++ + E++ F R Sbjct: 465 AKARGIVVYTIGFEAPSGGVS----VLKDCASSDAHYFDVQGLEISDAFASIAT---SIR 517 Query: 453 VIRITK 458 +R+T+ Sbjct: 518 QLRLTQ 523 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 38/372 (10%), Positives = 91/372 (24%), Gaps = 32/372 (8%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I+ G +++ + + L +GG+ +D +R +L+ A++ + ++ Sbjct: 19 FIQEEDGTVTVLSFFIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATASTEAGA 78 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + L + T + Sbjct: 79 RTIVEDYFAKSGQSDYLLAQKDGDISTTLNAAKVTAGAELSLD-TYLMKLAGVPTLSASG 137 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 ++ ++ + L L S +K + + ++ D S++ + Sbjct: 138 TATAEVRIPKLEAILVLDVSGSMASNSKIQNLQTAAKDFVTTVMNSSKPGDTVMSIVPFS 197 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + T + K S S + Sbjct: 198 FSVTPPQSVFDALAVEETHNYSTCLEFKENDYQHATLSSGSSSLSSGIPV-NQMVYTSVY 256 Query: 254 SEEHFVDSSSLRHV---IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + +DS + ++ + I + + Sbjct: 257 GDFDNLDSGWRSCYTDEYIRILPYSTSITDLHAKIDALQP------------AGNTSGNE 304 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 +WG L T + A +S V + E K Sbjct: 305 GMNWGAALLDPTFREVTASMIAAGH---------------LSETLANVPSDYDEPETLKA 349 Query: 371 IVLLTDGENTQD 382 I+ + DG NT Sbjct: 350 IIFMGDGANTTS 361 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 61/237 (25%), Gaps = 18/237 (7%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL----RHVIKKKHLVRDALASVIRSIKK 287 +M + Y + ++ ++ +S Sbjct: 352 FMGDGANTTSYFFDRSSPKYRGKFSDLYEVRFQERVFKYAYNIYNVDWKKYGDDGKSRCS 411 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN------D 341 + + V + G + A N ST Sbjct: 412 QNRWECVYDVAENSPEYSVYYLRNPDTGKFWSVAEEKWIEANTFNNFESTMDGFISRTQL 471 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A+ + + ++ + N +C K++G+ + Sbjct: 472 DWEMAWGLMSPEYYGQTTGNWGPWNDYIGSEFVS---GSMKNGLMQNVCKATKTEGVVVY 528 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +I F V + + LS CAS + + S ++ F + +R+T+ Sbjct: 529 SIGFEVPVNGTAENQ--LSACASSPAHYFRASGTDIKSAFSAIAA---NVKQLRLTQ 580 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 62/515 (12%), Positives = 147/515 (28%), Gaps = 133/515 (25%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ + L + TG+ I+ AL + ML G D +R ++ A+I A Sbjct: 19 RYFHTLRGLRRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAV 78 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + +++ + N Sbjct: 79 KQI-NNTADADALKAKVSDWFHAQVDNSYTLGEIDIDTANH------------------- 118 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I + + + + + SY +++ VID Sbjct: 119 --------NITATASGTVPTTFMKIANIDTVPVSVGSAVKGPATSY-----LNVYIVIDT 165 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML S + + GK Sbjct: 166 SPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGK------------------------ 201 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + S+ +++ + + DA+ V+ I D+ ++ +++G D Sbjct: 202 -------KTYANNYEYSTAKNIKLRADVAGDAVKDVLSLIDTSDSNHERIKVGLYSLGDT 254 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + +T + ++ T + + + Sbjct: 255 LTEVLAPTLSTDTARTRLSTASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT-----T 309 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI-----------------------CNKAKSQGIRI 400 + K ++LLTDG ++ ++ C K+Q + Sbjct: 310 SGTPLKLVLLLTDGVQSKREWVTDSVVWKSGQAVSGAYWNKVAPLNPDWCAYLKNQSNTM 369 Query: 401 MTI------------------------------------AFSVNKTQQEKARYFLSNCAS 424 + S + T+++ Y L++CAS Sbjct: 370 AVLYTEYLPITSDWGYNATVGSTMASANWKNTWGGTMDSGVSTSITRRDYIPYALADCAS 429 Query: 425 PNSFF-EANSTHELNKIFRDRIGNEIFERVIRITK 458 S F A+S+ E+ + +R+T+ Sbjct: 430 SKSLFLSASSSTEITAGLSTLFTQYLSS--VRLTQ 462 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 56/503 (11%), Positives = 127/503 (25%), Gaps = 70/503 (13%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I LL+ ++G V+ R+ L+ +A+ A + ++ + Sbjct: 16 GVAAIWMGLLLVPIMGFTFWAVEGTRYVQESSRLRDSAEAAAMAVTIEDQPGAARALATK 75 Query: 80 KNSF-----------------------------TFPKQKIEEYLIRNFENNLKKNFTDRE 110 + + + + +E Sbjct: 76 YVENYVRDIKSTNLSAQRFYQAEDKGTGALEYIQYTVNARTTHDSWFASSFIPSFDKQQE 135 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + + +V S + +N + I + + Sbjct: 136 LAGRSLARKYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKIDDLKTAIDQISNNILCK 195 Query: 171 YHKEHGVS--IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 ++ V + V D + F V + Sbjct: 196 STRQEYVDGEWKDVCDEPGDDTTSDKLLNRVGFVPFNVRTREIVAGNRANATSQLSYKNG 255 Query: 229 L----SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR---HVIKKK----HLVRDA 277 SPY + Y+ Y + + + + K+ R A Sbjct: 256 YKAYLSPYSYNDVDWNYWRTYTSSEVNNCAYWQSYCQNPRAENHNYAKRIRDVLEADRYA 315 Query: 278 LASVIRSIKKIDNVND------------TVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +A V + +V+ G FN S S + + Sbjct: 316 VADVYNYVDLPTSVSTMFTDKSGLKANFYGVSGTRLFNAHGSSYSSQFHNIQLSNKLSDL 375 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDN 383 G TA + + + + + + KK +++L+DG+ + DN Sbjct: 376 DSIKSMWADGGTAAFQGILRGSQVLHEGDPNSSDQEEQQAYNKKIKMLLILSDGQESPDN 435 Query: 384 EEGI-----AICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANST 434 +C+KA+ + G+ I I + + +C + ++ Sbjct: 436 GILKGLVDWGMCDKARQEIPGLYIGVIGI----DFRASQQSGFQDCVVDPREDIIDVSNL 491 Query: 435 HELNKIFRDRIGNEI-FERVIRI 456 EL + + I + ++ Sbjct: 492 DELIEKIEELIRKGSKTSGITKL 514 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats. Identities = 62/464 (13%), Positives = 136/464 (29%), Gaps = 111/464 (23%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 ++L V G+ VD+ L+ A + S+ L + S +Q + Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSLELPYDPD-------LSKGIVQQAVS 53 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + + N + + ++ ++P VV++++ + L M Sbjct: 54 DMIHTNMPDAVVES----------------VSPGTEVRSVVVTAKAKVNLL-------VM 90 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G + Q + + I +VID S SM + + Sbjct: 91 GFLNLADQWVEAGAAAGFNK-----LEIVFVIDNSGSMKGTPINLVKEASIGLTDLLIPD 145 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + ++ G V R + V + + V++ + Sbjct: 146 GQQPDTKVGLVAFRGKVRLGGDVDG-----------LEAGCRNADGSVNTGIHEDFMSMY 194 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + I + S ++ I A Sbjct: 195 WALSSYYRNQIDLDT----------------CSSIPESRPLSQDKGDIVEGINSQTA--L 236 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--------- 382 T I++ ++ A + + + + +K +++LTDG+ Sbjct: 237 GSASGTVISEGIKWARHMLTPEAPYT--QAGDKKDFRKIMIVLTDGDTEDGECGGSYRAS 294 Query: 383 ---------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 N++ +A AK +GI I I F V+ Sbjct: 295 FRPNNYWTNAYYGMGVDTAHCQDGGVLNQDMLAEAQLAKDEGIEIFAIRFGVSDN---TD 351 Query: 416 RYFLSNCASP-----NSFFEANSTHELNKIFRDRIGNEIFERVI 454 + AS + +F+A S +++ +F+ IG ++ R++ Sbjct: 352 ISLMKQIASSKAGTNDHYFDAPSVYDIPDVFKK-IGKQLGWRLL 394 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 71/470 (15%), Positives = 136/470 (28%), Gaps = 78/470 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K L G+ IITAL+ ++ + GM +D + ++ A A + A P + Sbjct: 8 FKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALAAVTPAMV 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + S+ F Q + + N++ + + Sbjct: 68 AAGQSSAITTAQNVFNAQALTMTGVTYNANDVTVSIATSGDKR----------------- 110 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 ++ ++ MG S I +A A T + ++D S SM Sbjct: 111 ---TATVQYQAQSQAMLPDVMGFGSIKIGGQATATTTIAPN-----IDFYLLLDDSPSMA 162 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 S + +N + +SL Sbjct: 163 IAATQSGINTMVANTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSL---------- 212 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-TVRMGATFFNDRVISD 309 V + ++R A ++ + + + T RM F+ + + Sbjct: 213 ---------------GVTLRIDMLRQATQDLMTTAQTTETQKGTTYRMAIYTFDIGLNTI 257 Query: 310 PSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL- 365 + + + + I N + ND T YDT ++ + N Sbjct: 258 GNLTSDLSQAQTEAGNIQLLEVYSNNWLTQNDYNDDEDTNYDTALNGINAIMPNPGNGTG 317 Query: 366 ----EAKKYIVLLTDGE-------NTQDNEEGIAICNKAKSQGIRI---MTIAFSVNKTQ 411 ++ + +TDG N Q + +C K++GIRI T + Sbjct: 318 AAGDTPQEVLFFVTDGVEDEDVNGNRQQSLLNTDLCTAIKNRGIRIAVLYTEYLPLPTNS 377 Query: 412 ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 Q L CASP +FE S +++ + Sbjct: 378 WYNTYIAPFQNSIAPTLQQCASPGLYFEVKSGGDISAAMSALFQTAVQSS 427 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 62/507 (12%), Positives = 143/507 (28%), Gaps = 71/507 (14%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + T+ + + + G+ II AL +L G VD S + L +A Sbjct: 1 MIRTRIAAAAARFAGAAEGNVAIIFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSAT 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A+ P + +++ + F Q + +R + T Sbjct: 61 LAATTPAMMQQTTATAKTVATSMFAAQVAQ---------------INRLTYNSANLTVTV 105 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + ++ + L + F + + + KA + + + Sbjct: 106 NDDTSASPVKTRTVTVSYLAQVGNAFGSFYHVPTSIFTVKASSTASTARN-----IDFYL 160 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V+D S SM + + ++ S + + K+ Sbjct: 161 VLDNSPSMELPATTAGLASMTAATGCVFACHENTYSDPENTVQYPGYGTIDSYTYAKNAG 220 Query: 242 Y---------------------MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 M G + D++ L+ + A+++ Sbjct: 221 IALRIDNVREAAKRLASTSQAMMSANGATYRLAAYAFNYDTTQLQALTSTTSANVSAIST 280 Query: 281 VIRSIKK--IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 I ++ ++ N + + + + + G A+ + EM T Sbjct: 281 SINAMTPPLMEKNNYLPTGASYTYPTSASTWTTVTLGSDPTKTNYNVRDAMTDIEMTLTK 340 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------------N 383 +N AM + +S + + + T+ T Sbjct: 341 VNAAMPNPGNGTTASGDKPQEVVMLVTDGMVDGSFYTNTSCTNYASSYSNSYGTFYRCLR 400 Query: 384 EEGIAICNKAKSQGIRIMT---IAFSVNKTQ---------QEKARYFLSNCASPNSFFEA 431 +C K++GIRI I + L +CAS + +FE Sbjct: 401 PLDTTLCTTIKNRGIRIAVLNLIYYPTPGYGFYDGAVAPFISTVSPALKSCASTDLYFEV 460 Query: 432 NSTHELNKIFRDRIGNEIFERVIRITK 458 ++ ++++ ++ +T+ Sbjct: 461 DTGSDISEAMTYLF-QKVVTTASYLTQ 486 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 37/389 (9%), Positives = 94/389 (24%), Gaps = 28/389 (7%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + + I A+ + +LG G VD R + +++ A +A++ S Sbjct: 9 RIRNSISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVLMVS 68 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + +++ F + +V Sbjct: 69 RDAAANPAMTSQQ-------------------ITDAVQRYFNSLYNDKSAFNVSVSAAYT 109 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 S + F++ G T + + + + V+D Sbjct: 110 PSTSSAAAKILASGQGAIETDFMKIAGFPQLSFGTSSTSTW------GNSRMRVALVLDN 163 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + SM D + + Q + + I K S + + Sbjct: 164 TGSMRDNGKMAALQRAAKDMIDSLSAFAKTADDVYISIIPFAKDVNVDKSNYNAAWINWT 223 Query: 246 PGPLDPS--LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 +P + E + ++ K + + F Sbjct: 224 EWEAEPPILIEENYPINVRYNGLTYDKWEDIGPGAPCPFDTRNNGSPRPTQNSRVGKFSF 283 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMK 362 + S S + + I + + Sbjct: 284 ACMDRPGSLSGATDLSSLYTNRYLIPSDPSGKYYGMICPGIDGGAKFPRKRDIYYNGCYT 343 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + ++ + +V+ T + + C Sbjct: 344 SVVDTDRSVVVSTGSGASCPSTTPNCSCT 372 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 85/277 (30%), Gaps = 34/277 (12%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-----SLR 265 K NG + +VS + S + + R Sbjct: 331 PRKRDIYYNGCYTSVVDTDRSVVVSTGSGASCPSTTPNCSCTGSGRNRKCTQAKYKHYWR 390 Query: 266 HVIKKKHLVRDAL---ASVIRSIKKIDNVND----------TVRMGATFFNDRVISDPSF 312 + +DA ++ I D D + F+ +++ Sbjct: 391 AHPTDTNQAKDAAPAHSTWTGCINDRDQAYDISNADPSSGSSGTPSTKFYAEQLNGCLPA 450 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN-EDEVHRMKNNLEAKKYI 371 + + +K + GST + + T+ ++N + Y+ Sbjct: 451 TITPVSSQSSTLKNQIDSMSPSGSTNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYL 510 Query: 372 VLLTDGENTQD-------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 VLL+DG NT++ + +C K K G I T+ +V ++ Sbjct: 511 VLLSDGLNTRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGN--RDPLSQV 568 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L +CAS +F S ++ F++ + R+ + Sbjct: 569 LQDCASNGNFQMITSANQTADAFQNILTQISQLRIAK 605 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 113 bits (281), Expect = 7e-23, Method: Composition-based stats. Identities = 54/449 (12%), Positives = 140/449 (31%), Gaps = 63/449 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK+ G I+ L+P ML + + + + QA+ + I + Sbjct: 5 IKAQQGSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACA--------- 55 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ + Q +Y NF +++ T + + + S Sbjct: 56 ---YSQRANVSLTQGFAQYYKPNFISHVNAQSTFLSGQKQCKIQIGYAFTPLLKDLLPAS 112 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 S+ + + ++ + + + + + V+D S SM + Sbjct: 113 SQNKV-----------------HASVQIQSTSTLTVHSEIKPMDLSLVLDISGSM-SGRI 154 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + +N Q ++ + IR + P+ + S L Sbjct: 155 GLLKRIINQAIQNIEQ-----QNTKNNTQIRFS-IVPFSSGVSISNAPWLAKSK-----G 203 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + VD+ S + + + + ++ ++ + Sbjct: 204 KALCVDAMSYPGNVLNTAQTVADIDTHPSKLNIRAKEPLSLINDCNVYS----LLLPLTN 259 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH-RMKNNLEAKKYIVL 373 + K+ + + + GSTA T++ + + + + + + + ++L Sbjct: 260 NLSKVRKHVDSLSIL-----GSTASYQGFIWGVRTLLPNWQKAWNLQPETSSLLSQRLIL 314 Query: 374 LTDGENTQDNEEGI----AICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 TDGE+ ++ +C + + I I I F ++ + ++ C Sbjct: 315 FTDGEDDSRDQFDKLVRSGMCQRIQDDFNIDISFIGFGLSPRRLDQ----FKKCIGSNGK 370 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++A + +L K F + + + + Sbjct: 371 GVVYDAKNGSDLEKFFAEALLLDTSAHLS 399 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 50/455 (10%), Positives = 121/455 (26%), Gaps = 53/455 (11%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 Y + + F+++ L + ++LG+ + A +Q A + A Sbjct: 2 RYPSSVFRHKAHLSFVVSGLTLAILLGLSACS--------DKAAEQQTP--AELAA---- 47 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + +++ + + + + + Sbjct: 48 --------QAKLAAEQQAERQANRQRDAAIAMHEQASSAKLRTMSAESRAYIAQPTASIS 99 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 L+ + + P ++ + T + +++ ++ Sbjct: 100 AAPALNGDWPGAVPPERNRFEKQVQNGIMVAGEIPVSTFAIDVDTGSYTTLRRML-KEGR 158 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + LN F K+ + + + + M+ L Sbjct: 159 LPQKDTLRVEEMLNYFSYDYPLPGKNDAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKA 218 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 219 ELGASNLVFLLDVSGSMASPDKLPLLQTALKMLTQQLDAQDKVSIVV---------YAGA 269 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + + GST +Q AY + + Sbjct: 270 AGVVLDGAAGNDTQTLNYALEQLSAGGSTNGAQGIQLAYQL--AQKHFVEGGINR----- 322 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++L TDG+ + + K QGI + T+ F + Sbjct: 323 --VILATDGDFNVGTTNLDELIDLVSARKQQGIGLTTLGFGMGDYNDHLMEQLADK--GN 378 Query: 426 NSFFEANSTHELNKIFRD-------RIGNEIFERV 453 + +S +E K+ + I E+ +V Sbjct: 379 GQYAYIDSINEARKVLVEHLSATLLTIAKEVKVQV 413 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 112 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 66/489 (13%), Positives = 146/489 (29%), Gaps = 106/489 (21%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + +++ G I L + + + G VD + S + AL QAA ++ L ++ Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVL---TALKEA 59 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E++ + + +++ + N + T + + Sbjct: 60 REQLKQGKPDWQSIAEKQGGKAFTNNASKIGGVSGTGATINLSLSGG------------- 106 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 VLS + N + FLR G+ + ++ A A + Y I +VID S SM Sbjct: 107 VLSGSLNYAANAPTHFLRIAGLNTINLKGSASATMSAAQYR-----DIHFVIDVSASMGI 161 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ Q + A + ++ + + Sbjct: 162 GATKADQQAMQNSVGCAVACHHAEAADPATDNLAAVRAI--------------------- 200 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + +VR A+ + I + + R+ F++ + + Sbjct: 201 --------------GATLRIDVVRKAVMDALAKI----PNDGSTRVAIHSFSNSLKTVFP 242 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT-------AYDTIISSNEDEVHRMKNN 364 S + I +EN G T + ++ A + + +S + + Sbjct: 243 LSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNLLASAGNGLTASQPRGFVLLATD 302 Query: 365 LEAKKYIVLLTDGEN------------------------TQDNEEGIAICNKAKSQGIRI 400 + DG A C+ K++G + Sbjct: 303 AVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYFAWGLHYVQAPDAANCSAIKAKGYTM 362 Query: 401 MTI--------AFSVNKTQQEKAR------YFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 MT+ + +++CAS P+ +F A S E+++ + + Sbjct: 363 MTLETEYLIPDGVYNPTFDAVRGDMGPAMTKSMTDCASAPDYYFHAESPQEIDRAVQTMV 422 Query: 446 GNEIFERVI 454 + + Sbjct: 423 SKTVNLSLT 431 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 62/429 (14%), Positives = 115/429 (26%), Gaps = 60/429 (13%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A L+ VML + DV L + A L ++ + Sbjct: 16 RRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQA 75 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 R F K I ++ +++ TD + A ++ ++ Sbjct: 76 RVVAKDIFSKNLIGGRELKLHNKDIEFGRTDANPDGTWEF--LPNERPFQAIRISVNLDD 133 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + G + ++ T S I +D S SM + + Sbjct: 134 NRQKGRNGSVPLLFGK----VLGQSSFATNHSSVAANLVHEIVLCLDRSHSMCFDETGVD 189 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 Y Y Y P +P S Sbjct: 190 ----------------------------------YAYPPGTPSYPAGYITPPNPVGSRWA 215 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + V L V+ I F V+ D ++ Sbjct: 216 KLQGAIQVFVDTLDDLQIVPDVGVVTWGSDITLSWSWYPFQGRSF-PAVMVDVPLGQNLN 274 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + I G T ++ + + + ++ + A+K I+L++DG Sbjct: 275 LVSPAIAAKLGDIMM--GGTNMSSGIDRSVSLLTAN--------GTHSLAQKTIILMSDG 324 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + + N A + I I TIAF + + A + FF A Sbjct: 325 QWNAGR-NPLDAANDAADKNITIHTIAF------LNGDQSVMRQIAERTGGKFFNAPDGE 377 Query: 436 ELNKIFRDR 444 L F++ Sbjct: 378 SLEDTFKEL 386 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 49/189 (25%), Positives = 86/189 (45%), Gaps = 13/189 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + ++ ++ IK I +VN+ VR G F+ +++ +WGV ++ Sbjct: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV-----QHIQEKI 244 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +T ++ AY+ I + E H K + + KKYI+ LTDGEN+ N + Sbjct: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 Query: 389 ---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 CN+AK +G + I A FL NCASP+ F+ ++ +L+ F Sbjct: 305 SLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 Query: 446 GNEIFERVI 454 + +R++ Sbjct: 360 KEMVKQRIL 368 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 81/234 (34%), Gaps = 4/234 (1%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + +C G I+TA+L+PV+ V G++++ + + L +++ + ++ Sbjct: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + F + I+ +F N L++N +++ +I R T++ + Sbjct: 66 QENGNNGKKQK-NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 LS+ + + S + +S + + V+D S SM Sbjct: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG--LDMMMVLDVSLSMN 182 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 D+ L + + S + L + ++ Sbjct: 183 DHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 52/458 (11%), Positives = 117/458 (25%), Gaps = 61/458 (13%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT----------------- 59 G AL++ +L + G+++++VR + L AA Sbjct: 6 RQGGGLSPTFALMLTGVLALTGVVIELVRGYSGQSLLSAAADAVLYSAADSDSAAEDAAA 65 Query: 60 ---AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A + L +S + + + ++ + + R Sbjct: 66 LVRANLAG-RHLQVGPPALSQNEQEAQVILQGEVPALMALSAIGTSGDLPVAAAARASSA 124 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL-RSMGIKSWLIQTKAEAETVSRSYHKEH 175 T +E+ + + + L+ F G + + + Sbjct: 125 RTRIEIALVLDVSNSMSGAPMKAIKQGLAEFGEVLFGRERRNQDRVVSIIPATGLVN--- 181 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + + S+ + R V + + Sbjct: 182 -IGDHPELFHPESLTFPFGLQTLAHERGWSNLLTREVPGRQRKAF---CARLPEHVDGID 237 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-----LASVIRSIKKIDN 290 L E H K ++ L + ++ Sbjct: 238 RLAELTPGWIRKLELAPRGEAQ-----PRLHYSTKPPAIQQYEDGTPLRAFAPRENPLER 292 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R F+D + R + + +T + + + + Sbjct: 293 YLENRRDKLGIFDDPDCGVSPIQAHLS--TRAAYRQALDTLHAAFNTNTAEGVMWGWRLL 350 Query: 351 ISSNEDEV-------HRMKNNLEAKKYIVLLTDGENTQDNEEGIA-----ICNKAKSQGI 398 + R + +K +VL +DGE+ +C + K +GI Sbjct: 351 SPQWQGRWQQGAAELPRPYGQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGI 410 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTH 435 ++ T+AF + CAS S ++A S + Sbjct: 411 QVYTVAFEGDARFVA-------QCASERSLAYKATSGN 441 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 61/445 (13%), Positives = 123/445 (27%), Gaps = 79/445 (17%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + + + ++ S G I+T L+ ++ V G VD++R+ L+ A A++ Sbjct: 16 ENRSSRHLRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLA 75 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI--------- 114 A+ + V S I + + + E L + +D + Sbjct: 76 AADLDQELCPRVVVNDYISKEGFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSD 135 Query: 115 ---------------------VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 Q LN + F++ G+ Sbjct: 136 SDPSDTASSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKWSGV 195 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + + AE + V I V+D S SM + + + F Sbjct: 196 DTINSTAVSAAEESIGN------VEISLVLDVSGSMEGAKLTNLQKAAKDFV-----KEM 244 Query: 214 SYSSQNGKVGIRDEKLSP-YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 S + + I S V G + E D +++ Sbjct: 245 LEKSADDSLSISIIPYSEQVGVPDYMMDKINTTGGNKVANCIEFQPADFTAIPFTAFSIG 304 Query: 273 LVRDALAS---------VIRSIKKI--------DNVNDTVRMGATFFNDRVISDPSFSWG 315 +A + ND V + + + + + + Sbjct: 305 APSEATNPPPSVPQSLHFTNRSNDFRRGGNRDHRSTNDVVSRFSPWDANFPCREDTPTDR 364 Query: 316 VHKL----IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---------------- 355 + + + GST+IN ++ + S + Sbjct: 365 REMVVIQNDLDTLNKQINNLVAAGSTSINIGLKWGLALLDESIQPLIKTVANDTNVPKIF 424 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT 380 ++ R N + K +VL+TDG+N Sbjct: 425 EDRPRPTNTTDTLKVVVLMTDGKND 449 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/315 (9%), Positives = 85/315 (26%), Gaps = 17/315 (5%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + + + S G+ + + + + + Sbjct: 367 MVVIQNDLDTLNKQINNLVAAGSTSINIGLKWGLALLDESIQPLIKTVANDTNVPKIFED 426 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 R + + V + D K + Y + ++E+ V Sbjct: 427 RPRPTNTTDTLKVVVLMTDGK--NDLQRAVVPPYNDGPSDVFWNAGAQEYSVLVDEDNSQ 484 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + ++ + + R Sbjct: 485 FLWPDV---SINEWTNKDTSNQKYWEREEYQNHAYGAGTTRYRRCKTFYTSTGRCASFYD 541 Query: 328 AIDENEMGSTAIND---AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + + T A+ I + + + + ++ NE Sbjct: 542 EVRDATELGTQPVRLEWPDVWAHTEIKAIESLFRRTKGDTYADDFIRNSIVTADISKKNE 601 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRD 443 + +++C KA+ + + I +IAF + + L +CA P ++EA ++ ++F Sbjct: 602 QVVSLCGKAEEKEVLIFSIAFEAPSSV----KQMLKDCAVKPARYYEAT-GTQIERVFDS 656 Query: 444 RIGNEIFERVIRITK 458 + +R+T+ Sbjct: 657 I---STSIQNLRLTQ 668 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 66/541 (12%), Positives = 145/541 (26%), Gaps = 95/541 (17%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT-- 63 + +F +L ++ G+ I A M + G+ G +D+ R + L+QA ++ Sbjct: 7 RALFILMRLYRNQAGNTLAIVAAAMLPLAGMVGGALDISRGYLAKTRLQQACDAGVLAGR 66 Query: 64 ---ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-TA 119 S ++ R SF +P + L N + + TA Sbjct: 67 KVMGSSGVLSDSVRDEVRKYVSFNYPSGYLGSTLATTDINPTLGSNDQIALSLTTAIPTA 126 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL----IQTKAEAETVSRSYHKEH 175 V ++ + S + + + L S + Y + Sbjct: 127 VMRLFGRNNMSITASCTARNDYSNIDIVLVLDTTGSMACKPERNDSDCSTWAGSRYVTQW 186 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK------------VG 223 + F ++ + ++ + Sbjct: 187 VAGLGRDATFVPEEMNSGVNVSRMQGLRTALANLQSQMATIETQFNMTEESKRKRVRWAI 246 Query: 224 IRDEKLSPYMVSCNK---------SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK---- 270 + ++ S S ++ + ++ S + Sbjct: 247 VPFSQMVNAGFSQGSAGTTLYSRHSDWFNRTGKYYNSGYIYDNPTHSDTWLANTWDGCVE 306 Query: 271 ----------------KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + + D+ T + + + Sbjct: 307 ERRTSNAITLTSGHSIPNNLPNTADDLKFDSTPTDSNTRWTVADPTRASGQYACPKAMRE 366 Query: 315 G--VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM--KNNLEAKKY 370 + TF G T ++ + A + + +Y Sbjct: 367 LQQMTATDFNNYFTFNNGFIPNGGTWLDVGLLWAARLLSRDGLWSTENDELYHTYPVSRY 426 Query: 371 IVLLTDG---------------------------ENTQDNEEGIAICNKAKSQGIRIMTI 403 ++ +TDG +N + C K+ +I TI Sbjct: 427 VIFMTDGYMSIGSSNYAAYAQEDYWRRVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTI 486 Query: 404 AFSVNKTQQEKARYFLSNCAS------PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +F T L NC+S P ++A+S+ +LN++FRD IG I +R++ Sbjct: 487 SFGAGSTLDSN----LINCSSSTNTTNPEFAYKADSSSDLNRVFRD-IGENIGS--LRLS 539 Query: 458 K 458 + Sbjct: 540 Q 540 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 52/440 (11%), Positives = 132/440 (30%), Gaps = 71/440 (16%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K +G ++ +++PV+L + +++V + A+ A IQ+ ++ Sbjct: 34 KQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQTGDKN 93 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ A + + ++ ++L+ + RE D + + V L+ Sbjct: 94 AALAAARDAAERNPVAGKVVPINMSDLEFGISLRESLDEGYSFQPLSDDDEFGNAVRLT- 152 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 L +P +F ++ + + VS + + + VID S SM + Sbjct: 153 TLSLSNSPQPVFSPLFPTMGTNLEIRPQRVAVS----TQSTMDVALVIDRSGSMAYANDE 208 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + + Sbjct: 209 A-----------------------------PDPYVN-----------------PAAAPPG 222 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + D +A + ++ ++D D + Sbjct: 223 WTYGDPVPPNSRWLDLVASVNAFNGFLAD------SPQYEKLCLATYSDNASRDCDLTHT 276 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ + + G T++ ++ + + + + +VL+T Sbjct: 277 YAEISNQLDA--ISYQFNGGGTSVGYGLEHGLAVLTDATHARKFAV-------RVMVLMT 327 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG + ++ ++ G+ + TI FS + Q + + C N F A Sbjct: 328 DGHHNTGKS-PESMTYHLQNHGVTLFTITFSDDAD-QSRMSNLANACGGEN--FHATDAS 383 Query: 436 ELNKIFRDRIGNEIFERVIR 455 +L F+ I ++ + + Sbjct: 384 QLQNAFQK-IAKKLPSLMTQ 402 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 52/463 (11%), Positives = 127/463 (27%), Gaps = 40/463 (8%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + ++S G+F ++ L+M ++ V G+ VD + + ++ A+ + Sbjct: 4 HSVSQYLRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDI 63 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 I+ + ++ ++ + ++ + T + + + ++ + S Sbjct: 64 IEEGKTLAEAELSARKYFDAELA-----FSSGVGLEVSTFTLTQGVDGIVKLGVSGKTST 118 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID---- 184 + R ++ ++ + G I + E I ++ Sbjct: 119 SLLKAVGREEIPVSVDAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALNALKA 178 Query: 185 FSRSMLDYQRDSEGQPLNCF-GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 S SM + + + G G + + + ++ Y+M Sbjct: 179 NSGSMALTFIPFTDRVNVGMGRANLLNPGDQTAVKKGGWGGCVDVRATKKKNKGETEYFM 238 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHL-------VRDALASVIRSIKKIDNVNDTVR 296 P + + + K ++ V + K+ Sbjct: 239 PDSAPEKGDRFTKFDNGTPAAHKSGYKLACNPQSIIGPTSNVSDVTSQLGKLTKGG--TG 296 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F + G + + ST A D + ++ Sbjct: 297 RFDLGFAWLWYALSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAYVY 356 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQE 413 E + G NT + AIC +Q I + + + Sbjct: 357 EYGKTNLA------------GWNTGSKDHFENVVAICKSMAAQKIEVHVMHV----NGND 400 Query: 414 KARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 KA + CAS +++ S L R++ Sbjct: 401 KAEPYFRECASATGGGYYKVASKQTLVDALTGITNGGGNLRLV 443 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 51/440 (11%), Positives = 127/440 (28%), Gaps = 81/440 (18%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++S +G ++ +L+PVML V ++VV L+ + A A L + ++ Sbjct: 29 LRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDK 88 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + ++ ++R + + + + + Sbjct: 89 AEAIEAAERLLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVGLRAFGA 148 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 +L + + + IK + + + I V+D S SM Sbjct: 149 DDVPMLFPTMGVPIEFRPIKQ--------------AVATQVELDIAIVLDRSGSMAFSHD 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + G + Sbjct: 195 EVAKN-----------------------------------------------GSPSSAPP 207 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + +++ + R+ + ++D+ +D + Sbjct: 208 GWKMGHAVPKNARWLDTVAAVN------GFLDIMEDSSHDERVSLSTYSDKSKADVKLTG 261 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + + G+T I + T+ N A + ++++ Sbjct: 262 DYTEIRAAMNAHSTN--FKGGATNIGSGILEGGATLGDKN-------LARSWASRVLIVM 312 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 +DG + E I + ++ I I T+ FS QE + + F A + Sbjct: 313 SDGIHNTG-IEPIPAAQQVANEKIMIFTVTFSNEANVQEMEKV---AVSGGGQHFHAKDS 368 Query: 435 HELNKIFRDRIGNEIFERVI 454 +L + FR I + + Sbjct: 369 QQLAEAFRK-IAKSLPTLIT 387 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 110 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 50/440 (11%), Positives = 128/440 (29%), Gaps = 81/440 (18%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++S +G ++ +L+PVML V ++VV L+ + A A L + ++ Sbjct: 29 LRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDK 88 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + ++ ++R + + + + + + Sbjct: 89 AEAIEAAERLLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVSLRAFGA 148 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 +L + + + IK + + + I V+D S SM Sbjct: 149 DDVPMLFPTMGVPIEFRPIKQ--------------AVATQVELDIAIVLDRSGSMAFSHD 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + G + Sbjct: 195 EVAKN-----------------------------------------------GSPSSAPP 207 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + +++ + R+ + ++D+ +D + Sbjct: 208 GWKMGHAVPENARWLDTVAAVN------GFLDIMEDSSHDERVSLSTYSDKSKADVKLTG 261 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + + + G+T I + T+ A + ++++ Sbjct: 262 DYTEIRAAMNAH--STKFKGGATNIGSGILEGGATLGDKK-------LARSWASRVLIVM 312 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 +DG + E I + ++ I I T+ FS QE + + F A + Sbjct: 313 SDGIHNTG-IEPIPAAQQVANEKIMIFTVTFSDEANVQEMEKV---AVSGGGQHFHAKDS 368 Query: 435 HELNKIFRDRIGNEIFERVI 454 +L + FR I + + Sbjct: 369 QQLTEAFRK-IAKSLPTLIT 387 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 109 bits (272), Expect = 1e-21, Method: Composition-based stats. Identities = 59/510 (11%), Positives = 128/510 (25%), Gaps = 93/510 (18%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +S G + AL + + G+ V+ R+ L+ AAQTA + ++ + + Sbjct: 15 RSQKGIAAVWFALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALAITIDDKSNQADA 74 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + + + + VR T N K Y V + Sbjct: 75 -----LATMYINDYVRDISHVDI----------QTVRTYEEPTEDNDNTEKIQYSVQAVT 119 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 ++ S ++ Y + + + V DFS SM + Sbjct: 120 THNSWFASNS----IPSFETQEKLAGQAVAAKYPFYLGDKIIDLVLVTDFSGSMNNSWDG 175 Query: 196 S------------------------------------EGQPLNCFGQPADRTVKSYSSQN 219 + T + N Sbjct: 176 EIKIDLLKDAVKQISNRILVPREGESEVLNRIAIIPFNLRVQEKINDNLYSTSQLRYKGN 235 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR---- 275 + + K S Y ++ S + +V Sbjct: 236 YRKSVSSVKYEQVNWD-YWSPYSEEAVEECANKRTDCPNKKSWERDQAKRVADVVNINNN 294 Query: 276 -----------DALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSFSWGVHKLIRT- 322 ++ + +N+ R +N + + + L Sbjct: 295 RLEIPDYVGYSKSVRHMFDDKVANNNLTFHFRSNNNKLYNSSMTRTGNSGFYTIPLTANK 354 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK--KYIVLLTDGENT 380 + G+TA + M + + + + K +++++DG Sbjct: 355 ANLDKMQTMSSGGNTAAHQGMLRGLQIMEAGRPNGGSEEETEQYNDRLKMLIVISDGMEY 414 Query: 381 QDNEEG-----IAICNKAKSQ------GIRIMTIAFSVNKTQQEKARYFLSNCAS--PNS 427 E +C+KA+ + I I + + + Q +C Sbjct: 415 PYTEILPGLVNKGMCDKAREHFQTENGNLYIGVIGVNFSASSQSG----FQDCVLNPDED 470 Query: 428 FFEANSTHELNKIFRDRIGNEIFER-VIRI 456 + T + K + I V R+ Sbjct: 471 IIDVTETEDFIKKIEELIQKGSGGNGVSRL 500 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 47/384 (12%), Positives = 100/384 (26%), Gaps = 66/384 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +G + VM+ GG+ +D++ ++ A++ A+ Sbjct: 25 RFWADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAANLSNTRD 84 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + + V+ A +V Sbjct: 85 PQTVVEDYFRAMKLEDTLGD---------------------------VQTGDSLGAKRVR 117 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + S FL +G+ + A AE + + I V+D S SM Sbjct: 118 AEGNGSI----NSHFLGLIGVDQLDVYGAATAENATA------PLEISLVLDVSGSMQGQ 167 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + F +S+ I ++ + + Sbjct: 168 KIRDLKEAAKAFVDAVLGEG-GDNSRVTVSLIPYNATVNLGDDLSERFNLDRWQNYSSCA 226 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + E +S S+ + + + D D + Sbjct: 227 IFESSDYNSLSIDP------------NAGLEQLAHFDPY-DYSGNSPDLTAPWCAEGNNL 273 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE---------------DE 357 + H + G+TAI+ M+ + + Sbjct: 274 AIVPHSSDADYLSDVIDSFEAQGNTAIDLGMKWGLALLDPAARPVIGDMQADGLVPSSAR 333 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQ 381 + K++V++TDGENTQ Sbjct: 334 YRPSDYGTQTMKFVVVMTDGENTQ 357 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/251 (13%), Positives = 72/251 (28%), Gaps = 20/251 (7%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD-AL 278 + D + + Y + ++ + +S + + Sbjct: 367 NPNALSDVWVDDHGTPGKGDDRYSIRVKDNYGDSNDVFYWPHASRNNYRNGPYSWVTRTA 426 Query: 279 ASVIRSIKKIDNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A ++ + +D ++T + + + N + G+ + N+ Sbjct: 427 AQMVNGVAVVDGDSETTKAKCSSYKGAGHNAGQETLIENVLGMDYGTLDLDGDGIAGAND 486 Query: 334 MGSTAINDAMQTA-------YDTIISSNEDEVHRMKNNLEAKKYIVLL---TDGENTQDN 383 S + ++ + + Y T + +Q N Sbjct: 487 DCSNYPPVRLTWQELFGNVKTTYYANAWYWQAYMDGRASYNDYYNAYYSWETTVDASQAN 546 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 IC KAK Q + I TI + + NCAS S + S+++L FR Sbjct: 547 TNLATICAKAKQQDVTIFTIGVEAPQAGLN----AMRNCASSASHYYNVSSNQLVDTFRS 602 Query: 444 RIGNEIFERVI 454 + R+ Sbjct: 603 ISDVVVELRLT 613 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 43/184 (23%), Positives = 85/184 (46%), Gaps = 11/184 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A+ S+I+ ++ +D + R+GA ++ +W + V+++ Sbjct: 253 SRMSELKKAVRSLIKELQTVDPDDQFTRLGAYAYHWYYAGKKELTWN-----KNSVRSWV 307 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T A+Q A + +++++E H KN E +I+ +TDG + N Sbjct: 308 NSLPASGGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNWAKRE 367 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGN 447 C AK+ GI I T+AF + + L CA+ + +++A + +ELNK+F+D Sbjct: 368 -CTSAKNAGITIYTVAFKAPASGRN----LLKACATSDAHYYDAKNANELNKVFKDIARE 422 Query: 448 EIFE 451 Sbjct: 423 TTKS 426 Score = 91.2 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 37/296 (12%), Positives = 88/296 (29%), Gaps = 9/296 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G + +L+ ++L V + +D+ + L+ AA A + L+ + Sbjct: 76 KERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLAEEITI 135 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + ++ F V+ V+ T ++ + Y V ++ Sbjct: 136 AQANAYAKDAFNRIASGLSASGDGSSGTSIFGTMTVKPAVQITE-TVDGNTTNYVVKVNG 194 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + +PLS F+ G + + +S+ V+D S SM Sbjct: 195 TAKIPASPLS-FMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGSMG----W 249 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 ++ + +K + + Y + S Sbjct: 250 ERPSRMSELKKAVRSLIKELQTVDPDDQFTRLGAYAYHWYYAGKKELTWN---KNSVRSW 306 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + +S ++ L + I+ + + D + DP+ Sbjct: 307 VNSLPASGGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPN 362 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 57/578 (9%), Positives = 138/578 (23%), Gaps = 134/578 (23%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K +G+ ++ L + ++ + G D R++ AL+QA A++T + L + Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFMLGATADYTRYATTRSALRQATDVAVLTVASKLTAT 73 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++A+ Q + T ++ + R ++ Sbjct: 74 TTDAQAKAQAQVILNAQPRMSTASITTASIATTKQTFCATSEVTIQNSFMQMARVTSLTP 133 Query: 132 VLSSRYDLLLNPL-----SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV---- 182 ++S DL + L S L + + + + V + Sbjct: 134 SVTSCADLAWGANPNATYEVALVVDNSGSMLSSDGSVTKISALKTAAKSFVDTMFAKAPD 193 Query: 183 ---IDFSRSMLDYQRDSEGQPLNCFGQPADRTVK--SYSSQNGKVGIRDEKLSPYMVSCN 237 + N D + G + + Sbjct: 194 RVQFSVTPFAGAVVAVDPTVAANRTLPWIDTEGDNSQHWLVFGNGSLTPSTAKAAAAAQG 253 Query: 238 KSLYYMLYPGPLDPSL--SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + ++ + ++ + ++ + + D + Sbjct: 254 FNNRFDIFNKLKQRNSAMDWRGCFEAPAYPKNVQDIVVSSSDPETQFVPYLAPDEPSGYD 313 Query: 296 RMGAT---------------------------------------------------FFND 304 FF Sbjct: 314 NNNYIDDNGGVTTRTYYGSTTTYTCSDTASGSWSKLTHVCKYKPTAAKSGNYGPTSFFGP 373 Query: 305 RVISDPSFSWGVHKLIRT--IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + +L + +K G+T + + + TI + R Sbjct: 374 NAFCPDHTTQRLLQLTTSQTTIKNKIDQLVANGNTNLQEGFMWGWRTISPNGPFAAGRPY 433 Query: 363 NNLEAKKYIVLLTDGENTQD----------------------------------NEEGIA 388 +K +V +TDG N + Sbjct: 434 ATSNNRKVMVFMTDGFNHWGAYPNTVVGSDYEALGYYTYNGEKNLRLPDGSRGDRVDYQN 493 Query: 389 ICNKAKSQG--------------------------IRIMTIAFSVNKTQ-QEKARYFLSN 421 A++ + + TI FS + + L + Sbjct: 494 ALKAARNSNSSYLATARDAQDELTLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKS 553 Query: 422 CASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 CA+ + +F + ++LN F I +R+++ Sbjct: 554 CATNVDHYFAVENANQLNAAFSSI---GIGLGKLRLSQ 588 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 55/495 (11%), Positives = 129/495 (26%), Gaps = 89/495 (17%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEEVSSRAKN 81 + L+PV +G+ + + + L A + A + S + Sbjct: 1 MSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYL 60 Query: 82 SFTFPKQKI--------------------EEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ + + EY N + V Sbjct: 61 VTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVN 120 Query: 122 MNPRKSAY--------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 N Y VV + + ++ + + L + KA ++ Sbjct: 121 ENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFS 180 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 G++ + FS + + C + + + + + K P + Sbjct: 181 NDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGL 240 Query: 234 VSCNKSLYY---------------------MLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + + P ++F+D + + I Sbjct: 241 DNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTIT 300 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +I ID +T+ + FF + + S ++ Sbjct: 301 KIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDP---NSKGDINE 357 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---- 383 ++ G T + + + S + K +++L+DG++ Sbjct: 358 ILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KLMIILSDGDDNTQKMSSP 406 Query: 384 -------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +C K K GI+++ I + +C +F+ Sbjct: 407 HDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--NNIIDWEKDCVGTGNFYL 464 Query: 431 ANSTHELNKIFRDRI 445 A + HEL + Sbjct: 465 AKNAHELEISIERAL 479 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 56/525 (10%), Positives = 148/525 (28%), Gaps = 96/525 (18%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVS---------SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + + N + ++ + + L + D+ V T Sbjct: 67 NSGRKANDYKLGGSNPNDDSFNISSEVGKRDHAIVTTFVKTFLPQTNDDKMNLIPVCKTV 126 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHGV 177 + + ++ + S F L+ ++ Q +++ + + + Sbjct: 127 NNTSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVDVASKSKAFKKNTFNIPI 186 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V D S SM ++E + + + + + D + Sbjct: 187 DLMVVADLSGSMNFDLDNNETKKTGKPSKISILKEVLVELADKTLLSEDANQHNRIYVTP 246 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDS---------SSLRHVIKKKHLVRD-----ALASVIR 283 +L + S + S + + L+ + + + Sbjct: 247 FALGAEINKNSCALPYSWDIESSSKIENIKKILNKENSQYNRADLINNLVYRISTKETLN 306 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 +I N T F + + + W ++ ++ G+T + + Sbjct: 307 NINGKQKYNVTFPKN--TFCLKDMKTSNQGWYTRS-DKSKFTSYVQSIKASGATLASSGV 363 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------------------------- 378 A + +I + K+ I++L+DG Sbjct: 364 LVAANNMIKDGSRTEQ---LKEQTKRVILVLSDGNDEIIKSDPNSKVPFLNYTRITENLI 420 Query: 379 ----------------------------NTQDNEEGIAICNKAK--------SQGIRIMT 402 +TQ E +C + + +I+ Sbjct: 421 YGKQEVFLSQKQKISLSLSHSTIETYLTDTQPRNETDGMCKVIRDRLDTLNNDKNTKIVF 480 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + F KA+ +C +++ AN L F+ IG Sbjct: 481 VEFGY----ASKAKQAWQHCVGNGNYYSANDKESLLNSFKQAIGE 521 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 65/518 (12%), Positives = 138/518 (26%), Gaps = 77/518 (14%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 7 LTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLI----------------------------QSLEEVSSRAKNSFTFPKQKIEEYL 94 + + + K + + ++ YL Sbjct: 67 LLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYL 126 Query: 95 IRNFENNLKKN--------FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + N K + F + + + + A V S + L Sbjct: 127 RSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQT 186 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ------------- 193 + S + + + + + + V D SRSM Sbjct: 187 LVSSSQLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNR 246 Query: 194 --RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 N + ++ S ++G + Y Sbjct: 247 RIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQ 306 Query: 252 SLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--FFNDRVI 307 + +F + + + + + + + F Sbjct: 307 AEISNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYPFSLSSLSSRNFCLENN 366 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLE 366 + + + V + +G TA+ + + + +N+D K N Sbjct: 367 KGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTN 426 Query: 367 AKKYIVLLTDGENTQDNEEGI------AICNKAKSQ------------GIRIMTIAFSVN 408 ++ +++L+DGE+ + ++ + +C K K + RI +A N Sbjct: 427 TRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFN 486 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 Q + C ++ NS L F+ I Sbjct: 487 PPQDQLI--AWKKCVG-KQYYPVNSKQGLLDAFKQIIS 521 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 65/518 (12%), Positives = 138/518 (26%), Gaps = 77/518 (14%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K K+ +++ G + IITALL +L VD + L QA A + Sbjct: 7 LTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAAL 66 Query: 63 TASVPLI----------------------------QSLEEVSSRAKNSFTFPKQKIEEYL 94 + + + K + + ++ YL Sbjct: 67 LLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYL 126 Query: 95 IRNFENNLKKN--------FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + N K + F + + + + A V S + L Sbjct: 127 RSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQT 186 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ------------- 193 + S + + + + + + V D SRSM Sbjct: 187 LVSSSQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNR 246 Query: 194 --RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 N + ++ S ++G + Y Sbjct: 247 RIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQ 306 Query: 252 SLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--FFNDRVI 307 + +F + + + + + + + F Sbjct: 307 AEISNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYPFSLSSLSSRNFCLENN 366 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNLE 366 + + + V + +G TA+ + + + +N+D K N Sbjct: 367 KGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTN 426 Query: 367 AKKYIVLLTDGENTQDNEEGI------AICNKAKSQ------------GIRIMTIAFSVN 408 ++ +++L+DGE+ + ++ + +C K K + RI +A N Sbjct: 427 TRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFN 486 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 Q + C ++ NS L F+ I Sbjct: 487 PPQDQLI--AWKKCVG-KQYYPVNSKQGLLDAFKQIIS 521 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 56/450 (12%), Positives = 129/450 (28%), Gaps = 66/450 (14%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + + I G I ++ M+ + + + QAA A I Sbjct: 2 HFNQSIVKHHGSVAISYLAMLIPMVIAAASTIVIGYQVQLSNRAMQAADAASIAC----- 56 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 K + ++ L +++ + + + + + Sbjct: 57 ---------------EFKGEYDQALTQSYLDYYQPKIDKVRGQIR------TNSGCNMSL 95 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 LS+ + L + F+ + + A + + + V+D S SM Sbjct: 96 GYSLSTIFTSLTLSDTSFV---------VSSTANEKAYVTEDVVSDPLELVIVLDISTSM 146 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + ++Q+ VS N + + Sbjct: 147 YGAINDL---KAILKRGIVSLKEQQNNAQSEDHIKVSIIPFSTGVSVNNAPWLNDARTFC 203 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +E ++ + ++ K A F Sbjct: 204 VDGTTESEDKFYAARTVANLDITHDQISV-------KLSQPNKWRESCSAASF------T 250 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNLEA 367 + + ++ G TA + + + + +N + Sbjct: 251 LPLTADLDQVT-----NTVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNVDKV 305 Query: 368 KKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 ++ +VL+TDG + D+ +C++AK GI + + F VN ++ E+ + CA Sbjct: 306 ERKLVLMTDGNDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGSRLEQ----FTRCAV 361 Query: 425 PNS-FFEANSTHELNKIFRDRIGNEIFERV 453 F A+ T +L+ F + + ++ Sbjct: 362 DPKGVFSASDTQDLDHYFSQLLSVQYDTKL 391 >gi|126208028|ref|YP_001053253.1| tight adherence protein G [Actinobacillus pleuropneumoniae L20] gi|126096820|gb|ABN73648.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 520 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 59/515 (11%), Positives = 150/515 (29%), Gaps = 86/515 (16%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQKI----------EEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + S + + + L + T Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDTQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHG 176 N + ++ + S F L+ ++ Q +++ + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + V D S SM D + + + +P+ + + + S Sbjct: 187 IDLMVVADLSGSMNF---DLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRI 243 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS------------VIRS 284 + + + + + S R K A + ++ Sbjct: 244 AVAPFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLSANNNISAKDFALSLSYLVDI 303 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL----IRTIVKTFAIDENEMGSTAIN 340 K I+++ T + FN + H + +F + GST + Sbjct: 304 DKTINSIGGTFSSNSIIFNKNKFCLGRSNKNTHHWYNRDESSNFFSFIKRLHAEGSTLAS 363 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-----------TQDNEEGIAI 389 + TA + ++ E + K+ I++L+DG + TQ + + Sbjct: 364 SGLITASNIMLKE---ESRSKSLGEQTKRVILVLSDGNDELRLNDEGTPFTQYSRITENL 420 Query: 390 -----------------------------CNKAK--------SQGIRIMTIAFSVNKTQQ 412 C++ + + +I+ + F Sbjct: 421 LLGQEEQTTDTYPYFMSKPPKKLTSNINVCDRIRNKLDEHNEDKNTKIVFVEFGY----A 476 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 KA+ +C +++ AN L F+ IG Sbjct: 477 SKAKQAWQHCVGNGNYYSANDKASLLNSFKQAIGE 511 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 60/395 (15%), Positives = 108/395 (27%), Gaps = 60/395 (15%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M+ ++ K G ++ ++ + GG+ VD+ + A Sbjct: 1 MMLISRIAH----FRKDEGGGMLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNA 56 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ A+ E R+ + E R + Sbjct: 57 VLAAASLSQDLDAEEVVRSYLTSAGLDPSEVEIETREEKIG------------------- 97 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 V + L + F R I + + +EA + I Sbjct: 98 -------GILVGRTVEASLPAGLNTYFFRFFDIDTLGMTISSEATERVED------IEIS 144 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V+D S SM D D G ++ + A + + G + PY N Sbjct: 145 LVLDVSGSMGDITSDRSGIKMDLLKRAA-GDFVETILSDAEEGRVSISIVPYSTKVNPGS 203 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 L + S H VD + + + + I N T Sbjct: 204 AL-LGQYTVSQEHSYSHCVDFDADDFTHLRIDTATELQRTGHFLIGSESTSNRTAGQWVC 262 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--- 357 F+ + S V +L K +GST+I+ + + S + Sbjct: 263 RFDSG-FAVTPLSSSVAEL-----KAQIAALTPLGSTSIDMGAKWGLALLDPSAQTPIAA 316 Query: 358 -------------VHRMKNNLEAKKYIVLLTDGEN 379 + + K +VL+TDGEN Sbjct: 317 MIASGQVNRAFQGRPHVYGADNSMKVLVLMTDGEN 351 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 72/243 (29%), Gaps = 22/243 (9%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + P++ + S+ ++ + + +S + + + SV S Sbjct: 325 RAFQGRPHVYGADNSMKVLVLMTDGENREEYRLKPEFASGQSDLTRTTYGGYTYYSVASS 384 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +N +D + F+ + + ST + Sbjct: 385 ESNYENDSDWTYPESNFYAIHPFGTQRMWSNYTLANNSDFR------QARMSTEVRLDWP 438 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIV-----LLTDGENTQD----NEEGIAICNKAKS 395 + + + + Y + T D + IC A + Sbjct: 439 EVWAEMSPYY-YGYNMYGRRYNSSWYWYQRATDFRDYIQWTVDSVEKDRRLRQICGVANA 497 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 G+ I +I V+ + L +CAS S + E+ F D I I ++R Sbjct: 498 AGVVIYSIGMDVD---NTNSLNLLKDCASSESHYFDVEGLEIQTAF-DMIAASIS--MLR 551 Query: 456 ITK 458 +TK Sbjct: 552 LTK 554 >gi|46143335|ref|ZP_00135441.2| COG4961: Flp pilus assembly protein TadG [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 520 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 59/515 (11%), Positives = 150/515 (29%), Gaps = 86/515 (16%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQKI----------EEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + S + + + L + T Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDNQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHG 176 N + ++ + S F L+ ++ Q +++ + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + V D S SM D + + + +P+ + + + S Sbjct: 187 IDLMVVADLSGSMNF---DLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRI 243 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS------------VIRS 284 + + + + + S R K A + ++ Sbjct: 244 AVAPFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLSANNNISAKDFALSLSYLVDI 303 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL----IRTIVKTFAIDENEMGSTAIN 340 K I+++ T + FN + H + +F + GST + Sbjct: 304 DKTINSIGGTFSSNSIIFNKNKFCLGRSNKNTHHWYNRDESSNFFSFIKRLHAEGSTLAS 363 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-----------TQDNEEGIAI 389 + TA + ++ E + K+ I++L+DG + TQ + + Sbjct: 364 SGLITASNIMLKE---ESRSKSLGEQTKRVILVLSDGNDELRLNDEGTPFTQYSRITENL 420 Query: 390 -----------------------------CNKAK--------SQGIRIMTIAFSVNKTQQ 412 C++ + + +I+ + F Sbjct: 421 LLGQEEQTTDTYPYFMSKPPKKLTSNINVCDRIRNKLDEHNEDKNTKIVFVEFGY----A 476 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 KA+ +C +++ AN L F+ IG Sbjct: 477 SKAKQAWQHCVGNGNYYSANDKASLLNSFKQAIGE 511 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 54/478 (11%), Positives = 118/478 (24%), Gaps = 94/478 (19%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + +G ++ + + +L + G VD + L A A++ A Sbjct: 12 VTRSRRVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVS 71 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + A + F + +A Sbjct: 72 AAGMKMSQADAEAAMAKLFTDAAAA----------------------LPNVSASPRAATA 109 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 V ++ + +R G + A A + + ++D S Sbjct: 110 PTTDGVRTASLTYSATIRTGIMRLAGFSTVAFGGTATAASPNP-----IFTDFYLLLDNS 164 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ + Sbjct: 165 PSMGVAATTADIATMVANTSDQCAFACH-------------------------------- 192 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D S + + V + +VRDA ++ + RM F Sbjct: 193 ---DMSAGGNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAKAIAAGQYRMAIYSFGTSC 249 Query: 307 I--SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 S L + A+D + N+ T +D I + V + Sbjct: 250 SGIGLNQVSALTANLSTSKTDAGALDLMTVPYQNYNNDQCTDFDGIFARLNSAVPNPGSG 309 Query: 365 ---LEAKKYIVLLTDGENTQD---------------NEEGIAICNKAKSQGIRI---MTI 403 +K + ++DG + +A C K +GIR+ T Sbjct: 310 ASAASPQKVVFFVSDGVADANYPSTCTKPTTNGRCQEPITLANCQALKDRGIRVAVLYTT 369 Query: 404 AFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + ++ CASP+ ++ + + + + + + Sbjct: 370 YLPLPTNGWYNTWIAPFSSQIATNMAACASPDLYWPVSPSEGIADAMKGLFKKVVDSQ 427 >gi|27379052|ref|NP_770581.1| hypothetical protein blr3941 [Bradyrhizobium japonicum USDA 110] gi|27352202|dbj|BAC49206.1| blr3941 [Bradyrhizobium japonicum USDA 110] Length = 472 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 59/502 (11%), Positives = 129/502 (25%), Gaps = 91/502 (18%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + + II AL+M + + GM +D L AA A I A Sbjct: 5 LRSAMLRFARDRKANVAIIFALMMVPTIFLLGMALDYTLALRKREQLNAAADAAAIAAVR 64 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 P + + + + + K N T V+ Sbjct: 65 PAMLTQSDTTVVKATAEAVFAA--------------KANLPGLSAVPTPTVTIVDSG--- 107 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + F +G ++W + A A S ++ ++D S Sbjct: 108 ----LARTITVSYTAQSTNNFPGVLGKQTWQVAGSATARASSAPN-----MNFYLLMDDS 158 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM ++ L + PA G ++ + L Sbjct: 159 PSMGIGATTTDISNLIKYTAPAY-------QSAGGSQNCGFACHETNIAHDGGTKDNLAI 211 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 V S+ + + + + ++ + N T + F+ + Sbjct: 212 ARQRNITLRIDLVTSAVNQLLNSWSNCPQSGVSGGVMQCMSA-LNNTTYKAALYTFDLGL 270 Query: 307 ISDPSFSWGVHKLIRTIVKT-----------FAIDENEMGSTAINDAMQTAYDTIISSNE 355 + + + + + T I A+++ D + + Sbjct: 271 NALATLTTPTSAGTQVSNIALMPVAYQNCVVVTTNCKTDNGTDIAGALKSLNDVMPTPGL 330 Query: 356 DEVHRMKNNLEAKKYIVLLTDG-------------------ENTQDNEEGIAICNKAKSQ 396 + ++ + L+TDG N IC K++ Sbjct: 331 GSN---ASGDTPQEVVFLVTDGVEDKIVSGASTCPNASLASNNRCQQPLDTTICTTIKNR 387 Query: 397 GIRIMTIA------------------------FSVNKTQQEKARYFLSNCASPNSFFEAN 432 GI+I + ++ + L +CASP + Sbjct: 388 GIKIAVLYTEYLQLKTPNIPVTNSWYMSWVDPYNEPTSLTGTIAQKLQSCASPGFYASVQ 447 Query: 433 STHELNKIFRDRIGNEIFERVI 454 + +++ + Sbjct: 448 TGGDISDALTNLFIKVASSTAS 469 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 53/523 (10%), Positives = 148/523 (28%), Gaps = 93/523 (17%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEV---------SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + S + N + ++ + + L + + T Sbjct: 67 NSGRKTTDYKLGGSNPSDESFNISSEVGKRDHAIVTAFVKTFLPQTDEKNMHLTPLCKTI 126 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHGV 177 + + ++ + S F L+ ++ Q +++ + + + Sbjct: 127 NNTSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASQSRAIKKNTFNIPI 186 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V D S SM ++E + + + + + D + + Sbjct: 187 DLMVVADLSGSMNFDLDNNEIKKTGKPSKISILKEVLVELADKTLLSEDANQNNRIYVTP 246 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSS---------LRHVIKKKHLVRDALASVIRSIKKI 288 +L + S S + + L+ + + + S K+ Sbjct: 247 FALGAEINNNNCALPYSWSVESSSRTQNIKNILNKQNSQYNRADLINNLVYKI--STKET 304 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + + F+ + + + G+T + + Sbjct: 305 LDNINGKQNYNVTFSKNAFCLKDMKTSNKGWYSRSDKLEFTNYVQSIKANGATLASSGVL 364 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-------------------------- 378 A + +I D + + K+ I++L+DG Sbjct: 365 VAANNMI---RDGSRTEQLKEQTKRVILVLSDGNDEIVKGDPNNKVPFLNYTRITENLIY 421 Query: 379 --------------------------NTQDNEEGIAICNKA--------KSQGIRIMTIA 404 +TQ + +CN K + +I+ + Sbjct: 422 GRQEEFSSEKKRVSFGHSTTIETYLTDTQPKKVTDGMCNVIRDKLDTLNKDKNTKIVFVE 481 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 F + ++ +C +++ AN L F+ IG Sbjct: 482 FGY----KSTSKQAWEHCVGNGNYYSANDKASLLNSFKQAIGE 520 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 49/447 (10%), Positives = 123/447 (27%), Gaps = 62/447 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++K TG + L+ ++ + + + QA A + + Sbjct: 1 MLKQHTGSVSLSFLALLIPLVVLSAATIMIGFQVQLSSRAMQAVDAASLACAFADYSDPS 60 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 Q EY N + + ++ + + Q Sbjct: 61 ------------VNQAYLEYYQPNVKLVKSEIYSASGCELNMGYQLTGLFSSLKFAQASY 108 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S++ + + + + V+D S SM Sbjct: 109 SAQ-----------------------SGSVEQAHVNQSASVTPTEMTLVLDISSSMAGSI 145 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + V+ + + I VS + + L Sbjct: 146 DTLKSILTRAIERIEQDNVQIDGRRAISISIVPFS---DGVSARNADWLDDKGVFCIDGL 202 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 ++E ++ + + I V+ + + Sbjct: 203 TKESGGS------------VLVNETVQNLDRIHSEKAVSHRAPDEFLADCSASATLVPLT 250 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + ++ KT G T + +I E +L K+ ++L Sbjct: 251 DNMSEV-----KTAINALTTTGGTRSYQGVIWGARQLIPRWRQEWGYNPYSLAPKQKLIL 305 Query: 374 LTDGENTQ---DNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASP---N 426 +TDG ++ D+ +C++ ++ I + I F+V ++ + + ++ + Sbjct: 306 MTDGVDSGYVLDDLIDAGLCDRLANEFAIELNFIGFNVQDSRLAQFQSCINAANTDGIKG 365 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 F A +T +L++ F + + + Sbjct: 366 QVFSATNTEKLDEYFSKILEVQYDTTL 392 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 57/449 (12%), Positives = 126/449 (28%), Gaps = 69/449 (15%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +S +L+ G I ++ M+ + + + QA TA + Sbjct: 2 HSSRLLSKQRGSVAISYLAMLIPMIIAAASTIVIGYQVLLSNRAMQAVDTASLAC----- 56 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + + + + K Sbjct: 57 ---------------EFRGEYDRSIAQGYLDYYKPKIDKVTA----------------TL 85 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRS 188 + +L + S+F + + ++ V + + + + V+D S S Sbjct: 86 GASSGCKVELGYSYSSIFTSLTFSDASYVAGVTASQKVYVTEVTDSDPIELVLVLDISGS 145 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M L T++S + + P+ + + L G Sbjct: 146 M-----MGALDELKSILNRGLTTLRSQQANVAGQDHIKVSIVPFSNGVSVTDAPWLKSGG 200 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + S S + + + D ++ Sbjct: 201 TLCVDATVNSGGSFSPANTVANLDVTHDQA--------------PVTTSSSSSDCSLTSV 246 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNLE 366 + ++ + +GSTA + + + + +N Sbjct: 247 ILPLTSNLNDV-----VDAVNRLQTIGSTASYQGLLWGLRQLTPNWQSAWRVGPNRNQDN 301 Query: 367 AKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++ +VL+TDG + D +C +AK GI + I F V Q + F Sbjct: 302 VQRKLVLMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGV---QSWRLEQFTRCAG 358 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFER 452 S + F AN+T +L+ F + E + Sbjct: 359 SAGAVFSANNTQDLDDYFSQLLSIEYDTK 387 >gi|262173885|ref|ZP_06041562.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] gi|261891243|gb|EEY37230.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] Length = 403 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 39/447 (8%), Positives = 116/447 (25%), Gaps = 72/447 (16%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 ++ P M+ + + + + L +A++ A + + +N+ ++ + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL-------IASPKEDDENNVSYAR 53 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + ++ Y++ N ++ R + + + V ++++ ++ ++ Sbjct: 54 KLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYENIS 113 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 L+ + + + V + +++D S SM + ++ Sbjct: 114 LK----PEFTVNGSSVTRKFLPQ-----PVDVYFIVDMSASMRATW-QNGKSQIDEVKNV 163 Query: 208 ADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 R V + +V R + + + + R Sbjct: 164 ITRVVNDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYALYNTPQQTVSNMFFPPKR 223 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 L + + + + + Sbjct: 224 VNPSDSGLFSHRDIDLTQ---------NYSSFLQIMNDRNFYPP------RSACTESWQG 268 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH---RMKNNLEAKKYIVLLTDGENTQD 382 A + +T IN + + + + R + Y+ L DG Sbjct: 269 IIAAAQAADKATDINP--EQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLVDG----- 321 Query: 383 NEEGIAICNKAKSQ-------------------GIRIMTIAFSVNKTQQEKARYFLSNCA 423 +C K + + + I + + +C Sbjct: 322 -----GLCKNLKQRIRQKPNRFQSSTPTENEKTKVTMGVIGVNYQVNPNDG----FGDCV 372 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + A ++ K + I E Sbjct: 373 GRENIYHATQGEDVYKYILNLINEETG 399 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 55/499 (11%), Positives = 141/499 (28%), Gaps = 108/499 (21%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G II +P+++ V + +++AA+ A + + ++ Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + ++ ++ Y+ N + T + + + VV + Sbjct: 64 -------DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVAT 116 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + +D ++ + + K ++++R Y + V I +++D S+SM + Sbjct: 117 AEHDSWISHNEI--------GVEPKFKVSGDSITRKYLPQ-PVDIYFILDTSQSMSNPWY 167 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + V + + K P S L Y Y D Sbjct: 168 GERNKTQMQVVKDTITRV--------VKELENFKTGPDKKSRVALLTYNAYNAKFDKGAG 219 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 D +S + + + ++V + A+ +N + Sbjct: 220 RVKLYDYASEFSHTE--ASFESIVDKMFD-----ESVVEQKPHYASDYNKSQD--IPLTD 270 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + I + + G T ++ +I++ ++ K + ++ ++L Sbjct: 271 KYQEFIDILNSNKVMPAR-GGGT-------QSWLGLIAAAKEADKVKKEDRNPEQVFIIL 322 Query: 375 TDGENT---------------------------QDNEEGI----------------AICN 391 +DG +T Q + +C Sbjct: 323 SDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQYDGRTYYYQVYDKFLKSLVGEHGLCE 382 Query: 392 KAKSQ--------------------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 K++ + + I + N + + C + + A Sbjct: 383 SLKNRISSKENKFQSEHVKLEGEKTKVTMGVIGVNYNVQKDDG----FGECVGEKNIYHA 438 Query: 432 NSTHELNKIFRDRIGNEIF 450 + ++ K + I E Sbjct: 439 KNGKDVYKYILNLINEETG 457 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 53/554 (9%), Positives = 118/554 (21%), Gaps = 117/554 (21%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ I ++ G+ +D+ R + L+ A + Sbjct: 5 RFLSDRRGNIAIAFGSFAFLLTAGSGVGIDMSRVVTEKSRLQSA--ADATALAANYKSGT 62 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-TAVEMNPRKSAYQV 131 +++ + + N+ V V T Sbjct: 63 YTAEQIRQHAEAYFDGLYTAPERGSVSRNVTVGDGTISVEAGVTMPTFFAPLLGVEEISF 122 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV-------SIQWVID 184 + + + + L S + + + + ++ Sbjct: 123 AVMAESKVGTASFDVVLVLDNSGSMAGSRMTTLKQAASDLIRTLMSINEISTEDDRVMVG 182 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 ++ + V + + Sbjct: 183 LVPFTAFVNIGADKATQPWMDREGRSPVHWTN-FQTGSDGTPVPSLFSSSALVNGRPSRF 241 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID-NVNDTVRMGATFFN 303 V++ + + + D AS+ D + ++ Sbjct: 242 SLYQQLGGTDWLGCVEARPMPYDVTDDAADPDVPASLYVPAFAPDEPDSSPDNRDGYRYS 301 Query: 304 DRV--------------------------------------------ISDPSFSWGVHKL 319 + + S+G Sbjct: 302 NNWLADNAGACSLTAKQAAQVNIYDQGDSPIHGSLATREVAQGRLCKYRNQPKSYGTSSS 361 Query: 320 -----------------IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM- 361 + + G T I+ + + + R Sbjct: 362 QGPNFLCKTQPITDLTNDKQALLDAVAAMRADGYTNIHQGVVWGWRVLTPQEPFSRGRSP 421 Query: 362 --KNNLEAKKYIVLLTDGENTQD------------------------------------N 383 K + ++ ++++TDG NT + Sbjct: 422 DQKREKDHRRIMIVMTDGANTYQDKSSSHNRTEYNAYGYGTEQRLGSGIDTAGEIAAKMD 481 Query: 384 EEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIF 441 E C A + + ++ TIAF V R L +CA SP F+A S EL F Sbjct: 482 ERTALACRNAATYEATQVYTIAFQVGDYAT---RKLLRDCASSPEMAFDAGSNSELVTAF 538 Query: 442 RDRIGNEIFERVIR 455 R+ R Sbjct: 539 ERIGKEISRLRLAR 552 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 62/515 (12%), Positives = 140/515 (27%), Gaps = 134/515 (26%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G+ I+ AL + M+ G D +R ++ TA+I A Sbjct: 23 RVFKAFRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAV 82 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++ + + + ++E + N Sbjct: 83 KEI--DTDDTDALKEKVADWFHAQVENSYTLGDIDIDTSNHKI----------------- 123 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + ++ I + + + + + SY +++ VID Sbjct: 124 ----------TATASGTVPTTLMKIANIDTVDVSVASAVKGPATSY-----LNVYIVIDT 168 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML + + + GK Sbjct: 169 SPSMLLAATTAGQSAMYSGIGCQFACHTGDAHTVGKTK---------------------- 206 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + S+ + + + + DA+ V+ I D+ + +++G D Sbjct: 207 ---------YANNYEYSAAKTIKLRADVAGDAVRDVLDMIDDSDSNHQRIKVGLYSLGDT 257 Query: 306 VISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + +T + ++ T + + + Sbjct: 258 LTEVLTPTLSTDTARNRLADASYGLTSATSKAATYFDVSLATLKQKVGTGG-----DGTS 312 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI-----------------------CNKAKSQGIRI 400 + K ++LLTDG +Q + C K+Q + Sbjct: 313 SGSPLKLVLLLTDGVQSQREWVTDKVTWKNGQATYGAYWNKVAPLNPDWCAYVKNQSATM 372 Query: 401 MTI------------------------------------AFSVNKTQQEKARYFLSNCAS 424 + S + T+++ Y LS+CAS Sbjct: 373 AVLYTEYLPITSDWGYNATVGSTMASASWKSTYGGTMQSGVSTSITRRDYIPYALSDCAS 432 Query: 425 PNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 S F A+S+ E+ + +R+T+ Sbjct: 433 SKSLFISASSSTEITAGLSALFTQYLAS--VRLTQ 465 >gi|312882153|ref|ZP_07741902.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309370199|gb|EFP97702.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 498 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 59/518 (11%), Positives = 134/518 (25%), Gaps = 89/518 (17%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +K G + LL+ ++GV V+ R+ L+ +A+ A I ++ Sbjct: 3 IRQTQNSSLKKQQGVAAVWMGLLLVPIMGVTFWAVEGSRYVQESSRLRDSAEAAAIAVTI 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + Y+ L + R D Sbjct: 63 EDRKGRADSMAEEYVRS---------YVRDIESTALTTSRDYRARDDAKGIRE------- 106 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + +R +S F+ S K L + + + I +V DFS Sbjct: 107 -YIEYTVDARTTHKSWFVSNFIPSFDRKQELAGRSVAGKEPIQPGEDT--IDIVFVSDFS 163 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYS--------SQNGKVGIRDEKLSPYMVSCNK 238 SM + ++ Y G+ P+ + + Sbjct: 164 TSMYGSKLRVLKSSIDQISIKILCESPRYDQQISSYICDDKRATGVNRIGFVPFNIRTRE 223 Query: 239 SLY----------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH-LVRDALASVIRSIKK 287 ++ L + + + ++ + + Sbjct: 224 MIWSRDGRAVSQLVYLNGDASVSTYTYNDIDWDRWRMSSKESVEKCAQNYFNCNVPQWQN 283 Query: 288 -------IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT-------------- 326 D V D + D + + +T Sbjct: 284 HQRAKRIFDVVGDYSDPDMFDYIDFDATVSDMFNDKSRSKSNFYRTTKGVRLYSGSLHAQ 343 Query: 327 --------------FAIDENEMGSTAINDAMQTAYDTI---ISSNEDEVHRMKNNLEAKK 369 + G+TA + + S + + + K Sbjct: 344 FQNILLTNDLLDLEAIQNMTAAGNTAAYQGILRGLQVLKQGQPSIDADEEVQQAYQNKNK 403 Query: 370 YIVLLTDGENTQ----DNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCA 423 +++L+DG+ T + +C+KA+ + G+ I I + + Q C Sbjct: 404 MLLILSDGQETDERIINELVKNKMCDKAREEIPGLYIGVIGIKFSASDQNG----FKECV 459 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEI-FERVIRITK 458 + ++ +EL + + I + R+ + Sbjct: 460 ENEDEDIIDVSNLNELIEKIEELIMKGSRGLGITRLYQ 497 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 62/526 (11%), Positives = 128/526 (24%), Gaps = 88/526 (16%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K +S +G+F ++ AL P +L G DV + L+ + A +++S Sbjct: 3 INKFWRSESGNFALLFALAAPAILAAVGFAADVSSVMRAKVNLQNSLDAATLSSSHLSDD 62 Query: 71 SLEEVSSRA-KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + N+ + Sbjct: 63 EAARRLAFDGYFQANVANHPELTNAKLTLSVDKGFNYVKTKAIASADVNLYFAFLFGDNQ 122 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWL-------------------------------I 158 + + + N L + L S I Sbjct: 123 HIEVDAGGVEATNNLEVVLVLDNTGSMAGAKIKALRDATKVLLDNLDGAKSPDRKVTAAI 182 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSR-SMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 A ++ + + + + ++ F A T + Sbjct: 183 VPFVTAVNINGDKFDPSWIDMAGKSPNNGANFPLLPDGKRVNHMDLFRDLAQGTGWQGTG 242 Query: 218 QNGKVGIRDEK-----LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 G V R ++P N P + + + + Sbjct: 243 WKGCVEARPGSAAISDVAPDQADANTLFVPYFAPDDPGDATGPSASYGNDAKVYNN---S 299 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS---------DPSFSWGVHKLIRTI 323 + D + + K + A ND+ L + Sbjct: 300 YLSDDVQDGTDTKGKDKKIAKYENPKAKKINDKYAPLTVGPNRACPTPVVPLTADLDKLR 359 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD- 382 + E T +++ + + + K +VLLTDGEN Sbjct: 360 TAAAQMQEWNGSGTNVSEGLSWGMRVLSPAPPYTDGAPWKTPNTSKIVVLLTDGENVVYG 419 Query: 383 ---------------------------------NEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + +C+K K+Q ++I TI + Sbjct: 420 ASAEPEKSDYTSYGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQQVQIYTITLQSDT 479 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 R CA+ P ++ N +L +F+ G +++ Sbjct: 480 ---AANRTLYGKCATNPADYYAVNDPSKLPNVFQTIAGKFTTLQLV 522 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 47/441 (10%), Positives = 114/441 (25%), Gaps = 51/441 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM- 122 S+ + + A N+ T +K+ + + + ++V + Sbjct: 162 GSMDQNGKMTALKKAAANATTGLIKKLSAFNTNEGDVYISVVPFAKDVNVGTSNVGASWL 221 Query: 123 NPRKSAYQVVLSSRYDLLLN---PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 N + + + + + + ++ R + Sbjct: 222 NWSEWEAAPRILTDNSYPIKVKYNNITYEWADIGPGAPCPFDTKSNGNPRPSQQSSVSKF 281 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + L D + + P+ T + G + + + Sbjct: 282 SFACMDRPGSLSGATDLSNNSTDRYLIPSSGTYEGMICPGIDGGANYPGKTGVYYNGCYT 341 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR------------------------ 275 + + S + + Sbjct: 342 SVVDPDASIVLATGSNAQCPSNKPNCSCTGTGSSRKCVQAKYKHYWRDHPSDTTKAAAAA 401 Query: 276 ----DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 V + D N + + ++ S + + + Sbjct: 402 PAHSTWTGCVNDRDQDADTTNAVMSGSGRIYPEQWKDCLSATITPMSNQWATLNSKVNAM 461 Query: 332 NEMGSTAINDAMQTAYDTIISSN-EDEVHRMKNNLEAKKYIVLLTDGENTQD-------- 382 N G+T + + T+ ++N + N + YIV+L+DG NTQ+ Sbjct: 462 NPSGNTNQAIGLFWGWQTLNTANDPFKAPSKDPNWVYQDYIVILSDGLNTQNRWYTCPNA 521 Query: 383 ------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE--ANST 434 + +C+ K+ I I TI + ++ L +CAS S + S Sbjct: 522 GPCPTIDGREKTLCDNIKADKITIFTIQVN--INSKDPESQVLKDCASSGSGYFQLITSA 579 Query: 435 HELNKIFRDRIGNEIFERVIR 455 ++ F + + R+ + Sbjct: 580 NDTATAFDNVLNKIAKLRIAQ 600 Score = 98.5 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 37/392 (9%), Positives = 88/392 (22%), Gaps = 35/392 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F + K G+ II L+ ++ + G VD R S A++ A +A + S Sbjct: 8 RFASLANGFRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAALMIS 67 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + N Sbjct: 68 KDAATMSD--SEITTRARQYVNSLYTNT---------------ETPIQTFSAVYTPNNGS 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + N + F++ +G + + T S + V+D Sbjct: 111 GATILLNAGG------NMPTYFMKIVGTNFSTLPINTASTTKWGSSRMRVAL----VLDN 160 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADR----TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + SM + + + + + K S + + Sbjct: 161 TGSMDQNGKMTALKKAAANATTGLIKKLSAFNTNEGDVYISVVPFAKDVNVGTSNVGASW 220 Query: 242 YML--YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + + + + + + + Sbjct: 221 LNWSEWEAAPRILTDNSYPIKVKYNNITYEWAD-IGPGAPCPFDTKSNGNPRPSQQSSVS 279 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + S S + + I + + + Sbjct: 280 KFSFACMDRPGSLSGATDLSNNSTDRYLIPSSGTYEG-MICPGIDGGANYPGKTGVYYNG 338 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + ++ IVL T + + C Sbjct: 339 CYTSVVDPDASIVLATGSNAQCPSNKPNCSCT 370 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 64/515 (12%), Positives = 142/515 (27%), Gaps = 134/515 (26%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G+ I+ AL + M+ G D +R ++ TA+I A Sbjct: 19 RVFKTLRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAV 78 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++ + + + ++E N N Sbjct: 79 KEI--DTDDAVALKQKVSDWFHAQVENSYTLGDINIDTSNHKI----------------- 119 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + +R I + + + + + SY +++ VID Sbjct: 120 ----------TATASGTVPTTLMRIANIDTVDVSVASAVKGPATSY-----LNVYIVIDT 164 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SML + + + + G Sbjct: 165 SPSMLLAATTAGQATMYSGIGCQFACHTGDAHKIGN------------------------ 200 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + S+ +++ + + DA+ V+ I D+ + +++G D Sbjct: 201 -------KTYNNNYEYSTAKNIKLRADVAGDAVKDVLALIDTSDSNHQRIKVGLYSLGDT 253 Query: 306 VISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + +T + ++ T + + + Sbjct: 254 LTEVLAPTLSTDTARNRLTDASYGLTSATSKAATYFDVSLTTLKQKVGTGGDGTASNS-- 311 Query: 364 NLEAKKYIVLLTDGENTQDNEEGI-----------------------AICNKAKSQGIRI 400 K ++LLTDG +Q A C K+Q + Sbjct: 312 ---PLKLVLLLTDGVQSQREWVTDKVTWSNNKAISGTYWNKVAPLNPAWCAYLKNQSATM 368 Query: 401 MTI------------------------------------AFSVNKTQQEKARYFLSNCAS 424 + S + T+++ Y LS+CAS Sbjct: 369 AVLYTEYLPLTSDWGYNATVGSTMASANWKGTYGGIMQSGVSTSITRRDYIPYALSDCAS 428 Query: 425 PNSFF-EANSTHELNKIFRDRIGNEIFERVIRITK 458 S F A+S+ E+ + +R+T+ Sbjct: 429 SKSLFLSASSSTEITAGLSALFTQYLAS--VRLTQ 461 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 29/315 (9%), Positives = 74/315 (23%), Gaps = 29/315 (9%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++ G+ II A++ ++ + G VD R AL+ A +A + Sbjct: 7 TRLTRLARRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAALMI 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 S S + + T + Sbjct: 67 SKDAATMS--ASQITTRARQYVDSLYTATDAPIQN---------------FTATYTPNSG 109 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 ++ + + + F+R +G + + T S + V+D Sbjct: 110 SGASILLSANGTMP------TYFMRVLGSNFNTLPVATSSTTKWGSTRMRVAL----VLD 159 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + SM + + Q + + + K S + + Sbjct: 160 NTGSMAQNGKMAALQSAATDMITKLSAFNTTTGDVYISIVPFAKDVNVSTSNVSASWLNW 219 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +P ++ + + + + Sbjct: 220 TEWMAEPPYLSQNGY--PNNLASLVGITTPGVWIDMTGPGSNCPFTNSSYGFSCTDRPAT 277 Query: 305 RVISDPSFSWGVHKL 319 ++ + Sbjct: 278 LSNANSASKIPSSGN 292 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 69/220 (31%), Gaps = 28/220 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--------FFNDRVISDPSFS 313 S+L + + D DT T + ++ + Sbjct: 384 STLTQYAATNPSASVSNGGWTGCVNDRDQNFDTTNDAMTGSGIPSKLPYAEQWADCLPAT 443 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN-EDEVHRMKNNLEAKKYIV 372 + +K+ G+T + + T+ ++N + N K YIV Sbjct: 444 ITPMSNQWSTLKSQINAMTPSGNTNQAVGLFWGWQTLNTTNDPFKAPAKDPNWVYKDYIV 503 Query: 373 LLTDGENTQD---------NEEGIAICNKAKS------QGIRIMTIAFSVNKTQQEKARY 417 LL+DG NTQ+ + +C K I + +I + + ++ Sbjct: 504 LLSDGLNTQNRWTQTVSDIDARQELLCKNIKDPAQNGGNQITVFSIQVN--ISSKDPTSK 561 Query: 418 FLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L +CA+P F + + F + + R+ + Sbjct: 562 VLQDCATPGAGYFQMITQSSQTADAFNNVLATIAKLRISQ 601 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 105 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 61/429 (14%), Positives = 118/429 (27%), Gaps = 61/429 (14%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A L+ V++ + +DV L+ A+ ++ + Sbjct: 16 RRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRA--------ATDAAAKAGTLALA 67 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + + RN T +V+ +A Q +S Sbjct: 68 KTDGDAASARTAAIQAAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQTPYTSVK 127 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 L S S+ + + + + + I VID S SM E Sbjct: 128 ILSSMSDSTAAGSVPLFLGTFMGRGSFQPAQSATASQMEQEICLVIDRSHSMCFNMSGVE 187 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 P +T + V+ P ++ Sbjct: 188 WS-----YPPGTKTTPHTICYPPHATLSRWAALQSSVNLFMDTILETNNTPRVALITWGS 242 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + +++ + KK V A S Sbjct: 243 TIGTNTAEYSYTKKTEVAVANE------------------------------LGLSTDYA 272 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + I G T ++ + + + AKK ++L+TDG Sbjct: 273 AVKSKIAARTTKVML--GGTNMSAGIDAGRTLLNGNT--------VRALAKKTMILMTDG 322 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + Q + I A +GI+I TI F A+ + A + ++ +++ Sbjct: 323 QWNQGR-DPIDAAEDAADEGIQIHTITFLSGS-----AQNTMRQVAEITGGKYYVSSNQA 376 Query: 436 ELNKIFRDR 444 EL + FRD Sbjct: 377 ELEEAFRDL 385 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 105 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 55/499 (11%), Positives = 139/499 (27%), Gaps = 108/499 (21%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G II +P+++ V + +++AA+ A + + ++ Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + ++ ++ Y+ N + T + + + VV + Sbjct: 64 -------DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVAT 116 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + +D ++ + + K ++++R Y + V I +++D S+SM + Sbjct: 117 AEHDSWISHNEI--------GVEPKFKVSGDSITRKYLPQ-PVDIYFILDTSQSMSNPWY 167 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + V + + K P S L Y Y D Sbjct: 168 GERNKTQMQVVKDTITRV--------VKELENFKTGPDKKSRVALLTYNAYNAKFDKGAG 219 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 D +S + + + +V + A+ +N + Sbjct: 220 RVKLYDYASEFSHTE--ASFESIVDKMFD-----KSVVEQKPHYASDYNKSQD--IPLTD 270 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + I + + G T ++ +I++ ++ K + ++ ++L Sbjct: 271 KYQEFIDILNSNKVMPAR-GGGT-------QSWLGLIAAAKEADKVKKEDRNPEQVFIIL 322 Query: 375 TDGENT---------------------------QDNEEGI----------------AICN 391 +DG +T Q + +C Sbjct: 323 SDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQYDGRTYYYQVYDKFLKSLVGEHGLCE 382 Query: 392 KAKSQ--------------------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 K + + + I + N + + C + + A Sbjct: 383 SLKKRISSKENKFQSEHAKLEGEKTKVTMGVIGVNYNVQKDDG----FGECVGEKNIYHA 438 Query: 432 NSTHELNKIFRDRIGNEIF 450 + ++ K + I E Sbjct: 439 KNGKDVYKYILNLINEETG 457 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 105 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 55/499 (11%), Positives = 139/499 (27%), Gaps = 108/499 (21%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G II +P+++ V + +++AA+ A + + ++ Sbjct: 9 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 68 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + ++ ++ Y+ N + T + + + VV + Sbjct: 69 -------DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVAT 121 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + +D ++ + + K ++++R Y + V I +++D S+SM + Sbjct: 122 AEHDSWISHNEI--------GVEPKFKVSGDSITRKYLPQ-PVDIYFILDTSQSMSNPWY 172 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + V + + K P S L Y Y D Sbjct: 173 GERNKTQMQVVKDTITRV--------VKELENFKTGPDKKSRVALLTYNAYNAKFDKGAG 224 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 D +S + + + +V + A+ +N + Sbjct: 225 RVKLYDYASEFSHTE--ASFESIVDKMFD-----KSVVEQKPHYASDYNKSQD--IPLTD 275 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + I + + G T ++ +I++ ++ K + ++ ++L Sbjct: 276 KYQEFIDILNSNKVMPAR-GGGT-------QSWLGLIAAAKEADKVKKEDRNPEQVFIIL 327 Query: 375 TDGENT---------------------------QDNEEGI----------------AICN 391 +DG +T Q + +C Sbjct: 328 SDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQYDGRTYYYQVYDKFLKSLVGEHGLCE 387 Query: 392 KAKSQ--------------------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 K + + + I + N + + C + + A Sbjct: 388 SLKKRISSKENKFQSEHAKLEGEKTKVTMGVIGVNYNVQKDDG----FGECVGEKNIYHA 443 Query: 432 NSTHELNKIFRDRIGNEIF 450 + ++ K + I E Sbjct: 444 KNGKDVYKYILNLINEETG 462 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 51/468 (10%), Positives = 114/468 (24%), Gaps = 39/468 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + +L+ +M+ +GG +D R ++ A A++ + Sbjct: 8 RLPARLAAFGRDARASILPMVGVLVALMVVIGGAGLDYGRAIMLRASISHALDAAVLAVA 67 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L S+ S K + + D + I + Sbjct: 68 RQLSVSIMTDSELDKAIKDAFAANMASAGLSGATLGDLTYVLDPDAGTISATATALVPTY 127 Query: 126 KS--------AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + S+ + L + S + + Sbjct: 128 FIHVGGLGPENVAIAASADATYSRFDVELAMVVDVTGSMRNSMASLRTAA----QSVVDI 183 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 I S S + Q +N + + + V+ Sbjct: 184 LIPDGTKKSASKVRIALVPYSQGVNLGEYAPKVS-------------NGDAGTQNCVTER 230 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 S E F S+ + + ++ +I K+ + T Sbjct: 231 MGNEKYTDATYNYNGTSSEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKDNGRTAGQ 290 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + + D + + YD + + + Sbjct: 291 TGIAWGW--YALSPKWSNLWPNDSVPGSYTDSDILKFALIMTDGDFNEYYDKATAQSNCK 348 Query: 358 VHRMKNN------LEAKKYIVLLTD--GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + ++ ++ G + + +C K GI++ +I F N Sbjct: 349 WQFNWSTFKWEQVCDSSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFGSNA 408 Query: 410 TQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + +CAS +FF A S EL F + + Sbjct: 409 NSA--GAKVMKDCASSTKETFFMATSDSELIAAFAKIANKIQNIYLSK 454 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 65/614 (10%), Positives = 133/614 (21%), Gaps = 173/614 (28%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ ++ L +PV+ VD+ +L+QAA A + A ++ Sbjct: 25 RFRREKAGNVAVVFGLTLPVLALCFATAVDLSGIYGANRSLQQAADVAALAAGREYGRTQ 84 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA------------- 119 + + + F +E + D V Sbjct: 85 DADYLSSVSEAFFFHNAGDETRGTTQFSYDGVFREDGLTILKVTARRQLPTFFGDALMWV 144 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + S + + L L S + ++ K+ + Sbjct: 145 TGGKLDWRQFPLYAKSEIVVQNRSIELALVLDNSGSMQDRPRSGGSKSKIDIIKDAAEDL 204 Query: 180 Q------------------WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN-- 219 V+ FS S+ + ++ G+ Sbjct: 205 AKQFLSSDKGSTEEFPVQFAVVPFSSSVNVGPQYKNADWMDTQGRSPIHHENLDWGGWLS 264 Query: 220 ----------------GKVGIRDEKLSPYMVSC---------NKSLYYMLYPGPLDPSLS 254 ++ Y S Y + Sbjct: 265 GATSGGWEWIRDRGWVYTAPSSGAPMARYNGSYWTRITTGEPLTRFYVYDNARYKSQFGT 324 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF------------- 301 V++ V ++ D +D+ + Sbjct: 325 WRGCVEARPNGLAATDTAPVSSKPETLFVPTFAPDEYDDSDYGWNDYLDSGSGSPGSAKE 384 Query: 302 ----------FNDRVISDPSFSWGVHKLIRT----------------IVKTFAIDENEMG 335 + D S + S V G Sbjct: 385 AMAEQAKVAKYFDSGYSITTPSSNRSDWGPNSTCATTPITPLTKTLKTVTDAIDVMGAQG 444 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL--------------------- 374 +T I + + + + R + + K +VL+ Sbjct: 445 ATNIPHGLAWGWRLLTARPPFTEGRSHDEPDNLKVLVLMTDGNNTYNLNSGGRPLEIRDY 504 Query: 375 ---TDGENTQD-----------------------------------NEEGIAICNKAKSQ 396 T G ++ +C K+ Sbjct: 505 NRSTYGSYGYGAAYSHGSSSRKPGRIYDGTTGNAKDYSVDSYVAAMDQNVAKVCENVKAD 564 Query: 397 G--------IRIMTIAFSVNKTQQEKARYFLSNCASPNS-------FFEANSTHELNKIF 441 G I I TIAF + + + + +CAS +++A S EL F Sbjct: 565 GRKPGGTDGILIFTIAFDLRDGEP--VKKLMEDCASNGLIDASEKLYYDAQSQEELAAAF 622 Query: 442 RDRIGNEIFERVIR 455 + R+ R Sbjct: 623 QSITEQISSLRIAR 636 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 103 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 88/285 (30%), Gaps = 20/285 (7%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S + + + T S G D S C + L+ Sbjct: 263 SGSKIPGKAGVFYNGCYRTVKNTIATGSGASC--GSTPHCDCSGSGRNKVCTQVLHPWRD 320 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + S+ + + D S +D F+ ++ Sbjct: 321 TDNPGAAAAAPAAPQPSAAANDGGWTGCINDRDREYDISNTAPSTGSDGTPSTK-FYAEQ 379 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN-EDEVHRMKNN 364 + + +K G+T + + + ++ ++N ++ Sbjct: 380 WKDCLPATITAMSSQWSTLKNQIDAMTPSGNTNQSIGLAWGWQSLSTTNGPIAAPGKESG 439 Query: 365 LEAKKYIVLLTDGENTQD--------------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + YIVLL+DG NTQ+ + +C K K G+ I TI +V Sbjct: 440 YVYQDYIVLLSDGLNTQNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTIQVNVG-- 497 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 ++ L NCAS +F S E F++ + R+ + Sbjct: 498 SKDPLSQVLQNCASDGNFQMITSATETADAFQNILTQISQLRLAK 542 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/244 (11%), Positives = 60/244 (24%), Gaps = 25/244 (10%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 +G G VD R + +++ A +A++ S + S + Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLMVSKDAAANPTMTSQQ--------------- 45 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 N +++ FT V S+ + F++ G Sbjct: 46 ----ITNAVQRYFTSLYNDKSAFGVTVSATYTPSSSSAAAKILASGQGAIQTDFMKIAGF 101 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 T + + + + V+D + SM + S Q + Sbjct: 102 PQLSFGTSSTSTW------GNSRMRVALVLDNTGSMSSNGKMSALQRAAKDMIDSLSAFA 155 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + I K S + + +P + + S Sbjct: 156 KKTGDVYISIIPFSKDVNVDTSNYNAAWINWAEWLGEPPVLDPASSYGGSKPSNWDDIVE 215 Query: 274 VRDA 277 + Sbjct: 216 DSNC 219 >gi|148974876|ref|ZP_01811856.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] gi|145965385|gb|EDK30634.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] Length = 418 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 51/448 (11%), Positives = 130/448 (29%), Gaps = 61/448 (13%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 +M+ + + + L +AA+ A + + +N+ + + ++ Sbjct: 1 MMVIFMAFSMQMSQQMLAHTRLLEAAEVASLAL-------IASPREDEENNVKYARYLVD 53 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 Y++ N ++ +T A S + V +++Y + + L+ Sbjct: 54 RYVVDNTDDVDVAVYTSICEYKDGCVQASGELAPFSDFVVRATAKYTSWIAYEDVNLKP- 112 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + V+R Y + V + ++ DFS SM + + + L+ + R Sbjct: 113 -------EFSVSGRAVTRKYLPQ-PVDVYFIGDFSGSMGNPWK-NGKMKLDVVKETIKRV 163 Query: 212 VKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 V N + R +P V + + + G + + + + Sbjct: 164 VDDIEEFNSEEKSRVALLGYNPLHVKQSNEIVRLNAYGYRASWRKKHAYDYARNSPATTV 223 Query: 270 K--KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-ISDPSFSWGVHKLIRTIVKT 326 + + S + + F+D D + ++ Sbjct: 224 RRMFDEPTLYNEIIEPSHGMSRYEVERLYKRNNDFDDYFKFYDIPLTEDYDNFRAQLMSA 283 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T+ + + A + ++ ++L+DG ++ Sbjct: 284 ---QLKAGGGTSSWNGIIAAAQEANKATSL---------NPEQVFIVLSDGSDSDKTYLQ 331 Query: 387 ----IAICNKAKS-------------------QGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +C K +S + + I + +A +C Sbjct: 332 KLVDQGLCKKLRSTISAKRNRFQSNAPTEAEKTKVTMGVIGI----NYRVQASDGFGDCF 387 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + A ++ K + I E + Sbjct: 388 GKKNIYHAKDGEDVYKYILNLINEETGK 415 >gi|170744425|ref|YP_001773080.1| hypothetical protein M446_6382 [Methylobacterium sp. 4-46] gi|168198699|gb|ACA20646.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 482 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 53/467 (11%), Positives = 111/467 (23%), Gaps = 55/467 (11%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 +G+ +I A + ++G+ G+ VD + L AA + L + + Sbjct: 20 SGNVSLIVAFSLIPLIGMIGLGVDYGMAVGAKTKLDHAA------DAAALAAVVTAKAYV 73 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A N+ + I + N +++ ++ ++ Sbjct: 74 AANARNWNVWDIAVAEGQARAANAFAVNAGSVPFTHFALDPIQLTRSGQTFEATVTYTAT 133 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + + F GI++ + +A A T SY + +ID S SM Sbjct: 134 VS----NNFGPLFGIRTTAVSGRAVATTSVPSY-----LDFYLLIDVSGSMGLPSTTDGQ 184 Query: 199 QPLN--------------CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 L C Y V Sbjct: 185 AQLAALNRIDFFKVYQQGCQFACHFPGFVGYDLAVFNNIQLRSGAVNVAVCNLIKRAAQP 244 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +L+ + + + + DT D Sbjct: 245 EVANQYRVGLYPFITQMGTLQDLTADTSALNLKAGCAASNPMVFTQLLDTGATQLDSNGD 304 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI--NDAMQTAYDTIISSNEDEVHRMK 362 S + +++ T + GST I + D + ++ + Sbjct: 305 PSTGVGSGGTHFETSLTSMLATIKANGYGDGSTQIKPKPFVFLITDGMQNNQWYSIQING 364 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGI------AICNKAKSQGIRIMT-------IAF---- 405 + + C+ + G+ + I F Sbjct: 365 KRYYSGSPSKFSAYPDANWSPGGSDPVPMDWGYCDTLRQAGVTVSVLLIPYIKIDFTYVK 424 Query: 406 -------SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + CASP F A++ +++ Sbjct: 425 SDIADENNKVNGFSSGLPDVARQCASPGYFQMADTPEAIDRALDAMF 471 >gi|262164956|ref|ZP_06032694.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] gi|262027336|gb|EEY46003.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] Length = 403 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 39/442 (8%), Positives = 121/442 (27%), Gaps = 62/442 (14%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 ++ P M+ + + + + L +A++ A + + +N+ ++ + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL-------IASPKEDDENNVSYAR 53 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + ++ Y++ N ++ R + + + V ++++ ++ ++ Sbjct: 54 KVVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYENIS 113 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 L+ + + + V + +++D S SM + ++ + Sbjct: 114 LK----PEFTVNGSSVTRKFLPQ-----PVDVYFIVDMSASMRATW-QNGKSQIDEVKEV 163 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 R V + +V R L+ + + + + + ++ + Sbjct: 164 ITRVVNDLKGFDTEVKSRVSLLAYHNYNIKQGSRTLEAYDYALYNTPQQTVSNMFLPPKR 223 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + ID + + + Sbjct: 224 VTPSDSA------LFSHRD-IDLTQNYSSFLQIMNDRNFYPPK------SACTESWQGII 270 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A + +T IN + + + + R L T +N Sbjct: 271 AAAQAADKATDINP--EQVFIILSDGADCSWQRQDR-----WGRWLTT--KNYLKKLVDS 321 Query: 388 AICNKAKSQ-------------------GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +C K + + + I + + +C + Sbjct: 322 GLCKNLKQRIRQKPNRFQSSTPTENEKTKVTMGVIGVNYQVNPNDG----FGDCVGRENI 377 Query: 429 FEANSTHELNKIFRDRIGNEIF 450 + A ++ K + I E Sbjct: 378 YHATQGEDVYKYILNLINEETG 399 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 61/516 (11%), Positives = 136/516 (26%), Gaps = 113/516 (21%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F KK +S G ++ A LM +++ GM +D A+ A Sbjct: 11 QFSDQFKKFNRSEDGVVAVLVAFLMVLLIVFAGMAIDFGLGFNTR--------RAVNQAL 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++ S + S I++Y N +N+ + V +P+ Sbjct: 63 DAAVLAVANKLSTTELSSNTVDSLIDQYFEENLKNS-------VGGDVVHTKPVVTYDPK 115 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 ++ P+ L S + + + E V + Sbjct: 116 GDTVAATATATVKTSFLPVLKLLNSESGDFGELTVTSSSTARFPKTKVEVAVVVDVTGSM 175 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S+ + + L+ + ++S + KL + + + Sbjct: 176 SGSIGSLKT-ASRDMLDTLLPDDNTRLQSRVRISYVPYNVGVKLD---KTLARKATFEKS 231 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + + + D + + ++ Sbjct: 232 QYGCVHARVRDLAYSGENHDYEDEDDDERVDYIGT--------------------NYSWC 271 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH-----R 360 + RT +++ +TA + + T+ Sbjct: 272 PNAQMVPLTN----DRTKIESSINALRASSATAGQIGIAWGWYTLSPEWRGFWPTESKPD 327 Query: 361 MKNNLEAKKYIVLL---------------------------TDGENTQDNEE-------- 385 +N +KY VL+ +D +N+QD + Sbjct: 328 FYDNNGVRKYAVLMTDGSFNAYYAADYSKADAEHKKLIKNKSDVQNSQDPMDSGKLDADD 387 Query: 386 -------------------------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +C K + I I T+ F + K + + Sbjct: 388 HKKIASKVKWEYDYSSSLSGVPFKTASNLCKNMKKEDIVIYTVFFGSDY----KGKKIME 443 Query: 421 NCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 CAS +F+ A + L + F + + + Sbjct: 444 ECASNSETFYHATNQSALIQAFSSIANDIKSIYLSQ 479 >gi|238897540|ref|YP_002923219.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229465297|gb|ACQ67071.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 450 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 49/447 (10%), Positives = 132/447 (29%), Gaps = 44/447 (9%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 K+ G F+ ++ V L + + R + +Q + + Sbjct: 24 FFKNQKGAIFVFFLIMFSVFLILIYATFEYCRLINEKIKTEQVLEQTSL----------- 72 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 +++ + +L + + + T + + + N K Y + L Sbjct: 73 ILTAEDNGDTSGTPGDRNRFLALSHIESFIPSTT--ATNESIEIISPNQNDLKRKYTITL 130 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + Y + N + + KA A+ ++E + + +V F R ++ Sbjct: 131 RNAYSSIFNL---------FSTMSVSGKAAAQKA----YQEEPMDVVFVFSFYRHTIENP 177 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + + ++ ++ + P+ + Y L P Sbjct: 178 LRMSWETRQEINELKEVLIEKINTILEMNPLNKVGFVPFSWGVKEGRYC-LQPFVFLFKP 236 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + ++ + ++ ++ ++ V + D+ + ++R+ + Sbjct: 237 NYDNMNFGPAQFKAPRQTEVISKSIDFVKTVNNIPFAMTDSNVISLASLDNRLCGLSLYR 296 Query: 314 WGVHKLI-RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED-----------EVHRM 361 + + + DE G I+ + + + E M Sbjct: 297 DPIPLTREKNEIFMIIKDELIPGYNLISSGILSGVKLLSRGTNSKKLLIIISDTEEAPPM 356 Query: 362 KNNLEAKKY--IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 N + K+ I + N ++ +C K K+ GI++ I+ + + Sbjct: 357 AINDDNKRMLRIRFNNNTLNISESLIKAGMCEKIKNSGIKMYFISTGGSIESHNYWKT-- 414 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIG 446 C + N+ + K G Sbjct: 415 -QCIGEKNMHNLNNVDDELKQILQASG 440 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 53/398 (13%), Positives = 105/398 (26%), Gaps = 64/398 (16%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + G I + ++L GGM VD +R L+ A++ A+ Sbjct: 3 RITTRAAALRRDERGGMIIFGLFVFLLLLLAGGMAVDFMRTETARGRLQATLDGAVLAAA 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 R + + + + Sbjct: 63 DLDQDKDPVEVVRDYVAKAGLDPFLIDVDVTEIAGQRI---------------------- 100 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 V S++ D+ ++ F++ +GI ++ A + + + V+D Sbjct: 101 -----VTASAKSDVTMH----FMKMVGIDFLPAPARSTASEAVSN------LDVSLVLDM 145 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SM + D F T+ + PY + Sbjct: 146 SGSMEGDKLDQLQAAAKNFVGIVYDTMGAEKILLNV--------VPYATQVAAPAGLLDM 197 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G S + V S+ + D + Sbjct: 198 LGAFLREHSYSNCVSFSAADFTETSILEAAA-----LPQGGHFDPFYTWGPLRYDDVTFV 252 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS------------- 352 DPS + ++ + G+T+I+ M+ I Sbjct: 253 CNPDPSTEVLTLASTQREIEDYIDGLVAEGNTSIDVGMKWGAALIDPDLGSTLNEFANGP 312 Query: 353 -SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + + K IVL+TDG+NT + + Sbjct: 313 SAAGINPVALWGDRSTDKVIVLMTDGKNTTEYRLPATL 350 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 76/293 (25%), Gaps = 21/293 (7%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-----KLSPY 232 + +V + S S + + + + K G Sbjct: 248 DVTFVCNPDPSTEVLTLASTQREIEDYIDGLVAEGNTSIDVGMKWGAALIDPDLGSTLNE 307 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL-VRDALASVIRSIKKIDNV 291 + + D S + + + + + I + V Sbjct: 308 FANGPSAAGINPVALWGDRSTDKVIVLMTDGKNTTEYRLPATLGTWSDVYIDDARDEFWV 367 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 T A N R +D F R G ++ + Y+ + Sbjct: 368 RQTDDRDADDDN-RYSNDTWFWVDGRVAGRNNFWRRDNRFPHGGGRIEDNVRRLTYEELF 426 Query: 352 SSNEDEVHRMKNNL---------EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 S + ++ + + L + N AIC AK+QG+++ T Sbjct: 427 SRVSVYYNAYYHHYLQNFDRTELDTWYWDFLDMSLSTSAKNARLEAICTAAKNQGVQVFT 486 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANS-THELNKIFRDRIGNEIFERVI 454 + F V + +CAS + F S +L F R+ Sbjct: 487 VGFEVEDD----EAIIMEDCASSRAHFFRVSGGGDLTTAFESIARQITELRLT 535 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 100 bits (249), Expect = 5e-19, Method: Composition-based stats. Identities = 53/524 (10%), Positives = 146/524 (27%), Gaps = 93/524 (17%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQK---------IEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 ++ + S + + + + L + D+ + T Sbjct: 67 NSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTNDDKMNLIPICKTV 126 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHGV 177 + + ++ + S F +G A + + Sbjct: 127 NNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTLEVIPQQVNVASQSRAFKKNTFNIPI 186 Query: 178 SIQWVIDFSRSMLDYQRDSEGQ----------PLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + V D S SM + + ++Q+ ++ + Sbjct: 187 DLMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTPF 246 Query: 228 KLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKK---KHLVRDALASVIR 283 L + + + +L Y D L ++ + + + + + + Sbjct: 247 ALGAEIDASSCALPYHWDKGKDPDTPLKIKNILSKKEGNNQQSRAEFIEHLVYKMNTKAT 306 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + N V F + + + + W + ++ G+T + + Sbjct: 307 LDNVGEKQNYKVTFPKGNFCLKNMKNKNHGW-FSRKDKSDFTNHVRSLRADGATLASSGV 365 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------------------------- 378 A + +I D + + K+ I++L+DG Sbjct: 366 LVAANNMI---RDGSRTEQLKEQTKRVILVLSDGNDEIVKGDPNNKVPFLNYTRITENLI 422 Query: 379 ---------------------------NTQDNEEGIAICNKAK--------SQGIRIMTI 403 +TQ E +C + + +I+ + Sbjct: 423 YGRQEVFLSQKQKISLSHSTIYTYLTTDTQPKETTNGMCEVIRNKLDTLNGDKNTKIVFV 482 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 F + ++ +C +++ AN L F+ IG Sbjct: 483 EFGY----KSTSKQAWEHCVGNGNYYSANDKESLLNSFKQAIGE 522 >gi|303252178|ref|ZP_07338346.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302648961|gb|EFL79149.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 532 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 53/526 (10%), Positives = 143/526 (27%), Gaps = 96/526 (18%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSR------AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR----DIVRDT 118 ++ + +N ++ + + +K V T Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMATKFVKAFLPQTNEEKMHLTPVCKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHG 176 N + ++ + S F +G A + Sbjct: 127 INNTNGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPHEVNVASKSKAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQ----------PLNCFGQPADRTVKSYSSQNGKVGIRD 226 + + V D S SM + + ++Q+ ++ + Sbjct: 187 IDLMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTP 246 Query: 227 EKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 L + + + +L Y D L ++ + + + + + + + Sbjct: 247 FALGAEIDASSCALPYHWDKGKDPDTPLKIKNILSKKEGNNQQSRAEFIDNFVYKMNTQA 306 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDENEMGSTAIND 341 + + F V + + ++ + G+T + Sbjct: 307 TL--DNIGEKQNYKVTFPKGVFCLKNMKNKNHGWFSRKDKSDFTGYVRSLRADGATLASS 364 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE----------------------- 378 + A + +I + + K+ I++L+DG Sbjct: 365 GVLVAANNMIKGGSRTK---ELKEQTKRVILVLSDGNDEIVKGDPNSKVPFLNYTRITEN 421 Query: 379 ----------------------------NTQDNEEGIAICNKAK--------SQGIRIMT 402 +TQ E +CN + + +I+ Sbjct: 422 LIYGRQEEFLSGKKKVSFGNTTIETYLTDTQPKEVTNGMCNVIRDKLDKLNNDKNTKIVF 481 Query: 403 IAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGN 447 + F K++ +C +++ AN L F+ IG Sbjct: 482 VEFGY----ASKSKEAWKHCVEGDKNYYSANDKASLLNSFKQAIGE 523 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 99.7 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 60/549 (10%), Positives = 135/549 (24%), Gaps = 118/549 (21%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I L + ++ + G D R++ AL+QAA +A++T + + +S ++ + Sbjct: 1 MIFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMTESTTNAQAKDQAQV 60 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL---- 139 Q I + T + + + + + S +L Sbjct: 61 VLNAQPRMTTAIVTGATVSEDKRTVCATAKVTIQNSFMQMAQLATLTPTVKSCANLAGGA 120 Query: 140 -LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + L S + +++ V + + S Sbjct: 121 DPGTTYEIALVLDNSGSMNSSSDGQSKISILKSAANSFVDTMFSKSNNVKFSVVPFSSGV 180 Query: 199 QPLNCFGQPADRT--VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 ++ + V + + K + + +L Sbjct: 181 AAVDPSEPSSRNAAWVDKNGANSQHWIAFGGKTAANAAGFTSRFDIFDKLKARNSALDWR 240 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND------------ 304 + + ++ D +++ G ++N+ Sbjct: 241 GCFEPQVYPLNVNDTTPNPSDAETLFVPFLAPDEPDNSGWGGNPYWNNYFGDNPSACSSS 300 Query: 305 --------------RVISDPSFSWGVHKLI--------------------RTIVKTFAID 330 S+G ++ V+ Sbjct: 301 ASGAWARLSRACKYNATGSLGGSFGPSDFKGSSSFCPDPGTQRILQLTQKKSDVQNKINQ 360 Query: 331 ENEMGSTAINDAMQTAYDTIIS-------------------------------------- 352 G+T +++ + T+ Sbjct: 361 LVANGATNLHEGFMWGWRTLSPNAPFSGGRAYQAPKNRKIMVFMTDGFNSWNSRVNTATG 420 Query: 353 ------------SNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------NEEGIAIC 390 E+E + Y LL N +E C Sbjct: 421 STYDTLGYYSYNGAENERFPDGSQGNGVNYRSLLAAAANNSSSYQTISRAMQDELTRQAC 480 Query: 391 NKAKSQGIRIMTIAFSVNKTQ-QEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNE 448 AK+ GI + TI FSV+ + + CA+ + +F+A +LN F Sbjct: 481 TNAKTAGIEVFTIGFSVSGDPIDAQGLALMKECATNEDHYFKAEDASQLNAAFSQI---G 537 Query: 449 IFERVIRIT 457 I +R++ Sbjct: 538 IGLGKLRLS 546 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 99.3 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 26/307 (8%), Positives = 71/307 (23%), Gaps = 34/307 (11%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L G I + ++ ++L + G VD++ L+ A++ A+ Sbjct: 44 RLTHFIADLAGDEEGTMTIFSTFMLVLILVITGASVDIMYQEAIRARLQATLDRAVLAAA 103 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-- 123 Q + + + + + DR V T Sbjct: 104 DLDQQQDPVAVVNDYVTKAGLVEHLTDVIA-------TPGLYDRTVAADAGLTLDTYFLR 156 Query: 124 -PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 V+ +S + + + + L S + + ++ Sbjct: 157 MSGWQTLPVIAASTAEERIANVEISLVMDISGSMRWNNRITNARNAAKDFVTKVLTEDSA 216 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + +++ + + G+R + N Sbjct: 217 GVTTLNLIPFAGQVN---------------PGDVMFDYFRGVRPKIQQGNNGWGNGD--- 258 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 PG + + E+ + + + +F Sbjct: 259 QDAPGGSLCTNNAENADEGAIDPSCTDGTDTAATDTGGFF------SPWPQPIANIVVYF 312 Query: 303 NDRVISD 309 + Sbjct: 313 DTNGDDI 319 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 39/304 (12%), Positives = 75/304 (24%), Gaps = 74/304 (24%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G+ R ++ + G + + + F+D + +A+ Sbjct: 369 GQEKNRYFQVKGDGNGPFSDIGPTKNNGKIPGDVIDFAFIDYDAWAAYYVAPVAPVEAVN 428 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRV-------ISDPSFSWG----------VHKLIRT 322 + S + +D V WG + Sbjct: 429 VNMPSSCVEIYDTEFTNTDLPQSDDYVPHFMFWPYVREVMDWGWCPGEDTAIQYYSDDAA 488 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE----------------VHRMKNNLE 366 + F + T I ++ A + + + E Sbjct: 489 TLSAFIDNMRMHDGTGIQYGLKYALALLDPATGSAVTELISAGLVDSRFLGRPIAWEDEE 548 Query: 367 AKKYIVLLTDGE-----------------------------------NTQDNEEGIAICN 391 +K+IV+++DG N DN C Sbjct: 549 TEKFIVVMSDGAVTDQYRPVDPFAPLNGETELQTQGSGSYTTFSTRGNNLDNLHTQ--CQ 606 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A+ G+ + +AF + L CAS +S F E+ F Sbjct: 607 LARDLGVTVFAVAFETT----DADADELRLCASSDSHFFHVQGTEIIDAFDTIARQINNL 662 Query: 452 RVIR 455 R+I+ Sbjct: 663 RLIQ 666 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 98.9 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 54/410 (13%), Positives = 122/410 (29%), Gaps = 47/410 (11%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+ I+ +G + +I LL ++ + + ++ + L + + A+++ S Sbjct: 9 IKRFIQDESGVYAVIGGLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSAENNS 68 Query: 71 SLEEVSSR-----AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR----DIVRDTAVE 121 + + A+N P KI E + ++ + + V T + Sbjct: 69 GRKSNDYKLSNTDAENGHFNPNSKIGERDLEISKSFVTTYLPQTDPNKIKLQPVCTTTDK 128 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHGVSI 179 N + + S F +G A + + + Sbjct: 129 KNRQGHTASTETICTVAGTIEHKSWFPLKVGSTEVIPTEVNIASNSKAIKKNTISIPIDL 188 Query: 180 QWVIDFSRSM----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 D S SM D + ++ S+ N ++ + L Sbjct: 189 MVAADLSGSMRYDLENRYEPKDGTSKIDILKAVLTELSSNSLFSQESNDNNRIAVSPFAL 248 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD------------- 276 + +L + L + ++ + ++ I K +L + Sbjct: 249 GAEYSTTECTLPFALKNNNRTINYTKSLGIPTTENVQDIIKNYLTKSGSSNSQLSRAIFT 308 Query: 277 -------ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + SI +D V A D+ + + + TF Sbjct: 309 QSLVTQIDVTNTLSSIGSLDKVGLKFPKNAYCLGDKNRNQH---QWFTREEQDKFSTFVN 365 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +GST + A D ++ K E K+ +++L+DG + Sbjct: 366 SLEAIGSTFAGSGLLAAADKMLKETSRTQ---KLGEETKRVLLVLSDGND 412 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 98.5 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 64/457 (14%), Positives = 126/457 (27%), Gaps = 66/457 (14%) Query: 10 YSKKLIKSCTGHFFIIT-ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA-SVP 67 + +K G I ALL P + G + + Sbjct: 6 FGSNALKKQQGSMTISFLALLFPALFIAVG----------------------TLMVSAQV 43 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 ++ + ++ + ++R ++ K + + Sbjct: 44 MVSNRAAQAADSAALACAFADTATLPMMRAYQEYYKPTLKGVSGLEPEIIGS-------- 95 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 L+PL + + + T ++V S + V+D S Sbjct: 96 ----ECRISLGYSLSPLLPNFQYESYATKVTATGGGYKSVVESKQSSIPTELVLVLDVSG 151 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 SM Q L A T++S S+ + + P+ L Sbjct: 152 SMGS-----NIQSLKSILSNALNTIQSQSNNANDLDSVSISIVPFDSGVAAQRPPWLSKE 206 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + A + +K + Sbjct: 207 AAGIYCIDGLNY---RNGNFSAALTVDNLATLHSQQPVKFAKPNG---------WLSDCN 254 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH-RMKNNLE 366 R V+ G T + +I S + + E Sbjct: 255 QSSPMLPLTSVFSR--VRNSINSLTANGGTRSFHGLLWGVRQLIPSWQQAWGINVSTVPE 312 Query: 367 AKKYIVLLTDGEN---TQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNC 422 ++ +VL TDG + T D C A +Q GI + I + V+ ++ + C Sbjct: 313 TRRKLVLFTDGADEGDTFDQLVNAGFCTTAINQYGIEMNFIGYGVSSSRIAQ----FERC 368 Query: 423 AS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A P+ F A +T +LN+ F D + E + ++T+ Sbjct: 369 AGNPSRVFSATNTTQLNEYFSDILAVEYSATI-KLTR 404 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 98.5 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 71/496 (14%), Positives = 148/496 (29%), Gaps = 111/496 (22%) Query: 2 VFDT-KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 +F + + + L++ G + A+L+PV+LG+ G+ +D L+ Sbjct: 14 LFRSARPMRALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQA----- 68 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + ++ + + + +Y+ N+ T V TA Sbjct: 69 --AVDAAALAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTAT 126 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 P +F+ ++GI S + KA A + + + Sbjct: 127 ATVPT--------------------IFMNALGIGSSEVHAKATA--------GYNKLEVV 158 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +VID S SM + + S + G V R + P V Sbjct: 159 FVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFRGKVHLPAGVDGL-PD 217 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 G L+PS E + +S ++ ++ Sbjct: 218 GCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNT----------------------- 254 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + ++ I K + T I++ ++ + Sbjct: 255 --CTSIPRVQGLTEDRETILTAISKQ--NGLGDASGTVISEGLKWGRHVLTPEAPFTEG- 309 Query: 361 MKNNLEAKKYIVLLTDGENTQD------------------------------------NE 384 + + +K I++LTDG+ N Sbjct: 310 -SSAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNA 368 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEANSTHELNK 439 + KAK GI + I F + + AS + +++A S ++++ Sbjct: 369 AMLEEARKAKEAGIEVFAIRFG---DSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDD 425 Query: 440 IFRDRIGNEIFERVIR 455 +F+ IG ++ R++R Sbjct: 426 VFKK-IGRQLGWRLLR 440 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 64/341 (18%), Positives = 117/341 (34%), Gaps = 32/341 (9%) Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + ++ + T+ E + K G I I + +M Sbjct: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + +N T G +S Y + + Y + Sbjct: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 Query: 256 EHFVDSSSL---------------------RHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 V +S I K + ++ +++ +K I +VN+ Sbjct: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 V+ G F++++ WGV L R I ST ++ AY+ I Sbjct: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQ 274 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H + KK IV +TDGEN T+++++ + CN+AK +G + I + Sbjct: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-----RV 329 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ FL CASPNSF+ + H + F + + +R+ Sbjct: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/276 (14%), Positives = 97/276 (35%), Gaps = 12/276 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + + G I+TA+ +P++ V GM+++V + + L +++ A+ ++ Sbjct: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA--VEMNPRKSA 128 + + + I+ +F N L+ N ++ DIVR T+ + + P+ Sbjct: 68 EGNGNNRKKLKGGDILCR-IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 Y + SRY + L + + ++ + + + + + + V+D SRS Sbjct: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK---VNSQTDARLDMMIVLDVSRS 183 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + DS ++ + + ++ + L + + Sbjct: 184 MESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 L + S+ ++ A + Sbjct: 243 LQRKIKYLSKFGVSTN-----STPGLKYAYNQIFDM 273 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 97.8 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 121/435 (27%), Gaps = 63/435 (14%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + I + + + + +I + + E D V Sbjct: 33 ANTKKAIAGVISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWT 92 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + S +V L N +++ + G + + A+ + + + I V Sbjct: 93 DKSVSTQEVTLK---TYDDNHVTVAPK-DGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLV 148 Query: 183 IDFSRSMLDYQRDSEGQPLN-CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + + + + Y +G + + + S + Sbjct: 149 LDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRH 208 Query: 242 Y-MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI-----KKIDNVNDTV 295 Y D + F L + + ++ A+ I K D Sbjct: 209 YVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKN- 267 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + V + + + L T D G+T + MQTA + ++ Sbjct: 268 RIGLVTYASDVNTRSGLTDSLSGLKST-----VDDLKASGATRADLGMQTANTVLGNA-- 320 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFSVN 408 +A K ++ TDG+ T+ N + I K+ G + ++ Sbjct: 321 --------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTG 372 Query: 409 KTQQEKARYF-----------------------------LSNCASPNSFFEANSTHELNK 439 L A ++++ A S + Sbjct: 373 ANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLN 432 Query: 440 IFRDRIGNEIFERVI 454 + I +E+ Sbjct: 433 AVFNTIWSEVSSNPT 447 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 97.8 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 77/603 (12%), Positives = 155/603 (25%), Gaps = 163/603 (27%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K G+F +I AL++ +L G + VD + ++ + A + Sbjct: 36 RFGKDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRFSTGE 95 Query: 73 EEVSSRAKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE--------- 121 + + F + I + TD++V+ T Sbjct: 96 SHTVVQDYGARIFYANVTALSADAINFQIAFPQDKTTDQQVQATAAFTYKSLFGVVASRL 155 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ-------------------TKA 162 +Q L++ L N + + L KS + Sbjct: 156 TGDNWDKHQYTLTASVRLK-NTIEVALVLDNSKSMDETRSGSSKKRIDLLKDAASQLVET 214 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN----------CFGQPADRTV 212 A + + E V V F+ S+ + ++ F P Sbjct: 215 MASQSALITYVEKPVQFSLV-PFAGSVNVGPQYLNAAWMDPEGKSSVNLENFTLPVTIDA 273 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKS------LYYMLYPGPLDPSLSEEHFVDSSSLR- 265 + R K+ N LY L +P L+ + V+S Sbjct: 274 SRKIEEKPAGSGRYYKVGTGWGDRNNKPFSRAELYADLTRRSSEPWLAWQGCVESRPGPF 333 Query: 266 --------------------------HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 +V + +++ L S + + + Sbjct: 334 ALDVTPPSDNNPDTLFVPMFGPAEYYNVDSRGNVISTVLNSWWQDDMSLAYSPRQSDLKK 393 Query: 300 TF--------------------FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + ++ + + + V+T G T + Sbjct: 394 YYLRDSLDKIYRKGRSEGGGPNYSCTTLPLTPLTDVTTEQGMKTVQTAIKAMVPNGGTNV 453 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT---------DGENTQD-------- 382 +AM + TI+ R K +++LT DG Sbjct: 454 PEAMAWGWRTIVQGAPFTEARASTERGNDKVVIVLTDGANTYYKYDGLAGSGPDRAGNLS 513 Query: 383 ---------------------------------------NEEGIAICNKAKSQGIRIMTI 403 N +C+ AK+ I +MT+ Sbjct: 514 YYSTHGYTARITKKYSQSRLFQESGVSVSQNNTTYTKALNARFAKLCDNAKAANIIVMTV 573 Query: 404 AFSVN--KTQQEKARYFLSNCASPNSF----------FEANSTHELNKIFRDRIGNEIFE 451 A +N + ++ L +C+S + F ++ EL++ FR Sbjct: 574 ALDLNEANSTEKAQIDLLRSCSSNSRVRMEGGKPAKLFWNSTGGELSETFRQIGDELSNL 633 Query: 452 RVI 454 R++ Sbjct: 634 RLV 636 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 97.8 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 71/488 (14%), Positives = 144/488 (29%), Gaps = 110/488 (22%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 L++ G + A+L+PV+LG+ G+ +D L+ A Sbjct: 2 RALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQAA-------VDAAA 54 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + ++ + + + +Y+ N+ T V TA P Sbjct: 55 LAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTATATVPT--- 111 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +F+ ++GI S + KA A + + + +VID S S Sbjct: 112 -----------------IFMNALGIGSSEVHAKATA--------GYNKLEVVFVIDNSGS 146 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + + S + G V R + P V G Sbjct: 147 MKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFRGKVHLPAGVDGL-PDGCRNADGT 205 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L+PS E + +S ++ ++ + Sbjct: 206 LNPSWLHEEYFKTSYRYPSGSSLNVPKNT-------------------------CTSIPR 240 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + ++ I K + T I++ ++ + + + + Sbjct: 241 VQGLTEDRETILTAISKQ--NGLGDASGTVISEGLKWGRHVLTPEAPFTEG--SSAKDIR 296 Query: 369 KYIVLLTDGENTQD------------------------------------NEEGIAICNK 392 K I++LTDG+ N + K Sbjct: 297 KVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARK 356 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEANSTHELNKIFRDRIGN 447 AK GI + I F + + AS + +++A S ++++ +F+ IG Sbjct: 357 AKEAGIEVFAIRFG---DSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKK-IGR 412 Query: 448 EIFERVIR 455 ++ R++R Sbjct: 413 QLGWRLLR 420 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 97.4 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 121/435 (27%), Gaps = 63/435 (14%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + I + + + + +I + + E D V Sbjct: 19 ANTKKAIAGVISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWT 78 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + S +V L N +++ + G + + A+ + + + I V Sbjct: 79 DKSVSTQEVTLK---TYDDNHVTVAPK-DGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLV 134 Query: 183 IDFSRSMLDYQRDSEGQPLN-CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + + + + Y +G + + + S + Sbjct: 135 LDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRH 194 Query: 242 Y-MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI-----KKIDNVNDTV 295 Y D + F L + + ++ A+ I K D Sbjct: 195 YVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKN- 253 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + V + + + L T D G+T + MQTA + ++ Sbjct: 254 RIGLVTYASDVNTRSGLTDSLSGLKST-----VDDLKASGATRADLGMQTANTVLGNA-- 306 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFSVN 408 +A K ++ TDG+ T+ N + I K+ G + ++ Sbjct: 307 --------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTG 358 Query: 409 KTQQEKARYF-----------------------------LSNCASPNSFFEANSTHELNK 439 L A ++++ A S + Sbjct: 359 ANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLN 418 Query: 440 IFRDRIGNEIFERVI 454 + I +E+ Sbjct: 419 AVFNTIWSEVSSNPT 433 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 97.4 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 121/435 (27%), Gaps = 63/435 (14%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + I + + + + +I + + E D V Sbjct: 33 ANTKKAIAGVISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWT 92 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + S +V L N +++ + G + + A+ + + + I V Sbjct: 93 DKSVSTQEVTLK---TYDDNHVTVAPK-DGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLV 148 Query: 183 IDFSRSMLDYQRDSEGQPLN-CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 +D S SM + + + + + Y +G + + + S + Sbjct: 149 LDTSGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRH 208 Query: 242 Y-MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI-----KKIDNVNDTV 295 Y D + F L + + ++ A+ I K D Sbjct: 209 YVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKN- 267 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G + V + + + L T D G+T + MQTA + ++ Sbjct: 268 RIGLVTYASDVNTRSGLTDSLSGLKST-----VDDLKASGATRADLGMQTANTVLGNA-- 320 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFSVN 408 +A K ++ TDG+ T+ N + I K+ G + ++ Sbjct: 321 --------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTG 372 Query: 409 KTQQEKARYF-----------------------------LSNCASPNSFFEANSTHELNK 439 L A ++++ A S + Sbjct: 373 ANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLN 432 Query: 440 IFRDRIGNEIFERVI 454 + I +E+ Sbjct: 433 AVFNTIWSEVSSNPT 447 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 97.4 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 50/435 (11%), Positives = 103/435 (23%), Gaps = 29/435 (6%) Query: 44 VRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 + + + S + + + + + + Sbjct: 145 AEAEMRKSEIALVLDSTGSM-SRDSRMTNLKAAVDSVLASLLVSGENVWDARVGIVPFDT 203 Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + + E S ++S L + S + A Sbjct: 204 QVSIRGVPAAGIAGEFGETPYTYSCTGSSMTSDRCTSLGTTRDLMCSGAADVAACRANAR 263 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + S + + + + +++ N Sbjct: 264 IYAYTTSGGTYRVFVTSNYVQNNGYCWPNYYTYTYCGYTVVSRTDYYSTTTWTRTNNASN 323 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + Y + + S P+ + + S++ D L Sbjct: 324 STPWPSASYYGTPSYSYAQYNGTITATPTSAGGYGSGSTTTIKDNSTITANSDLLGVGTD 383 Query: 284 SIKK--IDNVNDTVRMGATFFNDRVISDPS---------FSWGVHKLIRTIVKTFAIDEN 332 S ID G + + V+ A Sbjct: 384 SWNGCVIDRKQPYDVSGQSPIASNTDTLYPAAKCATNNLLPVMGLTTDIAAVRAHAQKLT 443 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------N 383 G+T I +Q + + + ++ KY++++TDGENTQ+ N Sbjct: 444 PAGNTNITIGVQWGMELLSPELPFNTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTIN 503 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + C AK GI + TI L +CAS +F + Sbjct: 504 ARTLLACQAAKDLGITVYTIRVMEG------NSDMLKSCASRPEYFYDVTASSQLT--ST 555 Query: 444 RIGNEIFERVIRITK 458 + R+TK Sbjct: 556 LAKVFYSIQSTRLTK 570 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 30/338 (8%), Positives = 79/338 (23%), Gaps = 9/338 (2%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + +G+ I L+ ++ G V+ + + L+ AA A + ++ Sbjct: 21 FWRDVSGNTMIAFGLIAATLVAAAGAGVEFSQAQEQTNRLQDAADAAALRGALMAKDETA 80 Query: 74 EVSSRAKNSF-TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ I + T T V Sbjct: 81 AKAAADTVFTLNLTDSGIAPTSKGLKFEISGSHKTAVYTATAQIKTTFLKLAGIETLTVG 140 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH-KEHGVSIQWVIDFSRSMLD 191 +S + + + L S ++ + VS + V D ++ Sbjct: 141 ATSTAEAEMRKSEIALVLDSTGSMSRDSRMTNLKAAVDSVLASLLVSGENVWDARVGIVP 200 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS-------CNKSLYYML 244 + + + G + Y+ + ++ + + Sbjct: 201 FDTQVSIRGVPAAGIAGEFGETPYTYSCTGSSMTSDRCTSLGTTRDLMCSGAADVAACRA 260 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + S + + +V + + + + T N+ Sbjct: 261 NARIYAYTTSGGTYRVFVTSNYVQNNGYCWPNYYTYTYCGYTVVSRTDYYSTTTWTRTNN 320 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 S P S + + ++A Sbjct: 321 ASNSTPWPSASYYGTPSYSYAQYNGTITATPTSAGGYG 358 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 97.4 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 52/372 (13%), Positives = 100/372 (26%), Gaps = 65/372 (17%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 I +G+F I+TALLM +LG GM VD L AA A + Sbjct: 4 LPSGFISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIA 60 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + V++ + + F + + + +V + T Sbjct: 61 EKSGAVAAAMAMNSNGTVSLGKTDARNIFMSQMSGELAEVQVDLGIDVTKTANK------ 114 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 L+S+ + F++ +G S I T + Y + ++D + SM Sbjct: 115 ---LNSQVSFTATVPTTFMQILGRDSITISG-----TATAEYQTAAFMDFYILLDNTPSM 166 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + L A + + N Sbjct: 167 GVGATPDDVSKLEAKAGCAFACHQMDKTINNYTI-------------------------- 200 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV--- 306 + V + +VR A ++ + K +D RMG F + Sbjct: 201 ------------AKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDA 248 Query: 307 --ISDPSFSWGVHKLIRTIVKTFAIDENEMGS-----TAINDAMQTAYDTIISSNEDEVH 359 + + + K+ + T + AM I + + + Sbjct: 249 KLTTISGLTSDLTKVKNYTDAVDLMTIPYQNYNNDQITNFDSAMTQMNTIIDQAGDGTSN 308 Query: 360 RMKNNLEAKKYI 371 + + Sbjct: 309 ISAEKSCSSYPM 320 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 97.0 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 54/441 (12%), Positives = 127/441 (28%), Gaps = 45/441 (10%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + + +AL + ++LG+ G +D S L Sbjct: 18 HKKRHNAAV-SALTLAILLGLSGC-IDKQTESESRTEL-------------------ATQ 56 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ A Q++E ++ + + +++R + S L+ Sbjct: 57 ANEAAEQQAALVQRVEAERQAKIQHENELSAKSQDMRAEHMPYIAQYAASSSVAAPGLND 116 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + P ++ + T S +++ ++ + Sbjct: 117 DWQGAVLPERNQFEKQVQNGIMVAGEIPVSTFSIDVDTGSYTTLRRML-KEGRLPQKDTL 175 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + LN F + K+ + + + + M+ L + + Sbjct: 176 RVEEMLNYFSYNYPQPNKNEAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKAELGASNL 235 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D S K L++ AL + + + + D V+ V + G Sbjct: 236 VFLLDVSGSMASDDKLPLLQTALKMLTQQLDEQDKVSIVV---------YAGAAGVVLDG 286 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 I+ GST + +Q AY + + ++L T Sbjct: 287 AAGNDIKILTYALEQLTAGGSTNGAEGIQLAYQL--AQKHFVKGGINR-------VILAT 337 Query: 376 DGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 DG+ + + K GI + T+ F + + + + Sbjct: 338 DGDFNVGTTNLDELVDLVEVQKKHGIGLTTLGFGMGNYNDHLMEQLANK--GNGQYAYID 395 Query: 433 STHELNKIFRDRIGNEIFERV 453 S +E K+ +++G + V Sbjct: 396 SVNEARKVLVEQLGATLLTIV 416 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 97.0 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 42/461 (9%), Positives = 108/461 (23%), Gaps = 46/461 (9%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN 97 G +D+ + A + + + L + + + ++ Sbjct: 22 GSGIDLTSALNARSKMANALDASALKLAGKLSVAKLSDDEIQAGLEKMFTANLSRFDLKA 81 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKS--------AYQVVLSSRYDLLLNPLSLFLR 149 + + D + + V + V ++SR L L L Sbjct: 82 SALSELEFEVDWTKGILDVWSDVSVKTHFIGLGGLGPEKLDVGVTSRVSFASQALELALV 141 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 S + E + D + + + Sbjct: 142 LDVTGSMDGDISSLKEASQLLFEALVP-ENAGRHDQRIRVSIVPYSQGVNLGAKAWKVTN 200 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY--MLYPGPLDPSLSEEHFVDSSSLRHV 267 R S + + G + Y +S ++ Sbjct: 201 RQSDSSNCVATRGGPNAFTDAYYNYRGARSNFFVAPGALDYFVIRRGSNVSWYPPRNNCP 260 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI---- 323 + + ++ +++ ++ ++ T + + +S + Sbjct: 261 ESEILPLTNSRKTLLAAVDALEAQGGTAGQAGIAWGWKALSWTWHPFWPSGSDPAKSFSS 320 Query: 324 ----------------------------VKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 V+ +++ G D + Sbjct: 321 QVGKAAVIMTDGDFNVHYTERFNAGCAPVEETDGEDSTHGGKRNRRDNDDDDDDDDDDKK 380 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 E + L + + +A+C K Q + I T+ F + Sbjct: 381 RERGGRCSGGTYSIEEYLPGATKRDAPATQALALCEAMKEQDVVIYTVYFETT--GAKFG 438 Query: 416 RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + +CAS P+ F+ A L F + + + + Sbjct: 439 KDLMKSCASDPDKFYLAEDRDGLKAAFSAIAIDNLSIYLAK 479 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 96.2 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 70/488 (14%), Positives = 143/488 (29%), Gaps = 110/488 (22%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 L++ G + A+L+PV+LG+ G+ +D L+ A Sbjct: 2 RALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQAA-------VDAAA 54 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + ++ + + + +Y+ N+ T V TA P Sbjct: 55 LAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTATATVPT--- 111 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 +F+ ++GI S + KA A + + + +VID S S Sbjct: 112 -----------------IFMNALGIGSSEVHAKATA--------GYNKLEVVFVIDNSGS 146 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M + + S + G V R + P V G Sbjct: 147 MKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFRGKVHLPAGVDGL-PDGCRNADGT 205 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 L+PS E + +S ++ ++ + Sbjct: 206 LNPSWLHEEYFKTSYRYPSGSSLNVPKNT-------------------------CTSIPR 240 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + ++ I K + T I++ ++ + + + + Sbjct: 241 VQGLTEDRETILTAISKQ--NGLGDASGTVISEGLKWGRHVLTPEAPFTEG--SSAKDIR 296 Query: 369 KYIVLLTDGENTQD------------------------------------NEEGIAICNK 392 K I++LTDG+ N + K Sbjct: 297 KVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARK 356 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEANSTHELNKIFRDRIGN 447 K GI + I F + + AS + +++A S ++++ +F+ IG Sbjct: 357 VKEAGIEVFAIRFG---DSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKK-IGR 412 Query: 448 EIFERVIR 455 ++ R++R Sbjct: 413 QLGWRLLR 420 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 96.2 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 49/461 (10%), Positives = 111/461 (24%), Gaps = 72/461 (15%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +L+++ G I++ + VML G+ +D+VR ++ A++ A+ Sbjct: 20 MLHRLLRNEDGALIILSLQVFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAASLTQ 79 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + + + Sbjct: 80 KRDPTLVVKDYLTKAGLGYIASD-------------------SSFTPKVEGSIALGWRRV 120 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + ++F +G+ S A + + + + Sbjct: 121 SVEVDDDMP------TIFGPLLGVSSLAATGDTTAMQAVGNVEISLVLDLSGSMTEYVKD 174 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQ---------NGKVGIRDEKLSPYMVSCNKSL 240 + F ++ G+ + S + ++ Sbjct: 175 NPSCTKNCTSSKTRFQYLQVAAKSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGSEMQ 234 Query: 241 YYMLYPGPLDPSLSE---EHFVDSSSLRHVIKKKH-----------LVRDALASVIRSIK 286 + S D + + S+ + Sbjct: 235 EGYTLSSDFSVTGSSFAMPQCADFVANDYNTMVIDGTGPLTRTMYGSSYKYSDSLSALVS 294 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 N+ + N S L F G T+I+ + Sbjct: 295 DGSTSNNPGQDWHNCMNTPQNRVIPLSSDPTFLAADKTG-FIDKLTAGGWTSIDVGAKWG 353 Query: 347 YDTIISSNEDE---------------VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + S DE + + + K +VL+TDG NT + Sbjct: 354 LALLDPSARDEVAKMTSVSSAFRETKPRPINYDGDTMKVLVLMTDGANTTNFSTLPGY-- 411 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + GI + T+ + A AS +++A Sbjct: 412 RTGPSGI-MSTLGVDSLGSFGPSATT-----ASGIYYYDAT 446 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/366 (9%), Positives = 90/366 (24%), Gaps = 13/366 (3%) Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 + ++L +D + + LSS L + F+ + W Sbjct: 285 YSDSLSALVSDGSTSNNPGQDWHNCMNTPQNRVIPLSSDPTFLAADKTGFIDKLTAGGWT 344 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 ++ + S R + ++ Sbjct: 345 SIDVGAKWGLALLDPSARDEVAKMTSVSSAFRETKPRPINYDGDTMKVLVLMTDGANTTN 404 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + G R SL + + S+ + R Sbjct: 405 FSTLPGYRTGPSGIMSTLGVDSLGSFGPSATTASGIYYYDATRSSTPYYKYETGTWVARS 464 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A+ + + + + S + + ++ + Sbjct: 465 AITQTTQEQQVQTATCTQYKNSWGQWKWNSCSVGTSDCTLISSTTSLKTYTCKKTTTVNV 524 Query: 337 TAINDAMQTAYDTIISSNEDEVH--------RMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T +YD + + ++ + + ++ + + + Sbjct: 525 TTEAPLYDVSYDHLYKTKNWNLNTVAGLLGKPYGRSAGTQYELMANAVYDTSVKDARTKK 584 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +C+ AKS+GI I ++A + L C+S S++ L+ F + Sbjct: 585 LCDLAKSKGIYIFSVA----ADAPSGGKTLLKYCSSGTSYYYEVQGSNLSTAFASIAASI 640 Query: 449 IFERVI 454 R+ Sbjct: 641 SSLRLT 646 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 95.8 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 39/390 (10%), Positives = 93/390 (23%), Gaps = 59/390 (15%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 KS G ++ +M+ G+ +D +R ++Q TA+++A+ Sbjct: 25 RKVFSNWRKSEKGSMTAFGIFIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSAASL 84 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L E S +Y + + +T Sbjct: 85 LQTLGAEAVVTDYMSK---ANIDVDYNLSVNVSEGINFRAVDATATATLETLFLGLLNID 141 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + ++S + + L + L S ++ + + +S + Sbjct: 142 SLGITVTSGAEERIPNLEISLVLDVSGSMGSNSRLTNLKTAATQFVSTIISGGSGGTVAM 201 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S++ + + + + ++++ Sbjct: 202 SIIPFSSSVTPSQSVIDAITMEDNHDYSTCIEFADDDFSSSSLDLDSTYKRAVFT----- 256 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + D+ N R Sbjct: 257 ---------------------------SRYSDTGSGDFDDADDFNQDWR--------SCY 281 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS--------------- 352 D F + T++ GSTA + M+ + Sbjct: 282 MDEYFELLAYSDDETVLYNKIQGLLAQGSTAGHTGMKWGTSLLDPEFQAVTNSMIAAGVV 341 Query: 353 -SNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + ++ K IV ++DG N Sbjct: 342 DAAHAGMPVAYSDTNTMKIIVFMSDGNNHT 371 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 43/132 (32%), Gaps = 17/132 (12%) Query: 336 STAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKYIVLLTDGENT-----------QD 382 A +T Y+++ + + Y ++ G T Q Sbjct: 448 GNYYGVATETWYNSMTGMTFENLSWEEAWGLMSIEYYESVMGSGAATDWGSTSARTGSQS 507 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + A C AK +GI I TIAF A L+NCA+ ++ + + +F Sbjct: 508 DTLMSANCTAAKDRGITIFTIAFEAPSN----AETQLNNCATSDNHYYDAQGTSITSVFS 563 Query: 443 DRIGNEIFERVI 454 ++ Sbjct: 564 SIATTIQKLKLT 575 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 95.4 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 60/205 (29%), Gaps = 32/205 (15%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + ++ L I R+ F + ++ + + Sbjct: 98 DFTPNRLEKAKEVLDEFID-------KRGNDRLSLIVFGGDAYTKVPLTFDHNVIKEMTR 150 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K D TAI + A + + S K I+LLTDGEN Sbjct: 151 KLTVDDITSNTRTAIGMGIGVALNRLKDS-----------EAKSKVIILLTDGENNSGEM 199 Query: 385 EGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SPNSFFE 430 A + AK GI+I TI E L + A + +F Sbjct: 200 SPSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYKTVKNTELDENMLKSIAETTGGEYFR 259 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A+ + E +IF E + R Sbjct: 260 ASDSKEFKEIFNKIDALEKTKIDGR 284 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 95.1 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 58/570 (10%), Positives = 140/570 (24%), Gaps = 128/570 (22%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G+ ++ L + ++ + G VD R L+Q A TA + + + + Sbjct: 8 FVSDHRGNVVMLFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVASRITAATS 67 Query: 74 EVSSRAK------------NSFTFPKQKIEEYLIRNFENNLKKNFT------DREVRDIV 115 + + + T I + T + Sbjct: 68 TADAIKQAQNVLRSASQRLAAATISNATISADRKTFCIDAQVSIPTMIMKIARIDSMAPA 127 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLN----PLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + E+ + Y++ L+ +N + + + + Sbjct: 128 VMSCAEIGGGSTNYEIALALDNSGSMNESAGGATKIQSLKTAATNFVNSMFAKSPGKVKI 187 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS-------------- 217 + +D + + + L+ + ++ Sbjct: 188 AITPFAGLVIPVDPTVAANRALPWIDVNGLSSQHWITFGGKANANAAGFTSRFNVFSNLK 247 Query: 218 ----QNGKVGIRDEKLSPYMVSCNKSLY--------YMLYPGPLDPSLSEEHFVD----- 260 G + + P V+ L P D S E ++++ Sbjct: 248 SQRADWDFGGCYEPQPYPMNVTETAPTAGNAETLFVPYLAPDEPDSSAYENNYLNDDGGG 307 Query: 261 -------SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR------VI 307 + K + + T + A+ Sbjct: 308 CSLWTFGGWTDLTRTCKYKPATGWTSGIWSWFGATAGNGWTGGVFASRGGAFNGPNGMCP 367 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-------------- 353 + + + ++ + G T +++ + A+ +I + Sbjct: 368 NAATQTALQLTPTQSTITAKIAQLTAAGDTNLHEGVMWAWRSISPNPPFSAGSAYNTAGV 427 Query: 354 -----------NEDEVHRMKNNLEAKKYIVLLT------------------------DGE 378 N + + + + DG Sbjct: 428 RKILVLMTDGYNNWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLDGA 487 Query: 379 NTQ-----------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-QEKARYFLSNCAS-P 425 +E C AK++GI I +IAFSV+ L +CA+ Sbjct: 488 ANSWTDYKSVSRQAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCATNA 547 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + A + ++++ F N R+ R Sbjct: 548 DHYLLATDSTQIDRAFSQIAMNLSKLRLSR 577 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 94.3 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 53/450 (11%), Positives = 127/450 (28%), Gaps = 29/450 (6%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ A + ++ G+ VD+ R + L+ A A + S+ L + K S Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPD--VDNGKVSQ 58 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 + T D + ++ +++ N Sbjct: 59 AVTTNLAANDPEAKATDISSGGATRSVCVTAEADVDMTLSKVVGLDATTVTAEACAGYND 118 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + L + S A + + + ++ +Q + Sbjct: 119 IELVMVLDATGSMRGTPIANVKEAAANLVDLIMPDSGANTRSKIGLVPFQGKVRIDGNDP 178 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 D +N + D KL S ++ + S Sbjct: 179 VTAERDPDGVGAGCRNADGTLNDGKLKTEYSDTRS---RNSIFYGYTISGVSTYYDRTCS 235 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 I+ ++A+ I +I + T+ + +V+S + + + Sbjct: 236 GMSPIRALSSDKEAILDNIGAINAGAVTSGTLISEGIKWGHKVLSPKAPYTEGNTDKKVR 295 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLLT------ 375 + + + +A T+ + + N + Y L T Sbjct: 296 KIMIVLTDGDTEDGRCGGRYASASRTVNAYWTNAYFGQGLRPNSASSPYDTLSTASATLA 355 Query: 376 ---DGENTQD-NEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCAS---- 424 D + N+ + + AK+ + I I F + AS Sbjct: 356 QIPDCTDGGKLNQYVLDEADDAKNDADYPVEIFAIRFG---DSDATDISLMKRIASSKSG 412 Query: 425 -PNSFFEANSTHELNKIFRDRIGNEIFERV 453 + +++A + ++ +F+ IG ++ +R+ Sbjct: 413 TDDHYYDAPDSSDIKDMFKK-IGQQLGQRL 441 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 94.3 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 48/350 (13%), Positives = 98/350 (28%), Gaps = 41/350 (11%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + + G I++ L VML V G+ +D +R L+ +A++ Sbjct: 2 RISRTAGRFVTRDDGAITILSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVL--- 58 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++ + I E + + N D E + A ++ Sbjct: 59 --------AGAGAPADATAADVKLIVEDYFDAADLSQYLNTIDPETDIVASLNAKSVS-- 108 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + L + +R G+ + A A + + I +D Sbjct: 109 -----------ASVELEMDTFLMRLSGVDTLTTAGGATAAIAAPR------MEIVLALDV 151 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S SM + F G + PY S S M Sbjct: 152 SGSMAGERLTKMKSAAKQFVTDVMSASDQ--------GTTTISIVPYSWSVTPSD-EMFE 202 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFN 303 +D + +D R ++ S+ + + T ++ Sbjct: 203 ALSVDVRHNYSTCIDFLESDFEKTAIDPARSYGQTIYTSLTGSFGNIGIGDPTVTNTAYD 262 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +D F + T + GST+ + M+ A + + Sbjct: 263 RTCYTDEYFRILPYSNSVTALHNKIDSLKAAGSTSTHLGMKWAAGLLDPA 312 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 48/397 (12%), Positives = 101/397 (25%), Gaps = 34/397 (8%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S K + + + + D V + D + S Sbjct: 221 SDFEKTAIDPARSYGQTIYTSLTGSFGNIGIGDPTVTNTAYDRTCYTDEYFRILPYSNSV 280 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 L+ L++ G S + + + + +R+ D + Sbjct: 281 TA---LHNKIDSLKAAGSTSTHL-----GMKWAAGLLDPAFAPVVSSLQQTRTKTDSSGN 332 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + S + + + + + M + + + Sbjct: 333 LVTYSIVDPSINNTPALYSTGQVLKVAIVMGDGANDWTYGLDDPNGLMNPDIVENHTQPD 392 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 DS+ R + + S +N + I++ + Sbjct: 393 YRGPDSNLYRVQYTD-DVFKYRYFVFNPSFIAYSEMNCNTGYWVCVYESEDITNYYL-YS 450 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + T + DA+ T S + Y Sbjct: 451 TYWNDYTDITNGVYLTPAQ-----FDALPTTLPNFESQERLSWEEAWGLMTPDFYSRTTF 505 Query: 376 DGENT--------------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 D ++ IC KS+GI + TIAF + + + A L N Sbjct: 506 DYAPDNMFSKNGTGSIAPETKDDRMADICGATKSKGIVVYTIAFEMGEF--DSAADRLEN 563 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 CAS S +T +++ F + +R+T+ Sbjct: 564 CASSPSQHFNATTLNISQAFGSIAA---NVQKLRLTQ 597 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 94.3 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 50/432 (11%), Positives = 108/432 (25%), Gaps = 85/432 (19%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G I+ A+L+PV+L + VDV L+ + Sbjct: 15 ISRRRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELR---------IATDSAARAGA 65 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + EY +N Sbjct: 66 RTLSLEQDASLAHKSAIEYAAKNNVAGNTL------------------------------ 95 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + +V S + S ++ + Sbjct: 96 --------------------TLADSDVQIGLSVRTDDVGRFTFSSGGKLLNSVNVTGRRT 135 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + ++ +V + S+ + + + Sbjct: 136 QQAPDGAVRLYLTPIFGHEFFQPVADATASQIDRDIALVVDRSGSMTFRINRNSYESGWR 195 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 V S + + + D++ + + + + +N D + Sbjct: 196 NNDPVPSRA------RWWALVDSVDGFLTELGST---PQLELVSLSTYNSSAKIDEQLTD 246 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + GST I M T+ + A K +V++ Sbjct: 247 KYSRIEDALDDY--SRRYPDGSTNITAGMDRGISTLQNKK-------YARPYASKTMVVM 297 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 TDG + + A + A I + TI +S + + A + A Sbjct: 298 TDGNHNYGSSPTNAAYDAASDD-IVVHTITYSDGAN-----QSLMREVARIGGGQHWHAP 351 Query: 433 STHELNKIFRDR 444 EL +IFR+ Sbjct: 352 DGDELEEIFREI 363 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 94.3 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 58/539 (10%), Positives = 127/539 (23%), Gaps = 113/539 (20%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ AL + ++ + G +D+ ++ A A++ + + + RA+ + Sbjct: 1 MLWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNGIVDDGVRAEATK 60 Query: 84 TFPKQ-----KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 F + + + T T++ V +S Sbjct: 61 FFNFNFQQGMFGSKPFTPSISSATTSKTTVVINAATTVPTSLMRIFGSDELPVSVSCNAS 120 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + S + + + +++ + ++ L Sbjct: 121 QDFVNTDIVFVLDTTGSMRDKATSSDSQTKIEALRSAVLALYDQLAPVQNQLAASGMRLR 180 Query: 199 QPLNCFGQPADRTVKSYSSQNGK----VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + ++ + + S Y + Sbjct: 181 YGVVPYASAVNIGAAIRAANPNYMLSGSWTYQSRQVVTENMTSWSCGYRSGSYDFGSGIC 240 Query: 255 EEHFV-----------------------DSSSLRHVIKKKHLVR-DALASV--------- 281 ++ + + V Sbjct: 241 TYFRYLPRAFDTSKYVSGASVDVAELVGTAAYNGVTPTRSSANKVTWAGCVEERQTARVS 300 Query: 282 --------------IRSIKKIDNVNDTVRMGATFFNDR------------VISDPSFSWG 315 I I D + + PS + Sbjct: 301 SNDTAIPPGATDLDIDLIPTNDATKWKPYWPEVEYASYQVEIYKNDPYKPQFACPSPAAS 360 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKYIVL 373 + R +K + G T ++ M S N + KKYI+ Sbjct: 361 MQGWSRDDLKKYLNTLTPDGGTYHDNGMMWGARWASSGGIFGGNNPEKYNMMPVKKYIIF 420 Query: 374 LTDGENTQDNEEGIA---------------------------------ICNKAKSQGIRI 400 +TDG + +C+KAKS G I Sbjct: 421 MTDGLFETGYSRLYSSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLLCSKAKSMGYSI 480 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F L+NCAS P+ +++ L F + IG I +R+T+ Sbjct: 481 WVLGF------ATTLDASLTNCASTPSQASTSSNQAALMARFVE-IGKNIGA--LRLTQ 530 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 93.9 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 73/212 (34%), Gaps = 38/212 (17%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ ++ I K R+G F D ++ + + Sbjct: 102 TSNGDFVDRLTAVKQVVSDFIDQRKG-------DRLGLVLFGDHAYLQTPLTFDRNTVRE 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T TAI + + A T I SN ++ I+LL+DG NT Sbjct: 155 QLDRTVLN--LVGQRTAIGEGLGLATKTFIESNA-----------PQRTIILLSDGANTA 201 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPN 426 E + AK +I T+ T ++ L+ A + Sbjct: 202 GVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGG 261 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +F A + EL +I++ E V + T+ Sbjct: 262 QYFRARNADELAEIYQTIDALE---PVTQATQ 290 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 93.5 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 53/444 (11%), Positives = 128/444 (28%), Gaps = 42/444 (9%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ ++ F++++L + + G+ G + + V Sbjct: 25 TLCRQSARNR---VFLLSSLTLAIFFGLSGCSDK-----------QSDGDK---SLVVHP 67 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + E E L + + + D + R +N A Sbjct: 68 LTEQAETVRLESEQSKVLAIYKSEELHQLKQVQSSERLIDASLAAKQRSAKHVINTHYVA 127 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + Y + + F + + ++ + T S +++ +I S Sbjct: 128 APIASDAWYGIKQPERNRFEKQI-QNGIMVAGEIPISTFSIDVDTGSYSTLRRMI-KEGS 185 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + LN F K+ + + + + M+ L Sbjct: 186 LPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYELTKS 245 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 246 ELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVV---------YAGA 296 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + GST + + AY + + Sbjct: 297 AGVVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL--AQKHFIQGGINR----- 349 Query: 369 KYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++L TDG+ I++ K K +GI + T+ F ++ + Sbjct: 350 --VILATDGDFNVGVTNFDLLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADK--GN 405 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 + ++ +E K+ D + + + Sbjct: 406 GHYAYIDTLNEARKVLVDELSSTL 429 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 93.1 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 76/183 (41%), Gaps = 21/183 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + ++ ++ +K+ N + R G+ FND V G+ + + Sbjct: 189 SRLEVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLKRGLKR-----IYNDI 243 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE--EG 386 GST M+ A++ + ++ + KK ++ LTDGEN N + Sbjct: 244 KKYRAFGSTNSYVGMKYAWEQL-------YGNPQDTKDRKKIVIFLTDGENMIINATRKT 296 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 I +CN K + I +IA +V+ + L C+S + + A+ L + + IG Sbjct: 297 IELCNDMKKKKAVIYSIALAVD------NKEVLQGCSSSGNVYAADDAQSLVQAY-SLIG 349 Query: 447 NEI 449 ++ Sbjct: 350 KDV 352 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 70/194 (36%), Gaps = 8/194 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + ++ G I +A++ P+++ + ++ ++ + L+ A++ + Sbjct: 6 IRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTVTMIKL 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 E + + +N + L+ + +V++++ DT++++ + Sbjct: 66 KNIEDVVK-------NVGPVNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKLESDSNFKT 118 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 + +++ + + + K+ + + + + V+D S SM Sbjct: 119 LSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEG-SDIDLMIVLDVSSSMD 177 Query: 191 DYQRDSEGQPLNCF 204 D E P + Sbjct: 178 DNFMKPEEAPCSRL 191 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 93.1 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 51/444 (11%), Positives = 126/444 (28%), Gaps = 42/444 (9%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ ++ F++++L++ + G+ G + + V Sbjct: 25 TLCRQSARNR---VFLLSSLILAIFFGLSGCSDK-----------QSDGDK---SLVVHP 67 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + E E L + + + D + R +N A Sbjct: 68 LTEQAETVRLESEQSKALAIHKSEELHQLKQVQSSERLIDASLAAKQRSAKHVINTHYVA 127 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + Y + + F + + ++ + T S +++ +I S Sbjct: 128 APIASDAWYGIKQPERNRFEKQI-QNGIMVAGEIPISTFSIDVDTGSYSTLRRMI-KEGS 185 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + LN F K+ + + + + M+ L Sbjct: 186 LPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYELTKS 245 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 246 ELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVV---------YAGA 296 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + GST + + AY + + Sbjct: 297 AGVVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL--AQKHFIQGGINR----- 349 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++L TDG+ + + K +GI + T+ F + + Sbjct: 350 --VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMGNYNDQLMEQLADK--GN 405 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 + ++ +E K+ D + + + Sbjct: 406 GHYAYIDTLNEARKVLVDELSSTL 429 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 93.1 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 61/205 (29%), Gaps = 32/205 (15%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + ++ L+ I R+ F + ++ + + Sbjct: 98 DFTPNRLEKAKEVLSEFID-------KRTDDRLALIVFGGDAYTKVPLTFDHNVIKEMTG 150 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K D TAI + A + + S K I+LLTDGEN Sbjct: 151 KLTVDDITSNTRTAIGMGIGVALNRLKDS-----------EAKSKVIILLTDGENNSGEM 199 Query: 385 EGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SPNSFFE 430 A + AK GI+I TI E L + A + +F Sbjct: 200 SPSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYTTVKNTELDENMLKSIAETTGGEYFR 259 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A+ + E +IF E + R Sbjct: 260 ASDSKEFKEIFNKIDALEKTQIDGR 284 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 92.7 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 94/310 (30%), Gaps = 35/310 (11%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 L +F GI S + + A + S DF + Y + Sbjct: 417 SLKDEEVFSEIYGIVSTPVDIEVYAPFKEATVFIPIDTSKIPNQDFQNVKMFYLDEDLMT 476 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + Q D K ++ P + ++ + V Sbjct: 477 FVPLDEQGVDPINKVVWAKTDHFTTFVLFYIP-TWKAIWEVPINKGEREVNQQIKYIDLV 535 Query: 260 DSSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + + A S + ++ + D R F+D + Sbjct: 536 FVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDDYGYLLQPLTTDF 589 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 VK + G T I + ++ A +IS + D+ K I+LLTD Sbjct: 590 -----QTVKNAIDRIDSWGGTNIAEGIRIANHQLISQSSDDRI---------KVIILLTD 635 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 GE DN +AK+ GI I TI + + L N A + +F +S Sbjct: 636 GEGYYDNNLT----TEAKNNGITIYTIGLGTSVDEN-----LLRNIATQTGGMYFPVSSA 686 Query: 435 HELNKIFRDR 444 +L ++F+ Sbjct: 687 SQLPQVFKRI 696 >gi|146340337|ref|YP_001205385.1| hypothetical protein BRADO3364 [Bradyrhizobium sp. ORS278] gi|146193143|emb|CAL77155.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 470 Score = 92.7 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 62/490 (12%), Positives = 138/490 (28%), Gaps = 84/490 (17%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + + +I AL+M + + GM +D + + L AA A I A P + S+ Sbjct: 8 RFARDRCANVAVIFALMMVPTIYLLGMALDYTQALRKQGQLDAAADAAAIAAVRPAMLSV 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S N + V + + + Sbjct: 68 TDTSVVK-----------------NTAAAVFATKAAMNGLTAVPTPDITVTDSGLQRTIQ 110 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 +S + ++ F +G SW ++ A A+ S ++ ++D S SM Sbjct: 111 VS----YVARSINNFPSVLGSPSWQVKGSATAQASSAPN-----MNFYLLLDDSPSMAIA 161 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ A + + S+N + P + S S S Sbjct: 162 ATQTDI-----DNLIAATSKQPGGSKNCGFACHEV--HPNLDSGASSSTVDNLSIARSKS 214 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 ++ + ++++ ++ + A S N T + F+ + + + Sbjct: 215 ITLRIDLVVNAVKQLLVGPYTCPQAGISGGVMQCMSAINNTTYKAAIYTFDYNLNTIQTL 274 Query: 313 SWGVHKLIRTIVKTFAIDE------NEMGSTAINDAMQTAYDTIISSNEDE-VHRMKNNL 365 + + + + +T + A + + D ++ Sbjct: 275 TSPSSAGTKISNIQLLTVDHQNCVTTAICNTDFGTDISGALGGVNAIMPDPGTGTNQSGD 334 Query: 366 EAKKYIVLLTDGENTQ-------------------------DNEEGIAICNKAKSQGIRI 400 ++ + L+TDG + A+C+ K + IRI Sbjct: 335 TPQEVVFLVTDGVEDKLIASSSGCDPKATYPLPAAGSQVRCQQPLNTAVCDTIKKRNIRI 394 Query: 401 MTIA------------------FSVNKTQQEKARYFLSNCASPNSFF-EANSTHELNKIF 441 + F+ + L CAS FF + +++ Sbjct: 395 AILYTEYLQLTTDNWYNSRIAQFNSPSSLSGTIAQRLQACASSPDFFATVQTGGDISDAL 454 Query: 442 RDRIGNEIFE 451 Sbjct: 455 TKLFLRVASS 464 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 92.7 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 48/430 (11%), Positives = 113/430 (26%), Gaps = 25/430 (5%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A+LM VM+G + VDV + L+++ +A + + LI+ E + Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEG--EDVA 76 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + E N N + ++ QVV + + Sbjct: 77 TGTVHEYLTHNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKN 136 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 ++ + +M + + + + S I V+D S SM D + Sbjct: 137 PTTVSVRMTY-ENMPFFFGHLLGRDSFDITAESIATYQSRDIMLVLDLSGSMNDDSEFNS 195 Query: 198 GQPLNCFGQPADRTVKSYSSQN--GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 L ++ + D + + +S + Sbjct: 196 IGKLGFDHIYSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSVTYRGNSVV 255 Query: 256 EHFVDSSSLRHVIKKKHLVRDAL-ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + V + S + + V + + ++ SF + Sbjct: 256 VKSDKTIKQISVKTSNGSTYNYYPGSSLNYTANPNKEIRYVWVTSGKNSNNSDQVQSFDF 315 Query: 315 GVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + ND + + + L Sbjct: 316 DGQRINTIKTALGLDNLAYPYPGGSWND---YVNYCLGTGQNNNAGYRYRFGYFNWINYL 372 Query: 374 L--TDGENTQDNEEGIAI--CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 L N+ + + K+ + + I F +++ + Sbjct: 373 LERQYSSNSTPDLWKASAQPITAVKNS-VDLF-IHFMQEGDGRDRIGLA---------VY 421 Query: 430 EANSTHELNK 439 A + L + Sbjct: 422 NAPNGDGLLE 431 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/387 (10%), Positives = 94/387 (24%), Gaps = 41/387 (10%) Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 ++ + + N + + + + + R ++ V Sbjct: 205 YSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSVTYRGNSVVVKSDKTIK- 263 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 ++ + + Y D +S + Sbjct: 264 QISVKTSNGSTYNYYPGSSLNYTANPNKEIRYVWVTSGKNSNNSDQVQSFDFDGQRINTI 323 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 S + + + Y+ L+ S Sbjct: 324 KTALGLDNLAYPYPGGSWNDYVNYCLGTGQNNNAGYRYRFGYFNWINYLLERQYSSNSTP 383 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + V++++ I +++ D + N + + + + + + Sbjct: 384 DL--WKASAQPITAVKNSVDLFIHFMQEGDGRDRIGLAVYNAPNGDGLLESTLTENLPFI 441 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + A + T I M + + + K +VLLTDG+ Sbjct: 442 MTQSRQRQA--GHYHNYTNIGGGMTVGREELQTRGRKGA---------VKMMVLLTDGQA 490 Query: 380 TQDNEEGIAICNKAK-----------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPN 426 N AK QG I+TI+ + + A + Sbjct: 491 NWVNGGVNNN--AAKNYVLNEAYLCADQGFTIITISLGAGAD-----KALMDQVAEITGG 543 Query: 427 SFFEANST-------HELNKIFRDRIG 446 F +L +IFR G Sbjct: 544 VHFNVPGGQTVDEYSEDLTEIFRQVAG 570 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 92.4 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 65/521 (12%), Positives = 139/521 (26%), Gaps = 79/521 (15%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + ++ G I L+M V+ + G++VDV + L+ ++ A + + Sbjct: 3 RLNKLFCLFLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALAGA 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + AK + + + + ++ + Sbjct: 63 EIIPTATTAAAVYAKATAYSSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQGISCVGP 122 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 S + + + + L F R +G S I + A S ++ V+D Sbjct: 123 LSYNSIQVMQQAVVPL----YFARIIGRSSMTISATSTAAKGGASSR---PYNVALVLDT 175 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + S + Y D + C Q + V S + + Sbjct: 176 TYSEISYDSDCGNSQMLCTLQGVQILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFP 235 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 S+ ++ V + S ++ + Sbjct: 236 QMVTSTVSSDYDCSSNTPANTVYTFPIP---GAGTYAPSSSTYRVLDFQSDYRVGDTSTS 292 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNN 364 + + + V A ++ A + T Y TI ++ +H Sbjct: 293 LNQASNLTKAVGGFSGCTGIYPATSASQSNFQATSGQYGTYYPSTIYAAQSSLIHEQTLF 352 Query: 365 LEAKKYIVLLTDGENTQDNEEG-------------------------------------- 386 +++ ++++ DG T Sbjct: 353 PDSQNVMIIIGDGNATAPQTNNGYPVMSTTASVSATPGASTLAGTSSGLYPSWNGECGQA 412 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------LSNCASPNSFFE 430 I N A SQG R+ T+A+ + ++ AS + +F Sbjct: 413 ITAANFATSQGTRVYTVAYGSPSAGCASDQSGQGKIPGLYPNVLPCNEMAQMASASYYFF 472 Query: 431 --------------ANSTHELNKIFRDRIGNEIFERVIRIT 457 A EL+ IF G+ R+I + Sbjct: 473 SDFKQSGSGSVCTAAQVMVELSDIFLQIAGDLTVARLIPNS 513 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 92.4 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 90/374 (24%), Gaps = 53/374 (14%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + + +M+ GG+ VD++R+ ++ +A++ ++ S Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASAGAPYGSDHRAIIEDYFRVA 60 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + + D+ DT + + S + L Sbjct: 61 NMTDYLAA-EKEGEIVVTVNSASVTANADMTMDTYLMKLSGIKELRTTGGSTAVRKVPKL 119 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + L S +K + ++ + S++ + Sbjct: 120 EVVLVLDVSGSMGSNSKLVNLKKAAKEFVTSLLNGSEPGNTVISIVPFSWSVSPSVATFE 179 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 DR + + K S S + Sbjct: 180 ALAVDRKHEFSTCIRFKANDHSHASLATGNSGFSSGQPLD-------------------- 219 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ AL ++ +D A + + T + Sbjct: 220 -------QMIYTALYGNFDEFSGSESSSDYRSCYANDY---------MEILPFSVSETEL 263 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIIS----------------SNEDEVHRMKNNLEAK 368 G+T+ N M + S++ + E Sbjct: 264 HAKIDSLQASGNTSGNQGMIWGAALLDPSFRQITDDLIAAGEVASSQAAIPSNYGTAETL 323 Query: 369 KYIVLLTDGENTQD 382 K V++ DG+NT Sbjct: 324 KVAVVMGDGQNTTS 337 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/287 (9%), Positives = 69/287 (24%), Gaps = 23/287 (8%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S + + P +S + V+ Sbjct: 275 NTSGNQGMIWGAALLDPSFRQITDDLIAAGEVASSQAAIPSNYGTAETLKVAVVMGDGQN 334 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 K+ + + A + D + + Sbjct: 335 TTSYFFSNGGQWRGQNSDLYEVKSQKR--VFKYAYRKNKKDKISYDQSKCSNNSWECVYE 392 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + E ++ + D +S + Sbjct: 393 SSGEIESA----FYLHDNYGDNRYYNTEEGEYLSSSDWDDLQDSDEFVSMRRLDWEEAWG 448 Query: 364 NLEAKKYIVLLTDGEN------------TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + Y + D ++ + A C K++G+ + +I F +++ Sbjct: 449 YMSPYYYYQVTGDPNAYYDYYYYDRLDGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGG 508 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L NCAS + + +N F N + +R+T+ Sbjct: 509 TA--EQVLKNCASSENHYFRAEGININDAFSAIASNVVN---LRLTQ 550 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 92.4 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 58/179 (32%), Gaps = 28/179 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 K +V+ + + +G F ++ L + + Sbjct: 97 KSKFEVVKSMAQNFF-------HKRFDDNIGIVIFGSFAYIAAPLTYDTKALD--FLINY 147 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI + + + + + +K ++L+TDG + + Sbjct: 148 LEPSIAGNNTAIGEGLWQGIKALQA-----------DTAKQKVLILITDGHHNSGSISPR 196 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 KAK GI+I TI + ++ L A S FF A S +L IF + Sbjct: 197 QAVEKAKKLGIKIYTIGLG------DADKHLLEQIAKESGGKFFYAKSEEDLQSIFSEL 249 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 92.4 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 26/316 (8%), Positives = 79/316 (25%), Gaps = 4/316 (1%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ +K LI G+ F++TA + ++G G VD+ R + L+QA ++ Sbjct: 7 SRLCAGTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLAG 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN- 123 + + + +++A+ + F E + +V Sbjct: 67 RRAMGGASYDEAAQAEANKMFNFNFPEAKYGATGILFSSRALNASDVEGQASAVLPTELM 126 Query: 124 --PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 K +++ L ++ + + L S ++ + Sbjct: 127 FMFGKEEFRLSADCTAKLEISNVDVMLVLDVTGSMAQTNAGDSVNRITALKDATMDFFDT 186 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + + + + + + + Sbjct: 187 LTNADVGDGRLRFGVVPYSSTANVGQILLAKNPAWLADTVTLPSRTPIFREVYTETGTET 246 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y S + + + + D + Sbjct: 247 SDDYTDSPTTYSSNWTNDGTVPASNSAAC-AALTPPANTTPSPSGSPDYNQTGQYVDGDT 305 Query: 302 FNDRVISDPSFSWGVH 317 + ++++ + Sbjct: 306 RVTTYDTVQTYTFRTY 321 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 68/230 (29%), Gaps = 41/230 (17%) Query: 260 DSSSLRHVIKKKHLVRDA-LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + ++ ++ + + + + + Sbjct: 447 QYAPSSSDPMTVNVFNSDGTRNITSDTSSNGRWQRYSKYWGSGWGVCPAAAMKLTTMTAS 506 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLLTD 376 R T+ +G T + M + + N+ ++IV +TD Sbjct: 507 -DRATFNTYVQSLQPLGGTYHDAGMVWGARLLSPTGLFADENATAPNDRPISRHIVFMTD 565 Query: 377 GENTQD----------------------------NEEGIAICNKAKSQGIRIMTIAFSVN 408 G + N +C A+ +GI I ++F V Sbjct: 566 GAMAPNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVG 625 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L+NCAS FEA++ ELN+ F I +I + +R+ + Sbjct: 626 SN------DSLNNCASSGQAFEADNAAELNEQF-QAIARQISK--LRLAQ 666 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 92.0 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 39/343 (11%), Positives = 100/343 (29%), Gaps = 22/343 (6%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + Y + + G I ALL+ ++G+ V+ R+ L+ +A+ A + + Sbjct: 2 RTHSYQNRSLHKQKGVAAIWMALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALAVT 61 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + + + + N + ++ ++ E Sbjct: 62 IEDKPG-----AASVMAENYVRSYVRDIKSINVQAERRE----------PGNSRNEEAAD 106 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 Y V ++ +D + F+ S ++ I ++ A S + I +V DF Sbjct: 107 FIQYTVNATTTHDSWFA--NSFIPSFD-ETQDIAGRSLARK-YLSSVGGKNIDIVFVSDF 162 Query: 186 SRSMLDYQ-RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S SM + + ++ + Q+ + + +L P Sbjct: 163 SGSMNFDWMDPNGNKKIDDLKTAIRAISNKFICQDVRNEFVEGELKPVCHDQEDGYTADK 222 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + + +D + I S+ D + Sbjct: 223 LKNRVGFVPFNSRTREKRGSSVYATSQLKYKDNYKTDISSVSYKGVDWDYWTRFRSEEIK 282 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ++ +F + K T + + + Sbjct: 283 NCATNSNFCEAPRQERHLEAKRAIDVLEYH--TLPSYGIDWGF 323 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 91.6 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 53/521 (10%), Positives = 128/521 (24%), Gaps = 124/521 (23%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G II L + ++ G+ D + L +A A + + Sbjct: 10 FTRDDKGGVAIIMGLAVIPLVLASGLAADYAIVQAAKSRLDASA---DAAALAAIKTAQT 66 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ + + Q I + + ++ + V++ + + Sbjct: 67 TIAELSATNPNPRPQAIAAAMSQAEKSFYAQAGKRAADLLGKPAIDVQIKGQ------EV 120 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 ++ S F R G+K A A+ + + ++D S SM Sbjct: 121 TANVAYSAAMPSNFGRIAGVKLMNYNGGAGAQLTMAKF-----LDFYLLLDVSGSMGLPS 175 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + L L Sbjct: 176 TPAGEAALAAKNPD-------------------------------DLAQYPTGCRFACHF 204 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + S ++ + V A+A ++ K + R+G F + + Sbjct: 205 AGSQGYNVSRANNIQLRIDAVGAAVAQLMEKAKDTATLPKQYRVGVYPFVTHANAFVDLT 264 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN--------- 364 + + V++ + +T + D + + + + ++ N Sbjct: 265 DNLRG-DQYSVESAINYDPATRTTDFGRLLDAGKDWVFARDLNPNYKANPNIPADVTPMG 323 Query: 365 --------------------------LEAKKYIVLLTDGENTQD---------------- 382 + ++ ++DG Sbjct: 324 AGGSHIHNIFQDINAKIPSVGDGSGASSPQPFVFFVSDGMQNSQSFVSATGTWPGVTPYP 383 Query: 383 ---------NEEGIAICNKAKSQGIRIMTIAFSVN----------------KTQQEKARY 417 +CN K++GI + + Sbjct: 384 TPPGQTVSIRAMDPTLCNVLKARGITVSVLEIPYPTFTNPKPFAAAQEFKANDAVPNLSG 443 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + CASPN +F A++ + + + R+T+ Sbjct: 444 AMRACASPNFYFMADTPEGIADAMKKMFEQAVQS--ARLTQ 482 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 91.6 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 67/567 (11%), Positives = 125/567 (22%), Gaps = 128/567 (22%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ A+ ML + M V++ R + L+ A A I A+ + + Sbjct: 5 KRKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGT 64 Query: 77 SRAKNSFT--------FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV------EM 122 + F L + T + I V Sbjct: 65 CTGSCASDTTAIFWANFSSAHQANGLGPFQAVSTGPVITPQNASTITIQANVQLPLLFTK 124 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL------------------IQTKAEA 164 S + ++ + L L S A Sbjct: 125 ILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTVYGVAAPG 184 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD-----SEGQPLNCFGQPADRTVKSYSSQN 219 + ++ F+ M + + + + + N Sbjct: 185 SCGGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWQSMPSGWLTAGSDISTTRYGSN 244 Query: 220 GKVGIRDEKLSPYM------VSCNKSL------YYMLYPGPLDPSLSEEHFVDSSSLRHV 267 G +G + S Y N + +P F +S + Sbjct: 245 GWMGCVMARYSGYNNSPAHIYDVNDANPIQAPFTPFYWPSTYHKYSQSSWFGGTSWVVGD 304 Query: 268 IKKK-----HLVRDALASVIRSIK--------KIDNVNDTVRMGATFFNDRVISDPSFSW 314 A S+ + + + G + Sbjct: 305 NDWILSGGVVTPSSAARSLYGQMAESPLITSFPTTSGSLVTESGLQVGPNLGCDPSPTLP 364 Query: 315 GVHKLIRTIVKTFAIDENEMGSTA--------------------INDAMQTAYD------ 348 ++ G T N A+ AY+ Sbjct: 365 ETASRSVVEAHISSMPMMSRGGTMLPQALQAGWFTISPNWQGFWPNPALPLAYNTPNMTK 424 Query: 349 -----------------TIISSNEDEVHRMKNNLEAKKYIVLLTD--------------G 377 + + + Y LL D G Sbjct: 425 VLVLMTDGNNQICPCFPVYNYYGPVAPPQSNGDTDMVAYGRLLQDELGVVSSYNGNGYYG 484 Query: 378 ENTQD-------NEEGIAICNKAKSQGIRIMTI-AFSVNKTQQEKARYFLSNCAS-PNSF 428 N N +C+ K+ GI I I + + L NCAS P ++ Sbjct: 485 SNGFSSNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNY 544 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIR 455 ++A + + + F D G R+ + Sbjct: 545 YDAPTAASMKQAFSDLGGQLSALRISQ 571 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 91.2 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 61/197 (30%), Gaps = 36/197 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V++ + I R+G F D + + + + Sbjct: 108 DYIDRLSAVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQ 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T TAI D + A T + S ++ ++LL+DG NT + Sbjct: 161 TIIGLI--GQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLD 207 Query: 386 GIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFE 430 I N AK G++I TI T + L+ A + +F Sbjct: 208 PIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFR 267 Query: 431 ANSTHELNKIFRDRIGN 447 A EL + I Sbjct: 268 ARDAQEL-QAIYQAINQ 283 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 91.2 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 53/374 (14%), Positives = 112/374 (29%), Gaps = 65/374 (17%) Query: 147 FLRSMGIKSWLIQTKAEA-ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 F MG + + V+ + + + D S +++ ++ E + Sbjct: 282 FSAPMGFGILPKSSTTAGCDWVTNPGVVSNYELFKRLRDNSGNVVRWKGCVEARANAVEV 341 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 A+ + + + P +P D S + Sbjct: 342 AAANAAFGRSYTTGVDYDVTETP--PVAGDPASLFVPYFWPDEPDYLPSTGAYAAPGPYV 399 Query: 266 HVIKKK----------HLVRDALASVIRSIKKIDNVNDTVRMGAT------FFNDRVISD 309 + ++ + +A+ + + I + + R + Sbjct: 400 AGSGRFHNNYLADFTVPILWNWVANPWNNGQSILKYDGSTRAAIISETSPETYGPNASCP 459 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + L +++ G T I++ + A+ T+ + + KK Sbjct: 460 EPLTRLTNNLSTVTAAIDSMNYWLNGGTVISEGLMWAWRTLSPQKPYADGAAYTDKKTKK 519 Query: 370 YIVLLTDGENTQDNE----------------------------------------EGIAI 389 IVL+TDG N + Sbjct: 520 VIVLMTDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQTFLDDRLKKA 579 Query: 390 CNKAKSQGIRIMTIAFS----VNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDR 444 C+ AK++GI I T+ F+ ++ T+Q ++ LS CAS + F A + LN F Sbjct: 580 CDNAKAKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSAALNSAFGQ- 638 Query: 445 IGNEIFERVIRITK 458 I + +R+T+ Sbjct: 639 IASSAAASPLRLTR 652 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 49/195 (25%), Gaps = 4/195 (2%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 I L M + + G +D R + +L++AA + A + AKN+ Sbjct: 2 IAFGLAMIPVTFMAGAAIDYGRATLLRSSLQKAADAGALAAGARTSLTQLAREQIAKNAV 61 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL---L 140 L + T++ R VV++S + Sbjct: 62 LANLGAKARNLSLTITETEPSAGVFQVQIQASIATSIMKVARFDTIPVVVTSEARVVGGS 121 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV-IDFSRSMLDYQRDSEGQ 199 NP+ + L S A + S V + + Sbjct: 122 TNPIEIALALDNTGSMRDDMPALKQAAKTLAQNVMSGSGGNVKVSVVPYVAAVNPGLTDL 181 Query: 200 PLNCFGQPADRTVKS 214 A T Sbjct: 182 TQVDTAAVAPATGNW 196 >gi|149188854|ref|ZP_01867144.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] gi|148837274|gb|EDL54221.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] Length = 505 Score = 90.8 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 59/527 (11%), Positives = 130/527 (24%), Gaps = 127/527 (24%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 +G V+ R+ L+ A++ A + ++ + + IE Y Sbjct: 1 MGFTFWAVEGTRYVQETSRLRDASEAAALAVTIEDQP---------TLANNLATKYIENY 51 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + L + Q ++++ S F+ S G Sbjct: 52 VRDIKSTALTAQRFHQAENQ--------NAGVLEYIQYTVNAKTTHDSWFASSFIPSFG- 102 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF--------- 204 + ++ A Y ++ + I +V DFS SM + + ++ Sbjct: 103 DQQDLAGRSLARK-YPVYLGDNNIDIVFVSDFSGSMNSRWGTNRNRKIDDLKTAINQISD 161 Query: 205 -------------------------------------GQPADRTVKSYSSQNGKVGIRDE 227 P + + S Sbjct: 162 KILCTSIRRDQVNGNVEDVCDEQGQDSTANKLLNRVGFVPFNIRTREVISSQNARSTSQL 221 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEH---------------FVDSSSLRHVIKKKH 272 + + Y P S + ++ Sbjct: 222 SYRNDTHANRSTYTYNQVPWDAWRGESWNQVSSCANNPGNCKPVRNWSWNRRQNCYYNAR 281 Query: 273 LVRDALASVIRSIKKIDNVND-TVRMGATFFNDRVISDPSFSWGVHKLIRTI-------- 323 ++AL + + + D ++ + D + + L Sbjct: 282 CPQEALNNQQYAKRINDVLSRSYYYPDDYNYVDFNQTVSTMFSDKSGLSSNYYRINGVQL 341 Query: 324 ----------------------VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 GSTA + + + + + Sbjct: 342 FAGFGDQSSTQFHNIALTNKLSDLNAISPMWANGSTAAFQGILRGAQILKQGDPNSSDQD 401 Query: 362 KNNLEAKKY--IVLLTDGENTQDNEEG-----IAICNKAKSQ--GIRIMTIAFSVNKTQQ 412 K KK +++L+DG+ + +N +CNKA+ Q G+ I I Q Sbjct: 402 KQQAYTKKIKMLLILSDGQESPNNGILRGLVNAGMCNKARQQIPGLYIGVIGI----NFQ 457 Query: 413 EKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEI-FERVIRI 456 + +C + + ++ EL K + I + ++ Sbjct: 458 ASQQSGFQDCVVDPNEDIIDVSNLDELIKKIEELIRKGSKTSGITKL 504 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 90.8 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 68/200 (34%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V+ ++ I K R+G F D ++ + + Sbjct: 99 DMKTDSGFVDRLTAVKRVVSDFIEKRKG-------DRLGLVLFGDHAYLQTPLTFDRNTV 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +T TAI + + A T I SN ++ I+LL+DG N Sbjct: 152 QEQLNRTVL--GLVGQRTAIGEGLGLATKTFIESNA-----------PQRTIILLSDGAN 198 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + I AK +I T+ T ++ L+ A + Sbjct: 199 TAGVLDPIEAAQLAKDNNAKIYTVGIGAGEMQVRGFFGNQTVNTARDLDEDTLTKIATMT 258 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A + EL +I++ Sbjct: 259 GGQYFRARNADELAEIYQTI 278 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 90.8 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 64/204 (31%), Gaps = 35/204 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + L+ I+ R+G F + S+ + Sbjct: 104 DMALNGQPANRLEAAKSVLSDFIQE-------RRGDRIGIIVFGSKAYLQAPLSFDTKTI 156 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + TAI DA+ + D KK ++L+TDG N Sbjct: 157 NQLVQEAQI--GFAGEQTAIGDAIGLGIKRLEDKPSD-----------KKVLILMTDGAN 203 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA--S 424 T + A SQ ++I TI + + L N A + Sbjct: 204 TAGRVQPQQAATFAASQNVKIHTIGIGADSMIVQSFFGPKAINPSSDLDETLLKNIAAQT 263 Query: 425 PNSFFEANSTHELNKIFRDRIGNE 448 +F A ST +L I++ E Sbjct: 264 GGEYFRAKSTEDLQAIYQTLDALE 287 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 90.8 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 61/194 (31%), Gaps = 35/194 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + +V+ + I+ V R+G F + ++ + + + + Sbjct: 116 NQATRLDVVKSVVTDFIQ-------VRQGDRLGLILFGAQPYIQAPLTYDLVTVGELLNE 168 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 +TAI DA+ + D + +VLLTDG NT Sbjct: 169 ATL--GIAGNATAIGDAIGLGIKRLRERPADS-----------RVLVLLTDGANTGGEVS 215 Query: 386 GIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFE 430 A GI+I T+ + + L + A + +F Sbjct: 216 PEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADETDGQYFR 275 Query: 431 ANSTHELNKIFRDR 444 A +T EL I+ Sbjct: 276 ARNTGELELIYESI 289 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 76/249 (30%), Gaps = 42/249 (16%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R + L PY + ML +E+ V + + +++ + + Sbjct: 81 RPQWLEPYTENRTAGYDLMLAVDTSRSMTAEDFTVHGRE----VSRLSVLKGIMGKFVD- 135 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 R+G F D ++ + + + + G TAI D + Sbjct: 136 ------GRVGDRIGLIIFGDTSYVLSPLTFDRNAI--HQLLDGIVPTLAGGGTAIGDGIG 187 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E + ++L+TDG+N + AK +GIRI TI Sbjct: 188 LGIKKLRERPEGS-----------RVLILVTDGKNETGTIPPLKAAQLAKQEGIRIYTIG 236 Query: 405 FSVNKT----------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 K L A + ++F AN T L K+++ Sbjct: 237 VGSTKNRVRLLSPDLRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRIDE 296 Query: 447 NEIFERVIR 455 E E R Sbjct: 297 LEKSEAESR 305 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 83/296 (28%), Gaps = 55/296 (18%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I +G+F I+TALLM ++G GM VD L AA A + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSS 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V++ + + F + + + V + T L Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANK---------L 115 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S+ + F++ G S I T + Y + ++D + SM Sbjct: 116 NSQVSFTATVPTTFMQIFGRDSITISG-----TATAEYQTAAFMDFYILLDNTPSMGVGA 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S+ L A + S N Sbjct: 171 TPSDVSKLEAKTGCAFACHQMDKSTNNYTI------------------------------ 200 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + V + +VR A ++ + K +D RMG F + Sbjct: 201 --------AKSLGVAMRIDVVRQATQALTDTAKTERISSDQFRMGVYTFGTKAEDA 248 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 90.4 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 63/198 (31%), Gaps = 19/198 (9%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 V S + + R + + R+G F S + + Sbjct: 122 SVSMSESDILPTRLAAARRVAQAFV-------RGRRNDRIGLVIFAGEAFSLCPLTTDYN 174 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK-----YIV 372 L + + TAI DA+ + + + E K I+ Sbjct: 175 LLNQY-LNDLNDGMIRTSGTAIGDALARCINRMRDRPAASSDTTQAKTEQWKSERSKVII 233 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----LSNCA--SPN 426 LL+DG+NT N + I + AK+ I+I TIA +A L A Sbjct: 234 LLSDGDNTAGNLDPITAASLAKAFNIKIYTIAVGQPVASASEASTVDEGILKKIATIGKG 293 Query: 427 SFFEANSTHELNKIFRDR 444 SFF A + L +F Sbjct: 294 SFFRAVDSGRLKTVFAQI 311 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 90.4 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 61/198 (30%), Gaps = 38/198 (19%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 I + +V+ + I ++ R+G F + ++ + + Sbjct: 112 DKQIARILVVKYVVNEFIER-------RESDRLGLILFGSQAYLQAPLTFDRKTVSTLLD 164 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + TAI DA+ A + ++ ++LLTDG NT Sbjct: 165 EAQL--GFAGEQTAIGDAVGLAIKRLRERPAS-----------QRVLILLTDGANTAGEV 211 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN----------------KTQQEKARYFLSNCA--SPN 426 + AK GI+I T+ + + L A + Sbjct: 212 APRQAADLAKQAGIKIYTVGVGADQMEQRMGLFGGFSRTVNPSSDLDEDTLRYMAETTGG 271 Query: 427 SFFEANSTHELNKIFRDR 444 +F A + EL I+ + Sbjct: 272 LYFRARNPQELQAIYEEL 289 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 90.4 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 62/197 (31%), Gaps = 36/197 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V++ + I R+G F D + + + + Sbjct: 108 DYIDRLSSVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQ 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T TAI D + A T + S ++ ++LL+DG NT + Sbjct: 161 TIIGLI--GQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLD 207 Query: 386 GIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFE 430 I N AK G++I TI T + L+ A + +F Sbjct: 208 PIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFR 267 Query: 431 ANSTHELNKIFRDRIGN 447 A EL I+ I Sbjct: 268 ARDAQELQTIY-QAINQ 283 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 66/501 (13%), Positives = 135/501 (26%), Gaps = 89/501 (17%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ + +ITAL+M ++ + GM +D + + L AA A I A P + Sbjct: 5 LLSRFVRDRKANIAVITALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM 64 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + ++ F L + T +V Sbjct: 65 LMQTDAVAQNTAYAIFMST--ANRLASGLTSVPTPTITITDVGLQR-------------- 108 Query: 130 QVVLSSRYDLLLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + L+ F + M SW I + A+ S ++ ++D S S Sbjct: 109 ----TVKVSYNAASLNNFPQLLMNNVSWAISGASTAQASSAPN-----MNFYLLMDDSPS 159 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M ++ L P + S + ++ + L Sbjct: 160 MGIGATATDISNLIASTAPKYQKASSSQN-------CGFACHETNIAHDGGTKDNLAIAR 212 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + V S+ + + + + ++ + N T R F+ + + Sbjct: 213 ANNITLRIDLVTSAVNQLLNTWSNCPQSGVSGGVMQCMSA-LNNTTYRAALYTFDLSLNT 271 Query: 309 DPSFSWGVHKLIRTIVKTFA-----------IDENEMGSTAINDAMQTAYDTIISSNEDE 357 S + + + T I A+ + + S Sbjct: 272 LASLTTPTTAGAQVSNIALMPVAYQNCVVPTTNCKTDNGTDIAGALTSLNGIMPSPGLGS 331 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQ----------------DNEEGIAICNKAKSQGIRIM 401 + ++ + L+TDG + AIC K +GI+I Sbjct: 332 N---ASGDTPQEVVFLVTDGVEDKIASSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIA 388 Query: 402 TI------------------------AFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + A+ + L CASP F + + Sbjct: 389 ILYTEYLQLKTPNVPVTDTWYMSWIDAYDEPTSSTGAIAKNLQACASPGFFSNVQTGGNI 448 Query: 438 NKIFRDRIGNEIFERVIRITK 458 + D +T+ Sbjct: 449 TQALTDLFLKVASSTAS-LTQ 468 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 73/212 (34%), Gaps = 38/212 (17%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +S + + V+ ++ I K R+G F D ++ + + Sbjct: 102 TSNGDFVDRLTAVKQVVSDFIDQRKG-------DRLGLVLFGDHAYLQTPLTFDRNTVRE 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T TA+ + + A T I SN ++ I+LL+DG NT Sbjct: 155 QLDRTVLR--LVGQMTAMGEGLGLATKTFIESNA-----------PQRTIILLSDGANTA 201 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPN 426 E + AK +I T+ T ++ L+ A + Sbjct: 202 GVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGG 261 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +F A + EL +I++ E V + T+ Sbjct: 262 QYFRARNADELAEIYQTIDALE---PVTQATQ 290 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 65/501 (12%), Positives = 135/501 (26%), Gaps = 89/501 (17%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ + +I AL+M ++ + GM +D + + L AA A I A P + Sbjct: 5 LLSRFVRDRKANIAVIAALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM 64 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + ++ F L + T +V Sbjct: 65 LMQTDAVAQNTAYAIFMST--ANRLASGLTSVPTPTITITDVGLQR-------------- 108 Query: 130 QVVLSSRYDLLLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + L+ F + M SW I + A+ S ++ ++D S S Sbjct: 109 ----TVKVSYNAASLNNFPQLLMNNVSWAISGASTAQASSAPN-----MNFYLLMDDSPS 159 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M ++ L P + S + ++ + L Sbjct: 160 MGIGATATDISNLIASTAPKYQKASSSQN-------CGFACHETNIAHDGGTKDNLAIAR 212 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + V S+ + + + + ++ + N T R F+ + + Sbjct: 213 ANNITLRIDLVTSAVNQLLNTWSNCPQSGVSGGVMQCMSA-LNNTTYRAALYTFDLGLNT 271 Query: 309 DPSFSWGVHKLIRTIVKTFA-----------IDENEMGSTAINDAMQTAYDTIISSNEDE 357 S + + + T I A+ + + S Sbjct: 272 LASLTTPTTAGAQVSNIALMPVAYQNCVVPTTNCKTDNGTDIAGALTSLNGIMPSPGLGS 331 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNE----------------EGIAICNKAKSQGIRIM 401 + ++ + L+TDG + + AIC K +GI+I Sbjct: 332 N---ASGDTPQEVVFLVTDGVEDKISSSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIA 388 Query: 402 TI------------------------AFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + A+ + L CASP F + + Sbjct: 389 ILYTEYLQLKTPNVPVTDTWYMSWIDAYDEPTSSTGAIAKNLQACASPGFFSNVQTGGNI 448 Query: 438 NKIFRDRIGNEIFERVIRITK 458 + D +T+ Sbjct: 449 TQALTDLFLKVASSTAS-LTQ 468 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 51/309 (16%), Positives = 95/309 (30%), Gaps = 35/309 (11%) Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L +F GI S + + A + S DF + Y + Sbjct: 384 LKDEEVFSEIYGIVSTPVDIEVYAPFKEATVFIPIDTSKIPNQDFQNVKMFYLDEDLMTF 443 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + Q D K ++ P + ++ ++ V Sbjct: 444 VPLDEQGVDPVNKVVWAKTNHFTTFVLFYIP-TWKAIWEVPINKGEREINQQINYIDLVF 502 Query: 261 SSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + A S + ++ + D R F+D + Sbjct: 503 VLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDDFGYLLQPLTTDFQ 556 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + K + G T I + ++ A +IS ++ + K I+LLTDG Sbjct: 557 AV-----KNAIDRIDSWGGTNIAEGIRIANQQLIS---------LSSEDRIKVIILLTDG 602 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 E DN +AK+ GI I TI + + L + A + +F +S Sbjct: 603 EGYYDNNLT----TEAKNNGITIYTIGLGTSVDEN-----LLRDIATQTGGMYFPVSSAS 653 Query: 436 ELNKIFRDR 444 +L ++F+ Sbjct: 654 QLPQVFKRI 662 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 35/294 (11%), Positives = 92/294 (31%), Gaps = 28/294 (9%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSE--------GQPLNCFGQPADRTVKSYSSQNGKV 222 V + +V+D S SM + S + + + Sbjct: 546 SDTRPPVDVTFVLDRSGSMGPHNPTSWSAYEPDYEIDIGEEWEPIPTDEPFRNTHDWKSI 605 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +RD+ + + ++ + + ++ S + + + Sbjct: 606 QVRDDDGTIRTLEHRDFVHPDDWTEIRVHPYHQFGYIPGSIGIYPHPGNDPTNQRVEATR 665 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I ++D D R+G F + S + ++V T G T + Sbjct: 666 NVIDELDPSAD--RVGVYDFASSGRALHPLSDDLESAKESVVGT------AYGGTNMAAG 717 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--ENTQDNEEGIAICNKAKSQGIRI 400 ++ A + + + + ++ ++LL+DG NT ++E + +++ + Sbjct: 718 LEAALN---------DYATRGTDDRERIVILLSDGKNSNTANDERMDELADRSDDLDYTL 768 Query: 401 MTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 T+ + + ++++ EL +F + + EI + Sbjct: 769 HTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDEEIDLDM 822 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 67/191 (35%), Gaps = 38/191 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +V+ + I R+G F ++ + + +V++ Sbjct: 117 RIAVVKHIVGDFIER-------RVGDRLGLVLFGTSAYLQSPLTFDRTTVKQLLVESQI- 168 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +TAI DA+ + + + + ++LLTDG+NT Sbjct: 169 -GFAGPNTAIGDAIGLSIKRLRDRPAEN-----------RVVILLTDGQNTAGEVSPRQA 216 Query: 390 CNKAKSQGIRIMTIAFSVN----------------KTQQEKARYFLSNCA--SPNSFFEA 431 + AK G+++ TI N ++ L+ A + +F A Sbjct: 217 ADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTGGRYFRA 276 Query: 432 NSTHELNKIFR 442 +S ELN+I++ Sbjct: 277 HSPQELNQIYQ 287 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ H ++ + A +T+I DA+ A + Sbjct: 142 DRIGLILFGTRAYLQTPLTYDRHSVL--MRIDDATAGLAGKTTSIGDAVGLAVKRLQDV- 198 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---- 410 + I+LLTDG N + AK GI+I TI Sbjct: 199 ----------PSKGRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 248 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A L I++ Sbjct: 249 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTI 293 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ H ++ + A +T+I DA+ A + Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVL--MRIDDATAGLAGKTTSIGDAVGLAVKRLQDV- 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---- 410 + I+LLTDG N + AK GI+I TI Sbjct: 191 ----------PSKGRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A L I++ Sbjct: 241 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTI 285 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ H ++ + A +T+I DA+ A + Sbjct: 142 DRIGLILFGTRAYLQTPLTYDRHSVL--MRIDDATAGLAGKTTSIGDAVGLAVKRLQDV- 198 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---- 410 + I+LLTDG N + AK GI+I TI Sbjct: 199 ----------PSKGRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 248 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A L I++ Sbjct: 249 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTI 293 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ H ++ + A +T+I DA+ A + Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVL--MRIDDATAGLAGKTTSIGDAVGLAVKRLQDV- 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---- 410 + I+LLTDG N + AK GI+I TI Sbjct: 191 ----------PSKGRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A L I++ Sbjct: 241 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTI 285 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 90.0 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 56/207 (27%), Gaps = 36/207 (17%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + L I R+ F + ++ + + Sbjct: 97 RDFKPNRLETAKKLLEEFID-------KRINDRISLVVFGGDAYTKVPLTFDHNVVKDIT 149 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K D TAI + + + + S K I+L+TDGEN Sbjct: 150 SKLTTDDITSNNRTAIGMGLGVSLNRLKDS-----------EAKSKVIILMTDGENNSGE 198 Query: 384 EEGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCAS--PNSFF 429 + AK GI+I TI E L N AS +F Sbjct: 199 MSPMGASEIAKELGIKIYTIGIGAREIQIRVPFGHTTVKNTELDENLLKNIASTTGGEYF 258 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRI 456 A S E F++ +I Sbjct: 259 RAGSEKE----FQEIFNRIDSLEKTKI 281 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 89.7 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 66/192 (34%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + LV+ +A I+ K R+G FF D ++ + + + + Sbjct: 109 VDRLSLVKTVVADFIQQRKG-------DRVGLIFFADNAYLQAPLTFDLKTVSGYMQQAV 161 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A ++ +K ++LLTDG+N+ + + Sbjct: 162 L--GLVGEQTAIGEGIGLALKRFDAA-----------DNPQKVLILLTDGQNSAGEVKPL 208 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--SPNSFFEAN 432 A+ QG++I TI + ++ L A + +F A Sbjct: 209 DAAKFAQEQGVKIYTIGVGADAYYKRTLFGNQKVDPSRDLDEVTLKTIAAQTGGQYFRAR 268 Query: 433 STHELNKIFRDR 444 L I+ + Sbjct: 269 DASSLAAIYAEL 280 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 89.7 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ H ++ + A +T+I DA+ A + Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVL--MRIDDATAGLAGKTTSIGDAVGLAVKRLQDV- 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---- 410 + I+LLTDG N + AK GI+I TI Sbjct: 191 ----------PSKGRVIILLTDGANNSGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A L I++ Sbjct: 241 TGDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSIYQTI 285 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 89.3 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 44/405 (10%), Positives = 120/405 (29%), Gaps = 27/405 (6%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSR-----AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI----VRDTA 119 ++ + +N ++ + + +K + +++ + T Sbjct: 67 NSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPLCKTI 126 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQTKAEAETVSRSYHK-EHGV 177 + + ++ + S F L+ ++ Q +++ + + + Sbjct: 127 NNNSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTFNIPI 186 Query: 178 SIQWVIDFSRSMLDYQRDSE-GQPLNCFGQPADRTVKSYSSQ------NGKVGIRDEKLS 230 + V D S SM D + + N +K + + + + Sbjct: 187 DLMVVADLSGSMKDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQESNEYNRIGITAF 246 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-LASVIRSIKKID 289 K +L D L + L R + + + + Sbjct: 247 AMGAEHPKENKCVLPFVLQDNLHGMSKIKIKQYLASRYNRTTLKRTEFVDNFVALLDIEK 306 Query: 290 N-VNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + F I K + G T + + Sbjct: 307 TLNSIGQTNYDITFPKSSICLEGLKNASQFWYTKTENDKFRDRVNSLKANGGTLASSGLL 366 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TA + ++S + N E K+ I++L+DG + N + Sbjct: 367 TASNQMLSEKSRSE---ELNQETKRVILVLSDGNDDMSNLNLADL 408 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 15/104 (14%) Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA--------KSQGIRIMTI 403 SS +Y LTD N G +C+ K + +++ + Sbjct: 434 SSTPKTNKAYNTRSSTYQYNTYLTDK---TKNIIGKGMCSIIQEKLNALNKDENTKLVFV 490 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 F + ++ C ++F A++ L F+ IG Sbjct: 491 EFGY----RSESADAWKTCVGNGNYFYADNRESLLNSFKQAIGE 530 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 89.3 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 50/309 (16%), Positives = 95/309 (30%), Gaps = 35/309 (11%) Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L +F GI S + + A + S DF + Y + Sbjct: 384 LKDEEVFSEIYGIVSTPVDIEVYAPFKEATVFIPIDTSKIPNQDFQNVKMFYLDEDLMTF 443 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + Q D K ++ P + ++ ++ V Sbjct: 444 VPLDEQGVDPVNKVVWAKTNHFTTFVLFYIP-TWKAIWEVPINKGEREINQQVNYIDLVF 502 Query: 261 SSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + A S + ++ + D R F++ + Sbjct: 503 VLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDNFGYLLQPLTTDFQ 556 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + K + G T I + ++ A +IS ++ + K I+LLTDG Sbjct: 557 AV-----KNAIDRIDSWGGTNIAEGIRIANQQLISR---------SSEDRIKVIILLTDG 602 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 E DN +AK+ GI I TI + + L + A + +F +S Sbjct: 603 EGYYDNNLT----TEAKNNGITIYTIGLGTSVDEN-----LLRDIATQTGGMYFPVSSAS 653 Query: 436 ELNKIFRDR 444 +L ++F+ Sbjct: 654 QLPQVFKRI 662 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 89.3 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 78/250 (31%), Gaps = 13/250 (5%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 K + +G +R + P Y ++ D + V + Sbjct: 226 YPEKQDTVIDGSFIVRYDVDRPLSGGSIQIENGYFVHYFAPDSLSTIPKNVIFVIDKSGS 285 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ ++I+ + + + F+ + K++A Sbjct: 286 MMGRKIKQTREALIKILDDLSP---HDQFDLISFSSEATTWKPLLVPASTENVNEAKSYA 342 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T INDAM A + +N++E+ + I+LLTDG+ T + Sbjct: 343 TGIQAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT----LIILLTDGDPTVGETNPLN 398 Query: 389 ICNKAKSQGI----RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + I + + F + + + L N +E + + + F Sbjct: 399 IQKNVRKA-INGQHSLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSDSALQLQDFYQE 457 Query: 445 IGNEIFERVI 454 + N + V Sbjct: 458 VANPLMTSVA 467 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 88.9 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 74/182 (40%), Gaps = 24/182 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + A +I K D + ++ + + +K D Sbjct: 397 LNETKTATREFASTIFKSD-----ADVCLVSYDSSARNVIDSTDN-----EYALKAAVRD 446 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIA 388 + G T I DA++ +Y+ + S D K+ IVL++DGE + ++ IA Sbjct: 447 LSAGGGTNIEDALRVSYERLEGSGSD-----------KRIIVLMSDGEANEGLVGDDLIA 495 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 N+ K G+ I T+ F + + + + + + ASP +E + +L F D IG++ Sbjct: 496 YANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPGCHYEVDDASQLRYFFGD-IGDD 554 Query: 449 IF 450 I Sbjct: 555 IN 556 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 88.9 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 43/255 (16%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYY--------MLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G G R+ K + Y+ ++ + + G + + Sbjct: 53 GNAGTRNFKRANYLKYLLGFIWILLIISGSGIQWLGKPVSLPQSGRDLIMAIDLSGSMAI 112 Query: 272 HLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ A + + V + R+G F R ++ + + + + Sbjct: 113 QDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD 172 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 TAI DA+ A + D K ++LLTDGEN Sbjct: 173 DASIA--LPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGTL 219 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFF 429 + + AK I+I TI T ++ L A + +F Sbjct: 220 QPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYF 279 Query: 430 EANSTHELNKIFRDR 444 A ++ +L K++ Sbjct: 280 RAQNSSDLKKVYESI 294 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 88.9 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 43/394 (10%), Positives = 111/394 (28%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + + ++ E+ + Sbjct: 32 AALLLVAVSLTACSGKGAEVE--HRQAKQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 89 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R + ++ + T S + Sbjct: 90 QMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYAT 149 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 150 LRRML-REGHLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 208 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L P + + +D S + K L++ AL + + D V+ V Sbjct: 209 GLKGYDLPKSQLGASNLVFLLDVSGSMASVDKLPLLQTALKLLTAQLSAQDKVSIVV--- 265 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G + + GST + AY + Sbjct: 266 ------YAGAAGVVLDGASGNDTQTLNYALEQLSAGGSTNGGQGITQAYQL--AKKHFIP 317 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKA 415 + + ++L TDG+ + + K GI + T+ F + + Sbjct: 318 NGINR-------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 370 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 371 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 402 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 88.9 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 61/168 (36%), Gaps = 29/168 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ H + + + +T STAI + + A T I S Sbjct: 128 DRLGLVLFGDHAYLQTPLTFDRHTVEQQLDRTVL--GLVGQSTAIGEGLGIATKTFIKS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ I+LL+DG NT + + AK G+ I T+ + Sbjct: 185 ----------KAPQRVIILLSDGANTAGVIDPLEAAKLAKESGVTIYTVGIGADEMLQRS 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 Q+ L+ A + +F A + EL+KI+ I Sbjct: 235 IFGVQKVNPSQDLDEKTLTKIAQMTGGKYFRARNPQELDKIY-QIINQ 281 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/359 (9%), Positives = 85/359 (23%), Gaps = 40/359 (11%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L +G+ + A + +L + G VD+ R E L+QA + ++ A L Sbjct: 2 RRLASDRSGNTLALIAAGLLPLLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGTE 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT--------DREVRDIVRDTAVEMN 123 + ++ ++ T ++ + F + T+V Sbjct: 62 IATLTDIPTDAGTRGQEFFNSNFQDGNYGTQNRTFNMVLENDYSVSGTATVDVPTSVMTV 121 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHGVSIQW 181 + V + + + + + + + S ++ S + Sbjct: 122 FGFTKIPVKVECQARISFSDVDVMMVLDVTGSMKHTNSGDTLSKIDSLKATVRNFYDQME 181 Query: 182 VIDFSRSMLDY---------------------QRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + + + Y + + K+ + Sbjct: 182 GAKSAGTRIRYGFVPYASNVNVGHLLKDEWVVNSWAYQSRAISGTTTVEAGTKTRENWAY 241 Query: 221 KVGIRDEKLSPYMV------SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 K G R + + + + + V V Sbjct: 242 KSGSRSAWIEESTYAATMNPGSGDTNPFSCDQSLPKSTYKWHSEKNGIPSTSVTMDPPGV 301 Query: 275 RDALASVIRSIKKIDNVNDTVRMGAT---FFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R V +ND V + + A+D Sbjct: 302 RTTQPMQSTENGVSYRVRLQGSNCIVERREYNDYVQTFDEVTVVPTLANLYNYLPVAMD 360 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 56/196 (28%), Gaps = 36/196 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V DT + N P+ + + + + ++ G+T + M Sbjct: 434 PVYDTKKEFIQTGNWWFSGCPAPAQKLKAMTSGELDSYLDSLTPHGATYHDGGMIWGGRL 493 Query: 350 IISSNEDEVHRM-KNNLEAKKYIVLLTDGENTQDNE------------------------ 384 + K ++++ LTDG+ + Sbjct: 494 LSQYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLA 553 Query: 385 -----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + CN+ K G + +AF + CA +FEA + +LN Sbjct: 554 QTVEERFLFACNEVKKLGATVWVVAFGTAAN------DKMKTCAGSGRYFEAANASQLND 607 Query: 440 IFRDRIGNEIFERVIR 455 F + R+ R Sbjct: 608 AFSTIAKSTGDLRIAR 623 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 66/196 (33%), Gaps = 25/196 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + ++ + +D + +G F + L+ + + Sbjct: 109 VDRLTAIKKVAGDFVTGGDNLD-GRLSDLVGLITFAGYADGVTPPTLDHAFLVSQLNHSQ 167 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + TAI DA+ A + + + + ++++ K I+LLTDGEN + E I Sbjct: 168 IVTNRSEDGTAIGDAISLAVEKLNALDARRKEKIQS-----KIIILLTDGENNAGDLEPI 222 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCAS--PNSF 428 A++ GI++ TI L AS + Sbjct: 223 QAAELAQTMGIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVASITGGKY 282 Query: 429 FEANSTHELNKIFRDR 444 F A T L KI+ + Sbjct: 283 FRATDTDSLAKIYGEI 298 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/334 (9%), Positives = 82/334 (24%), Gaps = 9/334 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G ++ + V L V + VD+ + L+ A +A + A+ L S ++ Sbjct: 10 KKQQGLVAVMITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAATILDNSKDKD 69 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + A+ + + + + D T V + Sbjct: 70 AVDAEVGTAL--NAMAASTGNQEIDFSTASISIDYSNDPKDFTGTATFDSTDDVYVRVRV 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 L F++ G++ + + + +Y+ + + S + + + Sbjct: 128 DA---LEMDEFFIQLFGLEKVVSASAVAGPSSGLAYNNVVPIGVCIGDGTSDNDISPEDG 184 Query: 196 SEGQPLNCFG-QPADRTVKSYSSQNGKVGIRDEKLSPYM---VSCNKSLYYMLYPGPLDP 251 + ++ + G + + Y + G D Sbjct: 185 FHDETGEEITSVFGYEIGTVHALKVGDSSLSEMGNGNYHLLDFGSGGNTIKEGLGGSYDQ 244 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + + V + + D +D V T Sbjct: 245 PIKIGEDITTKPGGTVGPTGDGLNTRFGEYGGGLSASDYPSDYVTTEPTNEITIDAGTGE 304 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 GS ++ + Sbjct: 305 IDSDGSYTYAQYESDTNACIASGGSGCASNGVAW 338 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 63/168 (37%), Gaps = 29/168 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + + +T STAI + + A T I+S Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRKTVEKQLDRTVLGLI--GQSTAIGEGLGIATKTFINS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ I+LL+DG NT + + AK G++I T+ + Sbjct: 185 ----------KAPQRVIILLSDGANTSGVIDPLEAAKLAKESGVKIYTVGVGADQMVQQG 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 Q+ L++ A + +F A + +L KI+ D I Sbjct: 235 FFGDRIVNPSQDLDEKTLTDIAKMTGGEYFRARNPQQLEKIY-DIINK 281 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 78/250 (31%), Gaps = 13/250 (5%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 K + +G +R + P Y ++ D + V + Sbjct: 226 YPEKQDTVLDGSFIVRYDVDRPLSGGSIQIENGYFVHYFAPDSLSTIPKNVIFVIDKSGS 285 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ ++I+ + + + F+ + K++A Sbjct: 286 MMGRKIKQTREALIKILDDLSP---HDQFDLISFSSEATTWKPLLVPASTENVNEAKSYA 342 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T INDAM A + +N++E+ + I+LLTDG+ T + Sbjct: 343 TGIQAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT----LIILLTDGDPTVGETNPLN 398 Query: 389 ICNKAKSQGI----RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + I + + F + + + L N +E + + + F Sbjct: 399 IQKNVRKA-INGQHSLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSDSALQLQDFYQE 457 Query: 445 IGNEIFERVI 454 + N + V Sbjct: 458 VANPLMTSVA 467 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 79 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGDR 138 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + D Sbjct: 139 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYPGD 196 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K ++LLTDGEN + + AK I+I TI Sbjct: 197 S-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 245 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 246 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 288 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 77/250 (30%), Gaps = 13/250 (5%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 K + +G +R + P Y ++ D + V + Sbjct: 226 YPEKQDTVLDGSFIVRYDVDRPLSGGSIQIENGYFVHYFAPDSLSTIPKNVIFVIDKSGS 285 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ ++I+ + + + F+ + K++A Sbjct: 286 MMGRKIKQTREALIKILDDLSP---HDQFDLISFSSEATTWKPLLVPASTENVNEAKSYA 342 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T INDAM A + +N++E+ + I+LLTDG+ T Sbjct: 343 TGIQAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT----LIILLTDGDPTVGETNPSN 398 Query: 389 ICNKAKSQGI----RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + I + + F + + + L N +E + + + F Sbjct: 399 IQKNVRKA-INGQHSLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSDSALQLQDFYQE 457 Query: 445 IGNEIFERVI 454 + N + V Sbjct: 458 VANPLMTSVA 467 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 88.5 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ + + + + TAI DA+ A + Sbjct: 137 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYP 194 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 D K ++LLTDGEN + + AK I+I TI Sbjct: 195 GDS-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVET 243 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 244 TFGQRLINTSEDLDTTVLEKIAEMTGGKYFRAQNSSDLKKVYESI 288 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 88.1 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 121/404 (29%), Gaps = 26/404 (6%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQKI----------EEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + S + + + L + D+ + T Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHG 176 + + ++ + S F +G A + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIKHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV----------KSYSSQNGKVGIRD 226 + + V D S SM D + + + R V +++ ++GI Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + NK + + L + +S +++ V + +A + Sbjct: 247 FAMGAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEAT 306 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + S + K + + G T + + T Sbjct: 307 LNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLT 366 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 A + ++S + N E K+ I++L+DG + N + Sbjct: 367 ASNQMLSEKSRSE---ELNQETKRVILVLSDGNDDMSNLNLADL 407 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 15/113 (13%) Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA--------K 394 + +SS N Y LT N + +C+ K Sbjct: 424 LILGKKEDLSSTTTSNKAYYNRHSTFNYNTYLT---NKTKDISRKGMCSIIQEKLNTLNK 480 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +++ + F + ++ C ++F A++ L F+ IG Sbjct: 481 DKNTKLVFVEFGY----RSESADAWKTCVGNGNYFYADNRESLLNSFKQAIGE 529 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 64/191 (33%), Gaps = 28/191 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++ I R+G F + ++ + + + + Sbjct: 119 TRLAAVKKVVSDFIDQ-------RQGDRIGLILFGTQAYLQTPLTFDTQSVNQFLQEAQL 171 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + + + K I+LLTDGENT E + Sbjct: 172 --GFAGKDTAIGDAIGLSVKRLKNQSSASSAK----PSNSKVIILLTDGENTAGEVEPLQ 225 Query: 389 ICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANS 433 A+ G +I T+ + L+ A + +F A + Sbjct: 226 AAKLAEKIGAKIYTVGIGADEMIVRGFFGNRRVNPSASLDEETLTAIANTTGGLYFRARN 285 Query: 434 THELNKIFRDR 444 T ELN I+ + Sbjct: 286 TQELNNIYSEL 296 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 121/404 (29%), Gaps = 26/404 (6%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQKI----------EEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + S + + + L + D+ + T Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHG 176 + + ++ + S F +G A + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV----------KSYSSQNGKVGIRD 226 + + V D S SM D + + + R V +++ ++GI Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + NK + + L + +S +++ V + +A + Sbjct: 247 FAMGAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEAT 306 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + S + K + + G T + + T Sbjct: 307 LNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLT 366 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 A + ++S + N E K+ I++L+DG + N + Sbjct: 367 ASNQMLSEKSRSE---ELNQETKRVILVLSDGNDDMSNLNLADL 407 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 15/113 (13%) Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA--------K 394 + +SS N Y LT N + +C+ K Sbjct: 424 LILGKKEDLSSTTTSNKAYYNRHSTFNYNTYLT---NKTKDISRKGMCSIIQEKLNTLNK 480 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +++ + F + ++ C ++F A++ L F+ IG Sbjct: 481 DKNTKLVFVEFGY----RSESADAWKTCVGNGNYFYADNRESLLNSFKQAIGE 529 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 43/255 (16%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYY--------MLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G G R+ K + Y+ ++ + + G + + Sbjct: 53 GNAGTRNFKRANYLKYLLGFIWILLIISGSGIQWLGKPVSLAQSGRDLIMAIDLSGSMAI 112 Query: 272 HLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ A + + V + R+G F R ++ + + + + Sbjct: 113 QDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD 172 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 TAI DA+ A + D K ++LLTDGEN Sbjct: 173 DASIA--LPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGTL 219 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFF 429 + + AK I+I TI T ++ L A + +F Sbjct: 220 QPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYF 279 Query: 430 EANSTHELNKIFRDR 444 A ++ +L K++ Sbjct: 280 RAQNSSDLKKVYESI 294 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 43/255 (16%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYY--------MLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G G R+ K + Y+ ++ + + G + + Sbjct: 47 GNAGTRNFKRANYLKYLLGFIWILLIISGSGIQWLGKPVSLAQSGRDLIMAIDLSGSMAI 106 Query: 272 HLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ A + + V + R+G F R ++ + + + + Sbjct: 107 QDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD 166 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 TAI DA+ A + D K ++LLTDGEN Sbjct: 167 DASIA--LPGPQTAIGDAIGLAVKKLKKYPGDS-----------KALILLTDGENNSGTL 213 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFF 429 + + AK I+I TI T ++ L A + +F Sbjct: 214 QPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYF 273 Query: 430 EANSTHELNKIFRDR 444 A ++ +L K++ Sbjct: 274 RAQNSSDLKKVYESI 288 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 120/404 (29%), Gaps = 26/404 (6%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++ I+ +G + ++ LL +L + + ++ + L + + A+++ + Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 IQSLEEVSSRAKNSFTFPKQKI----------EEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + S + + + L + D+ + T Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHG 176 + + ++ + S F +G A + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV----------KSYSSQNGKVGIRD 226 + + V D S SM D + + R V +++ ++GI Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLNGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + NK + + L + +S +++ V + +A + Sbjct: 247 FAMGAEHPKENKCVLPFVLQNNLHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTEAT 306 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + S + K + + G T + + T Sbjct: 307 LNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGLLT 366 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 A + ++S + N E K+ I++L+DG + N + Sbjct: 367 ASNQMLSEKSRSE---ELNQETKRVILVLSDGNDDMSNLNLADL 407 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 15/113 (13%) Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA--------K 394 + +SS N Y LT N + +C+ K Sbjct: 424 LILGKKEDLSSTATSNRAYYNRHSTFNYNTYLT---NKTKDISRKGMCSIIQEKLNTLNK 480 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +++ + F + ++ C ++F A++ L F+ IG Sbjct: 481 DKNTKLVFVEFGY----RSESADAWKTCVGNGNYFYADNRESLLNSFKQAIGE 529 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 43/394 (10%), Positives = 111/394 (28%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + + ++ E+ + Sbjct: 47 AALLLVAVSLTACSGKGAEVE--HRQAKQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 104 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R + ++ + T S + Sbjct: 105 QMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYAT 164 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 165 LRRML-REGHLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 223 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L P + + +D S + K L++ AL + + D V+ V Sbjct: 224 GLKGYDLPKSQLGASNLVFLLDVSGSMASVDKLPLLQTALKLLTAQLSAQDKVSIVV--- 280 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G + + GST + AY + Sbjct: 281 ------YAGAAGVVLDGASGNDTQTLNYALEQLSAGGSTNGGQGITQAYQL--AKKHFIP 332 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKA 415 + + ++L TDG+ + + K GI + T+ F + + Sbjct: 333 NGINR-------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 385 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 386 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 417 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 88.1 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 71/205 (34%), Gaps = 18/205 (8%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + + + V+DA+ + + R+G F+D + Sbjct: 102 GSGQSLKTGTTAAGRTRMDAVKDAVRTFV-------RGRRDDRIGLVVFSDNAYVISPLT 154 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + L+ + G TAI D + A + + + + +VL Sbjct: 155 FDHQYLLDYLGFVDGEILLGEGQTAIGDGLALASAVL-------ARQAGRDARGHQVVVL 207 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE-KARYFLSN--CASPNSFFE 430 TDGE+ + + I + +AKS GIR+ I ++ + L A+ +F Sbjct: 208 FTDGESNRGR-DPIEVVGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGGRYFA 266 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A+S +L R E V R Sbjct: 267 ADSERDLLTASRTIDAMEKGVLVSR 291 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 85 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGDR 144 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + Sbjct: 145 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKL------ 196 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K K ++LLTDGEN + + AK I+I TI Sbjct: 197 -----KKFPGDSKALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 251 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 252 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 294 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 69/191 (36%), Gaps = 31/191 (16%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +DSS + A S + ++ + D R F+D + Sbjct: 40 VFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------RAAVVDFDDYGYLLQPLTTD 93 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 VK + G T I + ++ A +IS + D+ K I+LLT Sbjct: 94 F-----QTVKNAIDRIDSWGGTNIAEGIRIANHQLISQSSDDRI---------KVIILLT 139 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 DGE DN +AK+ GI I TI + + L N A + +F +S Sbjct: 140 DGEGYYDNNLT----TEAKNNGITIYTIGLGTSVDEN-----LLRNIATQTGGMYFPVSS 190 Query: 434 THELNKIFRDR 444 +L ++F+ Sbjct: 191 ASQLPQVFKRI 201 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 87.7 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 34/158 (21%), Positives = 55/158 (34%), Gaps = 18/158 (11%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + D + K I + ++ +TAI +A+ + Sbjct: 136 VLYGDFAFIASPIT--YEKNIIIEMLSYLNQGMAGQNTAIGEAIAMSLR----------- 182 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 K++ K +VLLTDGE+ + AK + I+I TI + E L Sbjct: 183 AFKHSKAKSKIVVLLTDGEHNSGDISPKDALVLAKEENIKIYTIGMG---NRGEADEALL 239 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A S FF A + EL +I+ E + R Sbjct: 240 KKIADESGGEFFYATNAKELKEIYEHIDELESSKIKSR 277 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 87.4 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 67/223 (30%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 85 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGDR 144 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + D Sbjct: 145 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYPGD 202 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K ++LLTDGEN + + AK I+I TI + Sbjct: 203 S-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGDQMIVETTF 251 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 252 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 294 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 87.4 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 62/216 (28%), Gaps = 33/216 (15%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + TG+ II AL + ML G D +R ++ A+I A Sbjct: 5 RAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAV 64 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + ++E + N Sbjct: 65 KQI-NNTGDTDALKLKVTDWFHAQVENSYTLGEIDIDTTNH------------------- 104 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + F++ I + + + + + SY +++ VID Sbjct: 105 --------NITATASGTVPTTFMKIANIDTVPVSVASAVKGPATSY-----LNVYIVIDT 151 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 S SML + + S + G Sbjct: 152 SPSMLLAATTAGQATMYSGIGCQFACHTGDSHKVGN 187 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 87.4 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 69/201 (34%), Gaps = 36/201 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIR 321 + I + + +V R+G F D ++ + + + Sbjct: 95 MSIPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGDRLGLVLFADHAYLQTPLTFDRNTVEQ 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T STAI + + A T I+S ++ I+LL+DG NT Sbjct: 155 QLDRTVLGLI--GQSTAIGEGLGIATKTFINS-----------KAPQRVIILLSDGANTS 201 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPN 426 + + AK G++I T+ + Q+ L+ A + Sbjct: 202 GVIDPLEAAKLAKESGVKIYTVGVGADQMVQKGFFGDRLVNPSQDLDEKTLTEIAKMTGG 261 Query: 427 SFFEANSTHELNKIFRDRIGN 447 +F A + +L KI+ D I Sbjct: 262 EYFRARNPQQLEKIY-DIINK 281 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 87.4 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 79 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGDR 138 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + D Sbjct: 139 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYPGD 196 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K ++LLTDGEN + + AK I+I TI Sbjct: 197 S-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 245 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 246 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 288 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 87.4 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 85 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGDR 144 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + D Sbjct: 145 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYPGD 202 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K ++LLTDGEN + + AK I+I TI Sbjct: 203 S-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 251 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 252 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 294 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 87.4 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 79 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGDR 138 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + D Sbjct: 139 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKLKKYPGD 196 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K ++LLTDGEN + + AK I+I TI Sbjct: 197 S-----------KALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 245 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 246 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 288 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 87.4 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 62/194 (31%), Gaps = 35/194 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I + V++ + I R+G F D + + + + Sbjct: 108 DYIDRLSAVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQ 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T TAI D + A T + S ++ ++LL+DG NT + Sbjct: 161 TIIGLI--GQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLD 207 Query: 386 GIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFE 430 I N AK G++I TI T + L+ A + +F Sbjct: 208 PIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFR 267 Query: 431 ANSTHELNKIFRDR 444 A EL I++ Sbjct: 268 ARDAQELQTIYQAI 281 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 87.4 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 66/194 (34%), Gaps = 37/194 (19%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +++ + I R+G F + ++ + + ++ Sbjct: 117 VDRLTVIKSVVDDFISH-------RKNDRIGLILFGTQAYLQTPLTFDHKTVRTLLNESR 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TAI DA+ A + KN+ K ++LLTDG NT + + Sbjct: 170 I--GIAGGQTAIGDAIGLALKRL-----------KNHKTGSKVLILLTDGANTAGSVSPV 216 Query: 388 AICNKAKSQGIRIMTIAFSVN---------------KTQQEKARYFLSNCAS--PNSFFE 430 A QG++I T+ + + + AS +F Sbjct: 217 QAAELAARQGMKIYTVGVGADEMRIPGVLGFGSQIVNPSADLDEVTMKKIASLTGAQYFR 276 Query: 431 ANSTHELNKIFRDR 444 A +T EL +I++ Sbjct: 277 ARNTDELRRIYQHI 290 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 87.0 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 92/309 (29%), Gaps = 31/309 (10%) Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN--------------CFGQP 207 + T + K V V D S SM + + Sbjct: 701 SSLATSQSTVEKIQTVDAMMVFDLSGSMNEIMSGQNQLKDIGEFSRVKNQMDINKVYYWN 760 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + K V + Y+ S + DS + Sbjct: 761 KYEKSGWWPWTYDKSVGMGTAAVSGNVYAKYPVKYIDGQWKKYVDGSYQSISDSDVMAVW 820 Query: 268 IKKKHLVRDALASVIRSIKKIDNVN--DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 K ++DA + + I + ++ + G+ K+ + + Sbjct: 821 TSKISALKDAASGFVTGISDTSPDSLVGIATFYGIGNGWNSSTEGKLNHGLSKVNKNEML 880 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T+ ++ AY + + + KKY++L +DGE + N++ Sbjct: 881 KSVNALFADGGTSPQKGLEHAYSELQKAEDGN----------KKYVILFSDGEPSDSNDK 930 Query: 386 GIAICNK--AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + K G ++T+ +N E A + AS F A++ ELNKIF++ Sbjct: 931 METEASAVKLKEAGYTVITVGLGLN---NETATWLGEKVASAGCAFTADTAEELNKIFQN 987 Query: 444 RIGNEIFER 452 R Sbjct: 988 IQSTITQSR 996 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 87.0 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 68/201 (33%), Gaps = 36/201 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIR 321 + I + + +V R+G F D ++ + + Sbjct: 95 MSIPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGDRLGLVLFADHAYLQTPLTFDRKTVEQ 154 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T STAI + + A T I+S ++ I+LL+DG NT Sbjct: 155 QLDRTVLGLI--GQSTAIGEGLGIATKTFINS-----------KAPQRVIILLSDGANTS 201 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPN 426 + + AK G++I T+ + Q+ L+ A + Sbjct: 202 GVIDPLEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDLDEKTLTEIAKMTGG 261 Query: 427 SFFEANSTHELNKIFRDRIGN 447 +F A + +L KI+ D I Sbjct: 262 EYFRARNPQQLEKIY-DIINK 281 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 87.0 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 26/201 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +++ + ++++ +G F + L+ + Sbjct: 109 VDRLTAIKNVAGKFVEGKEELE-GRFNDLVGLMTFAGYADGITPPTLDHPYLVSQLNNIQ 167 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + TAI DA+ A + + + + ++K+ K I+LLTDGEN E I Sbjct: 168 IVTNRSEDGTAIGDAISLAVEKLNALDARRDEKVKS-----KVIILLTDGENNAGEVEPI 222 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNS 427 A++ GI++ TI L A + Sbjct: 223 QAAELAETLGIKVYTIGVGTKGEAPVPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHGK 282 Query: 428 FFEANSTHELNKIFRDRIGNE 448 +F A T L KI+ + E Sbjct: 283 YFRATDTDSLEKIYHEIDALE 303 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 87.0 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 64/210 (30%), Gaps = 35/210 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V+D + + +D ++ F + + L+ + Sbjct: 116 TRLEVVKDVFRAFVAGEDGLDGRSNDTIG-LVSFAGFADTRCPLTLNHGSLLTILDDLEI 174 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + E TAI D + A + + S + I+LLTDG N E + Sbjct: 175 VRERAEDGTAIGDGLGLAVERLRESEASS-----------RVIILLTDGVNNAGIETPLE 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNSF 428 A GI++ TI + E L A + + Sbjct: 224 AAELASRLGIKVYTIGAGTDGVAPVRVTNPLTGAEELRPMPVEIDEATLEAIAEHTGGRY 283 Query: 429 FEANSTHELNKIFRDRI---GNEIFERVIR 455 F A L +++ EI ER +R Sbjct: 284 FRATDGDGLRQVYEQIDRLERTEISERRLR 313 >gi|307247518|ref|ZP_07529563.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306856021|gb|EFM88179.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 510 Score = 87.0 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 48/497 (9%), Positives = 128/497 (25%), Gaps = 96/497 (19%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR------AKNSFTFPKQKIE 91 + ++ + L + + A+++ + ++ + +N ++ Sbjct: 14 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 73 Query: 92 EYLIRNFENNLKKNFTDREVR----DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + + +K V T N + ++ + S F Sbjct: 74 KRDSQMATKFVKAFLPQTNEEKMHLTPVCKTINNTNGKGHTSSSEVTCTVSGTIEHKSWF 133 Query: 148 LRSMGIKSW--LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ------ 199 +G A + + + V D S SM + Sbjct: 134 PLKVGTVEVIPHEVNVASKSKAFKKNTFNIPIDLMVVADLSGSMRYDITNKYETNNETSK 193 Query: 200 ----PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLS 254 + ++Q+ ++ + L + + + +L Y D L Sbjct: 194 LGILKDVLIELAEKTLLSEDANQHNRIYVTPFALGAEIDASSCALPYHWDKGKDPDTPLK 253 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 ++ + + + + + + + + + F V + Sbjct: 254 IKNILSKKEGNNQQSRAEFIDNFVYKMNTQATL--DNIGEKQNYKVTFPKGVFCLKNMKN 311 Query: 315 ----GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + ++ + G+T + + A + +I + + K+ Sbjct: 312 KNHGWFSRKDKSDFTGYVRSLRADGATLASSGVLVAANNMIKGGSRTK---ELKEQTKRV 368 Query: 371 IVLLTDGE---------------------------------------------------N 379 I++L+DG + Sbjct: 369 ILVLSDGNDEIVKGDPNSKVPFLNYTRITENLIYGRQEEFLSGKKKVSFGNTTIETYLTD 428 Query: 380 TQDNEEGIAICNKAK--------SQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFE 430 TQ E +CN + + +I+ + F K++ +C +++ Sbjct: 429 TQPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGY----ASKSKEAWKHCVEGDKNYYS 484 Query: 431 ANSTHELNKIFRDRIGN 447 AN L F+ IG Sbjct: 485 ANDKASLLNSFKQAIGE 501 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 87.0 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 64/200 (32%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ ++ + K R+G +F D ++ + Sbjct: 101 DMAYQDDYIDRLTAVKHVVSDFVDRRKG-------DRVGLVYFADHAYLQTPLTFDRETV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +T TAI D + A T + SN ++ ++LL+DG N Sbjct: 154 KTQLNQTVLKLI--GTQTAIGDGIGLATKTFVDSNA-----------PQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 + + AK G I TI T ++ L A + Sbjct: 201 NAGVLDPVQAAEIAKKYGTTIYTIGVGAGEMQVKDFFMTRTVNTAEDLDEKTLIKIANIT 260 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A + EL I+ Sbjct: 261 GGQYFRARNADELATIYDTI 280 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 87.0 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 58/165 (35%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + S+ + + I +T TAI DA+ + Sbjct: 137 DRIGVIVFGTKAYLQAPLSFDTKTVRQLIQETQI--GFAGEKTAIGDAIGLGIKQLSELP 194 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 D KK ++L+TDG NT + N A QG+ I TI + Sbjct: 195 SD-----------KKVLILMTDGANTAGRVSPLQAANFAAEQGVTIHTIGIGADEMEVQG 243 Query: 409 -------KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 ++ L N AS ++ A ST +L +I+ D Sbjct: 244 FFGPQTVNPSEDLDEALLENVASLTGGKYYRAKSTSDLEEIYGDI 288 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 87.0 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 86/325 (26%), Gaps = 70/325 (21%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + S + D N + ++ +D + + + + Sbjct: 384 TNPSYRPHWDDRIE--YNRGNLIRPTNTQQVWEDYQRIDEKDFQ----NNYLDDAHNFDD 437 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 G S + + + +V ++ +G + Sbjct: 438 KNGADHVKSEANTGYASGNQYKRQEWIN------KYFTDDGGNRPSVGNSNPLGMGAGPN 491 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + S S S + + G+T + + + T+ R + Sbjct: 492 SMCSSVSVSDLTD--NKNTTQAKLTSMQASGATNVQMGVAWGWRTLSPGEPFTEGRPYDA 549 Query: 365 LEAKKYIVLLTDGENTQD------------------------------------------ 382 + KK ++++TDG NT Sbjct: 550 EDNKKIMIIMTDGNNTYYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYDGEANPGHNS 609 Query: 383 -------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-----SFFE 430 +E C AK+ GI I +IAF V + L +CAS + +F+ Sbjct: 610 QTFTKAMDEHLTETCTNAKNAGITIYSIAFDVPNG--SSVKATLEDCASSDVGGGKLYFD 667 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 AN+ L F R+ + Sbjct: 668 ANNNAALIDTFEKIAERLADLRISK 692 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 50/378 (13%), Positives = 102/378 (26%), Gaps = 15/378 (3%) Query: 1 MVFDTK--FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 +V ++ K+ G+ F+ TAL +PVML G D + Sbjct: 11 LVQRSRAVLSNTVKRFRDDERGNVFVFTALSLPVMLMAIGAGADYAELYRARVNFQS--- 67 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 +V + A K EE N + +K + ++ + D Sbjct: 68 ------AVDAGAIAAAKNLAATGQVQTSKDIGEEVFRSNLSHLGEKAVREGQINFDMGDG 121 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + + + L S K Q + S + Sbjct: 122 DCAVQGVITTATLPHDRFFSLSFVDQSQQKGFGANKIVKGQEEFILSASSTVECGNDTIE 181 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I V+D S SM + + + T +++ + + + + + N+ Sbjct: 182 IALVLDNSGSMRWNGKIGTLRQASNSLVETLHTTMGSANKAIQFSVVPFAATVNVGTNNR 241 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + + S + +D S+ + + D++ +T G Sbjct: 242 N---EPWMDTQGRSSTHWEMIDPSTSTDFSFSGGRYLQSNGQALSRFTLYDSLPNTSWQG 298 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + IV +FA D + N + Sbjct: 299 CVEQRPHPYHTQDDTPSISNPDTLIVPSFAPDTPDNWDNDYNKRLSNIVVGADPH-CTRF 357 Query: 359 HRMKNNLEAKKYIVLLTD 376 KN +Y +D Sbjct: 358 QGSKNGRRTIRYCNRWSD 375 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 87.0 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 52/273 (19%), Positives = 94/273 (34%), Gaps = 30/273 (10%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS----PYMVSCNKSL 240 + +L Y + ++ +P N F + ++++G + + C Sbjct: 150 NNGELLIYTKWAKNEPDNTFSDYGYYVRFNENAKDGTWKVDTFSGGETNFNNVFLCEWGD 209 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--HLVRDALASVIRSIKKIDNVNDTVRMG 298 Y + L + + V + + + + A A + SI +N +G Sbjct: 210 YSVTGNDGLKVTSKKRDIVLTLDISASMDGIPLDETKKAAAKFVDSI-----LNKNSNIG 264 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D S +K + +T I D + AY + Sbjct: 265 LVSYSDEATSLSGICSN-----DVFLKNTITSLSSAENTNIEDGLSRAYSMLQ------- 312 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAFSVNKT-QQEKA 415 KK IVL++DG T EE I K K QG+ I T+ F N + + Sbjct: 313 ----LGQSKKKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEG 368 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +Y + AS +E +S+ +L F D G Sbjct: 369 QYLMEKIASEGCHYEVSSSEDLVFFFEDVAGQI 401 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 86.6 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 53/160 (33%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TAI DA+ A + Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRRTVRVWLDEAKI--GIAGKNTAIGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I T+ + + Sbjct: 189 --------MRPANSRVLVLVTDGANNGGQIDPITAARLAADEGVKIYTVGIGSDPDKNAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L + A S +F A EL+KI Sbjct: 241 QGVLGLNPSLDLDEPTLKDIASLSGGQYFRARDGAELDKI 280 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 86.6 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 66/223 (29%), Gaps = 35/223 (15%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-------R 296 + G + + ++ A + + V + R Sbjct: 79 QWLGKPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGDR 138 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F R ++ + + + + TAI DA+ A + Sbjct: 139 VGLILFGTRAYLQTPLTFDIATVKKMLDDASIA--LPGPQTAIGDAIGLAVKKL------ 190 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------- 408 K K ++LLTDGEN + + AK I+I TI Sbjct: 191 -----KKFPGDSKALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 245 Query: 409 -----KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + +F A ++ +L K++ Sbjct: 246 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESI 288 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 86.6 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 43/394 (10%), Positives = 109/394 (27%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + ++ E+ + Sbjct: 32 AALLLVAVSLTACGGKGAEVQ--HRQAKQQAEQRHQEASQRQAEMRDAAKVEMARVAAPM 89 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R + ++ + T S + Sbjct: 90 QMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYAT 149 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 150 LRRML-REGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 208 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L P + + +D S K L++ AL + + D V+ V Sbjct: 209 GLKGYDLPKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQDKVSIVV--- 265 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G + + GST + AY + Sbjct: 266 ------YAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL--AKKHFIP 317 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKA 415 + + ++L TDG+ + + K GI + T+ F + + Sbjct: 318 NGINR-------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 370 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 371 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 402 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLSDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGALNP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + + L A + FF A Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 49/156 (31%), Gaps = 6/156 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ G I + ++M ++L GGM VD++R+ L+ + A + A+ Sbjct: 8 KGIFQRFRNDEDGALIIFSLMMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAADL 67 Query: 68 LIQSLEEVSSRAKNSFTFP------KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 R + + + + + D+ R Sbjct: 68 DQIMPPADVVRDYLDKAGMLHFLQGEPTVSQGINYRVVSAQASAPMALFFYDLPRIFTSP 127 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 +P +A V +S + + + + L S Sbjct: 128 FSPGMTAINVSGASTAEERVTDVEVSLVLDVSSSMN 163 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 58/191 (30%), Gaps = 34/191 (17%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + LV+ L ++ + R+G F + ++ + + Sbjct: 110 RDEDVSRLSLVKHLLGDFLQQ-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTWL 162 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + +TAI DA+ A + + ++L+TDG N Sbjct: 163 DEARI--GIAGKNTAIGDAIGLALKRLR-----------LRPAQSRVLILITDGANNAGQ 209 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNC--ASPNSFF 429 + + A +G++I I + Q + L A+ +F Sbjct: 210 IDPLTAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDEPALKAIAAATGGQYF 269 Query: 430 EANSTHELNKI 440 A EL I Sbjct: 270 RARDGEELQAI 280 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 50/444 (11%), Positives = 124/444 (27%), Gaps = 47/444 (10%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ ++ F++++L + + G+ G + + V Sbjct: 25 TLCRQSARNR---VFLLSSLTLAIFFGLSGCSDK-----------QSDGDK---SLVVHP 67 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + E E L + + D + +N A Sbjct: 68 LTEQAETVRLESEQSKTLAIHKSEELHQLKLVQSSERLIDASL-----AAKHVINTHYVA 122 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + Y + + F + + ++ + T S +++ +I S Sbjct: 123 APIASDAWYGIKQPERNRFEKQI-QNGIMVAGEIPISTFSIDVDTGSYSTLRRMI-KEGS 180 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + LN F K+ + + + + M+ L Sbjct: 181 LPEKGTIRIEEMLNYFTYDYPLPNKNAAPFSATTELAPSPYNDDMMLLRIGLKGYELTKS 240 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S K L++ AL + + + D V+ V + Sbjct: 241 ELGASNLVFLLDVSGSMASADKLPLLQTALKMLTQQLSAQDKVSIVV---------YAGA 291 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + GST + + AY + + Sbjct: 292 AGVVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL--AQKHFIQGGINR----- 344 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 ++L TDG+ + + K +GI + T+ F ++ + Sbjct: 345 --VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADK--GN 400 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 + ++ +E K+ D + + + Sbjct: 401 GHYAYIDTLNEARKVLVDELSSTL 424 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 44/394 (11%), Positives = 109/394 (27%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + ++ E+ + Sbjct: 32 AALLLVAVSLTACGGKGAEVE--HRQAEQQAEQRHQEASQRQAEMRDAAKVEMARVAAPM 89 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R ++ ++ + T S + Sbjct: 90 QMSSNGAVMGMSIAPMPRDYAVIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYAT 149 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 150 LRRML-REGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 208 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L P + + +D S K L++ AL + + D V+ V Sbjct: 209 GLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLLTAQLSAQDKVSIVV--- 265 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G + + GST + AY + Sbjct: 266 ------YAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL----AKKHF 315 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKA 415 N ++L TDG+ + + K GI + T+ F + + Sbjct: 316 IPNGINR-----VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 370 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 371 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 402 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 40/121 (33%), Positives = 57/121 (47%), Gaps = 13/121 (10%) Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-------QDNEEGIAICNKAKSQG 397 TA + + ED H++K KKYIV +TDG+N + C+ AKS+G Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRI 456 I I TIAF Q L CAS + +F+A +L F + IG + +V R+ Sbjct: 64 IEIYTIAFMAPAGGQA----LLHYCASDDSHYFQAEKMEDLLAAF-EAIGAKSAAQVTRL 118 Query: 457 T 457 T Sbjct: 119 T 119 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 86.2 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F ++ + + + + TAI DA+ A + Sbjct: 137 DRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIA--LPGPQTAIGDAIGLAVKKL---- 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 K K +VLLTDGEN + + AK I+I TI Sbjct: 191 -------KRYPGDSKALVLLTDGENNSGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKT 243 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + FF A ++ +L +++ Sbjct: 244 TFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSADLKQVYESI 288 >gi|238793630|ref|ZP_04637253.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] gi|238727045|gb|EEQ18576.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] Length = 480 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 48/440 (10%), Positives = 115/440 (26%), Gaps = 96/440 (21%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 LIK+ G + +P+++G+ ++ + + L A + A + +V Sbjct: 12 RLLVSLIKNEQGAILFPFIIFLPLIIGLIFFSFELAHFLQKKTKLSDAMEQATLALTVEN 71 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 S + KN+ + YL + N + N + Sbjct: 72 NNSTPSAAQITKNAE-IVSSYAQAYLPAETFSTPTIN--------------IIYNNGRIE 116 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 Y ++ Y ++ + + + T A + + +V D+S S Sbjct: 117 YGAEINMSYSAKFLSN---IQVTNLSTIINATDRGAARKNIISAPIEKTDVVFVADYSNS 173 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS--------- 239 M +Y E +P + + + I P+ Sbjct: 174 MDEYFYHDENEP-KKIVALREIFNRLNDNVLKNKNIHTIGFIPFSWGTKNRVENGTRIIE 232 Query: 240 ---LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK------------HLVRDALASVIRS 284 L ++ + ++ + +++ + + I Sbjct: 233 YCHLPFVPKKHSPNGDYLRKYILSGLKQFPGLERLEHIDHIEYAKINDEIYSNTKNQIDE 292 Query: 285 IKKIDNVNDT-----------------VRMGATFFNDRVISDPSFSWGV----------- 316 + D + + ++ + S S G Sbjct: 293 LNIEDAESAYTFLFRSRHIIQPLTQFDIIEENIDYDATINSILHNSAGTPPKTINIPIND 352 Query: 317 -------------HKLIRTIVKTFAIDEN---EMGSTAINDAMQTAYDTIISSNEDEVHR 360 + L D G T I+ + +A + Sbjct: 353 ILNTYNCLYQRNSYSLNENHSNDIINDMIEMVPAGGTLISSGILSANNLF---------N 403 Query: 361 MKNNLEAKKYIVLLTDGENT 380 + + KK +++L+DG ++ Sbjct: 404 ENRSNDNKKLMIILSDGNDS 423 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 38/353 (10%), Positives = 89/353 (25%), Gaps = 31/353 (8%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 S G IITAL + V++G G+ +D + L+ A + AS L Sbjct: 8 RDSQRGVVAIITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYEL-TGSPI 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-AVEMNPRKSAYQVVL 133 + K E + + D + + ++ V Sbjct: 67 SPENFTRAENAGKTVGTENRVDFQGGAIAAADIDVTFSTSLAGSWLPAGGATGNSKYVRC 126 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + F++ MG + A A + + + S + Sbjct: 127 TITRNGIAP---WFMQVMGFGDQTVSAIATATLAPSQNNCAIPMGL---CTHPSSSAPHF 180 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN----------KSLYYM 243 +G + + + + + + D + S + Sbjct: 181 GYVKGDWYSMNFKESGGGTMENLTGDFRWVDFDPSTTTPNCSGKGAQELSCLFEGAGQCN 240 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI----------RSIKKIDNVND 293 L P + + + +K + A + +K Sbjct: 241 LPPNGPSTCPTSGNSTPTPGCVGDAGQKTSIGQAFNTRFGICQGSACTDGELKSAPPDFT 300 Query: 294 TVRMGATFFN---DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + T ++ D + + ++ + + + A M Sbjct: 301 GLAYNTTTWSLGRDAYKGSSGGTPNFVAARSSHLQVQSSELPSGTNQASTAQM 353 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 62/199 (31%), Gaps = 28/199 (14%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 ++ +DSS L R+ I + D F+ + Sbjct: 25 DGINIAFVIDSSGSMFYNDPNGLRREVTHKFIDRLTDND------MAAVIGFDYKATVLE 78 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 F+ + + G T I A+ AYD + K+ Sbjct: 79 QFTSN-----KEKLHDAVDKIRSDGGTNIGRAVSIAYDL------FNNLDNNRKEKYPKF 127 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++LLTDG+ E I AK GI+I TI + L + A + + Sbjct: 128 LILLTDGDGDYSEEYTI----LAKKAGIKIYTIGLGNGVS-----EKLLKDIAKGTDGEY 178 Query: 429 FEANSTHELNKIFRDRIGN 447 F A +LNKIF Sbjct: 179 FHAKDASKLNKIFEKIADK 197 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 235 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLSDS 287 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 288 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 334 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + + L A + FF A Sbjct: 335 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 394 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 395 DTEELAGIYAEL 406 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 37/125 (29%), Gaps = 1/125 (0%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + L + TG+ II AL + ML G D +R ++ A+I A Sbjct: 11 RAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAV 70 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + ++E + N T N + Sbjct: 71 KQI-NNTGDTDALKLKVTDWFHAQVENSYTLGEIDIDTTNHNITATASGTVPTTFMKNCQ 129 Query: 126 KSAYQ 130 Sbjct: 130 YRYRS 134 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ + ++ + RT+ + ST I D + A + + Sbjct: 142 DRIGLVEFADQAYVAAAPTFDTATVARTLEEATI--GLVGRSTGIGDGLGLALKRLAPAQ 199 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQ 412 K +VLL+DG N + AK G+R+ TIA Sbjct: 200 VAAADGAGPPPSRDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNP 259 Query: 413 EKARYF-----LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L A S F +T +L + + I R Sbjct: 260 NNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENV-ANAIDELEGGR 305 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 66/193 (34%), Gaps = 34/193 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +V+ + + L+ + R+G F R + + + + + Sbjct: 121 NVVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + + + ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRDR 444 T EL I+ + Sbjct: 281 RDTEELAGIYAEL 293 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F ++ + + + + TAI DA+ A + Sbjct: 143 DRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIA--LPGPQTAIGDAIGLAVKKL---- 196 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 K K +VLLTDGEN + + AK I+I TI Sbjct: 197 -------KRYPGDSKALVLLTDGENNSGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKT 249 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + FF A ++ +L +++ Sbjct: 250 TFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESI 294 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 85.8 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F ++ + + + + TAI DA+ A + Sbjct: 137 DRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIA--LPGPQTAIGDAIGLAVKKL---- 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 K K +VLLTDGEN + + AK I+I TI Sbjct: 191 -------KRYPGDSKALVLLTDGENNSGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKT 243 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T ++ L A + FF A ++ +L +++ Sbjct: 244 TFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESI 288 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 64/197 (32%), Gaps = 32/197 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L + I R+G F D + ++ + Sbjct: 103 DMAYQGRYVDRLSMVKAVLKNFIAQ-------RQGDRLGLILFGDTAFLQTPLTRDLNTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + +TAI DA+ A + + +VLLTDGEN Sbjct: 156 SKMLEEAQI--GLVGRATAIGDALGLAVKRFSQKQDSN-----------RILVLLTDGEN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK----------TQQEKARYFLSNCA--SPNS 427 T N A+ +GI++ T+ L A + Sbjct: 203 TAGNLAPEEALLLAREEGIKVYTVGVGSQGGNRFNLFSMSGSSSLDESLLQKIATETGGL 262 Query: 428 FFEANSTHELNKIFRDR 444 +F A L +I+++ Sbjct: 263 YFRATDVASLQQIYQEL 279 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 64/192 (33%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + ++V+ A +R ++G F R ++ H ++ + Sbjct: 114 TSRLNIVKSAAEQFVRERSG-------DKIGLILFGTRAYLQTPLTYDRHSIL--LRLED 164 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A +T+I DA+ A + S+ + + I+LLTDG N + Sbjct: 165 ATAGLAGKTTSIGDAVGLAVKRLDSA-----------PKKGRVIILLTDGANNSGVLAPL 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQ-------------QEKARYFLSNCA--SPNSFFEAN 432 AK +GI+I TI + L + + +F A Sbjct: 214 KAAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEETLKKMSDMTGGRYFRAT 273 Query: 433 STHELNKIFRDR 444 T L+ I++ Sbjct: 274 DTESLHLIYKTI 285 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 61/178 (34%), Gaps = 23/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V++ I R+G F+DR ++ + + + + Sbjct: 116 TRLSIVKETADDFISRRDG-------DRLGLVLFSDRAYLQAPLTFDREAVRKLLDQAQV 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + ED + +VLLTDG N + Sbjct: 169 --GLTGQKTAIGDAIAVSVKRLKDRPED-----------GRVLVLLTDGANNEGVMSPDK 215 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + A GIRI TI ++ L A+ ++F A L +I+R Sbjct: 216 AADLAAKLGIRIYTIGVGS-ARSRDLDERTLRQIADATGGAYFRATDVQGLAQIYRAI 272 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 44/394 (11%), Positives = 111/394 (28%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + ++ E+ + Sbjct: 47 AALLLVAVSLTACGGKGAEVE--HRQAEQQAEQRHQEASQRQAEMRDAAKVEMARVAAPM 104 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R + ++ + T S + Sbjct: 105 QMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYAT 164 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 165 LRRML-REGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 223 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L + + +D S K L++ AL + + D V+ V Sbjct: 224 GLKGYDLLKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQDKVSIVV--- 280 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G + + GST + AY + Sbjct: 281 ------YAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL--AKKHFIP 332 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + + ++L TDG+ + A+ K K+ GI + T+ F + + Sbjct: 333 NGINR-------VILATDGDFNVGVTDFDDLTALIEKEKAHGIGLTTLGFGLGNYNDQLM 385 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 386 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 417 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 61/195 (31%), Gaps = 35/195 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + LV+ + ++ + R+G F + ++ + +++ Sbjct: 113 DISRLDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEA 165 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + +VL+TDG N Sbjct: 166 QI--GIAGKNTAIGDAIGLAVKRLR-----------ERPAQSRVLVLITDGANNGGQIHP 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SPNSFFEAN 432 + A +G+RI TI N + L A + ++F A+ Sbjct: 213 LTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAH 272 Query: 433 STHELNKIFRDRIGN 447 EL D + Sbjct: 273 DGAEL-DAIGDTLDQ 286 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 85.4 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 61/195 (31%), Gaps = 35/195 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + LV+ + ++ + R+G F + ++ + +++ Sbjct: 113 DISRLDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEA 165 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + +VL+TDG N Sbjct: 166 QI--GIAGKNTAIGDAIGLAVKRLR-----------ERPAQSRVLVLITDGANNGGQIHP 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SPNSFFEAN 432 + A +G+RI TI N + L A + ++F A+ Sbjct: 213 LTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAH 272 Query: 433 STHELNKIFRDRIGN 447 EL D + Sbjct: 273 DGAEL-DAIGDTLDQ 286 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 55/410 (13%), Positives = 106/410 (25%), Gaps = 60/410 (14%) Query: 36 VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 + VDV L+ A + L R Sbjct: 30 MTMFTVDVAYMQLVRTELRAATDASAKAGMEAL---------RRTQDTEAAIDAAIATAA 80 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 N T ++ + V+ + +A Q+ ++ S ++ + Sbjct: 81 ANKVGGRSLTLTADQIEFGLAFRNVDNSVSFNAGQLPYTAVRVNSAMTESSAAGAVPLFF 140 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 I + E + V I + ID S SM Sbjct: 141 GSIFGTGQFEPTRSAVSASTEVEICFAIDRSHSM-------------------------- 174 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV-IKKKHLV 274 D + P P+LS + + V I Sbjct: 175 --------CFDLTGVDWSYPPGTPRNPDPVAFPPHPTLSRWASLSRAMQTFVSITASQEP 226 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + +A V + K ++ + V D + + L + I Sbjct: 227 KPRVAMVTWASKITQ--SNYEGKLTKTNSPEVFVDVPLTTNLADLNQAIKGRSEKVML-- 282 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T + + A + ++ A + I+L+TDG Q +A + A Sbjct: 283 GATNMAAGIDEARKILNATKS-------TRPYAHRIIILMTDGLWNQGRNPLLAAQDAA- 334 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++GI I +++ + + A ++ L F D Sbjct: 335 NEGIVIHSVSLLPRSGDITPQVSS----TTGGVNYPATNSAALEAAFADI 380 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 60/195 (30%), Gaps = 35/195 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + LV+ + ++ + R+G F + ++ + + + Sbjct: 113 DISRLDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEA 165 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + +VL+TDG N Sbjct: 166 KI--GIAGKNTAIGDAIGLAVKRLR-----------QRPAQSRVLVLITDGANNGGQIHP 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SPNSFFEAN 432 + A +G+RI TI N + L A + ++F A+ Sbjct: 213 LTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAH 272 Query: 433 STHELNKIFRDRIGN 447 EL D + Sbjct: 273 DGAEL-DAIGDTLDQ 286 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 76/242 (31%), Gaps = 32/242 (13%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF-----VDSSSLRHVIKKKHLVRDALASV 281 ++ P MVS +P + +D+S + K V++ + + Sbjct: 180 DQTIPVMVSVKTLDQTNDMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQL 239 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + + D R+ FN + V+ + ++T G T I Sbjct: 240 LEMLNEND------RLSLITFNTKAKQLCGLK-NVNNQNKKSLQTITKSIKADGGTDIIR 292 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EGIAICNKAKSQG 397 ++ A+ + S + I LL+DG++ + + + + Sbjct: 293 GIEIAFQILQSRKQKNSVSS---------IFLLSDGQDNLADAGIKNLLKTTYKQLQEES 343 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + F + + A SF+ ++++ F D +G Sbjct: 344 FTIHSFGFGNDHDGP-----LMQKIAQIKDGSFYFVEKNDQVDEFFIDALGGLFSVVAQD 398 Query: 456 IT 457 +T Sbjct: 399 LT 400 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 53/377 (14%), Positives = 103/377 (27%), Gaps = 21/377 (5%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + + + L K T R + +V N ++ L + Sbjct: 96 KEKAEAQAQYSAAVAKGKSAGLVK-ATGRNMEQFQVSVSVAPNAKRRLGVYELLLKVRPQ 154 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L + + I T + + + Q + S+ Q Sbjct: 155 QLVKHLQMDIHIFEPQGISFLETESTFMTNQLVDALTTWQNK--TKAHIRFKPTLSQQQK 212 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + + ++Y G + +D Sbjct: 213 SPEQQETVLDGNLIIRYDVDRAISGGSIQIENGY----FVHYFAPEGLTTMPKNVVFVID 268 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S +K R+AL ++ + D + F+ Sbjct: 269 KSGSMSG-RKIQQTREALIKILDDLSPRD------QFNLIVFSTEATQWRPSLVPASAEN 321 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++FA +G T INDAM A + SSN++E + I+LLTDG+ T Sbjct: 322 VNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVS----LIILLTDGDPT 377 Query: 381 QDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 +I N + + + F + + + L N E + + Sbjct: 378 VGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQ 437 Query: 438 NKIFRDRIGNEIFERVI 454 + F + N + V Sbjct: 438 LQDFYQEVANPLLTAVT 454 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 61/453 (13%), Positives = 128/453 (28%), Gaps = 63/453 (13%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 L + G + L+ L + + + + + Sbjct: 2 VKSMFSSLARQQKGSITLSFLALLLPALFIAVGTLMISAQVMVSNR------------AA 49 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 S + A + + ++Y + ++ R + + + P Sbjct: 50 QAADSAALACAFADEATPRMMRAYQDYYHPTLKGVSGLTPENQGCRISLGYSVTPLLPHF 109 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + + V S H + V+D S Sbjct: 110 NYERYDQNVTATG----------------------GGFKGVVESKHSAIPTELVLVLDVS 147 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM Q L A T++S S+ + + P+ + + + Sbjct: 148 GSMSP-----NIQSLKSILSNALNTIQSQSNNANDLDSVSISIVPF--DSGVATHRPPWL 200 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + +D S R+ L D LA++ V T ++ Sbjct: 201 SEETAGI---YCIDGLSYRNGDFSASLTVDNLATLHSE----RPVKFTPPSKWLSDCNQE 253 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH-RMKNNL 365 + ++ + G T + +I S + ++ + Sbjct: 254 SPMLPLTNVFSRV-----QNSINSLTANGGTRSYQGLVWGVRQLIPSWQQAWGMKVSSVP 308 Query: 366 EAKKYIVLLTDGENTQD---NEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 E ++ +VL TDG + D C K GI + I + V+ ++ + N Sbjct: 309 ETRRKLVLFTDGADEGDAFNQLVNAGFCTTAIKQYGIEMNFIGYGVSPSRITQ----FEN 364 Query: 422 CASPN-SFFEANSTHELNKIFRDRIGNEIFERV 453 CA F A +T +LN+ F D + E + Sbjct: 365 CAGNPLRVFSATNTTQLNEYFSDILAVEYSASI 397 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 58/187 (31%), Gaps = 34/187 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + LV+ + ++ R+G F + ++ + + + Sbjct: 114 ISRLDLVKALMGDFLQD-------RQGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQ 166 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI DA+ A + + ++L+TDG N + Sbjct: 167 I--GIAGKNTAIGDAIGLAVKRLR-----------QRPAQSRVLILITDGANNGGQIHPL 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCAS--PNSFFEANS 433 A +G+RI TI N + L A ++F A+ Sbjct: 214 TAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIAGITHGTYFRAHD 273 Query: 434 THELNKI 440 ELN I Sbjct: 274 GAELNAI 280 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 77/208 (37%), Gaps = 26/208 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V++ + I + ++ + +G F ++ + ++ + +T Sbjct: 98 VDRLTAVKNVASKFITGGEDLE-GRFSDLVGLITFAAYADAETPPTLDHSFVVSRLNQTE 156 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + TAI DA+ + + + + + + ++++ K ++LLTDGENT + I Sbjct: 157 IVSRRDEDGTAIGDAIALSVEKLNALDARQERKVQS-----KILILLTDGENTAGELDPI 211 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPNS 427 A++ GI+I I L A + Sbjct: 212 QAAELAETLGIKIYAIGVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGK 271 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIR 455 +F A T L+ I+R+ E E R Sbjct: 272 YFRATDTDSLDAIYREIDQLEKTEVETR 299 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ + ++ + RT+ + ST I D + A + + Sbjct: 142 DRIGLVEFADQAYVAAAPTFDTATVARTLEEATI--GLVGRSTGIGDGLGLALKRLAPAQ 199 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQ 412 K +VLL+DG N + AK G+R+ TIA Sbjct: 200 VAAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNP 259 Query: 413 EKARYF-----LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L A S F +T +L + + I R Sbjct: 260 NNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENV-ANAIDELEGGR 305 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 87/269 (32%), Gaps = 50/269 (18%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 + PV+LG GM V V + L++AA +A + + L + S + F Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSEAEAYARNFVAG 60 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL 148 ++ YL + + + + ++YQV +S YDL +NPL + Sbjct: 61 QMANYLQSGVDI----------KGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPL---M 107 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 +++G K+ + TV + +S+ +D S SM + Sbjct: 108 QAVGFKTQHLS--TSGTTVGGHSQTQGSISMFLALDKSGSMGESTATVNE---------- 155 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 D+ + CN + + + S + Sbjct: 156 -----------------DDPTETFTYDCN--------LHYNSKNNKWVYDKCTGSRTNYY 190 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K ++ A ++ + D VR Sbjct: 191 TKIEALKIAAGNLFSQLNSADPNAQYVRT 219 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 59/466 (12%), Positives = 134/466 (28%), Gaps = 74/466 (15%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 +K+ G I L++PV++ V + + + + + +A++ + S Sbjct: 9 LKNSCGAAAFIFILILPVLICVMALSLQASQILLAQSKITEASEVTSLALSAL------S 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + ++ + ++ YL D+ ++ + + +V Sbjct: 63 EERAQQKLSSYATRVLKHYL--------------VGTDDVKGQATMQSSTFQFQTDLVGE 108 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD--Y 192 + ++ K + SR + + + + ++ D S SM Sbjct: 109 ATHEFWFKHKP----------QADTFKVSGASTSRKHKPQ-PMDVYFITDLSESMNRSEP 157 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 R + + + + + + P+ Sbjct: 158 SRLTIVKDAIRQVVSKLPKGSRAAFIGYNTENVKLTGRYFDKRTGREITSKKPTELQGPN 217 Query: 253 -LSEEHFVDSSSLRHVIKKKH-----LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 +E+ D + RH + S I + + D + F Sbjct: 218 IWAEKSIYDYLTGRHPDFVIENLFDIALSKKKKSSIDKLLEGYGPPDLDQKIRDFETKYP 277 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 D S + + TI G+T ++ II++ + + + Sbjct: 278 FYDISLTTDLGSFKETIKSNAIN---ANGNTHS-------WNGIIAAAREAHRQPSSVFN 327 Query: 367 AKKYIVLLTDGENTQDNEEG--IAICNKAKSQ-------------------GIRIMTIAF 405 ++ VLLTDGE+++ +G +C K ++ + + I Sbjct: 328 PQQVFVLLTDGEDSKKFPKGYYAPLCEKIRNDISDKQNRSQIQNASVEEKTKVTMSVIGV 387 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 N + E C + FEA EL K E+ Sbjct: 388 EFNPYKNEGVTE----CFGRENIFEAKREDELVKKILQLFEYEMGS 429 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLDP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SPNSFFEAN 432 + AK++G+RI TIAF + L A + FF A Sbjct: 222 LKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 85.0 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 41/391 (10%), Positives = 105/391 (26%), Gaps = 27/391 (6%) Query: 65 SVPLIQSLEEVSSRAKNS--FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV-E 121 + + E + + T Q I + + ++ + R + Sbjct: 53 AEKDSAAATEHQQQEEVKEHRTLADQAISDDKSADMSISVPSGIYGAGKQASERMIKMKS 112 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 ++ Q+ ++ + T S +++ Sbjct: 113 LSRHHVMGQMSAPGVSAFSQASNRNNFARRTANGIMVVGEIPVSTFSIDTDTGSYTTLRR 172 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I+ + + + +N F + + + + + L Sbjct: 173 WINQ-GRLPEKGTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNEHKMLLRIGLK 231 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + K L++ +L + + + + D+V+ V Sbjct: 232 GYEVDKSQLGASNLVFLLDVSGSMNSRDKLPLLKTSLKMLSQQLSEQDHVSIVV------ 285 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + GV + GST +Q AY + + Sbjct: 286 ---YAGASGVVLDGVKGNDTQAINQALNSLKAGGSTNGGAGIQQAYRL--AQKHFIQGGV 340 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYF 418 ++L TDG+ + A+ + + QGI + T+ F Sbjct: 341 NR-------VILATDGDFNVGTTDHQALMDLIAAKRDQGIALTTLGFGQGNYNDHLMEQL 393 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + ++ +E K+ D + + + Sbjct: 394 ADK--GNGHYAYIDTLNEARKVLVDELSSTL 422 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 85.0 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 97 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 149 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 150 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 196 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + + L A + FF A Sbjct: 197 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRAR 256 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 257 DTEELAGIYAEL 268 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 85.0 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + + L A + FF A Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 55/188 (29%), Gaps = 35/188 (18%) Query: 279 ASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 V I +V R+G F D + + + + + Sbjct: 106 GKVTDRFTLIQHVISQFIERRKGDRIGLILFADHAYLQSPLTQDRRTVAQYLNEAEI--G 163 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI +A+ + ++LLTDG N + + Sbjct: 164 LVGRQTAIGEAIALGVKRFDQVENSN-----------RVLILLTDGSNNAGSISPEQATD 212 Query: 392 KAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHE 436 A +GI I T+ + L A + S+F A +T E Sbjct: 213 IAAKRGITIYTVGVGAEVMERRTLFGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEE 272 Query: 437 LNKIFRDR 444 L +I+++ Sbjct: 273 LERIYQEI 280 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 34/386 (8%), Positives = 92/386 (23%), Gaps = 10/386 (2%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 TK Y +K G+ +I A + ++ G +D L+ A I A Sbjct: 2 TKVSRYLRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIAA 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + + + A F + ++N N T + Sbjct: 62 TIDPTATPTQTTREAVAKKAFCGNIKQSGGLQNSFCNTTTLDTLGTASATLSTATSNN-- 119 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +++ Y + + L +GI + I A++ + + + + Sbjct: 120 -------IMTVTYSATAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAEVAFVLDNTGSMS 172 Query: 185 FSRSMLDYQRDSEGQPLNC-FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + M + + + + + + ++ + Sbjct: 173 SNNKMTYLKSSLDAVLASMLDSTGKNYAKTKVALVPFDTQVSLSNVAGMVDYAGDFSTVT 232 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 S ++ + ++ + + + + Sbjct: 233 PTYSCSGYSSAQCQVISENASAMCNGNATCLSNNRNYTRSWTSNGNTYFGVFATSYYQSS 292 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + S + + + R +V + ST T N + Sbjct: 293 NTYRSYGNTYYYTYIAWRQVVYRVNSSTLTLNSTNNGGDYYTYQAYYNQPNNYSRYYGAV 352 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI 389 + + I Sbjct: 353 TYSTPTAGGYNSTSTTIIKDNTTITA 378 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 48/385 (12%), Positives = 102/385 (26%), Gaps = 31/385 (8%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + L T + ++ + S +S +M + Sbjct: 200 AKTKVALVPFDTQVSLSNVAGMVDYAGDFSTVTPTYSCSGYSSAQCQVISENASAMCNGN 259 Query: 156 WL-IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 + S + S + ++ + Q R S Sbjct: 260 ATCLSNNRNYTRSWTSNGNTYFGVFATSYYQSSNTYRSYGNTYYYTYIAWRQVVYRVNSS 319 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + N D + + + + S+++ Sbjct: 320 TLTLNSTNNGGDYYTYQAYYNQPNNYSRYYGAVTYSTPTAGGYNSTSTTIIKDNTTITAN 379 Query: 275 R--------DALASVIRSIKKIDNVNDTVRMGAT---FFNDRVISDPSFSWGVHKLIRTI 323 + VI + D +D G + + ++ Sbjct: 380 SDLLGVGTSNWTGCVIDRTQSYDVTSDAPVAGTPATLYPAAKCATNTLLPVMALTQDIAA 439 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD- 382 +T+A G+T + +Q + + + + KY+++LTDG NTQ+ Sbjct: 440 ARTYAARMAPAGNTNVTIGVQWGMEVLSPTAPFSEGGAFTDKAVLKYMIVLTDGINTQNR 499 Query: 383 --------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANS 433 N C AK+ GI + T+ L NCAS ++ ++ Sbjct: 500 WTTNNSQINARLALACTNAKNLGITVFTVRVEQG------DSTTLQNCASQTAYYYNLSN 553 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 +L + R +R+T+ Sbjct: 554 ADQLPATMSKIMK---SIRKVRLTQ 575 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 76/205 (37%), Gaps = 26/205 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V++ + I + ++ + +G F ++ + + Sbjct: 102 DFNIDGEPVDRLTAVKNVASKFITGGEDLE-GRFSDLVGLITFAAYADAETPPTLDHSFV 160 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +T + + TAI DA+ + + + + + + ++++ K ++LLTDGEN Sbjct: 161 VSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQERKVQS-----KILILLTDGEN 215 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSN 421 T + + A++ GI+I I L Sbjct: 216 TAGELDPVQAAELAETLGIKIYAIGVGTTGKAPVPVRDPFTGRQRLHYMEVNIDEATLQK 275 Query: 422 CA--SPNSFFEANSTHELNKIFRDR 444 A + +F A T L+ I+R+ Sbjct: 276 VAEITGGKYFRATDTDSLDAIYREI 300 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 59/201 (29%), Gaps = 39/201 (19%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + ++ S I D D R+G F + + L Sbjct: 102 ARDFKPDRITAAKEVAGSFI-----ADRYGD--RIGLVAFAGEAFTQSPLTTDQSTL--Q 152 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + TAI + + TA + + S K I+LLTDG N Q Sbjct: 153 TLLARIRSGLIEDGTAIGNGLATAINRLRES-----------DAKSKVIILLTDGVNNQG 201 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA-- 423 + AK+QGIR+ TI E L + Sbjct: 202 QIAPMTAAEIAKAQGIRVYTIGVGTEGMAPYPAIDMFGNLTFVNQKVEIDEKVLKAISDM 261 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + +F A +L ++ + Sbjct: 262 TGGRYFRATDKEKLKAVYDEI 282 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 62/194 (31%), Gaps = 39/194 (20%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ + I + R+G F S+ + R +++ Sbjct: 115 VSRIRAVKQVGSRFIEQ-------REGDRLGLILFGSNAYVQSPLSFDTATVKRFLLEAQ 167 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + K + ++LL+DG++T + + + Sbjct: 168 I--GFAGQDTAIGDAIGLAVKRL-----------KERPAENRVLILLSDGKDTASSVQPL 214 Query: 388 AICNKAKSQGIRIMTIAFSVNK-----------------TQQEKARYFLSNCA--SPNSF 428 A GIRI TI + E L A + + Sbjct: 215 NAAKLAADLGIRIYTIGIGADSLTMPGLFGSSFGARQVNPSAELDEAGLQQIAKITDGKY 274 Query: 429 FEANSTHELNKIFR 442 F A + EL I++ Sbjct: 275 FRARNPEELANIYQ 288 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 37/277 (13%), Positives = 83/277 (29%), Gaps = 9/277 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +S + + G I+ AL++PV+LG + VD+ R + L+ AA A + + Sbjct: 4 THHSSRRLHRQRGGTAILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHS 63 Query: 68 LIQSLEEV---SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 L + + S+ + + + + + +R P Sbjct: 64 LSDAGGQPYNWSAAVNAAQNVVQSNVANGAHIQDATIETGYWNLQNPSLGLRPAGTGSVP 123 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + N + L I + + G + + Sbjct: 124 AAGDVPA-VRTTVAISANQNNGPLPLFFAPILGIAESSIQASAIAVIAPPSGGTGMFPFV 182 Query: 185 FSRSMLDY--QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + MLD+ RD+ L P + Y ++ + G+ + + + Sbjct: 183 IAAPMLDHYWDRDTNSPVLENGVAPTIKLGSIYHFEDSEDGVLSGEWTTFQTEDGNPSGR 242 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 L+ D + + D++ A Sbjct: 243 FLWDLLEDLTTNG---NDTALSIGDNTYIQPGTKAHG 276 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 61/196 (31%), Gaps = 35/196 (17%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + V+ + + + + R+G F ++ ++ + + Sbjct: 110 SGQTVNRLMAVKAVVGNFVTE-------REGDRLGLILFGEKAYLQTPLTFDRKTMQTLL 162 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + TAI DA+ + + E + ++LLTDG N Sbjct: 163 YEAQL--GFAGNGTAIGDAIGLSVKRLQ-----------QRPENHRVVILLTDGANNAGE 209 Query: 384 EEGIAICNKAKSQGIRIMTIAFSV-------------NKTQQEKARYFLSNC--ASPNSF 428 + + A S ++I TI + L+ A+ + Sbjct: 210 LDPLKAAELASSAKVKIYTIGVGAETQEAWGLFGKRVTNPSADLDEQTLTAIAEATGGQY 269 Query: 429 FEANSTHELNKIFRDR 444 F A + EL I+++ Sbjct: 270 FRARNPEELMAIYQEL 285 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ + ++ + RT+ + ST I D + A + + Sbjct: 142 DRIGLVEFADQAYVAAAPTFDTATVARTLEEATI--GLVGRSTGIGDGLGLALKRLAPAQ 199 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQ 412 K +VLL+DG N + AK G+R+ TIA Sbjct: 200 VAAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNP 259 Query: 413 EKARYF-----LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L A S F +T +L + + I R Sbjct: 260 NNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENV-ANAIDELEGGR 305 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 66/193 (34%), Gaps = 34/193 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 V+ + + L+ + + R+G F R + + + + + Sbjct: 121 SVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + + + ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQ------------QEKARYFLSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRDR 444 T EL I+ + Sbjct: 281 RDTEELAGIYAEL 293 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 84.7 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 56/210 (26%), Gaps = 36/210 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + I R+G T F+ S + L Sbjct: 116 SMKIQDFTPNRLEAAKQVANDFID-------GRFQDRIGLTIFSGEAYSLSPLTTDYKML 168 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I G TAI A+ + + S K ++LL+DG+N Sbjct: 169 KNQITDIDFKMMEASG-TAIGSALAVGTNRMRES-----------DSKSKVLILLSDGDN 216 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT--------------QQEKARYFLSNCA-- 423 N + A + GI+I TIA + L N A Sbjct: 217 NAGNIDPETSAKLANAYGIKIYTIAIGKEGKVPYGKDFFGRTRYIENSMDVTGLKNIAKI 276 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 F+ A L ++F I + Sbjct: 277 GEGQFYRATDNQALEEVF-SIIDQYEKAEI 305 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 64/196 (32%), Gaps = 31/196 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + +++ I R+G F+ R ++ + + Sbjct: 107 DLSYDNIPVDRLTIIKGVADDFIA-------KRKGDRIGLILFSTRAYVQAPLTFDRNVV 159 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + TAI DA+ A T+ + ++ ++ +VLLTDG N Sbjct: 160 RDLLRTSSI--GMTGQETAIGDAIALAVKTLRTRPQE-----------QRVLVLLTDGAN 206 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ---------EKARYFLSNCA--SPNSF 428 I AK+ G++I TI + + L A + + Sbjct: 207 NSGMLSPIPAAEIAKANGVKIYTIGVGADAFAVGQRMVNPSFDLDEGALEQIAQMTGGRY 266 Query: 429 FEANSTHELNKIFRDR 444 F A L I+ D Sbjct: 267 FRARDAAGLAAIYNDI 282 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 48/158 (30%), Gaps = 25/158 (15%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + ++ + + TAI D + A + S Sbjct: 132 LVVFGSFADVQAPLTPDLNAIQS--LLADLRPGMADSRTAIGDGLALAVRQLRES----- 184 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV----------N 408 + +VLL+DGEN A ++ IR+ TI F Sbjct: 185 ------TTEDRVVVLLSDGENNSGEIRPDEATAVAAAENIRVYTIGFGSAGRDSLLQSFG 238 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + ++ A S+ EL ++FRD Sbjct: 239 LRSSSLDEQTLREIAEQTQGRYYRATSSAELAEVFRDI 276 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 82/276 (29%), Gaps = 7/276 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +S + ++S G I+ A+++PV+LG + VD+ R + L+ AA A + + Sbjct: 4 KHHSSRRLQSQRGGTAILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARS 63 Query: 68 LIQSLEEV---SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-MN 123 L + S+ + + + + + +R + Sbjct: 64 LSDPGGQPYNWSAASIKALDVARSNVANGGQIQDAAIETGYWNILNPALGMRPAGTPGVP 123 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 V +++ N L L I ++ + + +VI Sbjct: 124 ATGDVPAVRVTTAISATQNNGPLQLLFAPILGITERSIQASAIAVIAPPSGGTGMFPFVI 183 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + RD+ L P + Y ++ G+ + + + Sbjct: 184 ATPMLDHYWDRDTNSPVLENGVAPTIKLGSIYHFEDSDDGVLSGEWTTFQTEDGNPSGRF 243 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 L+ D + + D++ A Sbjct: 244 LWDLLKDLTTNG---NDTALYIGDNTYIQPGTKAHG 276 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 57/188 (30%), Gaps = 34/188 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + LV+ L + + R+G F + ++ + + + Sbjct: 113 DVSRLSLVKHLLGDFLE-------GREGDRVGLILFGSQAYLQAPLTFDRRTVRTWLDEA 165 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + ++L+TDG N + Sbjct: 166 RI--GIAGKNTAIGDAIGLALKRLR-----------QRPAQSRVLILVTDGANNAGQIDP 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNC--ASPNSFFEAN 432 + A +G++I I + Q + L A+ +F A Sbjct: 213 LTAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDEPALKAIAEATGGQYFRAR 272 Query: 433 STHELNKI 440 EL I Sbjct: 273 DGEELQAI 280 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 55/174 (31%), Gaps = 14/174 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ + ++ + RT+ + ST I D + A + + Sbjct: 142 DRIGLVEFADQAYVAAAPTFDTAAVARTLEEATI--GLVGRSTGIGDGLGLALKRLAPAQ 199 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQ 412 + K +VLL+DG N + AK G+R+ TIA Sbjct: 200 LADAEGGGPPPSRDKVVVLLSDGANNAGQTAPKDVAALAKDLGVRVYTIALGPIDMADNP 259 Query: 413 EKARYF-----LSNCA--SPNSFFEANSTHELN---KIFRDRIGNEIFERVIRI 456 + L A S F +T +L + G + + Sbjct: 260 NNEQDVVDVETLRAMAETSGGRAFRVKTTDDLESVAAAIDELEGGRAKAPPLPL 313 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 84.3 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 58/422 (13%), Positives = 108/422 (25%), Gaps = 35/422 (8%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 + ++ A+ +KI + R + DRE Sbjct: 109 LDLPSLAFISNFTMTINNKVYVAEVKEKHQAKKIYDEARRQGKTAAHVGVKDRETEKFRV 168 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 T V L+ L R +G + + + + S Sbjct: 169 ST-----------SVEAGGTVTFTLSYEELLQRHLGKYQHAVSVRPQQVVKNLSVEVTIS 217 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + + P+ + K ++ + Y S Sbjct: 218 ERTGIDYIHVLPLRTSRLLTNTLRGEADIPPSTKIEKGEKCARIIFTPTPQEQAAYSSSG 277 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRS 284 + + Y + + + + + + + + + ++ Sbjct: 278 IMGDFVVQYDVSMKDIIGDVQIYNGYFVHYFAPRGLPPVQKNVVFVIDVSGSMFGTKMKQ 337 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTA 338 KK +V +FN SD W I+ K + G T Sbjct: 338 TKKAMHVILNDLHHDDYFNIVTFSDAVSVWKASGSIQATPPNIKSAKVYVNKMEADGWTD 397 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN A+ A S + I+ LTDGE T I + AK Sbjct: 398 INAALLVAASVFNQSTGETGRGKGLKKIP--LIIFLTDGEATAGVTVASRILSNAKQSLK 455 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + +AF + R L N +E K F D I + + + Sbjct: 456 GNISLFGLAFGDDADYHLMRRLSLENRGVARRIYEDADATLQLKGFYDEIASPLLYDI-E 514 Query: 456 IT 457 +T Sbjct: 515 LT 516 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 83.9 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 54/448 (12%), Positives = 106/448 (23%), Gaps = 66/448 (14%) Query: 12 KKLIKS---CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 +++I G I+ A +M +L + G + + L+ A+ Sbjct: 9 RRVIHKPHPRRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELR---------AATDA 59 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + S T + + R + D +++ + + Sbjct: 60 AARSAVIRLVETQSTTSGRAAARDIASRFEVGGKALSLNDNDIQFGRSTRQSNGSYSFAI 119 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 ++ + + + E T + I V+D S S Sbjct: 120 NGTPTNAARVFGRKTKTSAAGPVELPFGGFVGAPEYSTELNAVAMRLDYDIVIVLDRSGS 179 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M E +R L S Sbjct: 180 MGWDLSGVE--------------------FEYPEAVRQRPLVENYFSPPDPTGSRWAI-- 217 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +++ + V G F Sbjct: 218 ----------------------LSASVNDFLTILNQRQVAARVGLVTYAGDYTFGKYSSV 255 Query: 309 DPSFSWGVHKLIRTIVK--TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + TI T +G T I + A + +S++ Sbjct: 256 KLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITAAQTMLTTSSQA------RLKT 309 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + I++ +DG Q E + I ++ F + + A Sbjct: 310 GQPIIIVFSDGMFNQGTEPVSLAASAYSQSSTIIHSVTFGATAQGRATMNSV-TATAGKG 368 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 AN+ EL + FR I N I V Sbjct: 369 LSLHANTAAELAESFRS-IANAIPIVVT 395 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 56/186 (30%), Gaps = 25/186 (13%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 K +VR + + I +G F ++ + Sbjct: 100 FDPTDPQKTKFDVVRSLVKAFIS-------KRRNDNIGVVIFGSFAYIASPLTFNKEAVK 152 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + TAI+DA+ + + S K ++LLTDG +T Sbjct: 153 KILDYLDI--GVAGSKTAIDDALIESVRLLKESQAKS-----------KIVILLTDGIDT 199 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 AK G++I TI ++ FL A +F A L Sbjct: 200 ASKTPPDVAVKMAKKYGVKIYTIGIG---DKRGIDEAFLRWLAQQGHGYYFYAKDASMLR 256 Query: 439 KIFRDR 444 KI+ + Sbjct: 257 KIYDEI 262 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 48/441 (10%), Positives = 129/441 (29%), Gaps = 31/441 (7%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + G+ VD+ R + L+ A A + S+ L +V++ + + + Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQL-PDDPDVTNGKVKAAVTANLALNDP 59 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + + V +++ N + L L Sbjct: 60 DATDIQVTSGGATRSVCVDAKANVDMTLTKVIGIGDT-TVTAEACAGYNDIELVLVLDST 118 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 S A+ +R + ++ +Q + + Sbjct: 119 GSMKGSPIDSAKDAARDLVNLIMPASTSSTRSKIGLVPFQGKVRIDGSDPVTAERNPDGV 178 Query: 214 SYSSQNGKVGIRDEK-LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 +N + K Y + + + Y + +++ S I+ Sbjct: 179 GPGCRNADGTLNTGKLKVEYSRTATSTNIFYGYTLSGVSTFTDKTC----SGMSPIRALS 234 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ + + I +I + T+ + +V+S + + + + Sbjct: 235 SDKNTILNNIEAINAGAVTSGTLISEGIKWGRKVLSPEAPYVEGSTDKKVRKIMIVLTDG 294 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE--AKKYIVLLT---------DGENTQ 381 + +A T+ + + + Y L T D ++ Sbjct: 295 DTEDGRCGGNFASASKTVNTYWTNAYFGQGLKPDTATSPYATLSTATATLAQIPDCKDGG 354 Query: 382 D-NEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEAN 432 N+ + + AK+ + I ++ F + + + AS + +++A Sbjct: 355 KLNQFVLDEADAAKNDLNYPVEIFSVRFGASD---ATDKSLMQKIASSKPGTTDHYYDAP 411 Query: 433 STHELNKIFRDRIGNEIFERV 453 S+ + +F+ IG ++ +R+ Sbjct: 412 SSTGIQDMFKK-IGQQLGQRL 431 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 83.9 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 71/197 (36%), Gaps = 33/197 (16%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ + + I+ RMG F + ++ ++ + + +T Sbjct: 127 VDRLTAVKSVVKNFIQ-------KRQGDRMGLVVFGSQAFLQSPLTYDLNTVETLLNETE 179 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI DA+ A + N E K ++LLTDG NT + + Sbjct: 180 I--GMAGNNTAIGDAIGIALKHL-----------HQNSEKKAVLILLTDGSNTAGAVQPL 226 Query: 388 AICNKAKSQGIRIMTIAFSVN----------KTQQEKARYFLSNCA--SPNSFFEANSTH 435 +A+ G++I TI N + L A + FF A T+ Sbjct: 227 DAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFMAKDTN 286 Query: 436 ELNKIFRDRIGNEIFER 452 +LN+I+ I + Sbjct: 287 QLNEIY-QLIDQLEASQ 302 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 52/390 (13%), Positives = 101/390 (25%), Gaps = 21/390 (5%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + E + + + + RN E + E+ R Sbjct: 95 IKEKAEAQAQYSTAVAKGKSAGLVKATGRNMEQFQVSVSVAPNAKITFELVYEELLKRCL 154 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 L + L + + I T + + + Q Sbjct: 155 GV-YELLLKVRPQQLVKHLQMDIHIFEPQGISFLETESTFMTNQLVDAFTTWQNK--TKA 211 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + S+ Q + + ++Y G Sbjct: 212 HIRFKPTLSQQQKYPEQQETVLDGNLIIRYDVDRAISGGSIQIENGY----FVHYFAPEG 267 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + +D S K R+AL ++ + D + F+ Sbjct: 268 LTTMPKNVVFVIDKSGSMSGK-KIQQTREALIKILDDLSPRD------QFNLIVFSTEAT 320 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++FA +G T INDAM A + SSN++E + Sbjct: 321 QWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPDGSVS-- 378 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDG+ T +I + + + F + + + L N Sbjct: 379 --LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGL 436 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 E + + + F + N + V Sbjct: 437 ARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 63/193 (32%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L I+ + R+G F ++ + + + + Sbjct: 112 SINRLQAVKRVLDDFIQR-------REGDRLGLLLFGTEPYIQAPLTFDLATVRTLLHEA 164 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + ++ ++ ++LLTDG NT Sbjct: 165 GI--GMAGRATAIGDALGLAVKRLRDRPQE-----------QRVVILLTDGANTAGEIAP 211 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 AK+ G+RI TI ++ L+ A + +F A Sbjct: 212 DKAAEIAKAAGVRIYTIGIGAETMVQRGLLGSRRVNPSRDLDEGLLTRIAQQTGGEYFRA 271 Query: 432 NSTHELNKIFRDR 444 S EL I+ Sbjct: 272 RSLPELELIYESI 284 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 46/438 (10%), Positives = 87/438 (19%), Gaps = 83/438 (18%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ A+ ML + M V++ R + L+ A A I A+ + + Sbjct: 2 KRKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGT 61 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS-- 134 + NF + + N N Q + Sbjct: 62 CTGSCASDTTA-----IFWANFSSAHQANGLGPFQAVSTGPVITPQNASTITIQANVQLP 116 Query: 135 -----------------SRYDLLLNPLSLFLRSMGIKSWL------------------IQ 159 ++ + L L S Sbjct: 117 LLFTKILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTVYG 176 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD------------------------ 195 A + ++ F+ M + Sbjct: 177 VAAPGSCGGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWRSMPSGWLTAGSDISTT 236 Query: 196 ----SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + ++ +P+ + Sbjct: 237 RYGSNGWMGCVMARYSGYNNSPAHIYDVNDANPIQAPFTPFYW--PSTYQKYSQSSWFGG 294 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALAS-----VIRSIKKIDNVNDTVRMGATFFNDRV 306 + D V+ R + +I S T N Sbjct: 295 TSWVVGDNDWILSGGVVTPSSAARSSYGQMAESPLITSFPTTSGSLVTESGLQVGPNLGC 354 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR-----M 361 ++ G T + A+Q + TI + + Sbjct: 355 D-PSPTLPETASRSVVEAHISSMPMMSRGGTMLPQALQAGWFTISPNWQGFWPNPALPLA 413 Query: 362 KNNLEAKKYIVLLTDGEN 379 N K +VL+TDG N Sbjct: 414 YNTPNMTKVLVLMTDGNN 431 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 32/75 (42%), Gaps = 2/75 (2%) Query: 383 NEEGIAICNKAKSQGIRIMTI-AFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 N +C+ K+ GI I I + + L NCAS P ++++A + + + Sbjct: 494 NSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQA 553 Query: 441 FRDRIGNEIFERVIR 455 F D G R+ + Sbjct: 554 FSDLGGQLSALRISQ 568 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 83.5 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 58/478 (12%), Positives = 127/478 (26%), Gaps = 123/478 (25%) Query: 31 PVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 ML G D +R +++ A+I A + + + + + + Sbjct: 1 MPMLLAVGASFDYIRAYNVRQSMQSDLDAALIAAVKNVDAG--DTDALKQKVSDWFHAQT 58 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 E N + ++ Sbjct: 59 ESSYSLGDIEIDTTNHRI---------------------------TATASGTVPTTLMKL 91 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I + + + + + SY +++ VID S SML + Q + Sbjct: 92 ANINTVPVSVASAVKGPASSY-----LNVYIVIDKSPSMLLAATTAGQQAMYNGIGCQFA 146 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S G + + + S+ + + + Sbjct: 147 CHTGDSHTIG-------------------------------TATYSNNYAYSTEKKIKLR 175 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + DA+ VI I D ++ +++G D + + + + + Sbjct: 176 ADVAVDAVHEVIDMISASDTNHERIKVGLYSLGDTITEVLAPTLDTTAAGKRVDSD-LTS 234 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI- 389 T + ++ + + + + ++ K ++LLTDG +Q Sbjct: 235 ATSTTYTYFDVSLAALKNKVGTGGDGS-----SSATPLKLVLLLTDGVQSQREWVTSGAK 289 Query: 390 ------------CNKAKSQGIRIMTI---------------------------------- 403 C+ K Q + + Sbjct: 290 YQPKVAPLNPAWCDYIKKQSATMAVLYTEYLPITTDWGYNATVGSTMASANWKSTWGGTM 349 Query: 404 --AFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 S + T+++ Y LS+CAS S F A+S E+ + +R+T+ Sbjct: 350 DSGVSSSITRRDYIPYALSDCASSKSLFMSASSATEITTGLSALFTQYLAS--VRLTQ 405 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 83.5 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 69/217 (31%), Gaps = 27/217 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + +D L+ + + + + ++ + I Sbjct: 75 QTYNSWDNKDAEGIDIMLTMDISASMLTEDVFPNRIEVAKEVASDFIS-------GRPND 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + L+ ++ D G TAI + + + Sbjct: 128 NIGLTIFAGEAFTQCPMTVDHAALL-NLLHNVRTDLVVKGLIQDGTAIGMGLANSVSRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S K I+LLTDG N + + + AK GIRI TI Sbjct: 187 DSKAKS-----------KVIILLTDGSNNVGSISPMTAASIAKKYGIRIYTIGLGKESEG 235 Query: 412 Q--EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L N A + F+ A S EL+KI++D Sbjct: 236 DLGAIDYKTLQNIAVSTNGEFYRAQSQAELSKIYQDI 272 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 83.5 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 62/195 (31%), Gaps = 38/195 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + R + + + R+ F + + + + ++ Sbjct: 102 NRLEVARKTIQDFVDQ-------RPSDRIALIAFAGTAYTRVPLTLDHNVVRESLQDISF 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 NE G TAI A+ + + K + K ++LLTDG+N + + Sbjct: 155 KSVNEEG-TAIGMAISVGLNRL-----------KKSTSPSKIMILLTDGDNNAGSIDPNT 202 Query: 389 ICNKAKSQGIRIMTIAFSVN-----------------KTQQEKARYFLSNCA--SPNSFF 429 AK GI+I TI + L A + ++ Sbjct: 203 ASTLAKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYESGLLNEDLLKKIAETTNGQYY 262 Query: 430 EANSTHELNKIFRDR 444 A ++ L+++F + Sbjct: 263 RAKDSNALSQVFANI 277 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 83.5 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 52/390 (13%), Positives = 103/390 (26%), Gaps = 21/390 (5%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + E + + + + RN E + E ++ Sbjct: 95 IKEKAEAQAQYSAAVAKGKNAGLVKATGRNMEQFQVSVSVAPNAKITFELVY-EELLKRR 153 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 L + L + + I T + + + Q Sbjct: 154 LGVYELLLKVRPQQLVKHLQMDIHIFEPQGISFLETESTFMTNQLVDALTTWQNK--TKA 211 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + S+ Q + + ++Y G Sbjct: 212 HIRFKPTLSQQQKSPEQQETVLDGNLIIRYDVDRAISGGSIQIENGY----FVHYFAPEG 267 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + +D S +K R+AL ++ + D + F+ Sbjct: 268 LTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD------QFNLIVFSTEAT 320 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++FA +G T INDAM A + SSN++E + Sbjct: 321 QWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVS-- 378 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I+LLTDG+ T +I N + + + F + + + L N Sbjct: 379 --LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGL 436 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 E + + + F + N + V Sbjct: 437 ARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLDP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SPNSFFEAN 432 + AK++G+RI TIAF + L A + FF A Sbjct: 222 LKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLDP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SPNSFFEAN 432 + AK++G+RI TIAF + L A + FF A Sbjct: 222 LKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 29/265 (10%), Positives = 76/265 (28%), Gaps = 27/265 (10%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G++ ++T + M ++G + VD + + + A A + L + Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLTEG 82 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + F + + + N+ + + P Sbjct: 83 -ATDDQLKAYALDFFNANLNDIDPASATLNVTLPSNTSGGGLLTMTAQLAYKPYFYPAFA 141 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + N F +V+ ++ + + V+D S SM Sbjct: 142 QLVGKSATDANQKINF------------------SVTSQVRLKNTLEVALVLDNSGSMTT 183 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--------SPYMVSCNKSLYYM 243 S + ++ + + V + + Q + D+ + + V Sbjct: 184 LGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDKPVQFGLVPFAASVNVGPANGNASW 243 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVI 268 + L P +E + + + Sbjct: 244 MDTEGLSPVSNENFDWSTLNAANKY 268 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 56/242 (23%), Gaps = 7/242 (2%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 D+ S + + + S S N +G+ D + Sbjct: 332 EYDYIDSYGPFASWQGCVEARPYPYNVNDASPSGGSANTGIGVGDPATMFVPMFAPDEPG 391 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM---- 297 E V ++ + + K Sbjct: 392 NHWKLT---QDPDEAAPVTYGAVNSWWNDDPTSGTGQSRLRNMAKYFQPRPIDAPALPAG 448 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 ++ + T +K G T + + M + + S Sbjct: 449 NGPNYSCTTNPITPLTDVSVADGATSIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFT 508 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 R + K +++LTDG NT + + A S+ + R Sbjct: 509 QGRRETEKGNDKVVIVLTDGANTYYTPSSLGYSDPANSKSTYASYGYLNPGYNGTSVGRM 568 Query: 418 FL 419 F+ Sbjct: 569 FM 570 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 66/199 (33%), Gaps = 34/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I R+G F D + V + Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDVKTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + +TAI DA+ + + +E + +VLLTDG+N Sbjct: 156 SKMLSEAQI--GLVGRATAIGDALGLSVKRFANKDESN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SP 425 T N A+ +GI++ TI + + L A + Sbjct: 203 TAGNLNPEDALLLAREEGIKVYTIGVGSDNPRGFSLFNVGGSSGSNLDESLLKKIAEQTG 262 Query: 426 NSFFEANSTHELNKIFRDR 444 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLQQIYAEL 281 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 72/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T IN+AM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I + + + F + + Sbjct: 367 NQEERLPDGSVS----LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 59/194 (30%), Gaps = 35/194 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + LV+ L ++ + R+G F + ++ + + + Sbjct: 114 VSRLDLVKALLGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQ 166 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI DA+ A + + +VL+TDG N + Sbjct: 167 I--GIAGKNTAIGDAIGLAVKRLR-----------QRPAQSRVLVLITDGANNGGRIHPL 213 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SPNSFFEANS 433 A + +RI TI N + L A + ++F A+ Sbjct: 214 TAARLAAQEDVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIADLTHGAYFRAHD 273 Query: 434 THELNKIFRDRIGN 447 EL D + Sbjct: 274 GAEL-DAIGDTLDQ 286 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 83.1 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 35/342 (10%), Positives = 83/342 (24%), Gaps = 16/342 (4%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +L G+ + L + ++ + G +D+ R ++ +QA + Sbjct: 3 LRMLLGRLWSDQRGNTLALMTLALIPLVALMGSGLDMTRAYVAQNRFRQACDAGSLAGRR 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---N 123 L ++R + + F + YL T ++ + T Sbjct: 63 MLAGLTLPQAARDEATKYFMFDFPQGYLQSAPYTLTMSVPTAGTLQISSQTTVPTTLMGL 122 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA-------ETVSRSYHKEHG 176 + + + S + Sbjct: 123 FGFDTLPISTTCSATQDFVNTDIMFVFDLSGSMNCAPGVTGYCGDVEQSGSRMGALRSAA 182 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S ++ ++S L + + + +N ++ Sbjct: 183 TSFYDTLETAQSQLAANNLRLRYGFVNYNSTVN-VGRILYEKNPDWMVQSWSYQSRTPDW 241 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + Y + + + + + + + DA A + K NV T Sbjct: 242 IDATAYFNNKSACNSAYTYDSQASQADANYTGSRTPDWIDATAYF--NGKSACNVAYTYD 299 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + N W IV++ + N G T Sbjct: 300 SQVSQANPNYTGIMGGWWLTSNRCSVIVQS---NGNPDGYTY 338 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 53/410 (12%), Positives = 106/410 (25%), Gaps = 74/410 (18%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 Y + + N+T D + TA S+ + + Sbjct: 256 SAYTYDSQASQADANYTGSRTPDWIDATAYFNGKSACNVAYTYDSQVSQANPNYTGIM-- 313 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 W + + + V + + + + +D + + + Q Sbjct: 314 ---GGWWLTSNRCSVIVQSNGNPDGYTYGRRSVDVRPFLASNLKATNVQS--------PT 362 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + N R + C + + ++ Sbjct: 363 PIWQITGTNDPSDDRPYEFKSVWNGCIEERKTN-------SAAINGGSSTTAPSDAYDLD 415 Query: 271 KH-LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFA 328 + + + D + R+ ++ Sbjct: 416 VDLVPYNDDTRWRPMWNDVSYYPDWSWSYGVGRQPVAYCPTEAKRLQNYHNNRSGFVSYL 475 Query: 329 IDENEMGSTAINDAMQTAYDTIISS-------------NEDEVHRMKNNLEAKKYIVLLT 375 G T + M + ++ N+ + KKY++ +T Sbjct: 476 NGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGFSVKKYMIFMT 535 Query: 376 DGE-----------------------NTQDNEEGIA--------ICNKAKSQGIRIMTIA 404 DG+ T DN +A CN AK++GI I IA Sbjct: 536 DGDMSPTWSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAKAKGIDIWVIA 595 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERV 453 FS T ++NCAS P +S L F++ IG++I + Sbjct: 596 FSTTLTAD------MTNCASKPEQAAGLSSNAALIAKFKE-IGSKIATVL 638 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 39/340 (11%), Positives = 87/340 (25%), Gaps = 69/340 (20%) Query: 171 YHKEHGVSIQWVIDFSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 Y I V+D S SM D +C + + + Sbjct: 85 YEDSKSREIVLVLDTSGSMDQEINQLCEDCAYYCKDCDKWIYETRENHKNQKPYINHTIV 144 Query: 227 EKLSPYM------VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 + Y N+ + P +++ + + K H ++ A + Sbjct: 145 RRFGGYYCYDCNKYIENEETHINYRPYANHSFSDKKYCNNHKAYESYTTKIHELKKAAKN 204 Query: 281 VIRSIKKID-----NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT-----IVKTFAID 330 I S+ +++G +N+ + + +K + Sbjct: 205 FIDSLTSTKTDGQTPNVKNLKIGIVSYNNSGYINEGLVQVTDSDRKNNGNINELKDTIEN 264 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE----- 385 G T D ++ A + NE K ++ + DGE T + + Sbjct: 265 LRADGGTNTGDGLRKAAYLLNEENEAN-----------KTVIFMGDGEPTYYSSDRWGND 313 Query: 386 --------------------------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF- 418 I K + + ++ + + Sbjct: 314 YTNLDDTNQYVGGTGYSDADGKCLSYAKTIGEIIKGEQYNVFSVGYGLGDENSASNNKMK 373 Query: 419 -----LSNCASPNSF-FEANSTHELNKIFRDRIGNEIFER 452 + +S + F A+ ++K+F+ I Sbjct: 374 QIHESMGGISSGENSTFFASDEGAIDKVFQQIADTIIKTS 413 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 72/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T IN+AM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I + + + F + + Sbjct: 367 NQEERLPDGSVS----LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 73/225 (32%), Gaps = 48/225 (21%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S+ +++ + ++ + I R+G F + L Sbjct: 105 DLSTRENIVTRLDAAKEVVQEFIG-------KRPHDRIGLVAFAADAFVVSPLTLNHDWL 157 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + D N G TAI A+ + + + + + ++LLTDGEN Sbjct: 158 KKNVQRLELGDINLSG-TAIGTALGASVNRLRD-----------HESRSRIVILLTDGEN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIA-----------------FSVNKTQQ---------- 412 ++ AKS +++ TIA ++ Sbjct: 206 NSGTLSPLSAAEAAKSLNVKVYTIATGRKGRVEVAEMSRDGRVIRDRNGNPLYRGRSELS 265 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + L A + FF+A+ + +L I+ + E +R Sbjct: 266 DYDESELREIAKLTGGQFFKASESGDLENIYDEIDELEKTTVELR 310 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 83.1 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 45/396 (11%), Positives = 96/396 (24%), Gaps = 65/396 (16%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +S G + +L +ML +GG VDV+ + L+QA +TA+ Sbjct: 28 RFARSEEGSILVFGLMLFILMLMLGGFAVDVMSFEAKRTDLQQAVDRCALTAAALAQTRD 87 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + N Sbjct: 88 PEEVVEDCMLKAGKADYVTLIDHDEGLNYR------------------------------ 117 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 LF +GI S +AE + + + + Sbjct: 118 -EVVVTAQQPTKPLFAHMLGIDSLTAPAATKAEQKVTNVEIVMVLDVSGSMVRDSYSRPT 176 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + + + +++ K Y Sbjct: 177 DKLKNLKAAAKEFVDTMLAKDLNHRISIAIVPYNGQVNLGKSLRQKFNIYDNNGVTYMDC 236 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + V +S+ K + +A + + T ++ S+ Sbjct: 237 VDMPASVYASTGLSRTLKMPMTANA-----DTFSAAFSNARTGGTPPDTYSGPKTSEAQP 291 Query: 313 SWGVHKLIR-------------TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED--- 356 + G + + ++ G+T+IN M+ + S Sbjct: 292 TPGNRWCLPTAANVVRLPTGSISSLQASIDGLEGNGATSINAGMKVGLSLLDPSARPMFS 351 Query: 357 -------------EVHRMKNNLEAKKYIVLLTDGEN 379 + E K ++++TDGE+ Sbjct: 352 EFVGSGEIQSYFHGRPFDYTDEEVMKVMIVMTDGEH 387 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/414 (12%), Positives = 103/414 (24%), Gaps = 23/414 (5%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 + + +++ + + F + + L + + +R Sbjct: 161 LDVSGSMVRDSYSRPTDKLKNLKAAAKEFVDTMLAKDLNHRISIAIVPYNGQVNLGKSLR 220 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + Y + + + M + S + Sbjct: 221 QKFNIYDNNGVTYMDCVDMPASVYASTGLSRTLKMP---MTANADTFSAAFSNARTGGTP 277 Query: 177 VSIQWVIDFSRSML-DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 S + R N P +S +G G ++ M Sbjct: 278 PDTYSGPKTSEAQPTPGNRWCLPTAANVVRLPTGSISSLQASIDGLEGNGATSINAGMKV 337 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 L P + S E + +++ + + + VND Sbjct: 338 GLSLLDPSARPMFSEFVGSGEIQSYFHGRPFDYTDEEVMKVMIVMTDGEHFEEERVNDGY 397 Query: 296 R---MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA------ 346 R +D S G T T + A Sbjct: 398 RVGDSPIYRSSDGEYLSLKLSNGKFYW-PYDNTTTNSAAKGKSPTQLTWQQVWASYRTSY 456 Query: 347 -YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + S + + T + + + A+CN AKS + I IAF Sbjct: 457 IAWQLYSRRPGFRSSDRLAAYTAQMNAFRTLTPISTMDAQLQALCNLAKSNNVTIFGIAF 516 Query: 406 SVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + NC++ S + S E+ FR I ++I +R+++ Sbjct: 517 EAPANG----KTQIQNCSTSRSSHYFDASGLEIQTAFR-AIASQISY--LRLSQ 563 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 83.1 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 83.1 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 37/336 (11%), Positives = 86/336 (25%), Gaps = 10/336 (2%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++ + G II L + VM+G G+ +D+ + L+ +A Sbjct: 89 MSSAARRPKAVRRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 148 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L S +S N + + + N T + + Sbjct: 149 ALSAARDL-TSAISLSVAEANGIAAGHMNFAFFQQKPVQMLTDSNVTFSDSLTNPFLSKT 207 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + V + + + L + + A ++ + +I Sbjct: 208 AVATPANVKYVKCVTSLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIP 267 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S Y+ + +++ +G +CN + Sbjct: 268 VFVCRGPSDPPYKVGDWISSPSGSSSTYGPGNFGWAALDGSTNETTIASELSGDTCNITS 327 Query: 241 YYMLYPG---------PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 L + + SS + V + K D Sbjct: 328 PPDLASTGLKSASLRAWNTRFGIYTNGANGSSGQPDFTGYAYVGRDYGPPGTAGTKGDAY 387 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V + + P G+ + Sbjct: 388 TQFVIDRTKYKPYQGDGAPPSGSGISTNGTATASNY 423 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 57/196 (29%), Gaps = 35/196 (17%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + + + + R+G F+ S + L R I Sbjct: 127 DFTPNRLEAAKLVAKNFV-------HGRKYDRIGLVIFSGEAYSVSPLTTDYKLLKRYIE 179 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 E G TAI A+ + S K ++L++DG+NT N Sbjct: 180 DIREDMIQENG-TAIGSALGMGTIRMQESASRS-----------KVVILISDGDNTAGNL 227 Query: 385 EGIAICNKAKSQGIRIMTI--A-----------FSVNKTQQEK-ARYFLSNCA--SPNSF 428 + I A + I+I TI F + L A F Sbjct: 228 DPITASRLATAHNIKIYTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREIAKIGEGKF 287 Query: 429 FEANSTHELNKIFRDR 444 + A+ L +F + Sbjct: 288 YRASDNQALKNVFAEI 303 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 74/224 (33%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + G + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGGYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|126173282|ref|YP_001049431.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125996487|gb|ABN60562.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 642 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 42/394 (10%), Positives = 109/394 (27%), Gaps = 26/394 (6%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A++ +V L + + +Q + + + + ++ E+ + Sbjct: 47 AALLLVAVSLTACGGKGAEVE--HRQAEQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 104 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + N + R + ++ + T + Sbjct: 105 QMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFFIDVDTGSYAT 164 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ ++ + + + LN F K+ + + + + M+ Sbjct: 165 LRRML-REGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRI 223 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L P + + +D S K L++ AL + + D V+ V Sbjct: 224 GLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLLTAQLSAQDKVSIVV--- 280 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + GV + + GS + AY + Sbjct: 281 ------YAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQL--AKKHFIP 332 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKA 415 + + ++L TDG+ + + K GI + T+ F + + Sbjct: 333 NGINR-------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 385 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ D + + + Sbjct: 386 EQLADK--GNGNYAYIDTLNEARKVLVDELSSTL 417 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 43/386 (11%), Positives = 99/386 (25%), Gaps = 22/386 (5%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-QV 131 + + ++ + + + V V K Sbjct: 68 KAAAQEQYSAAVARGESAGLVKAAGRKTEQFQVSVSVAPTAKVTFELVYEELLKRHLGAY 127 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + L + + I T + + + + +++ + Sbjct: 128 ELLLKVQPQQLVKHLQMDIHIFEPQGISFLETESTFMTNELADA---LTVSQNKTKAHIQ 184 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ Q + + ++Y G Sbjct: 185 FKPVLSQQKSSGQQDTVLDGNFIVRYDVNRTISGGSIQIENGY----FVHYFAPEGLPTI 240 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + +D S +K R+AL ++ + D + F+ Sbjct: 241 PKNVIFVIDKSGSMSG-RKMQQTREALIKILDDLSPKD------QFNLISFSGDAAQWKP 293 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +++A G T IN+A+ A + S+ + E+ I Sbjct: 294 LLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQKELMPEGTVS----LI 349 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +LLTDG+ T I K + + F + + + L N Sbjct: 350 ILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLALDNGGLARRI 409 Query: 429 FEANSTHELNKIFRDRIGNEIFERVI 454 +E + + + F + + N + V Sbjct: 410 YEDSDSALQLQDFYEEVANPLLTAVT 435 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 59/195 (30%), Gaps = 35/195 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + LV+ L ++ + R+G F + ++ + + + Sbjct: 66 DISRLDLVKALLGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEA 118 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + +VL+TDG N Sbjct: 119 QI--GIAGKNTAIGDAIGLAVKRLR-----------QRPAQSRVLVLITDGANNGGQIHP 165 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SPNSFFEAN 432 + A +G+RI TI N + L + ++F A+ Sbjct: 166 LTAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALREIGEITHGAYFRAH 225 Query: 433 STHELNKIFRDRIGN 447 EL D + Sbjct: 226 DGAEL-DAIGDTLDQ 239 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 64/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 110 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 162 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 163 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 209 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + L A + FF A Sbjct: 210 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRAR 269 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 270 DTDELAGIYAEL 281 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 62/212 (29%), Gaps = 42/212 (19%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + ++ S I D D R+G F + + L Sbjct: 102 ARDFRPDRITAAKEVAGSFI-----ADRYGD--RIGLVAFAGEAFTQSPLTTDQGTL--Q 152 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + TAI + + TA + + S K I+LLTDG N + Sbjct: 153 TLLARIRSGLIEDGTAIGNGLATAINRLRES-----------EAKSKVIILLTDGVNNRG 201 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-------------------EKARYFLSNCA 423 AK+QGIR+ TI E L + A Sbjct: 202 EIAPQTAAEIAKAQGIRVYTIGVGTEGMAPYPAVDIYGTPTGGTVMAKVEIDEKTLRSIA 261 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + +F A +L K D+I +V Sbjct: 262 EQTGGQYFRATDKAKL-KAIYDQINQLEKSKV 292 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 72/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 242 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 296 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T IN+AM A + SS Sbjct: 297 --QFNLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQLLDSS 354 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I + + + F + + Sbjct: 355 NQEERLPDGSVS----LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS 410 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 411 YAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 454 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 72/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++ + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEGRLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 82.7 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 61/173 (35%), Gaps = 7/173 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 IK +D+++ + F+ ++FA +G T INDAM Sbjct: 28 IKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAML 87 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 A + SSN++E + I+LLTDG+ T +I N + + Sbjct: 88 MAVQLLDSSNQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 143 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + F + + + L N E + + + F + N + V Sbjct: 144 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 196 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 49/163 (30%), Gaps = 28/163 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ Sbjct: 128 DRIGLILFADHAYLQSPLTQDRRSVAQYLKEAQI--GLVGKQTAIGEAIALGVKRFDKVE 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 + + ++LLTDG N A +GI I TI + Sbjct: 186 QSN-----------RVLILLTDGSNNAGAITPEQASQIAAQRGITIYTIGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + +F A +T EL +I++ Sbjct: 235 LFGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQ 277 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 62/193 (32%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + ++ + I R+G F ++ ++ + + R + + Sbjct: 113 SVDRLTALKAVAGAFIDQ-------RQGDRIGLILFGEQPYIQAPLTFDHNTVTRLLHEA 165 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ A + + + ++LLTDG + + + Sbjct: 166 VV--GLAGNKTAIGDAIGLAVKRLRKDPQAKN-----------VLILLTDGASNSGSLDP 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA--SPNSFFEA 431 + A +G+++ TI + L A + +F A Sbjct: 213 LKAAKLAAQRGLKVYTIGIGAEAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRA 272 Query: 432 NSTHELNKIFRDR 444 T EL +I++ Sbjct: 273 RDTEELAQIYQQL 285 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 64/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 235 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 287 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 288 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 334 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + L A + FF A Sbjct: 335 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRAR 394 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 395 DTEELAGIYAEL 406 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 64/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 235 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 287 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT Sbjct: 288 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNP 334 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEAN 432 + AK++G+R+ TIAF + L A + FF A Sbjct: 335 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRAR 394 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 395 DTEELAGIYAEL 406 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 66/193 (34%), Gaps = 34/193 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +V+ + + L+ + + R+G F R + + + + + Sbjct: 121 NVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + + ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALSVKRLREQRHG-----------QRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRDR 444 T EL I+ + Sbjct: 281 RDTEELAGIYAEL 293 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 62/197 (31%), Gaps = 33/197 (16%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + D I + LV+ L I R+G F + ++ Sbjct: 103 DYADMQWDDEPISRLELVKRLLGDFIE-------GRRGDRVGLILFGSQAYLQAPLTFDR 155 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 H + + + +TAI DA+ A + + +VL+TD Sbjct: 156 HTVRTWLDEALI--GIAGKNTAIGDAIGLAVKRLR-----------QRPAQSRVLVLITD 202 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK-----------TQQEKARYFLSNCA-- 423 G N + + A +G+RI TI + + + L + Sbjct: 203 GANNGGEIDPMVAAQLAADEGVRIYTIGIGADPQQSGAFGSFGFSALDLDETSLRAISDT 262 Query: 424 SPNSFFEANSTHELNKI 440 + +F A + EL +I Sbjct: 263 TGGEYFRARNQAELEQI 279 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F ++ +I + + +TAI DA+ A + SN Sbjct: 151 DRVGLILFGSNAYVQTPLTFDRKTVITLLDEAAV--GLAGKATAIGDAIGLAVKRLEQSN 208 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 D+ K ++ ++LLTDG NT A G+ I TI + Sbjct: 209 RDKRIASK-----EQVLILLTDGVNTAGQLSAPQAAELAAEHGLTIYTIGIGADAMTVQS 263 Query: 410 --------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L++ A + +F A+ T EL KI+ Sbjct: 264 FFGTQRVNPSADLDEKMLTDIATKTGGRYFRAHDTQELQKIYAMI 308 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 78/266 (29%), Gaps = 1/266 (0%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + +++ + G I+ L + VM+G G+ +D+ + L+ +A Sbjct: 1 MSSAARHPKLTRRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L ++ + A + + N T + T Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHVNFAF-FQKSAVQMLTDSNVTFSDALTNPFLTKT 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 ++ + V ++ + N L +M T+ A ++ + +I Sbjct: 120 AVSTPANVKYVKCTATLSGIANWFVGVLNAMPGVQVANATQVSASAIATVAAGQTTCAIP 179 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + S Y+ + +++ +G +CN + Sbjct: 180 VFVCRASSAAPYKVGDWISSPSGSSSTYGPGNFGWAALDGSTNETTIASELSGNTCNITS 239 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRH 266 L L + + Sbjct: 240 PPDLGSTGLKSASLRAWNTRFGIYTN 265 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 67/193 (34%), Gaps = 34/193 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +V+ + + L+ + + R+G F R + + + + + Sbjct: 121 NVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLAD 173 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI DA+ + + + ++ +VLLTDG NT Sbjct: 174 SVV--GLAGRETAIGDAIALSVKRLREQRQG-----------QRVVVLLTDGVNTAGVLN 220 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF------------LSNCA--SPNSFFEA 431 + AK++G+R+ TIAF + + L A + FF A Sbjct: 221 PLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRA 280 Query: 432 NSTHELNKIFRDR 444 T EL I+ + Sbjct: 281 RDTEELAGIYAEL 293 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 44/128 (34%), Gaps = 3/128 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + ++L++ G+F ++ A+ ++G + +D + ++ A A + + Sbjct: 6 FRLHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVR 65 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + E + F L + + T+ + + +N Sbjct: 66 AYGEGATEDDAYDAAQKFFWSNYA---LPQESVVDALAVATEPSTQALSVKFTRNVNEDT 122 Query: 127 SAYQVVLS 134 + + VL Sbjct: 123 ATAEFVLD 130 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEI 279 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 63/223 (28%), Gaps = 39/223 (17%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +D + + V+ +A I +T R+G F + + Sbjct: 92 CLDTSGSMQALDFKVEGKSVTRLEAVKKVVADFIG-------KRETDRIGLVVFGEEAFT 144 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + L+ + + TAI A+ + Sbjct: 145 QSPLTIDKGLLLELVNRMKI--GMAGDRTAIGSAIAIGGKRLKD-----------LKSKS 191 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ---------------- 412 K ++LLTDG N A + GI++ TI Sbjct: 192 KILILLTDGRNNAGEISPQAAARAVREFGIKLYTIGVGGKGPAPFRMKTLFGTRLVPQHV 251 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + L N A +F A ++ EL +I+ D I V Sbjct: 252 DLDEVTLRNVAKTGGGKYFRAANSQELQEIY-DIIDRAEKTDV 293 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 50/163 (30%), Gaps = 28/163 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A Sbjct: 128 DRIGLILFADHAYLQSPLTQDRRTVAQYLKEAQI--GLVGKQTAIGEAIALAVKRFDKVE 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 + + ++LLTDG N A +GI I TI + Sbjct: 186 QSN-----------RVLILLTDGSNNAGAISPEQATQIAAKRGITIYTIGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + +F A +T EL +I++ Sbjct: 235 LFGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQ 277 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 60/195 (30%), Gaps = 35/195 (17%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 I + V+ L+ I R+G FF D + + + + Sbjct: 106 DDYIDRLTAVKQVLSDFIAQ-------RQGDRLGLIFFADHAYLQTPLTLDRTTVAQQLN 158 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + TAI D + A T I S ++ ++LL+DG N Sbjct: 159 QAVLRLI--GTQTAIGDGIGLATKTFIES-----------DAPQRVMILLSDGSNNAGVL 205 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFF 429 + I + AK I T+ T ++ L A + +F Sbjct: 206 DPIEAAHIAKQYHTTIYTVGVGAGEMMVRDFFMTRRINTAEDLDEETLQKIADLTGGQYF 265 Query: 430 EANSTHELNKIFRDR 444 A H+L I++ Sbjct: 266 RARDKHDLQTIYQTI 280 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/159 (22%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + D ++ + + + + +TAI +A+ + S Sbjct: 107 VVLYGDFAFIASPITYEKEIVTQ--MLGYLTQGMAGQNTAIGEAIAMGVRSFKHSKAKT- 163 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K IVLL+DGE+ + AK QGI+I TIA + E Sbjct: 164 ----------KVIVLLSDGEHNSGSVSPKEATELAKEQGIKIYTIAMG---NKGEADEAL 210 Query: 419 LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L A S FF A+S EL I+ + E R Sbjct: 211 LETIAKDSNGEFFSASSAKELKNIYDEIDKLESSNIKSR 249 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 43/386 (11%), Positives = 99/386 (25%), Gaps = 22/386 (5%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-QV 131 + + ++ + + + V V K Sbjct: 101 KAAAQEQYSAAVARGESAGLVKAAGRKTEQFQVSVSVAPTAKVTFELVYEELLKRHLGAY 160 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + L + + I T + + + + +++ + Sbjct: 161 ELLLKVQPQQLVKHLQMDIHIFEPQGISFLETESTFMTNELADA---LTVSQNKTKAHIQ 217 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ Q + + ++Y G Sbjct: 218 FKPVLSQQKSSGQQDTVLDGNFIVRYDVNRTISGGSIQIENGY----FVHYFAPEGLPTI 273 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + +D S +K R+AL ++ + D + F+ Sbjct: 274 PKNVIFVIDKSGSMSG-RKMQQTREALIKILDDLSPKD------QFNLISFSGDAAQWKP 326 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +++A G T IN+A+ A + S+ + E+ I Sbjct: 327 LLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQKELMPEGTVS----LI 382 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +LLTDG+ T I K + + F + + + L N Sbjct: 383 ILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLALDNGGLARRI 442 Query: 429 FEANSTHELNKIFRDRIGNEIFERVI 454 +E + + + F + + N + V Sbjct: 443 YEDSDSALQLQDFYEEVANPLLTAVT 468 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 53/368 (14%), Positives = 109/368 (29%), Gaps = 19/368 (5%) Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 + + +V+ T M + + V +++ L L R +G+ Sbjct: 98 KAEAQAQYSTAVAKGKSAGLVKATGRNMEQFQVSVSVAPNAKITFELVYEELLKRRLGVY 157 Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 L++ + + + S + + + Sbjct: 158 ELLLKVRPQQLVKHLQMDIHIFEPQGISFLETESTFMTNQLVDALTTWQNKTKQKSPEQQ 217 Query: 215 YSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + +G + IR + ++Y G + +D S + Sbjct: 218 ETVLDGNLIIRYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-R 276 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R+AL ++ + D + F+ ++FA+ Sbjct: 277 KIQQTREALIKILDDLSPRD------QFNLIVFSTEASQWRPSLVPASAENVNKARSFAV 330 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T INDAM A + SSN++E + I+LLTDG+ T +I Sbjct: 331 GIQALGGTNINDAMLMAVQLLDSSNQEEQLPEGSVS----LIILLTDGDPTVGETNPRSI 386 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N + + + F + + + L N E + + + F + Sbjct: 387 QNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVA 446 Query: 447 NEIFERVI 454 N + V Sbjct: 447 NPLLTAVT 454 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTAMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ I ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I + + + F + + Sbjct: 367 NQEERLPDGSVS----LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 60/192 (31%), Gaps = 38/192 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+ L I+ K R+G F + ++ + R + + Sbjct: 116 RIVAVKTVLNEFIQRRKG-------DRLGLILFGSQAYVQAPLTFDQTTVQRFMREAQI- 167 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +TAI DA+ + + D ++LLTDG+N I Sbjct: 168 GFAGEENTAIGDAIGLSVKRLRDRPGDRH-----------VMILLTDGQNNGGKINPIPA 216 Query: 390 CNKAKSQGIRIMTIAFSVN-----------------KTQQEKARYFLSNC--ASPNSFFE 430 A + GI I TI + + L A+ +F Sbjct: 217 SKIAANNGIIIYTIGVGADEMVMPGVLGSSFGSRRVNPSADLDEKTLQQVATATGGQYFR 276 Query: 431 ANSTHELNKIFR 442 A + EL KI+R Sbjct: 277 ARNPQELEKIYR 288 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 63/227 (27%), Gaps = 38/227 (16%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + + +D +L+ + + + ++ I Sbjct: 78 QSSNSWQTYTSEGIDIALALDVSTSMLARDFTPDRLEAAKEVATKFI-------LERPQD 130 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++G F + + L + TAI + A + + S Sbjct: 131 KIGLVVFAGESFTQCPLTTDQAVL--VNLLREVKSGMIQDGTAIGLGLANAVNRLKDS-- 186 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--- 412 K ++LLTDG N Q + AK+ GIR+ TI Sbjct: 187 ---------PGKSKVVILLTDGINNQGAIAPVTAAELAKAFGIRVYTIGVGTYGEAPYPV 237 Query: 413 -------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +F A +L +I+ + Sbjct: 238 PTPFGVQLQNMPVEIDEGVLQQIANVTGGKYFRATDNDKLQQIYSEI 284 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 82.0 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 54/165 (32%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +++ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGESIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + +VLLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLVLLTDGSNNAGNIEPQQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + L A+ +F A ++ EL +I+++ Sbjct: 235 LFGRERVNPSMDLDENQLQQIADATHGRYFRARNSEELEQIYQEI 279 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIGLAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNSGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEI 279 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/327 (12%), Positives = 85/327 (25%), Gaps = 57/327 (17%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM-VSCNKSL 240 V+D S SM++ R + Q A + S + + + + Sbjct: 4 VLDVSSSMIEENRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVNFGLANTAWLK 63 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLR----------------------HVIKKKHLVRDAL 278 Y + SSS + + + + Sbjct: 64 SYNGTAAVPKRWTDPKSVYTSSSYKLSYWIDGVTPVMSTSKNYYWMGCIEPRADVEVRDT 123 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ + P + +K+ + GST Sbjct: 124 GAIGDGMGDAPPGTAAFVAMDANPKSGTSFCPPPVTPLTG-DFAYLKSVVKNLTSEGSTR 182 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAK----KYIVLLTDGENTQD------------ 382 ++ + + T+ + + E K +V +TDGE Sbjct: 183 LDAGVVAGWYTLSPKWQGVWGDETSPAEVSDSVHKVMVFMTDGEMNTKYDPNDKFDWICS 242 Query: 383 -----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 C K GI I T+++S + NCA+ + F Sbjct: 243 QTQSSACNAFATAAMQTACTAMKKSGIEIYTLSYSADADVVNI-----RNCATNTAHFFT 297 Query: 432 NSTHELNKIFRDRIGNEIFERVIRITK 458 S + ++ + I I +R+T+ Sbjct: 298 ASPATIKTVY-ETIAAAIRGDTLRLTQ 323 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIGLAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEI 279 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTAMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ I ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I + + + F + + Sbjct: 367 NQEERLPDGSVS----LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEI 279 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + R+G F D + + + + Sbjct: 267 VDRLTAVKQVLSEFIA-------KREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAV 319 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A T I S ++ ++LL+DG NT + + Sbjct: 320 LKLI--GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDPL 366 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEAN 432 N AK I T+ T Q+ L A + +F A Sbjct: 367 EAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRAR 426 Query: 433 STHELNKIFRDR 444 + +L I+ Sbjct: 427 NQQDLQNIYDTI 438 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 57/195 (29%), Gaps = 39/195 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ L I R+G F + ++ + + A Sbjct: 115 TRLELVKVLLGDFIEQ-------RHGDRVGLILFGSKAYLQSPLTFDRRTVR--VWLDEA 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TAI DA+ A + + +VL+TDG N E + Sbjct: 166 SVGIAGSNTAIGDAIGLALKRLR-----------ERPANSRVLVLVTDGANNGGEIEPLL 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANST 434 A + +RI TI + L A + +F A S+ Sbjct: 215 AATLAAEENVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASS 274 Query: 435 HELNKIFRDRIGNEI 449 +L IG + Sbjct: 275 EQL-----QAIGEAL 284 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 81.6 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 166 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 220 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 221 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 278 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 279 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 334 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 335 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 378 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 66/247 (26%), Gaps = 10/247 (4%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + D + Y ++ V Sbjct: 257 TKISQNGILGDFIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASM 316 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 ++ ++ + + ++ F+ R+ V K + Sbjct: 317 VGAKLKQTKEALFTILHDLRPEDNF---NIVGFSSRIKVWKDQLVPVTPNSIRDGKVYIH 373 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T IN A+QT + N+ H + IV LTDG T + I Sbjct: 374 HMSPSGGTNINGALQTGIRLL---NDFVAHNDIDARSVS-LIVFLTDGRPTVGEIQTPKI 429 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N K + + TI + + + L NC F K F D IG Sbjct: 430 LNNTKEAARDRVCLFTIGIGDDVDFKLLEKLSLENCGMTRRFQVEADAAAQLKGFYDEIG 489 Query: 447 NEIFERV 453 + + Sbjct: 490 TPLLSDI 496 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ H + + + +TAI DA+ A + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRTWLDEAMI--GIAGKNTAIGDAIGLAVKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + +VL+TDG N + + A +G+RI I + Sbjct: 189 --------QRPAQSRVLVLITDGANNGGEIDPMVAAQLAAEEGVRIYAIGIGADPRQSGV 240 Query: 410 ------TQQEKARYFLSNC--ASPNSFFEANSTHELNKI 440 + + L A+ +F A + EL +I Sbjct: 241 LGAFGFSALDLDETSLRAIAEATGGEYFRARNQAELTQI 279 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 95/334 (28%), Gaps = 72/334 (21%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + + T + Y + ++D + SM + L A + Sbjct: 1 MARDSITISGTATAEYQTAAFMDFYILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMD 60 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + N + V + +VR Sbjct: 61 KTINNYTI--------------------------------------AKSLGVAMRIDVVR 82 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISD--PSFSWGVHKLIRTIVKTFAIDENE 333 A ++ + K +D RMG F + + S L + T A+D Sbjct: 83 QATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDAVDLMT 142 Query: 334 MGSTAINDAMQTAYDTIISSNE---DEVHRMKNNLEAKKYIVLLTDGENTQDNEEG---- 386 + N+ T +D+ ++ D+ +N+ A+K + ++DG Sbjct: 143 IPYQNYNNDQITNFDSAMTQMNTIIDQAGDGTSNISAEKILFFVSDGVGDSYKPSTCTKK 202 Query: 387 -----------IAICNKAKSQGIRI---MTIAFSVNKTQ---------QEKARYFLSNCA 423 + C K +G++I T + Q + + CA Sbjct: 203 TTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQGEIPTKMQACA 262 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 SP +FE + T + + I R RIT Sbjct: 263 SPGFYFEVSPTEGITDAMKALFLKVI--RAPRIT 294 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIGLAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYKEI 279 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 19/253 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + +G + IR + ++Y G + +D S Sbjct: 225 SPEQQETVLDGNLIIRYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGS 284 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +K R+AL ++ + D + F+ Sbjct: 285 MSG-RKIQQTREALIKILDDLSPRD------QFNLIVFSTEASQWRPSLVPASAENVNKA 337 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++FA+ +G T INDAM A + SSN++E + I+LLTDG+ T Sbjct: 338 RSFAVGIQALGGTNINDAMLMAVQLLDSSNQEEQLPEGSVS----LIILLTDGDPTVGET 393 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +I N + + + F + + + L N E + + + F Sbjct: 394 NPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDF 453 Query: 442 RDRIGNEIFERVI 454 + N + V Sbjct: 454 YQEVANPLLTAVT 466 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 27/218 (12%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P+L +D+S K V++ + ++ + + D R+ FN Sbjct: 149 STSSRPNLDLVCIIDNSESMSGCSKIENVKNTILQLLEMLNEND------RLSLITFNSY 202 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V L + ++ G T I ++ A+ + S + Sbjct: 203 AKQLCGLK-KVSNLNKETLQAITNSIKAYGGTNITSGLEIAFQILQSRKKKNSVSS---- 257 Query: 366 EAKKYIVLLTDGENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I LL+DG++ + + + + + + I + +F + + Sbjct: 258 -----IFLLSDGQDDGADTKIKNLLKITYQQLQEESFTIHSFSFGSDHDCP-----LMQK 307 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 A SF+ ++++ F D +G +T Sbjct: 308 IAQIKDGSFYFVEKNDQVDEFFIDALGGLFSVVAQDLT 345 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTAVGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPANSRVLVLVTDGANNAGQIDPITAARLAADEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSALGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 62/469 (13%), Positives = 118/469 (25%), Gaps = 95/469 (20%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K G+F +I AL++ +L G + VD + ++ + A + Sbjct: 4 RFAKDERGNFAMIMALVLVPLLLAGMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGE 63 Query: 73 EEVSSRAKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE--------- 121 + + + F + + K TD++++ T Sbjct: 64 SQTVVQVYGAQVFTANLTALSADAVNFDVAFPKDKTTDQQIQATAGFTYKSLFGVIASRL 123 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ-------------------TKA 162 Q LSS L N + + L +S + Sbjct: 124 TGDDWDQNQYTLSSSVRLK-NTIEVALVLDNSRSMDETRSGSTKKRIDLLKEAASQLVET 182 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN----------CFGQPADRTV 212 A + H E+ V V F+ S+ + ++ F P Sbjct: 183 MAAQSTLITHVENPVQFSLV-PFAGSVNVGPQYLNAAWMDPEGRSPVNLENFTLPVTIDS 241 Query: 213 KSYSSQN------GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + LY L + L + V+S + Sbjct: 242 TRKIEEKPVGSGRYYKSGTGWGERNNKPYSRAELYADLSLRSKETWLPWQGCVESRPGTY 301 Query: 267 VIKKKHLVRDALASVI-RSIKKIDNVNDTVRMGA--TFFNDRVISDPSFSWGV--HKLIR 321 + + ++ + N R T N D S ++ L + Sbjct: 302 ALDVTPPSDNNPDTLFVPMFGPAEYYNTDSRGNVTSTVLNSWWQDDTSLAYSPRQSDLKK 361 Query: 322 TIVKTFAIDEN------------------------------------------EMGSTAI 339 ++ G T + Sbjct: 362 YYLRDSLDKIYRGGRSNDGGPNYSCTSSPLTPLTDVTTEQGMKTIQTAIKAMVPSGGTNV 421 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +AM + TI+ R K +++LTDG NT +G+A Sbjct: 422 PEAMAWGWRTIVRGAPFTEARPSTERGNDKVVIVLTDGANTYYKYDGLA 470 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 57/475 (12%), Positives = 120/475 (25%), Gaps = 85/475 (17%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 +++SV L ++E + + + + T V Sbjct: 134 TLSSSVRLKNTIEVALVLDNSRSMDETRSGSTKKRIDLLKEAASQLVETMAAQSTLITHV 193 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 E + S S + L S +E V Sbjct: 194 ENPVQFSLVPFAGSVNVGPQYLNAAWMDPEGRSPVNLENFTLPVTIDSTRKIEEKPVG-S 252 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S + + + + + R+ +++ G V R + + + + Sbjct: 253 GRYYKSGTGWGERNNKPYSRAELYADLSLRSKETWLPWQGCVESRPGTYALDVTPPSDNN 312 Query: 241 -------------YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 YY S + + ++ ++ + S+ K Sbjct: 313 PDTLFVPMFGPAEYYNTDSRGNVTSTVLNSWWQDDTSLAYSPRQSDLKKYY--LRDSLDK 370 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I G ++ + + ++T G T + +AM + Sbjct: 371 I-YRGGRSNDGGPNYSCTSSPLTPLTDVTTEQGMKTIQTAIKAMVPSGGTNVPEAMAWGW 429 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLT---------DGENTQD---------------- 382 TI+ R K +++LT DG Sbjct: 430 RTIVRGAPFTEARPSTERGNDKVVIVLTDGANTYYKYDGLAGSGPDRAANFSYYSAHGYT 489 Query: 383 -------------------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N +C+ AKS I +MT+A +++T Sbjct: 490 ARITKHYSQARLFQESGVSVSQNNSTYTKAMNARFAKLCDNAKSANIIVMTVALDLSETN 549 Query: 412 QEKAR--YFLSNCASPNSF----------FEANSTHELNKIFRDRIGNEIFERVI 454 + L +C+S + F ++ EL++ FR R++ Sbjct: 550 STEKAQIDLLRSCSSNSRVRTESGRPAKLFWNSTGGELSETFRQIGDELSNLRIV 604 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 19/253 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + +G + IR + ++Y G + +D S Sbjct: 225 SPEQQETVLDGNLIIRYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGS 284 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +K R+AL ++ + D + F+ Sbjct: 285 MSG-RKIQQTREALIKILDDLSPRD------QFNLIVFSTEASQWRPSLVPASAENVNKA 337 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++FA+ +G T INDAM A + SSN++E + I+LLTDG+ T Sbjct: 338 RSFAVGIQALGGTNINDAMLMAVQLLDSSNQEEQLPEGSVS----LIILLTDGDPTVGET 393 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +I N + + + F + + + L N E + + + F Sbjct: 394 NPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDF 453 Query: 442 RDRIGNEIFERVI 454 + N + V Sbjct: 454 YQEVANPLLTAVT 466 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 62/193 (32%), Gaps = 39/193 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ V+ + + R+G F R S+ + + R ++++ Sbjct: 117 RRIDAVKQLGSDFMSRRSG-------DRLGLILFGSRAYLQSPLSFDIQTVQRFLLESQI 169 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ A + + ++LLTDG++T + + Sbjct: 170 --GFAGQETAIGDAIGLAVKRLQ-----------ERPATSRVLILLTDGQDTASTVDPLE 216 Query: 389 ICNKAKSQGIRIMTIAFSVNK-----------------TQQEKARYFLSNCAS--PNSFF 429 N A G+RI TI + + L AS +F Sbjct: 217 AANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTGGQYF 276 Query: 430 EANSTHELNKIFR 442 A EL ++R Sbjct: 277 RARDPEELATVYR 289 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 81.6 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 47/426 (11%), Positives = 106/426 (24%), Gaps = 62/426 (14%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ G ++T LL+ ML VGGM VD +R+ L+ A A++ A+ Sbjct: 57 FIRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAANLNQ 116 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + I + + T R + DT Sbjct: 117 EREAADVITDFFTAEGFGGSI---VGTPSIQKNRNGSTIRLESIVDVDTFYLRLVGIDTL 173 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW-------- 181 ++ + + L S ++ + E + Sbjct: 174 SAPANATAIEGTGNVEVSLVLDISGSMG--SRMTGDAYLYDSDGEIRLDPDGNPLTERRT 231 Query: 182 ---------------------VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 S+ + + + T S + Sbjct: 232 EDRMFFLIQEANKFIGDLLKDEYRDRVSINLVAYSQHVRLGDDLYTALNTTPDSIDEDDN 291 Query: 221 ----KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-SLRHVIKKKHLVR 275 I D +P+ + + + G D +L + K Sbjct: 292 LGSSYGSITDGYTAPFTYTWVNAEGDEVEEGTPDATLVGPFDSPIDLTWAGDNKVIRNPS 351 Query: 276 DALASV--------IRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKT 326 + + + + V F+ F + + ++ Sbjct: 352 RCVTFLDEEYTKLEFDTDRVYQQVEHVDFYNYETFSWPACPAEDFQGIIMMSQDKKQLQD 411 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNED--------------EVHRMKNNLEAKKYIV 372 +T+I+ M+ + + D ++ K++V Sbjct: 412 AISKYVPTLNTSIHRGMKWGVSLLDPTMRDLMAGLPTIDEAFRGTRPADYSDASTTKFVV 471 Query: 373 LLTDGE 378 ++TDG+ Sbjct: 472 IMTDGQ 477 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + R ++ R+G F + + + L+ Sbjct: 102 AQDFSPSRFEVARRTAIQFVQ-------GRRADRIGLVVFAGQAFTQVPPTLDYRFLLTM 154 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + + TAI A+ TA + + KN+ K I+LLTDG+N + Sbjct: 155 LQRLQVGRLED--GTAIGTAIATAINRL-----------KNSEARSKVIILLTDGQNNRG 201 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--S 424 + + A+ GIRI TI S E + A + Sbjct: 202 EIDPLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPVEIDEAMMREVAEKT 261 Query: 425 PNSFFEANSTHELNKIFRDRIG-NEIFERVIR 455 +F A L I+ + + V R Sbjct: 262 GGRYFRATDARTLEAIYAEIDRLEKSPVAVTR 293 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 70/220 (31%), Gaps = 14/220 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++Y G S +D S +K R+AL V+ + D + Sbjct: 235 GYFVHYFAPTGLPTLPKSVVFVIDKSGSM-DGRKIVQTREALLKVLGDLNPED------Q 287 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN + + K F +D G T IN+A+ A + SN+ Sbjct: 288 FNLVVFNSMISQWQPSLLKATQENVGSAKKFVLDIRASGGTNINEAVLAAVHLLDESNQR 347 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQE 413 E+ + I+LLTDGE T I + + + F + + Sbjct: 348 ELLPENSVS----MIILLTDGEPTVGETNPENIQQNIQRSLDGKYALFCLGFGFDVSYSF 403 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + L N +E + + F + + + V Sbjct: 404 LEKMALDNSGLARRIYEDSDAALQLQDFYQEVATPLLKHV 443 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 19/253 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + +G + IR + ++Y G + +D S Sbjct: 225 SPEQQETVLDGNLIIRYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGS 284 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +K R+AL ++ + D + F+ Sbjct: 285 MSG-RKIQQTREALIKILDDLSPRD------QFNLIVFSTEASQWRPSLVPASAENVNKA 337 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++FA+ +G T INDAM A + SSN++E + I+LLTDG+ T Sbjct: 338 RSFAVGIQALGGTNINDAMLMAVQLLDSSNQEEQLPEGSVS----LIILLTDGDPTVGET 393 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +I N + + + F + + + L N E + + + F Sbjct: 394 NPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDF 453 Query: 442 RDRIGNEIFERVI 454 + N + V Sbjct: 454 YQEVANPLLTAVT 466 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 36/194 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + LA + R+G F + + + + + + + Sbjct: 128 VVDRLTAAKAVLADFLDRRDG-------DRIGLLVFGQQAYALTPLTADLATVRDQLRDS 180 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TA+ DA+ A + + ++ ++LLTDG NT + Sbjct: 181 VV--GLAGRETALGDAIALAVKRLREQPQG-----------ERVLILLTDGVNTAGVLDP 227 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ--------------QEKARYFLSNCA--SPNSFFE 430 + AK++ +R+ TIA + E L A + FF Sbjct: 228 LKAAELAKAEHVRVYTIALGGDGGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFR 287 Query: 431 ANSTHELNKIFRDR 444 A T +L I+ + Sbjct: 288 ARDTAQLASIYAEL 301 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 66/190 (34%), Gaps = 33/190 (17%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + + LA + R+G F DR + + + + + + Sbjct: 123 VDRLTAAKAVLADFLDR-------RAGDRIGLLIFGDRAYTLTPLTADLASVRDQLRDSV 175 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + S E ++ ++LLTDG + E + Sbjct: 176 V--GLAGRETAIGDAIGLAVKRLRSQPEG-----------QRVLILLTDGVSNAGVLEPL 222 Query: 388 AICNKAKSQGIRIMTIAFSVNK-----------TQQEKARYFLSNCAS--PNSFFEANST 434 A+++G+RI T+AF + Q L AS FF A T Sbjct: 223 RAAEVAQAEGVRIHTVAFGGDGSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDT 282 Query: 435 HELNKIFRDR 444 +L I+ + Sbjct: 283 AQLAGIYAEL 292 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 65/190 (34%), Gaps = 27/190 (14%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + ++ + I S +G T F + + L+ Sbjct: 102 TDDVYPNRMAVAKEVASEFISS-------RPNDNIGLTIFAGEAFTQCPMTLDHAALL-N 153 Query: 323 IVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ D G TAI + A + S K ++LLTDG Sbjct: 154 LLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQDSKAKS-----------KIVILLTDGS 202 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--EKARYFLSNCA--SPNSFFEANST 434 N + + AK GIR+ TI F ++ L N A + F+ A S Sbjct: 203 NNVGSISPMTAAAIAKKFGIRVYTIGFGRETGEEIGAIDYRALQNIAVSTNGEFYRAQSQ 262 Query: 435 HELNKIFRDR 444 EL++I++D Sbjct: 263 AELSRIYQDI 272 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 55/382 (14%), Positives = 107/382 (28%), Gaps = 24/382 (6%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 +++ + R L K + + V K +++V Sbjct: 96 KEKAAAQEQYSAAVARGESAGLVKATGRKTEQFQVSVNVAP--AAKVTFELVYEELLKRQ 153 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L L L+ + + + + + S Sbjct: 154 LGVYELLLKVQP---QQLVKHLQMDIHIFEPQGISFLKTESTFMTSELANALTISQNKTK 210 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSE 255 + +P + +G +R + ++Y G + Sbjct: 211 AHIQFKPQKTAGQLDKVLDGNFIVRYDVNRTLSGGSIQIENGYFVHYFAPEGLPTIPKNV 270 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D S +K R+AL ++ +K D + F+ V Sbjct: 271 IFVIDKSGSMSG-RKIQQTREALIKILDDLKPND------QFNLISFSGDVTHWKPLLVP 323 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 K +A + G T INDAM TA + S+N+ E+ + I+LLT Sbjct: 324 ASPENVDQAKRYAANIEAHGGTNINDAMLTAVRLLQSANQKELLSDGSVS----LIILLT 379 Query: 376 DGENTQDNEEGIAICNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 DG+ T I A + + F + + + L N +E + Sbjct: 380 DGDPTVGETSPARIQKNVQKAIDGQYSLFCLGFGFDVSYVFLEKLALDNGGLARRIYEDS 439 Query: 433 STHELNKIFRDRIGNEIFERVI 454 + + F + N + V Sbjct: 440 DSALQLQDFYQEVANPLLTAVT 461 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 66/190 (34%), Gaps = 33/190 (17%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + + LA + R+G F DR + + + + + + Sbjct: 123 VDRLTAAKAVLADFLDR-------RAGDRIGLLVFGDRAYTLTPLTADLASVRDQLRDSV 175 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + S E ++ ++LLTDG + E + Sbjct: 176 V--GLAGRETAIGDAIGLAVKRLRSQPEG-----------QRVLILLTDGVSNAGVLEPL 222 Query: 388 AICNKAKSQGIRIMTIAFSVNK-----------TQQEKARYFLSNCA--SPNSFFEANST 434 A+++G+RI T+AF + Q L A + FF A T Sbjct: 223 RAAEVARAEGVRIHTVAFGGDGSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDT 282 Query: 435 HELNKIFRDR 444 +L I+ + Sbjct: 283 AQLAGIYAEL 292 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 66/199 (33%), Gaps = 39/199 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++V+ + I N T +G F+ + + L++ + KT Sbjct: 111 NRLYVVKQVVKDFI-------NKRSTDPIGLVVFSANAYTQCPLTLDYGILLQFLEKTEI 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI A+ ++ D + ++ K IVLLTDG N + + Sbjct: 164 GLLED--GTAIGSAIASSVDRLRNTKAQS-----------KVIVLLTDGRNNSGQIDPLT 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-------------------SPNSFF 429 A++ I+I TI AR N + ++ Sbjct: 211 AAELAQAFNIKIYTIGAGSKGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANITGGRYY 270 Query: 430 EANSTHELNKIFRDRIGNE 448 A T L +I++ E Sbjct: 271 RATDTGSLKEIYQQIDALE 289 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 63/198 (31%), Gaps = 35/198 (17%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + I + V++ ++ + K R+G F D + + + Sbjct: 104 DSSGNYIDRLTAVKNVVSQFAQQRKG-------DRLGLVLFADHAYLQTPLTLDRNTISE 156 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + TAI + + A T I S ++ ++LL+DG NT Sbjct: 157 QVNSLVLQLI--GQKTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTS 203 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPN 426 + I N AK I TI T Q+ L + A + Sbjct: 204 GVLDPIEAANIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLDEKTLMSIAKITGG 263 Query: 427 SFFEANSTHELNKIFRDR 444 +F A + EL I+ Sbjct: 264 QYFRARNAQELATIYDTI 281 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 64/197 (32%), Gaps = 37/197 (18%) Query: 264 LRHVIKKKHLVRDALASVI-RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + +V++A + + ++G F R ++ ++ Sbjct: 110 HDRPASRLTVVKNAAEQFVRDRLGD--------KIGLILFGSRAYLQTPLTYDRQTVL-- 159 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + A +T+I DA+ A + + + + I+LLTDG N Sbjct: 160 LRIEDATVGLAGKTTSIGDAVGLAVKRLDAVPQ-----------KGRVIILLTDGANNSG 208 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---------------SPNS 427 E + AK +GI+I TI + FL A + Sbjct: 209 ILEPLKAAELAKDEGIKIYTIGLGAATDPRALTNGFLMQAAAADLDEETLKEMSAMTGGR 268 Query: 428 FFEANSTHELNKIFRDR 444 +F A T LN I++ Sbjct: 269 YFRATDTATLNSIYKTI 285 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 74/243 (30%), Gaps = 30/243 (12%) Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYML--YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 G+ + + + ++Y P + + S + +K D Sbjct: 51 WFTKGLVIALWIGLVTAAARPVWYGDPVTTSPKHRDMMLVVDLSYSMSQQDMKSGDQFID 110 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 L++V + + R+G FF D + + + + + Sbjct: 111 RLSAVKQVLSDFIAKRQGDRLGLIFFADHAYLQTPLTLDRQTIAQQLNQAVLRLI--GTQ 168 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI + + A T I S ++ ++LL+DG NT + + AK Sbjct: 169 TAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTSGVLDPMEAAKIAKKY 217 Query: 397 GIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 I T+ T ++ L A + +F A + +L I+ Sbjct: 218 HTTIYTVGVGAGEMMVKEFFMTRKINTAEDLDEKTLQAIADETGGQYFRARNQQDLQHIY 277 Query: 442 RDR 444 Sbjct: 278 DTI 280 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRQTVRVWLDEAKI--GIAGKNTAVGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + + A +G++I TI + + Sbjct: 189 --------MRPANSRVLVLVTDGANNAGQIDPLTAARLAADEGVKIYTIGIGSDPEKNAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F +L KI Sbjct: 241 QSALGLSASLDLDEPTLKEIARLSGGQYFRTRDGDQLEKI 280 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +++ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGESIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N + A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIDPDQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL +I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELEQIYQEI 279 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 81.2 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 52/162 (32%), Gaps = 7/162 (4%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+ V K F + MG+T INDA+ A + SN Sbjct: 307 DQFNLVIFDGHVFQWMPALLQASSQNVEQAKKFTSLISAMGATNINDAVLLAVKMLDDSN 366 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + ++LLTDG+ T I K+ + + F + Sbjct: 367 RKEKLPPGSVS----MVILLTDGDATDGETNPKKIQENVKAAIGGSYHLYCLGFGFDVNY 422 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + L N +E + + + F + N + +V Sbjct: 423 AFLEKLALENGGVARRIYEDSDSDLQLQDFYQEVANPLLTKV 464 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 81.2 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 62/217 (28%), Gaps = 10/217 (4%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ V +R ++ + + R Sbjct: 277 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTILHDLRP---QDRFNI 333 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F++R+ V K + + G T IN A+QTA + N H Sbjct: 334 IGFSNRIKVWKDNLISVTPNSIRDGKIYIHHMSPTGGTDINGALQTAIRLL---NNYVAH 390 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKAR 416 + IV LTDG+ T + I N K + I TI + + + Sbjct: 391 NDIEDRSVS-LIVFLTDGKPTVGETHTLKILNNTKEAAQGRVCIFTIGIGNDVDFKLLEK 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E + F D I + + Sbjct: 450 LSLENCGLTRRVHEEENAGAQLIGFYDEIRTPLLSDI 486 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 81.2 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 77/261 (29%), Gaps = 1/261 (0%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ Sbjct: 6 RYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAA 65 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L ++ + A + + + + + N T + T + Sbjct: 66 RDLTSAISLSVAEADGIAAGHLNFVF-FQKTSVQMSTNANVTFSDSLTNPFLTKNAVTTP 124 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V ++ + + L + + A ++ + +I + Sbjct: 125 ANIKYVQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCR 184 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S Y+ + +++ +G +CN + L Sbjct: 185 GPSDPAYKVGDWISSPSGSSSTYGPGNFGWAALDGSTNETTLASELSGNTCNITSPPDLA 244 Query: 246 PGPLDPSLSEEHFVDSSSLRH 266 + + + Sbjct: 245 TTGMKSASQRAWNTRFGIYTN 265 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 81.2 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 66/192 (34%), Gaps = 35/192 (18%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +++ VRD A I + R+G F R S+ + + + Sbjct: 112 NQAVRRIDAVRDIGAEFIER-------REGDRVGLILFGSRAYMQSPLSFDRDTVKQFLS 164 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + TAI DA+ A + + + ++LLTDG++T + Sbjct: 165 EAQI--GFAGSETAIGDALGLAVKRLRDKEDG-----------DRVVILLTDGQDTASSV 211 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFF 429 + + A + G+++ TI + E LS A + +F Sbjct: 212 DPLDATALAANYGVKVYTIGIGADEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYF 271 Query: 430 EANSTHELNKIF 441 A S EL KI+ Sbjct: 272 RARSPDELAKIY 283 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 81.2 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 37/344 (10%), Positives = 88/344 (25%), Gaps = 7/344 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++++L + G I ALL + + ++DV R S ++ A A + A+ Sbjct: 15 VAAFARRLRRDDRGAIAIQFALLALPLSILLFGVLDVSRLSLQRRQMQDALDAATLMAAR 74 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR-EVRDIVRDTAVEMNPR 125 + ++ + +F + L + + + Sbjct: 75 SAATASADLDTTGDAAFLAEIAGMNLGLTASSSTFSVGTGNRVIGTATATLKPIIANLWQ 134 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V +S L + L S A+ + + Q Sbjct: 135 AGDFTVTATSEVVRSSKNLEVALVLDITGSMSGTRIADLKVAASDLVDIVIRDTQTPFYS 194 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 +++ Y + + G ++ + + S + L Sbjct: 195 KVALVPYAAGVNVDTYADMARGPIPVRNISNVAWLATGTSIRGVTKALPALLWSDNHGLV 254 Query: 246 PGPLDPSLSEEHFV-----DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 G V + + + + L + +K T+R T Sbjct: 255 TGDRVFISGISGGVLTSMASLNGAIYTVTRLDSNVVYLNGIDTRLKSNSPSGGTIRKCLT 314 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + ++ S + G + G T + Sbjct: 315 S-DCNLVVTTSAAHGFDTGDQISFANLGGLTTLNGQTYSITDLT 357 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/284 (12%), Positives = 76/284 (26%), Gaps = 43/284 (15%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 T + L + + P L+ ++ +R + Sbjct: 350 TYSITDLTSNTFDTGVPGLGTAAFTSGGTATCEQSTTPGCRRLAYVSNWGTNEVRALSTC 409 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 A A + + A ++ + +T +K + Sbjct: 410 VSERTGADAYTDAAPSTAFVGTNYPSTSADSYSPNPCPSAKITPLSS--DKTALKAQINN 467 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVH------RMKNNLEAKKYIVLLTDGENTQDN- 383 + GSTA + + + + + + K ++++TDG Sbjct: 468 YSVGGSTAGQIGLAWGWYMVAPNFGYIWPSASQRPAAYKSKDLMKVVIMMTDGAFNTPYC 527 Query: 384 -----------------------------EEGIAICNKAKS--QGIRIMTIAFSVNKTQQ 412 + A+C K+ I + T+ F+V Sbjct: 528 NGVIAANAGIGSGSDEDHINCNATNGDPFAQARALCTVIKNSANDITLYTVGFAVGSDYT 587 Query: 413 EKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEIFERVIR 455 + FL++CAS +S F + EL F R+ + Sbjct: 588 A--KTFLTDCASDSSKAFFPATGSELKASFTAIAREISSLRIAK 629 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 64/208 (30%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + I K R+G ++ + + +++++ Sbjct: 111 NRIEALKRVASQFIEERKS-------DRIGIVVYSGESYTKVPATTDKSIVLQSLK--DI 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + TA + + S K I+L+TDG N + ++ Sbjct: 162 KQGEIEDGTAIGMGLGTAINRLKDS-----------KTKSKVIILMTDGVNNTGVIDPLS 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 AK GIR+ TI N L A + + Sbjct: 211 AAELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSLQYDMVPVEIDEKLLGEIAQSTGGKY 270 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 F A +L +I+ + I ++ + Sbjct: 271 FRATDNKKLAQIYTE-IDKLEKSKIEEL 297 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I K R+G F D + L + + + Sbjct: 133 YIDRLSAVKHVLSDFIERRKG-------DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQA 185 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI D + A T + S ++ ++LL+DG NT + Sbjct: 186 VLKLI--GTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 232 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ T Q+ L A + +F A Sbjct: 233 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 292 Query: 432 NSTHELNKIFRDR 444 + EL I+ Sbjct: 293 RDSKELATIYDTI 305 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALAVKRFDKID 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEI 279 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +A+ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALAVKRFDKID 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL++I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEI 279 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRFWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 78/261 (29%), Gaps = 1/261 (0%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +++ + G II L + VM+G G+ +D+ + L+ +A ++A+ Sbjct: 6 RYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAA 65 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L ++ + A + + ++ + + N T + T + Sbjct: 66 RDLTSAISLSVAEADGIAAGHLNFVF-FQKKSVQMSTNANVTFSDSLTNPFLTKNAVTTP 124 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V ++ + + L + + A ++ + +I + Sbjct: 125 ANIKYVQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCR 184 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S Y+ + +++ +G +CN + L Sbjct: 185 GPSDPAYKVGDWISSPSGSSSTYGPGNFGWAALDGSTNETTLASELSGNTCNITSPPDLA 244 Query: 246 PGPLDPSLSEEHFVDSSSLRH 266 + + + Sbjct: 245 TTGMKSASQRAWNTRFGIYTN 265 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 63/197 (31%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ + I+ T R+G ++ + + + ++ Sbjct: 108 DRLTALKEIARTFIKQ-------RTTDRIGLVEYSGEALMRVPLTSDHRVVEEELMS--F 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-GI 387 + G T I D + A + S K I+L+TDG NT DN + Sbjct: 159 NPMDLEGGTNIGDGLAVAVSHLRKSKAKS-----------KIIILMTDGVNTIDNAMSPL 207 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARY------------------FLSNCA--SPNS 427 A++ I++ TI N + L + A + Sbjct: 208 TAAELARNNDIKVYTIGIGSNGLALMPTQQDIFGNLVFTEEQVKIDEYLLRDVAQITGGK 267 Query: 428 FFEANSTHELNKIFRDR 444 +F A S L +I+ + Sbjct: 268 YFRATSNESLKQIYEEI 284 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 239 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 293 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 294 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 351 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 352 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 407 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 408 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 451 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 65/190 (34%), Gaps = 33/190 (17%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + + LA + R+G F DR + + + + + + Sbjct: 123 VDRLTAAKAVLADFLDR-------RAGDRIGLLIFGDRAYTLTPLTADLASVRDQLRDSV 175 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + S E ++ ++LLTDG + E + Sbjct: 176 V--GLAGRETAIGDAIGLAVKRLRSQPEG-----------QRVLILLTDGVSNAGVLEPL 222 Query: 388 AICNKAKSQGIRIMTIAFSVNK-----------TQQEKARYFLSNCAS--PNSFFEANST 434 A+++G+RI T+AF + Q L A FF A T Sbjct: 223 RAAEVARAEGVRIHTVAFGGDGSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDT 282 Query: 435 HELNKIFRDR 444 +L I+ + Sbjct: 283 AQLAGIYAEL 292 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 61/193 (31%), Gaps = 39/193 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ V+ + + R+G F R S+ + + R +++ Sbjct: 109 RRIDAVKQLGSDFMSRRSG-------DRLGLILFGSRAYLQSPLSFDIQTVQRFLLEAQI 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ A + + +VLLTDG++T + + Sbjct: 162 --GFAGQETAIGDAIGLAVKRLQERPASS-----------RVLVLLTDGQDTASTVDPLE 208 Query: 389 ICNKAKSQGIRIMTIAFSVNK-----------------TQQEKARYFLSNCA--SPNSFF 429 N A G+RI TI + + L A + +F Sbjct: 209 AANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDESTLIEIARSTGGQYF 268 Query: 430 EANSTHELNKIFR 442 A EL ++R Sbjct: 269 RARDPEELATVYR 281 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 80.8 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 41/380 (10%), Positives = 107/380 (28%), Gaps = 25/380 (6%) Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV-EMNPRKSAYQVV 132 + + T +Q I + + ++ + + R + ++ Q+ Sbjct: 89 QANDAMIEHRTLAEQAISDDKSADMSIDVSSRSYEAGKQASERMIKMKSLSRNHIMGQMS 148 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S + ++ + T S +++ I+ + + Sbjct: 149 APGLPAFREASSSDNFKRQTANGIMVAGEIPVSTFSIDTDTGSYTTLRRWINQ-GRLPEK 207 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + +N F + + + + + L + Sbjct: 208 GTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNDHKMLLRIGLKGYEVDKSQLGA 267 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +D S + K L++ +L + + + + D+V+ V + Sbjct: 268 SNLVFLLDVSGSMNSRDKLPLLKTSLKMLSQQLSEQDHVSIVV---------YAGASGVV 318 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 GV + + GST +Q AY + + ++ Sbjct: 319 LDGVKGNDIYAINQALNNLKAGGSTNGGAGIQQAYGL--AQKHFIQGGVNR-------VI 369 Query: 373 LLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 L TDG+ + A+ + + QGI + T+ F + Sbjct: 370 LATDGDFNVGTTDHQALMDLIASKRDQGIALTTLGFGQGNYNDHLMEQLADK--GNGHYA 427 Query: 430 EANSTHELNKIFRDRIGNEI 449 ++ +E K+ D + + + Sbjct: 428 YIDTLNEARKVLVDELSSTL 447 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 62/165 (37%), Gaps = 7/165 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R+G F D F+ + + +T ST + DA+ Sbjct: 136 LADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIG 193 Query: 345 TAYDTI-ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A S + + +N++E +K +++LTDG +T E I AK++G+RI I Sbjct: 194 LAIKVFEQSGKQMSAEQAQNDIEREKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVI 253 Query: 404 AFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A + + A S FEA + EL+ + Sbjct: 254 AMGDPQTVGEVALDMETIKRIASESGGEAFEALNRDELSTAYEQI 298 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 51/167 (30%), Gaps = 28/167 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + TAI DA+ + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLEEAAI--GIAGKDTAIGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + ++L+TDG NT A ++G+RI TI + Sbjct: 189 --------QRPAQSRVLILVTDGANTAGEIAPSVAARLAAAEGVRIHTIGIGADPRQDGP 240 Query: 410 -------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + L A + S+F A S+ EL + + Sbjct: 241 PGLLGLTPGLDLDEPTLRAIAEETGGSYFRARSSEELR-AIEETLAR 286 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 59/150 (39%), Gaps = 1/150 (0%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + +LI + G II L+ +L G +D +R L+ A +A++ A Sbjct: 2 TRRLSFFSRLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLAA 61 Query: 65 SVPLIQSLEEVS-SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + E S S N+F ++ +I E + + +V D V T +++ Sbjct: 62 AAKYKHGMPEASISGTVNAFLSANGTLKSAVIGKPEVSSDEAELCLDVGDAVPTTFMQVA 121 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 +S V S + L + L Sbjct: 122 NIQSVPISVRSCAALPGVKQLEIALVLDVS 151 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 65/199 (32%), Gaps = 34/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L I R+G F D + V + Sbjct: 103 DMAYNGQYVDRLTMVKAVLTDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDVKTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + +TAI DA+ + + ++ + +VLLTDG+N Sbjct: 156 SKMLSEAQI--GLVGRATAIGDALGLSVKRFANKDKSN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T N + A+ GI++ TI + + L A + Sbjct: 203 TAGNLKPEEALLLARDAGIKVYTIGVGSDNPRGFSLFNMGGMSGDTIDEGLLKRIAEQTG 262 Query: 426 NSFFEANSTHELNKIFRDR 444 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLQQIYAEL 281 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTAVGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPANSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 110 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTAVGDAIGLALKRLR--- 164 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 165 --------LRPANSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 216 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 217 QSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 256 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 80.8 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTAVGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPANSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 59/170 (34%), Gaps = 13/170 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ + S+ + R + + ST I D + A + + Sbjct: 147 DRIGLVIFADQAYVAAAPSFDTAAVARALDEATI--GISGRSTGIGDGLGLALRRLDPRD 204 Query: 355 EDEVHRMKN--NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------ 406 + + K ++LL+DG N + A+ GI++ TIA Sbjct: 205 AGGEAASGSKPGEKPAKAVILLSDGANNAGQTAPKDVAELARELGIKVYTIALGPRDMAD 264 Query: 407 VNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + Q L + AS F +T +L ++ D I R + Sbjct: 265 ADGEQDVVDTETLRDMARASGGEAFRVRTTEDLVRV-ADAIDRLEGGRAL 313 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I K R+G F D + L + + + Sbjct: 116 YIDRLSAVKHVLSDFIERRKG-------DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQA 168 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI D + A T + S ++ ++LL+DG NT + Sbjct: 169 VLKLI--GTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 215 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ T Q+ L A + +F A Sbjct: 216 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 275 Query: 432 NSTHELNKIFRDR 444 + EL I+ Sbjct: 276 RDSKELATIYDTI 288 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 53/327 (16%), Positives = 99/327 (30%), Gaps = 19/327 (5%) Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 Y + + P L R + L + + V + +H + + D S Sbjct: 183 EYMVSIRPQQLVGRLRVEVNILENSGIISLEVPPLRNSKHKGNGKAEGDVSPPPSTVIGQ 242 Query: 196 SEGQPLNCFGQPADRTVK-SYSSQNGKVGIRDEKLSPYMVS-----CNKSLYYMLYPGPL 249 ++ F K + + G +R + V ++Y Sbjct: 243 TKTLAKVTFSPNVVEQTKIARNGILGDFIVRYDVNRELSVGDVQILNGYFVHYFAPMDLP 302 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + +DSS+ V K ++AL ++++ ++ D+ N + Sbjct: 303 PLPKNVVFVLDSSASM-VGTKLRQTKEALFTILQDLRPEDHFNIIGFSNRIK-VWQQDRL 360 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + K + + + G T IN A+QT + ++ Sbjct: 361 VPVTPN----NIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNIDARSVS---- 412 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+ LTDG T + I + K + TI + + R L NC Sbjct: 413 LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGMVR 472 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 F E K F D IG + + Sbjct: 473 HFQEDEDAASHLKGFYDEIGTPLLSDI 499 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 70/220 (31%), Gaps = 13/220 (5%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++Y + +DSS+ V K ++AL ++++ ++ D+ N Sbjct: 294 GYFVHYFAPMDLPPLPKNVVFVLDSSASM-VGTKLRQTKEALFTILQDLRPEDHFNIIGF 352 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + K + + + G T IN A+QT + Sbjct: 353 SNRIK-VWQQDRLVPVTPN----NIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQ 407 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++ I+ LTDG T + I + K + TI + + Sbjct: 408 NNIDARSVS----LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKL 463 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L NC F E K F D IG + + Sbjct: 464 LERMALENCGMVRHFQEDEDAASHLKGFYDEIGTPLLSDI 503 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 55/186 (29%), Gaps = 34/186 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ L I R+G F + ++ + + + Sbjct: 115 TRLELVKVLLGDFIEQ-------RHGDRVGLILFGSKAYLQAPLTFDRRTVRVWLDEARV 167 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TAI DA+ A + + +VL+TDG N + + Sbjct: 168 --GIAGSNTAIGDAIGLAVKRLR-----------ERPTNSRVLVLITDGANNGGELDPLL 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFEANST 434 A + +RI TI + L A + +F A S+ Sbjct: 215 AATLAAEESVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASS 274 Query: 435 HELNKI 440 EL I Sbjct: 275 AELKAI 280 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/270 (9%), Positives = 61/270 (22%), Gaps = 1/270 (0%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + L++ G +I A M+ + G VD R L+QA +++A+V Sbjct: 16 LRTLLRDRNGAVAVIFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVKGGN 75 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP-RKSAY 129 +R NS P + + V ++M Sbjct: 76 PDSGQLARYLNSNMSPGGAATNVTLTRSVATGGAVTFVGDADFSVATNFLKMAGLGAIKL 135 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + S ++ ++ ++ + Sbjct: 136 HSHSEATLPAQIVTATFKPTSAQGAYSKDIFIWTKNAAGTVVSRQTVLTYRYNSSNGSKV 195 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 S K + +M G Sbjct: 196 TTPAIGSWSLTFTVPLYSTFGVGMVVYQDWQKYSGALVNPVELWSDAANASTFMRQSGLC 255 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + + + + + ++ A Sbjct: 256 TDDAGATYNWEDGGDGNFVDFVYTMKCATG 285 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 65/209 (31%), Gaps = 41/209 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I R+G F + + L + Sbjct: 108 DRIQAAKQVAGNFITD-------RPGDRIGLVAFAGEAFTQSPLTTDQGTL--QTLLGRL 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + + TA + + SN K I+LLTDGEN + + Sbjct: 159 RSGVVEDGTAIGNGLATAINRLRESNAKS-----------KVIILLTDGENNRGEIAPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA--SPNSFF 429 A+ QGIR+ TI T + L A + +F Sbjct: 208 AAEIARDQGIRVYTIGVGTRGTAPYPTVDFFGNPTVVQAKVQIDEKILGEIADLTGGRYF 267 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 A +L I+ + N++ + + I++ Sbjct: 268 RATDNAKLQSIYDEI--NQLEKSKVEISQ 294 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 26/179 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + ++ I + +G F S ++ + L+ Sbjct: 105 KSRYEVAKEIAKRFI-------KNRFSDNIGIVIFGSFSFSASPLTYDLKALLEMFDLMS 157 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI DA+ A + ++ K I+LLTDG++ + Sbjct: 158 -DVGIAGNNTAIGDAIFEAIKNL-----------ESGEAKSKVIILLTDGKHNFGKKSPK 205 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +AK +GI+I T+ + + L A + F A ++ EL ++F++ Sbjct: 206 EGVVEAKKRGIKIYTVGIGTDYD-----KKLLEKMAKETNAKSFFAKNSKELEEVFKEI 259 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +++ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGESIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL +I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEI 279 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +++ A + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGESIALAVKRFDKMD 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A ++ + I T+ + Sbjct: 186 ESN-----------RVLILLTDGSNNAGNIEPEQAAQIAANRKVTIYTVGVGADVMERRT 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + +F A ++ EL +I+++ Sbjct: 235 LFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEI 279 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 80.4 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 64/204 (31%), Gaps = 35/204 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + L I R+G F + S+ + + Sbjct: 107 DMKINQQAANRLDAAKQVLNRFITE-------RQGDRIGIIVFGSKAYLQAPLSYDLDTI 159 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +T +TAI DA+ + + + D K+ ++L+TDG N Sbjct: 160 AQLVNETQI--GFAGENTAIGDAIGLGIKRLANIDAD-----------KRVMILMTDGAN 206 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + A QG++I TI + L A + Sbjct: 207 TAGRVKPDQAAQFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVADLT 266 Query: 425 PNSFFEANSTHELNKIFRDRIGNE 448 +F A ST EL I+ E Sbjct: 267 QGQYFRAKSTQELASIYATLDALE 290 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 66/212 (31%), Gaps = 38/212 (17%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT-----FFNDRVISDP 310 ++F+D + + I + + +I ID +T+ + FF + + Sbjct: 45 KNFLDKTYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAK 104 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S ++ ++ G T + + + S + K Sbjct: 105 SLNFDP---NSKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNN-----------KL 150 Query: 371 IVLLTDGENTQDN-----------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 +++L+DG++ +C K K GI+++ I Sbjct: 151 MIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPD--N 208 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +C +F+ A + HEL + Sbjct: 209 NIIDWEKDCVGTGNFYLAKNAHELEISIERAL 240 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 66/199 (33%), Gaps = 34/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I R+G F D + + + Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQRTG-------DRLGLILFGDTAFLQTPLTRDLKTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + +TAI DA+ + S ++ + +VLLTDG+N Sbjct: 156 TKMLNEAQI--GLVGRATAIGDALGLSVKRFASKDDSN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SP 425 T N A+ +GI++ TI + L N A + Sbjct: 203 TAGNLNPDDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGGSGGSNLDERLLKNIADDTG 262 Query: 426 NSFFEANSTHELNKIFRDR 444 +F A L +I+ + Sbjct: 263 GLYFRAKDVAGLKQIYAEL 281 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 64/183 (34%), Gaps = 28/183 (15%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + L +V + R+G F ++ ++ + + R + + Sbjct: 111 LVNRLQTVKAVVTDFVEERKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVV--GLA 168 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI DA+ + E + ++LLTDG+NT E + A Sbjct: 169 GNKTAIGDAIGLGVKRLQDLPESN-----------RVLILLTDGQNTAGEIEPLKAAELA 217 Query: 394 KSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 + G++I I + ++ L+ A + ++ A + +EL Sbjct: 218 EKAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDEDTLTAIAENTGGQYYRARNVNELE 277 Query: 439 KIF 441 +I+ Sbjct: 278 QIY 280 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + + A +G++I I + + Sbjct: 189 --------MRPATSRALVLVTDGANNAGQIDPVTAARLAAEEGVKIYAIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 71/217 (32%), Gaps = 27/217 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + +D L+ + + + + ++ + I S + Sbjct: 75 QTYNTWDNKDTEGIDIMLTMDISASMLTEDVFPNRMEVAKEVASEFISS-------RPSD 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + L+ ++ D G TAI + + + Sbjct: 128 NIGLTIFAGEAFTQCPMTLDHAALL-NLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNK 409 S K ++LLTDG N + + AK GIR+ TI + Sbjct: 187 DSKAKS-----------KVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGE 235 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L + A + F+ A S EL+KI++D Sbjct: 236 DIGAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQDI 272 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + + A +G++I I + + Sbjct: 189 --------MRPATSRALVLVTDGANNAGQIDPVTAARLAAEEGVKIYAIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I K R+G F D + L + + + Sbjct: 105 YIDRLSAVKHVLSDFIERRKG-------DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQA 157 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI D + A T + S ++ ++LL+DG NT + Sbjct: 158 VLKLI--GTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 204 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ T Q+ L A + +F A Sbjct: 205 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 264 Query: 432 NSTHELNKIFRDR 444 + EL I+ Sbjct: 265 RDSKELATIYDTI 277 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 14/224 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA +G T INDAM A + SS Sbjct: 309 --QFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 N++E + I+LLTDG+ T +I N + + + F + + Sbjct: 367 NQEERLPEGSVS----LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVS 422 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N E + + + F + N + V Sbjct: 423 YAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 80.4 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 60/217 (27%), Gaps = 10/217 (4%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ V +R ++ + + Sbjct: 234 NGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDHFNV 290 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F++R+ V K + + G T IN A+Q A + N H Sbjct: 291 IGFSNRIKVWKDHLVSVTPDSVRDGKVYIHHMSPTGGTDINGALQRAIRLL---NNYVAH 347 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKAR 416 + I+ LTDG+ T + I N K I I TI + + + Sbjct: 348 NDIEDRSVS-LIIFLTDGKPTVGETNTLKILNNTKEATRGQICIFTIGIGDDVDFKLLEK 406 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC + + F D I + + Sbjct: 407 LSLENCGLTRRVHDEDDAGSQLIGFYDEIRTPLLSDI 443 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRIWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 51/163 (31%), Gaps = 27/163 (16%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D R+G F + ++ + + + +TAI DA+ A + Sbjct: 131 RDGDRVGLILFGSQAYLQAPLTFDRRTVRVWLDEARI--GIAGKNTAIGDAIGLALKRLR 188 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++L+TDG N + + A S+G++I I + + Sbjct: 189 -----------MRPAQSRVLILVTDGANNGGEIDPLTAAKLAASEGVKIYPIGIGADPEE 237 Query: 412 Q------------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A + +F A +L I Sbjct: 238 SGATALLGGNPTLDLDEPALKAIAEVTGGRYFRARDGKQLQAI 280 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 66/243 (27%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D + Y ++ V Sbjct: 250 QNGILGDFIIRYDVSREQSVGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTK 309 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R ++ + + R F++R+ V K + + Sbjct: 310 LRQTKDALFTILHDLRP---QDRFSIIGFSNRIKVWKDHLISVTPDSVRDGKVYIHHMSP 366 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN+A+QTA + N+ H + I+ LTDG+ T + I N Sbjct: 367 TGGTDINEALQTAIRLL---NKYVAHSDSGDRSVS-LIIFLTDGKPTVGETHTLKILNNT 422 Query: 394 KSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + I TI + + + L NC E F D I + Sbjct: 423 REAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLL 482 Query: 451 ERV 453 + Sbjct: 483 SDI 485 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 75/242 (30%), Gaps = 32/242 (13%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF-----VDSSSLRHVIKKKHLVRDALASV 281 ++ P MVS +P + +D+S K V++ + + Sbjct: 130 DQTIPVMVSVKTLDQTNDMEVESNPLEGRPNLDLICVIDNSGSMSGCSKIENVKNTILQL 189 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + + D R+ FN + V+ + ++T G T I Sbjct: 190 LEMLNEND------RLSLITFNTKAKQLCGLK-KVNNQNKESLQTITKSIKADGGTDITS 242 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN----EEGIAICNKAKSQG 397 ++ A+ + S + I LL+DG++ + + + + Sbjct: 243 GLEIAFQILQSRKQKNSVSS---------IFLLSDGQDDGADIKIKNLLKTTYQQLQEES 293 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + F + + A SF+ ++++ F D +G Sbjct: 294 FTIHSFGFGNDHDGP-----LMQKIAQIKDGSFYFVEKNDQVDEFFIDALGGLFSVVAQD 348 Query: 456 IT 457 +T Sbjct: 349 LT 350 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 34/192 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V+ + + L+ + R+G F R + + + + + + Sbjct: 122 VVDRLTAAKAVLSDFLDRRDG-------DRVGLLVFGQRAYALTPLTADLTSVRDQLRDS 174 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 175 VV--GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLDP 221 Query: 387 IAICNKAKSQGIRIMTIA-----------FSVNKTQQEK-ARYFLSNCA--SPNSFFEAN 432 + AK++G+RI TIA + + L A + FF A Sbjct: 222 LKAAELAKAEGVRIHTIAFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRAR 281 Query: 433 STHELNKIFRDR 444 T EL I+ + Sbjct: 282 DTEELAGIYAEL 293 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 33/265 (12%), Positives = 83/265 (31%), Gaps = 27/265 (10%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G++ ++TA+ M ++G + +D + + + A A + L + Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLTEG 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + ++ N + N T + T ++ Sbjct: 66 ATD---------DQLRAYALDFFNANLNDLNPANAT-------LNLTLPSNTAGGGLLKM 109 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 Y P F + +G KS ++ + V+ ++ + + V+D S SM Sbjct: 110 TARLNYKPYFYP--AFAQLVG-KSATDANQSISFDVTSQVRLKNTLEVALVLDNSGSMTT 166 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--------SPYMVSCNKSLYYM 243 S + ++ + + V + + Q + D + + V Sbjct: 167 LGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDRPVQFGLVPFAASVNVGSGNGNASW 226 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVI 268 + L P +E + + Sbjct: 227 MDTEGLSPVSNENFDWSTLNAADKY 251 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 60/253 (23%), Gaps = 7/253 (2%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 Y D+ + + + + S S N +G+ D Sbjct: 304 YSNNTCKRSHNEYDYVDTYGPFASWQGCVEARPYPYNVNDAPPSGGSANTGIGVGDPATM 363 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 + E V ++ + + K Sbjct: 364 FVPMFAPDEPGNHWKLT---QDPDEAAPVTYGAVNSWWNDDPSSTSGQSRLRNMAKYFQP 420 Query: 291 VNDTVRM----GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 ++ + + T +K G T + + M Sbjct: 421 RPINAPALPTGNGPNYSCTTNAITPLTDVSVTDGATAIKAAIDLMQPNGGTNVPEGMAWG 480 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + S R++ K +++LTDG NT ++ + A S+ + Sbjct: 481 WRVVSSGEPFTQGRLETEKGNDKVVIVLTDGANTYYTPSSLSYSDPADSKSTYASYGYLN 540 Query: 407 VNKTQQEKARYFL 419 R F+ Sbjct: 541 PGYNGTSVGRMFM 553 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ + R+G F D + +++ I +T Sbjct: 120 YIDRLTAVKQVLSDFVA-------KRKGDRLGVVLFGDHAYLQTPLTADRKSVMQQINQT 172 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI D + T + S ++ ++LL+DG NT E Sbjct: 173 VI--GLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP 219 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + AK I T+ T + L+ A + +F A Sbjct: 220 LEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRA 279 Query: 432 NSTHELNKIFRDR 444 T EL KI+ Sbjct: 280 RDTEELEKIYDTI 292 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 63/208 (30%), Gaps = 36/208 (17%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + A +I + D V F + S Sbjct: 95 DDSGSMQATDYSPNRLEATKSAAEELINDLDPKDYVG------IVVFESGASTASYLSPD 148 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 +I + + G+TAI D + + + KK ++LL+ Sbjct: 149 KDSVIENLE----NIMEKDGATAIGDGLSLGINM-----------ADSIPNRKKVVILLS 193 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA 423 DG N AK I++ TI N E L A Sbjct: 194 DGVNNAGVISPDEAIQFAKDSDIQVFTIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIA 253 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEI 449 + +F++ LN+I+ + I +EI Sbjct: 254 DETGGKYFKSVDDQTLNEIYSN-INSEI 280 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 47/148 (31%), Gaps = 17/148 (11%) Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F D + + + K TAI DA+ A + E Sbjct: 1 MFADHAYLQAPLTLDNLAVQSLLQKAVI--GMAGRDTAIGDAIGLAVKKLRERPEGS--- 55 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYF 418 + ++LLTDGEN + + AK IRI TI + Sbjct: 56 --------RILILLTDGENNAGALKPLQAAELAKQYDIRIYTIGVGGKGGMFSRGLNETE 107 Query: 419 LSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + ++F A + LN ++ Sbjct: 108 LKKIAQLTNGAYFPATNLGALNNVYEHI 135 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 74/271 (27%), Gaps = 28/271 (10%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +K G++ ++T L M ++G + +D A A I + ++ Sbjct: 1 MIRKFANDRRGNYALMTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIV 60 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + A + F + + N + T N Sbjct: 61 AGATDAEAIAY-AKNFFEANLAHIDPAN---------------TTLAVTLPNNNTGGGTL 104 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ + Y P + L A ++ + + V+D S SM Sbjct: 105 KLCGTLTYKPYFLPTAKIL----AGGTSGNATTMAFNTCSEVRLKNTLEVSLVLDNSGSM 160 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--------SPYMVSCNKSLY 241 + + S + A + V + Q + + + + V + Sbjct: 161 KELGKGSNKVRFDLLKDAAKQLVDQLAGQAQLMKQVSKPVQFSLVPFAASVNVDPGNASA 220 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + + P E S S + K Sbjct: 221 AWMDTTGISPIHHENFDWTSMSSSYSSTKYA 251 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 39/144 (27%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + L S S + + + + + Sbjct: 347 FAPDETDLTDSNSRPANNNWRTDVTSNSSSAIRQRFMPKYFADPGSTTVTPSYGMDAGPN 406 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + + VKT G+T + + M + T+ S+ R Sbjct: 407 TSCSTTPIKPLTDVSTTAGASAVKTAIDAMAADGATNVPEGMAWGWRTLSSTAPFTEGRP 466 Query: 362 KNNLEAKKYIVLLTDGENTQDNEE 385 + K +++LTDG NT + Sbjct: 467 ETERGNDKVLIVLTDGANTYYTPD 490 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 42/322 (13%), Positives = 88/322 (27%), Gaps = 47/322 (14%) Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 ++ + + P + + +S+ R + S + + Sbjct: 322 SRPYPYNINDTPAATGTPATLFVPMFAPDETDLTDSNSRPANNNWRTDVTSNSSSAIRQR 381 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK-----KHLVRDA-LASVIRSIKKIDNVND 293 + P +++ + +D+ + A ++V +I + Sbjct: 382 FMPKYFADPGSTTVTPSYGMDAGPNTSCSTTPIKPLTDVSTTAGASAVKTAIDAMAADGA 441 Query: 294 TVRMGATFFNDRVISDPSFSW----------------GVHKLIRTIVKTFAIDENEMGST 337 T + R +S + I + G+ Sbjct: 442 TNVPEGMAWGWRTLSSTAPFTEGRPETERGNDKVLIVLTDGANTYYTPDSVIAQTYSGTN 501 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY------IVLLTDGENTQD----NEEGI 387 A A + I S V N + V+ +D N NE Sbjct: 502 YNYGANDLAGNKAIYSALGYVTPYSNGYSYGRMFLGTSSSVIKSDYSNANYTKAMNEHFT 561 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQE--KARYFLSNCASPNSF-------------FEAN 432 +CN AK+ + +MTIA ++ T L C+S + F F + Sbjct: 562 TLCNNAKAANVMVMTIALDLDATNTAEKTQMDALKACSSDSRFSKDPTDPSKPMKLFWNS 621 Query: 433 STHELNKIFRDRIGNEIFERVI 454 + L+ F+ R++ Sbjct: 622 TGATLSNDFKAIGNELSNLRIV 643 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 28/163 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F ++ ++ + + + + TAI DA+ A + + Sbjct: 134 DRIGLILFGEQAYLQVPLTFDRKTVEKLLDEAAI--GLAGDKTAIGDAIGLAIKRLRDNP 191 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 D ++ ++LL+DG NT + + A +G++I TI + Sbjct: 192 AD-----------QRVLILLSDGANTAGQVQPLQAAELAAREGLKIYTIGVGADEMIVRD 240 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 ++ ++ A + +F A +T EL++I+ Sbjct: 241 FFGTRRVNPSEDLDEAAMTAIAEKTGGRYFRARNTEELDRIYA 283 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 58/190 (30%), Gaps = 34/190 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V+ I R+G F+ R + + + + + + Sbjct: 116 NRLQVVKRVADDFIARRTG-------DRIGLILFSTRAYVQAPLTLDRNVVRQLLAEASI 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +T+I DA+ A T+ + ++LLTDG NT + + Sbjct: 169 --GMTGRNTSIGDAIGLAVKTLRD-----------RPAKDRVLILLTDGANTSGVLDPME 215 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCAS--PNSFFEANST 434 A + +RI TI + + L A +F A + Sbjct: 216 AAAIAAKENVRIHTIGVGADSNFTDIQPGMLMNPSGDLDEEALKKIAGLTGGQYFRARND 275 Query: 435 HELNKIFRDR 444 L I+ D Sbjct: 276 KGLAAIYADI 285 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 67/184 (36%), Gaps = 18/184 (9%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + ++ +++ D ++G F + + +T VK Sbjct: 901 SKLTTDSVAKTSAVSFVESRGDGDQVGVVSFYTSASLNSALKQMNSGTNKTTVKNAINSL 960 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + G T I+ ++ A + + K+YI++LTDG + + I + Sbjct: 961 SASGGTDISSGIKKAIAELDAHKRSTA---------KQYIIVLTDGYSQYPEFDLIE-AD 1010 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIF 450 KAK++G I TI L AS P ++ S +L + D IG EI Sbjct: 1011 KAKAKGYTIFTIGMG------MADEDTLKKIASKPEYYYRVLSPEQLEAAYYD-IGQEIG 1063 Query: 451 ERVI 454 + Sbjct: 1064 GVIA 1067 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 30/266 (11%), Positives = 77/266 (28%), Gaps = 1/266 (0%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++ + G II L + VM+G G+ +D+ + L+ +A Sbjct: 1 MNSAARHPKVMRRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L ++ + A + + ++ + + N T + T Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFVF-FQNKSVQMSTNANVTFSDSLTDPFLTRS 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + S V ++ + + L + + A ++ + +I Sbjct: 120 AVTTPSSIKYVQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIP 179 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + Y L+ +++ +G +CN + Sbjct: 180 VFVCRATGSPSYNVGDWITSLSGSSTTYGPGNFGWAALDGSTNEPTIASELSGNTCNITS 239 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRH 266 L L + + Sbjct: 240 PPDLGSTGLKSASLRAWNTRFGIYTN 265 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 80.0 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 59/176 (33%), Gaps = 28/176 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + + TAI DA+ A + Sbjct: 132 DRLGLILFADTAYVQAPLTYDRDTVSTLLSEAVI--GLVGEQTAIGDAIGLAVKRFDERD 189 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E ++LLTDG+NT N A ++G+++ TI + Sbjct: 190 ESNN-----------VLILLTDGQNTAGNITPEQAKELAINKGVKVYTIGVGADKMLIQS 238 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 QE L++ A + +F A + EL I++ E E R Sbjct: 239 FFGSREINPSQELDEGMLTDIATSTGGQYFRARNAQELEAIYQQLDALEPIEGETR 294 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 36/336 (10%), Positives = 91/336 (27%), Gaps = 10/336 (2%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + +++ + G II L + +M+G G+ +D+ + L+ +A + Sbjct: 1 MSSAARHPKLTRRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L ++ + A + + N T + T Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAG-HANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKT 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + V +++ + + L ++ ++ A ++ + +I Sbjct: 120 AVANPANVKYVKCTAQLSNIAHWFIEVLNTIPGVQVANASQVAASAIATVGAGQTTCAIP 179 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S Y+ + +++ +G +CN + Sbjct: 180 VFVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAALDGSTNEPTIASELSGNTCNITS 239 Query: 241 YYMLYPG---------PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 L + + SS + V D Sbjct: 240 PPDLGSTGLKAASLRAWNTRFGIYTNGANGSSGQPDFTGYAYVGPNYGPPGTPGIVGDAY 299 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V A+F + P G+ + Sbjct: 300 TQFVLDRASFKAYQGDGTPPTGSGISTNGTVTASNY 335 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 43/394 (10%), Positives = 104/394 (26%), Gaps = 16/394 (4%) Query: 16 KSCTGHFFIITALLMP-VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G ++ LL+ V++ + G+ +D R L A + A+ L Sbjct: 9 RRQGGQAIVMFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAARLLGSGSNA 68 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLK--KNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + T + + + F +L N D V S Y+V Sbjct: 69 AQQLSMAKATAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGSGISTYKVR 128 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + +LF+R +G+ + + A R + + S ++ Sbjct: 129 -TVAVKATATMPTLFMRIIGMPTVTVSGSGTASR--RDVRVILVMDRSSSMGTYYSGINQ 185 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQ------NGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S F + G GI + Sbjct: 186 TPPSINDMALKFVNSFSGAGEFGGRDEVGLVVYGGSGIVAYPPRDITKDYTDYTKFTPPD 245 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 S + ++ + + A ++ ++ + + F + Sbjct: 246 NNFKASGNIPKYIADITSGSNTGTAEALYLAYMTLRADAATNPDLATKLNVIVLFTDGIP 305 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + + +G T+ + + + I+ + Sbjct: 306 NGVTAMANDKTIANQHYLMIPNCTNLGLGDTSRTPMLSGSPNPNIAGWFAQWGGNSYTDN 365 Query: 367 AKKYIV---LLTDGENTQDNEEGIAICNKAKSQG 397 + + ++ +T +G I + K+ G Sbjct: 366 SGPHGFHKPMMAY-ADTGYTGKGDDIDSYMKNPG 398 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 61/200 (30%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S I + V+ ++ + + R+G F D + + + Sbjct: 101 DMLSQDDYIDRLTAVKKVVSDFAQQ-------REGDRLGLVLFADHAYLQTPLTLDRNTI 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + TAI + + A T I S ++ ++LL+DG N Sbjct: 154 AKQVESLVLRLI--GDKTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + I AK I TI T ++ L A + Sbjct: 201 TSGVLDPIEAAKIAKKYNATIYTIGVGAGEMMVKEFFMTRKVNTAKDLDENTLQEIADLT 260 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A + EL I+ Sbjct: 261 GGQYFRARNADELATIYDTI 280 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 67/206 (32%), Gaps = 36/206 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+D LA I K R+G F ++ + + + + +T Sbjct: 107 SIDRLAAVKDVLAEFIEQRKG-------DRLGLVLFAQHAYLQTPLTFDRNTVKQQLERT 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STAI + + A T I+S ++ I+LL+DG NT E Sbjct: 160 VLGLI--GQSTAIGEGLGIATKTFINS-----------EAPQRVIILLSDGANTAGVIEP 206 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + A + I T+ Q+ L+ A + +F A Sbjct: 207 LEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRA 266 Query: 432 NSTHELNKIFRDRIGNEIFERVIRIT 457 + EL I+ I + T Sbjct: 267 RNPQELEHIY-QLIDQLEPINTAQQT 291 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 67/206 (32%), Gaps = 36/206 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+D LA I K R+G F ++ + + + + +T Sbjct: 107 SIDRLAAVKDVLAEFIEQRKG-------DRLGLVLFAQHAYLQTPLTFDRNTVKQQLERT 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STAI + + A T I+S ++ I+LL+DG NT E Sbjct: 160 VLGLI--GQSTAIGEGLGIATKTFINS-----------EAPQRVIILLSDGANTAGVIEP 206 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + A + I T+ Q+ L+ A + +F A Sbjct: 207 LEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRA 266 Query: 432 NSTHELNKIFRDRIGNEIFERVIRIT 457 + EL I+ I + T Sbjct: 267 RNPQELEHIY-QLIDQLEPINTAQQT 291 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 47/158 (29%), Gaps = 27/158 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TAI DA+ A + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRHWLDEARI--GIAGKNTAIGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + ++L+TDG N + + A +G++I I Q Sbjct: 189 --------QRPAHSRVLILVTDGANNGGEIDPLTAARLAADEGVKIYPIGIGAAPEQSGT 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELN 438 + L A + +F A +L Sbjct: 241 TGSLGVNPSLDLDEPTLKEIAEVTGGQYFRAQDGEQLL 278 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 30/266 (11%), Positives = 77/266 (28%), Gaps = 1/266 (0%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + +++ + G II L + VM+G G+ +D+ + L+ +A Sbjct: 1 MNSAARHPKVTRRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L ++ + A + + + + + N T + T Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFVF-FQKTSVQMSTNANVTFSDSLTNPFLTKS 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + S V ++ + + L + + A ++ + +I Sbjct: 120 AVTKPASIKYVQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIP 179 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S Y+ + +++ +G +CN + Sbjct: 180 VFVCRGPSDPAYKVGDWISSPSGSSSTYGPGNFGWAALDGSTNETTLASELSGNTCNITS 239 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRH 266 L L + + Sbjct: 240 PPDLSTTGLKSAAQRAWNTRFGIYTN 265 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + LV+ I R+G F + ++ H + Sbjct: 106 DMRWQDDEISRLELVKKLFGDFIE-------GRRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLALKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A + ++I TI + + L A + Sbjct: 206 TGGQISPQTAARLAAEERVKIYTIGIGADPQQGGVIGLFGLNPGLDLDEPVLRGIAETTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 56/185 (30%), Gaps = 20/185 (10%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + A+ +V+ R+G F + L + Sbjct: 1 SATREGRPVSRWSAVQAVVADFI---TRRGDDRLGLIVFGTAAYPQAPLTLDHAALQLLL 57 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 T +TAI DA+ A + + E K ++LLTDG +T Sbjct: 58 RHTAV--GMAGPNTAIGDAIGLAIRMLDAV-----------DEPDKVLILLTDGNDTGSA 104 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 A IRI TI + + L + A+ FF+AN L + Sbjct: 105 VPPQRAATLAAQHHIRIHTIGMGDPQARGDDKVDFDLLEHIAQATGGRFFQANDRESLQQ 164 Query: 440 IFRDR 444 ++ Sbjct: 165 VYATL 169 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 42/281 (14%), Positives = 89/281 (31%), Gaps = 19/281 (6%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V I + D E ++ G D V+ ++ VG D ++ Sbjct: 166 VEIARPSPNRAHIQYRPTDMEQMRMSPSGISGDFLVRYDVKRDLSVG--DIQIVNGYF-- 221 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++Y G + +D S K + A+ ++++ ++ +D Sbjct: 222 ---VHYFAPSGLPVVPKNIVFIIDKSGSMGG-TKMRQTKQAMNTILKDLRD----HDRFN 273 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + ++ + + I + N G T IN A+ A D + +D Sbjct: 274 VMPFSYSSTMWRPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDD 333 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 + + ++ I+ LTDG + + I K+ + + + F + Sbjct: 334 QPNSPRSAS----LIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGKDVDFPF 389 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + K F D + + V Sbjct: 390 LEKMALENRGLARRIYEDSDAALQLKGFYDEVATPLLFDVQ 430 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 71/217 (32%), Gaps = 27/217 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + +D L+ + + + + ++ + I S + Sbjct: 75 QTYNAWDNKDTEGIDIMLTMDISASMLTEDVFPNRMEVAKEVASEFISS-------RPSD 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + L+ ++ D G TAI + + + Sbjct: 128 NIGLTIFAGEAFTQCPMTLDHAALL-NLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNK 409 S K ++LLTDG N + + AK GIR+ TI + Sbjct: 187 DSKAKS-----------KVVILLTDGSNNVGSISPMTAATIAKKFGIRVYTIGLGRETGE 235 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L + A + F+ A S EL+KI++D Sbjct: 236 DIGAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQDI 272 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ + Sbjct: 47 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKI--GIAGKNTALGDAIGLGLKRLR--- 101 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 102 --------LRPATSRVLVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGADPDKDAL 153 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 154 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 193 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 59/181 (32%), Gaps = 27/181 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ + R+G F R ++ + R I + Sbjct: 115 VSRIEAVKAIASDFTSQ-------RVGDRVGLILFGTRAYVQAPLTFDTATVTRFIREAQ 167 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + + + ++LLTDG++T + + Sbjct: 168 L--GFAGEDTAIGDALGLAIKRLRERPAES-----------RVLILLTDGQDTASTVDPM 214 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKAR-----YFLSNC--ASPNSFFEANSTHELNKI 440 A GI++ TI S + L+ A+ +F A + EL I Sbjct: 215 EATALAAESGIKVYTIGISRRIGARAGGSGEVDEALLNAIAEATGGEYFRARNPAELQSI 274 Query: 441 F 441 + Sbjct: 275 Y 275 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 60/200 (30%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ L+ + R+G F D + + Sbjct: 132 DMNDNGEYIDRLTTVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLTADRKTV 184 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ I +T TAI D + T + S ++ ++LL+DG N Sbjct: 185 MQQINQTVI--GLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSN 231 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + + AK I T+ T + L+ A + Sbjct: 232 TAGVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMT 291 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A EL I+ Sbjct: 292 GGKYFRARDAKELETIYDTI 311 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 44/377 (11%), Positives = 109/377 (28%), Gaps = 40/377 (10%) Query: 48 YYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 ++ A++T + P + + + ++ + E + + Sbjct: 39 QSTSKIQSASKTDSSAQARPELSEVAGLVMASEPQADLYQGSKELHRSAKVMTSSMA--- 95 Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + IV V ++ R + + +++ ++ ++ + ++ I + + Sbjct: 96 ----QRIVSSQHVSVSDRNFSLAPTTNDKFESVVQNGNMVAGETPVSTFSIDVDTGSYST 151 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 +R + + + +N F S + + Sbjct: 152 TRRLINQ------------GQLPTKNTVRVEELVNYFSYDYPVPTNSEQPFSVNTELAPS 199 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + L + + +D S K L++ A+ + + + Sbjct: 200 PYNADTQLLRIGLKGFDVAPDKLSASNLVLLLDVSGSMSSADKLPLLKQAMLMLSQQLSA 259 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D V+ V + GV T +KT N G T + +Q AY Sbjct: 260 QDKVSIVV---------YAGASGVVLDGVAGNDFTAIKTALSQLNAQGGTNGSQGIQLAY 310 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIA 404 + +N ++L TDG+ + + + S +GI + T+ Sbjct: 311 QL----AQKHFIENGSNR-----VILATDGDFNLGMTDHQQLVDFVASRSKKGIGLSTLG 361 Query: 405 FSVNKTQQEKARYFLSN 421 F + + L Sbjct: 362 FGLGSGSASYNDHLLEQ 378 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 72/276 (26%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ + V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDYLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + +V LTDG+ T + I N + + I TI + + + Sbjct: 392 DIGDRSVS-LVVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIXTPLLSDI 486 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 64/198 (32%), Gaps = 33/198 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V+ L+ I R+G F D + V + Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDVKTV 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + +TAI DA+ + S E + +VLLTDG+N Sbjct: 156 SKMLSEAQI--GLVGRATAIGDALGLSVKRFASKKESN-----------RIVVLLTDGQN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-----------KTQQEKARYFLSNCA--SPN 426 T N A+ +GI++ TI + L A + Sbjct: 203 TAGNLNPEDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGG 262 Query: 427 SFFEANSTHELNKIFRDR 444 +F A L +I+ + Sbjct: 263 LYFRAKDVAGLQQIYAEL 280 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 55/170 (32%), Gaps = 27/170 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ R + ++ L + + + TAI D + + + Sbjct: 155 DRIGIVLFSGRAYTMAPLTFDHRWLGSQLERIKVGLIED--GTAIGDGLGVGLTRLEQAQ 212 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + + + ++VLLTDG N + + AK++GI + TI + Sbjct: 213 RESGGKRQGA-----FVVLLTDGANNRGSLTPQQAAELAKARGIPVYTIGAGQDGIVPFP 267 Query: 413 ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L + A + FF A + FR Sbjct: 268 VFDDKGRKLGYRRIMSDLDEGALRDIAEMTGGHFFRAADVGTVESAFRAI 317 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 41/120 (34%), Gaps = 3/120 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + ++L++ G+F ++ A+ ++G + +D + ++ A A + + Sbjct: 6 FCLHMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVR 65 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + E + F L + + T+ + + R + + Sbjct: 66 AYGEGATEEEAYDAAQKFFWSNYA---LPQESVVDALSAATEASTQALPRCCRTPICGQW 122 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 52/174 (29%), Gaps = 32/174 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F+ + + L + + TAI + TA + Sbjct: 128 KNRPDDRIGLVIFSGESFTQCPLTIDHDVL--INLYHDIKNGMIEDGTAIGMGLATAVNR 185 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + K ++LLTDG N + I AK GIR+ T+ Sbjct: 186 LR-----------GSEAKSKVVILLTDGVNNAGSIPPITAAEIAKQFGIRVYTVGIGTQG 234 Query: 410 TQQ-----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L+ A + +F A + L +I++ Sbjct: 235 YAPYPVPSPYGGVVYQRMEVQIDEPTLTKIAAITGGKYFRATNNDALTRIYKQI 288 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 22/194 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + ++ Sbjct: 266 TKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQEALQA-V 318 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ----DNE 384 G T I + ++ +I IVLL+DG++T N Sbjct: 319 NSLVSNGGTNIAEGLKKGARVLIDR---------RFKNPVSSIVLLSDGQDTYTMTSPNG 369 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + I I + + + A S +F S + F Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDAFA 429 Query: 443 DRIGNEIFERVIRI 456 IG + V + Sbjct: 430 QCIGGLLSVVVQEL 443 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 22/194 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + ++ Sbjct: 266 TKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQEALQA-V 318 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ----DNE 384 G T I + ++ +I IVLL+DG++T N Sbjct: 319 NSLVSNGGTNIAEGLKKGARVLIDR---------RFKNPVSSIVLLSDGQDTYTMTSPNG 369 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + I I + + + A S +F S + F Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDAFA 429 Query: 443 DRIGNEIFERVIRI 456 IG + V + Sbjct: 430 QCIGGLLSVVVQEL 443 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 58/197 (29%), Gaps = 22/197 (11%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + K +V++ + I R+G F ++ Sbjct: 96 NQWGFDPGDPNKSKFDVVKEVVGDFID-------KRKNDRIGLINFASVAFVASPLTFEK 148 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 L + + TAINDA+ Y+ + S K +LLTD Sbjct: 149 DFLRKILQMQE--PGIAGKRTAINDALLQTYNILSKS-----------DAKSKIAILLTD 195 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 G + I I++ TI + L+ A FF A+ Sbjct: 196 GIDNASRISFDEIRRLISDSDIKLYTIGIGSYRDFDAPYLKALAQ-AGHGRFFAASDRRS 254 Query: 437 LNKIFRDRIGNEIFERV 453 L KI+ + I ++ Sbjct: 255 LQKIY-EAIDRLETSKI 270 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 69/208 (33%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I+ + R+G ++ + + +++ + + Sbjct: 111 NRIEALKRVAAQFIQQ-------RASDRIGIVVYSGESYTKVPATTDKSIVLQALKEIRQ 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + TAI + TA + + S K I+L+TDG N + ++ Sbjct: 164 GEIED--GTAIGMGLGTAINRLKDS-----------KTKSKVIILMTDGVNNTGVIDPLS 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 AK GIR+ TI N L+ + + + Sbjct: 211 AAELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISKITGGKY 270 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 F A ++L +I+ + I ++ + Sbjct: 271 FRATDNNKLAQIYTE-IDKLEKSKIEEL 297 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/299 (11%), Positives = 74/299 (24%), Gaps = 30/299 (10%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + Q S ++ + P V +S N + K Sbjct: 193 DDDEVLDXQXESTERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPL 252 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKID 289 N + V + + K L++ A+ VI+S+ D Sbjct: 253 TSGRQNSGTNQTNMQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD 312 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+ + ++ G T I + ++ Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQA-VNSLISNGGTNIAEGLRKGAKV 365 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK----------AKSQGIR 399 ++ I+LL+DG++T ++ G Sbjct: 366 MLDRKW---------KNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRNGGT 416 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 I + + + S +F + + F IG + V + Sbjct: 417 GFQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAFAQCIGGLLSVVVQEL 475 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 65/198 (32%), Gaps = 34/198 (17%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +KK + + R+G F + + L++ + + N TAI A Sbjct: 122 DVLKKFVDKRQSDRIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSIN-GNQTAIGSA 180 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T+ + + K I+L+TDG+N + A++ GI+I T Sbjct: 181 LSTSMNRLR-----------ELKSKSKIIILMTDGQNNAGKVPPLTAAEAARALGIKIYT 229 Query: 403 IAFS-------VNKTQQEKARYFLSNCA--------------SPNSFFEANSTHELNKIF 441 I T + + ++ A+ST L KI+ Sbjct: 230 IGVGTKGVARMAVGTDPFSGQKIYQQVPVDIDEGTLTSISKMTNAKYYRADSTATLEKIY 289 Query: 442 RDRIG-NEIFERVIRITK 458 D + V + T+ Sbjct: 290 ADIDRLEKTEAEVKKYTQ 307 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 49/165 (29%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + TAI +A+ A + Sbjct: 127 DRLGLILFADHAYLQAPLTLDRRSVATFLDDAQI--GLVGKQTAIGEAIALAVKRFDKVD 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LLTDG N N E A + I I TI Sbjct: 185 ESN-----------RVLILLTDGSNNAGNIEPEVAAQIAAKRNITIYTIGVGAEILERRT 233 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +F A ++ EL I+++ Sbjct: 234 IFGKERINPSMDLDEDQLKKLAAMTKGRYFRARNSEELASIYQEI 278 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/275 (10%), Positives = 76/275 (27%), Gaps = 3/275 (1%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + +++ + G II L + VM+G G+ +D+ + L+ +A Sbjct: 1 MSSAARHPKLTRRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKI---EEYLIRNFENNLKKNFTDREVRDIVRD 117 ++A+ L ++ + A + + ++N+ + + Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTA 120 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 A N + LS+ + L+ + + A A + V Sbjct: 121 VATPANVKYVKCTAQLSNIAHWFIEVLNTIPGLQVANAAQVAASAIATVGAGQTTCAIPV 180 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + + S +G S N + + ++ Sbjct: 181 FVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAALDGSTNEPTIASELSGNTCNITSP 240 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 L + + + ++ + Sbjct: 241 PDLGSTGLKSASLRAWNTRFGIYTNGANGSSGQPD 275 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 79.3 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 62/193 (32%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L I + R+G F ++ + + + Sbjct: 112 SINRLQAVKRVLDDFIDQ-------REGDRLGLILFGTEPYVQAPLTFDRETVRTLLFEA 164 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ + + ++ ++ ++LLTDG NT Sbjct: 165 GL--GMAGRATAIGDAIGLSVKRLRERPQE-----------QRVVILLTDGANTAGQVSP 211 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 A++ G+R+ TI + ++ L+ A + +F A Sbjct: 212 DKATEIAQAAGVRLYTIGIGADTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGRYFRA 271 Query: 432 NSTHELNKIFRDR 444 S EL I+ Sbjct: 272 RSLPELEMIYDSI 284 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 78.9 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 63/206 (30%), Gaps = 39/206 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 280 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-V 332 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 G T I DA++ A I N I+LL+DG++T + Sbjct: 333 NLLGAGGGTNIADALKKAAKVIEDRN---------YKNPVCSIILLSDGQDTYNISSNVR 383 Query: 383 -------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + +I N I T+ + L + A S +F Sbjct: 384 GTRPDYRSLVPSSILNH------TICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIED 437 Query: 434 THELNKIFRDRIGNEIFERVI--RIT 457 + F IG + V R+T Sbjct: 438 ESVIQDAFAQCIGGLLSVVVQDMRLT 463 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 78.9 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 63/206 (30%), Gaps = 39/206 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 280 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-V 332 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 G T I DA++ A I N I+LL+DG++T + Sbjct: 333 NLLGAGGGTNIADALKKAAKVIEDRN---------YKNPVCSIILLSDGQDTYNISSNVR 383 Query: 383 -------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + +I N I T+ + L + A S +F Sbjct: 384 GTRPDYRSLVPSSILNH------TICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIED 437 Query: 434 THELNKIFRDRIGNEIFERVI--RIT 457 + F IG + V R+T Sbjct: 438 ESVIQDAFAQCIGGLLSVVVQDMRLT 463 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 64/227 (28%), Gaps = 39/227 (17%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +D ++ + + + +++ + I + R+ Sbjct: 81 DVSTRTNTTRGIDIVIAIDVSASMLARDLKPNRLEALKEVASQFIAD-------RPSDRI 133 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G + + + ++ ++ G TAI + T+ + + S Sbjct: 134 GLVEYAGESYTRTPITSDKSIVLSSLNDIQYNSII-EGGTAIGMGLATSVNRLKDS---- 188 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--- 414 K I+L+TDG N E A+ GI++ TI N T Sbjct: 189 -------RAKSKVIILMTDGVNNAGFIEPSTASELAQEFGIKVYTIGLGTNGTALSPVAL 241 Query: 415 ---------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A L +I+ + Sbjct: 242 RPDGSFQYGSIPVEIDEALLQEIADKTGGLYFRATDNESLEEIYAEI 288 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 57/204 (27%), Gaps = 47/204 (23%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + R+G + + + +TIVK Sbjct: 114 NRLEALKKVAVDFVDR-------RPNDRIGIVVYAGESFTQTPITS-----DKTIVKRTI 161 Query: 329 IDENEM---GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G TAI + + + + S K I+LLTDG N N + Sbjct: 162 NRLQWGQLEGGTAIGMGLGSRVNRLKDSKAKS-----------KVIILLTDGVNNAGNID 210 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEK-------------------ARYFLSNCA--S 424 AK GI++ TI N L N A + Sbjct: 211 PTTATELAKELGIKVYTIGIGTNGMADFPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEET 270 Query: 425 PNSFFEANSTHELNKIFRDRIGNE 448 +F A L +I+ + E Sbjct: 271 QGKYFRATDNTSLKEIYDEIDALE 294 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 56/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +I+ I +T Sbjct: 125 SAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVI--GLVGQ 179 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT + I AK Sbjct: 180 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPIEAAEIAKK 228 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A +L KI Sbjct: 229 YNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKI 288 Query: 441 FRDR 444 + Sbjct: 289 YNTI 292 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 56/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +I+ I +T Sbjct: 119 SAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVI--GLVGQ 173 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT + I AK Sbjct: 174 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPIEAAEIAKK 222 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A +L KI Sbjct: 223 YNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKI 282 Query: 441 FRDR 444 + Sbjct: 283 YDTI 286 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 28/266 (10%), Positives = 77/266 (28%), Gaps = 1/266 (0%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + ++ + G II L + +M+G G+ +D+ + L+ +A + Sbjct: 1 MSSAARHPKLTRHSLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSC 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++A+ L ++ + A + + N T + T Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAGHANYAF-FQQNAVQMQTDSNVTFSDSLTNPFLTKT 119 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + V +++ + + L ++ ++ A ++ + +I Sbjct: 120 AVATPANVKYVKCTAQLSNIAHWFIEVLNTIPGVQVANASQVAASAIATVGAGQTTCAIP 179 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S Y+ + +++ +G +CN + Sbjct: 180 VFVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAALDGSTNEPTIASELSGNTCNITS 239 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRH 266 L L + + Sbjct: 240 PPDLGSTGLKAASLRAWNTRFGIYTN 265 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 74/228 (32%), Gaps = 27/228 (11%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + +D L+ + + + + ++ + I Sbjct: 75 QTYNAWDDKDTEGIDIMLTMDVSASMLTEDVYPNRMVVAKEVASEFIS-------GRPND 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + L+ ++ D G TAI + A + Sbjct: 128 NIGLTIFAGEAFTQCPMTLDHAALL-NLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S K ++LLTDG N + + A+ GIRI TI F + Sbjct: 187 DSKAKS-----------KIVILLTDGSNNAGSISPMTAAAIARKFGIRIYTIGFGKETGE 235 Query: 412 Q--EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + L + A + F+ A S EL++I++D E + R Sbjct: 236 EIGAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQDIDKLEKTKMNAR 283 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 63/177 (35%), Gaps = 17/177 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 D L+ V ++ R+G F D F+ ++ Sbjct: 110 AVDRLSMVKSYLQSFIKQRQGDRIGIILFADHAYLMVPFTQDWQAA--GLLLDEVNIGLA 167 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI +A+ A + + K ++LL+DG+++ + + A Sbjct: 168 GKFTAIGEAITLAVKKTLHEPKPIQ---------NKTLILLSDGKDSINTIQPTDAAALA 218 Query: 394 KSQGIRIMTIAFSVNKTQQ----EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 K+ G++I TI + T + L A + +F A S +L++I++ Sbjct: 219 KASGLKIYTIGIGSDSTDAEAESDLDETTLEEIANMTGGQYFRARSEQDLSEIYQQI 275 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 59/195 (30%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + +LV+ L + + R+G F + ++ H + Sbjct: 106 DMQWQDEDVSRLNLVKHLLGDFLEH-------REGDRVGLILFGSKAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + ++L+TDG N Sbjct: 159 RVWLDEAKI--GIAGKNTAIGDAIGLALKRLR-----------QRPAQSRVLILVTDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSNCA--SP 425 I A +G++I I + Q + L A + Sbjct: 206 NGGEIAPITAARLAAEEGVKIYPIGIGADPEQSATLGVLGINPSLDLDEPALKELAQVTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A EL I Sbjct: 266 GRYFRARDGQELQAI 280 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 59/195 (30%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + L++ I R+G F + ++ H + Sbjct: 106 DMRWQDYEISRLELIKKLFGDFIED-------RRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A Q ++I TI + + L A + Sbjct: 206 TGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 60/191 (31%), Gaps = 24/191 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + I ++ Sbjct: 224 KIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNLRKVDDENTPNI-QSIIN 276 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M A++ + I LL+DG++ +E+ Sbjct: 277 SITADGGTDINSGMLMAFNILQKRQFF---------NPVSSIFLLSDGQDNGADEKIKKY 327 Query: 390 CN---KAKSQGIRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRI 445 N K++ I + F + R L +F+ +++++ F D + Sbjct: 328 INSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLK----DGNFYYVEKINQVDEFFVDAL 383 Query: 446 GNEIFERVIRI 456 G I Sbjct: 384 GGLFSVVAQEI 394 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 60/200 (30%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + V++ ++ ++ + R+G F D + + Sbjct: 101 DMQEGDQYIDRLSAVKNVVSDFVKQ-------REGDRLGLVLFADHAYLQTPLTLDRETI 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + TAI + + A T + S ++ +VLL+DG N Sbjct: 154 SDQVNSLVLRLI--GDKTAIGEGIGLATKTFVDS-----------EAPQRVMVLLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T E + AK I TI T ++ L A + Sbjct: 201 TSGVLEPLEAARIAKKYNATIYTIGIGAGEMMVKEFFMTRKVNTAKDLDEKTLKQIADLT 260 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A + EL I+ Sbjct: 261 GGQYFRARNADELATIYDTI 280 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 67/211 (31%), Gaps = 39/211 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + I + + R+G F+ + + + L I + Sbjct: 118 KNRIEAAKSVATDFI-------HQRLSDRIGLVVFSGKSFTQCPLTLDYRLLTNFISELK 170 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A E G TAI A+ TA + + S K I+LLTDG+N E + Sbjct: 171 AGTIEEDG-TAIGTAIATATNRLRESTAKS-----------KVIILLTDGQNNAGEIEPV 218 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA--SPNSF 428 A + GI+I T+ + + L+ A S + Sbjct: 219 TAAELAAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRY 278 Query: 429 FEANSTHELNKIFRDRIG-NEIFERVIRITK 458 F A L K + + + V R T+ Sbjct: 279 FRATDLESLKKTYHEIDELEKTKVEVERYTE 309 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------MRPATSRALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 241 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 55/162 (33%), Gaps = 28/162 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ ++ + + +TAI DA+ A + Sbjct: 133 DRVGLILFGTQAYLQTPLTFDRKTVMTLLNEAVI--GLAGDNTAIGDAIGLAVKRL---- 186 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 K+ + +VL+TDG NT + A + ++I TI + Sbjct: 187 -------KSEQVNSRVLVLMTDGANTAGEVSPLKAAELAAANHLKIYTIGIGADEMIVRS 239 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L A + ++ A +T ELN I+ Sbjct: 240 FFGNRKINPSVDLDEKTLIKIAESTGGQYYRARNTDELNNIY 281 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 63/243 (25%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D + Y ++ V Sbjct: 208 QNGVLGDLIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTK 267 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R ++ + + F++R+ V K + + Sbjct: 268 LRQTKDALFTILHDLRP---QDHFSIIGFSNRIKVWKDHLVSVTPDNVRDGKVYIHHMSP 324 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+Q A + N+ H + I+ LTDG+ T + I N Sbjct: 325 TGGTDINGALQRAIKLL---NDYVAHNDIEDRSVS-LIIFLTDGKPTVGETHTLKILNNT 380 Query: 394 KSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K I I TI + + L NC E + F D I + Sbjct: 381 KEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTRRVLEEDDAGAQLIGFYDEIRTPLL 440 Query: 451 ERV 453 + Sbjct: 441 SDI 443 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 63/243 (25%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D + Y ++ V Sbjct: 206 QNGVLGDLIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTK 265 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R ++ + + F++R+ V K + + Sbjct: 266 LRQTKDALFTILHDLRP---QDHFSIIGFSNRIKVWKDHLVSVTPDNVRDGKVYIHHMSP 322 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+Q A + N+ H + I+ LTDG+ T + I N Sbjct: 323 TGGTDINGALQRAIKLL---NDYVAHNDIEDRSVS-LIIFLTDGKPTVGETHTLKILNNT 378 Query: 394 KSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K I I TI + + L NC E + F D I + Sbjct: 379 KEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTRRVLEEDDAGAQLIGFYDEIRTPLL 438 Query: 451 ERV 453 + Sbjct: 439 SDI 441 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 56/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +I+ I +T Sbjct: 125 SAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVI--GLVGQ 179 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT + I AK Sbjct: 180 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPIEAAEIAKK 228 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A +L KI Sbjct: 229 YNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKI 288 Query: 441 FRDR 444 + Sbjct: 289 YDTI 292 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 78.9 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + +T TAI + + A T I S Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQTVLKLI--GTQTAIGEGIGLATKTFIDS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 185 ----------DAPQRVMILLSDGSNTAGVLDPLEAANIAKQYNTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + + TAI DA+ A Sbjct: 132 DRIGLILFADTAYVQAPLTYDRDTVSTLLSEAVI--GLVGEQTAIGDAIGLAVKRFDERE 189 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E ++LLTDG+NT N A S+G+++ TI + Sbjct: 190 ESNN-----------VLILLTDGQNTAGNITPEQAKELAISKGVKVYTIGVGADKMLIQS 238 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 QE L+N A + +F A + EL I++ E E R Sbjct: 239 FFGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQELQAIYQQLDALEPIEGETR 294 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 67/437 (15%), Positives = 137/437 (31%), Gaps = 61/437 (13%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + V+ + T+ ++I+ + +N K T + + V T + Sbjct: 971 IATATSDDSGNVTFDKLRAGTYYLKEIQAPVG-YVASNEKWTVTVTKENNNVTTTLKNSS 1029 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ------TKAEAETVSRSYHKEHGV 177 +L+ + ++N + ++ +K W + T S K Sbjct: 1030 GTAVKDNKILNETPEEIINSSMVTSKTAKVKDWDQRTYDITINATSTSTSSIIETKTSVA 1089 Query: 178 SIQWVIDFSRSML-DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 I V+D S SM D S N + D + + + + S Sbjct: 1090 DIMLVLDVSGSMGEDITSYSYTFVANNTSEARDDKKLLNRNVTYYIEVDGSYKEMWYYSS 1149 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK--KKHLVRDALASVIRSIKKIDNVND- 293 + + G D + +++ R + +++A+ I K + Sbjct: 1150 YNKGWRVGPRGSSDDAAKDKYNNCKIYTRTSTTETRLDALKNAVNQFIDDTAKKSPNSKI 1209 Query: 294 ------TVRMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTA 346 + + + S G + T +K F D G T ++ A Sbjct: 1210 GITVFSSTDDYNRPYGNHGTSVSLGEVGTADSAKVTELKNFVKDLKANGGTDPAVGLEDA 1269 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC---------------- 390 + + + + KY+VL TDG+ T + + Sbjct: 1270 KNKLDAMVD----------TNPKYVVLFTDGKPTGGGNKWNSNAQKNAETQAGELKTGLR 1319 Query: 391 ---NKAKSQGIRIMTIAFSVNKTQQEKARYFLSN-----------CASPNSFFEANSTHE 436 + AK+ + TI F++N ++A+ FLS +S + A+ Sbjct: 1320 NNVDNAKNP-YTVYTIGFALNDEG-DRAKTFLSGGTYDGKKDPGIASSSDCAKTADDAAS 1377 Query: 437 LNKIFRDRIGNEIFERV 453 L +IF+ I + I + V Sbjct: 1378 LTQIFQS-ISSTINKNV 1393 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 55/179 (30%), Gaps = 31/179 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + + TAI +++ A + Sbjct: 127 DRIGLILFGDHAYLQSPMTQDRRSVAQYLREAQI--GLVGKQTAIGESIALAVKRFENLE 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + +VLLTDG N + A + + I TI Sbjct: 185 ESN-----------RVLVLLTDGTNNAGSISPDKAAAIAAERKVTIYTIGVGAEMMERRS 233 Query: 409 -------KTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L A+ +F A S+ +L I+++ E V R T+ Sbjct: 234 FFGRDRVNPSMDLDEEQLQRIANATQGKYFRARSSEDLAAIYQEIDKLE---PVSRDTQ 289 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 59/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ AS I N R+G + + + ++ + Sbjct: 112 NRLEALKKVAASFI-------NGRPNDRIGLIEYAGESFTKTPITSDKSIVLSALKSIQY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TAI + T + + S K I+L+TDGEN + Sbjct: 165 NNII-EGGTAIGMGLATGVNRLKDSKALS-----------KVIILMTDGENNAGQIDPRI 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A+ GI++ TI N L A + + Sbjct: 213 AAELAQEFGIKVYTIGMGTNGMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQY 272 Query: 429 FEANSTHELNKIFRDR 444 F A + +L +I+ + Sbjct: 273 FRATNNEKLQEIYDEI 288 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 51/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I S Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVASQLNQAVLKLI--GTQTAIGEGIGLATKTFIDS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 185 ----------DAPQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 63/203 (31%), Gaps = 35/203 (17%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + R+G F D + +I + + Sbjct: 108 VDRLTAVKQVLSEFIT-------KREGDRVGLILFADHAYLQTPLTLDRQTVISQLNQAV 160 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A T I S ++ ++LL+DG NT + + Sbjct: 161 LKLI--GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDPL 207 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCAS--PNSFFEAN 432 N AK I T+ T Q+ L AS +F A Sbjct: 208 EAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRAR 267 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 + +L I+ E + R Sbjct: 268 NQQDLQSIYDTINQLEPISQASR 290 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 66/198 (33%), Gaps = 41/198 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + I +G T F + + LI ++++ Sbjct: 108 NRLEAAKNVASEFISD-------RPNDNIGLTIFAGEAFTQCPMTTDHASLI-NMLRSVR 159 Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D G TAI + A + S K ++LLTDG N + Sbjct: 160 TDIAARGLISDGTAIGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMGDI 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPN 426 + AKS GIR+ TI NK E L N A + Sbjct: 209 SPLTSAQIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDSKTLKNIAETTDG 268 Query: 427 SFFEANSTHELNKIFRDR 444 +++ A S ++L +I++D Sbjct: 269 NYYRATSNNQLKQIYKDI 286 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 57/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +++ I +T Sbjct: 125 TAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVI--GLVGQ 179 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT E + AK Sbjct: 180 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEPLEAAEIAKK 228 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A T EL KI Sbjct: 229 YNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTEELEKI 288 Query: 441 FRDR 444 + Sbjct: 289 YDTI 292 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L+ I K R+G F D + L + + + Sbjct: 116 YIDRLSAVKHVLSDFIERRKG-------DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQA 168 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI D + A T + S ++ +VLL+DG NT + Sbjct: 169 VLRLI--GNQTAIGDGIGLATKTFVDS-----------DAPQRVMVLLSDGSNTAGVLDP 215 Query: 387 IAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEA 431 + + AK I T+ T Q+ L A + +F A Sbjct: 216 LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRA 275 Query: 432 NSTHELNKIFRDR 444 + EL I+ Sbjct: 276 RDSKELATIYDTI 288 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 93 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 152 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 153 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 209 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 210 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 266 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 267 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 325 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 326 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 361 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 100 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 159 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 160 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 216 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 217 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 273 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 274 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 332 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 333 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 368 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 78.5 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + L++ I R+G F + ++ H + Sbjct: 106 DMRWQDDEISRLELIKKLFGDFIED-------RRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A Q ++I TI + + L A + Sbjct: 206 TGGQIAPQIAAQLAAEQQVKIYTIGVGADPQQGGVPGLFGFNPGLDLDEPTLRGIAEITG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 78.1 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 63/211 (29%), Gaps = 43/211 (20%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + I N +G T F + + Sbjct: 97 STSMLTPDLPPSRIETAKQVAYEFI-------NNRPDDNIGLTVFGGEAYTQCPLTTDHS 149 Query: 318 KLIRTIVKTFAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 L+ + K D + G TAI + +A + S K I+L Sbjct: 150 ALL-NMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQS-----------KSKSKVIIL 197 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------------------QQEKA 415 LTDGEN + AK GIRI TI+ + +Q Sbjct: 198 LTDGENNAGEISPLTAAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEAAIKQNTD 257 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + F++A S +L I+++ Sbjct: 258 PKTLEAIANSTGGKFYQARSKAKLRDIYQNI 288 >gi|126723120|ref|NP_001075478.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] gi|11041722|dbj|BAB17303.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] Length = 951 Score = 78.1 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 58/189 (30%), Gaps = 15/189 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K R+AL ++ + D R F+ + + +++A Sbjct: 289 RKIQQTREALLKILDDLNPRD------RFNLILFSSSATPWKTSLVQASLETVSEARSYA 342 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T IN+A+ A + E+ + ++LLTDGE TQ Sbjct: 343 GAIQAAGGTDINEALLLAVSLLDHEQEELRAGSVS------LLILLTDGEPTQGKTNPTE 396 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I + + + F N + L N +E + + F + Sbjct: 397 IQRNVREAIGGRYSLFCLGFGFNVNYPFLEKLALDNGGLARRVYEDSDAALQLQDFYQEV 456 Query: 446 GNEIFERVI 454 + V Sbjct: 457 AHPQLTSVT 465 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I S Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 185 ----------DAPQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L + A + +F A + +L I+ Sbjct: 235 FLFSRKVNTSQDLDEKTLQSIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 80/304 (26%), Gaps = 18/304 (5%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL--IQSLEE 74 G F I AL M +LG G+ +D+ R + L+ A + + A+ L Sbjct: 9 RQHGAFLITFALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAARELNGQSDAIT 68 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + A + + N + L +D V T+ R + Q +S Sbjct: 69 RAQNAGMAAGNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSDGKLARYAECQYSMS 128 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW---VIDFSRSMLD 191 S LL + F +A A + +Q D S Sbjct: 129 SIKLWLLQAMGAFTGDSATWPNTGTVEARAVATRAPSQSACPIPVQLKKAKFDALNSSPG 188 Query: 192 YQRDSEGQPLNC----------FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + + + ++++ GK + P + K+ Sbjct: 189 GGKGTWIVAMTKSGGGSDFGWSNLDGSTSATETWAELEGKYCSTEFPQLPLDTNGLKASA 248 Query: 242 YMLYP---GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + + G + S+S + + + Sbjct: 249 FEYWNHRFGIYKKASDTPAQNSSASRPDYTGYIYTDTNWPSKFNAYDGPAQGGTSNFITQ 308 Query: 299 ATFF 302 F Sbjct: 309 RQSF 312 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 64/224 (28%), Gaps = 38/224 (16%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + +D ++ + + + ++ I R+G Sbjct: 82 FSWQNSTTEGIDIVIATDISGSMLAEDLKPNRLEAGKNIAIDFI-------KGRPEDRIG 134 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + + L + + + TAI + TA + + S Sbjct: 135 LVIFSGESFTQCPLTIDHDVL--INLFSDISNGMVEDGTAIGMGLATAVNRLKDS----- 187 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY- 417 K ++LLTDG NT + + AK +R+ TI + Sbjct: 188 ------EAKSKVVILLTDGSNTTGSIPPLTAAEIAKQMKVRVYTIGVGTKGYAPYPVKTP 241 Query: 418 ---------------FLSNCAS--PNSFFEANSTHELNKIFRDR 444 LS A +F A + +L +I++ Sbjct: 242 FGTQYQQVPVTIDEGVLSKIAGITGGKYFRATNNEKLKEIYQQI 285 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 74/228 (32%), Gaps = 27/228 (11%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + +D L+ + + + + ++ + I Sbjct: 75 QTYNAWDDKDTEGIDIMLTMDVSASMLTEDVYPNRMVVAKEVASEFIS-------GRPND 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + L+ ++ D G TAI + A + Sbjct: 128 NIGLTIFAGEAFTQCPMTLDHAALL-NLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S K ++LLTDG N + + A+ GIR+ TI F + Sbjct: 187 DSKAKS-----------KIVILLTDGSNNAGSISPMTAAAIARKFGIRVYTIGFGKETGE 235 Query: 412 Q--EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + L + A + F+ A S EL++I++D E + R Sbjct: 236 EIGAIDYKTLQDIAVSTNGEFYRAQSQAELSRIYQDIDKLEKTKMNAR 283 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 60/192 (31%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + RMG F D + +I + + Sbjct: 98 VDRLTAVKQVLSEFIT-------KREGDRMGLILFADHAYLQTPLTLDRQTVISQLNQAV 150 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A T I S ++ ++LL+DG NT + + Sbjct: 151 LKLI--GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDPL 197 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCAS--PNSFFEAN 432 N AK I T+ T Q+ L AS +F A Sbjct: 198 EAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRAR 257 Query: 433 STHELNKIFRDR 444 + +L I+ Sbjct: 258 NQQDLQSIYDTI 269 >gi|86147474|ref|ZP_01065786.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] gi|85834767|gb|EAQ52913.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] Length = 460 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/400 (10%), Positives = 97/400 (24%), Gaps = 21/400 (5%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE--VS 76 G +++ + +P +L V G+ VD R + L A A I A+ + + + Sbjct: 13 RGLVALMSIIALPFILLVVGLSVDAGRAYIVKSKLFAAVDAASIAAARAVANGEDAGRAA 72 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 ++ S P N +F + + + + Sbjct: 73 AQKYFSANIPADFYSATPSLGAVNFAYDSFGNISIDISATAQVPTIFLPLIGLD-TFNPG 131 Query: 137 YDLLL--NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 P+ L L S + + + K + D + Sbjct: 132 VSAQSIRRPVDLVLVIDNTTSLRLGSIGDVTQDVIDRSKSFVENFHEGFDRISLVKFAFG 191 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 N + ++KS + + + ++L + Sbjct: 192 AEVPVGFNATRGHSRSSIKSEIDSFNFGSTSNAQYTNASEGMYRALNELRTVTDPANLKV 251 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 F D + + + +I+ D + R + Sbjct: 252 IVFFTDGAPNTFATTF---DFEGGGTHTGAIRSSDGSSGRPRGLWRH----DTIATTLPG 304 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDA-MQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + T + + + + + N + Sbjct: 305 GYEGRDIDDYISEIPEHYTAHDTNATEFKILNPTHAVRPVAQYAPNSHSANDLYTRV--- 361 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 N I KA+++ I + T+ + T Sbjct: 362 -----NRVARNLVEDIAEKARTEDIYVFTLGLGSSLTSAT 396 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/352 (11%), Positives = 96/352 (27%), Gaps = 25/352 (7%) Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 T D++ E + ++ + L+ R +++ A+ Sbjct: 15 AALTACSGPLDRSDSSTENLHGAPDSALPATAVAEENLSLTENTERYQDQPDQPVKSVAQ 74 Query: 164 AETVSRSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 + S + + S + +N F Sbjct: 75 EPVSTFSIDVDTGSYANVRRFLNSGKQPPKDAVRIEEIINYFPYNYPLPTDGRP-FAVHT 133 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 D P + P + VD S K LV+ L + Sbjct: 134 ETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEENKLPLVQKTLRILT 193 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + ++ D V ++ + G + + GST+ A Sbjct: 194 QQLRPQDKVT------LITYSSGEELVLPPTSGS---DKETILKAIDKLKAEGSTSGESA 244 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIR 399 ++ AY+ + N I+L TDG+ + + + + GI Sbjct: 245 LRMAYE----EAQKAFVPNGINR-----ILLATDGDFNVGVSDTETLKSMVAEKRKTGIS 295 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + T+ F + ++ A ++ ++ E K+ + ++ + + Sbjct: 296 LSTLGFGTDNYNEDMMEQI--ADAGDGNYSYIDNEKEAKKVLQQQLTSTLAT 345 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 57/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ + R+G F D + + + + Sbjct: 112 TAVKQVLSEFIA---KREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GT 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + + A T I SN ++ ++LL+DG NT + + N AK Sbjct: 167 QTAIGEGIGLATKTFIDSNA-----------PQRVMILLSDGSNTAGVLDPLEAANIAKQ 215 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T Q+ L A + +F A + +L I Sbjct: 216 YHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNI 275 Query: 441 FRDR 444 + Sbjct: 276 YDTI 279 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 58/187 (31%), Gaps = 32/187 (17%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A+ +++ R+G F ++ ++ + + + +T Sbjct: 115 AVKALLTDFIARRKG---DRVGMILFGEQAYLQTPLTFDLKTVQTMLDETTI-GLAGSSR 170 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI D + A + + ++LLTDG+N + A+ Sbjct: 171 TAIGDGIGLAVKRLR-----------ERDANNRVLILLTDGQNNTGALNPLQAAELAEHA 219 Query: 397 GIRIMTIAFSVN--------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 GI I TI + E L A + +F A T E+ +I Sbjct: 220 GITIYTIGVGADEMIVKNRFFGNRRINPSLELDEESLIAVAEKTGGRYFRARDTKEMEEI 279 Query: 441 FRDRIGN 447 + I Sbjct: 280 Y-QIIDE 285 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 58/175 (33%), Gaps = 32/175 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + R+G ++ + + ++R + + E G TAI + T+ + Sbjct: 126 KGRPSDRIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESG-TAIGSGLATSVNR 184 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + S + K I+LLTDG N + AK GI++ TI N Sbjct: 185 LKDSKAES-----------KVIILLTDGVNNSGFIDPKVASELAKEFGIKVYTIGVGTNG 233 Query: 410 ------------------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 Q E L A + +F A + +L I+ + Sbjct: 234 MALTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDEI 288 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 58/183 (31%), Gaps = 33/183 (18%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G + + + ++ I + G T I + TA + + S Sbjct: 130 NDRIGLVLYASEAYTKTPVTSDKPIILEAIKGIRYDTVLQDG-TGIGMGLATAVNRLKDS 188 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + I+LLTDG N E + AK GI++ TI N + Sbjct: 189 KAKS-----------RVIILLTDGVNNAGFIEPETAADIAKQYGIKVYTIGLGTNGMAES 237 Query: 414 K------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + A + ++F A S +L +I+ + I + Sbjct: 238 PYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGTYFRATSNDKLAEIY-NSINKLETTEI 296 Query: 454 IRI 456 + Sbjct: 297 QEL 299 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 62/204 (30%), Gaps = 37/204 (18%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + +D ++ + R+ F ++ + Sbjct: 102 DMDDNGQRRSRIAVTKDVAMDFVKQ-------REGDRIALVLFGTHPYLQTPLTFDHPTV 154 Query: 320 IRTIVKTFAIDE----NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ I + + +TAI DA+ A + K ++LLT Sbjct: 155 MQHIYEAQLTMADDLQRGIHATAIGDAIGLAVKRLRD-----------IDAPDKTLILLT 203 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-------------ARYFLSNC 422 DG + + A +G++I TI + Q L + Sbjct: 204 DGSDNASQVAPLKAAQIAAREGLKIYTIGLGAEQRQASLLGFDFGFGKNREIDEKTLKDI 263 Query: 423 --ASPNSFFEANSTHELNKIFRDR 444 A+ +F A + EL +I++ Sbjct: 264 AKATDGRYFRARNPEELREIYQHI 287 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + L++ I R+G F + ++ H + Sbjct: 106 DMRWQDDEISRLELIKKLFGDFIED-------RRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A Q ++I TI + + L A + Sbjct: 206 TGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + L++ I R+G F + ++ H + Sbjct: 106 DMRWQDDEISRLELIKKLFGDFIED-------RRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A Q ++I TI + + L A + Sbjct: 206 TGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 78.1 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + L++ I R+G F + ++ H + Sbjct: 106 DMRWQDDEISRLELIKKLFGDFIED-------RRGDRVGLILFGSQAYLQAPLTFDRHTV 158 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + +TAI DA+ A + + + +VL+TDG N Sbjct: 159 RVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRPAES-----------RVLVLITDGAN 205 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNK------------TQQEKARYFLSNCA--SP 425 T A Q ++I TI + + L A + Sbjct: 206 TGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTG 265 Query: 426 NSFFEANSTHELNKI 440 +F A S+ EL I Sbjct: 266 GEYFRARSSAELESI 280 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 63/179 (35%), Gaps = 17/179 (9%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + L +V ++ D RM F R F+ + L + +T Sbjct: 117 TPKQRLEAVKDVLRAFIAARDGDRMALIIFGTRAFVQAPFTEDLQSLNGFLEQTAV--GM 174 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 +TA+ DA+ T SS D ++ +++L+DG +T + + Sbjct: 175 AGPNTALGDAIGLGIRTFESSEVD-----------QRMMIVLSDGADTSSRMTPVIAASI 223 Query: 393 AKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 A +G+ I TI + L + A + +F A+ L +++R Sbjct: 224 AADKGVVIYTIGVGDPDATGEDRVDLDALKDIANKTQGQYFFADDEAALTEVYRQIDAQ 282 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I SN Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDSN 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + + AK I T+ Sbjct: 186 A-----------PQRVMILLSDGSNTAGVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 67/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLTEFVQSRKG-------DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DKSRGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 67/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLTEFVQSRKG-------DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DKSRGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 71/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 1 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 60 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 61 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 117 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 118 GFSNRIKVRKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 174 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 175 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 233 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 234 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 269 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 67/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLTEFVQSRKG-------DRLGLVLFGDAAFVQTPFTVDQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DKSRGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 50/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 40 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 94 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 95 --------MRPATSRALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 146 Query: 413 ----------EKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A S +F A +L KI Sbjct: 147 QSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 186 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 60/217 (27%), Gaps = 10/217 (4%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ V +R ++ + + Sbjct: 277 DGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDHFNI 333 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F++R+ V K + + G T IN A+Q + N+ H Sbjct: 334 VGFSNRIKVWKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLL---NDYVAH 390 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKAR 416 + +V LTDG+ T + I N + + I T+ + + + Sbjct: 391 NDIEDRSVS-LVVFLTDGKPTVGETHTLKILNNTREAARGRVCIFTVGIGADVDFKLLEK 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E + F D I + + Sbjct: 450 LSLENCGLTRRVHEDHDARAQLIGFYDEIRTPLLSDI 486 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I SN Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDSN 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 186 A-----------PQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKLNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 65/194 (33%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + + RMG F+ + + H ++ ++ Sbjct: 112 DRLTAAKNVASDFV-------KGRPGDRMGLVIFSGETFTQVPLTTD-HGVMLNMLAEMK 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TAI D + TA + S K ++LLTDG N + + Sbjct: 164 NGLIDDG-TAIGDGLATAISRLKDS-----------EAISKVVILLTDGMNNAGSVDPYT 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AK GIR+ TI T E L++ AS +F Sbjct: 212 AAEIAKLYGIRVYTIGVGSYGTAPYPVQTPFGTQIQQMKVEIDEKLLASVASMTGGKYFR 271 Query: 431 ANSTHELNKIFRDR 444 A S +L++I+ + Sbjct: 272 ATSNQKLDEIYEEI 285 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 77.7 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 63/178 (35%), Gaps = 20/178 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + + +V + R+G F R ++ V + + I + Sbjct: 111 NSLVSRITAVKAIAADFASRRTGDRVGLILFGTRAYVQAPLTFDVKTVKQFIEEAQL--G 168 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TAI DA+ A + D + ++LLTDG++T + + Sbjct: 169 FAGEDTAIGDALGLAVKRLRERPADS-----------RVLILLTDGQDTASTVDPMEAAA 217 Query: 392 KAKSQGIRIMTIAFS-----VNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 A ++I TI S + + E L+ A+ +F A + EL I++ Sbjct: 218 LASEMNVKIYTIGISRRLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDIYQ 275 >gi|62531155|gb|AAH92555.1| LOC594926 protein [Xenopus (Silurana) tropicalis] Length = 895 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 7/166 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K +D+V + F+ V K + + G T INDA+ Sbjct: 300 LKILDDVKEHDHFNFVIFDWGVEIWEQSLVKATPENLNRAKAYVRNLYPKGWTNINDALL 359 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 +A + +++ ++ I+ +TDG+ + I A++ + Sbjct: 360 SAISLLDQAHDARSVPKRSAS----LIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLY 415 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ F V + L N +E + + F D + N Sbjct: 416 SLGFGVGVDYPFLEKLSLENSGVARRIYEESDAALQMEGFYDEVAN 461 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 70/210 (33%), Gaps = 15/210 (7%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K + ++L+K +G+F ++TALL+PV+ G + +++ + ++ A A I A Sbjct: 3 KVLTAMQRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIKAV 62 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + E + R + + F N + TD AVE Sbjct: 63 RSYGEGESENAVRTEANRLFFA------------NFQTPSATDGYNSASPESPAVEFTFS 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 ++ + S+ Y NP+ L+ I + + I+ Sbjct: 111 ETGQETRASASYAAQYNPVFWGLQPFVISRRSVAARLTDREACILALHPTA---SRAIEV 167 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 S S + + Q + + Sbjct: 168 SGSAAVDTSNCTITSNSDDAQSIYLSGTAT 197 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 67/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLTEFVQSRKG-------DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DKSRGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 63/235 (26%), Gaps = 43/235 (18%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL----RHVIKKKHLVRDALASVIRSIKK 287 ++ + + + +D+S + + + + S K Sbjct: 81 FVFAEPYNNSIDPPKIDYKNGIDIILSIDASGSMLAQDFDPNRLEVAKRVAKKFVDSRKG 140 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 R+G + + + L I TAI + A Sbjct: 141 -------DRVGLVVYEGEAYTACPATLDYKLLKEQISAIEPGHLEP--GTAIGSGLGVAV 191 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + ++ K I+LLTDG + E + + AK++ R+ TI Sbjct: 192 TRLR-----------SDSLISKVIILLTDGSSNTG-PEPLEVAELAKAKKCRVYTIGVGA 239 Query: 408 NKTQQEK----------------ARYFLSNCASP--NSFFEANSTHELNKIFRDR 444 + L AS +F A L KI+ + Sbjct: 240 DGMAPTPVNTPFGVVYQNLPVEIDEGVLKEIASATNGKYFRAQDEKSLEKIYAEI 294 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 57/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +++ I +T Sbjct: 125 TAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVI--GLVGQ 179 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT E + AK Sbjct: 180 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEPLEAAEIAKK 228 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A T +L KI Sbjct: 229 YNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEMTGGQYFRARDTDQLEKI 288 Query: 441 FRDR 444 + Sbjct: 289 YDTI 292 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 45/407 (11%), Positives = 106/407 (26%), Gaps = 58/407 (14%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + L+R+ + + + R + Q Sbjct: 374 PDTDGDGLSDGAEVGEYRSNPLVRDTDGDGISDGEAVAQGIDPTGIEENPAVRTQSVQTA 433 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L S L + +S+ + G + + + + + S + +S I S Sbjct: 434 LESTEKPLFDNVSVTMNVGGNLNQHLSIRNIEKEDTLSANVVGALSAPIEITSDASFTSA 493 Query: 193 Q-RDSEGQPLNCFGQPADRTVKSYSSQNGKV-----GIRDEKLSPYMVSCNKSLYYMLYP 246 + + + L + V Y +N + + + + L Sbjct: 494 KITFTYDEALLGDTPAENLAVMWYDRENRRYVILDKDTVVDPNAHTVSYTTTHFSTYLVV 553 Query: 247 GPLDPSLSEEHFVDSSSLRH--------------------VIKKKHLVRDALASVIRSIK 286 +D S + + AL + I ++ Sbjct: 554 DREVWYDCWRENIDYRSGDAGTSQLEPYDIGLCVDVSGSMYGDRLEKAKTALNTFIDAML 613 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 DN F+D + + ++++ ++ T + + Sbjct: 614 PQDN------ACMVSFSDNAYLVAG-----YGASKEVMRSRTNQLRDLYGTNTDVGLSKT 662 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + +A K I+++ DG+ + AK+ GI + TI Sbjct: 663 ISILADQGR---------SDASKMIIMICDGDVNYI----QGTVDAAKAAGIAVYTINVV 709 Query: 407 VNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 L A + ++ A +T E+ G + Sbjct: 710 SGDN------DLLQKIADETGGEYYYAATTEEVVSQVEAIRGETVSA 750 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I SN Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDSN 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 186 A-----------PQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 73/217 (33%), Gaps = 27/217 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + S + +D L+ + + + + + ++ + I Sbjct: 75 QSYSAWDNKDSEGIDIMLAMDISASMLTNDVIPNRLEVAKEVASDFIS-------GRPND 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + LI ++ + D G TAI + A + Sbjct: 128 NIGLTIFAGEAFTQCPLTTDHASLI-NLLNSVRTDLVVKGLIQDGTAIGMGLINAVGRL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--NK 409 K++ K ++LLTDG N + + AK IR+ TI N Sbjct: 186 ----------KSSKAKSKVVILLTDGSNNVGSISPMTAAEIAKKFNIRVYTIGLGTEQNN 235 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + F+ A S EL++I++D Sbjct: 236 GYSDIDYTTLRQIANVTNGKFYSAQSQTELSQIYKDI 272 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 60/200 (30%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ L+ + R+G F D + + Sbjct: 113 DMNDNGEYIDRLTAVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLTADRKTV 165 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ I +T TAI D + T + S ++ ++LL+DG N Sbjct: 166 MQQINQTVI--GLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSN 212 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + + AK I T+ T + L+ A + Sbjct: 213 TAGVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMT 272 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A EL I+ Sbjct: 273 GGKYFRARDAKELETIYDTI 292 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 51/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I S Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + + AK I T+ Sbjct: 185 ----------DAPQRVMILLSDGSNTAGVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 57/179 (31%), Gaps = 24/179 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + VR +A + R+G F D F+ + + + Sbjct: 121 RVDAVRQVVADFVARRTG-------DRIGLIVFGDAPYPLAPFTLDHALVRE--LLADMV 171 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 ST++ DA+ S +K ++LLTDG +T Sbjct: 172 PGMAGASTSLGDAIGLGIKMFDQS-----------HAQEKVMILLTDGNDTASRMPPAQA 220 Query: 390 CNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 + AK++G+ + T+ +Q+ L + AS +F L I+ Sbjct: 221 ADIAKTRGVVVHTVGIGDPATTGEQKVDLDALKHIASTTGGRYFFGADQTSLASIYATL 279 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 77.7 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 71/211 (33%), Gaps = 41/211 (19%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D A I +G T F + + L+ +++T Sbjct: 106 TPNRMEAAKDVAAEFIAD-------RPNDNIGLTIFAGEAFTQCPMTTDHTSLL-NMLQT 157 Query: 327 FAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D G TAI + A + S K ++LLTDG N Sbjct: 158 VRTDIAAKGLIQDGTAIGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMG 206 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--S 424 + + N AKS GIR+ TI NK + E L + A + Sbjct: 207 DLSPMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAATT 266 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 +F+ A + EL +I++D E + ++ Sbjct: 267 DGNFYRATNNQELKQIYKDIDKLEKSKMSVK 297 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I SN Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDSN 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 186 A-----------PQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 51/160 (31%), Gaps = 27/160 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ H + + + +TAI DA+ A + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQI--GIAGKNTAIGDAIGLAVKRLRQRP 191 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + + +VL+TDG NT A Q ++I TI + Sbjct: 192 AES-----------RVLVLITDGANTGGQIAPQIAAQLAAEQQVKIYTIGIGADPQQGGV 240 Query: 410 -------TQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A + +F A S+ EL I Sbjct: 241 PGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 70/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDHFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F++R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 53/173 (30%), Gaps = 32/173 (18%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + R+G + + + ++ I + + G T I + TA + + Sbjct: 127 RQSDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDNVLQDG-TGIGMGLATAVNRLK 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S K I+L+TDG N E + AK GI++ TI N Sbjct: 186 DSKAKS-----------KVIILMTDGVNNAGFIEPVTAAEFAKEFGIKVYTIGIGTNGNA 234 Query: 412 QEK------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + A + +F A S L I+ + Sbjct: 235 PFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYFRAQSNSSLESIYSEI 287 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 60/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I N R+G + + + ++ ++ Sbjct: 112 DRLQALKQVAARFI-------NGRPNDRIGLVEYAGESYTKTPLTSDKTVVLSSLNSIEY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TAI + TA + + K + K I+LLTDGEN + Sbjct: 165 NSII-EGGTAIGMGLATAVNRL-----------KESTAKSKVIILLTDGENNSGFIDPKI 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A GI++ TI N L A + + Sbjct: 213 ASELAVEFGIKVYTIGLGTNGMASSPIGILPNGRFQYGNQPVKIDETLLKEIAKTTGGQY 272 Query: 429 FEANSTHELNKIFRDR 444 F A S +LN+I+ + Sbjct: 273 FRATSNTKLNEIYEEI 288 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 59/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ +S I N R+G + + + ++ + Sbjct: 112 NRLEALKKVASSFI-------NGRPNDRIGLVEYAGESFTKTPITSDKSIVLSALKGIQY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TAI + T + I S K I+L+TDGEN + Sbjct: 165 NSII-EGGTAIGMGLATGVNRIKDSKALS-----------KVIILMTDGENNAGQIDPRI 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A+ GI++ TI N T L A + + Sbjct: 213 AAELAQEFGIKVYTIGMGTNGTALSPYARNPNGTFVYENIQVTIDEELLEEIAETTGGQY 272 Query: 429 FEANSTHELNKIFRDR 444 F A + +L +I+ + Sbjct: 273 FRATNNKKLQEIYDEI 288 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 51/434 (11%), Positives = 124/434 (28%), Gaps = 32/434 (7%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 D+ R S + L+ A +A + S+ L + S + + + Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQLPDDPD--VSTGAVTAAATQNLLANDADATGILV 74 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 T + ++ +++ N + L + S Sbjct: 75 ESGGATRSVCVSAEAKVEMTLSQVIGIGDQTVTAEACAGYNDIELVMVLDATGSMKGTPI 134 Query: 162 AEAETVSRS-YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 A + + + + S ++ +Q + A+ +N Sbjct: 135 ANVKEAATNLVNLIMPSSSSTSTRSKIGLVPFQGKVRIDGNDPVTAEANPDGVGPGCRNA 194 Query: 221 KVGIRDEKL-SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + + KL + Y + K+ + Y + S++ S I+ + A+ Sbjct: 195 DGTLNNGKLRTEYSKTTTKTNIFYGYTLSGVSTTSDKTC----SGMSPIRALSSDKSAIL 250 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 S I ++ + T+ + V++ + + + + + Sbjct: 251 SNITALNAGQVTSGTIISEGIKWGRHVLTPTAPYVEGSTDTKVRKIMIVLTDGDTEDGRC 310 Query: 340 NDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLLTD-------------GENTQDNE 384 + +A TI + + + A Y L T G Sbjct: 311 GGSYASASKTINTYWTNAYFGQGLKPDSSASPYSTLSTAALTLAQIPDCKDGGLLNTYVV 370 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFFEANSTHELNK 439 + + I I +I F + + AS + +F+A + + Sbjct: 371 NEATLAKTDANYPIEIFSIRFG---DSDSTDKNLMKQIASSKPGTEDHYFDAPDEAGIKE 427 Query: 440 IFRDRIGNEIFERV 453 +F+ IG ++ +R+ Sbjct: 428 MFKK-IGQQLGQRL 440 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 66/191 (34%), Gaps = 24/191 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ +++ V+D + + + + R+ F + F+ + + Sbjct: 112 MTAPDGTRLQRLQAVKDVVGAFVAE-------REGDRISLIVFGAKPFIQAPFTEDLDSV 164 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +TAI DA+ A + S ++ ++LL+DG + Sbjct: 165 --VELLNQVQTGMAGPNTAIGDAIGLAIRSFEDS-----------EIEERLLILLSDGAD 211 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--ARYFLSNCA--SPNSFFEANSTH 435 T I A +GI I TI E+ L + A +F+ A+ Sbjct: 212 TASTMTPINAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATRGGGAFYFADDVE 271 Query: 436 ELNKIFRDRIG 446 L++I+ + Sbjct: 272 GLSEIYAEIDA 282 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 56/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ + R+G F D + + + + Sbjct: 112 TAVKQVLSEFIA---KREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GT 166 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + + A T I S ++ ++LL+DG NT + + N AK Sbjct: 167 QTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDPLEAANIAKQ 215 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T Q+ L A + +F A + +L I Sbjct: 216 YHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNI 275 Query: 441 FRDR 444 + Sbjct: 276 YDTI 279 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 39/284 (13%), Positives = 68/284 (23%), Gaps = 15/284 (5%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS-----YSSQNGKVGIRDEKLSPY 232 + S V+ + D Sbjct: 16 EASPAMRNYDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQS 75 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + Y ++ V +R ++ + + Sbjct: 76 IGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP-- 133 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R F++R+ V K + + G T IN A+Q A + Sbjct: 134 -QDRFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL-- 190 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNK 409 N+ H + IV LTDG+ T + I N + + I TI + Sbjct: 191 -NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDV 248 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + L NC E F D I + + Sbjct: 249 DFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 292 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 52/442 (11%), Positives = 122/442 (27%), Gaps = 38/442 (8%) Query: 23 FIIT-ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 ++ A + G + D + A Q + + + R Sbjct: 10 AMLFSAFAFLPERLIKGKVTDGANALIAGVRV--AIQGSPVGTATDQGGHFTIRVPRLGR 67 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + F K + + +++ K + E+ + V +D+ Sbjct: 68 ALVFSKSGYKALTLPITSDSVMKVVLTQASVKKKDIAREELKDSEQPVMVRKMFTFDMAA 127 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID-----------FSRSML 190 P + + + K E S ++ + +D + M Sbjct: 128 APSTHSETILAMPQATESYKPINENGFLSVGQQPVTTFSVDVDRAAYSNVRRFLNNGQMP 187 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F + D +P + + L Sbjct: 188 PEDAVRIEEMINYFDYDYPQPR-GEHPVAIVAETTDSPWNPGLKLVHIGLQAKTVSAENL 246 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + +D S + K L++ A + ++ D ++ + Sbjct: 247 SASNLVFLIDVSGSMNEANKLPLLKQAFKLLADQLRVEDKIS------IVAYAGSAGMVL 300 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + G + +K GSTA + ++ AYD + NN Sbjct: 301 APTSGSE---KKTIKDALDKLEAGGSTAGGEGIELAYDL----AKKHFLPKGNNR----- 348 Query: 371 IVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ + E + + + GI + + F + + + Sbjct: 349 VILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADK--GNGN 406 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+F G + Sbjct: 407 YAYIDNIQEARKVFVQEFGGTL 428 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 63/243 (25%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D + Y ++ V Sbjct: 208 QNGILGDFIIRYDVNREQSIGDIEVLNGYFVHYFAPRDLPPLPKNVVFVLDSSASMVGTK 267 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R ++ + + F++R+ V K + + Sbjct: 268 LRQTKDALFTILHDLRP---QDHFSIIGFSNRIKVWKDHLVSVTPDNVRDGKIYIHHMSP 324 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+Q A + N+ H + IV LTDG+ T + I N Sbjct: 325 TGGTDINGALQRAIKLL---NDYVAHNDIEDRSVS-LIVFLTDGKPTVGETHTLKILNNT 380 Query: 394 KSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K + I TI + + + L NC E F D I + Sbjct: 381 KEAARGQVCIFTIGIGDDVDFKLLEKLSLENCGLTRRVLEEEDAGSQLIGFYDEIRTPLL 440 Query: 451 ERV 453 + Sbjct: 441 SDI 443 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 68/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L+ ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLSEFVQSRKG-------DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DQSRGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 60/192 (31%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + R+G F D + +I + + Sbjct: 108 VDRLTAVKQVLSEFIT-------KREGDRVGLILFADHAYLQTPLTMDRQTVISQLNQAV 160 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A T I S ++ ++LL+DG NT + + Sbjct: 161 LKLI--GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDPL 207 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCAS--PNSFFEAN 432 N AK I T+ T Q+ L AS +F A Sbjct: 208 EAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRAR 267 Query: 433 STHELNKIFRDR 444 + +L I+ Sbjct: 268 NQQDLQSIYDTI 279 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 66/473 (13%), Positives = 141/473 (29%), Gaps = 90/473 (19%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T + L + G I ALL ++L + +D+ + A + Sbjct: 1 MYTGIKNRIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSL 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + S+ + E R F K+ + ++ V+ Sbjct: 61 FAVTTFRKYVADGMSKNQARKR------AETDARKFLTARTKSLDGTTEKFSIKINIVDR 114 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V +S +++ + F A++ S + ++ Sbjct: 115 EAKVVKANVNISGKHESYMTHAMGF----------DNIDYTADSESTISFGQGKYEFIFL 164 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + + Q + + Sbjct: 165 VDVSPSMGIGASNRDRQIMQRAIGCQFACHE----------------------------- 195 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 V + + +V+DAL S++ +++ V +R G F Sbjct: 196 -----------PWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEV--DLRTGLYSF 242 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++ + + G+ K R K E G T + + S + + Sbjct: 243 SNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN--------FHGVFSDFNGVLRSL 294 Query: 362 KNNLEAKKYIVLLTDGENT--------------QDNEEGIA------ICNKAKSQGI-RI 400 K + K++I++++DG N N C++ K + + Sbjct: 295 KPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTV 354 Query: 401 MTIAFSVN-KTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFE 451 T+ + + + CA S + F+ ANS E++K F+D + Sbjct: 355 HTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKAFKDLFEALLKS 407 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 71/211 (33%), Gaps = 41/211 (19%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D A I +G T F + + L+ +++T Sbjct: 106 TPNRMEAAKDVAAEFIAD-------RPNDNIGLTIFAGEAFTQCPMTTDHTSLL-NMLQT 157 Query: 327 FAIDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D G TAI + A + S K ++LLTDG N Sbjct: 158 VRTDIAAKGLIQDGTAIGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMG 206 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--S 424 + + N AKS GIR+ TI NK + E L + A + Sbjct: 207 DLSPMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAAST 266 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 +F+ A + EL +I++D E + ++ Sbjct: 267 DGNFYRATNNQELKQIYKDIDKLEKSKMSVK 297 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 51/165 (30%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I S Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDS- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 185 ----------DAPQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 57/181 (31%), Gaps = 24/181 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +V+ +A I R+G F + L + T Sbjct: 117 VSRWTVVKAVVADFID-------KRTDDRLGLIVFGTGAFPQAPLTRDHKSLRLLLDHTA 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +TAI DA+ + S+ E K ++LLTDG +T + Sbjct: 170 V--GMAGPNTAIGDAIGMGIRMLDSAQER-----------DKVLILLTDGNDTGSAVPPL 216 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNC--ASPNSFFEANSTHELNKIFRD 443 N A + + TI + F L AS FF+A L+ ++ Sbjct: 217 RAANLAAQHHVTVHTIGIGSPTASGDDQVDFDTLRGISSASGGQFFQAQDGAALHDVYAT 276 Query: 444 R 444 Sbjct: 277 L 277 >gi|34783791|gb|AAH56811.1| Zgc:112265 protein [Danio rerio] Length = 927 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 52/354 (14%), Positives = 98/354 (27%), Gaps = 36/354 (10%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK---EHGV 177 + K++ V +S+ L L R +G LI + E+ Sbjct: 126 TLEDFKTSVTVAANSKVTFELTYEELLKRRLGKYELLINAQPMQPVADFKIDVHIQENPG 185 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV--KSYSSQNGKVG------------ 223 + + D + + + N Sbjct: 186 ISFLEVKGDLNTGDLASAVKTTRADKDAWVTFYPTRDQQTKCTNCAENGLNGDLIITYDV 245 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 R +S ++Y + +D S H +K R AL ++++ Sbjct: 246 NRGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGSMHG-RKIRQTRSALLTILK 304 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + D+ G F+ + K R ++F + G+T INDA+ Sbjct: 305 DLDEDDH------FGLITFDAEIDFWRRELLQATKANRENAESFVKRIQDRGATNINDAV 358 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRI 400 D I R ++LLTDG+ T I K + Sbjct: 359 LAGVDMIN-----RNPRKGTAS----ILILLTDGDPTAGETNIEKIMANVKEAIGSKFPL 409 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + F + + L N A +E + + F D + + + Sbjct: 410 YCLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVAVPLLTDIQ 463 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 60/200 (30%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ L+ + R+G F D + + Sbjct: 113 DMNDNGEYIDRLTAVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLTADRKTV 165 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ I +T TAI D + T + S ++ ++LL+DG N Sbjct: 166 MQQINQTVI--GLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSN 212 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + + AK I T+ T E L+ A + Sbjct: 213 TAGVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASELDEQTLTKIAEMT 272 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A EL I+ Sbjct: 273 GGKYFRARDAKELETIYDTI 292 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 56/194 (28%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + I ++ +G F + + L + Sbjct: 108 TRLEAAKSVATEFI----LSRPNDN---IGLVIFARESFTQCPLTTDHAVL--VNLFNGV 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + A + I K I+LLTDG N + I Sbjct: 159 NNGMIEDGTAIGLGLANAVNRIKD-----------GKSKSKVIILLTDGSNNSGDIAPIT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNK----------------TQQEKARYFLSNCA--SPNSFFE 430 AK+ GIRI TI + Q E L N A + +F Sbjct: 208 AAEIAKTFGIRIYTIGVGTHGVINIPVSTPMGIQYQRVQSEFDAKSLENIANLTGGKYFG 267 Query: 431 ANSTHELNKIFRDR 444 A +L I+++ Sbjct: 268 ATDNSKLRNIYQEI 281 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 50/154 (32%), Gaps = 17/154 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 RM F + + +I + +T TA+ DA+ + T +S Sbjct: 139 DRMALIVFGTSAYLQAPLTDDLETIIALLDRTEV--GMAGPHTALGDAIGLSIRTFETSE 196 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQ 412 D ++ ++LL+DG +T + A +G+ I TI + Sbjct: 197 ID-----------QRLLILLSDGSDTASRMSPVNAAEIAADRGVEIYTIGVGDPDATGEN 245 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A L ++ Sbjct: 246 RVDLTTLKEVAQRTGGQYFFAEDAASLEAVYDRI 279 >gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis] Length = 1225 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 53/348 (15%), Positives = 98/348 (28%), Gaps = 23/348 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + V + L L R +G I + + + S Sbjct: 126 VSTSVEAGGKVAFELTYEELLQRHLGKYQHAINIRPQQVVRNLSVEVTISERTGIDYVHV 185 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + + P+ R K ++ + + S + + Y Sbjct: 186 LPLRTSRLLTNAVRGDADVPPSTRVEKGTHCAWIVFNPTPQEQAAFSSSGIMGDFVVQYD 245 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNVNDT 294 + + + + + + + + ++ KK +V + Sbjct: 246 VAMPDVAGDVQIYNGYFVHYFAPRGLPPVQKDVVFVIDISGSMYGTKMKQTKKAMHVILS 305 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTI------VKTFAIDENEMGSTAINDAMQTAYD 348 FFN SD W + I+ K + G T IN A+ A Sbjct: 306 DLHQDDFFNIVTFSDTVNVWKPSQSIQATPQNIKKAKDYVSKMEADGWTDINAALLAAAS 365 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 S+ M++ I+ LTDGE T G I + A+ I + +AF Sbjct: 366 VFNHSSPMAGKIMRDQRIP--LIIFLTDGEPTSGVTTGSRILSNAQQALKGTISLFGLAF 423 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + R L N +E K F D I + + V Sbjct: 424 GDDADYGLLRRLSLENRGVARRIYEDADATLQLKGFYDEIASPLLYDV 471 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 67/187 (35%), Gaps = 24/187 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D I + V+ + + + R+G F + ++ V + Sbjct: 105 DMVYNNRRITRLMAVKKVVGDFVAR-------RQSDRLGLVLFGTQAFLQAPLTFDVKTV 157 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++ +TAI DA+ + + AK+ I+LLTDGEN Sbjct: 158 QEMLIEAE--SGYAGEATAIGDAIALSIKRLREQ-----------PNAKRVIILLTDGEN 204 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 T + A +I TIAF + +E + + A + FF A +T +L Sbjct: 205 TAGELGIATATDLAVKANTKIYTIAF--SPYDREVDSHSMQQIAEQTGGEFFRARNTRDL 262 Query: 438 NKIFRDR 444 +I R Sbjct: 263 EEIHRQL 269 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 35/299 (11%), Positives = 74/299 (24%), Gaps = 30/299 (10%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + Q S ++ + P V +S N + K Sbjct: 193 DDDEVLDHQPESTERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPL 252 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKID 289 N + V + + K L++ A+ VI+S+ D Sbjct: 253 TSGRQNSGTNQTNMQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD 312 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+ + ++ G T I + ++ Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQA-VNSLVSNGGTNIAEGLRKGAKV 365 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK----------AKSQGIR 399 ++ I+LL+DG++T ++ G Sbjct: 366 MLDRKW---------KNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRNGGT 416 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 I + + + S +F + + F IG + V + Sbjct: 417 GFQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAFAQCIGGLLSVVVQEL 475 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 53/161 (32%), Gaps = 28/161 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D + + + + + TAI +A+ + Sbjct: 131 LILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALSVKRFD------- 181 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------- 408 E+ + +VLLTDG N + + A +GI+I +I + Sbjct: 182 ----LVDESNRILVLLTDGSNNSGSISPEQAADIAAKRGIKIYSIGVGADVMERRTLFGK 237 Query: 409 ---KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L++ A + +F A + EL +I+++ Sbjct: 238 ERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEI 278 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 59/418 (14%), Positives = 116/418 (27%), Gaps = 44/418 (10%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I ++ ++ L V+S + ++ F+ + + E V + + Sbjct: 15 AILLAISMVVGLVPVNSITSYAKDKSAGNVKSQPRAAFDEESVSDESSLETWTKVVENST 74 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM-GIKSWLIQTKAEAETVSRSYHKEHGVSI 179 E R + V L + + G +L+ A + T + S + + I Sbjct: 75 ENIGRIWVDKTVSDKNIKLPASSQGKEIEIDKGTSDFLVGLSALSSTSNISTVADKPLDI 134 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 +V+D S SM D N K+ + Sbjct: 135 VFVLDTSGSMSDPMEYIYSPTYNVVTDGRVEYYAEVEGNYVKIDRITGLFGFFKHWEVAG 194 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTV 295 D + F + K ++ A+ + I + Sbjct: 195 KEVTPKKNESDT--NGIQFYTRREKPNSQSKMGALKIAVNQFAQETAKRNDSITDAAKQH 252 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 RM F+ S + + + N G+T N M+ A +++ + Sbjct: 253 RMSIVTFSSESYIRQSLK-AYNSNTVSEFERTINGLNANGATYANLGMEKAKESLKNV-- 309 Query: 356 DEVHRMKNNLEAKKYIVLLTDGE------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 +A+K ++ TDG + I K +I +I Sbjct: 310 --------REKAQKVVIFFTDGTPGRSGFDDDTANNTIQAAKSLKDDLTKIYSIGVFDQA 361 Query: 410 TQQEKARYF-------------------LSNCASPNSFFE-ANSTHELNKIFRDRIGN 447 + F L A +++++ A ELNKIF + Sbjct: 362 NPDNTSSSFNAYMHGVSSNYPNATKWTELGERAENSNYYKAAQDADELNKIFEEIFEE 419 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 61/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + A I N +G T F + + + L +++ Sbjct: 108 NRIEAAKQVAAEFI-------NGRPNDNIGLTIFAGEAFTQCPLTVDHGVLLNLFQSVSC 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 TA+ + A + S K ++LLTDG N + + Sbjct: 161 DMVQRGMIEDGTALGMGLANAVSRLKDSKAKS-----------KVVILLTDGVNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIR+ TI N T E +S A + Sbjct: 210 PLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQMPVEIDEQTMSQIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++R+ Sbjct: 270 YFRATSNTKLKEVYREI 286 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 60/184 (32%), Gaps = 15/184 (8%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS--FSWGVHKLIRTIVKTFAIDEN 332 ++AL +++ ++ D R F+ R+ +R K + Sbjct: 328 KEALLTILGDLRPAD------RFNFISFSSRIRVWQPGRLVPATPSAVRDAKKFVVMLPT 381 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T I+ A+QT + + I+ LTDG+ T AI Sbjct: 382 SGGGTDIDGAIQTGSSLLRDHLSGRDAGPNSVS----LIIFLTDGQPTVGEVRPGAILGN 437 Query: 393 AKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A++ I TI + + R L NC E + K F D IG + Sbjct: 438 ARAAVRDKFCIFTIGMGDDVDYRLLERMALDNCGMMRRIPEEADASSMLKGFYDEIGTPL 497 Query: 450 FERV 453 + Sbjct: 498 LSDI 501 >gi|326797316|ref|YP_004315136.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] gi|326548080|gb|ADZ93300.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] Length = 528 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 57/514 (11%), Positives = 122/514 (23%), Gaps = 75/514 (14%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV- 66 ++S +G L++ L +D R LK+A A + Sbjct: 14 RNALNTFLRSDSGAALPFIVLILVGALVGFSFALDTTRMVNTAGQLKRATDAAALAIGQI 73 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + ++ + + + + + + F + D V + Sbjct: 74 QLRNNNDDETDLDSIAQGYVLNNLGMDSGLIDQIDTTSIFVTQGTNDGHPTFTVTVTLNT 133 Query: 127 SAYQVVL-------SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + SS +++ P + L + + +S+ + Sbjct: 134 QSDLLNAQTEDQVISSTVEVVSTPTEVALLLPNTLTEDEPELVALRKLGKSFARNLLGED 193 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS------------SQNGKVGIRDE 227 D S S AD + K G Sbjct: 194 T---DASNSAQKVWLSLVPFSQAVNVYDADDPERISRWAAAGALNPPELRSLFKTGKVRS 250 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 P L M + + + DS + A+ I + Sbjct: 251 LADPRFPDRVAKLLCMYRGLGAGENFNWDQQPDSQFGVYYRHDLPQNGSPGATPISWVGP 310 Query: 288 IDNVNDTVRMGATFF--NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ++ + + D+ + + L + + + + AM Sbjct: 311 NPSLWPSSVAEDVRWIVADKGCPNAPLLPLTNDLD--AIDARLDEMSTRFNVNYAIAMGW 368 Query: 346 AYDTIISSNEDEVH--------RMKNNLEAKKYIVLL---------TDGENTQDNEEGIA 388 A + + + K +V+L TD N ++ + Sbjct: 369 AGHALSPNMRGSSGWGDSELPLDFSKSKSNVKVMVMLANTTGDWFDTDAYNFNRDQALDS 428 Query: 389 ----------------ICNKAKSQGIRIMTIAFSVNKTQQEKARYF--------LSNCAS 424 +C + + I+ I + R L CA Sbjct: 429 TGPGSAKAFATQRFHDVCRSFRDKNIKFFFIGV-RPGDPADFGRTLFTDIAGPGLRECAG 487 Query: 425 PN---SFFEANS---THELNKIFRDRIGNEIFER 452 F +A+S + I EI + Sbjct: 488 GGGGLYFADASSFVEGKSQIDSLLEEIAEEIRQN 521 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 70/276 (25%), Gaps = 13/276 (4%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 D S + N +P + + D + Sbjct: 218 DDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLN 277 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y ++ V +R ++ + + R Sbjct: 278 GYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSII 334 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F +R+ V K + + G T IN A+Q A + N+ H Sbjct: 335 GFPNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHS 391 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARY 417 + IV LTDG+ T + I N + + I TI + + + Sbjct: 392 GIGDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKL 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E F D I + + Sbjct: 451 SLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 486 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 72/231 (31%), Gaps = 12/231 (5%) Query: 15 IKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G IITAL ++ +L G+ +D+ R + ++ L+ A + ++ L Q+ Sbjct: 11 LRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNGP 70 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA----- 128 LI + N+T+ +V++ + Sbjct: 71 ANVKSEAKEAAENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSVKIMVNNHSINSFF 130 Query: 129 -----YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-SRSYHKEHGVSIQWV 182 +Q +S+ ++ PLS + I + +E Y Sbjct: 131 ASALNFQQTVSAEATAVIGPLSGKRHLIPIAVTEDEADTMSEAGEGIIYQFGRSAGNWGT 190 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 +DF + Q + + T++ N + G+ Sbjct: 191 VDFDAYIDGDQGGGNPEIGEWLENGYNGTIRIGDDINTESGVGKIVGGNNG 241 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 61/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + + I N +G T F + + + L +I Sbjct: 108 NRLEAAKQVASEFI-------NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKG 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI + A + S K I+LLTDG N + + Sbjct: 161 DIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGSNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIRI TI N T E L+ A + Sbjct: 210 PLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++++ Sbjct: 270 YFRATSNSKLKEVYQEI 286 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 61/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + + I N +G T F + + + L +I Sbjct: 108 NRLEAAKQVASEFI-------NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKG 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI + A + S K I+LLTDG N + + Sbjct: 161 DIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGSNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIRI TI N T E L+ A + Sbjct: 210 PLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++++ Sbjct: 270 YFRATSNSKLKEVYQEI 286 >gi|68448495|ref|NP_001020335.1| inter-alpha (globulin) inhibitor H4 [Danio rerio] gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio] Length = 915 Score = 76.9 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 51/354 (14%), Positives = 98/354 (27%), Gaps = 36/354 (10%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK---EHGV 177 + K++ V +S+ L L R +G LI + E+ Sbjct: 120 TLEDFKTSVTVAANSKVTFELTYEELLKRRLGKYELLINAQPMQPVADFKIDVHIQENPG 179 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV--KSYSSQNGKVG------------ 223 + + D + + + N Sbjct: 180 ISFLEVKGDLNTGDLASAVKTTRADKDAWVTFYPTRDQQTKCTNCAENGLNGDLIITYDV 239 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 R +S ++Y + +D S H ++ R AL ++++ Sbjct: 240 NRGNPKGEVQISNGYFVHYFAPSDVPRIPKNVVFIIDRSGSMHG-RRIRQTRSALLTILK 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + D+ G F+ + K R ++F + G+T INDA+ Sbjct: 299 DLDEDDH------FGLITFDAEIDFWKRELLQATKANRENAESFVKRIQDRGATNINDAV 352 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRI 400 D I R ++LLTDG+ T I K + Sbjct: 353 LAGVDMIN-----RNPRKGTAS----ILILLTDGDPTAGETNIEKIMANVKEAIGSKFPL 403 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + F + + L N A +E + + F D + + + Sbjct: 404 YCLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVAVPLLTDIQ 457 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 76.6 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 59/217 (27%), Gaps = 10/217 (4%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ V +R ++ + + Sbjct: 277 DGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDHFNI 333 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F++R+ V K + + G T IN A+Q + N+ H Sbjct: 334 VGFSNRIKVWKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLL---NDYVAH 390 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKAR 416 + +V LTDG+ T I N + + I T+ + + + Sbjct: 391 NDIEDRSVS-LVVFLTDGKPTVGETHTFKILNNTREATRGRVCIFTVGIGADVDFKLLEK 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L NC E + F D I + + Sbjct: 450 LSLENCGLTRRVHEDHDARAQLIGFYDEIRTPLLSDI 486 >gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Anolis carolinensis] Length = 914 Score = 76.6 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 64/189 (33%), Gaps = 13/189 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK +AL ++ + D+ F+ + S + + K + Sbjct: 283 KKIQQTIEALQKILEDLNPEDH------FNLVVFSGEISEWQSSLLKATEENVELAKQYV 336 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T IN A+ TA +++ + E+ ++ IVLLTDG+ T + Sbjct: 337 RTIMAQGGTDINGALLTAINSLDRATSAELLPEQSIS----MIVLLTDGQPTVGETNVNS 392 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I K + + F + + + L N +E + + F + Sbjct: 393 IQTNIKKANDGNYFLYCLGFGFDVSYTFLEKLALENRGIARRIYEDSDAALQLQDFYQEV 452 Query: 446 GNEIFERVI 454 I + ++ Sbjct: 453 ATPILKEIV 461 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 76.6 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 36/299 (12%), Positives = 84/299 (28%), Gaps = 24/299 (8%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 ++ + T S +++ I+ + + + +N F Sbjct: 154 NGIMVAGEIPVSTFSIDVDTGSYSTLRRSINH-GVLPERGTVRVEELINYFAYQYPAPDA 212 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + + +P+ + L + +D S K L Sbjct: 213 GEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQLVFLLDVSGSMSSQDKLPL 272 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++AL + + + + D ++ V + GV + Sbjct: 273 LKNALKMLSQQLDEGDRISIVV---------YAGASGVVLDGVKGNDTLAISQALDKLKA 323 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAIC 390 GST ++ AY + + ++L TDG+ D + + Sbjct: 324 GGSTNGGAGIELAYQL--AQKHFIAGGVNR-------VILATDGDFNVGVSDQQALEDMI 374 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + QGI + T+ F + ++ +E K+ D I + Sbjct: 375 EEKRKQGIALTTLGFGQGNYNDHLMEQLADK--GNGHYAYIDTLNEARKVLVDEISATL 431 >gi|296156498|ref|ZP_06839336.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295893097|gb|EFG72877.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 446 Score = 76.6 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 47/475 (9%), Positives = 125/475 (26%), Gaps = 67/475 (14%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G II A+ + +LG+ G+ VD + L A A+I A + + + Sbjct: 8 RRRQRGSVSIIVAVSLIALLGILGLAVDSGLGYVIKARLDAATDGAVIAAGEAVTRGSNQ 67 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 Y +++ ++ Sbjct: 68 TEQTNNAQQAATAFFTANYPAGFLGSSVAAGTPSIVFDAG-----------------TVT 110 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + F + +G K + + ++A + + + +VID + S+ Sbjct: 111 IGMTAQASVPVSFSKVLGFKVLNVSSTSQAIRKT--------LDMAFVIDTTGSLNTSGV 162 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + P+ + + + + Sbjct: 163 PAAVRSNAVAFLNNFDVTNDRVALMHFAYGTVV-DVPFSGNARGFDRTTMTADINKYTFN 221 Query: 255 EEHFVDSSSLRHVIKKKHLVRD-ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + + ++ + VI + ++FF + Sbjct: 222 GSTNSAEAIWNARNQLNTVISQPSSLRVIVFFSDGAPNS-----FSSFFTTNQSGCNKSA 276 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--DEVHRMKNNLEAKKYI 371 + T + + N + T + Q+ ++++ H + + Sbjct: 277 GTIASPDSTGTMSGLYNMNALNQTLGSPCFQSNVTKLVTAMPKWYNAHDVNEKTFPIWPV 336 Query: 372 V---------LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-----KTQQEKARY 417 + N A+ + A+ +G + T+ + E + Sbjct: 337 TAPRAVPNGNVTYVNVNRASRNLLEAMASAARDEGTYVFTLGYGPELVQPAGPDNELGQD 396 Query: 418 FLSNCASP--------------NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L A+ + A + +L F +++ +++RI+K Sbjct: 397 VLKCMANTADSLSRCYNPKQPVGVYCYAATPADLKPCF-----SQLASQILRISK 446 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 24/188 (12%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + + V+ + I R+G F + L+ Sbjct: 108 TDANGAKSDRLSAVKSVVRDFIAR-------RKDDRIGLIVFGTGAYPQAPLTLDHASLL 160 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + +TA+ DA+ + + E +K ++LLTDG +T Sbjct: 161 LLLDEVGI--GMAGPNTALGDAIGLTIKALEKT-----------PEQEKVLILLTDGNDT 207 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHE 436 + A++ GI + TI + L A + FF A+ Sbjct: 208 SSAITPDHAAHLAQANGIVVHTIGIGDPQATGDAKVDLTTLQAIARTTGGQFFRADDRQA 267 Query: 437 LNKIFRDR 444 L +++ Sbjct: 268 LQQVYATL 275 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 38/335 (11%), Positives = 87/335 (25%), Gaps = 6/335 (1%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++++L + G I ALL + + L+DV R S ++ A A + A+ Sbjct: 15 VAAFARRLRRDDRGAIAIQFALLALPLSILLFGLLDVGRLSLQRRQMQDALDAATLMAAR 74 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK-KNFTDREVRDIVRDTAVEMNPR 125 S ++ + +F + L + N + + Sbjct: 75 STATSSADLDTTGDAAFLAEIAGMNLGLTASSSTFSAGTNNRVIGTATATLRPIIANLWQ 134 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV--SIQWVI 183 + V SS L + L S T+ V+ + + V + Sbjct: 135 SGNFTVTASSEVVRASKNLEIALVLDITGSMGNGTRIADLKVAAADLVDVLVRDTQTPFY 194 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCNKSLYY 242 + + G + + R ++ + + + Sbjct: 195 SKMALVPYSAGVNVGATYADAVRGPVPVKTITGAAWASGSARSITGITRANPAVVTASGH 254 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L G + + + + + +L S S T++ T Sbjct: 255 GLSTGDYVYITGVRGMTSVNDKIYRVTRSDPDKVSLNSTNTSSASNYTNGGTIQKCLT-- 312 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + + + I + T Sbjct: 313 STCQVVITTSTSHGFTTGDEIRFAGMSGLTSLNGT 347 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 47/424 (11%), Positives = 103/424 (24%), Gaps = 47/424 (11%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ + + ++ T + TA Y + Sbjct: 207 NVGATYADAVRGPVPVKTITGAAWASGSARSITGITRANPAVVTASGHGLSTGDYVYITG 266 Query: 135 SRYDLLLNPLSLFLRSMGIK--SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 R +N + S + A + + ++ + + S Sbjct: 267 VRGMTSVNDKIYRVTRSDPDKVSLNSTNTSSASNYTNGGTIQKCLTSTCQVVITTSTSHG 326 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + D L+ S S P Sbjct: 327 FTTGDEIRFAGMSGLTSLNGTTRTITVLTNTTFDSSLTGPGSSTYSSGGTATCEESTVPG 386 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + F + V + + + S ++ V + Sbjct: 387 CEKIRFTNVDGYERVNSQSTCATERIGS--QAYTDAAPSTAYVGSHYPTAGSSSSTVCPT 444 Query: 313 SWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM------KNNL 365 + +T +K G+TA + + + + Sbjct: 445 ATITPLSTDKTALKAQINGLTVGGATAGQIGLAWGWYMVAPNFGYLWPNASQRPAAYKAR 504 Query: 366 EAKKYIVLLTDG-------------------------------ENTQDNEEGIAICNKAK 394 + K ++L+TDG N ++ +C+ K Sbjct: 505 DLMKVVILMTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSSFDQAAELCDSIK 564 Query: 395 S--QGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFE 451 + I + T+ F+V Q + FL+NCAS + + + EL F+ Sbjct: 565 ASANDITLYTVGFTVGNDQTARN--FLTNCASSTDKAYFPATGSELKASFQAIAQEISNL 622 Query: 452 RVIR 455 R+ R Sbjct: 623 RIAR 626 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 38/200 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + +++ + T + Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVDHDVILQMLDATEIGQLED-- 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + K I+LLTDG N + A+ Sbjct: 165 GTAIGLGLATAINTLR-----------GSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRHIAQQTGGKYYRATDETKL 273 Query: 438 NKIFRDRIGNEIFERVIRIT 457 ++I+++ R+T Sbjct: 274 HEIYKEID----SLEKSRLT 289 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 70/274 (25%), Gaps = 13/274 (4%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S + N +P + + D + Y Sbjct: 6 SGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGY 65 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ V +R ++ + + R F Sbjct: 66 FVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSIIGF 122 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++R+ V K + + G T IN A+Q A + N+ H Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGI 179 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFL 419 + IV LTDG+ T + I N + + I TI + + + L Sbjct: 180 GDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 NC E F D I + + Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 272 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 70/274 (25%), Gaps = 13/274 (4%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S + N +P + + D + Y Sbjct: 6 SGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGY 65 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ V +R ++ + + R F Sbjct: 66 FVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSIIGF 122 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++R+ V K + + G T IN A+Q A + N+ H Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGI 179 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFL 419 + IV LTDG+ T + I N + + I TI + + + L Sbjct: 180 GDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 NC E F D I + + Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 272 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 70/274 (25%), Gaps = 13/274 (4%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPA---DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S + N +P + + D + Y Sbjct: 6 SGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGY 65 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ V +R ++ + + R F Sbjct: 66 FVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRP---QDRFSIIGF 122 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++R+ V K + + G T IN A+Q A + N+ H Sbjct: 123 SNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGI 179 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFL 419 + IV LTDG+ T + I N + + I TI + + + L Sbjct: 180 GDRSVS-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 NC E F D I + + Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDI 272 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + + + TAI + + A T I SN Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLI--GTQTAIGEGIGLATKTFIDSN 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 ++ ++LL+DG NT + + N AK I T+ Sbjct: 186 A-----------PQRVMILLSDGSNTAGVLDPLEAANIAKQYHTTIYTLGVGAGEMVVKD 234 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T Q+ L A + +F A + +L I+ Sbjct: 235 FLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTI 279 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 76.6 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 55/196 (28%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A I R+G + + + +R I + Sbjct: 115 NRLEATKKVAADFI-------KGRPNDRIGVVVYAGESYTKTPITTDEMISLRAINEIAF 167 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + TA + + S K I+L+TDG N + Sbjct: 168 DGVLENG-TAIGMGLATAVNRLKDS-----------EALSKVIILMTDGVNNSGFIDPKI 215 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A I++ TI N + A + + Sbjct: 216 ASELALEYDIKVYTIGIGTNGNAPSPVAQIGRNKFRMAMMPVEIDEELMKQIAVDTGGKY 275 Query: 429 FEANSTHELNKIFRDR 444 F A + +L +I+ + Sbjct: 276 FRATNNKKLEEIYGEI 291 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 41/173 (23%), Positives = 62/173 (35%), Gaps = 13/173 (7%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V+ K R+G F D F+ + + +T Sbjct: 135 NAAKEVLSDFVKTRKG---DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVA--MAGQ 189 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ST + DAM A S R+ +K ++LTDG +T E I AK+ Sbjct: 190 STHLGDAMGLAIKVFEQSKS----RIGVEENKEKVAIVLTDGNDTGSFVEPIEAAKVAKA 245 Query: 396 QGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +G+RI IA + + + A S FEA + EL K + D Sbjct: 246 KGVRIHVIAMGDPQTLGEAALDMKTIRRIAKESGGKAFEAMNRDELAKAYDDI 298 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 60/191 (31%), Gaps = 20/191 (10%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D+ K DA+ V+ R+G F D F+ Sbjct: 103 SQSMDTKDFPGADGKPLARVDAVKQVVADFVGRRPG---DRIGLVAFGDAPYPLAPFTMD 159 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + A+ T++ DA+ A + +K +++LT Sbjct: 160 HALVET--MIADAVPGMAGPRTSLGDALGLAVKMFEKT-----------TVPEKVLIVLT 206 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEA 431 DG +T + AKS+G+ + + + + L A + +F Sbjct: 207 DGNDTASRMPPLKAAEIAKSKGVVVHAVGIGDPLATGEDKLDTATLQKIAEKTGGRYFFG 266 Query: 432 NSTHELNKIFR 442 +L I++ Sbjct: 267 GDQAQLASIYQ 277 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 61/187 (32%), Gaps = 24/187 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + I + V++ + I+ R+G F + + L Sbjct: 111 DANGQRIDRLSAVKEVVQGFIQR-------RKDDRLGLIVFGSGAFAQAPLTLDHASLSL 163 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + +TAI DA+ A + + E +K ++LLTDG +T Sbjct: 164 LLEDSGI--GMAGPNTAIGDAIGLALKLLEQA-----------HEPEKVLILLTDGNDTS 210 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHEL 437 A ++G+ I TI + + L A + +F A L Sbjct: 211 SAITPQHAAAMAAARGVVIHTIGIGDPSAEGEAKVDLSALEQIARTTGGRYFRAEDRSAL 270 Query: 438 NKIFRDR 444 ++++ Sbjct: 271 DQVYATL 277 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 57/163 (34%), Gaps = 17/163 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +DN+N R+ F+D + L + + E G+T + +A+ Sbjct: 140 LNLMDNMNTQNRVSIYKFDDTSKRIIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAID 199 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIRIMTI 403 +AY+ I S+ ++LL+DGE+ N++ K I I TI Sbjct: 200 SAYNEINSTKRPGRKAA---------VILLSDGEDNFGLNKKFDETLKPFKDSNISIYTI 250 Query: 404 AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 S L A + ++ + +L F Sbjct: 251 GMSNENNF-----TTLKKIAKDTHGEYYNVKNASDLKGTFSKI 288 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 44/288 (15%), Positives = 90/288 (31%), Gaps = 50/288 (17%) Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 ++ + + V+D S SM D Sbjct: 218 GKNVVLQEATDVVLVLDRSGSMGQGVVDKNNPNAQKCTVLTCTNSNR------------- 264 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + C YY+L + + D + + + V+DA + + ++++ Sbjct: 265 -WHRHNADCYDEEYYILK---CTQNHTHTLPGDFIANSCYVSRADKVKDASYTFLDTLQE 320 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + V + + + +++ +++ G T ++ A Sbjct: 321 ----KEDVNISVVTYAGTASKV------TNSNLKSGIESAYNVLGTDG-TNTGRGIEIAS 369 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + +S K IV+L+DGE+ N N AK++G + TI Sbjct: 370 QILSNSTA-----------PNKMIVVLSDGESNAGNSRT--AANSAKNKGCIVYTIG--A 414 Query: 408 NKTQQEKARYFLSNCASPNS------FFEANS-THELNKIFRDRIGNE 448 L +CAS + F+ A+ + LN+IF + G Sbjct: 415 GIASGSNGAKELFDCASVDQSTNKAKFYLADDTGNALNEIFAEIAGEI 462 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 66/243 (27%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D K + Y ++ V Sbjct: 217 QNGILGDFIIQYDVKREQNIGDIQVLDGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAK 276 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ +++ + + R F++R+ V K + + Sbjct: 277 LQQTREALVTILNDLRP---QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSP 333 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+QTA + + ++ I+ LTDG+ T + I + Sbjct: 334 TGGTDINGALQTAIKLLNNYVAQNDIEDRSVS----LIIFLTDGKPTFGETNTLRILSNT 389 Query: 394 KSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K I I T+ + + + L NC E F D I + Sbjct: 390 KEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEIRTPLL 449 Query: 451 ERV 453 + Sbjct: 450 SDI 452 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 76.6 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 66/243 (27%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D K + Y ++ V Sbjct: 251 QNGILGDFIIQYDVKREQNIGDIQVLDGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAK 310 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ +++ + + R F++R+ V K + + Sbjct: 311 LQQTREALVTILNDLRP---QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSP 367 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+QTA + + ++ I+ LTDG+ T + I + Sbjct: 368 TGGTDINGALQTAIKLLNNYVAQNDIEDRSVS----LIIFLTDGKPTFGETNTLRILSNT 423 Query: 394 KSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K I I T+ + + + L NC E F D I + Sbjct: 424 KEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEIRTPLL 483 Query: 451 ERV 453 + Sbjct: 484 SDI 486 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 76.2 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 57/194 (29%), Gaps = 35/194 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + V+ ++ + R+G F D + + + + Sbjct: 107 DYVDRLTAVKKVVSDFAS-------KREGDRLGLVLFADHAYLQTPLTLDRKTIAEQVNQ 159 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 TAI + + A T + S ++ +VLL+DG NT + Sbjct: 160 LVLRLI--GEKTAIGEGIGLATKTFVDS-----------DAPQRVMVLLSDGSNTSGVLD 206 Query: 386 GIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFE 430 + AK I TI T Q+ L + A + +F Sbjct: 207 PLEAAKIAKKYNATIYTIGVGAGEMVVKEFFMTRKVNTAQDLDERTLMDIAQVTGGQYFR 266 Query: 431 ANSTHELNKIFRDR 444 A EL I+ Sbjct: 267 ARDAKELATIYDTI 280 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 76.2 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 66/191 (34%), Gaps = 37/191 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V ++ +G T F + + L+R +++ D G Sbjct: 111 EAAKDVATEFISGRPNDN---IGLTIFAGEAFTQCPMTTDHASLLR-LLQATRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + S K ++LLTDG N + Sbjct: 167 LIDDGTAVGMGLANAVSRLKDS-----------KSKSKVVILLTDGSNNMGEISPMTAAE 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANS 433 AKS GIR+ TI NK E LS+ A + +F+ A + Sbjct: 216 IAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDTKTLSDIAQTTDGNFYRATN 275 Query: 434 THELNKIFRDR 444 +EL KI+RD Sbjct: 276 NNELKKIYRDI 286 >gi|224824215|ref|ZP_03697323.1| conserved hypothetical protein [Lutiella nitroferrum 2002] gi|224603634|gb|EEG09809.1| conserved hypothetical protein [Lutiella nitroferrum 2002] Length = 421 Score = 76.2 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 36/340 (10%), Positives = 88/340 (25%), Gaps = 6/340 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F + + G II L + V++G G++ D+ R + L A+ + A+ Sbjct: 3 RFAQLPTR--RHQKGVAAIIVGLCIVVLIGFLGLVADLGRLFITKTELSDASDACSLAAA 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L E +S T ++ ++ N ++ T + + T + P Sbjct: 61 AELKGDAESLSRAESAGITVGQRNKVDFQANNVTIVPNQDVTFSDHLNGTYYTKNAVAPA 120 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 A + F++ MG + + ++A A + V + Sbjct: 121 NIANMKYAKCTLPRT-GIMPWFMQVMGAGAQSVTSQAVASLSPSQTNCAIPVGLCSQTPP 179 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG--KVGIRDEKLSPYMVSCNKSLYYM 243 + DS G + + ++ S + Sbjct: 180 ASCPDGSAPDSYGFCVGKWYGSRFTAGGGFTGSFNLIDYSPPSGGASELSGQLTGAGQCN 239 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 L + + +++ + + + Sbjct: 240 LNVTNPVGQTGMQQSI-ANAWNTRFGLYQGSYNVSNAPPDFTGYAYTAISWPSQYNAYDG 298 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + GS+A + + Sbjct: 299 TDGTRPNFQASRTAHTVYQGDAAAGLSLTGSGSSATSTQL 338 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 68/181 (37%), Gaps = 14/181 (7%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + ++ L+ ++S K R+G F D F+ + + +T Sbjct: 131 ISRLDAAKEVLSEFVQSRKG-------DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTD 183 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ST + DA+ A S + ++ + +K ++LTDG +T E I Sbjct: 184 VA--MAGQSTHLGDAIGLAIKVFEQS-DKSSGALEQDQNREKVAIVLTDGNDTGSFVEPI 240 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 AK++G+R+ IA + + A S FEA + EL+ + + Sbjct: 241 DAAKVAKAKGVRVHVIAMGDPETIGETALDMETIHRIAKESGGEAFEALNRDELSAAYDE 300 Query: 444 R 444 Sbjct: 301 I 301 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 40/322 (12%), Positives = 80/322 (24%), Gaps = 60/322 (18%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + + Q T +S G +++ Sbjct: 347 AADDYVRLTNMGGLAQLNDGGYRVASVSGSQVVLDTTRSLGLAGASKGGVTYSSGGKLMN 406 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + ++P S V + V Sbjct: 407 GRDGSEWRVFPTADGYVNVHASSTCVSERVGVERYTDA---------------RPSTAYV 451 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + S PS + +K+ + GSTA + A+ + Sbjct: 452 ---GRSYLSSSNSCPSAELSALSTSASSLKSKIDQMSAGGSTAGQIGIAWAWYALSPDFA 508 Query: 356 D-----EVHRMKNNLEAKKYIVLLTDGE------------------------------NT 380 + K +L+TDGE N Sbjct: 509 SLFSGEGQPGAYAPSDTLKVAILMTDGEFNTPFRDGVIALDAGTGSGGLDSHIDLNSSNG 568 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK-----TQQEKARYFLSNCASPN--SFFEANS 433 + +A+C +++G+ + T+ F + + A + CA+ FF+A+ Sbjct: 569 DPFAQSVALCQAMQAKGVVVYTVGFDLGSATGREGVVDTALDVMRECATNEQTHFFQADD 628 Query: 434 THELNKIFRDRIGNEIFERVIR 455 +L + FR + R+ R Sbjct: 629 GTDLKEAFRAIGRDITRLRIAR 650 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/259 (9%), Positives = 78/259 (30%), Gaps = 13/259 (5%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + ++ G+ ++ A+ +PV+L + +D+ + S + L+ A A + A+ Sbjct: 13 LKSLASRFLRQTQGNVAMMFAMALPVLLMITLGAIDIHQASKVKAQLQDALDAAALAAAR 72 Query: 67 PLIQSLEE-----VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 +++ N ++ + + NN V+ +V + + Sbjct: 73 STFTDDVNINKVGLAALKANMPSYFGEASGDTASFVLLNNRVTGEATVNVKVLVANVVLP 132 Query: 122 MNPRKSAYQVVLSSRYDL--LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + +SSR ++ + + + S ++ T + + Sbjct: 133 PYGKLLDDFLPVSSRSEVLRASRNVEVAMALDITGSMDNCSRNCPPTSKLEDLQAAAKEL 192 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV------KSYSSQNGKVGIRDEKLSPYM 233 ++ + Y + + + + + +G + + Sbjct: 193 VDIVVQDQQTPFYSKVALIPYAAGVNVGSSAISARGPLDTTTHNISGATWLTGSVKTISS 252 Query: 234 VSCNKSLYYMLYPGPLDPS 252 +S L Sbjct: 253 ISGGSPLTVNASSHGFSNG 271 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 60/203 (29%), Gaps = 36/203 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + I + V+ L+ + R+G F D + + Sbjct: 113 DMNDNGEYIDRLTAVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLTADRRTV 165 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ I + TAI D + T + S ++ ++LL+DG N Sbjct: 166 MQQINQAVI--GLVGERTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSN 212 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T E + A+ I T+ T + L+ A + Sbjct: 213 TAGVLEPLEATEIAQKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMT 272 Query: 425 PNSFFEANSTHELNKIFRDRIGN 447 +F A EL + D I Sbjct: 273 GGKYFRARDAKEL-EAIYDTINQ 294 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 56/204 (27%), Gaps = 47/204 (23%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + R+G + + + + IVK Sbjct: 112 NRLEALKKVAIDFVDR-------RPNDRIGIVVYAGESFTQTPITS-----DKNIVKRTI 159 Query: 329 IDENEM---GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G TAI + + + + K + K I+LLTDG N N + Sbjct: 160 SELQWGQLDGGTAIGMGLGSGVNRL-----------KESTAKSKVIILLTDGVNNAGNID 208 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEK-------------------ARYFLSNC--AS 424 A+ I++ TI N L A+ Sbjct: 209 PRTATELARELEIKVYTIGIGTNGMADFPWSKDPRTGKLNFRKQQVEIDEKLLQEIATAT 268 Query: 425 PNSFFEANSTHELNKIFRDRIGNE 448 +F A L +I+ + E Sbjct: 269 DGKYFRATDNQSLKEIYDEIDALE 292 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 56/190 (29%), Gaps = 24/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + ++ Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQQALQA-V 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 310 NSLVANGGTNIFDGLRKGAKVMEDRRERNSVAS---------IILLSDGRDTYTTNHPDP 360 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 Q I + + F + + + + S +F S + IG Sbjct: 361 SYKAMLPQ-IPVHSFGFGSDHD-----ASVMHSVSEFSGGTFSFIESESVIQDALAQCIG 414 Query: 447 NEIFERVIRI 456 + V + Sbjct: 415 GLLSVAVQEL 424 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 31/258 (12%), Positives = 75/258 (29%), Gaps = 2/258 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +K+L +S TG+ +I AL +P ++G G +D+ +W + L+ A A + + Sbjct: 4 IRQTAKRLRQSNTGNAMMILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVAGAY 63 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + A+ + + ++ K + M+ Sbjct: 64 SLSYNGTAGDWSARAEQEYDANRSITTGYATANDSTKGVTDYGSFTQNSVTVSATMDVSL 123 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKS--WLIQTKAEAETVSRSYHKEHGVSIQWVID 184 ++LS+ + +N ++F ++ G ++ A T++ + + Sbjct: 124 PFSSILLSTPTTINVNSQAMFEKTDGATGCMIALKPNDTAITINGNVTINAPCGMAVSST 183 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S S+ P + + + Sbjct: 184 SSISINKSGGSGSINPGWVYTGGGVNDKFQELIDKFNATAPAADQVEVNEDASGFVDPFA 243 Query: 245 YPGPLDPSLSEEHFVDSS 262 + S + Sbjct: 244 SVTVPTVTGSGSKNCSTG 261 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 52/179 (29%), Gaps = 20/179 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A+ +V+ R+G F + L + +T Sbjct: 115 RPASRWQAVQAVVGDFIDKRP---DDRLGLIVFGAGAYPQAPLTRDHAALRLLLQRTAV- 170 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +TA+ DA+ + + E K ++LLTDG +T Sbjct: 171 -GMAGPNTALGDAIGLGIRMLDHAGER-----------DKILILLTDGNDTASAVPPARA 218 Query: 390 CNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A + + TI + L + A + FF A L +++ Sbjct: 219 AELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATL 277 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 57/185 (30%), Gaps = 24/185 (12%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S ++ + V+ +A + R+G F D F+ + Sbjct: 112 SDPQGNLQARVDAVKTVVADFVDR-------RPYDRLGLVAFGDAPYPLVPFTMDHATVR 164 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + T A+ TA+ DA+ + S K +V+LTDG +T Sbjct: 165 S--MLTGALPGMAGPKTALGDALGLSIKLFQQSQA-----------PDKVLVVLTDGNDT 211 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNC--ASPNSFFEANSTHE 436 A IRI T+ +++ L A+ +F Sbjct: 212 ASKMPPDKAAEIASQNHIRIHTVGIGNPDAQGEEKLDTETLQKIATATGGRYFFGQDQQA 271 Query: 437 LNKIF 441 L +I+ Sbjct: 272 LAEIY 276 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/306 (10%), Positives = 78/306 (25%), Gaps = 62/306 (20%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + I + + + L + S + + Sbjct: 16 FLDLMLEGDLKKYTSINLIPFGGTVNTGPRLLSRYFPTRGYMDGQPASEYGSGSQITSKL 75 Query: 269 KKKHLVRDALASVIRSIKKID---NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI-- 323 + + + D N + + +F+ ++ + Sbjct: 76 YRFLGAKHCIEYKPEDFDLNDIPLNSRAQLPHLYYWRKTNPWCPENFASRMYLNRNNLGG 135 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV----------HRMKNNLEAKKYIVL 373 +K T ++ + + ++ + +K IVL Sbjct: 136 LKAAVDRLTLSDGTGMDIGLLWEAKALSPKLRTAAALDGGLLPGHPTDWSDKQTQKVIVL 195 Query: 374 LTDG---------------------------------------ENTQDNEE------GIA 388 +TDG N+ N + Sbjct: 196 MTDGGITAQYRPKDPWKGLNPKDMRRGIVNARRNVQYVTTRGNMNSPANSKHNSVAYMKT 255 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 +C++AK++GI I T+ F + LS CA+ S + + +L+ F+ + Sbjct: 256 MCDQAKAKGIIIYTVGFQ--IRRNTLPDLSLSYCATSPSHYYFVESSDLSAAFKAIASSI 313 Query: 449 IFERVI 454 R++ Sbjct: 314 KSLRIV 319 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 62/207 (29%), Gaps = 35/207 (16%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 I + V+ L+ I R+G F D + H + + Sbjct: 54 NGDYIDRLSAVKQVLSDFIS-------KRQGDRLGLVLFADHAYLQTPLTLDRHTVAEQL 106 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 +T TAI + + A T + S ++ ++LL+DG NT Sbjct: 107 NQTVLRLI--GTKTAIGEGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAGV 153 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSF 428 + I AK I T+ T Q+ L A + + Sbjct: 154 LDPIEAAKIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDEKSLMEIAKLTGGQY 213 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIR 455 F A + EL I+ E + + Sbjct: 214 FRARDSKELATIYDTINNLEPVSQATQ 240 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 46/425 (10%), Positives = 112/425 (26%), Gaps = 49/425 (11%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + ++++ + T+ ++ + + + T Sbjct: 231 GSLDSNTQSITAATWLTGSVKTITSISRAYTAVVTASKHGFKTGDIVTIWSAETMAPLNG 290 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V L+ + + +G S + V++ + + I S Sbjct: 291 VALTV-----GSVTTNTFSLVGEDSRYYSAFSGQAYVAKCARTDCNIVITLARHGLSSEG 345 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 D L+ R + + + Sbjct: 346 DAAVLGNMGGLSQLNNIGFRVASVTPTTATLALDASQANLATTAKGGAAYTSGGQLICGV 405 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S FV++ V + S + + R + + P Sbjct: 406 DGCSNRDFVNAIGAWTRFPGTPCVSERAGSQAYTDAAPSASSWVGRS----YASGGNACP 461 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS----NEDEVHRMKNNLE 366 + ++ + +GSTA + + + + + + ++ + Sbjct: 462 ASQIVPLTNVKKTLTDAVDGMTAVGSTAGHIGLAWGWYLVSPNFGLWSGLGAPAAYDSSK 521 Query: 367 AKKYIVLLTDGENTQDNEE------------------------------GIAICNKAKSQ 396 K +VL+TDGE +C K+ Sbjct: 522 TLKAVVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCENMKAA 581 Query: 397 GIRIMTIAFSVNKTQQEKAR-----YFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIF 450 + + T+ F + + ++ CA+ P+ F+A+S+ +L+ FRD + Sbjct: 582 DVIVYTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRDIGRDITR 641 Query: 451 ERVIR 455 R+ R Sbjct: 642 LRISR 646 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 43/320 (13%), Positives = 90/320 (28%), Gaps = 32/320 (10%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +L G+ +I L +PV++ + VD+ R + + Sbjct: 12 RLRKLVSRLRDDRRGNVAMIFGLSLPVIVMLALGGVDLHRITTARSQFQ----------- 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L + + ++ + K L N + + D +V + D +V Sbjct: 61 DALDAATLAAARSSETTPAGLKSVALATLHGNIQGTEVEPINDADVEVAMNDKSV----- 115 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + L + + S V + V+D Sbjct: 116 -------VIATAQGRVKTLVANIVLPPYGQL-LDDTLPISARSEVNRSSRDVEVALVLDI 167 Query: 186 SRSMLDYQRDS-EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + SM D G+ ++ A + N L+PY + N Y Sbjct: 168 TGSMNDCADSCSSGRKIDNLKSAAKELIDIVVQTNQSPFYSKVALAPYSMGVNVGDTY-- 225 Query: 245 YPGPLDPSLSEEHFVDSSSL-RHVIKKKHLVRDALASVIRS----IKKIDNVNDTVRMGA 299 S + +++ +K + A +V+ + K D V Sbjct: 226 ASRARGSLDSNTQSITAATWLTGSVKTITSISRAYTAVVTASKHGFKTGDIVTIWSAETM 285 Query: 300 TFFNDRVISDPSFSWGVHKL 319 N ++ S + L Sbjct: 286 APLNGVALTVGSVTTNTFSL 305 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 54/174 (31%), Gaps = 20/174 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A+ V+ S R+G F D F+ + I + Sbjct: 122 VQAVREVVSSFVARRPG---DRIGLIVFGDAPYPLAPFTLDHALVQTMIR--DLLPGMAG 176 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 STA+ DA+ S +K +++LTDG +T + AK Sbjct: 177 PSTALGDAVGLGIKMFDQS-----------PAPEKVLIVLTDGNDTASKMPPERAADIAK 225 Query: 395 SQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFRDR 444 + + + TI + E+ L A + +F + L I+ Sbjct: 226 QRHVTVHTIGIGDPSAEGEQRVDLGVLQRMAAQTGGRYFFGADQNSLESIYATL 279 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 41/194 (21%), Positives = 63/194 (32%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL-NLLKDMK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + A + S K I+LLTDG N + + + Sbjct: 160 CGLIEDG-TAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGVNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIRI TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 76.2 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 57/197 (28%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ + R+G ++ + + L+ + Sbjct: 112 DRLTALKNIAKKFVD-------KRPGDRIGLVTYSGEAFTKVPVTSDHAVLLEELEN--L 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + + A + S K I+L+TDG NT +N Sbjct: 163 NPLELQPGTAIGEGLSVAVSHLRHSKAKS-----------KIIILMTDGVNTIENAMPAQ 211 Query: 389 I-CNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNS 427 + AKS IR+ +I N L A + Sbjct: 212 VGAQLAKSNDIRVYSIGIGTNGYALMPTQTDIFGDLVFTEVEVKIDEPVLREIAQTTGGK 271 Query: 428 FFEANSTHELNKIFRDR 444 +F A S L +++ + Sbjct: 272 YFRATSNQSLEEVYEEI 288 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 63/197 (31%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + A I N +G T F + + + L ++I Sbjct: 108 NRLEAAKQVAAEFI-------NGRPNDNIGLTIFAGEAFTQCPLTVDHGVLLNLFQSIKC 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A M TA+ + A + S K I+LLTDG N + + Sbjct: 161 DIAQKGLIMDGTALGMGLANAVSRLKDSKAKS-----------KVIILLTDGVNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIR+ TI N T E L+ A + Sbjct: 210 PLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQVPVEIDEQTLTQIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++++ Sbjct: 270 YFRATSNSKLKEVYQEI 286 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 55/190 (28%), Gaps = 20/190 (10%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D+ R DA+ V+ R+G F D F+ Sbjct: 103 SQSMDTKDFRAPDGNLEARVDAVRKVVGDFVARRPG---DRIGLIAFGDAPYPLAPFTMD 159 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I T T++ DA+ A + +K +++LT Sbjct: 160 HELVREIISGTL--PGIAGPRTSLGDAVGLAIKMFEKT-----------TVPEKVLIVLT 206 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEA 431 DG +T + AK G+ + T+ + L A + +F Sbjct: 207 DGNDTASKMPPLKAAEIAKRNGVVVHTVGIGDPQATGEDRLDATALEKIAETTGGRYFFG 266 Query: 432 NSTHELNKIF 441 +L + Sbjct: 267 GDQAQLAAAY 276 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 58/191 (30%), Gaps = 33/191 (17%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ LA I R+G F + ++ + I+ Sbjct: 111 RINRLQAVKQVLAEFIDQ-------RQGDRLGLILFGSQAYVQAPLTFDRTTVN--ILLQ 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A +TAI DA+ A + ++ ++LTDG NT Sbjct: 162 EAGLGMAGNATAIGDAVGLAVKRLR-----------ERPLEQRVAIVLTDGANTAGEITP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFLSNCA--SPNSFFEANS 433 A++ +R+ TI + L+ A + +F A + Sbjct: 211 DKASELAQASAVRLYTIGIGAGADSAITGLLQRNPSRDLDEALLTRMAQQTGGQYFRARN 270 Query: 434 THELNKIFRDR 444 EL I+ Sbjct: 271 LAELGGIYTSI 281 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 56/227 (24%), Gaps = 3/227 (1%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + + + L K G +I A++ PV++G G+ V+ W + L+ AA A+ A Sbjct: 9 SNLFRFFRSLAKDQGGSVAVIAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAA 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM-- 122 SV + + + + + Sbjct: 69 SVRYRAGDARALMETAALRSARVTGYQPSIGTITTGVQAGSTAGSGTVSVELTETHSRLF 128 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + VVLS+R L S + T + VS Sbjct: 129 SSVFTTDPVVLSARAVAELKGGSRACVLALSPTAPGAVTVTGSTD-VQLNGCSVVSNSNA 187 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 D S G + V Sbjct: 188 SDAFLMRNGSALMSTDCVYTVGGAITTTGLNLTGCSKPVVSSPPVPD 234 >gi|84385695|ref|ZP_00988726.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] gi|84379675|gb|EAP96527.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] Length = 436 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 40/341 (11%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + +++A + A + + +E + K ++ Y+ N + Sbjct: 9 YTQRLLAHSKIEEATEIASLALIANPGGNNQE-------DKDYAKNLVDLYVTDNINDVE 61 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T R + + V + + + + + K Sbjct: 62 IDVTTTRCEYADGCVQRNHELSPFTDFTVSARTEHKSWITHDEI--------GVEPEFKV 113 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 +V+R + + V I +++D S+SM + + + K Sbjct: 114 SGRSVTRKFLPQ-PVDIYFILDTSQSMSNRWSGEKNNKTQMDVVKETIVKVVEDLKTFKT 172 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 G + + Y Y D ++ D + R + + Sbjct: 173 GDEKKSQISLV-------TYNAYNAKFDKKSNQVKLYDYT--RDFKHTTETFEGIVDKMF 223 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + A +N + I+ + G T Sbjct: 224 DQDWPAETTS-----YAYMYNTSQD--IPLTDDYDNFIKLVKSDKL-KPATGGGTMS--- 272 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + +I++ ++ N ++ + L+DG + + N Sbjct: 273 ----WLGLIAAAKEANKVEDVNRNPEQVFIFLSDGADNKVN 309 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 59/192 (30%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + R+G F D + + + + + Sbjct: 22 VDRLTAVKQVLSDFIA-------KREGDRIGLILFADHAYLQTPLTLDRETVTQQLNQAV 74 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI + + A I S ++ I+LL+DG NT + + Sbjct: 75 LKLI--GTQTAIGEGIGLATKIFIDS-----------DAPQRVIILLSDGSNTAGVLDPL 121 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCAS--PNSFFEAN 432 N AK I T+ T Q+ L AS +F A Sbjct: 122 EAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRAR 181 Query: 433 STHELNKIFRDR 444 + +L I+ Sbjct: 182 NQQDLQNIYDTI 193 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 50/169 (29%), Gaps = 28/169 (16%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F D + + + TAI +A+ A Sbjct: 124 RRNGDRLGLILFADHAYLQAPLTQDRRSIATFLADAQI--GLVGKQTAIGEAIALAVKRF 181 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-- 408 +E + +VLLTDG N N E A + + I T+ Sbjct: 182 DQVSESN-----------RVLVLLTDGSNNAGNIEPDVAAEIAAKRNVTIYTVGVGAELM 230 Query: 409 -----------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +F A ++ +L +I++ Sbjct: 231 ERRTIFGKERVNPSMDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQKI 279 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 38/200 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + + + + + T + Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVDHNVIQQMLETTEIGQLED-- 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + K I+LLTDG N + A+ Sbjct: 165 GTAIGLGLATAINTLR-----------GSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKL 273 Query: 438 NKIFRDRIGNEIFERVIRIT 457 ++I+++ R+T Sbjct: 274 HEIYKEID----SLEKSRLT 289 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 47/150 (31%), Gaps = 17/150 (11%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + F+ + L + TA+ D++ A SN Sbjct: 142 LILFGNAAFVQAPFTQDLDALEHLLDSLRV--GMAGPQTAMGDSIGLAVKMFRESN---- 195 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--AR 416 + +++++DG++T A G+ + TI K E Sbjct: 196 -------VTDRMLIVMSDGDDTGSKVPPKTSAELAAKNGVNVFTIGIGDPKNAGEHPIDT 248 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + F+ A + +L I++ Sbjct: 249 DTLKEIAAITGGKFYYAWNLDDLQDIYKQI 278 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 38/200 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A V + ++ +G F + + + + + + T + Sbjct: 110 EAARDVASEMIAARPNDN---IGLVVFAGESFTLCPLTVDHNVIQQMLETTEIGQLED-- 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI + TA +T+ + K I+LLTDG N + A+ Sbjct: 165 GTAIGLGLATAINTLR-----------GSDNKSKVIILLTDGSNNAGDITPSMAAELAQQ 213 Query: 396 QGIRIMTIAFSVNKTQQEKARYF----------------LSNCA--SPNSFFEANSTHEL 437 GIRI T+A N + + L + A + ++ A +L Sbjct: 214 YGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKL 273 Query: 438 NKIFRDRIGNEIFERVIRIT 457 ++I+++ R+T Sbjct: 274 HEIYKEID----SLEKSRLT 289 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 55/169 (32%), Gaps = 24/169 (14%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R G F + + + L + + TAI A+ + + +S Sbjct: 134 QDRFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAI-SDKGTAIGSAILVGTNRLRAS 192 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF-------- 405 ++ ++LLTDGE+ + A+S+GIRI I Sbjct: 193 V-----------SKERVLLLLTDGEHNSGEVGPVTASEIAQSEGIRIYVIGVRNEEEAGS 241 Query: 406 --SVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIF 450 S++ ++ L A +F A+ + L F + E Sbjct: 242 PESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAFGEIDALERS 290 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 57/176 (32%), Gaps = 28/176 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G FF D ++ + + + ++ TAI DA+ A S Sbjct: 131 DRLGLIFFADTAYLQAPLTYDRETVSQLLGESLI--GLVGEQTAIGDAIGLAIKRFQSKK 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E K ++LLTDG+NT N A + G+ + TI + Sbjct: 189 ESN-----------KVLILLTDGQNTAGNISPQQANELAINNGVTLYTIGVGADQMMVQS 237 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 QE L+ A + +F A L I+ E R R Sbjct: 238 IFGSRQVNPSQELDESMLTQLAESTGGRYFRARDAESLKAIYDKLDELEPIARESR 293 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 75.8 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 36/243 (14%), Positives = 63/243 (25%), Gaps = 10/243 (4%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + D + Y ++ V Sbjct: 3 QNGILGDFIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTK 62 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +R ++ + + F++R+ V K + + Sbjct: 63 LRQTKDALFTILHDLRP---QDHFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSP 119 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T IN A+Q A + N+ H + IV LTDG+ T + I N Sbjct: 120 TGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILNNT 175 Query: 394 KSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + I TI + + + L NC E F D I + Sbjct: 176 REAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLL 235 Query: 451 ERV 453 + Sbjct: 236 SDI 238 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 75.8 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 57/191 (29%), Gaps = 20/191 (10%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D+ K +A+ V+ R+G F D F+ Sbjct: 103 SQSMDARDFPGADGKPLARVEAVRQVVADFVGKRPG---DRIGLVAFGDAPYPLAPFTMD 159 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I T T++ DA+ A +K +++LT Sbjct: 160 HELVRTMIADTV--PGMAGPRTSLGDALGLAIKMF-----------GKTTAPEKVLIVLT 206 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEA 431 DG +T + AKS+G+ T+ + + L A + +F Sbjct: 207 DGNDTASRMPPLKAAEIAKSKGVIFHTVGIGDPAATGEDKLDTATLQKIAASTGGRYFFG 266 Query: 432 NSTHELNKIFR 442 +L I+ Sbjct: 267 GDQSQLAAIYE 277 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 75.8 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 36/153 (23%), Positives = 53/153 (34%), Gaps = 14/153 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F S L I T + D E G TA+ DA+ + + Sbjct: 131 YDRIGIVVFAGAAYLQSPLSSDRFALDELIAGTSSEDIEEQG-TAVGDALVLSSYRL--- 186 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 KN+ K I+LLTDG + + K+ GI++ I + Q E Sbjct: 187 --------KNSEAKSKVIILLTDGVSNTGKLDPDTAAYTTKTMGIKVYCIGIGKEEGQYE 238 Query: 414 KARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 L +S FF A S L + + Sbjct: 239 INYESLQKISSNTNGKFFRAESPEVLESVLNEI 271 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 75.8 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 52/179 (29%), Gaps = 20/179 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A+ +V+ R+G F + L + +T Sbjct: 115 RPASRWQAVQAVVGDFIDKRP---DDRLGLIVFGAGAYPQAPLTRDHAALRLLLQRTAV- 170 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +TA+ DA+ + + E K ++LLTDG +T Sbjct: 171 -GMAGPNTALGDAIGLGIRMLDHA-----------RERDKILILLTDGNDTASAVPPARA 218 Query: 390 CNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A + + TI + L + A + FF A L +++ Sbjct: 219 AELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATL 277 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 24/187 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + V+ +A I R+G F + L Sbjct: 111 APDGRREDRLSGVKAVVADFIDR-------RQDDRLGLIVFGTAAYPQAPLTQDHATLK- 162 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ +TAI DA+ A + E + ++LLTDG +T Sbjct: 163 -LLLGQVSTRMAGPNTAIGDAIGVAIKQFEHAGE-----------HDQVLILLTDGNDTG 210 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + A ++ I + T+ + +++ L A + FF A L Sbjct: 211 SAVPPDRAASMAAARHIVVHTVGIGDPQAEGEEKVDFDALRAIAAKTGGRFFPAQDQASL 270 Query: 438 NKIFRDR 444 +++ + Sbjct: 271 RQVYAEL 277 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 60/190 (31%), Gaps = 34/190 (17%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F + + L + + + + TAI + A + Sbjct: 121 KGRPHDRIGLVVFAGEAFTQCPLTTDHKILETFLEQLECGNLED--GTAIGMGLAGAVNR 178 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV-- 407 + K + K I+LLTDG N + + AK GI++ +I Sbjct: 179 L-----------KKSPAKSKVIILLTDGVNNVGYFKPLTAGELAKELGIKVYSIGVGTIG 227 Query: 408 ----------------NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG-NE 448 + Q E L A + +F A + +L +I+ + Sbjct: 228 EALTPVSRLSDGSFFLDYAQVEIDEELLREIARMTGGQYFRAKNNQDLRQIYNTIDRLEK 287 Query: 449 IFERVIRITK 458 +V RI K Sbjct: 288 TEIQVTRIKK 297 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 61/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + A I N +G T F + + + L +I Sbjct: 108 NRLEAAKQVAAEFI-------NGRPNDNIGLTVFAGEAFTQCPLTVDHGVLLNLFNSIKG 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI + A + S K I+LLTDG N + + Sbjct: 161 DIAQRGMIEDGTAIGMGLANAISRLKDSKAKS-----------KVIILLTDGSNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIR+ TI N T E L+ A + Sbjct: 210 PLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVNVPVEIDEQTLTQIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++R+ Sbjct: 270 YFRATSNSKLEEVYREI 286 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 64/194 (32%), Gaps = 31/194 (15%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + + +D I S +G T F + + L+ Sbjct: 102 SDDVDPNRIEVAKDVAKEFIAS-------RPNDNIGLTIFAGEAFTQCPMTTDHASLL-N 153 Query: 323 IVKTFAIDENEM----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ D + TAI + A + E K ++LLTDG Sbjct: 154 LLAGIRADLSVNHLIQDGTAIGMGLANAVGRLKDVKEGS-----------KVVILLTDGS 202 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ------QEKARYFLSNCA--SPNSFFE 430 N + + + A+ G+R+ TI + E L + A + F+ Sbjct: 203 NNVGDISPLTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYKTLQDIAMQTDGEFYR 262 Query: 431 ANSTHELNKIFRDR 444 A S EL++I++D Sbjct: 263 AQSRAELSQIYKDI 276 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 52/156 (33%), Gaps = 14/156 (8%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 T R+ F + + L + G TA+ A+ + + + Sbjct: 137 GRTTDRIALVAFAGETATLSPLTTDYTALKEYLASINTNIIRTSG-TALGMALSSCVNKL 195 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + ++++DG+NT AKS G+R+ TIA + Sbjct: 196 RDVAGKS-----------RIAIIISDGDNTAGTIPPETAVELAKSFGVRVYTIAIGKPGS 244 Query: 411 QQEKARYFLSNCASP--NSFFEANSTHELNKIFRDR 444 ++ L A FF+A L+KIF + Sbjct: 245 EEGVDEKTLRMLAGGPNGRFFKAADNSSLSKIFDEI 280 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 28/165 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + + + ++ TAI +++ A + Sbjct: 130 DRLGLILFADHAYLQTPLTFDLKTIQQMVDESEI--GLAGTRTAIGESIAMAIKRFV--- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 N ++ ++L++DG N + E I +A I I TI Sbjct: 185 --------ENKNEQRVLILVSDGANNSGSIEPIQAAKQAAKNNITIYTIGMGAEQMIKRG 236 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L+ A + +F A + EL I++ Sbjct: 237 LFGNQRINPSADLDEKTLTEIANLTGGKYFRARNQTELQNIYQTL 281 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 40/279 (14%), Positives = 79/279 (28%), Gaps = 14/279 (5%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V+ S S + P T + E+ V Sbjct: 207 DLMNVLTTSFSGKKGHVSFKPTFDQQRSCPTCSTSLLNGDFIITYDVNREQPGNVQVVNG 266 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +++ + +D S +K ++AL ++ +KK D + Sbjct: 267 YFVHFFAPQNLPVVPKNVVFVIDVSGSMSG-RKLVQTKEALLKILSDVKKDD------FL 319 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F+ V + + F G T IND + + + + E Sbjct: 320 NFILFSSDVRTWKENLVPATPENLKAAEEFVHQIQATGGTNINDGLLRGIEMVNKAREMG 379 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEK 414 ++ I++LTDGE I ++ + + F + Sbjct: 380 TVLDRSTS----IIIMLTDGEANVGESRVEKIQENVRNAIGGKYPLYNLGFGYDVNYNFL 435 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L N +E + + + F D + N + V Sbjct: 436 ERMALENNGVARRIYEDSDANLQMQGFYDEVANPLLIDV 474 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 61/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + + I N +G T F + + + L +I Sbjct: 108 NRLEAAKQVASEFI-------NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKG 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI + A + S K I+LLTDG N + + Sbjct: 161 DIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGSNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIRI TI N T E L+ A + Sbjct: 210 PLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++++ Sbjct: 270 YFRATSNSKLKEVYQEI 286 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 59/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ + I R+G + + + ++R++ + Sbjct: 112 NRLEALKNVASEFI-------KGRPNDRIGLVEYAGESYTKTPITSDKSIVLRSLQEIRY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TAI + TA + + S K I+LLTDG N + Sbjct: 165 NNII-EGGTAIGMGLATAVNRLKDSKAKS-----------KVIILLTDGVNNSGSINPKI 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A GI+ TI N L A + + Sbjct: 213 ASELAVEFGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQVTGGKY 272 Query: 429 FEANSTHELNKIFRDR 444 F A + +L +I+ + Sbjct: 273 FRATNNKKLAEIYDEI 288 >gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus] gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus norvegicus] Length = 933 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 59/192 (30%), Gaps = 12/192 (6%) Query: 271 KHLVRDALASVIRS-----IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I+ IK + +++ + F+ + Sbjct: 279 IDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEENLNRAV 338 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 +A G T IN A+ +A + + SN+ E+ K+ I+LLTDGE T Sbjct: 339 DYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPTVGETN 394 Query: 386 GIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + + + F + + L N +E + + + F Sbjct: 395 PKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFY 454 Query: 443 DRIGNEIFERVI 454 + N + V Sbjct: 455 QEVANPLLSSVT 466 >gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] Length = 932 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 59/192 (30%), Gaps = 12/192 (6%) Query: 271 KHLVRDALASVIRS-----IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I+ IK + +++ + F+ + Sbjct: 278 IDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEENLNRAV 337 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 +A G T IN A+ +A + + SN+ E+ K+ I+LLTDGE T Sbjct: 338 DYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPTVGETN 393 Query: 386 GIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + + + F + + L N +E + + + F Sbjct: 394 PKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFY 453 Query: 443 DRIGNEIFERVI 454 + N + V Sbjct: 454 QEVANPLLSSVT 465 >gi|194221223|ref|XP_001915876.1| PREDICTED: inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Equus caballus] Length = 834 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 40/253 (15%), Positives = 77/253 (30%), Gaps = 19/253 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + NG +R + ++Y G + +D S Sbjct: 223 SPEQQDTVLNGNFIVRYDVNRTVSGGSIQIENGYFVHYFAPEGLPTIPKNVIFVIDQSGS 282 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +K R+AL ++ + D + FN+ Sbjct: 283 MAG-RKIQQTREALIKIVDDLGPKD------QFNLVCFNEEATQWKPSLVPASAENMKEA 335 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + FA G T INDA+ A + +N+ E+ + I+LLTDG+ T Sbjct: 336 RNFAAGIMARGGTNINDAVLLAVQLLERANKQELLPAGSVS----LIILLTDGDPTVGET 391 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I + + + F + + + L N +E + + + F Sbjct: 392 NRANIQKNVQEAISGQCSLFCLGFGFHVSYAFLEKLALDNGGLARRIYEDSDSALQLQDF 451 Query: 442 RDRIGNEIFERVI 454 + N + +V Sbjct: 452 YQEVANPLLMKVA 464 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 55/181 (30%), Gaps = 32/181 (17%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F R + L R + TAI A+ ++ + + Sbjct: 130 DDRIGLVAFAGRPYMVSPLTLDHDWLGRRLQ--TIHPGMVEDGTAIGSAIGSSINRLRDQ 187 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 K ++LLTDG N + A++ GI+I TI Sbjct: 188 KAKS-----------KVVILLTDGMNNAGKILPVTAAEAAETLGIKIYTIGAGSRGEVPV 236 Query: 413 ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L A + ++ A T L KI+ + E +R I Sbjct: 237 PITDKFGNQKIVRAKVDIDEATLEKVAQMTGAKYYRATDTDSLKKIYSEINKLETTKRKI 296 Query: 455 R 455 R Sbjct: 297 R 297 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 57/185 (30%), Gaps = 10/185 (5%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +R ++ + + R F++R+ V K + Sbjct: 4 TKLRQTKDALFTILHDLRP---QDRFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHM 60 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + G T IN A+Q A + N+ H + IV LTDG+ T + I N Sbjct: 61 SPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILN 116 Query: 392 KAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + I TI + + + L NC E F D I Sbjct: 117 NTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAGSQLIGFYDEIRTP 176 Query: 449 IFERV 453 + + Sbjct: 177 LLSDI 181 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 75.4 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 58/196 (29%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I R+G + + + ++R++ Sbjct: 113 NRLEALKKVAADFIE-------GRPNDRIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKY 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TAI + T+ + + S K I+LLTDG N + Sbjct: 166 NTII-EGGTAIGMGLATSVNRLKDS-----------RAKSKVIILLTDGVNNGGFIDPKI 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A GI++ TI N T L A + + Sbjct: 214 ASELAVEYGIKVYTIGLGTNGTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVTGGKY 273 Query: 429 FEANSTHELNKIFRDR 444 F A + +L +I+ + Sbjct: 274 FRATNNKKLAQIYDEI 289 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 59/196 (30%), Gaps = 38/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A+ ++ R+G + + + +++++ Sbjct: 111 NRLEALKKVAATFVQD-------RINDRIGLVVYAGESYTRTPVTSDKTIILQSLKSVEF 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D T I + TA + I S + I+LLTDG N + Sbjct: 164 DDSIIADGTGIGVGLATAINRIKDSKAKS-----------RIIILLTDGVNNSGTIDPRT 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 + AK GI++ TI N + A + + Sbjct: 213 AASIAKEYGIKVYTIGIGTNGKAMFPVAKDANGKLVFKMMPVEIDEKLMQEIAKNTDAKY 272 Query: 429 FEANSTHELNKIFRDR 444 F A S +L I+ + Sbjct: 273 FRATSNKKLQAIYDEI 288 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 55/188 (29%), Gaps = 27/188 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V++ + +I + D R+ FN + K Sbjct: 258 KIENVKNTILQLIDMLNDHD------RLSIITFNSYAKQLCGLRKVNKDNKENLQKIT-K 310 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---- 385 G T I +QTA+ + + + + LL+DG++ + Sbjct: 311 SIQADGGTNITSGLQTAFSILQNRKQRNSVSS---------VFLLSDGQDNNSDSRIRNL 361 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + + + I + F + + A SF+ ++++ F D Sbjct: 362 LQTTYQQLQEECFTIHSFGFGNDHDGP-----LMQRIAQIKDGSFYYVERNDQVDEFFID 416 Query: 444 RIGNEIFE 451 +G Sbjct: 417 ALGGLFSV 424 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 70/191 (36%), Gaps = 16/191 (8%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +S I + ++ LA + K R+G F D F+ + Sbjct: 119 TSKRGDKISRLDATKEVLADFATTRKG-------DRLGLILFGDAAFVQTPFTADQDVWL 171 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDG 377 + +T ST + DA+ A S + ++++A +K +++LTDG Sbjct: 172 ELLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSEKQSAAVQDSSVDANEKEKVVIVLTDG 229 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANS 433 +T E I AK++G+RI IA + + A S FEA + Sbjct: 230 NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALN 289 Query: 434 THELNKIFRDR 444 EL K + Sbjct: 290 RDELTKAYAQI 300 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 61/232 (26%), Gaps = 10/232 (4%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + Y ++ V +R ++ Sbjct: 262 YDVSREQSIGDIQVLNGYFVHYFAPRDLPPLPKNVVFVLDSSASMVGAKLRQTKDALFTI 321 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + F++R+ V K + + G T IN A+Q Sbjct: 322 LHDLRP---QDHFNIIGFSNRIKVWKDHLVSVTPNNVRDGKVYIHHMSPSGGTDINGALQ 378 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 T + D V R + IV LTDG+ T I N + + I Sbjct: 379 TGIALL----HDSVARHDLEDRSVSLIVFLTDGKPTVGETHTPKILNNTREAARGRVCIF 434 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 T+ + + + L NC E F D I + + Sbjct: 435 TVGIGDDVDFRLLEKLSLENCGFTRHVHEDEDAGAQLIGFYDEIRTPLLSDI 486 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 52/334 (15%), Positives = 95/334 (28%), Gaps = 23/334 (6%) Query: 128 AYQVVLSS---RYDLLLNPLSLFLRSMGIKSWLIQT--KAEAETVSRSYHKEHGVSIQWV 182 YQ +S + L L + GI S + K+ +++ V Sbjct: 181 KYQYTVSIRPQQLVGKLRVEVNILENSGINSLEVPPLQKSRSKSAGEGQDDASPPPSTVV 240 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + Q N ++ + D ++ ++Y Sbjct: 241 GQTKTLAKITFSPTVVQQAKIARNGILGDFTIRYDVNRELSVGDVQVFNGYF-----VHY 295 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + +DSS+ V K +DAL ++++ ++ D+ N Sbjct: 296 FAPKDLPPLPKNVVFVLDSSASM-VGTKLRQTKDALFTILQDLRPEDHFNIIGFSNRIK- 353 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + K + + + G T IN A+Q + + + Sbjct: 354 VWQHDQLVPVTPN----NIRDAKVYIHNMSPSGGTNINGALQISTKILNDYIAQNDIEAR 409 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFL 419 + I+ LTDG T E I N K + TI + + R L Sbjct: 410 SVS----LIIFLTDGRPTFGEIEPAKIINNTKEAIRNKFCLFTIGIGNDVDYKLLERLAL 465 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 NC E E K F I + + Sbjct: 466 ENCGMMRRVREEEDAAEQLKGFYYEIDTPLLSDI 499 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 73/208 (35%), Gaps = 29/208 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S+ K + +A+ + N ++G + D++ + + + Sbjct: 45 VDVSNSMTQSDKNKVSNEAMKMFVDMTSIQAN-----KVGVVAYTDKIEREKALLEINSE 99 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + +K F + T I + A + N IVLL DG Sbjct: 100 EDKNDIKAFIDSLQKGAYTDIAVGVTEAVKILD---------AGRNPNNAPIIVLLADGN 150 Query: 379 N----------TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPN 426 N + ++E +AK +G + TI + + + R L A + Sbjct: 151 NFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADG---QLNRTTLQQIAAETNG 207 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 FFE ++ +L +I + N + +V+ Sbjct: 208 KFFETSTADKLPQILSEIFANHLKLKVV 235 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 47/391 (12%), Positives = 103/391 (26%), Gaps = 32/391 (8%) Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + P ++ +N + T EV+ ++ + + + + V S Sbjct: 25 TEAKKISQPSTLAKKTTSNTSKNLSNQVKTSPEVQQLISQVIKQASLKPKSVAVQKQSLA 84 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW-VIDFSRSMLDYQRDS 196 + + L+S K K++ ++ + + +D+ Sbjct: 85 QNPIKAKATQLKSAVSKQLTSFQKSDISIQKEEKIQKLDTKTMLREQEIKPDLQQMIKDA 144 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGI-------RDEKLSPYMVSCNKSLYYMLYPGPL 249 + + K + + Sbjct: 145 KKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVKQAEQ 204 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + V +S +K V+ L ++ + D R+ FN Sbjct: 205 SRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLL 258 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + K I K T IN M A++ + Sbjct: 259 CNLRKVDDKNTPNIQK-IINSITAEEYTDINSGMLMAFNILQKRQFF---------NPVS 308 Query: 370 YIVLLTDGENTQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYF-LSNCASP 425 I LL+DG++ +E+ N K++ I + F + R L Sbjct: 309 SIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLK----D 364 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 +F+ +++++ F D +G I Sbjct: 365 GNFYYVEKINQVDEFFVDALGGLFSVVAQEI 395 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 57/192 (29%), Gaps = 26/192 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K R+A + + D + ++D V + R VK Sbjct: 55 DKMVHAREAAKQALDRLGAGD------MVSVVAYDDAVSLISPATDLT---DRDRVKAAI 105 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 GSTA+ + + + + +VLL+DG Sbjct: 106 DRIQAGGSTALFSGISKGAEELRRNKRPNQVNR---------VVLLSDGMANVGPSSPQD 156 Query: 389 I----CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + AK GI + T+ + + L S + ++ L IF+ Sbjct: 157 LGRLGASLAKE-GITVTTLGLGLGYNEDLMTELALR---SDGNHAFIENSQNLAGIFQTE 212 Query: 445 IGNEIFERVIRI 456 G+ + RI Sbjct: 213 FGDILSVVAQRI 224 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 52/161 (32%), Gaps = 28/161 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D + + + + + TAI +A+ A + + Sbjct: 138 LILFADHAYLQAPLTQDRRSVAQFLTEAQI--GLVGKQTAIGEAIALAVKRFDKAKQSN- 194 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV----------- 407 + ++LLTDG N + + A +G+ I TI Sbjct: 195 ----------RVLILLTDGSNNSGSITPEQAADIAAKRGVTIYTIGVGAEVMERRTLFGK 244 Query: 408 ----NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++A+ L + +F A ++ EL +I+++ Sbjct: 245 ERVNPSMDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQEI 285 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 60/197 (30%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG---VHKLIRTIVK 325 + + A I N +G T F + + + L +I Sbjct: 108 NRLEAAKQVAAQFI-------NGRPNDNIGLTIFAGEAFTQCPLTIDHGVLLNLFGSIKG 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TAI + + + S K I+LLTDG N + + Sbjct: 161 DIAQRGLIEDGTAIGMGLANSISRLKDSKAKS-----------KVIILLTDGSNNRGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNS 427 + AK GIR+ TI N T E L AS + Sbjct: 210 PLTAAEIAKQFGIRVYTIGVGTNGTAPYPMPTYAGVQYVNVPVEIDEQTLIQIASTTNGN 269 Query: 428 FFEANSTHELNKIFRDR 444 +F A S +L +++ + Sbjct: 270 YFRATSNSKLKEVYEEI 286 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 62/185 (33%), Gaps = 17/185 (9%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 R + LA V ++ + RM F + + ++ + Sbjct: 108 TRDFTDASNENVQRLAGVRDVVRAFVEGREGDRMALIVFGSKAYLQSPLTEDTGTIVELL 167 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 +T TAI DA+ + T +S ++ ++LL+DG +T Sbjct: 168 DQTEV--GMAGPHTAIGDAIGLSIRTFEAS-----------EIEQRLLILLSDGADTASR 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + A+ G+ I TIA + L + A + S+F A L++ Sbjct: 215 MSPLNAAEIARGAGVEIFTIAVGDPDATGENRVDVAALQDIANRTSGSYFFAADQAALDE 274 Query: 440 IFRDR 444 I+ Sbjct: 275 IYARI 279 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 67/193 (34%), Gaps = 27/193 (13%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + L +V + N + R+G F R + ++ L R + + Sbjct: 110 SNRLQAVKPVLSAFINRRENDRIGLIAFAGRAYTVAPLTFDHKWLARQTERLQIGLIED- 168 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 TAI D++ A ++ ++ + +IVLLTDGENT + + AK Sbjct: 169 -GTAIGDSLAVATSRLLEGAKERAGEREGA-----FIVLLTDGENTAGMMDPMEGATLAK 222 Query: 395 SQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSFFEANST 434 GIR+ TIA N L A + FF A ++ Sbjct: 223 DAGIRVYTIAAGKNGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETNGEFFRAENS 282 Query: 435 HELNKIFRDRIGN 447 +++ F + Sbjct: 283 DTIDQAFEKIDAS 295 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 54/154 (35%), Gaps = 17/154 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 RM F + + ++ + +T TAI DA+ + T +S Sbjct: 139 DRMALIVFGSKAYLQSPLTEDTGTIVELLDQTEV--GMAGPHTAIGDAIGLSIRTFEAS- 195 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQ 412 ++ ++LL+DG +T + A+ G+ I TIA + Sbjct: 196 ----------EIEQRLLILLSDGADTASRMSPLNAAEIARGAGVEIFTIAVGDPDGTGEN 245 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L + A + S+F A L++I+ Sbjct: 246 RVDVAALQDIANRTSGSYFFAADQAALDEIYARI 279 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 61/215 (28%), Gaps = 20/215 (9%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA--LASVIRSIKKIDNV 291 S + Y +D+S ++ ++ Sbjct: 62 FSLASPIIYDQKTSSKRKGRDLVFALDTSGSMAESGFNPENVQNRKFDALKELLRSFITK 121 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +G + F S+ + + + F STAI + + A + Sbjct: 122 RYNDNVGVSIFGTYAYPAIPLSYDMGSV--AFLLDFFDVGIAGDSTAIGEGLAMALKIL- 178 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 K +K I+L+TDG KAK Q ++I TI + Sbjct: 179 ----------KKGEAKEKVIILITDGYQNSGAVSVKEAVQKAKKQHVKIYTIGIG---DR 225 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + FEA + L I+++ Sbjct: 226 SAFDANLLQLIAKNTDAKMFEAKNVKMLQDIYKEI 260 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/396 (9%), Positives = 101/396 (25%), Gaps = 34/396 (8%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ + M +L +G + +D + L+ A A ++ +V + + + Sbjct: 11 RRRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEYDY 70 Query: 75 VSSRAKNSFTFPKQKIEEYLIR----------NFENNLKKNFTDREVRDIVRDTAVEMNP 124 + +R + TF + NF ++ E + + P Sbjct: 71 LRARQEGLVTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDPFVPVLTP 130 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +V +S + + F + MG+ + + S + + + Sbjct: 131 GAQYVRVTVS-----DVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMVCAE 185 Query: 185 FSRSMLDYQRDSEGQPLN--------------CFGQPADRTVKSYSSQNGKVGIRDEKLS 230 ++ + + D T + + ++ Sbjct: 186 DLGEDNFGYPLNKMMAMKISSQQNTPIGPGNFQLIRLGDNTGAADIRRAMAGESAGDQFC 245 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 N + PG +++ + + +D V + + Sbjct: 246 FGYDGPNATEQVETEPGNTVGPVAQGLNTRMGRWQGPVNSDDHPQDWDTCVGEPLDVDSD 305 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N T + + + + + + A D+ Sbjct: 306 GNITDKN-----GSPITIPDGTADYHAYMDTNYPQYYHNGWYNSVEGHSCPAAADGSDSF 360 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ++ M+ E + G N N Sbjct: 361 DRNSNSSAMGMEKRREFSIVVADCGTGNNNGQNSLP 396 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 75.0 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 64/182 (35%), Gaps = 20/182 (10%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +K + ++ + I +D N R F+D S + + Sbjct: 124 GSMKNTDPNNERFSAALNLIDNMDKNN---RFSMYKFDDTAEKIIPMSQVTKQSREEVSG 180 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NE 384 +N G+T + DA++ AY+ I SS + +++L+DG +T D ++ Sbjct: 181 KLKDMQNPKGNTNMRDALEKAYEEIKSS---------ETKDKNAMVIMLSDGGDTYDLSK 231 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + K + I I TI S L A S +++ +L +F Sbjct: 232 KFDETLKPFKEKNISIYTIGMSNGNNF-----SMLKEIAKESGGNYYNVKEIKDLKNVFN 286 Query: 443 DR 444 Sbjct: 287 KI 288 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 75.0 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 67/216 (31%), Gaps = 24/216 (11%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P++ + + L S + +K + + + Sbjct: 66 PFVYDAAGNQH------KKGRDLVLAIDASGSMAQSGFDEKDRFKTKYETTLDLSADFIK 119 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 MG F + ++ + + + T STAI DA+ A T+ Sbjct: 120 HRFDDNMGVVIFGTFAYTASPLTYDLEAMESMLKMTTV--GIAGESTAIGDALMQAMRTL 177 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 K I+LLTDG + A KAK +GI+I TI + Sbjct: 178 S-----------YGEAQSKAIILLTDGYHNAGRSSPKAAVAKAKEKGIKIYTIGVGKSSD 226 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A S + A S +L +++++ Sbjct: 227 Y---DAALLDTIAKESGGKSYAAASAAQLKEVYKEI 259 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 68/213 (31%), Gaps = 42/213 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I +G T F + + L+ +++ Sbjct: 108 NRIEAAKDVAAEFIS-------GRPNDNIGLTIFAGEAFTQCPMTTDHASLLT-LLQDVR 159 Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D G TAI + A + S + ++LLTDG N + Sbjct: 160 TDMATRGLINDGTAIGMGLANAVSRLKDS-----------KTKSRVVILLTDGANNAGDI 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVN----------------KTQQEKARYFLSNCA--SPN 426 + AKS GIR+ TIA + + L A + Sbjct: 209 SPLTAAQMAKSLGIRVYTIAVGTSKVAPYPIEVGGRVQYISRPADIDTKTLREIAAVTEG 268 Query: 427 SFFEANSTHELNKIFRDRIG-NEIFERVIRITK 458 +F+ AN+T +L +I+ D + V R K Sbjct: 269 NFYSANNTAQLKQIYHDIDQLEKTKMSVTRYAK 301 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 63/198 (31%), Gaps = 41/198 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I +G F + + L+ +++ Sbjct: 108 NRIEAAKQVASEFI-------IGRPNDNIGLAIFAGESFTQCPMTTDHASLL-NLLQNVR 159 Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D G TAI + A + S K ++LLTDG N + + Sbjct: 160 TDIAARGLIEDGTAIGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNRGDI 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PN 426 AKS GIR+ TI NK E LS AS Sbjct: 209 SPSTAAEIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNVPVEIDTKTLSEIASITEG 268 Query: 427 SFFEANSTHELNKIFRDR 444 F+ A +T+EL KI+++ Sbjct: 269 DFYRATNTNELRKIYKEI 286 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 57/185 (30%), Gaps = 10/185 (5%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +R ++ + + R F++R+ V K + Sbjct: 4 TKLRQTKDALFTILHDLRP---QDRFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHM 60 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + G T IN A+Q A + N+ H + IV LTDG+ T + I N Sbjct: 61 SPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHTLKILN 116 Query: 392 KAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + I TI + + + L NC E F D I Sbjct: 117 NTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTP 176 Query: 449 IFERV 453 + + Sbjct: 177 LLSDI 181 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 65/232 (28%), Gaps = 10/232 (4%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + + Y ++ V ++ +++ Sbjct: 228 YDVEREQNIGDIQVLNGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTI 287 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + R F++R+ V K + + G T IN A+Q Sbjct: 288 LNDLRP---QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQ 344 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 A + + ++ I+ LTDG+ T + I + K I I Sbjct: 345 AAIKLLNNYVAQNDIEDRSVS----LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIF 400 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 T+ + + + L NC E + F D I + + Sbjct: 401 TVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDI 452 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 65/232 (28%), Gaps = 10/232 (4%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + + Y ++ V ++ +++ Sbjct: 262 YDVEREQNIGDIQVLNGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTI 321 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + R F++R+ V K + + G T IN A+Q Sbjct: 322 LNDLRP---QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQ 378 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 A + + ++ I+ LTDG+ T + I + K I I Sbjct: 379 AAIKLLNNYVAQNDIEDRSVS----LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIF 434 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 T+ + + + L NC E + F D I + + Sbjct: 435 TVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDI 486 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 65/232 (28%), Gaps = 10/232 (4%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + + Y ++ V ++ +++ Sbjct: 262 YDVEREQNIGDIQVLNGYFVHYFAPKNLPPLPKNVVFVLDISASMVGAKLQQTREALVTI 321 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + R F++R+ V K + + G T IN A+Q Sbjct: 322 LNDLRP---QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQ 378 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 A + + ++ I+ LTDG+ T + I + K I I Sbjct: 379 AAIKLLNNYVAQNDIEDRSVS----LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIF 434 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 T+ + + + L NC E + F D I + + Sbjct: 435 TVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEIRTPLLSDI 486 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 61/167 (36%), Gaps = 9/167 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R+G F D F+ + + +T ST + DA+ Sbjct: 136 LADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIG 193 Query: 345 TAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A S + ++++A +K +++LTDG +T E I AK++G+RI Sbjct: 194 LATKVFEQSEKQSAAVQDSSIDANVKEKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIH 253 Query: 402 TIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 IA + + A S FEA + EL K + Sbjct: 254 VIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQI 300 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 59/195 (30%), Gaps = 39/195 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + S + R+G F+ + + L I + + Sbjct: 116 DRLEAAKRVAMNFVDS-------RISDRIGLVIFSGESFTQCPITTDHGVLKNQIAQVKS 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + T+ + + +S K I+LLTDG N + + Sbjct: 169 GMLQD--GTAIGMGLATSVERLRTSKAKS-----------KVIILLTDGVNNTGLIDPLT 215 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-----------------ARYFLSNCA--SPNSFF 429 AK+ IR+ TI + + + +F Sbjct: 216 ALEIAKAFKIRVYTIGVGTIGKAPFPMTMPDGSIQMQMQDVQLDEPLMKKISVETGGKYF 275 Query: 430 EANSTHELNKIFRDR 444 A S EL I+ + Sbjct: 276 RATSNKELENIYGEI 290 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 66/223 (29%), Gaps = 39/223 (17%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +D + + +V+ +A I +T R+G F + Sbjct: 91 CLDTSGSMQALDFELDGKPVTRLTVVKKVVADFI-------KERETDRIGLVVFGQEAFT 143 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + L+ + K STAI +A+ A + K+ Sbjct: 144 QSPLTMDKGLLLSLVDKMEI--GMAGDSTAIGNAIAVAGKRL-----------KDLKAKS 190 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ---------------- 412 K +++LTDG + + A + GI+I TI Sbjct: 191 KIMIILTDGRSNTGDITPEEAAGAAAALGIKIYTIGVGGTGPAPFKVNTFFGPRIVNQSV 250 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + L A +F A + EL I+ + I V Sbjct: 251 DLDEKTLKEIAAIGKGKYFRATDSKELANIY-EIINKAEKTEV 292 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 59/197 (29%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ ++ I R+G + + + +++++ Sbjct: 49 NRLDALKRVASTFIED-------RINDRIGLVVYAGESYTRTPITSDKTVILQSLKTVEY 101 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D T I + TA + I S + I+LLTDG N + Sbjct: 102 DDSIIADGTGIGVGLATAINRIKDSKAKS-----------RVIILLTDGVNNAGTIDPRM 150 Query: 389 ICNKAKSQGIRIMTIAFSVNK-------------------TQQEKARYFLSNCA--SPNS 427 + AK GI++ TI N Q E + A + Sbjct: 151 AADIAKQYGIKVYTIGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTDGK 210 Query: 428 FFEANSTHELNKIFRDR 444 +F A +L I+ + Sbjct: 211 YFRATDDKKLKAIYAEI 227 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 30/375 (8%), Positives = 87/375 (23%), Gaps = 6/375 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A + + A+ L Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDL-TGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + + T + + S V Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCM 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSQTGIVNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +++ + T + + D V +F+ Sbjct: 255 GNKAATTNAYNTRFGIYANPYKDPSYGTPDFTGYAYDATTWPSQSNAYADFVSKRQTFTS 314 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + T + + + VL+ Sbjct: 315 Y-----QGDLITGINTGGTYNPNYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLM 369 Query: 375 TDGENTQDNEEGIAI 389 D + + + + Sbjct: 370 LDPMGSGGSAGPVHL 384 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 74.3 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 64/199 (32%), Gaps = 35/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + + + + +++S+ D G F S S ++ Sbjct: 99 SMQATDYQPTRLESSKRSAEILLKSLDPKDY------AGIITFESGATSAAYLSPDKDRV 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 IR + + G+TAI D + D ++ KK ++LL+DG N Sbjct: 153 IRKLQAIE----PKEGATAIGDGLALGIDM-----------AESMPNRKKVVILLSDGVN 197 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA--SP 425 A+ +GI++ TI + L A + Sbjct: 198 NAGVIHPEQAAGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAASTN 257 Query: 426 NSFFEANSTHELNKIFRDR 444 +F++ L++I+ + Sbjct: 258 GQYFKSVDDRTLSEIYSNL 276 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 74.3 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 87/273 (31%), Gaps = 30/273 (10%) Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + N + + V + + G +P N + +Y +LS+ Sbjct: 237 TGLNKDNFTIEIGSKKVNDVTVSDVGEGKYKLSFNPPKQDSNGNYDLNVYVKYKKVTLSD 296 Query: 256 EHFV-------DSSSLRHVIKKKHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFN 303 ++++ +V+ S I S K D + G F+ Sbjct: 297 SELNAVRYGEDNANANANVMLVIDRSGSMSGSPISSAKNSANLFIDYMEAEDMAGVVSFS 356 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + ++ + +K G TAI M+ + +++ + Sbjct: 357 SSARYDYHLATLTPEV-KNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDPNNP---- 411 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 IVLL+DG + K+ I++ T+ + L N A Sbjct: 412 -----WAIVLLSDGYQNSGE-NPNNVIPSIKASNIQVYTVGLG-----PAVDQKLLGNIA 460 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + ++ + + +L +I+ D +G I + + Sbjct: 461 DQTGGKYYYSPTDSQLQEIYNDIVGKIIGWKTV 493 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 63/199 (31%), Gaps = 35/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + + + + +I S+K D G F + S K+ Sbjct: 100 SMQAQDYTPSRLEAAKSSAEILINSLKSKDY------AGIVTFESGATTAAYLSPYKEKV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I + + GSTAI D + D + KK I+LL+DG N Sbjct: 154 IEKLR----NVAPKEGSTAIGDGLSLGIDM-----------ASSIPNKKKVIILLSDGVN 198 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SP 425 AK+ I++ TI N E L A + Sbjct: 199 NAGYISPDEAIQYAKANNIQVYTIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDTG 258 Query: 426 NSFFEANSTHELNKIFRDR 444 +F++ L++I+++ Sbjct: 259 GKYFKSIDDKTLDEIYKNI 277 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 49/161 (30%), Gaps = 28/161 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D + + + + + TAI +A+ A Sbjct: 131 LILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALAVKRFD------- 181 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------- 408 E+ + +VLLTDG N + A +G++I +I Sbjct: 182 ----RVDESNRILVLLTDGSNNSGSISPEQAAAIAAKRGVKIYSIGVGAEVMERRTLFGK 237 Query: 409 ---KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L+ A + +F A + EL I+++ Sbjct: 238 ERVNPSMDLDETQLTALAQTTGGLYFRARNAQELESIYQEI 278 >gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Ailuropoda melanoleuca] Length = 891 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/310 (12%), Positives = 93/310 (30%), Gaps = 22/310 (7%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I T + + ++ + + + + P Sbjct: 184 IEADIYEPQGISTLDAEASFITNDFLGSALTKSFSG-KKGRVSFKPSMDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + E + + +++ G S +D S H Sbjct: 238 TCTDSLLHGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPKSVVFVIDVSGSMHG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K +DAL ++ +K+ D T + D ++ +T Sbjct: 298 -RKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLVQATP-------ENIQEART 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ G T IND + A + + + E+ ++ +++LTDG+ Sbjct: 350 FVKNIHDQGMTNINDGLMRAINMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTGV 475 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 40/213 (18%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D ++ + V++ + I R+G F + + + Sbjct: 103 DFKLDGAIVNRLDAVKNVVKDFIM-------KRSGDRIGMVVFGSEAFTQMPLTRDYDTI 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + STAI DAM + + ++LLTDG++ Sbjct: 156 AFVLSRLKIGA--AGPSTAIGDAMGISLKRLEDVKSKSN-----------IVILLTDGKS 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNC 422 A + A+ +G+++ TI + L Sbjct: 203 NSGEITPGAAADIARERGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVDMDHEALKEI 262 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + +FF A T L KI+ D I + V Sbjct: 263 ADKTGGAFFAAADTDSLKKIY-DMIDSLEKTEV 294 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 57/197 (28%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I + ++ F R ++ L I Sbjct: 106 TRLEASKKTMIDFI-------KKRNFDKISLVAFALRASVLSPSTFDYTLLEEEIKNIKI 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + GST+I + TA D + S D +K I+LLTDGEN + Sbjct: 159 DE---EGSTSIGLGIATAVDMLRSVKGDN----------EKIIILLTDGENNSGEIDPKL 205 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-------------------EKARYFLSNCAS--PNS 427 A + I+I TI L + AS Sbjct: 206 ASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFSLNEEALIDIASTTGGK 265 Query: 428 FFEANSTHELNKIFRDR 444 +F A + L+ ++ Sbjct: 266 YFNAQNASALDNVYNTI 282 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 70/217 (32%), Gaps = 27/217 (12%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + S + +D L+ + + + + ++ + I N Sbjct: 75 QSNSAWNNKETEGIDIMLAMDISASMLTNDVTPNRLIVAKNVASDFI-------NGRPND 127 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTII 351 +G T F + + LI ++ D G TAI + A + Sbjct: 128 NIGLTIFAGEAFTQCPLTIDHATLI-NLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLK 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--NK 409 +SN K ++LLTDG N + + AK IR+ TI + Sbjct: 187 ASNAKS-----------KIVILLTDGSNNVGSISPMTAATIAKKFNIRVYTIGLGTEQSG 235 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + F+ A S EL +I+ D Sbjct: 236 NYNDIDYTTLKQIALTTNGEFYRAQSQTELLQIYNDI 272 >gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca] Length = 854 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/310 (12%), Positives = 93/310 (30%), Gaps = 22/310 (7%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I T + + ++ + + + + P Sbjct: 149 IEADIYEPQGISTLDAEASFITNDFLGSALTKSFSG-KKGRVSFKPSMDQQRSC-----P 202 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + E + + +++ G S +D S H Sbjct: 203 TCTDSLLHGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPKSVVFVIDVSGSMHG 262 Query: 268 IKKKHLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K +DAL ++ +K+ D T + D ++ +T Sbjct: 263 -RKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLVQATP-------ENIQEART 314 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ G T IND + A + + + E+ ++ +++LTDG+ Sbjct: 315 FVKNIHDQGMTNINDGLMRAINMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRP 370 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 371 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYE 430 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 431 EVANPLLTGV 440 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 61/212 (28%), Gaps = 39/212 (18%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + +V+D +A I +G F R + + L Sbjct: 103 DFTLNGKRTNRLEVVKDVMAKFISQ-------RPNDSIGLVAFAGRPYVVCPPTLDHNWL 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + TAI A+ T + + + I+LLTDG N Sbjct: 156 TLRLHSLSIGMIED--GTAIGSAIGTGVNRLREK-----------KSPSQIIILLTDGIN 202 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNC 422 + AKS +++ TI + LS Sbjct: 203 NAGKVPPLIAAEAAKSFKVKVYTIGAGTRGEAPIPITDAFGRRQLVRARVDIDDKTLSKV 262 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 A + +F A T L K++ + E R Sbjct: 263 AQITGARYFRATDTESLEKVYAEINSMETTSR 294 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 59/182 (32%), Gaps = 16/182 (8%) Query: 274 VRDALASVIRSIKKID-------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 A + I + R+G F D F+ + + +T Sbjct: 123 FTSASGANISRLDATKEVLAEFAKTRQGDRLGLILFGDAAFVQTPFTADQKVWLALLNQT 182 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ST + DA+ A S + K +K ++LTDG +T E Sbjct: 183 DVA--MAGQSTHLGDAIGLAIKVFEQSEPSQAASSKPR---QKVAIVLTDGNDTGSFVEP 237 Query: 387 IAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 I AK++G+RI IA + + A S F+A + EL + Sbjct: 238 IDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELASAYD 297 Query: 443 DR 444 D Sbjct: 298 DI 299 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 63/194 (32%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL-NLLKDMK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + A + S K I+LLTDG N + + + Sbjct: 160 CGLIEDG-TAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGVNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus norvegicus] Length = 706 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 59/192 (30%), Gaps = 12/192 (6%) Query: 271 KHLVRDALASVIRS-----IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 I+ IK + +++ + F+ + Sbjct: 279 IDKSGSMAGKKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATEENLNRAV 338 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 +A G T IN A+ +A + + SN+ E+ K+ I+LLTDGE T Sbjct: 339 DYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPTVGETN 394 Query: 386 GIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + + + F + + L N +E + + + F Sbjct: 395 PKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFY 454 Query: 443 DRIGNEIFERVI 454 + N + V Sbjct: 455 QEVANPLLSSVT 466 >gi|332185631|ref|ZP_08387379.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] gi|332014609|gb|EGI56666.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] Length = 420 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 63/460 (13%), Positives = 119/460 (25%), Gaps = 65/460 (14%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 ++ AL +PV+ GM VD R A S + + Sbjct: 1 MMFALALPVLTCSIGMGVDYAR--------------AAKAQSKLNAIADAAALLAVSKNA 46 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 Y R+F + + + T V++ R Sbjct: 47 MRADDATAAYFARSFFSLQSAALVKSDGITLSNVTVQAPTDGNGRRTAVVNYRAT----S 102 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 ++F R +G+ + I K+E + ++D S SM S + Sbjct: 103 ENVFARILGMSTLTISGKSETANAIAPD-----IDFYMLLDVSASMALPTTSSGLNKVAQ 157 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMV---SCNKSLYYMLYPGPLDPSLSEEHFVD 260 + + K + S L ++ + + Sbjct: 158 SNTSRCVFACHTGEKRFRGYDAHGKQTDLYGVALSYGLPLRIDAEGDAVNQLTATARSMA 217 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S + R A +R D + S + Sbjct: 218 SKNGSDYRIAITTFRGARGFSVRQPLTNDLTAAGHKAANLKPPYYASIGCPTSACKSSEV 277 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + T +DAM I N + + + ++TDG Sbjct: 278 GWNDRD----------TGSSDAMDQINAMIPQPGSGV-----NGQDPQAVVFMVTDGMRN 322 Query: 381 QDNEE--------GIAICNKAKSQGIRIMTIAFSVNKTQQEKA--------------RYF 418 + + + A C+ K +GIRI + + + Sbjct: 323 EKSPKGARPEVAFDTAKCDMIKHRGIRIAVLYTEYLRDAVKGTTNLERSVEPYLYQVEPA 382 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L +CASP + + + +++ N + RITK Sbjct: 383 LQSCASPGLYTKVTTDGDISAALNTLFQNAVAT--TRITK 420 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 74/202 (36%), Gaps = 27/202 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + K + A + I + R+G +NDR++ S + R +K Sbjct: 55 NSMNKTDPGKTAAEVMSMFIDMSEATR--TRIGFVAYNDRIVQAQSPASMAEARNREQLK 112 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT----- 380 G + + ++ + I + + +++LL+DG Sbjct: 113 RTIQGLRYSGYSDLGLGLRRGAEMIEKAKDPARKP---------FLILLSDGGTDLRQNA 163 Query: 381 ------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEAN 432 N++ + +KAK+QG I TI + + + Q + L A+ + F Sbjct: 164 GGRSVAASNKDVETVISKAKAQGYPIYTIGLNNDGSVQ---KEQLKKIAEATGGTSFVTQ 220 Query: 433 STHELNKIFRDRIGNEIFERVI 454 ST +L +IF I +++ Sbjct: 221 STDDLPEIFNQIFAKHIQSQLV 242 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 60/193 (31%), Gaps = 35/193 (18%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I + V+ L I R+G F ++ + + + + Sbjct: 112 SINRLQAVKRVLDDFISR-------RQGDRLGLILFGTEPYVQAPLTFDLETVRTLMREA 164 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TAI DA+ A + + +D ++ +VLLTDG NT Sbjct: 165 GL--GMAGRATAIGDAVGLATKRLRNRPQD-----------QRVVVLLTDGANTAGEITP 211 Query: 387 IAICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA--SPNSFFEA 431 A + IR+ TI ++ L+ A + +F A Sbjct: 212 DKATEIAAAASIRLYTIGIGAESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRA 271 Query: 432 NSTHELNKIFRDR 444 S EL I+ Sbjct: 272 RSLPELELIYESI 284 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 49/396 (12%), Positives = 109/396 (27%), Gaps = 33/396 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + +V + S A +S Q + + + R V + Sbjct: 16 AAGLLLAVAGCGVSSKPESAAGSSTQGALQAAPQ--AQYEVQHADATMAKRAVHPMRLSA 73 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + V P + + +A T S + Sbjct: 74 PMPAPISSRDSLV-----AGYRDEPREQYQKL-PDNPIHSVAEAPVSTFSADVDTGAYAN 127 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 ++ +++ S+ + +N F + +P+ Sbjct: 128 VRRLLNQ-GSLPPEGAVRLEELVNYFPYDYALPTDGSP-FGVTTELAPSPWNPHTRLLRI 185 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + + VD S + LV+ L ++ ++ D V+ Sbjct: 186 GIKASDRAVAELAPANLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDRVS------ 239 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + G + ++T GSTA +Q AY D+ Sbjct: 240 LVVYAGESRVVLEPTSGR---DKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGFIDQG 296 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 I+L TDG+ + A+ + + G+ + T+ F V+ + Sbjct: 297 INR---------ILLATDGDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFGVDNYNEHLM 347 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ ++ E K+ D++ + + Sbjct: 348 EQLAD--AGDGNYAYIDNLREARKVLVDQLSSTLAV 381 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 52/156 (33%), Gaps = 7/156 (4%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+D+V + K F + G T IN A+ A + +++ +++ Sbjct: 31 FDDKVDKWQNTLVKAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPK 90 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYF 418 + I+ L+DGE T I N K + + F + + Sbjct: 91 ISTS----IILFLSDGEPTSGVTNHNEIINNVKKANERQTTLYCLGFGNDVDFNFLEKMA 146 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 L N +E + + F + + N + V Sbjct: 147 LENGGLARRIYEDSDAALQLQGFYNEVANPLLLNVQ 182 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 74.3 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 55/190 (28%), Gaps = 31/190 (16%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + I R+G F +S + L + Sbjct: 121 SPNRLEAAKRMARQFI-------KGRLQDRIGLIVFAGEAVSLCPLTTDYELL-YGFLDE 172 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + + + + K +L++DG+NT N Sbjct: 173 VTPSLIPTPGTAIGSALAVAVNRMRDTAGES-----------KVAILISDGDNTSGNLGP 221 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------LSNCA--SPNSFFEANST 434 A + G+++ TI+ K+ + L N A +F A Sbjct: 222 TTSAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDEGELQNIAGIGNGKYFRATDN 281 Query: 435 HELNKIFRDR 444 L +F+ Sbjct: 282 TALESVFKQI 291 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 54/162 (33%), Gaps = 25/162 (15%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ + TG +I L +P+++ G V+ R L++A A + A+ L Sbjct: 3 RAIRRFLADRTGAVALIIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAGEL 62 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + +VS A R + + + R ++ Sbjct: 63 SVAGSDVSVEAMARRLAFDSA-------------------RATDPGITRVSAAVVGRGTS 103 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 V ++ SLF R + + S I A AE+ + Sbjct: 104 VTVAINETVQ------SLFGRLLTLPSMEIGASATAESSGST 139 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 53/166 (31%), Gaps = 28/166 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F S + L I G TAI A+ +A + + Sbjct: 146 DRIGMVVFAGEAYSLAPLTNDYKLLTDLIQDISFNMMEAKG-TAIGSAIASATNRM---- 200 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 K + A K ++LL+DGE+ N + + A + I+I TIA + Sbjct: 201 -------KESESASKVLILLSDGESNAGNVDPLFAAQLASALDIKIYTIAVGKDGMVPYG 253 Query: 413 ------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A FF A+ LN IF Sbjct: 254 TDFFGRPQMVESYLDETNLREIAKIGNGEFFRASDGGTLNNIFDRI 299 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 58/200 (29%), Gaps = 35/200 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + V+ ++ + R+G F D + + Sbjct: 101 DMSDGSDYVDRLTAVKKVVSDF-------AIKREGDRLGVVLFADHAYLQTPLTLDRTTV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + TAI + + A T I S ++ ++LL+DG N Sbjct: 154 ADQVNQLVLRLI--GDKTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--S 424 T + I AK I TI T Q+ L A + Sbjct: 201 TSGVIDPIEAAKIAKKYDATIYTIGVGAGEMMVKEFFMTRKVNTAQDLDEKALMQIAQIT 260 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A EL I+ Sbjct: 261 GGQYFRARDAKELATIYDTI 280 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 33/327 (10%), Positives = 80/327 (24%), Gaps = 23/327 (7%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + Y + G I+T +PV++G + ++ ++ A A ++ Sbjct: 8 LLHYLRLFCDDRKGGIAIMTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGAL 67 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + E ++ +V TA + R Sbjct: 68 AYSNANSEDQMTDAALRVAKLNGVDA------------------ANVVVSLTASPKDSRV 109 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 A V +++ L L P+ + I + + AE+ + + S Sbjct: 110 QAVHVDVTTTNTLFLAPILGVDSKLNIATEAYSSLGAAESGCIIALDKSASGVTL----S 165 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + P + + S+ + + S +V + S + Sbjct: 166 GGVQANASSCYVASNSNLVAPCGTKITAKSASYYEGSSQPCPWSTNIVRADGSPAPVTKQ 225 Query: 247 GPLDPSLSEEH-FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 DP S + + + + + + Sbjct: 226 YTSDPLEGNAGVDKLSGRFDTIREATWPSKVSTGNGTDIEFGGSSSPSVTAAAIDAVGCS 285 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDEN 332 + K T + + Sbjct: 286 YDPANYNQYWTTKWDITCSGSKISMGS 312 >gi|291407509|ref|XP_002720066.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Oryctolagus cuniculus] Length = 1320 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 47/351 (13%), Positives = 96/351 (27%), Gaps = 24/351 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + L L R G ++ K S Sbjct: 132 ISTSLAAGTAMTFALAYEELLQRQQGQYQLVVSLKPGQLVTRLSVEVTVSERTGIAFVHV 191 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + ++ P+ R + + + + S + + + Y Sbjct: 192 PPLRTSRLRTKDNTRQADSPPSTRIERGETCARVTFCPTLRDQAAFSGSGVMADFVVQYD 251 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNVNDT 294 ++ + + D + + + + + ++ KK NV + Sbjct: 252 VLMEDVMGDVQIYDGYFIHYFAPRGLPPVEKNVVFVIDVSGSMFGTKMQQTKKAMNVILS 311 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYD 348 +FN SD W I+ K + G T IN A+ A Sbjct: 312 DLQANDYFNIISFSDTVSVWRAGGSIQATSQNVHSAKNYLDHMEAAGWTDINAALLEAAS 371 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + SN++ I+ LTDGE T I + + + + ++AF Sbjct: 372 VLNHSNQEPGRSPGVGRTP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAF 429 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + R L N +E + + I + V R+ Sbjct: 430 GDDADFPLLRRLSLENRGVARRIYEDTDAALQLEGLYEEISMPLLADV-RL 479 >gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 [Sus scrofa] gi|3024051|sp|P79263|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Major acute phase protein; Short=MAP; Flags: Precursor gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa] gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Sus scrofa] gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor Length = 921 Score = 73.9 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 83/254 (32%), Gaps = 23/254 (9%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSC-----NKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + +G +R + ++Y + +D+S Sbjct: 223 SPEQQETVLDGNFIVRYDVNRTVTGGSIQIENGYFVHYFAPEVWSAIPKNVIFVIDTSGS 282 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 +K R+AL ++ + D + F+ + + + Sbjct: 283 M-RGRKIQQTREALIKILGDLGSRD------QFNLVSFSGEAPRRRAVAASAENVEEA-- 333 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K++A + + G T INDAM A + +N +E+ ++ +I+LLTDG+ T Sbjct: 334 KSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVT----FIILLTDGDPTVGET 389 Query: 385 EGIAICNKAKSQGI----RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 I + I + + F + + L N +E + + + Sbjct: 390 NPSKIQKNVREA-IDGQHSLFCLGFGFDVPYAFLEKMALENGGLARRIYEDSDSALQLED 448 Query: 441 FRDRIGNEIFERVI 454 F + N + V Sbjct: 449 FYQEVANPLLRLVA 462 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 74/233 (31%), Gaps = 15/233 (6%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 R + + Y + N+ + + D S + + ++ + I Sbjct: 65 IARPQAGTSYSLEVNEGIAIQMLVDVSSSMDMSVKNFDGKS----TTRMEVAKEMVERFI 120 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + +G F + ++G L++ + + TA DA Sbjct: 121 AGDGEDLQGRPHDLIGLITFARYADTRSPLTFGHDALLQIVRHLTIQERPNEDGTAYGDA 180 Query: 343 MQTAYDTIISSN---EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + A + + + + K I+LLTDGEN + I AK+ + Sbjct: 181 LALAAARLKNPQELRHGKRPDAQAEAIESKVIILLTDGENNSGSHLPIEAAGLAKAWDCK 240 Query: 400 IMTIAFSVNKTQQE------KARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 I I+ + + A L + + + F +A+ L ++ + Sbjct: 241 IYAISLGESLDAENPLDALSPAERVLEHISIETGGVFRQAHDFESLLSVYEEI 293 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 63/194 (32%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL-NLLKDMK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + A + S K I+LLTDG N + + + Sbjct: 160 CGLIEDG-TAIGMGIANAVTRLKDSKAKS-----------KVIILLTDGVNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLIED--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 59/197 (29%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I + ++ F R ++ L I K Sbjct: 67 TRLEASKKTMIDFI-------KKRNFDKISLVSFALRASVLSPATFDYTSLEEEIKKIEI 119 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + GST+I + TA D + S ED +K I+LLTDGEN + Sbjct: 120 DE---EGSTSIGLGIATAVDMLRSVKEDN----------EKIIILLTDGENNSGEIDPKL 166 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE-------------------KARYFLSNCA--SPNS 427 A + I+I TI L + A + Sbjct: 167 ASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGGK 226 Query: 428 FFEANSTHELNKIFRDR 444 +F A + L+ ++ Sbjct: 227 YFNAKNASALDNVYNTI 243 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 73.9 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 83/286 (29%), Gaps = 41/286 (14%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG---KVGIRDEKLSPYMVSCNKSLYYM 243 + D + P T+ SS++ + ++ +P + + Sbjct: 205 PEVFDNLNVRSTKTAEINTYPEFSTIPQSSSKDDFAILIHLKAPSANPDQGTGKLANESS 264 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + +D S K L++ A+ V++ + D R+ F+ Sbjct: 265 AGSSRNRAPVDLVTVLDVSGSMAG-TKLALLKRAMGFVVQHLGPSD------RLSVIAFS 317 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 H + ++ G T I DA++ A I + Sbjct: 318 SSARRLFHLQRISHHGRQQALQA-INSLGASGGTNIADALKKAMKVIEDR---------S 367 Query: 364 NLEAKKYIVLLTDGENTQD-------------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + I+LL+DG++T + + +I N A+ T+ Sbjct: 368 YKNSVCSIILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDAR------HTVPLHAFGF 421 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L + A S +F + F IG + + Sbjct: 422 GADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLSIVIQ 467 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 68/198 (34%), Gaps = 41/198 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I +G T F + + L+ +++ Sbjct: 108 NRMEAAKDVAAEFIS-------GRPNDNIGLTIFAGEAFTQCPMTTDHQSLL-NLLQNVR 159 Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D + G TA+ + A + S K ++LLTDG N + + Sbjct: 160 TDLSARGLIEDGTAVGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNRGDL 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PN 426 + N AKS GIR+ TI NK E LS AS Sbjct: 209 SPMTSANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQYVNIPVEIDTKTLSGIASVTHG 268 Query: 427 SFFEANSTHELNKIFRDR 444 +F+ A + +EL +I++D Sbjct: 269 NFYRATNNNELKQIYKDI 286 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 46/313 (14%), Positives = 90/313 (28%), Gaps = 39/313 (12%) Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 ++ F+ G+ + + KE V +D +S L Sbjct: 178 INSFMVKGGLSTKELANAVTTTQA----EKEAWVHFYPTVDQQKSCDSCDDKGMNGDLIV 233 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 N + D + SP +++ + +D S Sbjct: 234 VYD-----------VNRNTSLGDIQRSPGYF-----VHHFAPSNLQRIPKNVVFIIDQSG 277 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 H +K R AL ++ + + G F+ + K Sbjct: 278 SMHG-RKIEQTRTALIHILNDL------AEDDFFGLLTFDSNIFQWKRELVQATKANLES 330 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 KTFA + G+T IN A+ + + + ++LLTDG+ T Sbjct: 331 AKTFARNIRANGATNINAAVLKGSSMLNAHPREGSAS---------ILILLTDGDPTTGE 381 Query: 384 EEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 AI + ++ + + F + + + L N +E + K Sbjct: 382 TNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEKMSLQNNGVARRIYEDSDADLQLKG 441 Query: 441 FRDRIGNEIFERV 453 F + + + V Sbjct: 442 FYEEVATPLLTNV 454 >gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 [Mus musculus] Length = 941 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|151357765|emb|CAO78005.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 903 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus musculus] Length = 943 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 307 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 366 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 367 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 422 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 423 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 468 >gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus musculus] Length = 927 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 307 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 366 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 367 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 422 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 423 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 468 >gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus musculus] Length = 904 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 307 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 366 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 367 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 422 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 423 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 468 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 83/286 (29%), Gaps = 41/286 (14%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG---KVGIRDEKLSPYMVSCNKSLYYM 243 + D + P T+ SS++ + ++ +P + + Sbjct: 205 PEVFDNLNVRSTKTAEINTYPEFSTIPQSSSKDDFAILIHLKAPSANPDQGTGKLANESS 264 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + +D S K L++ A+ V++ + D R+ F+ Sbjct: 265 AGSSRNRAPVDLVTVLDVSGSMAG-TKLALLKRAMGFVVQHLGPSD------RLSVIAFS 317 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 H + ++ G T I DA++ A I + Sbjct: 318 SSARRLFHLQRISHHGRQQALQA-INSLGASGGTNIADALKKAMKVIEDR---------S 367 Query: 364 NLEAKKYIVLLTDGENTQD-------------NEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + I+LL+DG++T + + +I N A+ T+ Sbjct: 368 YKNSVCSIILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDAR------HTVPLHAFGF 421 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L + A S +F + F IG + + Sbjct: 422 GADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLSIVIQ 467 >gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus] Length = 902 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus] Length = 941 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 [Mus musculus] gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus] gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus musculus] gi|151357764|emb|CAO78004.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 941 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus] Length = 942 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 73/216 (33%), Gaps = 18/216 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + + + ++ + S + K V +A V + Sbjct: 105 SMAVEDFTLPIATNELTNRAKNDTDSSATKSSTNDTGKGEKVNRLVA-VKHVLNAFVKSR 163 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + R+G F D F+ + + ++ STA DA+ A Sbjct: 164 EHDRLGLILFGDAPYLQAPFTDDIATWQALLNESDI--GMAGQSTAFGDAIGLAISVFQQ 221 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--T 410 S + +++LTDG +T + A ++ I+I TIA Sbjct: 222 S-----------DTQNRVLIVLTDGNDTASKVPPVEAAKVAAARDIKIYTIAIGDPSAVG 270 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +++ L A + F+A ++ EL K++ + Sbjct: 271 EEKVDLEVLQAMAEITQGKSFQALNSEELLKVYAEI 306 >gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 7/166 (4%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + F+ + +A G T IN+A+ A + + Sbjct: 305 SPQDQFNLIEFSGEANQWKQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLD 364 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN+ E+ K+ I+LLTDG+ T I N + + + F + Sbjct: 365 RSNQAELLPSKSVS----LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFD 420 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L N +E + + + F + N + V Sbjct: 421 VNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVANPLLSSVA 466 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 57/196 (29%), Gaps = 40/196 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ A I R+G + + + ++ + + Sbjct: 112 NRLSALKEVAAEFI-------RKRPNDRIGLVAYAGESYTKTPITSDKSIVLGALRE--I 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + T+ + + K + K I+LLTDG N E Sbjct: 163 TYGQLNDGTAIGMGLATSVNRL-----------KESTAISKVIILLTDGVNNAGFIEPQT 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNC--ASPNSF 428 + A GI+ TI N L++ A+ + Sbjct: 212 AADLALEYGIKTYTIGLGTNGNALSPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEY 271 Query: 429 FEANSTHELNKIFRDR 444 F A +L I+ + Sbjct: 272 FRATDNEKLEAIYEEI 287 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLITD--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPMT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 73.5 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 64/202 (31%), Gaps = 34/202 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 220 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLR-RMSDNGREAAGLAI 272 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NE 384 G T I + ++ + ++ I+LL+DG++T + N Sbjct: 273 NSLTSSGGTNIVEGLKKGVRVL---------EERSEQNPVASIILLSDGKDTYNCDNVNR 323 Query: 385 EGIAIC-----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + C + + I + T F + + + S +F S + Sbjct: 324 RQTSHCASSNPRQGRQAIIPVHTFGFGSDHD-----STAMHAISDESGGTFSFIESVATV 378 Query: 438 NKIFRDRIGNEIF--ERVIRIT 457 F IG + + +R+T Sbjct: 379 QDAFAMCIGGLLSVVAQELRLT 400 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 73.5 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 43/404 (10%), Positives = 95/404 (23%), Gaps = 20/404 (4%) Query: 2 VFDTKFI-FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 + T+ ++ + G II L++ V++G G+ +D+ + + L+ A + Sbjct: 1 MSTTRSRGMSLPRIHERQRGAVAIIVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSC 60 Query: 61 IITASVPLIQSLE----EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 + A+ L + E + + + N + ++D + Sbjct: 61 ALAAARDLTGATPLTVSEAAGLTAAARNLVLFQGNLEQQPNITSAESVTYSDSLANPFLD 120 Query: 117 DTAVEMNPRKSAYQ--VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 +V Y V + + + I + + A A T S Sbjct: 121 KNSVTYALNTIKYVKCDVSRGNIANWFAQVLNAIPGIHIGANTVGAFAVATTTSAQTTCA 180 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 V I + ++ + Y N Sbjct: 181 IPVYICRPDTATPAVPGGYTIGQWLTAK----IGTSGGGVYGGGNFGWADLSGGKGNTPA 236 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + P + +S + + + V Sbjct: 237 LDTQLQGAGQCNLPALNTQIGTTGNKASLSDAWNSRFGITTNKTTGVTDFTGFAYTSTTW 296 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRT---IVKTFAIDENEMGSTAINDAMQTAYDT-- 349 + S + +T + GS A Q D Sbjct: 297 TAQNNAYNGTSTDSKGASQPNFLAARKTYQPYQSDSVTGLSTKGSAASQATYQAGADRRL 356 Query: 350 ----IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 I+ + VL+ D N + + + Sbjct: 357 VLAPIVDCSTFSTPGTHQANVTGWACVLMVDPMQQGGNIDTVHL 400 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 73.5 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 57/164 (34%), Gaps = 9/164 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R+G F D F+ + + +T ST + DA+ Sbjct: 141 LSDFAKTRKGDRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVA--MAGQSTHLGDAIG 198 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A S + ++ +K ++LTDG +T E I AK++ +RI IA Sbjct: 199 LAIKVFEQSEKSRTDVEES---KEKVAIVLTDGNDTGSFVEPIDAAKVAKAKDVRIHVIA 255 Query: 405 FSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + A S FEA + EL K + + Sbjct: 256 MGDPQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEI 299 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 63/194 (32%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL-NLIKDVK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TA+ + A + S K I+LLTDG N + + + Sbjct: 160 CGLIEDG-TAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 55/190 (28%), Gaps = 24/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 258 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLT-RMSDAGRQLALQAV 310 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 311 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS---------IILLSDGRDTYTTNHPDP 361 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIG 446 Q I + + F + + + S +F S + IG Sbjct: 362 SYKVMLPQ-ISVHSFGFGSDHD-----ASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 415 Query: 447 NEIFERVIRI 456 + V + Sbjct: 416 GLLSVAVQEL 425 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 55/190 (28%), Gaps = 24/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 232 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLT-RMSDAGRQLALQAV 284 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 285 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS---------IILLSDGRDTYTTNHPDP 335 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIG 446 Q I + + F + + + S +F S + IG Sbjct: 336 SYKVMLPQ-ISVHSFGFGSDHD-----ASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 389 Query: 447 NEIFERVIRI 456 + V + Sbjct: 390 GLLSVAVQEL 399 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 55/190 (28%), Gaps = 24/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLT-RMSDAGRQLALQAV 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 310 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS---------IILLSDGRDTYTTNHPDP 360 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIG 446 Q I + + F + + + S +F S + IG Sbjct: 361 SYKVMLPQ-ISVHSFGFGSDHD-----ASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 414 Query: 447 NEIFERVIRI 456 + V + Sbjct: 415 GLLSVAVQEL 424 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 43/348 (12%), Positives = 89/348 (25%), Gaps = 23/348 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + L L R G ++ + S Sbjct: 132 ISTSLAAGTEVTFSLAYEELLQRHQGHYQLVVSLRPGRLVKRLSVEVTVSERTGISYVHI 191 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + P+ R + + + S S + + + Y Sbjct: 192 PPLRTSRLRTNAHASEVDSPPSTRIERGETCVRITYSPTLQDQSAISGSGIMADFLVQYD 251 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNV--- 291 ++ + + D + + + + + + KK NV Sbjct: 252 VVMENIIGDVQIYDGYFIHYFAPRGLPPMEKNVVFVIDVSGSMFGTKMEQTKKAMNVILS 311 Query: 292 --NDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTFAIDENEMGSTAINDAMQTAYD 348 F+D + + + K + G T IN A+ A Sbjct: 312 DLRANDYFNIISFSDTINVWKAGGSIQATIQNVHSAKDYLHRMEADGWTDINSALLAAAS 371 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + SN++ I+ LTDGE T I + + + + T+AF Sbjct: 372 VLNHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAF 429 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + R L N +E + + I + V Sbjct: 430 GDDADFTLLRRLSLENRGIARRIYEDTDAALQLEGLYEEISMPLLADV 477 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 58/204 (28%), Gaps = 20/204 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V + + D A N + ++ + K Sbjct: 349 FSSIVKVWNVDKVQVDYEVGEDPWPLYDSPEAPQKN-KTNQVLPPAYKATDENKEKAKKV 407 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 N G T I A++ + + E++ + IV LTDGE T Sbjct: 408 VEKLNAYGGTDIKSALEVGLKLVKKNKENKE------DAHQPIIVFLTDGEPTMGETNTE 461 Query: 388 AICNKAKSQG-----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + I +++F ++ + L N +EA + F Sbjct: 462 KITSAISEMNSGETRAPIFSLSFGDGADREFLQKISLKNLGFARHIYEAADASLQLQEFY 521 Query: 443 DRIGNEI--------FERVIRITK 458 +I + + V ++TK Sbjct: 522 KQISSPLLNNVNFKYVSNVTKLTK 545 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 31/200 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 332 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA-V 384 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 385 YSLTSNGGTNIAEGLRKGSKVIEDRQ---------AKNPVCSIILLSDGQDTYTVSPTAG 435 Query: 389 ICNKAKS------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A Q + + F + S +F + Sbjct: 436 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAA 492 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + + Sbjct: 493 IQDAFAQCIGGLLSVVAQDL 512 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 62/213 (29%), Gaps = 40/213 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 + + + + + D + + + + + +G Sbjct: 473 SSSGTWLNTQSRKDGGWNACPSQSRRLTSYTNRTSTPTGQSSSFNSY-----IDNLIAVG 527 Query: 336 STAINDAMQTAYDTIISSN--EDEVHRMKNNLEAKKYIVLLTDGENTQD----------- 382 T + M + + + N ++IV +TDG+ + Sbjct: 528 GTYHDIGMLWGARFLSPKGIFASDNNSAPNGFNISRHIVFMTDGDMSAYQQVYGAYGYQQ 587 Query: 383 -------------------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N +CN K++GI I I F + L NCA Sbjct: 588 LDARVAPGNTSDTDLTAIHNTRLQMLCNAIKAKGITIWVIGFRNQSEGN--IQTPLQNCA 645 Query: 424 SPNSFFE-ANSTHELNKIFRDRIGNEIFERVIR 455 + ++ + A L++ F+D N RV + Sbjct: 646 TSSNHWTMAYDATSLSQKFKDIAKNIGGLRVSQ 678 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 27/277 (9%), Positives = 66/277 (23%), Gaps = 15/277 (5%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + +L ++ G+ + A + + + G +D+ R ++QA A + + Sbjct: 11 FMSRLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAMT 70 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRN-----FENNLKKNFTDREVRDIVRDTAVEMNP 124 S +++ + F + + + T + T V Sbjct: 71 TSSMTQANKDEAKKFFDFNFPQGTFQAATFTPVIRSKPGETTTVQVTASTTMPTTVMKIF 130 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW--LIQTKAEAETVSRSYHKEHGVSIQWV 182 R + ++ + + L S I T + K+ Sbjct: 131 RYETLPLSVTCEARFDIGNTDVMLVLDTTGSMAYAISDGKGGSTTRLAALKQAVKDFYDT 190 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + + P TV + + + L Sbjct: 191 L--------GAGSNATGRIRYGFMPYSSTVNVGYQLPTNYLVGGISGETWDYQTRRLLTT 242 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 S S+ + +A Sbjct: 243 YGTATNETGSWIYTSGSVSAPTTYSSTSGGSASQCVA 279 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 73.1 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLITD--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPMT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 48/372 (12%), Positives = 101/372 (27%), Gaps = 30/372 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G+ + TA+LM VMLG+ VDV + L+++ A + + L++ + + Sbjct: 22 RRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQA 81 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 +A + E K E D A E N +++ + Sbjct: 82 KATEYLVRNPVGSSMTFVNEEEVPAKIAQFVAEHGDDFEVEAGEWNASTRSFETTNTLPS 141 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 L ++ + I K + S I V+DFS SM D Sbjct: 142 TLSVSMEYPTM---PTFFGKILGKDSFSIRASSVAMYQPRDIMVVLDFSGSMNDDSTFE- 197 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-SLYYMLYPGPLDPSLSEE 256 A + ++ + +P S + + Sbjct: 198 ----------AFGKLGRSWVESNLQQCWADIGNPTYGSLTFEPKWANCKGAVPTDGSKPQ 247 Query: 257 HFVDSSSLRHVIKKKHLVRDALASV--------------IRSIKKIDNVNDTVRMGATFF 302 +V+ + + + + + + + + + Sbjct: 248 IYVEYRNTSVYVTSTLNLENVVLQFSGGTRQTFSGLSAKTGTFQGSSTNSGKQITK-VWV 306 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + G +++ ++D I E + K Sbjct: 307 KSGNNLSGEGTNYGEPFDFSSSNMNNMVKKAFGLNSVSYPYNGSWDAYIDYCEQSSNSNK 366 Query: 363 NNLEAKKYIVLL 374 N KY L Sbjct: 367 NAGYRYKYGYLN 378 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 43/154 (27%), Gaps = 1/154 (0%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +K +K G + A M V+LG G + +D R L+ A + + L Sbjct: 6 RKELKRSRGAVLVWVAASMVVLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAWELGNK 65 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQ 130 + R ++ + + V + Sbjct: 66 SASQALREASAAQVAGSVASDNKSEGAYAVDFPDADTCHVEGQETIAMTFARILGVNEST 125 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 V+ + L + LR GI+S + Sbjct: 126 VLAEAAARLSGAGSAKGLRPFGIQSQDFVFGDQY 159 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 54/194 (27%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D AS I N +G F+ + + L+ + Sbjct: 109 NRLEAAKDVAASFI-------NGRPNDNIGLVVFSAESFTQCPLTTDHTVLL--NLFKDI 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + A I S K I+LLTDG N + Sbjct: 160 QSGMIQDGTAIGLGLANAVSRIKDS-----------HAKSKVIILLTDGSNNAGEIAPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AK+ G+R+ TI E L AS +F Sbjct: 209 AAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFR 268 Query: 431 ANSTHELNKIFRDR 444 A L +I+ + Sbjct: 269 ATDNASLKEIYSEI 282 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 66/223 (29%), Gaps = 21/223 (9%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ Y P + D +A + Sbjct: 63 SMAYASDETPDPYVNPASAPPGW-TYGDPVPPNSRWLDLVASVNAFNGFLVD------SP 115 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 ++ ++ D + ++ + + + G T++ ++ + Sbjct: 116 QYEKLCLATYSSTASRDCDLTHTYAEISNELDA--ISYQFDGGGTSVGYGLEHGLAVLTD 173 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + + +VL+TDG + ++ ++ G+ + TI FS + Q Sbjct: 174 ATHARKFAV-------RVMVLMTDGHHNTGKS-PESMMYHLQNHGVTLFTITFSDDAD-Q 224 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + C N F A +L F+ I ++ + + Sbjct: 225 SRMSNLANACGGEN--FHATDASQLQNAFQK-IAKKLPSLMTQ 264 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 60/164 (36%), Gaps = 7/164 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R+G F D F+ + + +T ST + DA+ Sbjct: 136 LADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVA--MAGQSTHLGDAIG 193 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A S + + N++ +K +++LTDG +T E I AK++G+RI IA Sbjct: 194 LAIKVFEQSAAVQDSSVDANVK-EKVVIVLTDGNDTGSFVEPIDAAKVAKAKGVRIHVIA 252 Query: 405 FSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + A S FEA + EL K + Sbjct: 253 MGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQI 296 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 68/197 (34%), Gaps = 28/197 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A S + + V++G F + + + +I Sbjct: 106 SRLDAAKAAARSFVERMPAG------VKVGLVSFAAGAVLESGLTADHQGVIER-----I 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TAI + + + + +V ++LL+DG N Sbjct: 155 DLLERRANTAIGEGLLESLKAFPTGANHQVAVPAT-------VILLSDGRNRIG-IAPQE 206 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-----ARYFLSNCA--SPNSFFEANSTHELNKIF 441 +AK +G+R+ TI + L A + +F A+S L +I+ Sbjct: 207 AAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIAEVTGGRYFAADSADRLQEIY 266 Query: 442 RDRIGNEIFERVIRITK 458 R+ G++I ++ R T+ Sbjct: 267 REL-GSQIGWKLER-TQ 281 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 46/436 (10%), Positives = 112/436 (25%), Gaps = 38/436 (8%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT---ASVPLIQSLEEVS 76 G +T + + + G + + + Q A+I + Sbjct: 14 GSILTLTIISL-----LAGC-----QGQQQNTSAQSDEQAAVIEQKNTQREAQTQTNTSA 63 Query: 77 SRAKNSFTFPKQKIE-EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S + + +I + L VR + P ++ Sbjct: 64 EVQTKSKSSQENEIRFNHPRIENVAGLNHESLAPAQMQQVRHRIGAVYPPMPMPPILPPK 123 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQR 194 + + ++ + S + S + Sbjct: 124 PMPPQFENEQARENYLKNEQNPVKQVMLEPVSTFSIDVDTGSYSNSRRMIKMGKRPPADA 183 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 E +N F + N + + + + + Sbjct: 184 VREEAFINYFDYHYSAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKAAN 243 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D S + K L++ +L + + + + D+V + + Sbjct: 244 LVFLLDVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVA------IVVYAGAAGLVLPATK 297 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G ++ + + GST ++ AY + + E ++L Sbjct: 298 GNE---YQVISNALNNLSAGGSTNGAQGIELAYQIASQNFKKEGINR---------VILA 345 Query: 375 TDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 TDG+ A+ ++ GI + T+ F + L+N Sbjct: 346 TDGDFNVGMSSVDALKKLIANKRKTGIALTTLGFGQG-NYNDGLMEQLANI-GNGQHAYI 403 Query: 432 NSTHELNKIFRDRIGN 447 ++ +E K+ D + + Sbjct: 404 DTINEARKVLVDELSS 419 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 55/195 (28%), Gaps = 39/195 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I +G T F + + L+ + Sbjct: 108 NRLEAAKDVAAKFIND-------RPNDNIGITLFAGESFTQCPLTIDHTALL--NLFGNI 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + A + S K I+LLTDG N + + Sbjct: 159 QTGVIEDGTAIGMGVSNAVARLKDSQAKS-----------KVIILLTDGSNNAGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-----------------EKARYFLSNCA--SPNSFF 429 AK+ GIR+ T+ T + L + A + +F Sbjct: 208 SAEIAKTYGIRVYTVGIGTRGTAPYPIQTMTGAIQRIQVEVDIDEPTLKDIARTTGGVYF 267 Query: 430 EANSTHELNKIFRDR 444 A L I+++ Sbjct: 268 RATDNTSLQDIYQEI 282 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 61/182 (33%), Gaps = 16/182 (8%) Query: 274 VRDALASVIRSIKKID-------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 A + I + R+G F D F+ + + +T Sbjct: 123 FTSASGANISRLDATKEVLAEFAKTRQGDRLGLILFGDAAFVQTPFTADQKVWLALLNQT 182 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 ST + DA+ A S ++ K +K ++LTDG +T E Sbjct: 183 DVA--MAGQSTHLGDAIGLAIKVFEQSESNQAASSKPR---QKVAIVLTDGNDTGSFVEP 237 Query: 387 IAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 I AK++G+RI IA + + A S F+A + EL + + Sbjct: 238 IDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELARAYD 297 Query: 443 DR 444 D Sbjct: 298 DI 299 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 73.1 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLIED--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPMT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 73.1 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 22/262 (8%), Positives = 62/262 (23%), Gaps = 3/262 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G I+ L+M V++G G+ +D + L+ +A + AS L + Sbjct: 15 RARQRGAVAIVVGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSI 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + + + + + + + S Sbjct: 75 APASFARAEAAGQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSS 134 Query: 135 SRYDLLL---NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + N + F++ + + + A G+ + S Sbjct: 135 KYVRCTITRSNIMPWFMQVLMPSLDTLTVSSLATATLAPAQNNCGIPMAICSKGSAPSYG 194 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 G + G + P Sbjct: 195 MTPGQWVSGFFGAGGGVTGSFNWIDFTPPAGGTSELAALLTGNGVCTLNVPTPVGEPGAL 254 Query: 252 SLSEEHFVDSSSLRHVIKKKHL 273 + ++ + ++ Sbjct: 255 GAAAAKAWNTRFGLYQTGSTNV 276 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 73.1 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 35/192 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V+ L+ I + R+G F D + + + + Sbjct: 38 VDRLTAVKQVLSEFIAQ-------REGDRIGLILFADHAYLQTPLTLDRQTVTEQLNQAV 90 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA+ + + A T I S ++ ++LL+DG NT + + Sbjct: 91 LKLI--GTQTAMGEGIGLATKTFIDSAA-----------PQRVMILLSDGSNTAGVLDPL 137 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCAS--PNSFFEAN 432 N AK I T+ T Q+ L AS +F A Sbjct: 138 EAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRAR 197 Query: 433 STHELNKIFRDR 444 + +L I+ Sbjct: 198 NQQDLQSIYDTI 209 >gi|301133566|gb|ADK63405.1| C3HC4 type zinc finger protein [Brassica rapa] Length = 677 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 56/196 (28%), Gaps = 26/196 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 260 TKLALLKRAMGFVIQNLGPFD------RLSVISFSSTSRRSFPLRLMTETGKQEALQA-V 312 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + + +I IVLL+DG++T Sbjct: 313 NSFVSNGGTNIAEGLTKGAKVLIDRRFKNSVSS---------IVLLSDGQDTYTMTSPTG 363 Query: 389 ICNKAKSQGIR------IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + K + + I + + + A S +F S + Sbjct: 364 --SNTKGADYKTLLPKEVNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDA 421 Query: 441 FRDRIGNEIFERVIRI 456 F IG + V + Sbjct: 422 FAQCIGGLLSVMVQEL 437 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 58/204 (28%), Gaps = 20/204 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V + + D A N + ++ + K Sbjct: 290 FSSIVKVWNVDKVQVDYEVGEDPWPLYDSPEAPQKN-KTNQVLPPAYKATDENKEKAKKV 348 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 N G T I A++ + + E++ + IV LTDGE T Sbjct: 349 VEKLNAYGGTDIKSALEVGLKLVKKNKENKE------DAHQPIIVFLTDGEPTMGETNTE 402 Query: 388 AICNKAKSQG-----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + I +++F ++ + L N +EA + F Sbjct: 403 KITSAISEMNSGETRAPIFSLSFGDGADREFLQKISLKNLGFARHIYEAADASLQLQEFY 462 Query: 443 DRIGNEI--------FERVIRITK 458 +I + + V ++TK Sbjct: 463 KQISSPLLNNVNFKYVSNVTKLTK 486 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 64/217 (29%), Gaps = 44/217 (20%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + ++ +A + R+G F+ + + L + Sbjct: 106 DPKDRITVAKEVIAEFV-------KGRKDDRIGLVVFSGHAFTQVPLTLDYDFL--QNLL 156 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 TAI A+ + + + +S K ++LLTDG N + + E Sbjct: 157 GQVQTVRRPDGTAIGLALAHSVNGLRNS-----------TTKSKVVILLTDGSNNRGDIE 205 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQ------------------QEKARYFLSNCA--SP 425 A++ +R+ TI + L + A + Sbjct: 206 PAQAAEIARALDVRVYTILVGKPGNGEYPVHDPWRDETYLIPAPTAEDEVALRDIAEQTG 265 Query: 426 NSFFEANSTHELNKIFRDRI----GNEIFERVIRITK 458 FF A L ++ E+++R T+ Sbjct: 266 GIFFRAGDEQGLRDVYDTIDKMERSQVASEKLVRYTE 302 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 64/188 (34%), Gaps = 16/188 (8%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +S+ I + ++ LA R+G F D F+ + Sbjct: 124 TSASGAKISRLDATKEVLADF-------AKTRQGDRLGLILFGDAAFVQTPFTADQKVWL 176 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +T ST + DA+ A S + K +K ++LTDG +T Sbjct: 177 ALLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSEPSQAAFSKPR---QKVAIVLTDGNDT 231 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHE 436 E I AK++G+RI IA + + A S F+A + E Sbjct: 232 GSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDE 291 Query: 437 LNKIFRDR 444 L + D Sbjct: 292 LASAYDDI 299 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 81/290 (27%), Gaps = 25/290 (8%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T S +++ ++ + + +N F Sbjct: 105 STFSIDVDTGSYANVRRFLNN-GRLPPKDTVRIEEIVNYFPYNYPLPT-GGHPFAIHTQT 162 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + P + VD S + +K LV+ L + Sbjct: 163 IDSPWQHEAKLIKIGIQAQDLAKKELPPANLVFLVDISGSMNSPEKLPLVKKTLRILTEQ 222 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ D V + + G + + GSTA A++ Sbjct: 223 LRPQDKVT------LITYASGEELVLPPTSGR---NKDEILRAINKLQAGGSTAGESALK 273 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG---IAICNKAKSQGIRIM 401 AY+ + + N I+L TDG+ ++ + + GI + Sbjct: 274 MAYE----QAQKAYVKNGINR-----ILLATDGDFNVGVSSTDALKSMVAEKRKSGISLT 324 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 T+ F ++ A ++ ++ E K+ + ++ + + Sbjct: 325 TLGFGTGNYNEDMMEQIAD--AGDGNYSYIDNEKEAKKVLQHQLTSTLAT 372 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 54/194 (27%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D AS I N +G F+ + + L+ + Sbjct: 109 NRLEAAKDVAASFI-------NGRPNDNIGLVVFSAESFTQCPLTTDHTVLL--NLFKDI 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + A I S K I+LLTDG N + Sbjct: 160 QSGMIQDGTAIGLGLANAVSRIKDS-----------HAKSKVIILLTDGSNNAGEIAPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AK+ G+R+ TI E L AS +F Sbjct: 209 AAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFR 268 Query: 431 ANSTHELNKIFRDR 444 A L +I+ + Sbjct: 269 ATDNASLKEIYSEI 282 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 29/71 (40%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 S G ++ + M ++G+ G+ +D+ + L+ A A ++ + L + Sbjct: 9 HSQRGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGAKVLNDMHDVG 68 Query: 76 SSRAKNSFTFP 86 + A TF Sbjct: 69 QATAAALTTFN 79 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 65/236 (27%), Gaps = 5/236 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 VPL---IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 L + S E + + + + + + + V Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVTVASDKTSVEARAEQVIKLTFGA 134 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEHGVSIQW 181 ++ + +R ++ L + + A+ + + VS Sbjct: 135 FVGMASIPISARARASVVGRMRLCMLALDPAAAGAFNLERNAQVTAYDCALYSNSVSRSG 194 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 ++ ++ Q S + +R+ P N Sbjct: 195 MVGRDGALARAQTICSAGGFKDDRAN-FTPSPQTSCPVIEDPLRNRPAPPVGNCVN 249 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 65/236 (27%), Gaps = 5/236 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 VPL---IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 L + S E + + + + + + + V Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVTVASDKTSVEARAEQVIKLTFGA 134 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEHGVSIQW 181 ++ + +R ++ L + + A+ + + VS Sbjct: 135 FVGMASIPISARARASVVGRMRLCMLALDPAAAGAFNLERNAQVTAYDCALYSNSVSRSG 194 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 ++ ++ Q S + +R+ P N Sbjct: 195 MVGRDGALARAQTICSAGGFKDDRAN-FTPSPQTSCPVIEDPLRNRPAPPVGNCVN 249 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G FF D ++ + + + ++ TAI DA+ A S Sbjct: 131 DRLGLIFFADTAYLQAPLTYDRETVSQLLNESLI--GLVGEQTAIGDAIGLAIKRFKSKE 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E K ++LLTDG+NT N A + G+ + TI + Sbjct: 189 ESN-----------KVLILLTDGQNTAGNITPEQANELAINNGVTLYTIGVGADQMLVQS 237 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 QE L+ A + +F A L +I+ E R R Sbjct: 238 IFGSRQVNPSQELDEGMLTTLAESTGGRYFRARDAQSLTEIYSKLDELEPIARESR 293 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 63/200 (31%), Gaps = 31/200 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 H + ++ A++I R+G F +R + S+ V + RT+ Sbjct: 117 HAASRIDALKRVGAALI-------KRRTGDRIGLVIFAERAYAAAPLSFDVDAVSRTL-- 167 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 STAI + + A + S + IVLL+DG N + Sbjct: 168 AEIPLGLVGHSTAIGEGLGLALKRLTES-----------KAPSRVIVLLSDGANDAGTTD 216 Query: 386 GIAICNKAKSQGIRIMTIAFSV------NKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + A + G++I TI V N L A F +T +L Sbjct: 217 PTGVAELANNLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRLAEIGGGEAFRVRTTEDL 276 Query: 438 ---NKIFRDRIGNEIFERVI 454 + + E V Sbjct: 277 AYASAAIERLVAGEAAASVS 296 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 54/189 (28%), Gaps = 26/189 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + + + + + F + + R+ Sbjct: 104 DPNRLEAAKKSAKNFVNQL------PSKFNVALVNFAGTASIIVPPTTDRATVQRS---- 153 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STA + + T+ + D H IVLL+DG+ T Sbjct: 154 -IDGLELAESTATGEGIFTSLQALTQVPPDPEHPNDPAPAR---IVLLSDGKRTVGRTAQ 209 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEANSTH 435 AK + I TI F + E R L + A + + A S Sbjct: 210 EG-AQAAKEKNTPIYTITFGTDSGFIEMDGIRQRVPPDRAELRSVAEITGGEAYTAESAG 268 Query: 436 ELNKIFRDR 444 EL +++D Sbjct: 269 ELEDVYKDI 277 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 54/169 (31%), Gaps = 28/169 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + ++ TAI DA+ A Sbjct: 132 DRLGLILFADTAYLQAPLTYDRDTVSTLLSESVI--GLVGEQTAIGDAIGLAVKRFDEKE 189 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E ++LLTDG+NT A ++ +++ TI + Sbjct: 190 ESNN-----------VLILLTDGQNTAGFITPEQAKELAVNKKVKVYTIGVGADKMLIQS 238 Query: 409 -------KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNE 448 QE LS+ AS +F A +EL I+ E Sbjct: 239 FFGSRQVNPSQELDEDMLSDLASSTGGQYFRARDVNELEAIYAKLDALE 287 >gi|253687077|ref|YP_003016267.1| hypothetical protein PC1_0676 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753655|gb|ACT11731.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 543 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 54/475 (11%), Positives = 122/475 (25%), Gaps = 59/475 (12%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + I+ G L +L +VD + +K+A A + Sbjct: 22 REHLSAFIQQEKGAGTGFYVLGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQ 81 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + EE S N + + ++ L N + K +D V + + Sbjct: 82 ATINGEEYSQEETNKLAY--EYVKNNLGMNKALSEKLVASDVSVAEGRNSATRKTYTVTV 139 Query: 128 AYQV------------VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 A++ + S +++ P + L ++VSRS+ + Sbjct: 140 AFETKPSLMSLGARKQEVYSTSEVINRPTEIALVMPVTGDMSDADIRSLKSVSRSFVERM 199 Query: 176 GVSIQWVIDFSR-SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 S D S++ Y + + + S + + + + Sbjct: 200 LSSADGKRDNLWLSLVPYSQSVNVYDAEDANRIRRWSTPSALNPPELRSLFASGVVSSLA 259 Query: 235 SCNKSLYYMLYPGPLDP-SLSEEHFVDSSS-------LRHVIKKKHLVRDALASVIRSIK 286 E F D RH + + + Sbjct: 260 DRRFPDRRANLLCMYRGLGREENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPITWRGPNP 319 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + ++ T F P+ + + + + +T AM A Sbjct: 320 DLYPWDNNSDAVDTRFIVADRGCPNAALMPLTNEESKLNQRIAEFTPRFNTNYAIAMSWA 379 Query: 347 YDTIISSNEDEVH----------RMKNNLEAKKYIVLL---------------------- 374 + + + N + +K IV++ Sbjct: 380 GAALSPNMRGSDGWGDTTLPLDFNLDGNGDGQKVIVMMANTIGNWFDTDSYNFNRNEFRG 439 Query: 375 ---TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 TD + + +C+ +++ I+ + ++ R A+P Sbjct: 440 STGTDPARSFAAQRFQDLCSSFRARNIKFYFVGI-RPGDPEDFGRNLFDREATPG 493 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 65/214 (30%), Gaps = 28/214 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 V+ + + + +D+S K + Sbjct: 56 SVALASPVKSKIITNTHKKGYNIVIDLDTSGSMAEFNKIDAAKAVSLDF-------AKKR 108 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 +G F + ++ + + + G TAI DA+ + + Sbjct: 109 KNDALGLVVFGNIAYIASPLTFDKKTFEDILKRIYVS--IAGGKTAIYDALFLSSNLF-- 164 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N +K I+LLTDG + + K K + I++ +IA + Sbjct: 165 ----------KNANGEKIIILLTDGMDNMSITPLDVVIKKLKKEHIKVYSIAIGGDAD-- 212 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L + + F+ A+S +L KI+ D Sbjct: 213 ---LSVLKKISKETNGKFYIASSLEDLKKIYSDI 243 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 62/197 (31%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVK 325 + ++ A I +G T F + + L+ R + Sbjct: 108 NRLEAAKNVAAEFIS-------GRPNDNIGLTIFAGEAFTQCPMTTDHTSLLNLLRNVRT 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TA+ + A + S K ++LLTDG N + Sbjct: 161 DIAARGLISDGTAVGMGLANAVSRLKDS-----------KTKSKVVILLTDGSNNMGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNS 427 + AKS IR+ TI NK E LS+ A + + Sbjct: 210 PMTSAQIAKSLDIRVYTIGVGTNKVAPYPMSVGGGTQYINIPVEIDSKTLSDIAAVTEGN 269 Query: 428 FFEANSTHELNKIFRDR 444 F+ A + +L +I++D Sbjct: 270 FYRATNNQQLKQIYKDI 286 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 50/168 (29%), Gaps = 3/168 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ +L + G +I AL ++G+ G +D R++ L+ A ++ Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 VPL---IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 L + S E + + + + + + + V Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVTVASDKTSVEARAEQVIKLTFGA 134 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 ++ + +R ++ L + + A+ + Sbjct: 135 FVGMASIPISARARASVVGRMRLCMLALDPAAAGAFNLERNAQVTAYD 182 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 57/164 (34%), Gaps = 9/164 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R+G F D F+ + + +T ST + DA+ Sbjct: 141 LSDFAKTRKGDRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVA--MAGQSTHLGDAIG 198 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A S + ++ +K ++LTDG +T E I AK++ +RI IA Sbjct: 199 LAIKVFEQSEKSRTDVEES---KEKVAIVLTDGNDTGSFVEPIDAAKVAKAKDVRIHVIA 255 Query: 405 FSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + A S FEA + EL K + + Sbjct: 256 MGDPQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEI 299 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 52/154 (33%), Gaps = 4/154 (2%) Query: 12 KKLIKS--CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI--ITASVP 67 ++L G +TA+LM +LG+ VDV L+ A ++ I + Sbjct: 2 RRLKHDDRERGAVAPMTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQACA 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ + + + + + ++ +V R+T+ + + Sbjct: 62 QNAASADCAAPTSAATSLAGLNALDGVSNAPQASVNLGTGTVDVTTQSRNTSGDNHFTLV 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + + ++ + + F + + Sbjct: 122 FARALGVETANIQASAQAKFGGFSATDAIPLTFS 155 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 39/332 (11%), Positives = 91/332 (27%), Gaps = 25/332 (7%) Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 +P +A + +L L + + + + T S +++ Sbjct: 44 SPPNAALSTAAVAEENLPLAENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVRRF 103 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + +N F D P + Sbjct: 104 LTN-GEQPPKDAVRIEEIVNYFPYNYPLPTD-NRPFAVHTETIDSPWQPEAKLIKIGIQA 161 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P + VD S K LV+ L + + ++ D V + Sbjct: 162 QDTAKKDLPPANLVFLVDVSGSMDEENKLPLVQKTLRILTQQLRPQDKVT------LITY 215 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + G + + + G+T A+Q AY+ + Sbjct: 216 ASGEDLVLPPTSG---ADKETILSAIDKLRAGGATDGESALQMAYE----QAQKAFVPNG 268 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 N I+L TDG+ + + + + G+ + T+ F + ++ Sbjct: 269 INR-----ILLATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGNYNEDMMEQIA 323 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ ++ E K+ + ++ + + Sbjct: 324 D--AGDGNYSYIDNEKEAKKVLQQQLTSTLAT 353 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 63/194 (32%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNVGITLFAGESFTQCPLTVDHAVLL-NLIKDVK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TA+ + A + S K I+LLTDG N + + + Sbjct: 160 CGLIEDG-TAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNRGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 72.7 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 48/368 (13%), Positives = 111/368 (30%), Gaps = 24/368 (6%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV-LSSRYDLLLNPL 144 ++ I L + + T E T+ + R + ++ LN Sbjct: 82 GQKIINVVLEEDNVALQEVVVTSYETAKRRDVTSSSIRIRGAGSVSRNYRTKNSYTLNSS 141 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 +++ + TK T S + +++ ++++ + E Sbjct: 142 LETYKAIDEGGFKKTTKDPVTTFSADVDRASYSNVRRMLNYGQKPHKDAVRIEELINYFD 201 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 A + + +P L P + +D+S Sbjct: 202 YDYAPPAEGGKTPLKATTTLSSCPWNPDNYLLRIGLQAKKIDFTKAPPSNIVFLIDTSGS 261 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 K L++ + ++ +++ D R+ + + + + Sbjct: 262 MDEPNKMPLLKASFKLLLDNLRPED------RIAIVVYASQTGIALP---STPAKEKEKI 312 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-- 382 D GSTA +QTAY+ E NN I+L TDG+ Sbjct: 313 SKVIDDLVASGSTAGGAGLQTAYEV----AEKNFLPKGNNR-----IILATDGDFNVGIS 363 Query: 383 -NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +E + + ++ GI I + + + + + A + ++ ++ E K+ Sbjct: 364 SRDELQRLVEEKRNNGIYISVLGYGMGNYRDDMAETIANK--GNGNYAYIDNFTEAKKVL 421 Query: 442 RDRIGNEI 449 + G + Sbjct: 422 VNEFGGTL 429 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 107/410 (26%), Gaps = 29/410 (7%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNS 82 ++ + + + + G+ + W L + A S VS+ Sbjct: 9 AVLISSASLMTIAIAGIASQTIDWQDLGKKLTDTTEAAGRMTSSGKPDGDASVSTAELPK 68 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 + E + + D A S+ + L Sbjct: 69 QEETHTVVAEIARPVATPQPAPAPALPQKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLE 128 Query: 143 PLSLFL-------RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 P ++ R ++ ++ + + S + S L Sbjct: 129 PPAMPAVQLEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDA 188 Query: 196 SE-GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + +N F K + V + D + + + P PS + Sbjct: 189 VRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQN 248 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D+S K L++ + ++ S++ D V S Sbjct: 249 LVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIV---------TYAGSSGVLLE 299 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +T + GSTA ++ ++ AY + D I+L Sbjct: 300 PTKVADKTRILEKINALTSGGSTAGHEGLKGAYALAETMTGDGEQTR---------IILA 350 Query: 375 TDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 TDG+ + ++ + G + + F E + N Sbjct: 351 TDGDFNVGLSDPDSLKRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQN 400 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 59/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIQC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 DIIED--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPVRVGGTTQYINTPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 66/222 (29%), Gaps = 14/222 (6%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 ++ ++ G II L + V++G G+ +D+ + + L+ + + Sbjct: 8 RSRLTRRAR-----QRGAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALA 62 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ + + + S A T L + + +N + N Sbjct: 63 AARDVTGATPLLVSEAAGLATGTSNAA---LFQGKAVEMFENLNVSYSDTPDSTFYTKNN 119 Query: 124 PRKSAYQVV-LSSRYD-----LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 S +V + + + L + IK+ + A A T S V Sbjct: 120 VPFSLDKVKYVKCTAERKGIAHWFIHMLNLLPGIDIKASTVNAMAVATTTSAQAACAIPV 179 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + + + Y R + P ++ + Sbjct: 180 YVCSPPTANPAKTAYNRGDWIKSRVDPSDPYGPGSFGWADLS 221 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 51/196 (26%), Gaps = 32/196 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 198 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA-V 250 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 251 NSLTANGGTNIAEGLRKGSKVIEERQS---------KNPVCSIILLSDGQDTYTVSPTAG 301 Query: 389 ICNKAKS-------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + A Q + + F + S +F + Sbjct: 302 VHKGATEYCALLPSTTTNGSQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEA 358 Query: 436 ELNKIFRDRIGNEIFE 451 + F IG + Sbjct: 359 TIQDAFAQCIGGLLSV 374 >gi|74011920|ref|XP_548489.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Canis familiaris] Length = 897 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + + +++ G + +D S H Sbjct: 243 PTCTDSLLNGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPKNVVFVIDVSGSMH 302 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K +DAL ++ +K D + + + + + + + Sbjct: 303 G-RKMEQTKDALLKILGDMKGEDYL-NFILFSGDVITWKDDLVQATPENIEEAR-----I 355 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ G T IND + + + E+ ++ I++LTDG+ Sbjct: 356 FVKNIHDRGLTNINDGLLRGISMLNRAREEHRVPERSTS----IIIMLTDGDANVGESRP 411 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 412 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYE 471 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 472 EVANPLLTGV 481 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 23/267 (8%), Positives = 65/267 (24%), Gaps = 6/267 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ + + +I A+ + +LG G VD S ++ A +A++ + Sbjct: 4 RFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAGVLE----- 58 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++ + + + + T + Sbjct: 59 PTDAAKIARASAAFTANFQPSWSAATASFTVNGGELGGSANSTVPTQFLGVLGIKTMTIA 118 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV-IDFSRSMLD 191 S L + + + A+ + + + S + LD Sbjct: 119 AQSAAVATAKRKVCILLTASPDAQAFLVNSGAKLTGPTCEIHVASTAGVAGMINSGTTLD 178 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + G P ++ +S S Y Sbjct: 179 VGKVCIKGSATQNGGPYPVVSTGCAAIANPFVGALPPVSAPGCSFTNQTYSAGNVTLSPG 238 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDAL 278 + + S + ++++ Sbjct: 239 TYCGSTNFNGSGTLTLRPGLYVIKSGA 265 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 37/124 (29%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + + +G + L V+LG+ G +D R + L+ AA ++ A L Sbjct: 3 RAGSRAVTDRSGSVALTFGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGNYL 62 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 ++ ++ + + ++ V V +T Sbjct: 63 KLAVATSAAAKSIVVDTVHAQAAPRPESPYALRVEVADDKTSVATTVDETVKLAFGGFVG 122 Query: 129 YQVV 132 V Sbjct: 123 VPAV 126 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 62/188 (32%), Gaps = 27/188 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + +V+ + I MG F ++ H L Sbjct: 101 FDPVNPAASRFDVVKSIVKDFISQ-------RTNDNMGLVVFGSYSFIASPLTYDKHILS 153 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R + + TA+ +A+ + + K + K +LLTDG +T Sbjct: 154 RIVSQLEV--GMAGKYTALYEALAQGVNLL-----------KMSKAKSKVAILLTDGYST 200 Query: 381 QDNEE--GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 ++ + + AK +G+++ I R L A + F A++ + Sbjct: 201 AGADKIPLDVVLDMAKKEGVKVYPIGIGGPDEYN---RAVLLKIAKETGGVAFGASNASQ 257 Query: 437 LNKIFRDR 444 L ++++ Sbjct: 258 LKEVYKKI 265 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 56/184 (30%), Gaps = 31/184 (16%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ V+ R+G F D + +I+ I +T Sbjct: 17 SAVKKVLSDFVAKRKG---DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVI--GLVGQ 71 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TAI D + T + S ++ ++LL+DG NT + I AK Sbjct: 72 RTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDPIEAAEIAKK 120 Query: 396 QGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I T+ T + L+ A + +F A +L KI Sbjct: 121 YNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKI 180 Query: 441 FRDR 444 + Sbjct: 181 YDTI 184 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 37/343 (10%), Positives = 87/343 (25%), Gaps = 28/343 (8%) Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE----TVSRS 170 +S+ L P + K + T + Sbjct: 44 EDVEITSNEAALMEPSANATSKMILSSPPPPQQRNAETYKEIKENSFVAVAQQPVTTFSA 103 Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV-GIRDEKL 229 + I + + +N F + +S + Sbjct: 104 DVDRAAYANVRRIIGYGQIPPKDAVRIEEMVNYFDYDYPAPEEGSASPLRVSPELAPAPW 163 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 +P + L P + +D S K L++ + ++ ++ D Sbjct: 164 NPNHLLLRIGLQAKKIDLAKAPPSNIVFLIDVSGSMDEENKLPLLQSSFKMLLGQLRPDD 223 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V + + + + + G T+ +Q AY+ Sbjct: 224 KVA------IVTYANGTKVALP---STSVKDKEKIIKVLDNLYASGGTSGGKGIQLAYE- 273 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFS 406 + + NN I+L TDG+ + K + GI + + F Sbjct: 274 ---QAQKSFIKNGNNR-----IILATDGDFNIGINNTTDLEKFIEKQRESGIYMSVLGFG 325 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + + A ++ ++ E K+ + + + Sbjct: 326 MGNYRDDMAETIADK--GNGNYAYIDNITEAKKVLVNELSGTL 366 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 59/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++D + I+ R+G + + + ++ + Sbjct: 112 NRLEALKDVASEFIQ-------GRPNDRVGIVLYAGESYTKTPITSDKSIVLGALNDVKF 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + E G TAI + T+ + + S K I+LLTDG N + Sbjct: 165 SEVLENG-TAIGMGLATSVNRLKDSKALS-----------KVIILLTDGVNNSGTIDPKL 212 Query: 389 ICNKAKSQGIRIMTIA------------------FSVNKTQQEKARYFLSNCA--SPNSF 428 A GI+ TI F + E L A + + Sbjct: 213 ASELAVEYGIKTYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVTGGQY 272 Query: 429 FEANSTHELNKIFRDR 444 F A + +L I+ + Sbjct: 273 FRATNNQKLEAIYEEI 288 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 63/190 (33%), Gaps = 26/190 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ L+ ++ +K+ D R+ F+ +V ++ + + + K + Sbjct: 707 KLDMVKSTLSFMVDQLKEKD------RVAIVEFDTQVKTNLDLT-KMDIEGKKKAKQVSS 759 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T ++ A+ T+ + S +++ ++L TDG + I Sbjct: 760 AISPGSCTNLSGALFTSLKLLASRQQEKNEVTS--------VILFTDGLANRGLISTNEI 811 Query: 390 CNKAKS--------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I T F + + + ++ K F Sbjct: 812 LQNMQDLMDELLSTSNVTIHTFGFGQDTDANMLTSIAQK---GNGLYDYLETADDIPKAF 868 Query: 442 RDRIGNEIFE 451 + IGN + Sbjct: 869 GNVIGNLVSV 878 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLIED--GTAVGMGVANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 72.3 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ I Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 GLIED--GTAVGMGVANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|194221225|ref|XP_001492399.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Equus caballus] Length = 883 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/309 (12%), Positives = 90/309 (29%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I T + + ++ + + + + P Sbjct: 184 IEADIFEPQGISTLDAEASFVTNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPKNVVFVIDVSGSM-Y 296 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K +DAL ++ +K+ D +N F+ V + + F Sbjct: 297 GRKIQQTKDALLKILEDVKEDDYLN------FILFSGDVTTWKDNLVQATPENIQQAREF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ++ + G T INDA+ + + E+ ++ I++LTDG+ + Sbjct: 351 VMNIHSQGMTNINDALLRGISMLNKAREENAVPERSTS----IIIMLTDGDANVGESKPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + + + F N L N +E + + + F + Sbjct: 407 KIQENVHNAIRGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTSV 475 >gi|187919333|ref|YP_001888364.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] gi|187717771|gb|ACD18994.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] Length = 451 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/473 (9%), Positives = 111/473 (23%), Gaps = 60/473 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G I A+ + +LG+ G+ VD + L A A+I A + + Sbjct: 11 FRRRQRGSVSIFVAVSLIALLGILGLAVDSGFGYMIKARLDAATDGAVIAAGEAVTRGNN 70 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + Y +++ + Sbjct: 71 QTEQTNNAQQAATAFFAANYPAGFLGSSVSAGTPSIVFNAG-----------------TV 113 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + F + +G + + ++A + + ++ S + Sbjct: 114 TIGMTAQASVPVTFSKVLGFNVLNVSSSSQAIRKTLD--MVFVIDNTKSLNTSG--VPAA 169 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 S + V G V K + + + Sbjct: 170 VRSNAVAFLNNFDVTNDRVALMHFAYGTVVDVPFKGNTRGFDRATMTTDINKYTFDGSTN 229 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S E ++ + + + + + + N Sbjct: 230 SPEALWNARNQLNTVI-TQPSSLRVIVFFSDGAPNSFSSFFTTNQSKCNNISTTLASGDG 288 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV- 372 + + TA H + + Sbjct: 289 PSGSASGLFSINALSQKLASPCYQNDTSGFVTAM-----PKWYNAHNVNEQTFPIWPVTS 343 Query: 373 --LLTDGENTQ------DNEEGIAICNKAKSQGIRIMTIAFSVN-----KTQQEKARYFL 419 ++++G T A+ A+++G + T+ + E + L Sbjct: 344 PRVVSNGNATYVNVNRVSRNLLEAMAAAARAEGTYVFTLGYGNELTKPAGPDNELGQDVL 403 Query: 420 SNCASP--------------NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A+ + A + +L F +++ +++RI+K Sbjct: 404 KCMANTADSLSRCYNPKQPVGVYCYAATPADLKPCF-----SQLASQILRISK 451 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/343 (13%), Positives = 88/343 (25%), Gaps = 60/343 (17%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS----EGQPLNCFGQP 207 G + + + I L Sbjct: 322 GRGYRSSDGRCRYGWLVYESRGDVFFDIVEERTGESETFQNWTYKPVTYNLASLYDDNVI 381 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY-PGPLDPSLSEEHFVDSSSLRH 266 + + S+ N E+ + + + + Sbjct: 382 SLPIGSNGSNTNVTWDGCIEEANTVATDTFDPFPQDAHDLKINLTPSNVNEYWKPVLRNA 441 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K++ + L + ++ + S P+ ++ + + RT ++T Sbjct: 442 TWKREDSSGNVLGHITQTGNENRPG---------------YSCPAAAFKLTDISRTDLET 486 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLLTDG------- 377 + +T + M I + N ++IV +TDG Sbjct: 487 YVDGLTPRSNTYHDFGMIWGARFISPNGIFAASNATAPNGDAISRHIVFMTDGLLVPNQE 546 Query: 378 --------------ENTQDNEEGIA--------ICNKAKSQGIRIMTIAFSVNKTQQEKA 415 N + C A+ + I + IAF TQ Sbjct: 547 IYSMYGIEWWDRRITNDGSGGQARDRHATRFQVACRAARQENISVWVIAFGTTLTQN--- 603 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L +CA+P F+AN T L F I EI +R+T+ Sbjct: 604 ---LIDCATPGRAFQANDTAALETRFEQ-IAQEIAA--LRLTQ 640 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 43/166 (25%), Gaps = 3/166 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + K T + +I+A + ++ + G VD R+ E L+ A + A + Sbjct: 1 MAKDVTANTIVISAASLVPLMAMVGGGVDASRYYMAETRLQAACDAGALAARRSMADDNF 60 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE---MNPRKSAYQ 130 + R F + + +V T + Sbjct: 61 SRADRITGEKFFDENYPDGTFGLEDLERSFTATQSGQVNGEASGTLPTAIMAPFGYDEFS 120 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + ++ D+ ++ + S + Sbjct: 121 LSVTCEADVNISNTDVLFVVDVTGSMNCAPDNPGGGSCGNTEDPGA 166 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 59/198 (29%), Gaps = 39/198 (19%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +V+ I + R+G F R + LI+ + + Sbjct: 117 RASRVDVVKQVTQKFIEA-------RPNDRIGMIAFAARPYLVSPLTLDHGWLIQNLDRV 169 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI A+ + +I + + +VLLTDG+N Sbjct: 170 KLGLVED--GTAIGSAIASCTTRLIERKD----------SKSRIVVLLTDGDNNAGKVSP 217 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFLSNCA--SPN 426 + A + G+++ TI + L A + Sbjct: 218 LTAAEAASALGVKVYTIGAGTKGFAPMPVGRDVFGRKVYQNVKVDVDEDTLKKIADMTKA 277 Query: 427 SFFEANSTHELNKIFRDR 444 F+ A T L +I+ + Sbjct: 278 KFYRATDTKSLTQIYEEI 295 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 55/204 (26%), Gaps = 24/204 (11%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + + + I +G F + S Sbjct: 99 DINMKYGEERLTRMDVAKIVVEKFIGGDGDELVGRPDDLIGLITFARYADTIAPLSLAHE 158 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 LI + TA D + + K I+LLTDG Sbjct: 159 ALISIVQDVTINTRPNEDGTAYGD-------ATALAAAQLDLLQGDQDIKSKIIILLTDG 211 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---------------QEKARYFLSNC 422 EN N + + AK GI+I TI+ T L Sbjct: 212 ENNCGNHLPLQAASLAKEWGIKIYTISIQNKPTPERKKTDQGTFFVPPTPSAGDQVLKKM 271 Query: 423 A--SPNSFFEANSTHELNKIFRDR 444 A + F A+ L ++++ Sbjct: 272 AESTGGVFRLAHDYDSLKSVYKEI 295 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 60/181 (33%), Gaps = 24/181 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V++ L+ + + R+G FF F+ + + + Sbjct: 118 VTRLDSVKEVLSEFLAE-------REGDRVGLVFFGSAAFIQMPFTEDLEICQELMDEAQ 170 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T + DA+ + S K ++LLTDG +T Sbjct: 171 VR--MAGPQTMLGDAIGLSISIFDQS-----------ELEDKVLILLTDGNDTGSLVAPE 217 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 A+ +GI I T+A +Q L + +S ++ A + EL I+ + Sbjct: 218 KAAQIARDKGIVIHTVAVGDPAAAGEQALDEATLRSISSLTKGKYYWAGNREELAGIYDE 277 Query: 444 R 444 Sbjct: 278 I 278 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 31/200 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 296 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA-V 348 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 349 YSLTSNGGTNIAEGLRKGSKVIEDRQ---------AKNPVCSIILLSDGQDTYTVSPTAG 399 Query: 389 ICNKAKS------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A Q + + F + S +F + Sbjct: 400 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAA 456 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + + Sbjct: 457 IQDAFAQCIGGLLSVVAQDL 476 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 56/166 (33%), Gaps = 22/166 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ S S+ ++ + + ST I D + A + ++ Sbjct: 144 DRIGLVIFADQADVAASLSFDTAAVVHALEEAQI--GLVGRSTGIGDGLGLALKRLDAAT 201 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 +K ++LL+DG N + A+ GIR+ TIA + Sbjct: 202 AR-----------EKVVILLSDGANNAGQTTPHDVAGLARDLGIRVHTIALGPRDLSDAD 250 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 L + + S FF +T +L D I R Sbjct: 251 GDPDVVDTEALRDVSATSGGRFFRVRTTDDL-AAVADAIDEVEGGR 295 >gi|152994336|ref|YP_001339171.1| hypothetical protein Mmwyl1_0295 [Marinomonas sp. MWYL1] gi|150835260|gb|ABR69236.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 528 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/476 (9%), Positives = 113/476 (23%), Gaps = 62/476 (13%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV-PLIQSL 72 + G L++ + VD R LK+A A + L Sbjct: 26 FYQDEAGAVLPFAVLIIVGGVIAVSFAVDTTRMVNSSAQLKRATDVAALAIGNIQLTNGN 85 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ K + + + + ++ + D V ++ + + Sbjct: 86 DDDVDLQKIASGYVLSNLGMDSGLINQIEAEQVTVTKGEVDGSPTYKVSVSLIAQSDLLK 145 Query: 133 LSSRYDLLLNPLSLFLR------SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + + + R + + + A + + + Sbjct: 146 AGGQEQVIFSTVEVVSRPTEVALILPNSGTEDRGELAALRKVSKEFARNLLGDESAGVSP 205 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN----------GKVGIRDEKLSPYMVSC 236 + Q +N + + + +++ + G P Sbjct: 206 TQKVWLSLVPFSQSVNVYDESDPDRINRWAAFGELNPPELRSLFRTGKARSLADPRFPDR 265 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVN 292 +L + + + + A I + D Sbjct: 266 KANLLCVHRGLSAGQNFFWDQPPSGQFEIYYRHDLPENGSPGAPPISWVGPNPDFPDTQA 325 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + R + + + ++ + + AM A + Sbjct: 326 EDTRWIVADKGCPDAALLPLTDNLAEID-----ARLDKMSTRFNVNYAIAMSWAGAALSP 380 Query: 353 SNEDEVH--------RMKNNLEAKKYIVLL---------------TDGENTQD-----NE 384 + + K IV+L DGE T D + Sbjct: 381 NMRGGAGWGNNELPLDFSLDGNNVKVIVMLVNTIGDWFDTDAYNFNDGETTNDSMAFARQ 440 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQ-------QEKARYFLSNCAS-PNSFFEAN 432 +C S+ I+ I + + A L CAS +F A+ Sbjct: 441 RFSDLCRDFNSKNIKFFFIGVRPGDPEDWGRTLFADVAGPGLRECASGDGNFHFAD 496 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/308 (14%), Positives = 87/308 (28%), Gaps = 26/308 (8%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 S R G+ K T + I ++ YQ + L Sbjct: 191 PQGSETYRDYGVNPVEDPAKDRLSTFAIDVDTASYA-IARRKIMDGALPPYQAVRAEEFL 249 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N F + + + ++ P + + VD+ Sbjct: 250 NYFDYGY--ASPAAGPFAVHLAAAPSPFTSGHHLVRVAVQGKRVPVKERTPVHLVYLVDT 307 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S K L + +L + ++K D V + V + + + Sbjct: 308 SGSMQSPDKIELAKKSLKMLTDTLKPGDTVA------LCTYAGSVREVLAPTGI---ESK 358 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + D GSTA++ + AY R + IVL +DG+ Sbjct: 359 GKILAALADLTAGGSTAMSSGIDLAYSL--------AERTLVKGHVNRVIVL-SDGDANV 409 Query: 382 DNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 I K +GI + T+ F + + ++ +S + Sbjct: 410 GPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLANQ--GDGNYAYIDSEAQAR 467 Query: 439 KIFRDRIG 446 ++F +++G Sbjct: 468 RVFSEQVG 475 >gi|126723497|ref|NP_001075477.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Oryctolagus cuniculus] gi|75056157|sp|Q9GLY5|ITIH3_RABIT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|11041718|dbj|BAB17302.1| inter-alpha-trypsin inhibitor heavy chain3 [Oryctolagus cuniculus] Length = 903 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 71/220 (32%), Gaps = 14/220 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +++ G + +D S +K +DAL ++ +++ D++N Sbjct: 267 GYFVHFFAPQGLPVVPKNVVFVIDVSGSM-YGRKLEQTKDALLKILEDMREEDHLN---- 321 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ V + + F + GST +ND + + ++ E+ Sbjct: 322 --FILFSSDVTTWKEHLVQATPENLQEARAFVKSIQDQGSTNLNDGLLRGISMLNTAREE 379 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++ +++LTDG+ I ++ + + F N Sbjct: 380 HRVPERSTS----IVIMLTDGDANSGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNF 435 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 L N +E + + F + + N + V Sbjct: 436 LESLALENDGFARRIYEDSDANLQLHGFYEEVANPLLTSV 475 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 31/200 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 287 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA-V 339 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 340 YSLTSNGGTNIAEGLRKGSKVIEDRQ---------AKNPVCSIILLSDGQDTYTVSPTAG 390 Query: 389 ICNKAKS------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A Q + + F + S +F + Sbjct: 391 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAA 447 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + + Sbjct: 448 IQDAFAQCIGGLLSVVAQDL 467 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 67/187 (35%), Gaps = 27/187 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V++ ++ I MG F ++ V+ L + + + Sbjct: 109 SRFDVVKEIVSDFISQ-------RKNDNMGLVVFGAYSFIASPLTYDVNILNKILSQLQI 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGI 387 TA+N ++ + + K + K +LLTDG +T Q + + Sbjct: 162 --GMAGKYTALNTSLAQGANLL-----------KQSKSKTKIAILLTDGYSTPQVDTITL 208 Query: 388 AIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 I + K +GI++ I + + L A S F A+S EL ++++ Sbjct: 209 DIALDMIKKEGIKVYPIGIGMPHEYNTE---ALLKIANESGGVAFGASSAAELQEVYKKI 265 Query: 445 IGNEIFE 451 E + Sbjct: 266 DSLEKSK 272 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 52/194 (26%), Gaps = 30/194 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 308 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA-V 360 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 361 NSLTSNGGTNIAEGLRKGSKVIEERQ---------AKNPVCSIILLSDGQDTYTVSPTAG 411 Query: 389 ICNKA-----------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + A +Q I + F + S +F + + Sbjct: 412 VHKGAPEYCALLPSTNGNQQIPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAAI 468 Query: 438 NKIFRDRIGNEIFE 451 F IG + Sbjct: 469 QDAFAQCIGGLLSV 482 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 64/194 (32%), Gaps = 35/194 (18%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + + +V+ L I+ R+G F D ++ + + + Sbjct: 117 NGNEVDRLVMVKTVLGDFIQR-------RVGDRIGLILFADTAYLQAPLTFDRTTVEQLL 169 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 +T STAI DA+ A K +VLLTDG+NT N Sbjct: 170 SETVI--GLVGDSTAIGDAIGLAAKRFSDK-----------PNVNKVLVLLTDGQNTAGN 216 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNCA--SPNSF 428 + A Q I+I I + + L+ A + + Sbjct: 217 ITPDQALSLAVDQNIKIYPIGVGADAMTVNSLFGQRQVNPSADLDEGLLTRLAKDTGGQY 276 Query: 429 FEANSTHELNKIFR 442 F A T EL +I+R Sbjct: 277 FRARDTQELEQIYR 290 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 51/161 (31%), Gaps = 28/161 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D + + + + + TAI +A+ ++ Sbjct: 131 LILFADHAYLQAPLTQDRRSVAQFLKEAQI--GLVGKQTAIGEAIALGVKRFDMVDKSN- 187 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------- 408 + +VLLTDG N + A +G++I I + Sbjct: 188 ----------RILVLLTDGSNNSGSISPEQAAAIAAKRGVKIYAIGVGADVMERRSIFGT 237 Query: 409 ---KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + LS+ A + +F A S+ +L +I+++ Sbjct: 238 ERVNPSMDLDEAQLSSLAKITGGLYFRARSSQDLQQIYQEI 278 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 59/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIKC 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA+ + A + S K I+LLTDG N + + + Sbjct: 161 DIIED--GTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPVPVGGTVQYINTPVEIDEKTLTQIAGITDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 52/154 (33%), Gaps = 21/154 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D+ S S+ + + + ST I D + A + ++ Sbjct: 139 DRIGLVIFADQADVAASLSFDTASVAHALDEAQI--GLVGRSTGIGDGLGLALKRLDATP 196 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 +K +VLL+DG N + A+ GIR+ TIA Sbjct: 197 AR-----------EKVVVLLSDGANNAGQTTPHDVAALARELGIRVHTIALGPRDLSDAE 245 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 L + A S FF +T +L + Sbjct: 246 GDPDVVDTEALRDVATTSGGRFFRVRTTDDLAAV 279 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 64/214 (29%), Gaps = 20/214 (9%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L+ L + +D+ + A+ +V+ R G Sbjct: 86 PLHRDQPTRDLLLLVDLSGSMDTKDFTDASGRTVDRLTAVKAVLDDFLSRRKG---DRAG 142 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D + F+ + + T TA+ DA+ Sbjct: 143 VVVFGDAPFALVPFTTDLDLCREMLRDTVV--GMAGPRTALGDAIGLGIALFD------- 193 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKAR 416 + K I+ LTDG +T AK +GI I T+A + + Sbjct: 194 ----RSTVKAKTIIALTDGNDTASQVPPTEAAGVAKDKGIVIHTVAIGDPSTVGEDKLDE 249 Query: 417 YFLSNCAS--PNSFFEANSTHELNKIFRDRIGNE 448 L + AS FF A EL +I+ G E Sbjct: 250 TALKDVASATGGGFFRALDRDELARIYGRLDGTE 283 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 62/194 (31%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNVGITLFAGESFTQCPLTVDHAVLL-NLIKDVK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TA+ + A + S K I+LLTDG N + + Sbjct: 160 CGLIEDG-TAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNRGEISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/329 (13%), Positives = 96/329 (29%), Gaps = 42/329 (12%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +V++S L + M L S + Sbjct: 113 EVIVSMEATGNLQKTTTVESIMNKD-MLCTGVVGLVGEPFSIETTSDYEKATLTYVIDKN 171 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE--KLSPYMVSCNKSLYYMLYPG 247 + + + + D V+ + + + + +K ++ + Sbjct: 172 KLGDTEFDNLMFLWYDEKKDEFVELDTILDEENSTVSINTPHFSKYMLVDKKEWFDAWKR 231 Query: 248 PLDP------SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 L+ D S K + A+ + I S+ D Sbjct: 232 ASLYFQDEYEPLAAAICYDCSGSMSGNDPKGYRKLAIDNFIDSMTLTDKTA------LIS 285 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F D FS +L ++ G T + +++ A + + + Sbjct: 286 FEDEAKLVSEFSDNKEELK------GLVNPYFGGGTNVRASVEMAIEQLNTVQH------ 333 Query: 362 KNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++I+LL+DG+ + N + KA I+I TI Q+ Sbjct: 334 ----WYTRHIILLSDGDVNININLANNTVDDLIKKAVDNNIKIHTIGLGSGADNQK---- 385 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDR 444 L +CA + +F A + +L+ I++D Sbjct: 386 -LKDCAEYTGGQYFTAETAEKLDAIYKDL 413 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 56/188 (29%), Gaps = 11/188 (5%) Query: 271 KHLVRDALASVIRSIKKIDNV-----NDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIV 324 + + I+ K +V + F+D V + Sbjct: 299 IDVSGSMFGTKIKQTKSAMHVILNDLHRDDSFNIITFSDVVHVWRPGQSIPATAQNKKSA 358 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K + G T IN A+ A ++ I+ LTDGE T Sbjct: 359 KDYVNKIEADGWTDINAALMAAASIFNQTSHKPEKETSTKKIP--LIIFLTDGEATSGVL 416 Query: 385 EGIAICNKA-KSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I + A K+ G I + +AF + R L N +E + K F Sbjct: 417 ATSRILSNAQKAMGGTISLFCLAFGEDADYNLMRRLSLENRGIARRIYEYSDATLQLKGF 476 Query: 442 RDRIGNEI 449 D I + + Sbjct: 477 YDEIASPL 484 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 40/203 (19%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + +D ++ + + T R+G F++R + S + + + Sbjct: 99 ADDFRPNRLEVAKDKISDFVA-------LRPTDRIGLIMFSERAFTLLPLSTDLKLIKQM 151 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + T I DA+ A R +L K I+LLTDG + Sbjct: 152 VGEINVGGML-GSGTNIGDALGLAVA-----------RGAQSLAKNKVIILLTDGVSNVG 199 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKT-------------------QQEKARYFLSNCA 423 I +AK QGI++ TI L A Sbjct: 200 FLTPIQAAEEAKKQGIKVYTIGIGGRGDAKIPYGKNIFGRQRYQNIPGGSIDFKTLKEIA 259 Query: 424 --SPNSFFEANSTHELNKIFRDR 444 + FEA L ++ + Sbjct: 260 DKTNGQTFEAQDEKALAEVLSEI 282 >gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus] Length = 889 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 90/309 (29%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I T + + ++ + + + + Sbjct: 182 ITVDIFEPQGISTLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSCPTCTDS 240 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + + ++ +++ G S +D S H Sbjct: 241 LLKGDFIITYDVNRESPANVQIVNGYF-----VHFFAPQGLPVVPKSVVFVIDVSGSMHG 295 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K +DAL ++ +K+ D +N F+ V + F Sbjct: 296 -RKMEQTKDALLKILEDVKQDDYLN------FILFSGDVTTWKDSLVPATPENIQEASKF 348 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +D + G T INDA+ + + E+ ++ I++LTDG+ Sbjct: 349 VMDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTS----IIIMLTDGDANVGESRPE 404 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + + F + Sbjct: 405 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRIYEDSDANLQLQGFYEE 464 Query: 445 IGNEIFERV 453 + N + V Sbjct: 465 VANPLLTGV 473 >gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] gi|160332333|sp|P56652|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus] gi|296474789|gb|DAA16904.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] Length = 891 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 90/309 (29%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I T + + ++ + + + + Sbjct: 184 ITVDIFEPQGISTLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSCPTCTDS 242 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + + ++ +++ G S +D S H Sbjct: 243 LLKGDFIITYDVNRESPANVQIVNGYF-----VHFFAPQGLPVVPKSVVFVIDVSGSMHG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K +DAL ++ +K+ D +N F+ V + F Sbjct: 298 -RKMEQTKDALLKILEDVKQDDYLN------FILFSGDVTTWKDSLVPATPENIQEASKF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +D + G T INDA+ + + E+ ++ I++LTDG+ Sbjct: 351 VMDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTS----IIIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRIYEDSDANLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 51/175 (29%), Gaps = 33/175 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G + + + ++ + + TAI + TA + Sbjct: 126 KDRPNDRIGLVIYAGESYTKTPVTTDKGIILNAL--SELTYGQIEDGTAIGMGLATAVNR 183 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K + + I+LLTDG N + A GIR+ TI N Sbjct: 184 L-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAEYGIRVYTIGIGSNG 232 Query: 410 TQQEK------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T + A + +F A +L +I+ + Sbjct: 233 TALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYDEI 287 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 71.6 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 57/180 (31%), Gaps = 24/180 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + I + V+ + I + R+G F + F+ + + + Sbjct: 117 NKIDRLEAVKLVVDDFIER-------REGDRIGLILFGTKAYLQVPFTQDLETARFLLDE 169 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T + DA+ A T S + ++LLTDG +T Sbjct: 170 AQI--GMAGAQTMLGDAIGLAIQTFEDS-----------KTENRVLILLTDGNDTGSQVP 216 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEK--ARYFLSNCA--SPNSFFEANSTHELNKIF 441 A + I I TIA +T+ + L A + FF + L +I+ Sbjct: 217 PDQAAKFAAQRNIVIYTIAIGNPETEGTEKIDEETLQLIADQTGGQFFRGSDRQGLIQIY 276 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 64/198 (32%), Gaps = 41/198 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A I +G T F + + L+ +++ Sbjct: 108 NRIEAAKAVAAEFIA-------GRPNDNIGLTIFAGEAFTQCPMTTDHASLL-NLLQNVR 159 Query: 329 IDENEMG----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D G TA+ + A + S K ++LLTDG N + Sbjct: 160 TDIAARGLIQDGTAVGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMGDL 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPN 426 + AKS GIR+ TI NK + E LS+ A + Sbjct: 209 SPMTSAQIAKSLGIRVYTIGVGTNKVARYPMPVTGGIQYVNIPVEIDTKTLSDIAATTDG 268 Query: 427 SFFEANSTHELNKIFRDR 444 +F+ A + EL +I+ D Sbjct: 269 NFYRATNNRELKQIYNDI 286 >gi|108756796|ref|YP_635538.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108460676|gb|ABF85861.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 700 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 33/288 (11%), Positives = 87/288 (30%), Gaps = 25/288 (8%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 I T+ E + ++ S+ + Q + +N F + + Sbjct: 240 TINTEEERFSTFSVDTDSASYTLTRAYLERGSLPNEQAVRVEEFVNTFDYGY--AHQGSA 297 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + +V + + + P +D S ++ + LV+ Sbjct: 298 PFSVQVEGFPSPVRKGYHVVHVGVKAREVSRPQRKPSHLVFVIDVSGSMNLENRLGLVKR 357 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 AL ++ + + D V+ + + + I++ + GS Sbjct: 358 ALHLLVNELDERDQVS------IVVYGSTARLVLEPT---SAVHAHIIRAAIDSLHTEGS 408 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS- 395 T ++ Y +++ + ++L +DG + +I + ++ Sbjct: 409 TNAQAGLEMGYSL--AASHLVEGGINR-------VILCSDGVANTGLTDANSIWERIRAR 459 Query: 396 --QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +GI + T+ F + ++ + E ++IF Sbjct: 460 AAKGITLSTVGFGMGNYNDVLMERLSQ--VGEGNYAYVDRIEEAHRIF 505 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 60/186 (32%), Gaps = 31/186 (16%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +G F + + L+ + + + TAI + TA + + Sbjct: 121 NRPNDNIGLVMFAGESFTQCPLTTDHATLLNRLSEVEIGYLED--GTAIGLGIATACNRL 178 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 K + K IVLLTDG N + + A+S GIRI T+A Sbjct: 179 -----------KESHAKSKIIVLLTDGTNNAGSIAPSMAASLAESLGIRIYTVAVGTRGE 227 Query: 411 QQ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 E L A + S+F A LN+I+ + E + Sbjct: 228 APYPHATAFGTVIDNVKVEIDEASLKEIAQTTGGSYFRATDNESLNQIYDEIDSLEKSKL 287 Query: 453 VIRITK 458 + + K Sbjct: 288 MTQNFK 293 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 55/172 (31%), Gaps = 32/172 (18%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +G T F + + L+ + + A TA+ + A + Sbjct: 117 NDNIGLTIFAGEAFTQCPMTTDHASLLNLLQGVRTDIASRGLIADGTAVGMGLANAVSRL 176 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 K + K I+LLTDG N + + AKS GIR+ TI N Sbjct: 177 -----------KESKAKSKVIILLTDGSNNMGDISPLTAAQIAKSLGIRVYTIGVGTNTV 225 Query: 411 QQ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L + A + F+ A + EL +I+ D Sbjct: 226 APYPVTVGGTTQYVNVPAEIDTKTLKDIAQSTDGGFYRATNNAELKEIYNDI 277 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 22/188 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + + I R+ F + + H ++R ++ Sbjct: 146 DRLTVAKQVIDEFI-------RRRPHDRIALVGFGAHASTIAPLTLD-HAVLRNLIVQVR 197 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + TAI + + + + S K IVLLTDG + D + Sbjct: 198 LGVVDGQETAIGAGLGVSLNRLKESQAAT-----------KIIVLLTDGVHNADGMDPDT 246 Query: 389 ICNKAKSQGIRIMTIAFS-VNKTQQEKARYFLS--NCASPNSFFEANSTHELNKIFRDRI 445 + A +G+ I T+ + L A+ + A T L F+D + Sbjct: 247 VAQTAAERGVVIYTVLMGQQTGDRSSVDAGQLERLAGATDGYAYLAEDTQTLETSFQDLL 306 Query: 446 GNEIFERV 453 + Sbjct: 307 DKLEKSSI 314 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 58/205 (28%), Gaps = 41/205 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ A IR R+G + + + ++ ++ + Sbjct: 111 NRLSALKEVAADFIRQ-------RPNDRIGLVAYAGEAFTKTPITSDKSIVLNSLRE--I 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + T+ + + K + K I+LLTDG N E Sbjct: 162 TYGQLNDGTAIGMGLATSVNRL-----------KESKAISKIIILLTDGVNNSGFIEPQT 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 + A GI+ TI N L A + + Sbjct: 211 AADLAVEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLEGIAETTGGKY 270 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 F A +L + D I V Sbjct: 271 FRATDNEKL-EAIYDEINKLEKTEV 294 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 31/200 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 296 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA-V 348 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I I+LL+DG++T Sbjct: 349 YSLTSNGGTNIAEGLRKGSKVIEDRQ---------AKNPVCSIILLSDGQDTYTVSPTAG 399 Query: 389 ICNKAKS------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A Q + + F + S +F + Sbjct: 400 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAA 456 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + + Sbjct: 457 IQDAFAQCIGGLLSVVAQDL 476 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 33/286 (11%), Positives = 74/286 (25%), Gaps = 36/286 (12%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S + ++ + + P S + + K + N P Sbjct: 193 SNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSNSPL 252 Query: 248 PLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + V + + K L++ A+ VI+++ D R+ F+ Sbjct: 253 NSHNPRAPVDLVTVLDISGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSST 306 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + ++ G T I + ++ + E Sbjct: 307 ARRLFPLRRMTDAGRQQALQA-VNSLVANGGTNIAEGLRKGAKVMEDRKE---------R 356 Query: 366 EAKKYIVLLTDGENTQ-----DNEEGIAICNKA--------KSQGIRIMTIAFSVNKTQQ 412 I+LL+DG++T + ++ G I Sbjct: 357 NPVSSIILLSDGQDTYTVNGSSGNQPQPNYQLLLPLSMHGSQNTG---FQIPVHSFGFGT 413 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + + S +F + + F IG + V + Sbjct: 414 DHDASSMHTISEISGGTFSFIETESVIQDAFAQCIGGLLSVVVQEL 459 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 62/194 (31%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D I N +G T F + + L+ ++K Sbjct: 108 NRLEAAKDVATEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL-NLLKDMK 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + A + S K I+LLTDG N + + + Sbjct: 160 CGFIEDG-TAIGMGLANAVTRLKDSKAKS-----------KVIILLTDGVNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFE 430 AKS GIR+ TI N E L+ A ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 44/394 (11%), Positives = 107/394 (27%), Gaps = 27/394 (6%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 + + +LK + +++ ++ Q+ + RA+ + E L R + + Sbjct: 6 ANTVKTSLKISVLASMVALAISGCQNTDTGEQRAEQDKR-VEIAKTEELQRQKKEQEQVV 64 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 T + ++ V M +N + + + + Sbjct: 65 VTGARIMSEAKERNVPMYDMAMPAPQSPGIWQQPEVNREQY--QHLDESGIFLAKEQPVS 122 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 T S +++ ++ + + +N F + + Sbjct: 123 TFSIDVDTGSYSNVRRML-NDGYLPPEDAVRLEEFVNYFNYDYQTPDSTEQPFAVNTHVF 181 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + P + + +D S + K LV+ +L + Sbjct: 182 SAPWNSNAYLMEIGIKGFEPEQQELPPSNLVYLIDVSGSMNSEDKLGLVKKSLKLL---- 237 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +D R+ + + G R ++ + GST ++ Sbjct: 238 --AQESSDQDRISIVVYAGASGVVLEPTKG---NDRMAIEQALDRLSAGGSTNGGAGIEL 292 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 AY E + N ++L TDG+ + + + GI T Sbjct: 293 AYKL----AEQAFIKDGINR-----VILATDGDFNVGTINREQLIDLVERKRESGISFTT 343 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + F + ++ +S E Sbjct: 344 LGFGSGNYNEHLMEQLADK--GNGNYGYIDSLQE 375 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 55/196 (28%), Gaps = 40/196 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I R+G + + + ++ ++ + Sbjct: 111 DRLTALKKVAADFI-------KKRPNDRIGLVAYAAESYTKTPITSDKSIVLSSLRQITH 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + TA + + S K I+LLTDG N E Sbjct: 164 GQLED--GTAIGMGLATAVNRLKDS-----------KSKSKVIILLTDGVNNSGFIEPKT 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 + A I+ TI N L A + + Sbjct: 211 AADLAVEYKIKTYTIGLGTNGNALTPIAFNPDRTYRYGMRQVEIDEKLLEEIATVTGGQY 270 Query: 429 FEANSTHELNKIFRDR 444 F A +L+ I+ + Sbjct: 271 FRATDNEKLSAIYNEI 286 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D ++ + + + + ++ TAI DA+ + Sbjct: 130 DRLGLILFADTAYLQTPMTYDRNTVKQMLNESVL--GLVGERTAIGDAIALSVKRFRDDE 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 + + +VLLTDG+NT N A++ + I IA Sbjct: 188 KSN-----------RVLVLLTDGQNTAGNLPPEQALELAQAYDVTIYPIAVGAEEVVVDS 236 Query: 409 -------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 ++ + + A + +F A ST+EL +I++ Sbjct: 237 FFGQRRVNPSRDLDVPLMQSIAKQTGGKYFRARSTNELEEIYQ 279 >gi|1915956|emb|CAA72309.1| inter-alpha-inhibitor heavy-chain 1 [Sus scrofa] Length = 779 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 72/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + S + NG + + + + Y + V + Sbjct: 114 STCSTTLLNGDFKVTYDVNREKLCDLLVANNYFAHFFAPQNLTKLNKNVVFVIDISSSME 173 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F V S +++ Sbjct: 174 GQKVKQTKEALLKILSDLKPG---DYFDLVLFGSAVQSWRGSLVQASTANLDAARSYVRQ 230 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + GST +N + + + + + + N +++LTDGE T+ + I Sbjct: 231 FSLAGSTNLNGGLLRGIEIL-NKAQGSLPEFSNRAS---ILIMLTDGEPTEGVTDRSQIL 286 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + + F + L N +E + + + F D++ N Sbjct: 287 KNVRDAIRGRFPLYNLGFGHDVDWNFLEVMALENNGRAQRIYEDHDAAQQLQGFYDQVAN 346 Query: 448 EIFERV 453 + + V Sbjct: 347 PLLKDV 352 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 71.6 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 10/192 (5%) Query: 258 FVDSSSLRHVIKKKHLVR-DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + S ++ L +V + + R+G F D F+ Sbjct: 108 DLSGSMAEKDFTSIDGIKISRLDAVKKVLNDFAKTRKGDRLGLILFGDAAFVQTPFTADH 167 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + +T ST + DA+ E++ + + KK ++LTD Sbjct: 168 EVWLDLLNQTRVE--MAGKSTHLGDAIGLTIKRFE---ENDNSQPLSTTSRKKVAIILTD 222 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNC--ASPNSFFEAN 432 G +T + AK +GIRI IA +Q ++ AS F+A Sbjct: 223 GNDTDSYVPPMDAAKVAKVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQAL 282 Query: 433 STHELNKIFRDR 444 + EL + + Sbjct: 283 NQDELINAYAEI 294 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 60/200 (30%), Gaps = 42/200 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-TF 327 + ++ +A I + R+G F + + +++ + + + Sbjct: 109 NRLEAAKETIAKFISARTS-------DRIGLVVFAGESFTMVPPTLDYQMILQRVNEISS 161 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 A TA+ AM A + S + ++ +TDGEN + Sbjct: 162 ASSAKIKDGTALGVAMANAAGRLKDSQARS-----------RVMIFMTDGENNSGTIDPE 210 Query: 388 AICNKAKSQGIRIMTIAFSVN---------------------KTQQEKARYFLSNCASP- 425 AK GI++ +I + + L AS Sbjct: 211 TGLEIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDLLGRMASDT 270 Query: 426 -NSFFEANSTHELNKIFRDR 444 ++ A + L K+F D Sbjct: 271 GGKYYRATTEGALQKVFSDI 290 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 64/193 (33%), Gaps = 23/193 (11%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + + A A+ I + R+G F + + Sbjct: 154 DTSGSMAAQDFDPNRITAAKVAAANFIA-------NRLSDRIGIVVFASDAMLQSPLTLD 206 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 L+ + G TAI DA+ + + + + K I+LLT Sbjct: 207 YESLLDFLADVRIGMVRTDG-TAIGDAIAVSSVHL-----------ERSPARSKVIILLT 254 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--RYFLSNCA--SPNSFFEA 431 DGE+ + A GI++ TIA ++ L A + ++ A Sbjct: 255 DGESNSGVISPLDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIAKLTGGKYYRA 314 Query: 432 NSTHELNKIFRDR 444 + EL KI+ + Sbjct: 315 YNEAELTKIYAEI 327 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/286 (11%), Positives = 72/286 (25%), Gaps = 36/286 (12%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S + ++ + + P S + + K + N P Sbjct: 198 SNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSNSPL 257 Query: 248 PLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + V + + K L++ A+ VI+++ D R+ F+ Sbjct: 258 NSHNPRAPVDLVTVLDISGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSST 311 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + ++ G T I + ++ + E Sbjct: 312 ARRLFPLRRMTDAGRQQALQA-VNSLVANGGTNIAEGLRKGAKVMEDRKE---------R 361 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI-------------MTIAFSVNKTQQ 412 I+LL+DG++T ++ I Sbjct: 362 NPVSSIILLSDGQDTYTVNGSSG---NXPQPNYQLLLPLSMHGSQNTGFQIPVHSFGFGT 418 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + + S +F + + F IG + V + Sbjct: 419 DHDASSMHTISEISGGTFSFIETESVIQDAFAQCIGGLLSVVVQEL 464 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 55/188 (29%), Gaps = 27/188 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V++ + +I + D R+ FN + K Sbjct: 1620 KIENVKNTILQLIDMLNDND------RLSIITFNSHAQQLCGLRKVNKDNKENLQKIT-K 1672 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---- 385 G T I +QTA+ + S + I LL+DG++ + Sbjct: 1673 SIYANGGTNITSGLQTAFSILQSRKQRNSVSS---------IFLLSDGQDNNSDSRIRNL 1723 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + + + I + F + + A SF+ ++++ F D Sbjct: 1724 LQTTYQQLQEECFTIHSFGFGNDHDGP-----LMQRIAQIKDGSFYYVERNDQVDEFFID 1778 Query: 444 RIGNEIFE 451 +G Sbjct: 1779 ALGGLFSV 1786 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 56/182 (30%), Gaps = 24/182 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTV-------RMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 DA + V R+G F F+ + + ++ Sbjct: 111 FVDATGKQTDRLSAAKQVLKQFVAGREGDRLGLIVFGSAAYLQAPFTDDRETWLALLDES 170 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STA+ DA+ + S + +++LTDG +T Sbjct: 171 IVN--MAGPSTALGDAIGLSIAHFRES-----------KTKNRVLIVLTDGNDTGSKVPP 217 Query: 387 IAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + AK++G+ I T+A ++ L + A + F A L + ++ Sbjct: 218 LDAAQVAKAEGVTIYTVAVGDPETVGEEALDLEVLDSIAQTTGGVSFNAADLKALQETYQ 277 Query: 443 DR 444 Sbjct: 278 RI 279 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 71.2 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ + Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL--NLFQGI 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ + A + S K I+LLTDG N + + + Sbjct: 159 KCGIIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFE 430 AKS GIR+ TI N E L+ A + ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 60/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D A I N +G T F + + L+ + Sbjct: 108 NRLEAAKDVAAEFI-------NGRPNDNIGITLFAGESFTQCPLTVDHAVLL--NLFQGI 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ + A + S K I+LLTDG N + + + Sbjct: 159 KCGIIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNNKGDISPLT 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFE 430 AKS GIR+ TI N E L+ A + ++F Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFR 267 Query: 431 ANSTHELNKIFRDR 444 A S +L +++ + Sbjct: 268 ATSNSKLKEVYEEI 281 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 58/171 (33%), Gaps = 30/171 (17%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D R+G F D+ ++ + R + + TAI DA+ Sbjct: 124 RRDGDRLGLILFADQAYLQAPLTFDRFAVARFLDEAVL--GLVGQQTAIGDAIALGVKRF 181 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--- 407 + + +VLLTDGEN + A+ G+++ TI Sbjct: 182 NDLEQSS-----------RVLVLLTDGENNAGRFTPAQAVSLARQSGVKLYTIGIGSAEI 230 Query: 408 -------------NKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDR 444 + + + F+ + ++ +F A ST EL I+++ Sbjct: 231 RRRGLLGTRTVNPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQEL 281 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 37/308 (12%), Positives = 88/308 (28%), Gaps = 26/308 (8%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + T S +++ +++ S+ + +N F Sbjct: 95 LPDNPIHSVAETPVSTFSVDVDTGSYANVRRLLNQ-GSLPPEGAVRLEEMVNYFPYSYAL 153 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + +P+ + + VD S + Sbjct: 154 PTDGSP-FGVTTEVAPSPWNPHTRLLRIGIKASDRAVADLAPANLVFLVDVSGSMDRREG 212 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ L ++ ++ D V+ + + G + ++ Sbjct: 213 LPLVKSTLKLLVDQLRDQDRVS------LVVYAGESRVVLKPTSGR---DKVTIRNAIDQ 263 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GI 387 + GSTA ++ AY + N I+L TDG+ + Sbjct: 264 LDAGGSTAGASGIELAYQM----ARESFIDKGINR-----ILLATDGDFNVGVSDFDSLK 314 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + G+ + T+ F V+ + A ++ ++ E K+ D++ + Sbjct: 315 QMAVDQRKSGVSLTTLGFGVDNYNEHLMEQL--ADAGDGNYAYIDNLLEARKVLVDQLSS 372 Query: 448 EIFERVIR 455 + V R Sbjct: 373 TLAV-VAR 379 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 71.2 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 57/183 (31%), Gaps = 39/183 (21%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 R+G ++ ++ + LIR I + + + TAI + TA Sbjct: 123 FSKERQADRLGLVSYSGEALTRVPLTTDREVLIREINALESGELED--GTAIGIGLATAI 180 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE------GIAICNKAKSQGIRIM 401 + I S K I+L+TDG + + A S+GI++ Sbjct: 181 NHIKDSKAKS-----------KVIILMTDGVESINPTNDLMYISPQTAAEMATSRGIKVY 229 Query: 402 TIAFSVNKTQQ------------------EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 TI + L N A + +F A L KI+ Sbjct: 230 TIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIADLTGGLYFRATDNQSLQKIY 289 Query: 442 RDR 444 ++ Sbjct: 290 QEI 292 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/322 (10%), Positives = 76/322 (23%), Gaps = 11/322 (3%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G II L + V++G G+ +D+ + + L+ + + A+ + + + S A Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEA 78 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 T + + V ++ Sbjct: 79 AGLTTGTRNAALFQGKAVEMFENLNVSYSDTPDNTFYTKDKVPYSLDKIKYVKCTAERTG 138 Query: 140 ---LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + L M I+ + A A T S V I + Y R Sbjct: 139 IAQWFIQVLNTLPGMNIQPSTVNAMAVATTTSAQTACAIPVYICTPQTANPVRTAYNRGD 198 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP----- 251 + + + G + G Sbjct: 199 WIKSKDEKDPYGPGSFGWADLTPPGGGASELADLLAGSGQCDLSVVGSKVGQPGSISSLI 258 Query: 252 -SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI--KKIDNVNDTVRMGATFFNDRVIS 308 + + + + S + SV + + D ++ T + + Sbjct: 259 PAWNTRFGIYTGSYKGPQDGTPDFTGYAYSVTKWNPPNGGNAYADFIQKRGTNEPYQTDT 318 Query: 309 DPSFSWGVHKLIRTIVKTFAID 330 + S + + + A Sbjct: 319 NSGLSTQGSASTKAVHQAGADR 340 >gi|15929704|gb|AAH15276.1| Inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 234 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIVFVIDVSGSMS 293 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + KT Sbjct: 294 G-RKIQQTREALLKILDDVKEDDYLN------FILFSTDVTTWKDHLVQATPANLKEAKT 346 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ T IND + + + + ED ++ I++LTDG+ Sbjct: 347 FVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTS----IIIMLTDGDANTGESRP 402 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 403 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 462 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 463 EVANPLLTNV 472 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/258 (15%), Positives = 80/258 (31%), Gaps = 18/258 (6%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 L + + + + + S Y L L++ Sbjct: 5 SSLVPYKTCTTDALDKEETYYYVLSTDAATMYRVKYSDGNWYYIDDSKEGLGEKLTDNKE 64 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + + + ++ A + + + ++ + FN F Sbjct: 65 RLYYTTNDANDRFYYLKQAATNFTTQLAQSSPNSE---IALVTFNKTATEQFDFKNV--G 119 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G T N+ + AY + +N K+Y+VLLTDG Sbjct: 120 KDSAYITETINAMETSGGTHQNEGLDRAYKILN--------NDQNTSNLKRYVVLLTDGC 171 Query: 379 NT--QDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCASPNSFFEANST 434 ++ +I NK KS +++T+ ++ T + A+ +L A N + AN Sbjct: 172 PNGVTYDQITTSI-NKIKSTNTKLITVGVGLDETNTGLKAAKDYLQANADDNMAYNANDA 230 Query: 435 HELNKIFRDRIGNEIFER 452 LN IF +G Sbjct: 231 SHLNTIFTQILGQTTNSN 248 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 58/190 (30%), Gaps = 26/190 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K +V+ LA + +K D R+ F+ + + F+ +++ R+ K Sbjct: 64 SKIEMVKSTLAFMFDQLKPTD------RIALVEFDSNISTSLQFT-NMNESGRSKAKQVV 116 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + T ++ A+ I ++L TDG + Sbjct: 117 SNIRAGSCTNLSGALFEGLRLIGQRTNANEVTS---------LLLFTDGLANEGITNTNE 167 Query: 389 ICNKAK-------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I K + + T F + A ++ +T ++ K F Sbjct: 168 IVKKMTTMIHEEIRTNLTVFTFGFGTDTDANMLTSISQ---AGNGLYYFLQTTDDIPKAF 224 Query: 442 RDRIGNEIFE 451 + IG I Sbjct: 225 GNVIGGLISV 234 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 52/180 (28%), Gaps = 38/180 (21%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + + H + + + +TA+ DA+ + Sbjct: 143 DRVGLILFGSQAFILSPLTLDRHAARQ--LLDGVVPSIAGPATALGDAIALGVSKLRERP 200 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 E + ++++ DG+N + A++ G RI I + Sbjct: 201 EGS-----------RVMIVIADGDNNAGSFAPKEAARLARATGTRIYVIGVGSKQPSIPI 249 Query: 410 ---------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L A + +F A T L EI R+ ++ K Sbjct: 250 LEEGSVRYRDDLTMDEGTLQEIADLTGGGYFRATDTRAL---------EEISSRIGQLEK 300 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 63/173 (36%), Gaps = 17/173 (9%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 D L +V + + + D R+G F F+ + + +T Sbjct: 130 DRLTAVKQVLTEFSQRRDGDRLGLIVFGSAAYLQAPFTADKDTWLTLLQETEIA--MAGA 187 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ST+I DA+ + T S+ D + +++LTDG +T + A + Sbjct: 188 STSIGDAIGLSISTFEHSDTDN-----------RVLIVLTDGNDTGSRVPPVDAARVANA 236 Query: 396 QGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + ++I TIA + L + + ++FEA L + + D Sbjct: 237 RDVKIYTIAIGDPETIGEDAMDVDTLKQVSDITGGAYFEALDRQALERAYLDI 289 >gi|159110717|ref|NP_032433.2| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Mus musculus] Length = 889 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIVFVIDVSGSMS 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + KT Sbjct: 297 G-RKIQQTREALLKILDDVKEDDYLN------FILFSTDVTTWKDHLVQATPANLKEAKT 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ T IND + + + + ED ++ I++LTDG+ Sbjct: 350 FVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTS----IIIMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTNV 475 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 51/175 (29%), Gaps = 33/175 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G + + + ++ + + TAI + TA + Sbjct: 133 KDRPNDRIGLVIYAGESYTKTPVTTDKGIILNAL--SELTYGQIEDGTAIGMGLATAVNR 190 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K + + I+LLTDG N + A GIR+ TI N Sbjct: 191 L-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAEYGIRVYTIGIGSNG 239 Query: 410 TQQEK------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T + A + +F A +L +I+ + Sbjct: 240 TALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYNEI 294 >gi|8393899|ref|NP_059047.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Rattus norvegicus] gi|3024056|sp|Q63416|ITIH3_RAT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|602886|emb|CAA58233.1| pre-alpha-inhibitor, heavy chain 3 [Rattus norvegicus] Length = 887 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 74/250 (29%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIAFVIDVSGSMS 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + + Sbjct: 297 G-RKIQQTREALLKILDDMKEEDYLN------FILFSTGVTTWKDHLVKATPANLEEARA 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + + T IND + + + + ED + ++ +V+LTDG+ Sbjct: 350 FVKNIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTS----ILVMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + F + Sbjct: 406 EKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFARRIYEDSDASLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTNV 475 >gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 [Sus scrofa] Length = 889 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 38/309 (12%), Positives = 91/309 (29%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I T + + ++ + + + + Sbjct: 184 ITADIFEPQGISTLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSCPTCTDS 242 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + + ++ +++ G + +D S Sbjct: 243 LLKGDFIITYDVNRESPANVQIVNGYF-----VHFFAPQGLPVVPKNVVFVIDVSGSM-Y 296 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K RDAL ++ IK+ D +N F+ V + + F Sbjct: 297 GRKMEQTRDALLKILDDIKEDDYLN------FVLFSGDVTTWKDSLVQATPENIQKAREF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + G T IND + T + + E+ ++ I++LTDG+ + Sbjct: 351 VRNIRDQGMTNINDGLLTGISMLNKAREEHKVPERSTS----IIIMLTDGDANMGVSKPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTSV 475 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 63/196 (32%), Gaps = 34/196 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A+ VI+ + ID R+ F+ + + K Sbjct: 77 DKIEKVKTAMLFVIKKLSPID------RLSVVTFSADANRLCPLRQITENSQKDLEKL-I 129 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 N G+T I +QT + + + I+L++DGE Sbjct: 130 NGLNADGATNITAGLQTGLKVLSDRSLSGGRVVG--------IMLMSDGEQNAGG----- 176 Query: 389 ICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRD 443 + A+ + + T F + L A +F + T+ L+K F Sbjct: 177 --DAAQVPVGNVPVYTFGFGI-----NHEPRVLKAIAHNSIGGTFSDVQDTNNLSKAFSQ 229 Query: 444 RIGNEIFERV--IRIT 457 + + V +++T Sbjct: 230 CLAGLLTVVVQDLKLT 245 >gi|148692826|gb|EDL24773.1| inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 235 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIVFVIDVSGSMS 294 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + KT Sbjct: 295 G-RKIQQTREALLKILDDVKEDDYLN------FILFSTDVTTWKDHLVQATPANLKEAKT 347 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ T IND + + + + ED ++ I++LTDG+ Sbjct: 348 FVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTS----IIIMLTDGDANTGESRP 403 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 404 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 463 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 464 EVANPLLTNV 473 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 68/237 (28%), Gaps = 36/237 (15%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 MV+ N + + +D S K L++ A+ VI+++ D Sbjct: 232 MVTGNANSLNNQISRYPRAPVDLVTVLDISGSMAG-TKLALLKRAMGFVIQNLGSND--- 287 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+ F+ + + ++ G T I + ++ + Sbjct: 288 ---RLSVIAFSSTARRLFPLTKMSDAGRQRALQA-VNSVVANGGTNIAEGLRKGVKVMED 343 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDG---------ENTQDNEEGIAI--CNKAKSQGIRIM 401 + I+LL+DG + +++ C K I + Sbjct: 344 R---------RDKNPVASIILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVH 393 Query: 402 TIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F + + + + S +F S + IG + V + Sbjct: 394 SFGFGSDHD-----ASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLSVAVQEL 445 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 70.8 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 68/237 (28%), Gaps = 36/237 (15%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 MV+ N + + +D S K L++ A+ VI+++ D Sbjct: 229 MVTGNANSLNNQISRYPRAPVDLVTVLDISGSMAG-TKLALLKRAMGFVIQNLGSND--- 284 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+ F+ + + ++ G T I + ++ + Sbjct: 285 ---RLSVIAFSSTARRLFPLTKMSDAGRQRALQA-VNSVVANGGTNIAEGLRKGVKVMED 340 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDG---------ENTQDNEEGIAI--CNKAKSQGIRIM 401 + I+LL+DG + +++ C K I + Sbjct: 341 R---------RDKNPVASIILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVH 390 Query: 402 TIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F + + + + S +F S + IG + V + Sbjct: 391 SFGFGSDHD-----ASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLSVAVQEL 442 >gi|695636|emb|CAA49843.1| inter-alpha-inhibitor H3 chain [Mus musculus] Length = 886 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 234 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIVFVIDVSGSMS 293 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + KT Sbjct: 294 G-RKIQQTREALLKILDDVKEDDYLN------FILFSTDVTTWKDHLVQATPANLKEAKT 346 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ T IND + + + + ED ++ I++LTDG+ Sbjct: 347 FVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTS----IIIMLTDGDANTGESRP 402 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 403 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 462 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 463 EVANPLLTNV 472 >gi|149034207|gb|EDL88977.1| inter-alpha trypsin inhibitor, heavy chain 3 [Rattus norvegicus] Length = 886 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 74/250 (29%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIAFVIDVSGSMS 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + + Sbjct: 297 G-RKIQQTREALLKILDDMKEEDYLN------FILFSTGVTTWKDHLVKATPANLEEARA 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + + T IND + + + + ED + ++ +V+LTDG+ Sbjct: 350 FVKNIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTS----ILVMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + F + Sbjct: 406 EKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFARRIYEDSDASLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTNV 475 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 68/237 (28%), Gaps = 36/237 (15%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 MV+ N + + +D S K L++ A+ VI+++ D Sbjct: 232 MVTGNANSLNNQISRYPRAPVDLVTVLDISGSMAG-TKLALLKRAMGFVIQNLGSND--- 287 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+ F+ + + ++ G T I + ++ + Sbjct: 288 ---RLSVIAFSSTARRLFPLTKMSDAGRQRALQA-VNSVVANGGTNIAEGLRKGVKVMED 343 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDG---------ENTQDNEEGIAI--CNKAKSQGIRIM 401 + I+LL+DG + +++ C K I + Sbjct: 344 Q---------RDKNPVASIILLSDGRATYTMNQADPNYKLLLPLSMHGCES-KRFQIPVH 393 Query: 402 TIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F + + + + S +F S + IG + V + Sbjct: 394 SFGFGSDHD-----ASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLSVAVQEL 445 >gi|3024063|sp|P97280|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus] Length = 886 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 72/250 (28%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E V +++ G + +D S Sbjct: 234 PTCTDSLLNGDFTIVYDVNRESPGNVQVVNGYFVHFFAPQGLPVVPKNIVFVIDISGSMA 293 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R AL ++ +K+ D +N F+ V + +T Sbjct: 294 G-RKIQQTRVALLKILDDMKQDDYLN------FILFSTGVTTWKDSLVQATPANLEEART 346 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F ++ G T IND + + + E ++ I++LTDG+ Sbjct: 347 FVRSISDQGMTNINDGLLRGIRMLTDAREQHTVPERSTS----IIIMLTDGDANTGESRP 402 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I + + + F N L N +E + + + F + Sbjct: 403 EKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYE 462 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 463 EVANPLLTNV 472 >gi|161784288|sp|Q61704|ITIH3_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor Length = 889 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNIVFVIDVSGSMS 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R+AL ++ +K+ D +N F+ V + KT Sbjct: 297 G-RKIQQTREALLKILDDVKEDDYLN------FILFSTDVTTWKDHLVQATPANLKEAKT 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + ++ T IND + + + + ED ++ I++LTDG+ Sbjct: 350 FVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERSTS----IIIMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIYEDSDANLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTNV 475 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 39/273 (14%), Positives = 77/273 (28%), Gaps = 29/273 (10%) Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGI--RDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + P + V S +G + S + + Sbjct: 206 SNKTAEISSYPESQAVSQSSCLDGFDILIHVKAPTSSSNDVTGSLVNESSMRSSRRVPID 265 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D S K L++ A+ VI+ ++ D R+ F+ Sbjct: 266 LVTVLDVSGSMAG-TKLALLKQAMGFVIQHLRPSD------RLSVIAFSSTARRLFPLQR 318 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 H + ++ G T I DA++ A I N + I+LL Sbjct: 319 MSHHGRQQALQA-ISSLGAGGGTNIADALKKAVKVIEDRN---------YKNSVCSIILL 368 Query: 375 TDGENTQDNEEGIAICNKAKSQGI------RIMTIAFSVNKTQQEKARYFLSNC--ASPN 426 +DG++T + + + + + + + L + AS Sbjct: 369 SDGQDTFNISSNFQGTSAGRRSLVPPSILNELHMVPLHTFGFGADHDSDTLHSISEASGG 428 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI--RIT 457 +F + F IG + V R++ Sbjct: 429 TFSFIEDEGVMQDAFAQCIGGLLSVAVQEMRLS 461 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 70.8 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 66/192 (34%), Gaps = 10/192 (5%) Query: 258 FVDSSSLRHVIKKKHLVR-DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 + S + ++ L +V + + R+G F D F+ Sbjct: 108 DLSGSMAEKDFTSTNGLKISRLDAVKKVLNDFVKTRKGDRLGLILFGDAAFVQTPFTADH 167 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + +T E ST + DA+ SN + + +K ++LTD Sbjct: 168 KVWLDLLNQTHV--EMAGKSTHLGDAIGLTIKRFEDSNNS---QPLSTTSREKVAIILTD 222 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNC--ASPNSFFEAN 432 G +T + AK +G+RI IA +Q ++ AS F+A Sbjct: 223 GNDTDSYVPPMDAAKVAKVKGVRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQAL 282 Query: 433 STHELNKIFRDR 444 + EL + + Sbjct: 283 NQDELINAYAEI 294 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 56/196 (28%), Gaps = 38/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I + R+G + + + + + Sbjct: 112 NRLTALKKVAAEFIE-------GRPSDRIGLVVYAGESFTKTPITTDKSIIQNALKDIKY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +G TAI + TA + + S K I+LLTDG N E Sbjct: 165 KHGELIGGTAIGMGLATAVNRLKDSKAKS-----------KVIILLTDGVNNAGFIEPQI 213 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A GI+ TI N L A + + Sbjct: 214 ASELAVEYGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQVEIDEKLLQQIAKTTGGKY 273 Query: 429 FEANSTHELNKIFRDR 444 F A +T +L +I+ + Sbjct: 274 FRATNTKKLAEIYDEI 289 >gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster Length = 889 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 72/250 (28%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E V +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTIVYDVNRESPGNVQVVNGYFVHFFAPQGLPVVPKNIVFVIDISGSMA 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K R AL ++ +K+ D +N F+ V + +T Sbjct: 297 G-RKIQQTRVALLKILDDMKQDDYLN------FILFSTGVTTWKDSLVQATPANLEEART 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F ++ G T IND + + + E ++ I++LTDG+ Sbjct: 350 FVRSISDQGMTNINDGLLRGIRMLTDAREQHTVPERSTS----IIIMLTDGDANTGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I + + + F N L N +E + + + F + Sbjct: 406 EKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTNV 475 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 65/209 (31%), Gaps = 34/209 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + +V + + + + R+G F R + ++ + L Sbjct: 103 DYEIDQKRVSRLDIVLEVVKTFLD-------KRTNDRIGLVAFAGRAYTVCPLTFDHNWL 155 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R I + A + TAI DA+ A + E + +++LLTDG N Sbjct: 156 KRKIDQLQAGTIED--GTAIGDALGLALSRLEGKKE-----SGERKKIGSFLILLTDGAN 208 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIA------------------FSVNKTQQEKARYFLSN 421 N I A + + TI E L N Sbjct: 209 NCGNLTPIEAARLAAHAAVPVFTIGAGINGEVTMPVMDEERRKIGSQTVVSEVDEGLLRN 268 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNE 448 A + +F A ++ + F+ + Sbjct: 269 IAQLTGGEYFRATDSNAIVSAFQAIDAQK 297 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 61/197 (30%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +A I + ++ F R ++ L + I Sbjct: 106 TRLEASKKTMADFI-------KKRNFDKISLVAFALRASVLSPSTFDYTSLEKEIGNIKI 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + GST+I + TA D + S + +A+K I+LLTDGEN + Sbjct: 159 DE---EGSTSIGLGIATAVDMLRSVKD----------DAEKVIILLTDGENNSGEIDPKL 205 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE-------------------KARYFLSNCAS--PNS 427 A + I+I TI L AS Sbjct: 206 ASEIASNFNIKIYTIGIGDAAGSHAWVTYTDPNYGKRRIRADFTLNEKALIEIASITGGK 265 Query: 428 FFEANSTHELNKIFRDR 444 +F A ++ L+ ++ Sbjct: 266 YFNAKTSSALDNVYNTI 282 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 53/194 (27%), Gaps = 30/194 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 302 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA-V 354 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +A++ I I+LL+DG++T Sbjct: 355 NSLTSNGGTNIAEALRKGSKVIEERQ---------AKNPVCSIILLSDGQDTYTVSPTAG 405 Query: 389 ICNKA-----------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + A +Q + + F + S +F + + Sbjct: 406 VHKGAPEYCALLPSTNGNQQVPVHVFGFGADHDSVSLHSISQ---TSGGTFSFIETEAAI 462 Query: 438 NKIFRDRIGNEIFE 451 F IG + Sbjct: 463 QDAFAQCIGGLLSV 476 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 52/175 (29%), Gaps = 33/175 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G + + + ++ ++ TAI + TA + Sbjct: 126 KDRPNDRIGLVVYAGESYTKTPVTTDKGIILSSL--AELTYGQVEDGTAIGMGLATAVNR 183 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K + + I+LLTDG N + + A GI++ T+ N Sbjct: 184 L-----------KESKAKSRVIILLTDGVNNTGVIDPLIAAELAAEYGIKVYTVGIGTNG 232 Query: 410 TQQEK------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A + +F A + +L +I+ + Sbjct: 233 MALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQKLQQIYDEI 287 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 31/72 (43%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + ++ + G+ II A M +++ ++S + L++A+ A ++A Sbjct: 15 RQFLRRFANNERGNVMIIFAAAALPMAIGVAGALEISQYSQLKSQLQEASDRAALSAMAA 74 Query: 68 LIQSLEEVSSRA 79 L + + +A Sbjct: 75 LREGPRAMRQQA 86 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 70.4 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 52/426 (12%), Positives = 113/426 (26%), Gaps = 66/426 (15%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + E ++ T +V+D+ + + V D+ Sbjct: 30 MGQVYASDSTPTPAVAFDETSVSDPSTVDTWNQVVKDSTENIGRIWTDKTV---LTEDIQ 86 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L S G +L+ A + T + + I V+D S SM + Sbjct: 87 LEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPE 146 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLS-----------PYMVSCNKSLYYMLYPGPL 249 + +++ + +D + Sbjct: 147 SERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAE 206 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFNDR 305 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 207 DSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASD 266 Query: 306 VISDPSFSWGVHKLIRTIVKTF--------AIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + + R+ + T D ++ I A + + Sbjct: 267 ESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQR-A 325 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKT 410 A+K +V TDG+ T +++ ++ A K I +I + Sbjct: 326 FQLGGTREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDAN 385 Query: 411 QQEKARYF------LSNCAS--------------------------PNSFFEANSTHELN 438 + + +S + + A ELN Sbjct: 386 PNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELN 445 Query: 439 KIFRDR 444 IF + Sbjct: 446 NIFNEI 451 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 57/171 (33%), Gaps = 23/171 (13%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + + V++ + +NDR D + ++ G Sbjct: 204 QAAIRIFTDLLATTPVDEQIG--LASYNDRASEDVQLTENFAEVN-----NAMDRLRTGG 256 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T+I+ MQ + R + ++ ++++TDG + + E + + Sbjct: 257 FTSISRGMQAGQEI--------ALRGRPPEFVERTMIVMTDGRHNRG-PEPRVVATDLAA 307 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 G+ I TI F + + A F A + +L I+R+ Sbjct: 308 DGVTIHTITFGAGADFGR-----MQDVARIGGGRHFHATNGDQLRDIYREI 353 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 53/194 (27%), Gaps = 9/194 (4%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A++M + L V +D+ + L+ S + + Sbjct: 33 RHGAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRS---------STDAAANAAATTL 83 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 Q+ ++ N N D + + D V +A + + Sbjct: 84 ADTLDRNLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVR 143 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + + + E + + I V+D S SM + + Sbjct: 144 VNGQRTAGSLSGPVPLFFGNVTGTSIFEPEAFATATYVERDITLVVDRSGSMAGSRFNDL 203 Query: 198 GQPLNCFGQPADRT 211 + F T Sbjct: 204 QAAIRIFTDLLATT 217 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 6/70 (8%) Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGN 447 C+ AKS+GI I TIAF Q L CAS + +F+A +L F+ IG Sbjct: 1 TCDTAKSKGIEIYTIAFMAPAGGQA----LLHYCASDDSHYFQAEKMEDLLAAFK-AIGA 55 Query: 448 EIFERVIRIT 457 + ++ R+T Sbjct: 56 KASSQLTRLT 65 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 55/193 (28%), Gaps = 20/193 (10%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D+ R A+ V+ R+G F D F+ Sbjct: 103 SQSMDTRDFRDPAGALIPRVQAVRQVVSGFVARRPG---DRIGLIVFGDAPYPLAPFTLD 159 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + STA+ DA+ S +K +++LT Sbjct: 160 HRLV--QTLIADLLPGMAGPSTALGDAIGLGIKMFEHS-----------EAPEKVLIVLT 206 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEA 431 DG +T AK + + + TI +++ L A + +F Sbjct: 207 DGNDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEKVDLDVLQKLAAQTGGRYFFG 266 Query: 432 NSTHELNKIFRDR 444 L I+ Sbjct: 267 ADQAGLETIYATL 279 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 57/181 (31%), Gaps = 24/181 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V++ L + K R+G F D F+ + + + Sbjct: 140 VSRLDAVKEVLDGFLAKRKG-------DRVGLVVFGDAAYLQAPFTTDLQLSQELLGECE 192 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA DA+ + E K I+ LTDG +T+ + Sbjct: 193 V--GMAGPRTAFGDAIGLGVNLFDEDTERA-----------KTIIALTDGNDTKSKVPPV 239 Query: 388 AICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 A + I+I T+A + + L + AS +F A L I+ + Sbjct: 240 EAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFAADREHLAGIYDE 299 Query: 444 R 444 Sbjct: 300 L 300 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 52/426 (12%), Positives = 113/426 (26%), Gaps = 66/426 (15%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + E ++ T +V+D+ + + V D+ Sbjct: 30 MGQVYASDSTPTPAVAFDETSVSDPSTVDTWNQVVKDSTENIGRIWTDKTV---LTEDIQ 86 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L S G +L+ A + T + + I V+D S SM + Sbjct: 87 LEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPE 146 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLS-----------PYMVSCNKSLYYMLYPGPL 249 + +++ + +D + Sbjct: 147 SERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAE 206 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFNDR 305 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 207 DSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASD 266 Query: 306 VISDPSFSWGVHKLIRTIVKTF--------AIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + + R+ + T D ++ I A + + Sbjct: 267 ESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQR-A 325 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKT 410 A+K +V TDG+ T +++ ++ A K I +I + Sbjct: 326 FQLGGTREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDAN 385 Query: 411 QQEKARYF------LSNCAS--------------------------PNSFFEANSTHELN 438 + + +S + + A ELN Sbjct: 386 PNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELN 445 Query: 439 KIFRDR 444 IF + Sbjct: 446 NIFNEI 451 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 52/426 (12%), Positives = 113/426 (26%), Gaps = 66/426 (15%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + E ++ T +V+D+ + + V D+ Sbjct: 32 MGQVYASDSTPTPAVAFDETSVSDPSTVDTWNQVVKDSTENIGRIWTDKTV---LTEDIQ 88 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 L S G +L+ A + T + + I V+D S SM + Sbjct: 89 LEGGSRPSVKKGDSDFLVGLSALSSTSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPE 148 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLS-----------PYMVSCNKSLYYMLYPGPL 249 + +++ + +D + Sbjct: 149 SERYRYEEIYAGNLSETEDQEYYTKDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAE 208 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----KKIDNVNDTVRMGATFFNDR 305 D + + F + L I K +++A+ + + ID++ R+ F Sbjct: 209 DSNPNHIQFYERRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASD 268 Query: 306 VISDPSFSWGVHKLIRTIVKTF--------AIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + + R+ + T D ++ I A + + Sbjct: 269 ESDNIGNDFINNNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQR-A 327 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKT 410 A+K +V TDG+ T +++ ++ A K I +I + Sbjct: 328 FQLGGTREGAQKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDAN 387 Query: 411 QQEKARYF------LSNCAS--------------------------PNSFFEANSTHELN 438 + + +S + + A ELN Sbjct: 388 PNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELN 447 Query: 439 KIFRDR 444 IF + Sbjct: 448 NIFNEI 453 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 68/190 (35%), Gaps = 35/190 (18%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + +V+ L I K R+G F D ++ + + + + ++ Sbjct: 110 VDRLEMVKAVLGDFIERRKG-------DRLGLILFADTAFLQTPITYDRNTVQQMLDESV 162 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A R K + + +VLLTDG+NT N Sbjct: 163 L--GLVGERTAIGDAIALAVK-----------RFKGKQQTNRVLVLLTDGQNTAGNLSPE 209 Query: 388 AICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSFFEAN 432 AK+ +RI IA ++ + N A + +F A Sbjct: 210 QALELAKAYDVRIYPIAVGAEEVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYFRAR 269 Query: 433 STHELNKIFR 442 ST EL +I++ Sbjct: 270 STEELERIYQ 279 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 60/178 (33%), Gaps = 38/178 (21%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ + + + + ++ D TAI +A+QTA + SN Sbjct: 129 DRIAFIPYAAQAYPQLPLTSDYSAA-KIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSN 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 + K +++L+DGE+ Q + + + K +GIRI TI + Sbjct: 188 QSS-----------KILIILSDGEDHQ--QGATEMIQEVKEKGIRIFTIGLGTTQGTTIP 234 Query: 412 -------------------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + + L A +F+ ++T ++ + + N Sbjct: 235 IKENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGSNTQQVIDNLQKVLNNI 292 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 49/373 (13%), Positives = 99/373 (26%), Gaps = 22/373 (5%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 N + K + + + V L +LL L ++ ++ IK Sbjct: 195 GNELGSDKILGNGDNGDNGMELFKASASIPGRNKAVFLLMYEELLQRRLGMYEHTISIKP 254 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML----DYQRDSEGQPLNCFGQPADRT 211 + + E + + + + RS + N F + Sbjct: 255 QQLVGRLSVEVNIFENSGITALEVPPLRNKQRSKNTDTISPPPSTAINQTNNFARITFNP 314 Query: 212 VKSYSSQNGKVGIRDEKLSPY-------MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + ++ + GI + + Y + Y ++ V Sbjct: 315 SVVHQAKLSQNGILGDFVIRYDVQRELSVGDLQVLNGYFVHYFAPTNLPPLPKNVVFVID 374 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTI 323 + ++ ++ +K + F+ RV V Sbjct: 375 TSASMLGNKMKQTKEALFTILKDLRP---QDHFNIIGFSKRVKVWQQNQMVKVSPNNIRD 431 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K F G T IN+ + + + + ++ ++ LTDG T Sbjct: 432 AKKFIYSLYPTGETNINEGIHVGAQLLNNYLASN----GKHEKSVSLMIFLTDGRATIGE 487 Query: 384 EEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 E I K+ + +I F + + L NC E K Sbjct: 488 IESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQLKG 547 Query: 441 FRDRIGNEIFERV 453 F D IG + + Sbjct: 548 FYDEIGTPLLSDI 560 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 50/379 (13%), Positives = 110/379 (29%), Gaps = 38/379 (10%) Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD-LLLNPLSLFLRSMGI 153 + D+ D K + +++ Y + +N + + ++ + Sbjct: 1194 GSQVNGYTVETTQTVSGGDVQSDGKSTKIGEKDSATFTITNTYTPIDINSVIEYNKTATL 1253 Query: 154 KSWLIQTKAEAETVSRSYHKE--HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 W +T T S + I V+D S SM + + Sbjct: 1254 LDWNQRTYKIDLTASSKTTQSMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKT-- 1311 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + ++ S + + Y ++ + + K Sbjct: 1312 --------YYIKTQNGIYQQLSWSWDNTWSYTDSYSGKTVTVDPNTTDVYVAQKSNQTKI 1363 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI----SDPSFSWGVHKLIRTIVKTF 327 ++ A + + + N N R+G F++ ++ S++ + + Sbjct: 1364 DALKSAATTFV---NNVANKNSDCRVGIVTFSNDGYIKPITNNSYTLAKVGTSKGDIINT 1420 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE------NTQ 381 G T + A + ++ + ++ KK +V LTDG N Sbjct: 1421 IDGLKTGGDTYPAKGLDKANEIFSENSSNSWETVEQTDGRKKMVVFLTDGVPAPANTNNF 1480 Query: 382 DNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQ--------EKARYFLSNCASP-NSFF 429 D A N AK QG+ + ++ ++ + AS + Sbjct: 1481 DENLAGAGTNSAKILHDQGVATYALGIFGAANSDGTMDNASVQRIDKYMQSIASSHEKYM 1540 Query: 430 EANSTHELNKIFRDRIGNE 448 A+S L+ +F N Sbjct: 1541 TADSVDNLSSLFESITNNI 1559 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 76/225 (33%), Gaps = 29/225 (12%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P + VD S+ + + +A+ I + ++G + Sbjct: 32 NASSPSASKVDAVLVVDVSNSMNTSDPGKIGNEAMKMFIDMLS-----TQNDKVGIVAYT 86 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D V + + + + +KTF N T + ++ A + Sbjct: 87 DVVQREKALLNITSEADKQELKTFIDGLNRGAYTDTSVGVKEALRILQD---------GK 137 Query: 364 NLEAKKYIVLLTDGENT----------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 IV+L DG N Q +++ +AK+ G+ I TI + + + Sbjct: 138 TAGHAPMIVMLADGNNDFNKTTGRTESQSDQDMAQAVAEAKNSGVPIYTIGLNADG---K 194 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L++ A + F +S +L I + + + +V+ + Sbjct: 195 LNKNKLADIAQQTGGKSFITSSADDLPNILSEIFASNLKLKVVPL 239 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 59/194 (30%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D S I N +G F + + L+ + Sbjct: 108 NRLEAAKDVAISFI-------NNRPNDNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQM 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++ TAI + TA + + S + ++LLTDG N + Sbjct: 161 GVLDD--GTAIGMGLATAVNRLKDSKAKS-----------RVVILLTDGSNNMGDITPRM 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFE 430 + A++ GIR+ T+ + L A S +F Sbjct: 208 AADIARTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFR 267 Query: 431 ANSTHELNKIFRDR 444 A LN+I+++ Sbjct: 268 AVDNETLNEIYKEI 281 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 62/203 (30%), Gaps = 37/203 (18%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + R+A + + + R+G F + + L R + Sbjct: 67 DFQPTRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLDYSFLQRMLE 119 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + TA+ A+ TA + + S + K +LLTDG N + Sbjct: 120 DVEVGAVED--GTAVGTALATAVNRLKDSEAES-----------KVAILLTDGRNNRGQI 166 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-----------------NCASPNS 427 + A++ G+R+ I ++ + + ++ Sbjct: 167 DPRTAAEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSVSTGGQ 226 Query: 428 FFEANSTHELNKIFRDRIGNEIF 450 +F A + L +I+ + E Sbjct: 227 YFSATNRDALERIYAEIDTMEAT 249 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 60/197 (30%), Gaps = 33/197 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L+++A++ VI+++ D R+ F+ + ++ Sbjct: 255 TKIALLKNAMSFVIQTLGPND------RLSVIAFSSTARRLFPLRRMTLAGRQQALQA-V 307 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ I I+LL+DG++T Sbjct: 308 SSLVASGGTNIADGLKKGAKVIEDR---------RLKNPVCSIILLSDGQDTYTLPSDRN 358 Query: 389 ICNKAK----------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 + + + ++I T F + + A S +F ++ Sbjct: 359 LLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAA-----MHAIAEISSGTFSFIDAEGS 413 Query: 437 LNKIFRDRIGNEIFERV 453 + F IG + V Sbjct: 414 IQDGFAQCIGGLLSVVV 430 >gi|73985485|ref|XP_533794.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H1 precursor (ITI heavy chain H1) (Inter-alpha-inhibitor heavy chain 1) (Inter-alpha-trypsin inhibitor complex component III) (Serum-derived hyaluronan-associated protein) (SHAP) [Canis familiaris] Length = 910 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/249 (11%), Positives = 70/249 (28%), Gaps = 10/249 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 244 PTCSTSLLNGDFKVTYDVNRDQLCDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISTSME 303 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F V S + F Sbjct: 304 GQKVKQTKEALLKILGDMRPG---DYFDLVLFGSEVQSWKGSLVQASPANLRAAQDFVKH 360 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T +N + + + + + + +++LTDGE T+ + I Sbjct: 361 FFLAGATNLNGGLLRGIEILNQAQK----NLPKLSNHASVLIMLTDGEPTEGVTDRSQIL 416 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + + F D++ N Sbjct: 417 KNVRNAIRGKFPLYNLGFGDNVDFNFLDVMSMENNGRAQRIYEDHDAAQQLQGFYDQVAN 476 Query: 448 EIFERVIRI 456 + V + Sbjct: 477 PLLVDVELL 485 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 58/178 (32%), Gaps = 38/178 (21%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ + + + + ++ + TAI +A+Q A + S+ Sbjct: 129 DRVAFIPYAAQAYPQLPLTSDYSSA-KIFLEGINTNMLSSQGTAIGEAIQMAINYFEESS 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV------- 407 + K +++L+DGE+ Q + + +AK +GIR+ TI Sbjct: 188 Q-----------TSKILIILSDGEDHQQGVDT--VIQEAKDKGIRLFTIGLGTAQGATIP 234 Query: 408 -----------NKTQQ----EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + Q + + L A +F +T E+ + + Sbjct: 235 VSENGQIVAKRDNNGQVVITKLNQALLEEIAQEGGGKYFNGANTKEVLDALQKALDTI 292 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 70.4 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 57/190 (30%), Gaps = 24/190 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + K Sbjct: 137 KIQNVKKTLEYLLELLGDND------RLCLILFNSYSTRLCHLM-KTNNSNKPAFKEIIN 189 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + + + I LL+DG++ + Sbjct: 190 KIQATGGTDINSGMELAFRVL---------KERKYQNPVSSIFLLSDGQDGSADLRVRQS 240 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 241 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 295 Query: 447 NEIFERVIRI 456 I Sbjct: 296 GLFSVVAQEI 305 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 70.4 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 56/207 (27%), Gaps = 40/207 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F+ + + +++ + Sbjct: 191 TKLELLKQAMGFVIDNLGPRD------RLCVVSFSSGANRLMRLA-RMSDAGKSLARRAV 243 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +A++ A I +VLL+DG++T Sbjct: 244 QSLAAGGGTNIGEALRRAAKVIDERMHRNAVAS---------VVLLSDGQDTYTVPRRGG 294 Query: 389 ICNKAKSQGI-------------------RIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 + + T F + A+ +F Sbjct: 295 Y--GGRDANYDALVPPSFAFTGAGGRPAAPVHTFGFGTDHDAAAMHTIAE---ATGGTFS 349 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRI 456 + F IG + V + Sbjct: 350 FIEDEAAIQDAFAQCIGGLLSVTVQEL 376 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 49/164 (29%), Gaps = 26/164 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + + + L I + G TAI A+ A + Sbjct: 139 DRIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQVIQDEG-TAIGSAILIATNRF---- 193 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--AFSVNKTQ- 411 K + +K I+L+TDGEN + A GIRI + F + Sbjct: 194 -------KGSTSLQKVIILITDGENNTGDVGPATAATLAAQNGIRIYVVNAGFKSGGSAG 246 Query: 412 ---------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A L+ + Sbjct: 247 NLSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVLDNTIKTI 290 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 54/194 (27%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D ++ I N +G F + + L+ + Sbjct: 109 NRLEASKDVASAFI-------NGRPNDNIGLVVFAAESFTQCPLTTDHTVLL--NLFKDV 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + A I S K I+LLTDG N Q + Sbjct: 160 QPGIIQDGTAIGLGLANAVSRIKDSQAKS-----------KVIILLTDGVNNQGEIAPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFE 430 AK+ G+R+ TI E L A + +F Sbjct: 209 AAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFR 268 Query: 431 ANSTHELNKIFRDR 444 A L +I+ + Sbjct: 269 ATDNASLKEIYSEI 282 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 53/407 (13%), Positives = 106/407 (26%), Gaps = 45/407 (11%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 Y N+ N T + SS Sbjct: 229 VTTTASSYSNETTGNDQSYNSTRYNSLSACNTAKPADVAWANYGSATQSSTTTTNSAGQQ 288 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + ++ + + + + ++ S Y + N Sbjct: 289 IVTVTVTQPQRKTTYTCQMSGGRYRIYYYYTTRNYYTYTYNTSNPVYTTVTSQVFSNFAY 348 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + + V SY + + + S S + S+ Sbjct: 349 KQINVDVSSYKKFQTVTVQNGTNGANVSYTWKGCIEERDTEAASSFSYSTVDGMSPSTAL 408 Query: 266 HVIKKKHLVRDALASVIRSI-------KKIDNVNDTVRMGATFFNDRVISDPSF-SWGVH 317 + + D + V T ++ + + + + Sbjct: 409 DLDIDRVPDSDPATKWAPMWPELGYYRTASSRSSTPVSTLETTSGSQLSAACPYKAQLLQ 468 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK---NNLEAKKYIVLL 374 + ++ +A + GST + M + + N E ++I+ + Sbjct: 469 TMSQSAFYAYADALSANGSTYHDLGMLWGLRLSSPDGPWQAMVNETPENGGEVSRHIIFM 528 Query: 375 TDGENTQD----------------------------NEEGIAICNKAKSQGIRIMTIAFS 406 TDG+ + A+C+ AK++G R+ IAF+ Sbjct: 529 TDGQMDTNYKVMSTYGIEWHDRRITDDGVTDQDARHTLRFRALCDAAKAKGFRVWVIAFA 588 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + LS CAS +S F A + ELN F++ N RV Sbjct: 589 SDLNDD------LSYCASASSTFPATNATELNTAFQEIAKNVAELRV 629 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 28/273 (10%), Positives = 70/273 (25%), Gaps = 9/273 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 + A +PV++ + G +D+ R + L+ A + + + + +++A+ Sbjct: 1 MLPLAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQA 60 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE----MNPRKSAYQVVLSSRY 137 + F E+ L N + + + + T VE + + + Sbjct: 61 AAFFNANFNEDDLGATETNFATSSADGGSLVEGIATTDVEMVLMNLFGVISVPINVECSA 120 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAET---VSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + + + S R+ K ++ S + + Y Sbjct: 121 TMDIGNTDVTMVLDTTGSMSQTLSGTTTKRIDALRTAMKNFYDTVSAATTGSNARVRYSF 180 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 +N D + + + Y + Sbjct: 181 VPYSSSVNVGQLIYDLDPDYLVDTW--AIQSRTPVFNTVTEQILTGYDTPVTTTASSYSN 238 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 E D S A + + Sbjct: 239 ETTGNDQSYNSTRYNSLSACNTAKPADVAWANY 271 >gi|207029558|ref|NP_001125590.1| inter-alpha-trypsin inhibitor heavy chain H3 [Pongo abelii] Length = 879 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/309 (11%), Positives = 85/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + +P P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRPC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +KK D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMKKEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 55/190 (28%), Gaps = 24/190 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + K Sbjct: 149 KIQNVKKTLEYLLELLGDND------RLCLILFNSYATRLCHLM-KTNNSNKPAFKEIIN 201 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + + LL+DG++ + Sbjct: 202 KIYSTGGTDINSGMELAFRVLKDR---------KYQNPVSSVFLLSDGQDGSADLRVRQS 252 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 253 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 307 Query: 447 NEIFERVIRI 456 I Sbjct: 308 GLFSVVAQEI 317 >gi|75070765|sp|Q5RB37|ITIH3_PONAB RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|55728562|emb|CAH91023.1| hypothetical protein [Pongo abelii] Length = 876 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/309 (11%), Positives = 85/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + +P P Sbjct: 181 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRPC-----P 234 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 235 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 294 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +KK D +N F+ V + +TF Sbjct: 295 -RKLEQTKEALLRILEDMKKEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 347 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 348 VKSMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRPE 403 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 404 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 463 Query: 445 IGNEIFERV 453 + N + V Sbjct: 464 VANPLLTGV 472 >gi|328953619|ref|YP_004370953.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] gi|328453943|gb|AEB09772.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] Length = 333 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 75/246 (30%), Gaps = 11/246 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL++ +G I AL + ++ G + D+ + L++ A + + L+ Sbjct: 11 KLVRDASGAVAISFALTLAILCGFVALAFDIGHLVMVKAELQRTADAGALAGATGLVPYT 70 Query: 73 E----------EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + S+A+ + + ++ + + + + + Sbjct: 71 GPVTSQTPDWLQGQSKAQTLINNTANLADNQVFTTTDSAVTYGYWLLNPPEGYVQSLPTV 130 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 P SAY + R L N F +G+ S A A + ++ V Sbjct: 131 RPTTSAYLPQPAIRVTLSRNVDLYFAPLLGVSS-PKTVNATATAILPETYRTKNTPPIAV 189 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + ++ + + +G+ +++ + S ++Y Sbjct: 190 ARDIVYDIIGDSVVINVDEQTITPRSNAGSAGWFNLSGENSAPSVRINEALTSPTSAIYL 249 Query: 243 MLYPGP 248 + Sbjct: 250 LPGTKA 255 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 54/194 (27%), Gaps = 38/194 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +D ++ I N +G F + + L+ + Sbjct: 109 NRLEASKDVASAFI-------NGRPNDNIGLVVFAAESFTQCPLTTDHTVLL--NLFKDV 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + A I S K I+LLTDG N Q + Sbjct: 160 QPGIIQDGTAIGLGLANAVSRIKDSQAKS-----------KVIILLTDGVNNQGEIAPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFE 430 AK+ G+R+ TI E L A + +F Sbjct: 209 AAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFR 268 Query: 431 ANSTHELNKIFRDR 444 A L +I+ + Sbjct: 269 ATDNASLKEIYSEI 282 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 53/175 (30%), Gaps = 33/175 (18%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G + + + KLI + TAI + TA + Sbjct: 126 KDRPNDRIGLVIYAGESYTKTPVT--TDKLIILNALSEITYGQIEDGTAIGMGLATAVNR 183 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K + + I+LLTDG N + A GI++ T+ N Sbjct: 184 L-----------KESKAKSRVIILLTDGVNNTGFIDPQTAAELAAEYGIKVYTVGIGTNG 232 Query: 410 ------------------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 Q + + A + +F A + +L +I+ + Sbjct: 233 MALSPYALNADGSIIYRMQQVDIDEPLMKKIAQVTKGRYFRATNNQKLQQIYDEI 287 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 37/191 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A +V ++ +G T F + + LI ++++ D G Sbjct: 111 EAAKNVASEFIADRPNDN---IGLTIFAGEAFTQCPMTTDHVSLI-NLLQSVRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + S K ++LLTDG N + + Sbjct: 167 LISDGTAVGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMGDISPMTSAQ 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANS 433 A+S GIR+ TI NK E L + A + +F+ A + Sbjct: 216 IARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATN 275 Query: 434 THELNKIFRDR 444 +L +I++D Sbjct: 276 NRQLKQIYKDI 286 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 62/197 (31%), Gaps = 39/197 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVK 325 + ++ A I +G T F + + L+ R + Sbjct: 108 NRLEAAKNVAAEFIS-------GRPNDNIGLTIFAGESFTQCPMTTDHASLLNLLRNVRT 160 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A TA+ + A + S K ++L+TDG N + Sbjct: 161 DIAARGLISDGTAVGMGLANAVSRLKDSKAKS-----------KVVILITDGSNNMGDIS 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNS 427 + A+S GIR+ TI NK E LS+ A + + Sbjct: 210 PMTSAQIAQSLGIRVYTIGVGTNKVAPYPMNVGGTTQYVNIPVEIDSKTLSDIAAVTEGN 269 Query: 428 FFEANSTHELNKIFRDR 444 F+ A + EL +I+ D Sbjct: 270 FYRATNNKELKQIYNDI 286 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 59/198 (29%), Gaps = 24/198 (12%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +V+ I ++G F + + Sbjct: 97 SGSMSTQDMAGNANRLQVVQKVAGDFI-------RRRHGDQVGLILFGTQPYLQAPLTTD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ + + + + TAI DA+ A + + + + ++LLT Sbjct: 150 LNTVGQFLDEAMI--GVAGTQTAIGDAIGLAIKRLRDATNVSGRKGET------VLILLT 201 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------KTQQEKARYFLSNCA--SPN 426 DG N A + G+RI TI + + L A + Sbjct: 202 DGSNDAGAMPPDEAAKMAAAAGLRIYTIGVGSDQTDPFGMGGANDLDEDTLKLIAKTTGG 261 Query: 427 SFFEANSTHELNKIFRDR 444 +F A L +++ Sbjct: 262 EYFRATDVENLQQVYTRI 279 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 70.0 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 53/382 (13%), Positives = 101/382 (26%), Gaps = 39/382 (10%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L + G +ITALL+PV++G G+ +D+ + ++ A A+ + L Sbjct: 4 NLGRHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQG 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++ + + N D I NP S Sbjct: 64 LAMTTA--------------------NSFITSNGFDPNDATITYTQDTVNNPAGSP---E 100 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++ + L L S + K +T + + + S + Sbjct: 101 INCSMTNQVPTFFLGLFGYPNISITVSAKGILQTGGAGGPFNYAIFSDQNLPISGNQKIT 160 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 L K + G G+R S + S ++ + Sbjct: 161 GSVHTNHQLT-----ISGNTKISGAAEGATGVRLSG-SNTIGSVQADTTANIHVSGSNTI 214 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S+ ++ + A +V S K + ND + + + Sbjct: 215 GSQSGGATQIAMPDFTSQIEAAAQAQGTVYYSAKILSGTNDLDGNIWVQGDITMSGMTTS 274 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G I + + +A I S + I+ Sbjct: 275 GTGAILANGNITISGNTTI----GGNGQVCLYSANGNITVSGSFNNGSNGSE------II 324 Query: 373 LLTDGENTQDNEEGIAICNKAK 394 +G T I C AK Sbjct: 325 YAPNGSVTISGNSTINGCIVAK 346 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 49/373 (13%), Positives = 99/373 (26%), Gaps = 22/373 (5%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 N + K + + + V L +LL L ++ ++ IK Sbjct: 163 GNELGSDKILGNGDNGDNGMELFKASASIPGRNKAVFLLMYEELLQRRLGMYEHTISIKP 222 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML----DYQRDSEGQPLNCFGQPADRT 211 + + E + + + + RS + N F + Sbjct: 223 QQLVGRLSVEVNIFENSGITALEVPPLRNKQRSKNTDTISPPPSTAINQTNNFARITFNP 282 Query: 212 VKSYSSQNGKVGIRDEKLSPY-------MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + ++ + GI + + Y + Y ++ V Sbjct: 283 SVVHQAKLSQNGILGDFVIRYDVQRELSVGDLQVLNGYFVHYFAPTNLPPLPKNVVFVID 342 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTI 323 + ++ ++ +K + F+ RV V Sbjct: 343 TSASMLGNKMKQTKEALFTILKDLRP---QDHFNIIGFSKRVKVWQQNQMVKVSPNNIRD 399 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K F G T IN+ + + + + ++ ++ LTDG T Sbjct: 400 AKKFIYSLYPTGETNINEGIHVGAQLLNNYLASN----GKHEKSVSLMIFLTDGRATIGE 455 Query: 384 EEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 E I K+ + +I F + + L NC E K Sbjct: 456 IESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQLKG 515 Query: 441 FRDRIGNEIFERV 453 F D IG + + Sbjct: 516 FYDEIGTPLLSDI 528 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 54/173 (31%), Gaps = 31/173 (17%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N +G F + + L+ + ++ TAI + TA + Sbjct: 122 NNRPNDNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDD--GTAIGMGLATAVNR 179 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + S + ++LLTDG N + + A++ GIR+ T+ Sbjct: 180 LKDSKAKS-----------RVVILLTDGSNNMGDITPRMAADIARTFGIRVYTVGVGTRG 228 Query: 410 TQQ----------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A S +F A LN+I+++ Sbjct: 229 EAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEI 281 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 58/168 (34%), Gaps = 34/168 (20%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG----STAINDAMQTAYDTIISSN 354 T F + + L+ ++ D G TA+ + A + Sbjct: 131 LTIFAGESFTQCPLTIDHGSLL-NLLNNVRTDIAARGLIQDGTAVGMGLANAVSRL---- 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 K + K ++LLTDG N + + A+S GIR+ TI NK Sbjct: 186 -------KGSKAKSKVVILLTDGSNNMGDISPLTAAQIARSLGIRVYTIGVGTNKVAPYP 238 Query: 413 --------------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E LS+ A + +F+ A + EL +I+RD Sbjct: 239 MPVAGGVQYVNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIYRDI 286 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 60/193 (31%), Gaps = 24/193 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ ++ D R+ F+ H + ++ Sbjct: 275 TKMALLKQAMGFVIQHLRPSD------RLSVIAFSSTARRLFPLQRMSHHGRQQALQA-I 327 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I + + I+LL+DG++T + Sbjct: 328 NSLGAGGGTNIADALKKAVKVIADR---------SYKNSVCSIILLSDGQDTYNISSNFQ 378 Query: 389 ICNKAKSQGI------RIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKI 440 + + + + + + L + AS +F + Sbjct: 379 GTSAGRRSLVPSANPNELHMVPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEGVMQDA 438 Query: 441 FRDRIGNEIFERV 453 F IG + V Sbjct: 439 FAQCIGGLLSVAV 451 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 75/218 (34%), Gaps = 30/218 (13%) Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 +D+S + + ++ + D R+G +N V++ Sbjct: 45 GYDAVFVLDTSYSMRD-TDPEGISAEVINMFMDLSDAD----RTRVGFVAYNHNVVASKP 99 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + ++ ++ N G T + ++ + + + ++ Sbjct: 100 LTSIAVAAQKSQIQQDIRTLNRSGYTDLGLGLRRGSELLAAGASQGRQP---------FM 150 Query: 372 VLLTDGENTQD-----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +LL+DGE N + ++ A+++G + TI + + T R L Sbjct: 151 ILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVN---RQELE 207 Query: 421 NCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 AS + F +S +L +I ++I +++ I Sbjct: 208 RIASQTGGASFITSSAEDLPEILNRIFASQIRSKLVPI 245 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 49/373 (13%), Positives = 99/373 (26%), Gaps = 22/373 (5%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 N + K + + + V L +LL L ++ ++ IK Sbjct: 144 GNELGSDKILGNGDNGDNGMELFKASASIPGRNKAVFLLMYEELLQRRLGMYEHTISIKP 203 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML----DYQRDSEGQPLNCFGQPADRT 211 + + E + + + + RS + N F + Sbjct: 204 QQLVGRLSVEVNIFENSGITALEVPPLRNKQRSKNTDTISPPPSTAINQTNNFARITFNP 263 Query: 212 VKSYSSQNGKVGIRDEKLSPY-------MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + ++ + GI + + Y + Y ++ V Sbjct: 264 SVVHQAKLSQNGILGDFVIRYDVQRELSVGDLQVLNGYFVHYFAPTNLPPLPKNVVFVID 323 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTI 323 + ++ ++ +K + F+ RV V Sbjct: 324 TSASMLGNKMKQTKEALFTILKDLRP---QDHFNIIGFSKRVKVWQQNQMVKVSPNNIRD 380 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 K F G T IN+ + + + + ++ ++ LTDG T Sbjct: 381 AKKFIYSLYPTGETNINEGIHVGAQLLNNYLASN----GKHEKSVSLMIFLTDGRATIGE 436 Query: 384 EEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 E I K+ + +I F + + L NC E K Sbjct: 437 IESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQLKG 496 Query: 441 FRDRIGNEIFERV 453 F D IG + + Sbjct: 497 FYDEIGTPLLSDI 509 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 70.0 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 46/382 (12%), Positives = 105/382 (27%), Gaps = 27/382 (7%) Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + T L + E + V+ + ++ S+ Sbjct: 22 AAQGFAVTMVVALAGCGLSSSPELAKPAEPVAELQTASPQGALVKRMAMPAPMRMQESAA 81 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 D P + + T S +++ ++ S+ Sbjct: 82 MDYRSEPREQYANL-PDNPVHRVAETPVSTFSVDVDTGSYANVRRFLNQ-GSLPPEGAVR 139 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + +N F + +P + P + Sbjct: 140 LEEMVNYFPYHYALPTDGSP-FGVTTEVAATPWNPRTQLLRIGIKASDRPVAELAPANLV 198 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 VD S + LV+ L ++ +++ D V+ + + G Sbjct: 199 FLVDVSGSMDRREGLPLVKSTLKLLVDQLREQDRVS------LVVYAGESRVVLKPTSGR 252 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K+ ++ GSTA ++ AY D I+L TD Sbjct: 253 DKVK---IRNAIDQLTAGGSTAGASGIELAYQMAREGFIDNGINR---------ILLATD 300 Query: 377 GENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 G+ + + + + G+ + T+ F V+ + A ++ ++ Sbjct: 301 GDFNVGISDFDSLKQMAVEQRKSGVSLTTLGFGVDNYNEHLMEQL--ADAGDGNYAYIDN 358 Query: 434 THELNKIFRDRIGNEIFERVIR 455 E +K+ D++ + + V R Sbjct: 359 LREAHKVLVDQLSSTLAV-VAR 379 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 67/243 (27%), Gaps = 35/243 (14%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +R ++ + + + VD S+ K +A V Sbjct: 58 ILRLFTFVMVVIILARPQTSNSWKNTQVEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVA 117 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH---KLIRTIVKTFAIDENEMGSTAI 339 ++ +G T F + + L I A TA+ Sbjct: 118 SEFIAGRPNDN---IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKTDMAQRGMIQDGTAV 174 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + A + S K I+LLTDG N + + AKS GIR Sbjct: 175 GMGIANAVSRLKDSKAKS-----------KVIILLTDGTNNAGDISPLTAAEIAKSFGIR 223 Query: 400 IMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFEANSTHELNKIF 441 + TI N E L+ A F+ A +L ++ Sbjct: 224 VYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDNKKLEDVY 283 Query: 442 RDR 444 +D Sbjct: 284 KDI 286 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 55/173 (31%), Gaps = 17/173 (9%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 D L++V + + + R+G F F+ ++ + +T Sbjct: 120 DRLSAVKGVLDEFLTRREGDRVGLIVFGSAAFVQVPFTQDLNACRILLEETTVR--MAGP 177 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T DA+ + + ++ I+ LTDG +T A Sbjct: 178 RTVFGDALGLGITLF-----------ERSEVEERVIIALTDGNDTGSRVPPAEAAKIAND 226 Query: 396 QGIRIMTIAFSVN--KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 +++ + + L+ AS +F AN+ EL I+ + Sbjct: 227 NTVKVHVVGVGDPTTTGEDVLDEEALNAVASTTGGRYFHANNREELEDIYTEL 279 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 54/183 (29%), Gaps = 33/183 (18%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G F + + + + + + TAI DA+ T+ + + Sbjct: 94 NRVNDRIGLVVFAGAAYTQAPLTLDYGVVREVLKQIRTRVLED--GTAIGDALATSLNRL 151 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S + +VL+TDG+N + + A+S I I TI Sbjct: 152 RDS-----------EAKSRVVVLITDGDNNAGKISPLDAASMAESLKIPIYTILVGKGGK 200 Query: 411 QQEK------------------ARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIF 450 L + AS ++ A L + + + + Sbjct: 201 VPFPQGQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLER 260 Query: 451 ERV 453 ++ Sbjct: 261 SKL 263 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 72/286 (25%), Gaps = 24/286 (8%) Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + T S +++ ++ + LN F Sbjct: 125 EVPVSTFSVDVDTGSYANVRRML-RDGYRPPADSVRVEEMLNYFDYGHPAPASREVPFKV 183 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 + +P + P + VD+S K L++ A A Sbjct: 184 TTELAPAPWNPARQLLMVGIKGYDVDKRELPPANLVLLVDTSGSMDDPAKLPLLKRAFAQ 243 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ D V+ + + G + + GST Sbjct: 244 LVPQLRAKDRVS------IVAYAGHAGLVLPPTPGNRH---GEILAALEGLHAAGSTNGG 294 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQG 397 + ++ AY N I+L TDG+ D + + + G Sbjct: 295 EGLRLAYAM----ARQGHVEGGVNR-----ILLATDGDFNVGITDRNALLTLVADQRRSG 345 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I + T+ F A A ++ E + Sbjct: 346 IALSTLGFGSGNYNDAMAERL--ADAGNGQHLYIDTLDEARRALVQ 389 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 60/187 (32%), Gaps = 31/187 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D+S+ + +DA +I + D V F+ + ++ Sbjct: 104 DTSASMND-DAFTAAQDAAYGLINGLSPEDKVG------LITFDKTARVIEPLAQDHARV 156 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + TA+ + A + K IVL+TDG N Sbjct: 157 QES-----IQKLSRSVGTALYQGLSLAAQEVAKGQN------------TKAIVLMTDGFN 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 T N KA+ G + T+ F ++ L A + +F A + +L Sbjct: 200 TSRNTTLEEAVAKAQEVGASVFTVGFG-----KKVDTQGLQKIANETGGEYFSAPTNAQL 254 Query: 438 NKIFRDR 444 ++F D Sbjct: 255 RRVFADI 261 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 37/191 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A +V ++ +G T F + + LI ++++ D G Sbjct: 111 EAAKNVASEFIADRPNDN---IGLTIFAGEAFTQCPMTTDHVSLI-NLLQSVRTDIAARG 166 Query: 336 ----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 TA+ + A + S K ++LLTDG N + + Sbjct: 167 LISDGTAVGMGLANAVSRLKDSKAKS-----------KVVILLTDGSNNMGDISPMTSAQ 215 Query: 392 KAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--SPNSFFEANS 433 A+S GIR+ TI NK E L + A + +F+ A + Sbjct: 216 IARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATN 275 Query: 434 THELNKIFRDR 444 +L +I++D Sbjct: 276 NRQLKQIYKDI 286 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 64/462 (13%), Positives = 137/462 (29%), Gaps = 90/462 (19%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T + L + G I ALL ++L + +D+ + A + Sbjct: 1 MYTGIKNRIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSL 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + S+ + E R F K+ + ++ V+ Sbjct: 61 FAVTTFRKYVADGMSKNQARKR------AETDARKFLTARTKSLDGTTEKFSIKINIVDR 114 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V +S +++ + F A++ S + ++ Sbjct: 115 EAKVVKANVNISGKHESYMTHAMGF----------DNIDYTADSESTISFGQGKYEFIFL 164 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + + Q + + Sbjct: 165 VDVSPSMGIGASNRDRQIMQRAIGCQFACHE----------------------------- 195 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 V + + +V+DAL S++ +++ V +R G F Sbjct: 196 -----------PWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEV--DLRTGLYSF 242 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++ + + G+ K R K E G T + + S + + Sbjct: 243 SNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN--------FHGVFSDFNGVLRSL 294 Query: 362 KNNLEAKKYIVLLTDGENT--------------QDNEEGIA------ICNKAKSQGI-RI 400 K + K++I++++DG N N C++ K + + Sbjct: 295 KPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTV 354 Query: 401 MTIAFSVN-KTQQEKARYFLSNCA-SPNSFFEANSTHELNKI 440 T+ + + + CA S + F+ ANS E++K Sbjct: 355 HTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKA 396 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 60/194 (30%), Gaps = 35/194 (18%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + ++A+ ++I + D G F+ + S + ++ Sbjct: 105 DYQPDRITAAKEAIGTLINQLDLKDY------AGIITFDSGASTAAYLSPD----KQRVI 154 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + + STAI D + A D K+ K ++LL+DGE+ Sbjct: 155 EKLGMIAASDDSTAIGDGLALAVDM-----------SKSIPNRKSVVILLSDGESNAGYV 203 Query: 385 EGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SPNSFFE 430 AK G+++ T+A N L A + F+ Sbjct: 204 SPETAAEFAKESGVQVFTVAMGSSEKVLVGYDWANNPQYATVDEETLEYIADSTGGGFYS 263 Query: 431 ANSTHELNKIFRDR 444 + L I+ Sbjct: 264 SVDEKTLGNIYSQL 277 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 56/224 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 300 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLR-RMSDNGREAAGLAI 352 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 G T I + ++ + ++ I+LL+DG++T + Sbjct: 353 NSLTSSGGTNIVEGLKKGVRVL---------EERSEQNPVASIILLSDGKDTYNCDNVNR 403 Query: 383 ------------------NEEGIAIC-------NKAKSQGIRIMTIAFSVNKTQQEKARY 417 N +IC ++ + I + T F + Sbjct: 404 RQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHD-----ST 458 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF--ERVIRIT 457 + + S +F S + F IG + + +R+T Sbjct: 459 AMHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLT 502 >gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Pan troglodytes] Length = 1312 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 47/352 (13%), Positives = 94/352 (26%), Gaps = 24/352 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + L L R G ++ + S Sbjct: 132 ISTSLAAGTEVTFSLAYEELLQRHQGQYQLVVSLRPGQLVKRLSIEVTVSERTGISYVHI 191 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + P+ R + + + S S + + + Y Sbjct: 192 PPLRTSRLRTNAHASEVDSPPSTRIERGETCVRITYCPTLQDQSSISGSGIMADFLVQYD 251 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNVNDT 294 ++ + + D + + + + + + K NV + Sbjct: 252 VVMEDIIGDVQIYDDYFIHYFAPRGLPPMEKNVVFVIDVSSSMFGTKMEQTKMAMNVILS 311 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYD 348 +FN SD W I+ + K + G T IN A+ A Sbjct: 312 DLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAAS 371 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + SN++ I+ LTDGE T I + + + + ++AF Sbjct: 372 VLNHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAF 429 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + R L N +E K + I + V R+ Sbjct: 430 GDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADV-RLN 480 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 55/183 (30%), Gaps = 33/183 (18%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G F + + L + + + TAI DA+ T+ + + Sbjct: 128 NRVNDRIGLVVFAGAAYTQAPLTLDYGVLKEVVKQLRTRVLED--GTAIGDALATSLNRL 185 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S + +VL+TDG+N + N A++ + I TI Sbjct: 186 RDS-----------EAKSRVVVLITDGDNNSGKISPMDSANMAQALKVPIYTILVGKGGK 234 Query: 411 QQEK------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + A + ++ A +L + + + + Sbjct: 235 VPFPQGTDLFGNTVWRDTEIPINPELMQDIADRTGGEYYRATDPEQLREGLQKVLDSLER 294 Query: 451 ERV 453 ++ Sbjct: 295 SKL 297 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 37/296 (12%), Positives = 78/296 (26%), Gaps = 27/296 (9%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + + +N F + I Sbjct: 122 STFGLDVDTGSYSNVRRLLNDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSLTTEIA 181 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 +P + P+++ +D+S K L++ AL ++ + Sbjct: 182 PAPWNPQRQLLLVGIQGYRVAPQDIPAVNLVLLIDTSGSMADRAKLPLLKSALRQLVTQM 241 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D R+ + + G + GST ++ Sbjct: 242 RAQD------RVAIVAYAGSAGLVLPSTPGDRHAQ---ILAAIDGLQASGSTNGGAGLEL 292 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 AY + IVL +DG+ + + + + +GI + T Sbjct: 293 AY---------AEAAKGLVKDGVNRIVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTT 343 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + A A S+ +S L + R +E+ ++ I K Sbjct: 344 LGLGSGNYNDAMAMQL--ANAGDGSYHYIDS---LLQA-RKVFASELSATLLTIAK 393 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 32/332 (9%), Positives = 82/332 (24%), Gaps = 13/332 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G + L++ V+LG+ G+++D+ + L+ A + ++A+ L + + + Sbjct: 7 QRERGAVLPMVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQELDGASDAL 66 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + T Y + + V + S Sbjct: 67 TRATNAGVTAGNANRVVYQASSASLANADVTFSTALNGTYSAAGVASSSSSYVKCKHSKS 126 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-------------SRSYHKEHGVSIQWV 182 L S + + + T+ A++ + + + Sbjct: 127 GITARLIKFVGGANSYAVAALAVATRVHAQSTCPIPVALIPKTGGTAPNYGFQVGEWVSM 186 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + S + + + + + V N + + Sbjct: 187 LYSGGSTTPGEMGWYNLDGSTNANETKTEMDQGYCNSKIGDQLGTPGAKVSVDDNWNARF 246 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +Y + S + F S + K R+ + Sbjct: 247 GIYKNKSNASQLQPDFTGYSYTSTNWTNTVPQNAYSGTPASGSDKSAANFKNKRLAYANY 306 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D S ++ K A + Sbjct: 307 DDTGTSVSGGDKITGLNVKGGYKNLATSGSSG 338 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 66/185 (35%), Gaps = 20/185 (10%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 I + V+ L R+G F D F+ + +R + Sbjct: 132 SDEYISRLDAVKRVLHEFAEQ-------RQGDRLGLILFGDAAYLQAPFTADLASWLRLL 184 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 ++ ST + DA+ A + S + + K ++LLTDG +T + Sbjct: 185 DESRVA--MAGQSTHVGDALGLAIKVMSS-------DEIKSSQKNKVVLLLTDGNDTDSS 235 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + A +GIR+ IA +Q + A + F+A ST ELNK Sbjct: 236 VPPLEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDMEVIEGVAALTGGKAFKAISTQELNK 295 Query: 440 IFRDR 444 +++ Sbjct: 296 VYQTI 300 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 49/170 (28%), Gaps = 32/170 (18%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 G F + + L + TAI + TA + I + Sbjct: 124 NDNFGLVVFAGESFTQCPITSNHASL--VNLFKQVDFGIIQDGTAIGLGLATAINRIKDA 181 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 K I+LLTDG N + I+ A+S GIR+ TI + Sbjct: 182 -----------EGKSKVIILLTDGTNNTGDIAPISAAQIAQSYGIRVYTIGVGTQGIAEV 230 Query: 414 K-----------------ARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 L AS +F A + L +I+ + Sbjct: 231 PMLDQFGNIHYTEAEVVIDETTLQQIASTTGGKYFRATNVSSLKQIYSEI 280 >gi|194391096|dbj|BAG60666.1| unnamed protein product [Homo sapiens] Length = 698 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/309 (11%), Positives = 86/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N T+ F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMKEEDYLNFTL------FSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 56/171 (32%), Gaps = 23/171 (13%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + + V++ + +NDR D + ++ G Sbjct: 168 QAAIRIFTDLLATTPVDEQIG--LASYNDRASEDVQLTENFAEVN-----NAMDRLRTGG 220 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T+I+ MQ + + ++ ++++TDG + + E + + Sbjct: 221 FTSISRGMQAGQEIALRGRP--------PEFVERTMIVMTDGRHNRG-PEPRVVATDLAA 271 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 G+ I TI F + + A F A + +L I+R+ Sbjct: 272 DGVTIHTITFGAGADFGR-----MQDVARIGGGRHFHATNGDQLRDIYREI 317 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 52/190 (27%), Gaps = 9/190 (4%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 ++ A++M + L V +D+ + L+ S + + Sbjct: 1 MLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRS---------STDAAANAAATTLADTL 51 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 Q+ ++ N N D + + D V +A + + Sbjct: 52 DRNLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVRVNGQ 111 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + + + + E + + I V+D S SM + + + Sbjct: 112 RTTGSLSGPVPLFFGNVTGTSIFEPEAFATATYVERDITLVVDRSGSMAGSRFNDLQAAI 171 Query: 202 NCFGQPADRT 211 F T Sbjct: 172 RIFTDLLATT 181 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 55/203 (27%), Gaps = 34/203 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + ++ Sbjct: 280 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQHALQA-V 332 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-----DN 383 G T I + ++ + I+LL+DG++T Sbjct: 333 NALVANGGTNIAEGLRKGAKVMEDR---------REKNPVASIILLSDGQDTYTVSGNGG 383 Query: 384 EEGIAICNKAK--------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + + G I + + + + S +F + Sbjct: 384 NQPQPNYQLLLPVSIHGGDNAG---FQIPVHAFGFGADHDASSMHSISEISGGTFSFIET 440 Query: 434 THELNKIFRDRIGNEIFERVIRI 456 + F IG + V + Sbjct: 441 EAVIQDAFAQCIGGLLSVVVQEL 463 >gi|119585666|gb|EAW65262.1| inter-alpha (globulin) inhibitor H3, isoform CRA_a [Homo sapiens] Length = 670 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/273 (11%), Positives = 70/273 (25%), Gaps = 27/273 (9%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + G I+ A +MP +L G +D+ R Y ++ A A++ Sbjct: 8 RVRRAVPGWRSDARGTIGIMFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMAVV 67 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 F +Q + +F L + I R T + Sbjct: 68 GNTS-----------FDADFGRQAFKT--SASFAYALDGAAPGSDPLTITRLTFTQNPDG 114 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS---RSYHKEHGVSIQWV 182 + + + + +G++S + ++ A+ + S +S Q Sbjct: 115 TVTAEATAT--------MRTTVVSIIGVRSLDLSFRSTAKGTTTLKISTITFKVLSAQGA 166 Query: 183 IDFSRSMLDYQRDSEGQPLN---CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 D + + + + S N + + Sbjct: 167 FDKDIYFFTRDGNGNLLSESLVLQYDYNYPAGGTATKSYNPPPPQTFTQPVSNYKTYGFK 226 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + P ++ + SS + Sbjct: 227 MVVYKDPSYRGLHVNPSVYYSDSSNSSQWIRIQ 259 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 56/224 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ +I+++ D R+ F+ + R Sbjct: 218 SKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIFPLR-RMSDNGREAAGLAI 270 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 G T I + ++ + ++ I+LL+DG++T + Sbjct: 271 NSLXSSGGTNIVEGLKKGVRVL---------EERSEQNPVASIILLSDGKDTYNCDNVNR 321 Query: 383 ------------------NEEGIAIC-------NKAKSQGIRIMTIAFSVNKTQQEKARY 417 N +IC ++ + I + T F + Sbjct: 322 RQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHD-----ST 376 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF--ERVIRIT 457 + + S +F S + F IG + + +R+T Sbjct: 377 AMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQELRLT 420 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 55/190 (28%), Gaps = 35/190 (18%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH---KLIRTIVKTFAIDEN 332 +A V ++ +G T F + + L I A Sbjct: 111 EAAKQVASEFIAGRPNDN---IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKTDMAQRGM 167 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 TA+ + A + S K I+LLTDG N + + Sbjct: 168 IQDGTAVGMGIANAVSRLKDSKAKS-----------KVIILLTDGTNNAGDISPLTAAEI 216 Query: 393 AKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS--PNSFFEANST 434 AKS GIR+ TI N E L+ A F+ A Sbjct: 217 AKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDN 276 Query: 435 HELNKIFRDR 444 +L +++D Sbjct: 277 KKLEDVYKDI 286 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/275 (9%), Positives = 72/275 (26%), Gaps = 2/275 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F ++ + G +I A+ +L G VD + S + L+++ A++ V Sbjct: 10 FRNLCRRFGWASDGQISMIFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLAGVV 69 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 ++ + + + T + + Sbjct: 70 RPNDQQISTAAAVFSGAYRGRFDTAATASFASNTDGSLTGTATTSVKTSFLNVMGTSALG 129 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + L + ++ +S L+ + A+ S H S + + + Sbjct: 130 VTASATAKAGAQAQSSVCILLVSTVNAQSLLVNSGAQLNAPSCEIHVLSTQSPAAMFNAT 189 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + N P T + S + + + N ++ Sbjct: 190 LNAKRICIKGSNITKNGGVNPPVETGCAAISDPFVGKLPAVTVG--SCTTNNKVFDPGAV 247 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + S + ++++ + Sbjct: 248 TLNPGVYCGATNFNGSGTLTLNPGLYIIKGGAMTF 282 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 451 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDNGKSCGTCRTSNIDLAPLIDGSKSVRPQNF 510 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 511 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 565 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 566 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 619 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 620 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 665 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 49/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+GA ++ +V + S I T Sbjct: 45 MIDIIHELDIGLAATRIGAVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 102 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 103 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 153 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 154 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 187 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 65/187 (34%), Gaps = 26/187 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V+ + + DT R+G F D + + L Sbjct: 102 DFAIDGQRVNRLTAVKKVVHDFV-------KRRDTDRIGLVVFGDYAFTQAPLTLDKGLL 154 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + I TAI DA+ A I K ++LL+DGEN Sbjct: 155 LNLIENLRI--GMAGRKTAIGDALGVAGKRIKD-----------IPAMSKVVILLSDGEN 201 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 T + + GI+I TI ++ L+ A ++ A++T +L Sbjct: 202 TAGDMTPQGAAEALAALGIKIYTIGMGT----EQAGSKELAQIAAIGQGKYYHASNTEQL 257 Query: 438 NKIFRDR 444 + I+++ Sbjct: 258 DSIYKEI 264 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/301 (10%), Positives = 76/301 (25%), Gaps = 34/301 (11%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + +Q S + + + P + + + + K Sbjct: 186 DDDESLDLQPAFSDRSSGNKTPDHNSQKSIEIKTYPEVPSASRSCAYDNFTVLVHLKAPA 245 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKID 289 + N + P + V + + K L++ A+ VI+++ D Sbjct: 246 TVTMQNPRINQASLPQLSQSPRAPVDLVTVLDISGSMAGTKLALLKRAMGFVIQNLGSND 305 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+ + ++ G T I + ++ Sbjct: 306 ------RLSVIAFSSTARRLFPLRRMSDTGRQQALQA-VNSLVAHGGTNIAEGLRKGAKV 358 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK------------AKSQG 397 + I+LL+DG++T + N+ Sbjct: 359 MEDR---------REKNPVASIILLSDGQDTY--TVSSSGANQPQPNYHLLLPLSIHGGD 407 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + + + S +F + + F IG + V Sbjct: 408 TSGFQIPVHAFGFGADHDASSMHSISEVSGGTFSFIETEAVIQDAFAQCIGGLLSVVVQE 467 Query: 456 I 456 + Sbjct: 468 L 468 >gi|288563|emb|CAA47439.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 885 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 180 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 233 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 234 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 293 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N F+ V + +TF Sbjct: 294 -RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 346 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 347 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 402 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 403 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 462 Query: 445 IGNEIFERV 453 + N + V Sbjct: 463 VANPLLTGV 471 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 14/161 (8%), Positives = 44/161 (27%), Gaps = 3/161 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---I 69 +L G +I L ++G+ G ++ R L+ A ++ L + Sbjct: 22 RLRHESDGAVAVIFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNALKLVV 81 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S + + + + + + + + V ++ Sbjct: 82 SSTDSIVGLTTQTIQTEAKAPADVPVTVQVTVAPDKTSVEARAEQVIKLTFGAFVGMASM 141 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + ++ ++ L + + A+ + Sbjct: 142 PISARAKASVVGRMRLCMLALDPAAAGAFNMERNAQVTAYD 182 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 63/190 (33%), Gaps = 33/190 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + ++ I+ R G F R I + + Sbjct: 109 TRLERAKREIIDLMGMIQS-------DRAGLVAFAGRAILQCPLTLDHSAFNLFLNALEP 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +G T + A++TA + E +K I+L+TDGENT + I Sbjct: 162 DYL-PVGGTDLGGAIETALNGFEKEVES-----------EKAIILITDGENTTGDS--IE 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + KA QG++I I + S F + S KI R+ Sbjct: 208 MAKKAADQGVKIFCIGVGSPEGAPVPD--------SAGGFKKDRSG----KIIISRVDEP 255 Query: 449 IFERVIRITK 458 +++ +T+ Sbjct: 256 ALKKIAAMTQ 265 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/262 (9%), Positives = 72/262 (27%), Gaps = 4/262 (1%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + + G +I AL ++G+ G +D R + ++QA ++ L Sbjct: 27 ASTFHQDRGGTVTVIVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKL 86 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP---RKS 127 + +S + + +I++ + V +V T S Sbjct: 87 VVSNTASVIGLTTQTIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQTVHLSFGPFVGMS 146 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 +V ++ ++ L + + A+ + S + Sbjct: 147 ESKVSAKAKASVVGKMRLCMLALDPAAAGAFNLEKSAQVTAYDCALYSNSSNSGGMVGRN 206 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + + + + + ++++P + SC M Sbjct: 207 NSMARAQTICSAGGFKDDRANFTPNPQTNC-PVITDPLADRVAPPVGSCVILPNLMQTLD 265 Query: 248 PLDPSLSEEHFVDSSSLRHVIK 269 + ++ V ++ Sbjct: 266 AVTGKSKGKNPVPTNYTLDPGT 287 >gi|189524901|ref|XP_693183.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Danio rerio] Length = 963 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 51/165 (30%), Gaps = 10/165 (6%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 ++ G F+D + S T K + N T IN + A D + Sbjct: 367 HEDDYFGLVTFDDVIESWRPSLSKATPENVTEAKEYVQTINARSMTDINKGILYAVDMLT 426 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 S M I+LLTDG+ + ++ I ++ + + + F + Sbjct: 427 SEKSASFPNMS-------MIILLTDGQPSSGEQDLSKIQENVRNAINGSMSLFCLGFGYD 479 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 N +EA+ + F + + + V Sbjct: 480 LDYILLDTLAKQNDGLARRVYEASDAALQLQGFYEEVATPLLIEV 524 >gi|119585667|gb|EAW65263.1| inter-alpha (globulin) inhibitor H3, isoform CRA_b [Homo sapiens] Length = 890 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|329849361|ref|ZP_08264207.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841272|gb|EGF90842.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 505 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/289 (13%), Positives = 75/289 (25%), Gaps = 29/289 (10%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + + + + LN F + + Sbjct: 66 STFAVDVDTGAYANVRRLLNDGQRPPADAVRTEELLNYFRYDYPLPADKKQPFSITTEVT 125 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 +P L P P+ + +D S K LV+ AL V + Sbjct: 126 TTPWNPNSRLLRVGLRAYDVPRSERPAANLVFLIDVSGSMDEKDKLPLVQHALRLVADDM 185 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D V+ + + V+ GSTA + + Sbjct: 186 RPRDRVS------IVVYAGAAGIVLEPT-----ANPAQVRRALGQLKAGGSTAGGEGIAL 234 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 A ++ N ++L TDG+ + AI + K GI + Sbjct: 235 A----YATARAAYIDGGINR-----VILATDGDFNVGISDPEAIKDLVRKNKDDGITLTA 285 Query: 403 IAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 + F + A ++ +S E K+ D + + + Sbjct: 286 LGFGTGNY----NEALMEGIADVGNGNYAYIDSASEARKVLDDELSSTL 330 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 67/202 (33%), Gaps = 30/202 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + I R+G F + + + L + Sbjct: 117 KSRLDAAKTVALQFIE-------NRRRDRIGLVLFKGKSFTQCPLTLDHDVLSMLVRAAS 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA A+ A + + +S ++ ++LLTDGE+ + + Sbjct: 170 VDAV-PESGTATGSAILIAVNRLRAS-----------ESPERVLILLTDGEHNAGEVDPV 217 Query: 388 AICNKAKSQGIRIM--TIAFSVNKTQQEKARYF--LSNCAS---PNSFFEANSTHELNKI 440 A +G+RI T++ +++ ++ LS S F AN + LN+ Sbjct: 218 TAAGIAAGEGVRIYMATVSVPGSRSGEDMLASARDLSGEVSRITGGRSFRANDANSLNRT 277 Query: 441 FRDRIGNEIF----ERVIRITK 458 F + E +R ++ T+ Sbjct: 278 FSEIDQLEKSRFTGQRTLQRTQ 299 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/380 (12%), Positives = 111/380 (29%), Gaps = 72/380 (18%) Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + Q +S ++ + + ++ + ++ + + + I V+ Sbjct: 56 AATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVL 115 Query: 184 DFSRSMLDYQR--------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 D S SM + P++ + D + ++ G Sbjct: 116 DVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAG 175 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN----- 290 ++ Y GP + F S + ++DA+ + ++ + Sbjct: 176 QKYTVTYNWIGGPSASVSPDVQFYKSKQSE--ETRLDALKDAVTYFLDQVEDQNQRINDP 233 Query: 291 -------------------VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 NDT +N + S +W L + + Sbjct: 234 GKKVQVALIKYAGKNSDKIGNDTYNEDGYNYN-YSQTVHSLAWTPEDLQK--EQAAVNSL 290 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE------- 384 G+T + +Q A + S A+K V +DG T + Sbjct: 291 KAGGATRADFGLQHAVKQLNSGRPG----------AQKLTVFYSDGSPTSSDGFEAKIAN 340 Query: 385 EGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCAS---------------PNS 427 I + K+ ++++I + + + A F++ +S + Sbjct: 341 NAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGT 400 Query: 428 FFEANST-HELNKIFRDRIG 446 ++ A S +L IF++ I Sbjct: 401 YYYAVSARTDLQTIFKEIIS 420 >gi|149728587|ref|XP_001492576.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H1 [Equus caballus] Length = 908 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 68/223 (30%), Gaps = 14/223 (6%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V+ N ++ + + + +D S+ K ++AL ++ ++ D Sbjct: 271 VTNNHFAHFFAPQNLTNLNKNLVFVIDISTSMQGQ-KVQQTKEALLKILGDMRPGDY--- 326 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F V S + F GST +N + + + + Sbjct: 327 ---FDLVLFGSGVQSWKGSLVPASAANLQAARDFVQRFTLEGSTNLNGGLLQGIEILNKA 383 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKT 410 + +++LTDGE T+ + I ++ + + F N Sbjct: 384 Q----GSLPEVSNHASILIMLTDGEPTEGVTDRSQILKNVRNAIRGKFPLYNLGFGQNVD 439 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F D++ + V Sbjct: 440 FNFLEVMSMENNGRAQRIYEDHDATQQLQGFYDQVATPLLVDV 482 >gi|84685162|ref|ZP_01013061.1| hypothetical protein 1099457000257_RB2654_09854 [Maritimibacter alkaliphilus HTCC2654] gi|84666894|gb|EAQ13365.1| hypothetical protein RB2654_09854 [Rhodobacterales bacterium HTCC2654] Length = 496 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 53/184 (28%), Gaps = 11/184 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ + + G ++ AL++ ++ G+ + D R + + ++ A Sbjct: 1 MMRSRLASH----WRDEGGAVLVVFALMLAILSGIVALSYDFGRAAATQSEMQSFADNVA 56 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPK--QKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + A+ L + ++ + Q E + F A Sbjct: 57 LAAAGELDGGADALTRAQAAAEQLIADSQTYGEGPGALGAEDFVLTFYAVRPDASGEIDA 116 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V ++ R + F S + A + + + ++ Sbjct: 117 TTTPEAAKYVSVRVADRAVTPI-----FGAVYAALSGNDAGRDRAGAHAVAGFTRYACNV 171 Query: 180 QWVI 183 ++ Sbjct: 172 TPLM 175 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 68.9 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 57/190 (30%), Gaps = 24/190 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ L ++ + D R+ FN + + + K Sbjct: 104 KIQNVKKTLEYLLELLGDND------RLCLILFNSKATKLCHLM-RTNNSNKPAFKEIIN 156 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN M+ A+ + + LL+DG++ + + Sbjct: 157 KIEANGGTDINSGMELAFRVLKDR---------KYHNPVSSVFLLSDGQDGSADLKVRQS 207 Query: 390 CNK-AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIG 446 + + I + F + ++ S +F+ +++++ F D +G Sbjct: 208 LERHLPQECFTIHSFGFGSDHDGP-----LMNKICSLKDGNFYYVEKINQVDEFFVDALG 262 Query: 447 NEIFERVIRI 456 I Sbjct: 263 GLFSVVAQEI 272 >gi|38348336|ref|NP_940912.1| inter-alpha-trypsin inhibitor heavy chain H5-like protein precursor [Homo sapiens] gi|74762375|sp|Q6UXX5|ITH5L_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5-like protein; Short=Inter-alpha inhibitor H5-like protein; Flags: Precursor gi|37181448|gb|AAQ88537.1| ITI-like protein [Homo sapiens] gi|57208182|emb|CAI42356.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|57209935|emb|CAI42344.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|119613592|gb|EAW93186.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_a [Homo sapiens] gi|162318154|gb|AAI57043.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] gi|162318494|gb|AAI56206.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] Length = 1313 Score = 68.9 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 94/352 (26%), Gaps = 24/352 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + L L R G ++ + S Sbjct: 132 ISTSLAAGTEVTFSLAYEELLQRHQGQYQLVVSLRPGQLVKRLSIEVTVSERTGISYVHI 191 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + P+ R + + + S S + + + Y Sbjct: 192 PPLRTGRLRTNAHASEVDSPPSTRIERGETCVRITYCPTLQDQSSISGSGIMADFLVQYD 251 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKK------------KHLVRDALASVIRSIKKIDNVNDT 294 ++ + + D + + + + + + K NV + Sbjct: 252 VVMEDIIGDVQIYDDYFIHYFAPRGLPPMEKNVVFVIDVSSSMFGTKMEQTKTAMNVILS 311 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIV------KTFAIDENEMGSTAINDAMQTAYD 348 +FN SD W I+ + K + G T +N A+ A Sbjct: 312 DLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAAS 371 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + SN++ I+ LTDGE T I + + + + ++AF Sbjct: 372 VLNHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAF 429 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + R L N +E K + I + V R+ Sbjct: 430 GDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADV-RLN 480 >gi|189518186|ref|XP_001331201.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Danio rerio] Length = 969 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 73/219 (33%), Gaps = 14/219 (6%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 ++Y + +D+S+ K ++ ++ I ++ ++ Sbjct: 301 FVHYFAPRDLPVVPKNVVFVIDTSASMLGTK----MKQTKQALFTIINELRPNDNF---N 353 Query: 299 ATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F++R+ V + K F + G T IN +QT + Sbjct: 354 FVTFSNRIRVWQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSD---YL 410 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEK 414 + +++ + I+ LTDG T + I + K + + TI + + Sbjct: 411 SSKDESHHHSVSLIIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLL 470 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L NC + E + K F D IG + + Sbjct: 471 ERMSLDNCGTMRRIPEDADASLMLKGFYDEIGTPLLSDI 509 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 54/183 (29%), Gaps = 33/183 (18%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G F + + + + + + TAI DA+ T+ + + Sbjct: 76 NRVNDRIGLVVFAGAAYTQAPLTLDYGVVREVLKQIRTRVLED--GTAIGDALATSLNRL 133 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S + +VL+TDG+N + + A+S I I TI Sbjct: 134 RDS-----------EAKSRVVVLITDGDNNAGKISPLDAASMAESLKIPIYTILVGKGGK 182 Query: 411 QQEK------------------ARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIF 450 L + AS ++ A L + + + + Sbjct: 183 VPFPQGQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLER 242 Query: 451 ERV 453 ++ Sbjct: 243 SKL 245 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 50/166 (30%), Gaps = 7/166 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K +D++ + F V + + F + G T + M Sbjct: 307 LKILDDIKEDDHFNFILFGSDVHTWKETLIKATPENLDEARKFVRGIDTKGLTNLYGGMM 366 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 D + +++E + ++ I++LTDG+ I K + Sbjct: 367 KGIDMLNAAHEGNLVPKRSAS----IIIMLTDGQPNVGISNTQDIQTHVKKAIEGKYTLY 422 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + F + L N + + + + F D + N Sbjct: 423 NLGFGYGVDYNFLEKMALENKGLARRIYPDSDSALQLQGFYDEVSN 468 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 63/196 (32%), Gaps = 34/196 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A+ VI+ + ID R+ F+ + ++ Sbjct: 77 DKIAKVKTAMLFVIKKLSPID------RLSVVKFSADASRLCPLRQITEDSQKD-LENLI 129 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 N G+T I +QT + + + I+L++DGE Sbjct: 130 NGLNADGATNITAGLQTGLKVLNDRSLSSGRVVG--------IILMSDGEQNAGG----- 176 Query: 389 ICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRD 443 + A+ + + T F + L A+ +F + +T L+ F Sbjct: 177 --DAAQVPIGNVPVYTFGFGI-----NHEPRVLKAIANNSMGGTFSDVQNTDNLSLAFSQ 229 Query: 444 RIGNEIFERV--IRIT 457 + + V +++T Sbjct: 230 CLAGLLTVVVQDLKLT 245 >gi|109039132|ref|XP_001085463.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Macaca mulatta] Length = 891 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/307 (11%), Positives = 83/307 (27%), Gaps = 20/307 (6%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + I + + ++ + + + + P Sbjct: 186 VDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----PTC 239 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + E + +++ G + +D S + Sbjct: 240 TDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPKGLPVVPKNVAFVIDISGSMAG-R 298 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++AL ++ +K+ D +N F+ V + KTF Sbjct: 299 KLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEAKTFVK 352 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T IND + + + E+ ++ +++LTDG+ I Sbjct: 353 SMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRPEKI 408 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ + + F N L N +E + + F + + Sbjct: 409 QENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVA 468 Query: 447 NEIFERV 453 N + V Sbjct: 469 NPLLTGV 475 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 48/380 (12%), Positives = 111/380 (29%), Gaps = 72/380 (18%) Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + Q +S ++ + + ++ + ++ + + + I V+ Sbjct: 82 AATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVL 141 Query: 184 DFSRSMLDYQR--------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 D S SM + P++ + D + ++ G Sbjct: 142 DVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAG 201 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN----- 290 ++ Y GP + F S + ++DA+ + ++ + Sbjct: 202 QKYTVTYNWIGGPSASVSPDVQFYKSKQSE--ETRLDALKDAVTYFLDQVEDQNQRINDP 259 Query: 291 -------------------VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 NDT +N + S +W L + + Sbjct: 260 GKKVQVALIKYAGKNSDKIGNDTYNEDGYNYN-YSQTVHSLAWTPEDLQK--EQAAVNSL 316 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE------- 384 G+T + +Q A + S A+K V +DG T + Sbjct: 317 KAGGATRADFGLQHAVKQLNSGRPG----------AQKLTVFYSDGSPTSSDGFEAKIAN 366 Query: 385 EGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCAS---------------PNS 427 I + K+ ++++I + + + A F++ +S + Sbjct: 367 NAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGT 426 Query: 428 FFEANST-HELNKIFRDRIG 446 ++ A S +L IF++ I Sbjct: 427 YYYAVSARTDLQTIFKEIIS 446 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 54/196 (27%), Gaps = 25/196 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + + + + +G F+ + Sbjct: 96 SNSMAATDISPSRLAAAKQGAQAFVDQL------PPKINLGLVSFSGTAAVLVPPTTDRD 149 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + K +TAI + + I + IVLL+DG Sbjct: 150 AV-----KAGINGLQLGPATAIGEGIYAGLSAINTV--SSQFVNSGQAVPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPN 426 E T+ AK I + TIA+ + LS A + Sbjct: 203 ETTRGRPNN-QAAQAAKDAHIPVSTIAYGTPNGTLDVGGQLIPVPVNEPALSQIAEQTGG 261 Query: 427 SFFEANSTHELNKIFR 442 S A S EL I++ Sbjct: 262 SHHRATSGDELTSIYK 277 >gi|123228966|emb|CAI21016.2| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H5 (ITIH5) [Danio rerio] Length = 906 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 73/219 (33%), Gaps = 14/219 (6%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 ++Y + +D+S+ K ++ ++ I ++ ++ Sbjct: 238 FVHYFAPRDLPVVPKNVVFVIDTSASMLGTK----MKQTKQALFTIINELRPNDNF---N 290 Query: 299 ATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F++R+ V + K F + G T IN +QT + Sbjct: 291 FVTFSNRIRVWQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSD---YL 347 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEK 414 + +++ + I+ LTDG T + I + K + + TI + + Sbjct: 348 SSKDESHHHSVSLIIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLL 407 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L NC + E + K F D IG + + Sbjct: 408 ERMSLDNCGTMRRIPEDADASLMLKGFYDEIGTPLLSDI 446 >gi|47522638|ref|NP_999089.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Sus scrofa] gi|3024032|sp|Q29052|ITIH1_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|565283|dbj|BAA07632.1| inter-alpha-trypsin inhibitor heavy-chain H1 [Sus scrofa] Length = 902 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 73/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + S + NG + + + + Y + V + Sbjct: 237 STCSTTLLNGDFKVTYDVNREKLCDLLVANNYFAHFFAPQNLTKLNKNVVFVIDISSSME 296 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F V S +++ Sbjct: 297 GQKVKQTKEALLKILSDLKPG---DYFDLVLFGSAVQSWRGSLVQASTANLDAARSYVRQ 353 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + GST +N + + + + + + N +++LTDGE T+ + I Sbjct: 354 FSLAGSTNLNGGLLRGIEIL-NKAQGSLPEFSNRAS---ILIMLTDGEPTEGVTDRSQIL 409 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + + F + L N +E + + + + F D++ N Sbjct: 410 KNVRDAIRGRFPLYNLGFGHDVEWNFLEVRALENNGRAQRIYEDHDSAQQLQGFYDQVAN 469 Query: 448 EIFERV 453 + + V Sbjct: 470 PLLKDV 475 >gi|291295619|ref|YP_003507017.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470578|gb|ADD27997.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 412 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 49/164 (29%), Gaps = 17/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ ++D V + R + GSTA++ + + Sbjct: 85 DRVAVVIYDDAVEVLVPSTPVADG--RAAIANLIRTIRTGGSTALHAGWLEGATQVAAYQ 142 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQ 411 E +VLL+DG + I + + +G+ T+ ++ + Sbjct: 143 EAGRLNR---------VVLLSDGLANRGETNPGVIAEQVRELARRGVSTSTLGVGLDYNE 193 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A +++ S +L +IF + R Sbjct: 194 DLMTTM---ADAGEGNYYFIESPADLPRIFAQELAGLAGTLGTR 234 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 62/196 (31%), Gaps = 35/196 (17%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + +++D ++ I K ++G F D + + + + Sbjct: 103 NGQTVDRFTMIQDVVSDFIERRKG-------DKLGLILFADHAYLQAPLTQDRRSVAQFL 155 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + TAI +A+ ++ + +VLLTDG N + Sbjct: 156 QEAQI--GLVGKQTAIGEAIALGVKRFDMVDKSN-----------RILVLLTDGSNNSGS 202 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARYFLSNCA--SPNSF 428 A +G++I I + + L + A + + Sbjct: 203 ISPEQAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSMDLDEAQLISLAKTTGGLY 262 Query: 429 FEANSTHELNKIFRDR 444 F A S+ +L I+++ Sbjct: 263 FRARSSQDLQLIYQEI 278 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 48/399 (12%), Positives = 113/399 (28%), Gaps = 39/399 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 S+ S + S + + + + + T+ E Sbjct: 582 ATSLKKNDSDGDGLSDGYEVKKGTNPLVPDTDNDGIYDG-SEVVMGTDPLAKNPLTSAEK 640 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 +V + + D + L ++ + + + + V Sbjct: 641 YAVSEDGKVFVKALSDANILIAPLQVKRSDNVFINSLKGIVGKAIEITAGGFDIKRAEIV 700 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +++ + L+ ++ + K + Sbjct: 701 VNYDEAELNGVDENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKTEHFSTFLLGDKNM 760 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + +D ++ SS + A I++I +++N R+G F Sbjct: 761 PVDLSKVDIVFVLDNSGSMSSNDPNYYRIE----ATKKFIQNIDELNN-----RVGLVDF 811 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRM 361 + V + + + + G T I ++ A + Sbjct: 812 DSSVYVRSNLTS-----DKNKLLQALNAMRWTGGSTNIGGGLKAALELFDQEQS------ 860 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 KK IVLL+DG + + K + I + TIA ++ R L + Sbjct: 861 ------KKIIVLLSDGYHNTGIHPNDVLPELIKQE-IVVNTIALG-----KDCDRELLHD 908 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + +F ++T L+ ++ + +I ++TK Sbjct: 909 IADKTKGDYFYVDNTGGLS---QEDVDKQIELIYEKLTK 944 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 57/189 (30%), Gaps = 28/189 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L+++ L ++ ++ D R+ F+ K +K Sbjct: 1463 LDLLKETLLFLVDLLQTGD------RICLIQFSTNAQRLTPLLSIESKDNIKSIKNEINR 1516 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I MQ A+D + + + + LL+DG N + + Sbjct: 1517 LVAKGGTNICQGMQLAFDVL---------KQRRYKNPITSVFLLSDGLNDGAENKIRDLL 1567 Query: 391 NKA------KSQGIRIMTIAFSVNKTQQ--EKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + I T F + +K + +F+ H +++ F Sbjct: 1568 KQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLM-----DGNFYYIGDIHRIDECFI 1622 Query: 443 DRIGNEIFE 451 D +G Sbjct: 1623 DALGGLFSV 1631 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 68.9 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 60/182 (32%), Gaps = 14/182 (7%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + S + + S++ I+ + F D ++ Sbjct: 9 DTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIRVSYKST---YVSVVLFGTSATIDINYI 65 Query: 314 WGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + H + F+ G T ++DA Q AYD I A + Sbjct: 66 FNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIFGKYSGHKRPTHQVKTA---V 122 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 LLTDG+ N + I + K +GI I TI + L + ASPN++F Sbjct: 123 FLLTDGQWNW-NGDPWPIAKRLKDRGIEIFTIGVTNGVNVNT-----LRSLASPNNYFHY 176 Query: 432 NS 433 N Sbjct: 177 ND 178 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 42/361 (11%), Positives = 94/361 (26%), Gaps = 27/361 (7%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + + + + ++ N + +N + N + Sbjct: 116 DPVNTSLQEDVVVNAMIVPEAPKTVAGSPVVNAYMKSASPAFYGSRAPQFNTEDYSPVNE 175 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 ++ + S +M + +N F Sbjct: 176 NRFH---TVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKYSNP 232 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + + + + +L P + +D S KK Sbjct: 233 T-GNTPVAVRTDMAICPWNTAHQLVRIALKGKDVAKDNLPPSNLVFLIDVSGSMSDAKKL 291 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ A ++ ++ +D R+ + + G H +T + Sbjct: 292 PLVKQAFKLLVNQLRPVD------RVAIVVYAGAAGLVLPSTSGDH---KTAILDALDKL 342 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIA 388 GSTA + +Q AY + I+ TDG+ + E Sbjct: 343 EAGGSTAGGEGVQLAYK--------TATEYLLKSGNNRVII-ATDGDFNVGPSSDGELQR 393 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 I K + +GI + + F + + K ++ ++ E + F G Sbjct: 394 IIEKKREKGIFLSVLGFGMGNYKDNKLELLADK--GNGNYAYIDNFEEARRTFATEFGGT 451 Query: 449 I 449 + Sbjct: 452 L 452 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 65/187 (34%), Gaps = 25/187 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K R A +I + D+++ ++DRV + +K Sbjct: 96 DKIKYARQAAKFLIDQLNSTDHLS------IVNYDDRVEVTSP---SQSVKNKEALKAAI 146 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++ GST ++ M Y + S+ ++ ++LLTDG Q + + Sbjct: 147 DKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNR---------VLLLTDGLANQGITDPLE 197 Query: 389 ICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + A K GI + T + + + N +++ +S ++ +IF Sbjct: 198 LKRLAENKYKEDGIALSTFGVGADYNE-DLLTMLAEN--GRANYYFIDSPDKIPQIFAGE 254 Query: 445 IGNEIFE 451 + + Sbjct: 255 LKGLLSV 261 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 57/178 (32%), Gaps = 21/178 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + L I K R+G F + L I+ T Sbjct: 108 TRLGVSKKLLRKFIDKRKS-------DRLGLVVFAGAAYLQAPLTGDRESLNE-ILGTIE 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI DA+ + + +S K IVL+TDG + + + Sbjct: 160 EETVAEQGTAIGDAIILSTYRLRASQARS-----------KVIVLITDGVSNTGKIDPVT 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS--NCASPNSFFEANSTHELNKIFRDR 444 + A+ G++I ++ E L + ++ FF A E+ + Sbjct: 209 ATDLAEHIGVKIYSVGIGKEDGSYEINFEILRELSASTGGKFFRAEDPEEMKAVLTSI 266 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 63/203 (31%), Gaps = 37/203 (18%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + R+A + + + R+G F + + L R + Sbjct: 95 DFQPTRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLDYSFLQRMLE 147 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + TA+ A+ TA + + S + K +LLTDG N + Sbjct: 148 DVEVGAVED--GTAVGTALATAVNRLKDSEAES-----------KVAILLTDGRNNRGQI 194 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQE---------------KARYFLSNC--ASPNS 427 + A++ G+R+ I ++ + L + ++ Sbjct: 195 DPRTAAEVAQTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSTSTGGQ 254 Query: 428 FFEANSTHELNKIFRDRIGNEIF 450 +F A + L +I+ + E Sbjct: 255 YFSATNRDALERIYAEIDTMEAT 277 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 78/295 (26%), Gaps = 24/295 (8%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 +A + + + + +N F + + Sbjct: 82 VTDQAVSTFSIDVDTGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYPQP-NGVHPFS 140 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 D + P + VD S K LV+ L Sbjct: 141 VTTETVDSPWKENAKLIRIGIQAKDLALQQLPPANLVFLVDVSGSMSAADKLPLVKQTLR 200 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + ++ D V + + G + + G+TA Sbjct: 201 ILTEQLRAQDKVT------IITYASGEKLVLEPTSG---EQKEKILAVINGLRAGGATAG 251 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQ 396 A+Q AY E + N I+L TDG+ + + + + Sbjct: 252 EQAIQLAYK----QAEKAFVKNGINR-----ILLATDGDFNVGITDFSTLKGMVAEKRKS 302 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 GI + T+ F ++ A ++ ++ +E K+ + ++ + + Sbjct: 303 GISLTTLGFGTGNYNEQLMEQLAD--AGDGNYSYIDNKNEAKKVVQRQLSSTLAT 355 >gi|114587338|ref|XP_516521.2| PREDICTED: inter-alpha (globulin) inhibitor H3 isoform 2 [Pan troglodytes] Length = 865 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 73/250 (29%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 212 PTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMA 271 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K ++AL ++ +K+ D +N F+ V + +T Sbjct: 272 G-RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEART 324 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + G T IND + + + E+ ++ +++LTDG+ Sbjct: 325 FVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRP 380 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + F + Sbjct: 381 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYE 440 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 441 EVANPLLTGV 450 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 54/198 (27%), Gaps = 40/198 (20%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +++ ++ I R+G + + + ++ + + Sbjct: 107 SPNRLVALKEVASNFI-------KDRPNDRIGLVVYAGESYTKTPITSDKRLVLEALKEI 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + TAI + T+ + + S K I+LLTDG N E Sbjct: 160 KYGALTD--GTAIGMGLATSVNRLKDSKALS-----------KVIILLTDGVNNAGFIEP 206 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPN 426 A I+ TI N L A + Sbjct: 207 ATAAELAVEYDIKTYTIGLGTNGNALSPIAYNPDGSFRYGMAEVQIDEALLEQIATLTGG 266 Query: 427 SFFEANSTHELNKIFRDR 444 +F A +L I+ + Sbjct: 267 KYFRATDNKKLEAIYNEI 284 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 58/166 (34%), Gaps = 16/166 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G F+ D + ++ G TAI DAM A + +I+ Sbjct: 657 NTEVGVVSFSSTSTVDRQPVSLNISGNKDLLHNAIDSMVADGGTAIGDAMADANNLLING 716 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +AKK +++LTDG T + +G + A IRI +I ++ Sbjct: 717 RP----------DAKKIMIVLTDGVATAGSDRDGSDAISTANLNNIRIYSIGLGSSEY-- 764 Query: 413 EKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 L AS S++ A S EL ++ V I Sbjct: 765 -IDEPMLKRIASETGGSYYNAPSGSELQTVYNTISKEISDFDVTEI 809 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/440 (10%), Positives = 109/440 (24%), Gaps = 56/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGVVNYASAVRQEFPLRAHGSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R T ++ + + + VV R + Sbjct: 104 LLQAVRRIRPLATGTMTGLAIQFAITKALSDAEGGRVTSPGISKVVIVVTDGRPQDSVRD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + S S+++ + Sbjct: 164 VSERARASGVELFAIGVGGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFREAFCV 223 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C + + Sbjct: 224 VSDLC------ATGDHDCEQVCVSSPGSYTCACRDGFTLNSDGKTCNVCSGGGGSSATDL 277 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ + ++ ++ D + ++ V + G Sbjct: 278 VFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQVG---LVQYSSSVRQEFPL--GR 332 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 333 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 386 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 387 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 437 Query: 434 THELNKIFRDRIGNEIFERV 453 + + IG ++ +R+ Sbjct: 438 FKTITQ-----IGKKLQKRI 452 >gi|114587336|ref|XP_001172570.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1 [Pan troglodytes] Length = 890 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 73/250 (29%), Gaps = 14/250 (5%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 P + E + +++ G + +D S Sbjct: 237 PTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMA 296 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K ++AL ++ +K+ D +N F+ V + +T Sbjct: 297 G-RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEART 349 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F + G T IND + + + E+ ++ +++LTDG+ Sbjct: 350 FVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRP 405 Query: 387 IAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ + + F N L N +E + + F + Sbjct: 406 EKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYE 465 Query: 444 RIGNEIFERV 453 + N + V Sbjct: 466 EVANPLLTGV 475 >gi|39652254|emb|CAC79611.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 886 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 180 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 233 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 234 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 293 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +++ D +N F+ V + +TF Sbjct: 294 -RKLEQTKEALLRILEDMQEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 346 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 347 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 402 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 403 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 462 Query: 445 IGNEIFERV 453 + N + V Sbjct: 463 VANPLLTGV 471 >gi|296225412|ref|XP_002758286.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Callithrix jacchus] Length = 860 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/309 (11%), Positives = 85/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I + + ++ + + + + P Sbjct: 184 IEADIFEPQGISMLDAEASFITNDFLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G S +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKSVAFVIDVSGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N F+ V + K F Sbjct: 298 -RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEAKMF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 ++ G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSIDDRGMTNINDGLLRGISMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIRGKFPLYNLGFGNNLNYNFLENMALENQGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|133925809|ref|NP_002208.3| inter-alpha-trypsin inhibitor heavy chain H3 preproprotein [Homo sapiens] gi|166203665|sp|Q06033|ITIH3_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|77748471|gb|AAI07605.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|77748473|gb|AAI07606.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|78070482|gb|AAI07815.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] Length = 890 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +++ D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMQEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|149440247|ref|XP_001521494.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3, partial [Ornithorhynchus anatinus] Length = 390 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 73/261 (27%), Gaps = 14/261 (5%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + V+ S S + T + E + + Sbjct: 137 DLLGVLTKSFSGKKGHVSFKPTLDQQRSCLNCTTSMMNGDFTITYDVVRESPANVQIVNG 196 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +++ + +D S +K ++AL ++ +K+ D +N Sbjct: 197 YFVHFFAPKDLPVVPKNVVFVIDVSGSM-YGRKLVQTKEALLKILEDMKEEDYLN----- 250 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F+ + + K F + + G T IND + + + E Sbjct: 251 -FILFSSEITTWKDTLIKATPENLKKAKEFVKNIKDEGLTNINDGLMRGIKMLNEARETN 309 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEK 414 V ++ I++LTDGE I ++ + + F + Sbjct: 310 VVPKRSTS----LIIMLTDGEANVGEIRADKIQENVRNAIGGKFPLYNLGFGYDLNYNLL 365 Query: 415 ARYFLSNCASPNSFFEANSTH 435 + L N +E + Sbjct: 366 EKMALENHGLARRIYEDSDAD 386 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 58/181 (32%), Gaps = 38/181 (20%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF--AIDENEMGSTAINDAMQTAYD 348 + R+G + F R + + L+ + + A D TAI D + A Sbjct: 152 KRHSDRIGLSAFGARSFTQCPLTMDYGSLLEILKASDDLARDTLVNNRTAIGDGLMNALA 211 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + K + + ++LLTDG + + AKS G+++ T+ Sbjct: 212 RL-----------KMSDAKSRVVILLTDGRDNASVVPPVRAAEVAKSLGVKVYTVGVGKK 260 Query: 409 KT-----------------------QQEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 ++ L AS F+ A + EL KI+ + Sbjct: 261 SGKILAFQQNPWTGEISWGERDITPEEGIDEDVLKAIASKTGGRFYRAENKAELEKIYSE 320 Query: 444 R 444 Sbjct: 321 I 321 >gi|332216203|ref|XP_003257234.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Nomascus leucogenys] Length = 890 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSQ-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLDGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +K+ D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMKEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRVPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|62897073|dbj|BAD96477.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] gi|62898698|dbj|BAD97203.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] Length = 890 Score = 68.5 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/309 (11%), Positives = 84/309 (27%), Gaps = 20/309 (6%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + I + + ++ + + + + P Sbjct: 184 IEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSG-KKGHVSFKPSLDQQRSC-----P 237 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + E + +++ G + +D S Sbjct: 238 TCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAG 297 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K ++AL ++ +++ D +N F+ V + +TF Sbjct: 298 -RKLEQTKEALLRILEDMQEEDYLN------FILFSGDVSTWKEHLVQATPENLQEARTF 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G T IND + + + E+ ++ +++LTDG+ Sbjct: 351 VKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGESRPE 406 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I ++ + + F N L N +E + + F + Sbjct: 407 KIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEE 466 Query: 445 IGNEIFERV 453 + N + V Sbjct: 467 VANPLLTGV 475 >gi|317419026|emb|CBN81064.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Dicentrarchus labrax] Length = 836 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 85/276 (30%), Gaps = 21/276 (7%) Query: 182 VIDFSRSMLDYQRDSEGQP-LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V D + + + +C D N VG+ D ++ + Sbjct: 154 VTDNKAHISFSPTLEQQRKCASCDETLIDGDFIIKYDVNRVVGLGDIQIVNGYF-----V 208 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 ++ P + +D S +K R+AL ++++ + + D Sbjct: 209 HFFAPPDLPRVPKNVVFVIDRSGSM--RRKMVQTREALLAILKDLHEEDY------FALI 260 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+DR+ S K + N G T IN A+ T + ++ ++ Sbjct: 261 QFDDRIDSWQKSLTKATKENVDQAMIYVQQINYRGGTDINQAVLTGVEMLLKDRREKKLP 320 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARY 417 ++ I+LLTDG I +S + + + F + Sbjct: 321 ERSVD----MIILLTDGMPNSGESHLPRIQENVRSAIRGNMSLFCLGFGNDVDYSFLDVM 376 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 N FE + + F D + + + V Sbjct: 377 SKQNKGLARRIFEGSDATLQLQGFYDEVSSPLLSDV 412 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 42/325 (12%), Positives = 92/325 (28%), Gaps = 27/325 (8%) Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 S+ D P + + T S +++ ++ S+ Sbjct: 74 SATLDYRSEPREQYANL-PDNPVHRVAETPVSTFSVDVDTGSYANVRRFLNQ-GSLPPEG 131 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + +N F + +P + Sbjct: 132 AVRLKEMVNYFPYHYALPTDGSP-FGVTTEVAATPWNPRTQLLRIGIKASDRAVAELAPA 190 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + VD S + LV+ L ++ +++ D V+ + + Sbjct: 191 NLVFLVDVSGSMDRREGLPLVQSTLKLLVDQLREQDRVS------LVVYAGESRVVLPPT 244 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + ++T GSTA +Q AY D+ I+L Sbjct: 245 SGR---DKAKIRTAIDQLTAGGSTAGASGIQLAYQMAREGFIDKGINR---------ILL 292 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 TDG+ + + + + G+ + T+ F V+ + A ++ Sbjct: 293 ATDGDFNVGISDFDSLKQMAAEQRKSGVSLTTLGFGVDNYNEHLMEQL--ADAGDGNYAY 350 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 ++ E K+ D++ + + V R Sbjct: 351 IDTLREARKVLVDQLSSTL-TVVAR 374 >gi|319780570|ref|YP_004140046.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166458|gb|ADV09996.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 359 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/254 (11%), Positives = 60/254 (23%), Gaps = 3/254 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + I+S +G F I L M ++ G VD + A AI++ + Sbjct: 2 RGLAGSFIESRSGSFAPILILAMIPLITAIGFSVDYTSAVQTRSTEQAALDAAILSITTM 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 S + A + + +F+ + T + T R Sbjct: 62 DTASTKPQRQVAMQASYMANGGQGTATLNSFDVSANGTATAQASASFAMPTVFMQIARIP 121 Query: 128 AYQVVLSSRY-DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + V ++S + F + W + + K Sbjct: 122 SVAVGVTSAVNKAPALVEATFKVTGVSGYWNKKMTLYGTMFGAAAGKPLMTIDYAYGKTG 181 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + G+ + + + ++ YY + Sbjct: 182 DPKGYGTTTVSVLTTDSAGKTVTTVAQKQVCKLVDSSTPAGAVIQ--TDGFQTKYYCVDT 239 Query: 247 GPLDPSLSEEHFVD 260 + V Sbjct: 240 MYPADGAGAQINVS 253 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/286 (11%), Positives = 74/286 (25%), Gaps = 9/286 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G I + + V L V + VD+ + L+ A +A + A+ L S ++ Sbjct: 10 KKQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQA 69 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + A+ + T + + N + D V ++ Sbjct: 70 AVSAEVTSTL--NAMAGASGNHEFNFSTAVVSVEYSNDPQSFAGTTTFGADDDVYVRVNV 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQWVIDFSRSMLDYQR 194 L+ F++ G+ + T + + + S + ++ Sbjct: 128 S---SLDMDEFFIQMFGLAKDVSATAVAGPSSGLQVVNNIVPIGVCIGDGTSDNDINEDG 184 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM---VSCNKSLYYMLYPGPLDP 251 + ++ + G + Y G D Sbjct: 185 YDDATGEAITNVFGYEVGTVHALKVGDKKLSQMGNGNYHLLDFGSGGKTVKEALAGYYDY 244 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 S++ V + V + + +D V Sbjct: 245 SVAIGDSVTTKPGGTVGPTADGLNTRFGDYGGGLSSSAYPSDYVTT 290 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 51/178 (28%), Gaps = 24/178 (13%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A VI R+G F + S + + I G Sbjct: 119 AAKHVITQFVDSRP---DDRIGIVGFAGKTKSFCPLTLDHALVNSIIRDFHPRMIQADG- 174 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TAI A+ A + E K I+L+TDG + + A Sbjct: 175 TAIGSAIAAAATRLDDRKE----------TKSKIIILVTDGASNSGQISPLVAAENAAKL 224 Query: 397 GIRIMTIAFSVNKT--------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 GI+I TIA + Q E L A + F A + NK F Sbjct: 225 GIKIYTIAVGTEEGTLANGMVVQSEFDEPTLRKIAQLTGGEHFRATNMASFNKAFTSI 282 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 60/176 (34%), Gaps = 26/176 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +V+ + ++++ +K D ++ F+DR + Sbjct: 133 RMDMVKSSALNLLKQFRKQDLIS------VVAFSDRAEVVIPPTRVPDLAK---DDHRIS 183 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I +Q + + S + + ++LLTDG D+E I + Sbjct: 184 MLQVGGGTEIYQGLQLGIEQLRS----------IDPRFMRQLILLTDGHTYGDDEACIEL 233 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +A GI+I T+ E L A S + S +LNK F Sbjct: 234 AEEAAQDGIQINTMGIG-----HEWNDELLDKIATISGANSIFVTSPKDLNKFFEQ 284 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 669 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNF 728 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 729 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 783 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 784 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 837 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 838 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 883 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 115 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 116 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 166 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 167 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 200 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 369 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNF 428 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 429 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 483 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 484 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 537 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 538 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 583 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 102 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 103 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 153 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 154 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 187 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 669 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNF 728 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 729 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 783 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 784 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 837 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 838 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 883 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 115 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 116 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 166 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 167 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 200 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 546 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNF 605 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 606 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAVMNVE 660 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 661 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 714 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 715 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 760 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 115 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 116 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 166 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 167 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 200 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 62/214 (28%), Gaps = 29/214 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 PSL+ +D S K R A+ I + D+++ T F Sbjct: 34 PPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQAVCYAIDQLSPGDHLS------VTIF 87 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +D+V + + + K N G T ++ + + E++R Sbjct: 88 DDQVQTLIPSTLVK---DKAQFKRLVQGINPGGCTDLHGGWLQGGIQVSQNLSAELNR-- 142 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNK---AKSQGIRIMTIAFSVNKTQQEKARYFL 419 I+LL+DG + I +G T+ + L Sbjct: 143 --------IILLSDGLANRGETNPDIIATDVHGLAQRGASTTTLGLGDDYN-----EDLL 189 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 A +++ +L IF + Sbjct: 190 EAMARSGDGNYYYVADAEQLPTIFERELQGLAAT 223 >gi|291295701|ref|YP_003507099.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470660|gb|ADD28079.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 298 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 57/170 (33%), Gaps = 18/170 (10%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +A+ + + +D + +++G F+ + R ++ + + Sbjct: 102 APNRMVATQMAAKALVDKLPRHIKVGVVTFSGYGTLLLPPT-----TDRKAIRQAIDNLD 156 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G + + A + + + + IVL + G + N + + I ++ Sbjct: 157 LGGGFSFTYGLLAALEALPQTPPEGSRPG--------VIVLFSHGHDVSGN-DPLKIADQ 207 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 A +GI++ I L A + ++ S +L+K Sbjct: 208 ALERGIQVHAIGVGT--HGHNFDEEMLKKVADRTGGRYYPIFSASDLSKA 255 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/299 (12%), Positives = 87/299 (29%), Gaps = 25/299 (8%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 T+ T S +++ +++ + + +N F Sbjct: 98 IHSVTETPVSTFSADVDTGAYANVRRLLNQ-GRLPPEGAVRLEEMVNYFPYDYALPSDGS 156 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + +P+ + + VD S + LV+ Sbjct: 157 P-FGVTTELAASPWNPHTRLLRIGIKASDRAVAELAPANLVFLVDVSGSMDRREGLPLVK 215 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 L ++ +++ D V+ + + G + ++T G Sbjct: 216 STLKLLVDQLREQDRVS------LVVYAGESSVVLEPTSGRE---KAKIRTAIERLTAGG 266 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNK 392 STA ++ AY + N I+L TDG+ + + Sbjct: 267 STAGASGIELAYQM----AQQAFIPKGINR-----ILLATDGDFNVGTSDFDSLKQMAVD 317 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + GI + T+ F V+ + A ++ ++ E K+ D++G+ + Sbjct: 318 KRKTGISLTTLGFGVDNYNEHLMEQLAD--AGDGNYAYIDNLREARKVLVDQLGSTLAV 374 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 51/174 (29%), Gaps = 20/174 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A+ V+ R+G F D F+ + + T + Sbjct: 122 VQAVRQVVSGFVARRPG---DRIGLIVFGDAPYPLAPFTLDHQLV--QTLITGLLPGMAG 176 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 STA+ DA+ S +K +++LTDG +T AK Sbjct: 177 PSTALGDAIGLGIKMFEHS-----------EAPEKVLIVLTDGNDTASRMPPERAGGIAK 225 Query: 395 SQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFRDR 444 + + + TI E+ L A + +F L I+ Sbjct: 226 ERKVVVHTIGIGDPNASGEEKVDLGVLQRLAAQTGGRYFFGADQAGLETIYATL 279 >gi|74315933|ref|NP_001028276.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|72679321|gb|AAI00122.1| Inter-alpha (globulin) inhibitor H3 [Danio rerio] Length = 892 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 42/345 (12%), Positives = 90/345 (26%), Gaps = 22/345 (6%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI-DF 185 ++ V +S+ L L R +G LI + S + + Sbjct: 124 TSVTVAANSKVTFELTYEELLKRRLGKYKLLINAQPMQPVADFKIDIHIHESAGISLLEV 183 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY---MVSCNKSLYY 242 + + + + Q L+ M + Sbjct: 184 KGGLNTKDLANAVTTTRAQEDAWVKFYPTRDQQKDCDDCTKNGLNGNLVIMYDVERVKQS 243 Query: 243 MLYPGPLDPSLSEEHFVDS-SSLRHVIKKKHLVRDALASVIRSIKKI-----DNVNDTVR 296 + + D ++V+ + I + ++ Sbjct: 244 GDFKVANGYFVHYFAPTDVQRIPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDDY 303 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G F+ + + KTF G+T IN A+ A + I ++ Sbjct: 304 FGLITFSSHIQAWKPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQE 363 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++LLTDG+ T + I K+ + + F N + Sbjct: 364 GSAS---------ILILLTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGFNVRFEF 414 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L N + +E + + F + + + V + Sbjct: 415 LEKMSLENNGAARRIYEDSDADLQLQGFYEEVAIPLLTNVQLNYQ 459 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 57/200 (28%), Gaps = 28/200 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + + Sbjct: 295 TKLALLKRAMGFVIQNLGPSD------RLSVIAFSSTARRLFPLRCMTEAGRQEALLS-V 347 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ I+ I+LL+DG++T Sbjct: 348 NSLVSNGGTNIAEGLRKGAKVIVDRKW---------KNPVASIILLSDGQDTYTVTSPSG 398 Query: 389 I---CNK-------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 + + G + I + + + + S +F + Sbjct: 399 MNPRADYKSLLPISIHRNGGTGLKIPVHSFGFGADHDAASMHSISEISGGTFSFIEAEGV 458 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + V + Sbjct: 459 IQDAFAQCIGGLLSVVVQEL 478 >gi|325964113|ref|YP_004242019.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] gi|323470200|gb|ADX73885.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] Length = 352 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 16/171 (9%), Positives = 43/171 (25%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G I+ A+LM +LG + VDV + ++ A + + + + Sbjct: 22 ERGAATILVAVLMVALLGCAALAVDVGAMYAEKAQIQNGADATSLAIAEECANGVNCAVA 81 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 A + + F T R + + + + +V+ Sbjct: 82 MAAPANRLADANANDGATGVFSVTQPSPSTIRVETNAREAGSGDNHFSLFFARVLGIDTS 141 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + + + + + + + Sbjct: 142 QITAAAEASWGPPSSGSTLPWTVSECVFRKYLTPTQLTELDTTGSFTGDPT 192 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 60/184 (32%), Gaps = 21/184 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 K +V++ +A I R+G F D ++ L Sbjct: 100 FDPKDPYKNKFDVVKEVVADFI-------KKRKNDRIGMVTFADVAFIASPLTFEKDFLT 152 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 TAINDA+ AY+ + S K I+LLTDG + Sbjct: 153 NITEMQKL--GMAGKRTAINDALVQAYNLMSKSKAKS-----------KIIILLTDGRDN 199 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + + + +++ TI + + L+ A + A S L+KI Sbjct: 200 MSKIPLSDVKHMIEKRDVKLYTIGIGGPRDYDAQYLKTLAK-AGKGQAYAARSAAMLSKI 258 Query: 441 FRDR 444 + + Sbjct: 259 YDEI 262 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 47/347 (13%), Positives = 104/347 (29%), Gaps = 39/347 (11%) Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + SS ++++ S ++ G + A R ++ V Sbjct: 81 NSVMNITMLADTSSLDEVIITGYSAEMKIRGTSNIQTSVTANESYNKREDNQFKLVKASP 140 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL--SPYMVSCNKS 239 + FS + + + +N Q VK N + P+ + + Sbjct: 141 LSTFSIDVDKAGYSNIRRMINNGIQIPKDAVKIEEMINYFNYDYKQPTGKHPFSIQTEYA 200 Query: 240 LYYMLY--------------PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 P P+ + +D S K L++ A + ++ Sbjct: 201 QTPWNSDTKLVKIGLQGKTIPLENVPASNLVFLLDVSGSMGQQNKLPLLKSAFKLLTNNL 260 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ++ D ++ + + G +T ++ + GSTA + ++ Sbjct: 261 REQDKIS------IVVYAGSSGVVLEPTSG---DQKTKIEEALDKLSAGGSTAGGEGIEL 311 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMT 402 AY +D + NN ++L TDG+ A+ + K + GI + Sbjct: 312 AYKI----AKDNFIKNGNNR-----VILATDGDFNVGLSSDKAMEDLIKEKRESGIFLTA 362 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + F + + K + +S E ++ G + Sbjct: 363 LGFGMGNYKDSKLESLAQT--GNGNHAYIDSMQEAQRVLETEFGGTL 407 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 69/232 (29%), Gaps = 23/232 (9%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK--- 271 + ++G Y C + + + + Sbjct: 410 NTVEHGCEYQCVSTPGSYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNF 469 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+ + V+ + R+G ++ RV ++ S ++ I K E Sbjct: 470 ELVKQFVNQVVDQLDVSAKGT---RVGLVQYSSRVRTEFPLS--MYHSKDEIKKAEMNVE 524 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A++ + S E KN + ++ TDG +QD+ Sbjct: 525 YMEKGTMTGLALKHMVENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAK 578 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KAK GI + + + L AS FF + +++I Sbjct: 579 KAKEAGITMYAVGVG------KAVEDELREIASDPVEKHFFYSADFTAISQI 624 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 102 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 103 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 153 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 154 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 187 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 48/380 (12%), Positives = 111/380 (29%), Gaps = 72/380 (18%) Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + Q +S ++ + + ++ + ++ + + + I V+ Sbjct: 16 AATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVL 75 Query: 184 DFSRSMLDYQR--------DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 D S SM + P++ + D + ++ G Sbjct: 76 DVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAG 135 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN----- 290 ++ Y GP + F S + ++DA+ + ++ + Sbjct: 136 QKYTVTYNWIGGPSASVSPDVQFYKSKQSE--ETRLDALKDAVTYFLDQVEDQNQRINDP 193 Query: 291 -------------------VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 NDT +N + S +W L + + Sbjct: 194 GKKVQVALIKYAGKNSDKIGNDTYNEDGYNYN-YSQTVHSLAWTPEDLQK--EQAAVNSL 250 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE------- 384 G+T + +Q A + S A+K V +DG T + Sbjct: 251 KAGGATRADFGLQHAVKQLNSGRPG----------AQKLTVFYSDGSPTSSDGFEAKIAN 300 Query: 385 EGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCAS---------------PNS 427 I + K+ ++++I + + + A F++ +S + Sbjct: 301 NAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGT 360 Query: 428 FFEANST-HELNKIFRDRIG 446 ++ A S +L IF++ I Sbjct: 361 YYYAVSARTDLQTIFKEIIS 380 >gi|115496418|ref|NP_001068821.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] gi|122142424|sp|Q0VCM5|ITIH1_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|111308523|gb|AAI20097.1| Inter-alpha (globulin) inhibitor H1 [Bos taurus] Length = 906 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 70/250 (28%), Gaps = 12/250 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S + NG + + + Y + V + Sbjct: 240 PTCSTTLLNGDFKVTYDVNRDDACDLLVANNYFAHFFAPQNLKKLNKNVVFVIDISSSME 299 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++ + + + F V S + F Sbjct: 300 GQKLKQTKEALHKILGDMRPG---DYFDLVLFGSAVQSWKGSLVQASPANLEAARNFVQQ 356 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + + + +++LTDGE T+ + I Sbjct: 357 FSLAGATNLNGGLLRGIEILNKAQQS----LPELSNHASILIMLTDGEPTEGVMDRTQIL 412 Query: 391 NKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + GI+ + + F + L N +E + + + F +++ Sbjct: 413 KNVRD-GIKGRFPLYNLGFGHDVDLNFLEVMSLENNGRVQRIYEDHDATQQLQGFYEQVA 471 Query: 447 NEIFERVIRI 456 N + V + Sbjct: 472 NPLLRDVELL 481 >gi|296474790|gb|DAA16905.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] Length = 906 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 70/250 (28%), Gaps = 12/250 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S + NG + + + Y + V + Sbjct: 240 PTCSTTLLNGDFKVTYDVNRDDACDLLVANNYFAHFFAPQNLKKLNKNVVFVIDISSSME 299 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ ++ + + + F V S + F Sbjct: 300 GQKLKQTKEALHKILGDMRPG---DYFDLVLFGSAVQSWKGSLVQASPANLEAARNFVQQ 356 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + + + +++LTDGE T+ + I Sbjct: 357 FSLAGATNLNGGLLRGIEILNKAQQS----LPELSNHASILIMLTDGEPTEGVMDRTQIL 412 Query: 391 NKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + GI+ + + F + L N +E + + + F +++ Sbjct: 413 KNVRD-GIKGRFPLYNLGFGHDVDLNFLEVMSLENNGRVQRIYEDHDATQQLQGFYEQVA 471 Query: 447 NEIFERVIRI 456 N + V + Sbjct: 472 NPLLRDVELL 481 >gi|3024046|sp|P97278|ITIH1_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|1694688|dbj|BAA13938.1| inter-alpha-trypsin inhibitor heavy chain 1 [Mesocricetus auratus] Length = 914 Score = 68.1 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 27/246 (10%), Positives = 70/246 (28%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + Y + + + + Sbjct: 248 PTCSTSWLNGDFKVTYDVNRDKLCDLLVANNYFAHFFAPKNLTNMSKNLVFVIDISGSME 307 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 308 GQKVKQTKEALLKILGDVKPG---DSFDLVLFGSRVQSWKGSLVPATQANLQAAQDFVRR 364 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 365 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 420 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + + N +E + + + F +++ N Sbjct: 421 KNVRNAIRGRFPLYNLGFGHDLDFNFLEVMSMENSGWAQRIYEDHDATQQLQGFYNQVAN 480 Query: 448 EIFERV 453 + V Sbjct: 481 PLLTDV 486 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 47/342 (13%), Positives = 91/342 (26%), Gaps = 39/342 (11%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 IITA + V++ G+ VD R + + + TA + A+V + E +R Sbjct: 2 IITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEFSKLGSEQDARKAAKK 61 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 F Q + L L D + D + Sbjct: 62 AFDAQVSQLNLHGAKLKKLNIVTDDE----------------------TMKVSVDAVFEL 99 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + ++ G K+ + T+++A +E + I ID + SM R + L+ Sbjct: 100 PTTLMQIAGFKTLEVATRSDAVGGG----QEVILDIVMCIDATGSMGATLRSVQRNALS- 154 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 F +K Q + +R Y Y + L F + Sbjct: 155 FEANLKNRLKELGRQVDIIRVRPIYYWDYDYDGWSRSYGLKKSTFLKLPDQRTQFKNFVD 214 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 L + +++F F Sbjct: 215 SESAYGGGDWPEAGLECF------------NEGLRSSWFKTTNTRQSVFPVVALWTDAPA 262 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 T M ++ + ++ + ++N Sbjct: 263 DSPNNWRNINSSGTPYPSNMPRSWGGLEAAWNNGSLISQDNK 304 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 66/197 (33%), Gaps = 20/197 (10%) Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 ++ +D K DA+ V+ + R+G F + Sbjct: 99 AIDLSASMDYRDFPGPDGKPVSRFDAVQRVVDQFVA---NREGDRVGLIVFGAKAYLQLP 155 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 F+ ++ + TA+ D++ A SS D + + Sbjct: 156 FTRDLNTARALVDLMQV--GMAGPQTALGDSIGLAIRAFESSEVD-----------DRVL 202 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-VNKTQQEKAR-YFLSNCA--SPNS 427 +LLTDG +T I A+ GI I TI T +++ L++ A S Sbjct: 203 ILLTDGNDTASKMTPINAAEIAQLNGIEIYTIGIGDAEATGEDRIDFETLASIAERSGGQ 262 Query: 428 FFEANSTHELNKIFRDR 444 FF+A L +++ Sbjct: 263 FFDAQDETALRQVYDRI 279 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 58/174 (33%), Gaps = 17/174 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + LA V ++ D RM F + + ++ + +T Sbjct: 119 KQRLAGVRDVVRGFVEGRDGDRMALIVFGSAAYLQAPLTADLDAILALLDRTQV--GMAG 176 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 TA+ D++ A T SS D ++ ++LL+DG +T + + A Sbjct: 177 PHTALGDSIGLAIRTFESSEID-----------QRLLILLSDGSDTASRMDPVNAAEIAA 225 Query: 395 SQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + I T+ + L A + ++F A L ++ Sbjct: 226 GRDVEIFTVGVGDPDATGENRVDLDTLRAIADRTGGAYFFAADEAALTAVYESI 279 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 42/344 (12%), Positives = 89/344 (25%), Gaps = 27/344 (7%) Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + P +A + L + + + T S Sbjct: 50 SNAAAVKEYAGAPAYAALEKTARIAPAPSLPQNTEKYGKIESNPVQAVARNPVSTFSIDV 109 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + ++ + L + L + D Sbjct: 110 DTGSYANARRFLND--NRLPPAHAVRIEELINYFDYGYAPPSDGKPFAVYTETIDSPWQA 167 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ P + VD S K LV+ L + + ++ D V Sbjct: 168 DAKLIKIAIKAKEIRSSALPPANLVFLVDVSGSMQAQDKLPLVKKTLRILTKRLRAEDKV 227 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + G + + GSTA A+Q AY Sbjct: 228 T------LITYASNEKLVLPPTSGK---DKDTILQAINQLEAGGSTAGEQALQMAY---- 274 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIAFSV 407 + + + N I+L TDG+ N + K + GI + T+ F Sbjct: 275 AQAQKAYIKNGINR-----ILLATDGDFNVGITDFNTLKDTVAEK-RKAGISLTTLGFGT 328 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ A ++ ++ E K+ + ++ + + Sbjct: 329 GNYNEQLMEQL--ADAGDGNYSYIDNETEAKKVLQRQLSSTLAT 370 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 67.7 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 70/202 (34%), Gaps = 33/202 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A IR++ V +G F+ + +L + + Sbjct: 113 SRLEAAKAAAREFIRAMPPG------VEVGLVAFSSYATLLQPPTTDRERLEQAVDLLDL 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI D + A + + + + +VLL+DG N + + Sbjct: 167 -----AHRTAIGDGLVAALRVL-PLEDSDAPGGMS-------VVLLSDGRNNYG-IDPLE 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKT----------QQEKARYFLSNCA--SPNSFFEANSTHE 436 +A++QG+R+ T+ +++ + L A + +++ A+S E Sbjct: 213 AARQAEAQGVRVYTVGVGLSENTYVFANGYYIRAGLDEETLQEIAALTGGAYYRASSADE 272 Query: 437 LNKIFRDRI-GNEIFERVIRIT 457 L +++ + R +T Sbjct: 273 LRAVYQTLARAVRLEVRPTEVT 294 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 62/207 (29%), Gaps = 41/207 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ + R Sbjct: 277 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHL-QRMSHYGRQQALQAI 329 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I + + I+LL+DG++T + Sbjct: 330 NSLGAGGGTNIADALKKATKVIEDR---------SYKNSVCSIILLSDGQDTYN------ 374 Query: 389 ICNKAKSQGIRIMT--------------IAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 IC+ + G + + + + L + A S +F Sbjct: 375 ICSNVRG-GSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIE 433 Query: 433 STHELNKIFRDRIGNEIFERVI--RIT 457 + F IG + V R+ Sbjct: 434 DEGVMQDAFAQCIGGLLSVVVQEMRLN 460 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 62/207 (29%), Gaps = 41/207 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ + R Sbjct: 277 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHL-QRMSHYGRQQALQAI 329 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I + + I+LL+DG++T + Sbjct: 330 NSLGAGGGTNIADALKKATKVIEDR---------SYKNSVCSIILLSDGQDTYN------ 374 Query: 389 ICNKAKSQGIRIMT--------------IAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 IC+ + G + + + + L + A S +F Sbjct: 375 ICSNVRG-GSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIE 433 Query: 433 STHELNKIFRDRIGNEIFERVI--RIT 457 + F IG + V R+ Sbjct: 434 DEGVMQDAFAQCIGGLLSVVVQEMRLN 460 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 68/230 (29%), Gaps = 12/230 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + G +I +++PV+LG +++D + L+ AA +A + A Sbjct: 2 RALLARFRADADGTMTMIVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAVQV 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L S VS + + L ++ D + + Sbjct: 62 LPDSATAVSR--------GVSLVGQNLPPSYGTAAAATDVVVGTYDPGAKAFTAGGAQPN 113 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 A +V L F G +S A+AE+V+ + + + + +D Sbjct: 114 AVKVTARRSAALGNAIPVYFAWIFGYRSLE----AKAESVAVAAGGGNPAACLYALDPVN 169 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +D + + + + + + + N Sbjct: 170 PGIDARGSVTVNATCGMQLSSYISTSGNAGNYNGQICQSVGDTGATGNFN 219 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 58/196 (29%), Gaps = 34/196 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV+ L V+ + + D R+ F+ V D + + + Sbjct: 67 KLALVKKTLEFVVSQLNEKD------RLCLITFDTSVYLDFKLTPMTP-MNKYQTLKIIK 119 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D + T + + +I ++E + + + ++L TDG + Sbjct: 120 DISPGSMTNLCGGLMKGLCEVIDRADEEKNEVAS-------VLLFTDGFANKG-GLTNIY 171 Query: 390 C---NKAK---------SQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTH 435 C AK + I T F N L A ++ + Sbjct: 172 CSSSQTAKYTIGIVGPKTADASIYTFGFGSNHN-----AQMLKEISDAGSGMYYYIENVD 226 Query: 436 ELNKIFRDRIGNEIFE 451 + + F +G + Sbjct: 227 MIAEAFGQCLGGLLST 242 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 49/156 (31%), Gaps = 14/156 (8%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F + L + E G TAI DA+ + + Sbjct: 123 KRNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEEETVTEQG-TAIGDAIILSTYRL 181 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +S K IVL+TDG + + + + A+ G +I ++ Sbjct: 182 RNSKARS-----------KVIVLITDGVSNTGKIDPVTATDLAEQIGAKIYSVGIGKEDG 230 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L + + FF A E+ + Sbjct: 231 SYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSI 266 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 49/156 (31%), Gaps = 14/156 (8%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F + L + E G TAI DA+ + + Sbjct: 131 KRNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEEETVTEQG-TAIGDAIILSTYRL 189 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +S K IVL+TDG + + + + A+ G +I ++ Sbjct: 190 RNSKARS-----------KVIVLITDGVSNTGKIDPVTATDLAEQIGAKIYSVGIGKEDG 238 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L + + FF A E+ + Sbjct: 239 SYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSI 274 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 55/204 (26%), Gaps = 47/204 (23%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + I R+G + + + ++I+ Sbjct: 112 NRLEALKKVAINFIE-------GRPNDRIGLVIYAGESYTKTPLT-----TDKSIIFNAI 159 Query: 329 IDENEM----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D G TAI + T+ + + S + K I+LLTDGEN Sbjct: 160 NDLEYSQNIEGGTAIGMGLATSVNKLKDSKAES-----------KVIILLTDGENNAGFI 208 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--S 424 + A I+ TI N L A + Sbjct: 209 DPKTATQLATEYDIKTYTIGVGSNGMALSPVGIKANGQFEYRNIEVKIDEALLKTIAESN 268 Query: 425 PNSFFEANSTHELNKIFRDRIGNE 448 +F A + I+ + E Sbjct: 269 GGKYFRATDNQKFEAIYEEIDALE 292 >gi|301767172|ref|XP_002919036.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like [Ailuropoda melanoleuca] Length = 910 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/249 (11%), Positives = 69/249 (27%), Gaps = 10/249 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 244 PTCSTSLLNGDFKVTYDVNRDQLCDLLVANNHFAHFFAPQNLTNLNKNVVFVIDISTSME 303 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + I F V S + F Sbjct: 304 GQKVKQTKEALLKILGDIRPG---DYFDLVLFGSEVQSWRGSLVQASPANLRAAQDFVRR 360 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T +N + + + + + +++LTDGE T+ + I Sbjct: 361 FFLAGATNLNGGLLRGIEILNQAQ----GSLPELSNHASILIMLTDGEPTEGVTDRSQIL 416 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + + N +E + + + F D++ N Sbjct: 417 KNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNGRAQRIYEDHDATQQLQGFYDQVAN 476 Query: 448 EIFERVIRI 456 + V + Sbjct: 477 PLLVDVELL 485 >gi|281338027|gb|EFB13611.1| hypothetical protein PANDA_007566 [Ailuropoda melanoleuca] Length = 868 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/249 (11%), Positives = 69/249 (27%), Gaps = 10/249 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 199 PTCSTSLLNGDFKVTYDVNRDQLCDLLVANNHFAHFFAPQNLTNLNKNVVFVIDISTSME 258 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + I F V S + F Sbjct: 259 GQKVKQTKEALLKILGDIRPG---DYFDLVLFGSEVQSWRGSLVQASPANLRAAQDFVRR 315 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T +N + + + + + +++LTDGE T+ + I Sbjct: 316 FFLAGATNLNGGLLRGIEILNQAQ----GSLPELSNHASILIMLTDGEPTEGVTDRSQIL 371 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + + N +E + + + F D++ N Sbjct: 372 KNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNGRAQRIYEDHDATQQLQGFYDQVAN 431 Query: 448 EIFERVIRI 456 + V + Sbjct: 432 PLLVDVELL 440 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 67.7 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/291 (9%), Positives = 81/291 (27%), Gaps = 5/291 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L G+ I+ + + ++G+ + +D + L+ +A + A+ L + Sbjct: 9 LRSKQNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATELDEGAT 68 Query: 74 EVSSRAKNSF--TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +RA + + + + + N ++ + ++ Sbjct: 69 HEQARAAVVALIQLNIAHNDHHELASAIDFSIVNNGLDQMTAQLNVEFSQLPDPFIQDNN 128 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + L+ L L + S+ Q A A + S + ++ + + Sbjct: 129 ESAKYVKVSLSQLELDNFLADVFSFNKQVSATALSGPSSDISNCYQDLVPMVVCADTANA 188 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + L + + + +S G + +L + + Sbjct: 189 GVNGANEYGLKEYSLNLMKIGSNSNSPIGPGNFQLLRLGSNTGAADIRAAMAGDTDLGTV 248 Query: 252 SLSEEHFVDSSSLRHVIKKKHL---VRDALASVIRSIKKIDNVNDTVRMGA 299 + S + + + + D++ DT Sbjct: 249 CFTPGISNQSVPTEPGNTVGPVAQGLNTRMGEWHGPVNSTDHLRDTNICEG 299 >gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 627 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 42/346 (12%), Positives = 85/346 (24%), Gaps = 24/346 (6%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 N K + V SR L + R +G+ I + Sbjct: 145 STNTFKVSVNVAAGSRLSFTLTYQEMLQRRLGVYEHRISIRPGQIVNRLKIDVYITEPQG 204 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 ++ ++ DS + + + RDE+ + + Sbjct: 205 LEFIYAPPIVTNVLDSWNIDQSADSPATIDRLSDTRAHVYYYPTRDEQKNQSSEGISGDF 264 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND------- 293 + + V+ + VI +D Sbjct: 265 VIRYDVSHDLSAGQIQVLNGYFVHYFAPTGLPPVQKNVLFVIDVSGSMDGAKMGQTKEAL 324 Query: 294 ----TVRMGATFFN--DRVISDPSFSWGVHKLIRTIV----KTFAIDENEMGSTAINDAM 343 FN + + L K F + G T N + Sbjct: 325 RVILDDMRSFDRFNILTFSYEVSFWKENMMILATQENILEAKNFVNNLRASGGTNFNGGL 384 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRI 400 + + +D + ++ +++LTDG+ T + I AK + Sbjct: 385 VEGVEMLRRVTDDAENTERSAF----LVIMLTDGQPTSGETQLTKIQENAKTYIDGQYSL 440 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + F + + + L N +E + K F D + Sbjct: 441 FCLGFGGDVNFKFLQKISLENQGIARRIYEDADGYLQLKGFYDEVA 486 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 71/241 (29%), Gaps = 16/241 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 KLI+S G + A+L+ +M G G+ VD L+ AA A + + L Sbjct: 5 SKLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTR 64 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK-------KNFTDREVRDIVRDTAVEMNP 124 + + + + + ++ + N +N + + + T P Sbjct: 65 PTDPTQLPTLQWEVARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTLPTPVP 124 Query: 125 RKSAYQVVLSSRYD---LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-Q 180 V + + + F++ G + ++A + Y V Sbjct: 125 GVDIPAVRVIASRSDGNNGGPVKNFFMQIFGKDYSQVSSRAAVAMLGFPYTVPPAVPAEL 184 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + S+ M D P P S G + + + N Sbjct: 185 FPLALSKCMTDQYFSQVPMP-----DPPPEIRISSPYIPGGDTCYSGQWTSFKADTNDVR 239 Query: 241 Y 241 Sbjct: 240 T 240 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/328 (11%), Positives = 94/328 (28%), Gaps = 25/328 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + + ++ ++ T S +++ ++ Sbjct: 76 SSVADAALPGYRDVPREQYQNYPDNPVFAVSETPVSTFSIDVDTGSYANVRRFLN--GGQ 133 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 L + + L + A + + + +P ++ Sbjct: 134 LPPKDAVRLEELVNYFPYAYPLPQGDAPFGVSTELAVTPWNPQTRLLRIAIKASDRSVEE 193 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + VD S H + +V+ L ++ ++ D V+ + Sbjct: 194 LPPANLVFLVDVSGSMHRREGLPMVQGTLKLLVDQLRPQDRVS------LVTYAGATQVV 247 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + ++ GSTA +Q AY +S + Sbjct: 248 L---DSTPGSDKAKIRAAIDQLTAGGSTAGESGIQLAYQ--QASKHLIEGGINR------ 296 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + ++ A + G+ + T+ F V+ + A Sbjct: 297 -ILLATDGDFNVGISDFDSLKQLAADKRKTGVSLTTLGFGVDNYNERLMEQL--ADAGNG 353 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ ++ E K+ D++ + + Sbjct: 354 NYAYIDNLREARKVLVDQLASTLATVAS 381 >gi|50119743|ref|YP_048910.1| hypothetical protein ECA0798 [Pectobacterium atrosepticum SCRI1043] gi|49610269|emb|CAG73712.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 543 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 53/474 (11%), Positives = 116/474 (24%), Gaps = 55/474 (11%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + G AL +L +VD + +K+A A + Sbjct: 21 LRERLSAFVHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGH 80 Query: 67 PLIQSLEEVSSRAKNSFTF----PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + EE S N + + + L + R T Sbjct: 81 QATINGEEYSQEDTNKLAYEYVKNNLGMNKALSEKLVASDVSVTEGRNSATRKTYTVTAA 140 Query: 123 NPRKSAYQV------VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 K + + S +++ P + L ++VSRS+ + Sbjct: 141 FETKPSLLSLGAKKQEVYSTSEVINRPTEIALVMPVTGDMSEGDIRSLKSVSRSFVERML 200 Query: 177 VSIQWVIDFSR-SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 S D S++ Y + + + S + + + + Sbjct: 201 SSADGKRDNLWLSLVPYSQSVNVYDAEDANRIRRWSTPSALNPPELRSLFASGVVSSLAD 260 Query: 236 CNKSLYYMLYPGPLDP-SLSEEHFVDSSSLR--HVIKKKHLVRDALA-----SVIRSIKK 287 E+ F D + + L ++ S Sbjct: 261 RRFPDRRANLLCMYRGLGREEDFFWDEPPAGQFSIYYRHDLPQNGSPGAPPISWRGPNPD 320 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + ++ T F P+ + + + + + M A Sbjct: 321 LYPWDNNSNAVDTRFIVADRGCPNAALMPLTNEESRLNQRIEQFSARFNVNYAIGMSWAG 380 Query: 348 DTIISSNEDEVH----------RMKNNLEAKKYIVLL---------TDGENTQDNE---- 384 + + + N + +K IV++ TDG N + N+ Sbjct: 381 AALSPNMRGSDGWGDAKLPLDFNLDGNGDGQKVIVMMANTIGNWFDTDGYNFKRNQFSGS 440 Query: 385 ------------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +C+ ++ I+ + ++ R A+P Sbjct: 441 TGSDVAREFAQQRFRDLCSSFHARNIKFYFVGI-RPGDPEDFGRNLFDTEATPG 493 >gi|33989|emb|CAA45188.1| inter-alpha-trypsin inhibitor heavy chain ITIH1 [Homo sapiens] Length = 911 Score = 67.3 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVTRDEIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 70/204 (34%), Gaps = 26/204 (12%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + A VI + D R+G +N V++ + ++ Sbjct: 53 DTSYSMRDTDPEGIAAEVISMFMDLS-DADRTRVGFVAYNHHVVASKPLTSIGVAAQKSQ 111 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD- 382 ++ N G T + ++ + + + +++LL+DGE Sbjct: 112 IQQEIRMLNRSGYTDLGLGLRKGSELLAAGASQGRQP---------FMILLSDGETDFGV 162 Query: 383 ----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFE 430 N + ++ A+++G + TI + + T R L AS + F Sbjct: 163 SSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVN---RQELERIASQTGGASFI 219 Query: 431 ANSTHELNKIFRDRIGNEIFERVI 454 +S +L +I ++I +++ Sbjct: 220 TSSAEDLPEILNRIFASQIRSKLV 243 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 64/186 (34%), Gaps = 27/186 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V+D +++ I +G F ++ + L + + + + Sbjct: 109 TRFDVVKDIVSNFI-------KERQNDNIGLVVFGAYSFIASPLTYDENILNKIVSQLYI 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGI 387 TA+ ++ + + K + K +LLTDG +T + ++ Sbjct: 162 --GMAGKYTALFTSLAQGVNLL-----------KMSESKSKVGILLTDGFSTPEVDKIPF 208 Query: 388 AIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + A + I+I I + L A + F A S EL +++++ Sbjct: 209 DVALDMAIKEKIKIYPIGIGMPHEYN---IEVLRKIAEKTGGKAFGAASATELKEVYKEI 265 Query: 445 IGNEIF 450 E Sbjct: 266 DALEKS 271 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 33/241 (13%), Positives = 70/241 (29%), Gaps = 39/241 (16%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + ++ C+ S+ + K Sbjct: 83 WNKEIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRK---- 138 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + S + + R G F + + ++ Sbjct: 139 --------LASKNYVKAQGNDDRTGIVLFTSEANTVCELTN-----SEYVLMNAIDKIYS 185 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T N+A++ + + ++ D +K I+L++DGE +E ++ + A Sbjct: 186 NGGTNFNNAIKESIRILTNTRNDS----------EKRILLVSDGE----SELSSSVIDLA 231 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR-IGNEIF 450 I+I T+ L N A + +F+A + EL I+ + I +I Sbjct: 232 IENNIKINTVYIGGQNN-----NELLKNVAERTGGKYFKAVTADELINIYSEIMIDQKID 286 Query: 451 E 451 Sbjct: 287 S 287 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 66/181 (36%), Gaps = 11/181 (6%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 DA V++ + R+G F D F+ + + +T Sbjct: 127 DAAKEVLKQFAAT---REGDRLGLILFGDAAFLQAPFTADLETWQTLLQETDVA--MAGQ 181 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ST + DA+ A +S+ + +N+ + +K ++LTDG +T A Sbjct: 182 STHLGDAIGLAIKVFNNSDRHG-QQDQNSAKREKVAIILTDGNDTGSFVSPRDAARVAAV 240 Query: 396 QGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 G+R+ TIA +Q L A + F+A +L + + IG + Sbjct: 241 NGVRLHTIAMGDPATVGEQALDLDTLQQLATLTGGQLFQALDEAQLTRAY-QVIGELEPQ 299 Query: 452 R 452 R Sbjct: 300 R 300 >gi|221042216|dbj|BAH12785.1| unnamed protein product [Homo sapiens] Length = 677 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|221042196|dbj|BAH12775.1| unnamed protein product [Homo sapiens] Length = 645 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|189054356|dbj|BAG36876.1| unnamed protein product [Homo sapiens] Length = 911 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|119585665|gb|EAW65261.1| inter-alpha (globulin) inhibitor H1, isoform CRA_c [Homo sapiens] Length = 911 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|119585664|gb|EAW65260.1| inter-alpha (globulin) inhibitor H1, isoform CRA_b [Homo sapiens] Length = 893 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|156119625|ref|NP_002206.2| inter-alpha-trypsin inhibitor heavy chain H1 isoform a [Homo sapiens] gi|2851501|sp|P19827|ITIH1_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; AltName: Full=Inter-alpha-trypsin inhibitor complex component III; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|825630|emb|CAA49279.1| inter-alpha-trypsin inhibitor heavy chain H1 [Homo sapiens] gi|62550746|gb|AAH69464.1| Inter-alpha (globulin) inhibitor H1 [Homo sapiens] gi|119585663|gb|EAW65259.1| inter-alpha (globulin) inhibitor H1, isoform CRA_a [Homo sapiens] gi|158258937|dbj|BAF85439.1| unnamed protein product [Homo sapiens] Length = 911 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 99 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 483 >gi|825681|emb|CAA34346.1| inter-alpha-trypsin inhibitor C-terminal [Homo sapiens] Length = 837 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ I +++ + + + + V Sbjct: 25 DFAVTADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 84 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q + + + + I + Sbjct: 85 TFQLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 144 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 145 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 194 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 195 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 251 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 252 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQ 311 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 312 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 367 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F ++ + V Sbjct: 368 NFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDV 409 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 34/90 (37%), Gaps = 15/90 (16%) Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEK 414 + ++LLTDG NT E + A+ Q I+I TI N + Sbjct: 13 SRLLILLTDGANTAGEIEPLKAAELAQQQQIKIHTIGVGANEMLVPGLFSSRRVNPSADL 72 Query: 415 ARYFLSNCAS--PNSFFEANSTHELNKIFR 442 L AS +F A T +L +I+ Sbjct: 73 DEDTLKKIASQTGGQYFRAQDTEQLQQIYA 102 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 57/184 (30%), Gaps = 29/184 (15%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S I K + + A A +I ++ D+V F+ F + Sbjct: 421 SNLGNINKLEIAKSAAAKMIDHLESSDSVG------VIAFDHNFYWASKFGKLKS---KN 471 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 V G TAI + A +T+ S K IVLLTDG + Sbjct: 472 EVIENISGIQIGGGTAIIPPLTEAVNTLRKS-----------KAKDKVIVLLTDGYGEEG 520 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 E + AK I+I TI + LS A + F+ L + Sbjct: 521 GYEYP--ASIAKRNNIKITTIGVGSSINAP-----ILSWMAAYTSGRFYYVKDASNLIDV 573 Query: 441 FRDR 444 F Sbjct: 574 FLKE 577 >gi|119613593|gb|EAW93187.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_b [Homo sapiens] Length = 523 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 70/229 (30%), Gaps = 14/229 (6%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + ++Y G + +D SS K + A+ ++ ++ D Sbjct: 152 YIYDDYFIHYFAPRGLPPMEKNVVFVIDVSSSM-FGTKMEQTKTAMNVILSDLQANDY-- 208 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 F+D V + + K + G T +N A+ A + Sbjct: 209 ----FNIISFSDTVNVWKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLN 264 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 SN++ I+ LTDGE T I + + + + ++AF + Sbjct: 265 HSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDD 322 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 R L N +E K + I + V R+ Sbjct: 323 ADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLADV-RLN 370 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 68/188 (36%), Gaps = 21/188 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 SSS + + V+ L + + + R+G F D F+ + Sbjct: 116 SSSDGQAVSRLDAVKSVLHEFVAT-------REGDRLGLILFGDAAYLQTPFTADHDVWL 168 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +T ST + DA+ A S +++ + +K +V+LTDG +T Sbjct: 169 ALLDQTEVA--MAGQSTHLGDAIGLAIKVFEQS--------ESSKDKEKVVVVLTDGNDT 218 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHE 436 E A ++G+RI IA +Q + N A S F+A Sbjct: 219 GSFVEPKDAAIVAAAKGVRIHVIAMGDPATIGEQALDMATIDNIASQSGGQAFQALDQEA 278 Query: 437 LNKIFRDR 444 L + +R Sbjct: 279 LQQAYRTI 286 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 52/167 (31%), Gaps = 19/167 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +D ++D V + ++ GSTA+ A+ + Sbjct: 98 SDGDLFSLVVYDDSVETLVPAQPVSDIGD---IEARIRRIRPGGSTALFGAVSQGAAEVR 154 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN---KAKSQGIRIMTIAFSVN 408 ++ +VLL+DG + +GI + T+ + Sbjct: 155 KHSDAPYVNR---------VVLLSDGLANVGPSRPADLARLGAALLKEGISVTTVGVGTD 205 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + S + + S+ +L +IF +G+ + V R Sbjct: 206 FNEDLMTQLAER---SDGNHYFVESSRDLPRIFAAELGDVLSV-VAR 248 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 45/401 (11%), Positives = 98/401 (24%), Gaps = 24/401 (5%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + A A +V E L + D+ + Sbjct: 35 QVAVDETTQQADTTQTVEEPQVEVGELEITEAEPNPSENPLAAPLISEQVDMLADQAKQP 94 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 A K + T S Sbjct: 95 ATGIVASRQRATKMLAGENFPPVPVYPYQVNTERFPEAESNPLKQVKTDPVSTFSVDVDT 154 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 ++ + + + +N F + + V + + + Sbjct: 155 ASYAFVRSELVN-GGKPNPDAVRAEEMINYFQYDYKVPDSREAPFSTNVSVVETPWNSDT 213 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + L P P + +D+S K L++ A ++ ++++ D + Sbjct: 214 KLLHIGLKGYTVPLDDLPPQNLVFLIDTSGSMSDENKLPLLQQAFRLLLSTLREDDTIAI 273 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + ++ + GSTA + ++ AY + Sbjct: 274 V---------TYAGNAGVLLEPTALSDKSKIAEAIAALTSGGSTAGHAGLKEAYRLAETM 324 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKT 410 +D + K I+L TDG+ + + + GI + + F Sbjct: 325 QDD---------DTKSRIILATDGDFNVGLSSADDMKRFVKEKRDSGITLSVLGFGRGNY 375 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 E + N ++ E K+ D+I + I Sbjct: 376 NDELMQALAQN--GNGVAAYIDTLSEARKVLVDQIVSSIST 414 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 44/418 (10%), Positives = 110/418 (26%), Gaps = 36/418 (8%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A ++ + F + + + ++ T + + + Sbjct: 2 MAWSARGGGGAKIDLLKGAASDFVETILNDSEEDRVSISIIPYSTKVNAGEDLLEQYTVT 61 Query: 123 NPRKSAYQVVLSSR--YDLLLNPLSLFLRSMGIKSWLIQTKAEAET------VSRSYHKE 174 ++ V ++ DL ++P + R+ + + T Sbjct: 62 QEHDYSHCVDFNADDYTDLAISPRTELQRTGHFQFQQMSTSDPRSGQWVCRHDGGFSITP 121 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS--PY 232 S+ + ++ S D + + S G D P+ Sbjct: 122 LSKSVSDLKRQIAALTPEGSTSIDMGAKWGLALLDPSARGPVSALIASGQVDASFRERPH 181 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 S+ ++ + +S + + + + ++ + Sbjct: 182 PHDAENSMKVLVLMTDGKNDNEYRLQPEYASGDAKVIRTT---STASHLFYTVDAPETNG 238 Query: 293 DTVRMGA----TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 F+ + ++ +A E + + + Sbjct: 239 TNDGSWPNNERYFYATHPFEEERMWDEHTLRDNPRLERYANQLEEK-----HLSWPEVWA 293 Query: 349 TIISSN-EDEVHRMKNNLEAKKYIVLLTDGEN-------TQDNEEGIAICNKAKSQGIRI 400 + ++ + N L + +N + + IC A+ GI I Sbjct: 294 EMSPEYYGYNLYGRQGNSWWSWNSRLQSFWQNMHSTIGTDEKDRRLRNICAAAQRAGIVI 353 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 I V+ + + L CAS + + E+ F D I I ++R+TK Sbjct: 354 YAIGMDVDS---QNSLDLLKECASTEAHYFDVDGLEIQTAF-DMIAASIS--MLRLTK 405 >gi|13473814|ref|NP_105382.1| hypothetical protein mll4535 [Mesorhizobium loti MAFF303099] gi|14024565|dbj|BAB51168.1| mll4535 [Mesorhizobium loti MAFF303099] Length = 373 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/244 (9%), Positives = 59/244 (24%), Gaps = 5/244 (2%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +S +G + L++ ++ G VD +QA A+++ + S Sbjct: 22 FARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAIQTRSNQQQALDAALLSITTMDTTSTL 81 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-TAVEMNPRKSAYQVV 132 A + + +F + T R + + Sbjct: 82 AQRQTALQDSFIANGGLGTATLNSFVAGTTTTPATGQASASFSMPTIFMKIARIDSVPIA 141 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSW----LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 ++S ++ + + W + + + + D Sbjct: 142 VASAVSKPPALVNATFKIAKVSGWWNKTMTLYGTQFGATAAKPLMSIEYTYNGFGDPKGY 201 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 V + +S G+ + + S S + P Sbjct: 202 GTTNVYTITNNGGADIKTLVQSQVCATASVTTFTGLPADAILQTSGSRKYSTTCVNTMYP 261 Query: 249 LDPS 252 + + Sbjct: 262 ANGA 265 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 29/287 (10%), Positives = 72/287 (25%), Gaps = 10/287 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ + V L V + VDV + L+ A +A + A+ L S ++ Sbjct: 10 RKQQGLVVVLVTAALLVFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAATILDNSKDKD 69 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + A+ + + + + D T V + Sbjct: 70 AVDAEIGTAL--NAMAASTGNHEIDFTTASINIDYSNDPQDFTGTATFGDDDDVYVRVRV 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQWVIDFSRS-MLDYQ 193 L+ F++ G+ + + + + + + S + + Sbjct: 128 D---SLDMDEFFIQMFGLVKEVSASAVAGPSSGQEVVNNVVPIGVCIGDGTSDNDVDPED 184 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM---VSCNKSLYYMLYPGPLD 250 + ++ + G + + Y G D Sbjct: 185 GYHDVTGDAITSVFGYEVGTVHALKVGDSSLSEMGNGNYHLLDFGSGGKAIKEGLGGSYD 244 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + + + V + + D +D V Sbjct: 245 QPVKIGEDITTKPGGTVGPTGDGLNTRFGDYGGGLTASDYPSDYVTT 291 >gi|194289206|ref|YP_002005113.1| lipoprotein, von willebrand factor type a domain [Cupriavidus taiwanensis LMG 19424] gi|193223041|emb|CAQ69046.1| putative lipoprotein, Von Willebrand factor type A domain [Cupriavidus taiwanensis LMG 19424] Length = 570 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 93/325 (28%), Gaps = 25/325 (7%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSM 189 L+ + L R I I+ A+A + S + S + + + Sbjct: 72 AALALPHPYPLPAPEDRERYGTIDENGIRLVAQAPVSTFSIDVDTGSYSNVRRLLNAGRV 131 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + LN F + + P V + Sbjct: 132 PPADAVRVEELLNYFPYDYAQPTDGRP-FAVHTALAPAPWHPSNVLLRIGIKGKDMASGA 190 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P+ + VD S + K L++ +L ++ ++ D R+ + Sbjct: 191 LPAANLVFLVDVSGSMNTPDKLPLLKSSLKLLVNQLRAQD------RITLVTYASGTRVA 244 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + G + + GSTA + AY + + N Sbjct: 245 LPPTPGS---DKGAIVAAIDQLVAGGSTAGASGIALAYQ----AAQQSYIAGGINR---- 293 Query: 370 YIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 ++L TDG+ + ++ + G+ + T+ F ++ A Sbjct: 294 -VLLATDGDFNVGVTDFRQLKSMVEDKRKSGVSLSTLGFGTGNYNEQLMEQL--ADAGDG 350 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 ++ ++ E NK+ I + + Sbjct: 351 AYSYIDNLMEGNKVLVSEISSTLAT 375 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 57/197 (28%), Gaps = 41/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++D ++ R+G + + + +I I + Sbjct: 108 DRITALKDIAVKFVQ-------KRPNDRIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNS 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI + + A + ++ S K ++L+TDG + N Sbjct: 161 AGLEP--GTAIGEGLSVAVNHLVKSKAKS-----------KVVILMTDGVSNIQNAIPPQ 207 Query: 389 IC-NKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNS 427 + AK+ I++ I N L A + Sbjct: 208 VAAELAKNNNIKVYAIGIGTNGYALMPTSQDIFGDLVFTETEVTIDENTLREIAQTTGGK 267 Query: 428 FFEANSTHELNKIFRDR 444 +F A S L +++ + Sbjct: 268 YFRATSNSSLEEVYDEI 284 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 52/166 (31%), Gaps = 18/166 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F+ V ++K + + G T IN M+ A+ + Sbjct: 158 NDRLCLIAFDSCVSRRCHLMKTNSSNKPNLIK-IINEIHCHGGTNINSGMELAFRVL--- 213 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK-AKSQGIRIMTIAFSVNKTQQ 412 + + I LL+DG++ + K + I + F + Sbjct: 214 ------KERKYYNPVSSIFLLSDGQDGGADLRVRQSLEKHLSQECFTIHSFGFGSDHDGP 267 Query: 413 EKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ S +F+ +++++ F D +G I Sbjct: 268 -----LMNKICSLKDGNFYYVEKINQVDEFFVDALGGLFSVIAQEI 308 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 54/178 (30%), Gaps = 30/178 (16%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F + + + L + + + G TAI A+ A + + +S Sbjct: 138 NDRIGLVVFRGKGYTQCPLTLDHEVLAMLLDRLSPGVIQDDG-TAIGTAILIAVNRLKAS 196 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--AFSVNKTQ 411 K ++L+TDGEN + A G+RI I F V + + Sbjct: 197 -----------ESLHKVLILVTDGENNAGEVGPGTAASIAARSGVRIYVINAGFKVVEDR 245 Query: 412 QEK----------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L A + +F F I + R T Sbjct: 246 IDPPEESGRYIQKDEESLQGIARTTGGGYFRVEDP----AAFDQTIRSIDRLEKKRFT 299 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 48/442 (10%), Positives = 126/442 (28%), Gaps = 40/442 (9%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 A+++ +++ V + + + A + I+ + + N Sbjct: 8 AMVLALLMAVISLATVSAQAITVSGTVTDAKDGTPLVGCSVQIKGTTKGTVTNMNGQYTI 67 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---------------NPRKSAYQV 131 + K E L+ + ++ + V+ A E+ + + Sbjct: 68 QSKKGETLLFQYIGYKQEKRVVKSSTLDVKMKADELVLEECVVVGYGHELRATKSMSTAY 127 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSML 190 + ++ I+ + ++A + S + S + Sbjct: 128 MAVCPASGIMYNAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDAASYSNMRRFINKGKLP 187 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F + + + L P Sbjct: 188 PVDAIRTEELVNYFSYDYPKPT-GSDPVKITMEAGTCPWNADHRLVRIGLKAKEIPTDNL 246 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S + LV+ +L ++ +++ D V + Sbjct: 247 PASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVA------IVTYAGNAGVKL 300 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + ++ + GSTA + + AY + NN Sbjct: 301 EATPGS---DKQKIREAIDELEASGSTAGGEGIMLAYKI----AQKNFISGGNNR----- 348 Query: 371 IVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++E + + + GI + + + + + K + + Sbjct: 349 IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK--GNGN 406 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 ++ E N++ + G + Sbjct: 407 HAYIDNLQEANRVLVNEFGATL 428 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 57/187 (30%), Gaps = 29/187 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S I K + + A A +I ++ D+V F+ F Sbjct: 418 MGGSNLRNINKLEIAKSAAAKMIDHLESSDSVG------VIAFDHNFYWASKFGKLKS-- 469 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + V G TAI + A + + K + K IVLLTDG Sbjct: 470 -KNEVIENISTIQVGGGTAIIPPLTEAVNLL-----------KKSKAKDKVIVLLTDGYG 517 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + E + AK I+I TI + LS A + F+ L Sbjct: 518 EEGGYEYP--ASIAKRNNIKITTIGVGSSINAP-----ILSWMAAYTSGRFYYVKDASNL 570 Query: 438 NKIFRDR 444 +F Sbjct: 571 IDVFLKE 577 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 67.3 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 78/295 (26%), Gaps = 24/295 (8%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 +A + + + + +N F + + Sbjct: 82 VTDQAVSTFSIDVDTGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYPQP-NGVHPFS 140 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 D + P + VD S K LV+ L Sbjct: 141 VTTETVDSPWKENAKLIRIGIQAKDLALQQLPPANLVFLVDVSGSMSAADKLPLVKQTLR 200 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + ++ D V + + G + + GSTA Sbjct: 201 ILTEQLRAQDKVT------IITYASGEKLVLEPTSG---EQKEKILAVINGLRARGSTAG 251 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQ 396 A+Q AY E + N I+L TDG+ + + + + Sbjct: 252 EQAIQLAYK----QAEKAFVKNGINR-----ILLATDGDFNVGITDFNTLKGMVAEKRKS 302 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 GI + T+ F ++ A ++ ++ +E K+ + ++ + + Sbjct: 303 GISLTTLGFGTGNYNEQLMEQL--ADAGDGNYSYIDNKNEAKKVVQRQLSSTLAT 355 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 54/194 (27%), Gaps = 25/194 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP-SFSWGVH 317 +D S + + ++A I + ++G ++ + + Sbjct: 75 IDKSGSMQTDNRINAAKNAAKGFIDLMDMTK-----HQVGIVGYSSVAETSSLPLTTDTA 129 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K F G T A+ A + S + IV++TDG Sbjct: 130 AA-----KQFIDPIVASGGTETGYAIDQAITLLSSHRPEAQP----------VIVIMTDG 174 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCA-SPNSFFEANST 434 E AK GI TIA + + L A + + + Sbjct: 175 EANSSQAALER-AQAAKDAGIVFYTIALLGPNDNPDTSAPNELLKQMATTNSHHHFVLGS 233 Query: 435 HELNKIFRDRIGNE 448 L +I+ + Sbjct: 234 TGLAEIYAAIVAEI 247 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 54/187 (28%), Gaps = 28/187 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + + + + A+++ + +D T +G F Sbjct: 760 SMSQTVGDVTRLDVAKQAVSA---AANLLDP--QTGSLGVVMFGSEAEVALPLGPLP--- 811 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T I +Q A+ + +S+ D H IV++TDG Sbjct: 812 DAAGIAAALGHLQPGGGTNIYPGLQLAFQALRASDADARH-----------IVVMTDGM- 859 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 D + + +++GI + ++A + A F L Sbjct: 860 -SDEADFPGLLAAIRAEGITVSSVAIGSTS-----ETSIAEDIALLGGGRFHNTRDFGAL 913 Query: 438 NKIFRDR 444 I Sbjct: 914 PSILAQE 920 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 65/192 (33%), Gaps = 22/192 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ + + R+G F + + + + ++ + Sbjct: 120 SRLDAVKSVAREFVTR-------HSNDRIGVVVFKGKGYTLSPLTLDHR-VTGMLIDNVS 171 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TA+ A+ A + + +S D +K I+L +DG + + + Sbjct: 172 PDVIRDEGTAVGTAVLIAVNRLRASQSD-----------QKVIILFSDGVSNAGEIDPVT 220 Query: 389 ICNKAKSQGIRIMT-IAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRI 445 + A +QGIRI T A S + L A +F A ++ L + F Sbjct: 221 AASFAAAQGIRIYTAGAGSASSASSALDEGELRRVALTAGGRYFRAGTSASLAEAFESID 280 Query: 446 GNEIFERVIRIT 457 E E +T Sbjct: 281 RLEKSELTSPVT 292 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 36/296 (12%), Positives = 77/296 (26%), Gaps = 27/296 (9%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + + +N F + I Sbjct: 122 STFGLDVDTGSYSNVRRLLNDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSLTTEIA 181 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + P+++ +D+S K L++ AL ++ + Sbjct: 182 PAPWNAQRQLLLVGIQGYRVAPQDIPAVNLVLLIDTSGSMADRAKLPLLKSALRQLVTQM 241 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D R+ + + G + GST ++ Sbjct: 242 RAQD------RVAIVAYAGSAGLVLPSTPGDRHAQ---ILAAIDGLQASGSTNGGAGLEL 292 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 AY + IVL +DG+ + + + + +GI + T Sbjct: 293 AY---------AEAAKGLVKDGVNRIVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTT 343 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + A A S+ +S L + R +E+ ++ I K Sbjct: 344 LGLGSGNYNDAMAMQL--ANAGDGSYHYIDS---LLQA-RKVFASELSATLLTIAK 393 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R ++ + + + TAI DA+ A + Sbjct: 135 DRLGLILFGTRAYLQTPLTFDGATVAAMLRDSVV--GLAGRETAIGDAIGLAVKRLREQP 192 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--- 411 E ++ ++LLTDG+NT + + A G+R+ TI + Sbjct: 193 EG-----------QRVLILLTDGDNTAGALDPLEAAELAAQAGVRVYTIGIGGGELGVRS 241 Query: 412 ----------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + L A + F A+S +L ++ + E ER R Sbjct: 242 LFGMRLLRQASDFDPATLERIAEITGGRAFTADSRQQLEAVYDELDRLEPSEREQR 297 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 58/188 (30%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++K Sbjct: 78 KIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKL-ID 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I+D +QT + + ++L++DG+ + Sbjct: 131 ALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRGEP----- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + + T F + ++ +F N + L+ F + + Sbjct: 178 AANVKIGNVPVYTFGFGADYDPTVLNAVARNSM--GGTFSVVNDVNLLSMAFSQCLAGLL 235 Query: 450 FERVIRIT 457 V +T Sbjct: 236 TVVVQDLT 243 >gi|118096699|ref|XP_414253.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Gallus gallus] Length = 886 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 51/172 (29%), Gaps = 7/172 (4%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K +D++ + F V + F + G T + + Sbjct: 307 LKILDDIKEDDHFNFILFGSDVHIWKETLIKATPENLDEARKFVRSIDTEGMTNLYGGIM 366 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIM 401 D + +++E + ++ I++LTDG+ I K + Sbjct: 367 KGIDMLNAAHEGNLVPKRSAS----IIIMLTDGQPNVGISNTQDIQTHVKKAIEGKYTLY 422 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F + L N + + + + F D + N + V Sbjct: 423 NLGFGYGVDYNFLEKMALENKGLARRIYPDSDSALQLQGFYDEVSNPMLLDV 474 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 10/62 (16%), Positives = 23/62 (37%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + G+ +I AL P ++ + V++ L+ A TA + + Sbjct: 6 RCRRWLAAFGRDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTAALAGA 65 Query: 66 VP 67 Sbjct: 66 NE 67 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 57/200 (28%), Gaps = 38/200 (19%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + + I + R+G F + + L+ Sbjct: 102 ANDFSPTRLEAAKKVASEFIND-------RQSDRIGLVIFAGESFTQCPLTTDHRVLL-- 152 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + TAI + + + + S + ++LLTDG N Sbjct: 153 NLLSEVKFGMIEDGTAIGLGLANSVNRLKDSQSKS-----------RVVILLTDGSNNAG 201 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCAS-- 424 + A S GIR+ TI T + L+ AS Sbjct: 202 QIAPLTAAELAASYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSGDFDERTLTEIASKT 261 Query: 425 PNSFFEANSTHELNKIFRDR 444 S+F A L+ I+ + Sbjct: 262 GGSYFRATDNTSLSGIYDEI 281 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 50/396 (12%), Positives = 108/396 (27%), Gaps = 24/396 (6%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + A + + E ++ + + + D Sbjct: 39 EIAAKSEPKIQVAKPEAPVAKEERKAEPDSEAFHAPGAASGAMQSVDQAADAAAPMSTMG 98 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG- 176 AV + R + + D+L P R + +++ A + S + Sbjct: 99 GAVGLAARSRMETIPAPAPADMLPPPAENRERFGNADANPVKSVAAEPVSTFSVDVDTAS 158 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S + M + + +N F R + V + + Sbjct: 159 YSFVRRSLMAGEMPNPDAVRVEEMVNYFPYDWPRPTTAAEPFKATVTVTPTPWNAGTRLM 218 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + ++ P + +D S K L+++A ++ ++ D V+ Sbjct: 219 HVAIKGYEVVQKEAPRANLVFLIDVSGSMDEPDKLPLLKNAFRLLVDRLRPDDTVS---- 274 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + + +T + + GSTA + AY E Sbjct: 275 --IVTYAGNAGTVLEPT---AVKDKTKILSAIDTLQPGGSTAGAAGIDAAYQL----AEK 325 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 R N I+L TDG+ +EE + + GI + + F Sbjct: 326 AFVRDGVNR-----ILLATDGDFNVGPSSDEELKRMVETKRRSGIFLSVLGFGRGNYNDA 380 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + N ++ E K + G+ + Sbjct: 381 LMQTIAQN--GNGVAAYIDTLAEAQKTLVEEAGSSL 414 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 64/208 (30%), Gaps = 38/208 (18%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S+ + + + + +IRS+ + D G F S S +++ Sbjct: 98 SMSASDYSPTRVEAAKGSSEILIRSLSESDT------AGVVIFESGASSAAYLSSDKNRV 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + G TA+ D + A D + + +VLL+DG + Sbjct: 152 VSRLEQVSVKT----GKTALGDGLALAVDMVTAIPAGTY-----------IVVLLSDGVS 196 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----------QQEKARYFLSNCA--SPNS 427 AK+ G+ + TI L + A + Sbjct: 197 NSGMITPQEAAEYAKNSGVVVYTIGVGSESPVEVSSDGVQQYASLDEETLRSIAEITGGE 256 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIR 455 +F + L + I N I +IR Sbjct: 257 YFRSVDEKTLVQ-----IQNTIQTSIIR 279 >gi|149175890|ref|ZP_01854508.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] gi|148845337|gb|EDL59682.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] Length = 368 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 60/221 (27%), Gaps = 7/221 (3%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + TA+++ +LG+ VD L++AA A++ A LI Sbjct: 17 HRRGAIAVFTAIMLVPLLGMVAFAVDYGYLLKKRADLQRAADAAVLAAVRDLIPDANGTQ 76 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 +K T + L + + + V ++ R Sbjct: 77 DLSKVRATLRQYADLNIGEIEGFQVLDSDIQIGRYNPESVYDNFTILDWGTFDTVRVTLR 136 Query: 137 YDLLLNPL--SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 +D N F R +GI + + A S + + I Sbjct: 137 FDTQANSPVSLFFARLLGINESDLNATSTAILQKGSLLAPGVGVMPFTIPK-----AQWD 191 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 +E + + G G D S S Sbjct: 192 STEPGEVWTIYGDGRLEDNIGAEIPGNWGTCDIGKSYNSTS 232 >gi|126336625|ref|XP_001380258.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 819 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 63/188 (33%), Gaps = 13/188 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K + AL ++ +K D+ F+ V KTF Sbjct: 276 RKIKKTKAALIKILDDLKPEDH------FNMITFSGHVTRWKPELVLALDEHLKEAKTFL 329 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +G T +N A+ A + SN+ + + I+LLTDG++T+ + Sbjct: 330 SNTPALGVTNVNGAVLAAVSMLDESNKKKELPEGSVS----MIILLTDGDSTEGETKLQK 385 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I K+ + + F + R L N FE + F + Sbjct: 386 IHENVKAAIRGQYHLFCLGFGFDINYVFLERLALDNGGMARHIFEGLDAELQLQDFYQEV 445 Query: 446 GNEIFERV 453 N + +V Sbjct: 446 ANPLLTQV 453 >gi|163816348|ref|ZP_02207714.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] gi|158448345|gb|EDP25340.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] Length = 612 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 54/437 (12%), Positives = 118/437 (27%), Gaps = 49/437 (11%) Query: 23 FIITALLMPVMLGVGGM----LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 ++ ++M G+ G VD + E++ Sbjct: 15 AVMAGIMMVG--GLTGCGNDDAVDGRGMRLEKQ-----------------NGDKTEITGI 55 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ ++ ++ L N + + ++ + + + +S Y Sbjct: 56 AEKYREKESKQKKKSLQDNGLRGIFNSTSNDVMYAEEACDVETVAGDTAMVTDTSNSMYS 115 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + + + T + + S S S+ Sbjct: 116 EVAYDTREYDSMTENGF--VSTVDRPLSTFAADRDTASYSNVRSYIESGSLPPDGAVRIE 173 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + LN F + + + D + + + Sbjct: 174 EMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKLMMVGINTDEIDFGDKKPSNLVFL 233 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S + K LV+ + A + ++ + D V+ D G Sbjct: 234 IDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIV---------TYAGEDTVVLSGTPG 284 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + G T DA+ TAY+ E NN ++L TDG+ Sbjct: 285 SEQYTISEALSNMTAEGCTNGGDAIITAYEL----AEKNFINGGNNR-----VILATDGD 335 Query: 379 NTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + + K I + + F + + K N S+ +S Sbjct: 336 LNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADN--GDGSYAFIDSA 393 Query: 435 HELNKIFRDRIGNEIFE 451 +E K+ D +G + Sbjct: 394 YEAKKVLVDEMGGTLNT 410 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 64/206 (31%), Gaps = 29/206 (14%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S + K ++ I + ++G ++ Sbjct: 74 IDRSGSMNDENKMQSAINSAKGFIDLMDLSK-----HKVGIVDYSSANNISSFPL----S 124 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + VK + G TA DA++ A + +++ D IVLLTDG+ Sbjct: 125 TDKEAVKNYVNGLRANGGTATGDAIKKARELLVNHRPDAQP----------VIVLLTDGD 174 Query: 379 NTQDNEEGIAIC----NKAKSQGIRIMTIA-FSVNKTQQEKARY-FLSNCASPNSFFEAN 432 T+ N N+AK +GI TIA + N L A+ + Sbjct: 175 ATEPNGNAYNYALTNSNEAKQEGIVFYTIALLNTNANPDTSGPNLLLKQMATTSHHHHFV 234 Query: 433 STH----ELNKIFRDRIGNEIFERVI 454 ++ IG V+ Sbjct: 235 LGSVGLGDIYAAIVQEIGLASAYDVV 260 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 62/192 (32%), Gaps = 28/192 (14%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + V+ + S R+G F + + ++ V + R + + Sbjct: 111 TVTRLDAVKAVAKEFVTS-------RAGDRVGLILFAEFAYTAAPLTFDVAAVSRIVDEA 163 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STAI + A + K + + ++LL+DG +T N Sbjct: 164 TI--GISGRSTAIAGGLGLALKRL-----------KRSDAQSRVVILLSDGSDTSGNVLP 210 Query: 387 IAICNKAKSQGIRIMTIAFS------VNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 A+ G+ + TIA +T+ L + A S F +T +L Sbjct: 211 RDSARLAEQLGVTVHTIALGPEDMETAPQTRDAVDTATLRDIAELSGGRTFRVRNTDDLR 270 Query: 439 KIFRDRIGNEIF 450 + + E Sbjct: 271 AVTAEIDRLEAS 282 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 66.9 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 35/366 (9%), Positives = 88/366 (24%), Gaps = 32/366 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F K+L +G+ ++ A+ PV+ G G+ VD+ ++ ++ ++ A + + Sbjct: 3 FTKLMKRLQADTSGNALMLVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAGAW 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEE---------YLIRNFENNLKKNFTDREVRDIVRD 117 + + + F E L + V + Sbjct: 63 SRGNGDMGLEYKTRAKQEFYINLSETKDYLLTHSIELQTFDGTPDSAVYMHATVSAQLPF 122 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 T V +N V + ++ + + + + Sbjct: 123 TKVMIN-EGMTIAVQARATWETQKQFTACLYSLDPSSTRTMWFNGGPTVDAACGVGARSN 181 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG-----IRDEKLSPY 232 + ++ S G A + + + V + +P Sbjct: 182 ADNAIVTNGGSGAQNINWVVAGGTINDGAGAFVNAEVVENYDNMVDPWEGLTPPDNATPR 241 Query: 233 MVSCNK---------------SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 V+C + Y D + + Sbjct: 242 TVTCGSADANWQADEAQLDAITFKYYRGKNKNDAKSAGAISYSGPGSESAYDVTYATNVG 301 Query: 278 LASVIRSIKKID-NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 V + + A D++ + + + + + + D Sbjct: 302 AMFTSEPNDYNQAPVAENLSQIAGSGPDKIFIERITTSYFRYYNK-VDLSGSNDLLPGTY 360 Query: 337 TAINDA 342 T + A Sbjct: 361 TDFDIA 366 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 64/195 (32%), Gaps = 34/195 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ A+ +I+ + ID R+ F+ + + + ++ Sbjct: 78 KISKVKTAMLFMIKKLSSID------RLSIVTFSGDARRLCPLRQ-ISETSQRELENLIN 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I ++T + + + I+L++DGE Sbjct: 131 GLKAEGATNITAGLKTGLNVLNDRRLSGGRVVG--------IMLMSDGEQNAGG------ 176 Query: 390 CNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDR 444 + A+ + + T F + L A +F + +T L+K F Sbjct: 177 -DAAQVPVGNVPVHTFGFGI-----NHEPRVLKAIAQNSVGGTFSDVQNTDNLSKAFSQC 230 Query: 445 IGNEIFERV--IRIT 457 + + V +++T Sbjct: 231 LAGLLTVVVQDLKLT 245 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 57/163 (34%), Gaps = 20/163 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + + STA DA+ A + + + Sbjct: 133 LISFAGSATVLVAPTTDRSAVSQ-----GIDGLKLAQSTATGDAIVAA---LSAIDSFGK 184 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-----VNKTQQE 413 + +VL+TDG+ T + AK GI I TI+F V+ + Sbjct: 185 VVGGADGPPPARVVLMTDGKETVGTRKATDAAGDAKEAGIPISTISFGTERGSVDINGKA 244 Query: 414 K----ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + A S FF+A S EL +++ D +G +I Sbjct: 245 QEVPVDDESMKEIAKISGGEFFKAASAEELRRVY-DTLGEQIG 286 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 39/371 (10%), Positives = 86/371 (23%), Gaps = 20/371 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI----- 69 + G + + + + G + VD+ + + A A + + L Sbjct: 9 RRGRRGTILPLLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGV 68 Query: 70 --QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 +L + A+ S T ++ + + Sbjct: 69 AYNNLPAAVTAAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGG 128 Query: 128 AYQVV-----LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 A + R L ++ + F+R G+ S + A A V R + + Sbjct: 129 AMSAPGGNSWTAMRVTLGVSQPTYFMRVFGVNSMP--SGAVATAVYRPRDIAFVLDMTGS 186 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC--NKSL 240 + FS + + + + + L+ + Sbjct: 187 MAFSSTFNSGNAQGSSSDYQSMNPDPLVPKAGHYTTVQSRIVAADNLANSSNEALPRNNF 246 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR---SIKKIDNVNDTVRM 297 GP + + V D A+++ ++ D+ Sbjct: 247 TITTPGGPPIVRSYYYDPSNWGTPSTVASPVTTKGDGSANLLPAFHRWSPPESGADSDNY 306 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNED 356 A +N + T G T D + I + D Sbjct: 307 IAPTYNFAGYNAFHKGNETTPQGPTPAPDTYGTMTDASGLTYTGDRWRRRDGRIDKTTTD 366 Query: 357 EVHRMKNNLEA 367 Sbjct: 367 WSSTNNRAAYH 377 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 30/296 (10%), Positives = 60/296 (20%), Gaps = 25/296 (8%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L D ++ K VL + Sbjct: 570 LRAGRVVYYTSIPDDVNTATGTTAQKLDKGFWKLYIDYVL-GTGSYTSADNLYGVADSWS 628 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 S T ++ T ++ S + F A + Sbjct: 629 ASAASITTSDMTTYKFTWETGAATRPYMRYTDSPLRPRLHLWFGPLSMMDFLASATGS-N 687 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + + K V + + L + +H L K+ Sbjct: 688 WTAGTVYEAQCWQLKAGVNAVLDDVKNNHPNDYSGLVYFATSQHNGVRIPLSQNYKRIQN 747 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ +I D ++ T + N Sbjct: 748 ALFYPRDLVDTITNGDVTSEYRPYTNTSLGGYNQAIIP--------------------NS 787 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNN---LEAKKYIVLLTDGENTQDNEEG 386 G T + A++ + S + A+K ++ TDG Sbjct: 788 SGGTDPATGLGYAFNLLSPSTITAAKSVGTGNGRRGAQKIVIFETDGVPNSYRVPT 843 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 35/308 (11%), Positives = 73/308 (23%), Gaps = 12/308 (3%) Query: 9 FYSKKLIK-SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +++ K + G + AL++ V++ + VD+ + L+ AA + Sbjct: 2 DAFRRIRKLNEKGFAIVYIALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAGALAGVAK 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L ++ S + + N N+ + + R + + K Sbjct: 62 LSDTVTARQSAKLFAERNKAAGESVKVALNETNSADGDIVVGYWDKVTRTMSATVPTGKV 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 A V + +R + + I Sbjct: 122 ANAVKVVARRTTETGTGISAENKQVDLFFGQVLNWGQMSAKAEAIACRPAKPSAPI--VL 179 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + + A + + S S L Sbjct: 180 CQDLCSSTTFPFKVYFNQTIATDPSGALNPLYTVGWTEFSPTSKATDLGRNSTVAELIRD 239 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P + + ++RDA I + D + + F Sbjct: 240 PEHEIPLGLCGQTLWTNNGLGVAVDVLRDAY------IDQKDPGTNIWTVLVPIF---EF 290 Query: 308 SDPSFSWG 315 S SWG Sbjct: 291 CPSSVSWG 298 >gi|313151177|ref|NP_001100761.2| inter-alpha-trypsin inhibitor heavy chain H1 [Rattus norvegicus] gi|149034204|gb|EDL88974.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_a [Rattus norvegicus] gi|187469475|gb|AAI66831.1| Itih1 protein [Rattus norvegicus] Length = 904 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 72/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + Y + + + + Sbjct: 239 PTCSTSLLNGDFKVTYDVNRDKLCDLLVANNYFTHFFAPQNLTNMSKNLVFVIDISGSME 298 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + V++ F +V S + F Sbjct: 299 GQKVKQTKEALLKILGDMKPVDNFD---LVLFGSQVQSWKGSLVPASHANLQAAQDFVRR 355 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 356 FSLAGATNLNGGLLRGIEILNRAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 411 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + + N +E + + + F +++ + Sbjct: 412 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWAQRIYEDHDATQQLQGFYNQVAH 471 Query: 448 EIFERV 453 + V Sbjct: 472 PLLTDV 477 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 52/154 (33%), Gaps = 17/154 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + F+ + ++ STA+ DA+ A +S Sbjct: 139 DRLGLIVFGNAAYLQAPFTDDRETWQTLLEESEVA--MAGQSTALGDAIGLAISIFQAS- 195 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQ 412 + +++LTDG +T A + + I T+A ++ Sbjct: 196 ----------DTTNRVLIVLTDGNDTGSRVPPRDAATIAAANDVTIYTVAVGDPATIGEE 245 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L+ A + + F+A T L K + + Sbjct: 246 ALDLETLNAVAETTGGASFQALDTQALEKAYDEI 279 >gi|224065911|ref|XP_002191398.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Taeniopygia guttata] Length = 809 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 71/220 (32%), Gaps = 17/220 (7%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++Y + +D S +K RDAL +++ +++ D+ + Sbjct: 229 GYFVHYFAPQEMPVFPKNVIFVIDRSGSMTG-RKIEQTRDALLKILQDLRQEDHFS---- 283 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN++V+ S + G T I+ A+ A + + Sbjct: 284 --FITFNNKVVEWKSSLLPATEENVASAAALVQTLTARGGTDISGALLAAVGVLEKAEGL 341 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 + I+LLTDG+ T + I K + + + F + + + Sbjct: 342 PERSIS-------MIILLTDGQPTSGEKNVEVIQEKVQEAINGKYALFCLGFGFDVSYKF 394 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + LSN +E + F + I ++ Sbjct: 395 LEKMALSNGGIARRIYENADAALQLQGFYQEVATPILMQI 434 >gi|149034205|gb|EDL88975.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 899 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 72/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + Y + + + + Sbjct: 239 PTCSTSLLNGDFKVTYDVNRDKLCDLLVANNYFTHFFAPQNLTNMSKNLVFVIDISGSME 298 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + V++ F +V S + F Sbjct: 299 GQKVKQTKEALLKILGDMKPVDNFD---LVLFGSQVQSWKGSLVPASHANLQAAQDFVRR 355 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 356 FSLAGATNLNGGLLRGIEILNRAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 411 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + + N +E + + + F +++ + Sbjct: 412 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWAQRIYEDHDATQQLQGFYNQVAH 471 Query: 448 EIFERV 453 + V Sbjct: 472 PLLTDV 477 >gi|218778177|ref|YP_002429495.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759561|gb|ACL02027.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 558 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 87/282 (30%), Gaps = 25/282 (8%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T S +++ + + +M + +N F + K + + + Sbjct: 109 STFSIDVDTASYSNVRRFLSY-GNMPPVDAVRIEEMINYFHYDYPQP-KGQDPFSITMEM 166 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + L + +D S + K LV+ ++ +++ Sbjct: 167 SQCPWNRDNMLVHVGLQGRCLDYKDVKPSNLVFLLDVSGSMNSENKLPLVKRSMEMLVKE 226 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D V+ + + + T GSTA + ++ Sbjct: 227 LGAGDRVS------IVTYAGSAGLVLP---STSARNKRKIITALDRLEAGGSTAGGEGIE 277 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIM 401 AY + E+ + N ++L TDG+ + + GI + Sbjct: 278 LAYRV---AWENLIPEGNNR------VILCTDGDFNVGVSSTPELVRMIEEKRRAGIYLT 328 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 F + + EK + A +F+ +S E +K+F Sbjct: 329 ICGFGMGNYKDEKMEAI--SNAGNGNFYYIDSRREAHKVFVQ 368 >gi|167041757|gb|ABZ06500.1| putative NHL repeat protein [uncultured marine microorganism HF4000_010L19] Length = 1148 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 46/368 (12%), Positives = 107/368 (29%), Gaps = 21/368 (5%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 Y + I +N + YQ LS + + +N S L Sbjct: 146 YYPHAITVGPSYVYVGYNNSYISLYERSSLNHVSTPYQSSLSYSWGMSVNSASNKLVIAD 205 Query: 153 I-KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA--D 209 K+ +++ ++S S S + Y L+ + Sbjct: 206 YVKNRVVELNVSGSSLSYSQATSASYSSSNGYFRRPTDAAYDSSGNIYALDLYNNRIQKF 265 Query: 210 RTVKSYSSQNGKVGIRDEKLSPY--MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + +Y ++ G + PY + + ++Y + ++ Sbjct: 266 NSSLTYQAKTGSWSVSAGFRYPYGMHIDSSDNIYVTDFYNYAVRKYDTSLNETATYGGGA 325 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + + ++ + N + T N RV + + T Sbjct: 326 GTRLDAAKKVIKKIVSNTDLTSGANFGLMEWGTRHNIRVK--------ISDTGAKTIYTN 377 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T + AM A + S + + Y+++++DG + + Sbjct: 378 VDGVYASGGTDLAKAMSIARNYFTSGQVANWNLSCSVN----YLIVISDG-YWSGHTTVL 432 Query: 388 AICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 +I + K+ I+ + F++ + + + A++ EL D I Sbjct: 433 SIAEQIKNVYNIKTFAVGFALGGANSNYSTLATKGGTTSPLY--ASNQSELLAKLTDAIK 490 Query: 447 NEIFERVI 454 I ++ Sbjct: 491 QAISGKLT 498 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/326 (10%), Positives = 82/326 (25%), Gaps = 13/326 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G I L + V+LG+ G+++D+ + L+ A + + A+ L + + + Sbjct: 7 KKERGVILPIVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQELDGAADAL 66 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + T +Y + + T + + A L++ Sbjct: 67 TRATSAGLTAGNANKVQYQKASASLIDTDVTFSDSLTGAFSSTFTPVANARYAKCGHLTT 126 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 L + G S + A V + + + Sbjct: 127 GILAYLIQM-----VGGPTSNAVAAIGVATRTHAQSTCPIPVGLLPRTGGTAPDYGFAVG 181 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 L + A + + +G + K + CN + L ++ + Sbjct: 182 DWVTVLYDGTKTAGPGEMGWYNLDGSTNANETKNEMSVGYCNSKVNDTLRTPGAKVAVDD 241 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + ++ G + +N + Sbjct: 242 QWNSRFGIYKN--------NGDMQTMRPDFTGYSYTTQNWPNGKSAYNGTSGGSDPTATN 293 Query: 316 VHKLIRTIVKTFAIDENEMGSTAIND 341 T + I Sbjct: 294 FKTKRLTFASYGDTSTDVQAGDNITG 319 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 58/188 (30%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++K Sbjct: 78 KIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKL-ID 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I+D +QT + + ++L++DG+ + Sbjct: 131 ALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRGEP----- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + + T F + ++ +F N + L+ F + + Sbjct: 178 AANVKIGNVPVYTFGFGADYDPTVLNAVARNSM--GGTFSVVNDVNLLSMAFSQCLAGLL 235 Query: 450 FERVIRIT 457 V +T Sbjct: 236 TVVVQDLT 243 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 59/197 (29%), Gaps = 40/197 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ I+ R+G + + + ++ + Sbjct: 112 NRLDATKNVAEEFIQD-------RPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + E G TAI + TA + I S+ + K I+LLTDG N + Sbjct: 165 NNVLENG-TAIGSGLATAVNRIKDSDAES-----------KVIILLTDGVNNAGFIDPST 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-------------------ARYFLSNC--ASPNS 427 A GI++ TI N L A+ Sbjct: 213 ASELAVEFGIKVYTIGVGSNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGK 272 Query: 428 FFEANSTHELNKIFRDR 444 +F A + +L +I+ + Sbjct: 273 YFRATNNEKLEEIYAEI 289 >gi|130491439|ref|NP_001076115.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] gi|6579183|dbj|BAA88322.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] Length = 906 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 63/185 (34%), Gaps = 13/185 (7%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++AL ++ I+ D F RV S + + F + Sbjct: 307 KEALLKILGDIRPEDY------FDLVLFGSRVQSWRGSLVPASEANLQAARDFVQRFSLA 360 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T +N + + + ++ + + +++LTDGE T+ + I + Sbjct: 361 GATNLNGGLLRGIEILNNAQ----GNLPAVSKHAAILIMLTDGEPTEGVTDRPQILKNIR 416 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S + ++ F + + N +E + + + F +++ N + Sbjct: 417 SAIGGRFPLYSLGFGHDLDFNFLKSLSMENNGWAQRIYEDHDAAQQLQGFYNQVANPLLV 476 Query: 452 RVIRI 456 V + Sbjct: 477 DVELL 481 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 57/178 (32%), Gaps = 24/178 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ V+ L + + R+G F D F+ L +++ Sbjct: 118 TRRIDGVKSLLLDFLAQ-------RASDRVGLIAFGDAAYLQAPFTEDKGAL--SLLLKE 168 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA+ DA+ A + S+ D K ++LLTDG +T + Sbjct: 169 MDVRMAGAGTALGDAIGVAVNHFSHSDTDN-----------KVLLLLTDGNDTSSEFPPL 217 Query: 388 AICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 A QGI I IA + L A + FEA +++ Sbjct: 218 EAARYAAQQGIVIYPIAIGDPANVGEDSLDIEMLQQIADITYGQVFEAQDGEAFTQVY 275 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 64/216 (29%), Gaps = 23/216 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK---HLVRDALASVIRSIKK 287 Y C + + + + LV+ + V+ + Sbjct: 2 SYHCICPEGHVLQDDGKSCGTCRTSNIDLVLLIDGSKSVRPQNFELVKQFVNQVVDQLDV 61 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 R+G ++ V ++ S ++ I K E T A++ Sbjct: 62 SAKGT---RVGLVQYSSCVRTEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMV 116 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + S E KN + ++ TDG +QD+ KAK GI + + Sbjct: 117 ENSFSEAEGARPAEKNIP---RVGLVFTDGR-SQDDI--QEWAKKAKEAGITMYAVGVG- 169 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 + L AS FF + +++I Sbjct: 170 -----KAVEDELREIASDPVEKHFFYSADFTAISQI 200 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 58/188 (30%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++K Sbjct: 78 KIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLQQVTEDSQPQLLKL-ID 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I+D +QT + + ++L++DG+ + Sbjct: 131 ALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRGEP----- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + + T F + ++ +F N + L+ F + + Sbjct: 178 AANVKIGNVPVYTFGFGADYDPTVLNAVARNSM--GGTFSVVNDVNLLSMAFSQCLAGLL 235 Query: 450 FERVIRIT 457 V +T Sbjct: 236 TVVVQDLT 243 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 49/355 (13%), Positives = 103/355 (29%), Gaps = 34/355 (9%) Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 T E I T + + + ++ L++ P ++ L + Sbjct: 238 EFVATHTATEPGVIRLRTELVPDRNDTFTANNVAGATTLVMPPPNILLIEDSGDEGAVLA 297 Query: 161 KAEAETV-----SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 A S + + + D + + + + K Sbjct: 298 DALRRADMVIERSSASELPANLDLLTRFDGFVLVDVPATQLSLEQMVALREVVRSEGKGL 357 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI---KKKH 272 + G + ++ L P P +S +D S+ K Sbjct: 358 TVIGGNQSFTLGGYAETPLADALPLLMTPPPRPQRAPVSILFIIDRSASMSATFGISKFD 417 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + ++A + +++ D R+G F+ I F + ++ + Sbjct: 418 MAKEAAILSLTTLQPGD------RVGVLAFDTETIWTVPFRTVGEGVSLVELQDQIATMS 471 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEGIAICN 391 G T I A+ + + H VLLTDG + +N + Sbjct: 472 LGGGTNIERALSVGLPALANEPYSTRHA-----------VLLTDGRSYSNNYPRYQQLVE 520 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKI-FRD 443 A++ I + TIA + L+ AS ++ +L +I F++ Sbjct: 521 TARAAQITLSTIAIGSDSD-----TELLNQLASWGNGRYYFVADATDLPRITFQE 570 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 43/148 (29%), Gaps = 18/148 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F D ++ + + + +TAI + + + S Sbjct: 138 VVLFGDFAFIATPVTYEKEIVSE--MIGYLSHGMAGQNTAIGEGIAMGVRALRDSKAKS- 194 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K I+LLTDGE+ + + IR+ TI Sbjct: 195 ----------KVIILLTDGEHNSGSISPKEAVAMVGKEHIRLYTIGIGQKGEFDN---AL 241 Query: 419 LSNCA--SPNSFFEANSTHELNKIFRDR 444 L A FF A + EL ++ + Sbjct: 242 LKQLAHDGHGKFFAAANEKELQSVYDEI 269 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 41/377 (10%), Positives = 94/377 (24%), Gaps = 27/377 (7%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L G + A + ++ G +D R + L+ A + + + Sbjct: 26 LAGDRRGSTLALMAAGLIPVIAALGAGIDAGRLYLVKSQLQAGVDAAALAGARAFAVTDG 85 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP-------RK 126 ++R K + + + L +F ++ TA + P Sbjct: 86 SPAARDKQASAYFYGNFASDYMGVSNLQLTPDFKTVGGINVTTITARAIVPMTFMRIFGF 145 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ +L PL + + S + KE ++ Sbjct: 146 QPRTMQAVAKAELQPRPLEVMVVLDDTGSMKANLSGGRTRMVA--LKEAANDFVDILHQG 203 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S + L D TV G + E + + + L + Sbjct: 204 AS--------SRRDLAMGFIGYDVTVNVGHLLPGNAVKKVEGFNSFAGASAYGLSVPSHA 255 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + D++ + A V + + + Sbjct: 256 SGNYLAWKGCVMADTTVKDVDKNRLTSETGAWDLVRSLPGEGSHPPVEPFFVPPMYVPNA 315 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 S+ T+ N + ++A+ + + Sbjct: 316 KSNSGADNPSS--------TYYGIYNTEPGNNLYRIDKSAFGEM--GADYLARTELYRQY 365 Query: 367 AKKYIVLLTDGENTQDN 383 Y V L +G+ T+ + Sbjct: 366 FYDYYVGLNNGDATKAD 382 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/351 (11%), Positives = 81/351 (23%), Gaps = 58/351 (16%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S + + + S Y + N + + Sbjct: 292 SLPGEGSHPPVEPFFVPPMYVPNAKSNSGADNPSSTYYGIYNTEPGNNLYRIDKSAFGEM 351 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYM-----------LYPGPLDPSLSEEHFVDSSS 263 + + + + Y V N Y + ++D Sbjct: 352 GADYLARTELYRQYFYDYYVGLNNGDATKADDVVRRLDGSYYTPSAANRAGNDWYIDWKR 411 Query: 264 LRHVIK---KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + K + + S+ + +G K Sbjct: 412 IPKYYDGAVWKDPASATVNPLGGSVDSGSKNTTPMPSPNWQ---CPEEAMPPRYGRMKSE 468 Query: 321 R-TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + T + + Y ++ + ++ +V +TDG N Sbjct: 469 YLSHIANQHGAIYPANGTIHHAGLLWGYRLLVRDDVFTRANPTTER-PRRALVFMTDGVN 527 Query: 380 ---------------------------TQDNEEGI------AICNKAK--SQGIRIMTIA 404 Q N E C + + I IA Sbjct: 528 EIGESQNGYLDRTFTWYGRWSDSRIAANQSNSETQMLRRFEKTCANIQREANPPEIYIIA 587 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 N + + +NCA P + +ST EL + F+D + +I+ Sbjct: 588 LVANSSAID---TAFNNCA-PGRVYRTSSTDELRRAFQDVAAELVDLHLIQ 634 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 56/182 (30%), Gaps = 26/182 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A+ +I+++ D R+ F+ + R + + Sbjct: 176 LKRAVNFIIQNLGPSD------RLSIVTFSSSARRMLPLR-RMSGSGREDATSVVDSISA 228 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAIC 390 +G T I ++ + + I+LL+DG +TQ + Sbjct: 229 IGGTNIVAGLKKGVQVLEERRQHNSVAT---------IILLSDGCDTQSHNAQNRLDYLK 279 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNE 448 ++K + T F + + AS +F S + L F IG Sbjct: 280 EESKQPTFPVYTFGFGSDHDSAA-----MHAISDASRGTFSFIESINILQDAFARCIGGL 334 Query: 449 IF 450 I Sbjct: 335 IS 336 >gi|220925466|ref|YP_002500768.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] gi|219950073|gb|ACL60465.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] Length = 359 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 34/364 (9%), Positives = 84/364 (23%), Gaps = 26/364 (7%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + G +I A M + G+ G +D L+ + + + Sbjct: 6 RHFHSCDRGAVALIFAAAMIPLFGLAGAALDYANARRVRDVLQSISDATALLVADAD--- 62 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ Q I R+ + + ++N Sbjct: 63 --TPTVAERSFKLAENQLISRLGDRSGSGGYTIKGEWLDGSSYKLTISTKINTILIHLLS 120 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S ++++ +++ R T + + Y++ I S Sbjct: 121 GKSKQFEISA--VTVANRIPPRYETKPPTLSLLSPEAADYNR-----IYMYCFSSDPKRQ 173 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + D + + A +P N++ YML Sbjct: 174 AETDGGRRGITPIADNATPPSNYGD------------YAPPTCGDNEAPSYMLRNVRDAR 221 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + S+ + + ++ N + + Sbjct: 222 TNPTKWDAKYQSVYEYYTDVVIDTGTRRQTMNMKGYKVYSSNYKEPINMDKNPILETIIC 281 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 S K TA + ED ++ + Sbjct: 282 GSTTSCKPQSEGGILPNNHVTHDPQTATTS--CEEGKFMYYGWEDRPPNAGSDRDYDDIR 339 Query: 372 VLLT 375 +L++ Sbjct: 340 LLVS 343 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 46/337 (13%), Positives = 108/337 (32%), Gaps = 33/337 (9%) Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 D + ++ + + + L + + E +T + Y Sbjct: 503 SDSKVSVELTIDIPGKSVDSLSIDKVPEDDVFLPSEMPGFIGNGFDFYVDCEFDTATMKY 562 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + + D + P TV + ++ + + ++ Sbjct: 563 TFDESLLDTPDFEPAVYYCDIVNQKMVLLDDQVYDPVTCTVTAKTTHFSRYILLNKTAFE 622 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + + + + G ++ +DSS K+L +DA + + ID Sbjct: 623 KIWDKDFAGTSVDNSG-KTVAMDIALVIDSSGSMTWNDPKNLRKDAAKEFVDKLSSID-- 679 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 F+ + + + RT++ + D + G T++ + + + Sbjct: 680 ----EAAIIDFDSSSKINRNLTSN-----RTLLYSAIDDIDSSGGTSLTAGVSKGLEALS 730 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 SN+ KK ++LLTDG+ D A + G+ I TI N Sbjct: 731 KSND------------KKIMILLTDGKGPYDKSLTTQ----AINAGVTIYTIGLGTN--- 771 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 + + L++ A + ++ A ++ F + G Sbjct: 772 NDIDQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSG 808 >gi|260778728|ref|ZP_05887620.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] gi|260604892|gb|EEX31187.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] Length = 463 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 49/451 (10%), Positives = 116/451 (25%), Gaps = 30/451 (6%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I+ G +I+ + P ++ G +D R + L A A I A+ + + + Sbjct: 10 IRKSKGLVALISVIAAPFLILATGTAIDSGRAFLVKAKLFAAVDAAGIAAARAVAEGEDA 69 Query: 75 V--SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD---TAVEMNPRKSAY 129 ++ + P + +F + + T S+ Sbjct: 70 ARDAAIKFYNANLPTDYHDSTTASPTVTFGYDSFGNISIDLSASAEVSTTFLGVFGHSSL 129 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ +++ P+ L L S + + + + K+ + D + Sbjct: 130 EISATAQTVR--RPVDLVLVVDNTTSLRLGSIGDVTDDVVARSKDFITNFNESFDRIALV 187 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + + + + + +L + Sbjct: 188 KYAYGAEVPVAFQSSRGHSRSDITTEIDAFDFGSLSSLQYTNSSEGIYLALDALRNVTDP 247 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 F D + + V + + SI+ D + T R I+ Sbjct: 248 ANLKVIVFFTDGAPN-TFASEFDFVGTST-NYTGSIRSSDESSGTPRGLWYH---DAIAT 302 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 G D + A T ++ + S+ D N Sbjct: 303 QLPGSGYEGSDIDDYIAELPDYYTA-----HSASATEFNVLNPSHPDRPVTQYNPSTHSA 357 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-----KTQQEKARYFLSNCAS 424 + + N I A+ + I + T+ + E+ L A+ Sbjct: 358 NDLYIR--VNRVARNLLEDIATAARGEDIYVFTLGLGSSLTSSTGPDNEQGEDLLMRMAN 415 Query: 425 PNSFFEANS------THELNKIFRDRIGNEI 449 + +L ++ + E Sbjct: 416 DPRMLDDADLAGDFRADQLQGVYCHAVDEEA 446 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 55/184 (29%), Gaps = 32/184 (17%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F+ ++ + ++ I + I Sbjct: 122 GRKSDRVGLVAFSGEALTQVPLTTDYPVVLAAID-------------NLQVGQLEDGTAI 168 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ +R++N+ + +VLLTDGEN + + A + GIRI TI + Sbjct: 169 GTAIATAANRLRNSPGRSRVMVLLTDGENNRGAIDPRTAAQAAGTFGIRIYTIGVGTDGM 228 Query: 411 QQEK-----------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 L+ A + +F A L I+ E Sbjct: 229 AAVPVGRGLFGLRYENRPVKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQIDRLERSA 288 Query: 452 RVIR 455 R Sbjct: 289 VEAR 292 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 59/196 (30%), Gaps = 39/196 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ ++ + + R+G + + + + ++ + Sbjct: 112 DRLSALKRVASAFVDD-------RLSDRIGLVVYAGESYTLTPITSDKGIVKGSLREISY 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E G TAI + T+ + + S K I+LLTDG N + Sbjct: 165 QGLIEDG-TAIGMGLATSVNRLKDS-----------RAKSKVIILLTDGVNNSGFIDPKI 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNC--ASPNSF 428 A GI+ TI N T + L A+ + Sbjct: 213 ATELAVEFGIKTYTIGLGSNGTARAPVGILPNGSFQYAMTKVEIDEALLQEIATATGGIY 272 Query: 429 FEANSTHELNKIFRDR 444 F A +L +I+ + Sbjct: 273 FRATDNKKLEEIYEEI 288 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 66.5 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 50/178 (28%), Gaps = 22/178 (12%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + I + T R+ F+ + + N G+T I Sbjct: 122 TAMLFIIKKLGPTDRLSIVTFSGGANRLCPLRQ-TTGKSQEEFENLINGLNADGATNITA 180 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 +QTA + + + + I+L++DGE ++ G+ + Sbjct: 181 GLQTALKVLKGRSFNGERVVG--------IMLMSDGEQNAGSDAT----------GVSVG 222 Query: 402 TIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L A +F + + L K F + + V + Sbjct: 223 NVPIHTFGFGINHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQCLAGLLTVVVQDL 280 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 66.2 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 27/251 (10%), Positives = 62/251 (24%), Gaps = 16/251 (6%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + G+ +T + MP++LG +++DV R L+ A + + Sbjct: 3 LRKAINRFWDDNRGYVIALTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGAR 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKI---------------EEYLIRNFENNLKKNFTDREV 111 L + ++ Y N + +E+ Sbjct: 63 ELDGRDDAITRADAAIEALANSAAFGGGGNGMSLGSHITVAYDAGNAAGSTVAVTYLKEI 122 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 D + + ++F +G I AEA V RS Sbjct: 123 PADDDDPITASMETTDPNEASYAWVIANDQAMTTIFPVPVGFNRDTINVAAEAVAVYRSS 182 Query: 172 H-KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + I + + + + L+ + + + Sbjct: 183 ACDVTPIYICNPFEAPGNSTEAGNQTAADALHENFAAGNLYGRQIELHSTSSSAPGPGNF 242 Query: 231 PYMVSCNKSLY 241 ++ + Sbjct: 243 GFLATYGNGAN 253 >gi|15488640|gb|AAH13465.1| inter-alpha trypsin inhibitor, heavy chain 1 [Mus musculus] Length = 909 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 74/246 (30%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 244 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 303 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S V + F Sbjct: 304 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPVSNANLQAAQDFVRR 360 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 361 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 416 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 417 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 476 Query: 448 EIFERV 453 + V Sbjct: 477 PLLTDV 482 >gi|20381126|gb|AAH28814.1| Itih1 protein [Mus musculus] Length = 911 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 74/246 (30%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 246 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 305 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S V + F Sbjct: 306 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPVSNANLQAAQDFVRR 362 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 363 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 418 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 419 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 478 Query: 448 EIFERV 453 + V Sbjct: 479 PLLTDV 484 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 48/442 (10%), Positives = 127/442 (28%), Gaps = 40/442 (9%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 A+++ +++ V + + + A + I+ + + N Sbjct: 8 AMVLALLMAVISLATVSAQAITVSGTVTDAKDGTPLVGCSVQIKGTTKGTVTNMNGQYTI 67 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---------------NPRKSAYQV 131 + K E L+ + ++ + V+ A E+ + + Sbjct: 68 QSKKGETLLFQYIGYKQEKRVVKSSTLDVKMKADELVLEECVVVGYGHELRATKSMSTAY 127 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSML 190 + ++ I+ + ++A + S + S + Sbjct: 128 MAVCPASGIMYNAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDAASYSNMRRFINKGKLP 187 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F + + + L P Sbjct: 188 PVDAIRTEELVNYFSYDYPKPT-GSDPVKITMEAGTCPWNADHRLVRIGLKAKEIPTDNL 246 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S + LV+ +L ++ +++ D V + Sbjct: 247 PASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVA------IVTYAGNAGEKL 300 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + G + ++ + GSTA + + AY + NN Sbjct: 301 ASTPGS---DKQKIREAIDELEASGSTAGGEGIMLAYKI----AQKNFISGGNNR----- 348 Query: 371 IVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++E + + + GI + + + + + K + + Sbjct: 349 IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLSEK--GNGN 406 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 ++ E N++ + G + Sbjct: 407 HAYIDNLQEANRVLVNEFGATL 428 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 77/208 (37%), Gaps = 29/208 (13%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + ++ + + + + ++ D +G F+ S + Sbjct: 95 SLSMAANDIDPNRLEAAKKSAQGFVETL------PDRFNVGLVAFSSTATVVSSPTHDHQ 148 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +I + + TAI + + + ++I S +ED IVLL+DG Sbjct: 149 AVIGS-----IENLQLGPGTAIGEGVFASLESISSFDEDADVDP-----PPSAIVLLSDG 198 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---------QQEKARYFLSNCASP--N 426 ENT + A+ A Q + + TIAF + + L AS Sbjct: 199 ENTSGRDISQAVAMAA-EQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGG 257 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 F+EA S EL++++ D IG+ + ++ Sbjct: 258 HFYEAESETELDEVYED-IGSSLGTELV 284 >gi|3024048|sp|Q61702|ITIH1_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|695632|emb|CAA49841.1| inter-alpha-inhibitor H1 chain [Mus musculus] Length = 907 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 74/246 (30%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 242 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S V + F Sbjct: 302 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPVSNANLQAAQDFVRR 358 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 359 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 414 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 415 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 474 Query: 448 EIFERV 453 + V Sbjct: 475 PLLTDV 480 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 46/338 (13%), Positives = 92/338 (27%), Gaps = 21/338 (6%) Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + ++ + G + + V+ I+ ++ + Sbjct: 440 AGLKAGQSAWNSTVTIPLAGRVRRQLYLSGSTIRSDIETVADLSVTWDGCIEERQTFFNT 499 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS--PYMVSCNKSLYYMLYPGPLD 250 + LN P+D K + +L + G Sbjct: 500 DGNPSDDWLNYPSSPSDAIDMDIDRVPDSDPATRWKPLLPNAVWGPKGTLVGNTWSGDYT 559 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-----KIDNVNDTVRMGATFFNDR 305 + SS + A + ++++ V N Sbjct: 560 TDPVKAGENTSSGDTGRNLSNNSCVTASRKLTNYNGVGGNPSAQDLSNYVGTLVPHNNT- 618 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 D WG + T + G I + D ++ + Sbjct: 619 -YHDIGLLWGARLMSPTGIFASENA-TTGGGAQIQRHLIFMTDGATATTVNNYASYGLEW 676 Query: 366 EAKKYIV-------LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 ++ I D N +N A+C K++ I + I + + T + Sbjct: 677 WDRRQIAPAGPNDANYDDNLNAVNNARSNALCTAIKNKNITLWVIYYGSSDTAT---KTR 733 Query: 419 LSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L+NCA SP+ F+EA +T L FR+ R+ + Sbjct: 734 LTNCATSPSYFYEARNTTLLIGKFREIADRISNLRLTQ 771 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/340 (9%), Positives = 93/340 (27%), Gaps = 5/340 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +F +L + G+ ITA + +G+ G D+ R + L+ A + Sbjct: 7 RALFILMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALAGR 66 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP- 124 + ++ N+ + N +++++ + +A Sbjct: 67 RIMGSGRWTDNNGRPNTTALATFDLNFAQNSFGAENRTRSYSESDGTVSGTASADVPMTL 126 Query: 125 ----RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 +V ++ + + + S + + + + + Sbjct: 127 MRVLNVPTKRVEVTCEGQMRIPNTDVMFVLDNSGSMNEVIPGDTTGLKKMAGLQLAIRCF 186 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + +++ D G + G + + + N + KL P + Sbjct: 187 YEALARQNITDVSAAQCGTSADPTGDLSSQVQLRFGFVNYDNMVNVGKLLPNNYLADSWT 246 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y + + S++ +S + + N + A Sbjct: 247 YRSRTATGIQTVYAWTAGTQSATTWGPWSSTPNNFANPSSYSSTYSVVSGGNTSTTTLAD 306 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + P+ + L + +E T+ + Sbjct: 307 GLAYVKKAAPTSAAACSALNSYGSYDQLVGISESNGTSSD 346 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 49/446 (10%), Positives = 125/446 (28%), Gaps = 44/446 (9%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNS 82 ++ AL ++ V + + + A + I+ + + N Sbjct: 8 AMVFAL----LMAVISLATVSAQAITVSGTVTDAKDGTPLVGCSVQIKGTTKGTVTNMNG 63 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---------------NPRKS 127 + K E L+ + ++ + V+ A E+ + Sbjct: 64 QYTIQSKKGETLLFQYIGYKQEKRVVKSSTLDVKMKADELVLEECVVVGYGHELRATKSM 123 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFS 186 + + ++ I+ + ++A + S + S Sbjct: 124 STAYMAVCPASGIMYNAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDAASYSNMRRFINK 183 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + +N F + + + L P Sbjct: 184 GKLPPVDAIRTEELVNYFSYDYPKPT-GSDPVKITMEAGTCPWNADHRLVRIGLKAKEIP 242 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P+ + +D S + LV+ +L ++ +++ D V + Sbjct: 243 TDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVA------IVTYAGNA 296 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G + ++ + GSTA + + AY + NN Sbjct: 297 GVKLEATPGS---DKQKIREAIDELEASGSTAGGEGIMLAYKI----AQKNFISGGNNR- 348 Query: 367 AKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 ----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK-- 402 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 + ++ E N++ + G + Sbjct: 403 GNGNHAYIDNLQEANRVLVNEFGATL 428 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 56/191 (29%), Gaps = 25/191 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ H + ++ Sbjct: 276 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQA-V 328 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A I + I+LL+DG++T + I Sbjct: 329 NSLGASGGTNIADALKKAAKVIEDRSH---------QNPVCSIILLSDGQDTYNIPSNIR 379 Query: 389 ICN----KAKSQGIRIMT---IAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 I T + + L + A S +F + Sbjct: 380 GARPDYSSLVPSSILNHTFRLVPVHGFGFGVDHDSDALHSIAEASGGTFSFIEDEGVIQD 439 Query: 440 IFRDRIGNEIF 450 F IG + Sbjct: 440 AFAQCIGGLLS 450 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 49/446 (10%), Positives = 125/446 (28%), Gaps = 44/446 (9%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNS 82 ++ AL ++ V + + + A + I+ + + N Sbjct: 8 AMVFAL----LMAVISLATVSAQAITVSGTVTDAKDGTPLVGCSVQIKGTTKGTVTNMNG 63 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM---------------NPRKS 127 + K E L+ + ++ + V+ A E+ + Sbjct: 64 QYTIQSKKGETLLFQYIGYKQEKRVVKSSTLDVKMKADELVLEECVVVGYGHELRATKSM 123 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFS 186 + + ++ I+ + ++A + S + S Sbjct: 124 STAYMAVCPASGIMYNAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDAASYSNMRRFINK 183 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + +N F + + + L P Sbjct: 184 GKLPPVDAIRTEELVNYFSYDYPKPT-GSDPVKITMEAGTCPWNADHRLVRIGLKAKEIP 242 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P+ + +D S + LV+ +L ++ +++ D V + Sbjct: 243 TDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVA------IVTYAGNA 296 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G + ++ + GSTA + + AY + NN Sbjct: 297 GVKLEATPGS---NKQKIREAIDELEASGSTAGGEGIMLAYKI----AQKNFISGGNNR- 348 Query: 367 AKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+L TDG+ ++E + + + GI + + + + + K + Sbjct: 349 ----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEK-- 402 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 + ++ E N++ + G + Sbjct: 403 GNGNHAYIDNLQEANRVLVNEFGATL 428 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 51/398 (12%), Positives = 116/398 (29%), Gaps = 37/398 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 S+ S + S + + + + + T+ E Sbjct: 582 ATSLKKNDSDGDGLSDGYEVKKGTNPLVPDTDNDGIYDGA-EVVMGTDPLTKNPLTSAEK 640 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 +V + + D + L ++ + + + + V Sbjct: 641 YAVSEDGKVFVRALSDANILIAPLQVKRSDNVFINSLKGIVGKAIEITAGGFDIKKAEIV 700 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +++ + L+ ++ + K + Sbjct: 701 VNYDEAELNGVEENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKTEHFSTFLLGDKNM 760 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + +D ++ SS + A I++I +++N R+G F Sbjct: 761 PVDLSKVDIVFVLDNSGSMSSNDPNYYRIE----ATKKFIQNIDELNN-----RVGLVDF 811 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + V + + KL++ + GST I ++ A Sbjct: 812 DSSVSVRSNLTSDKSKLLQALNAMR----WTGGSTNIGGGLKAALGLFDQEQS------- 860 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 KK IVLL+DG + + K + I + TIA ++ R L + Sbjct: 861 -----KKIIVLLSDGYHNTGIHPNDVLPELIKQE-IVVNTIALG-----KDCDRELLHDI 909 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + +F ++T L+ ++ + +I ++TK Sbjct: 910 ADKTKGGYFYVDNTGGLS---QEDVDKQIELIYEKLTK 944 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 68/209 (32%), Gaps = 27/209 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +DSS L + A S + + T + G +++ + Sbjct: 65 WTTSVPIDVVFALDSSGSMGWNDPSGLRKTAAKSFVDKLNST-----TDQAGVVSWDNNI 119 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + + G T +N + A + + + Sbjct: 120 DFTQTLTNNFSLVKS-----KIDAVDSSGGTDLNVGLNAAISLLDTGKQ---------AN 165 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--S 424 + I+ L++G+ T + + A ++G + TI +++ A L + A + Sbjct: 166 SSWVIIFLSNGQGTYSH----STAVVAANKGYTVYTIGLAISPG--STAESNLKDIANTT 219 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ + + L+ +F D + + Sbjct: 220 GGKYYSSPNATNLDAVFNDIYKEVVTSTI 248 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 58/188 (30%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++K Sbjct: 43 KIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKL-ID 95 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+T I+D +QT + + ++L++DG+ + Sbjct: 96 ALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRGEP----- 142 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + + T F + ++ +F N + L+ F + + Sbjct: 143 AANVKIGNVPVYTFGFGADYDPTVLNAVARNSM--GGTFSVVNDVNLLSMAFSQCLAGLL 200 Query: 450 FERVIRIT 457 V +T Sbjct: 201 TVVVQDLT 208 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 56/166 (33%), Gaps = 31/166 (18%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + + L + + E G TAI A+ A + + +S Sbjct: 143 DRIGLVVFSGKGYTPCPLTLDHLTLGTVLDNISSEVIQEEG-TAIGTAILIAVNRLRAS- 200 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K I+LLTDG+N + + + A GI+I TIA + + Sbjct: 201 ----------ESRQKAIILLTDGQNNAGDIDPLTAAGFALQDGIKIYTIAATAQDARP-- 248 Query: 415 ARYFLSNCASPNSFFEANSTHELNK--IFRDRIGNEIFERVIRITK 458 S L ++ + R+T+ Sbjct: 249 ---------------FVRSAESLPAGGALSGLPAEDVLVSISRLTQ 279 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 60/185 (32%), Gaps = 29/185 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + I K + + A A ++ ++ D +G F+ F + + Sbjct: 420 DTEDAGIPKLEIAKSASAKMVEHLESSD------GVGVIAFDHNYYWAYKFG---KLVRK 470 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V G TAI + A T+ S K +VLLTDG Q Sbjct: 471 EDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKN-----------KLVVLLTDGMGEQ 519 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 E ++AK I+I TI LS A + F+ ++ EL Sbjct: 520 SGYEIP--ADEAKRNNIKITTIGVGKFVN-----ASVLSWIAAYTSGRFYLVSNPSELVD 572 Query: 440 IFRDR 444 +F Sbjct: 573 VFLKE 577 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 63/199 (31%), Gaps = 35/199 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + ++A+ ++I + +G F + + K Sbjct: 99 SMAATDYEPTRLDAAKNAINNLILKMGP------QHNVGVVLFESGATTVSYLTPDKEKS 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 V + E +G+TAI D + D + + K ++LL+DG + Sbjct: 153 ----VNAISSIEQGLGATAIGDGLALGVDM-----------ASSIPDKKGVVILLSDGVH 197 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA--SP 425 AK ++I TI E L A + Sbjct: 198 NSGLVTPEEATEYAKINNVQIHTIGLGSIEPVFLRDDIYGEPQYAELDEETLVIIAQQTS 257 Query: 426 NSFFEANSTHELNKIFRDR 444 +++++ LN+IF + Sbjct: 258 GNYYKSLDEQTLNEIFVNL 276 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/274 (8%), Positives = 74/274 (27%), Gaps = 2/274 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-VPLIQSLE 73 ++ G ++ ++++ ++LGV +D+ + L+ A TA + + V Sbjct: 19 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 78 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + A + +F + +++ V + + V + Sbjct: 79 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRV 138 Query: 134 SSR-YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + ++F + + + ++ A + + Sbjct: 139 AVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRP 198 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + ++ G + + +L G + Sbjct: 199 PGYDPNVDMDPSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGC 258 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + V + + L IK Sbjct: 259 AAVGNTVTTKPGNSAGPVAQGLNVRLNDFSGPIK 292 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 55/180 (30%), Gaps = 24/180 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L R+G F + F+ + + + ++ Sbjct: 120 NRLSAAKQVLKDFAAQRDG-------DRLGLIVFGNAAYLQAPFTDDIATWLTLLEESEV 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STA+ DA+ A +S + +V+LTDG +T + Sbjct: 173 A--MAGPSTALGDAIGLAISLFQAS-----------ETRNRVLVVLTDGNDTGSRVPPVD 219 Query: 389 ICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDR 444 + A + + I T+A ++ L AS + F A T L + Sbjct: 220 AASIAAANDVTIYTVAVGDPSTIGEEALDLETLDAIASSTRGASFLALDTLALKDAYEQI 279 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 81/247 (32%), Gaps = 25/247 (10%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 +N K + + +V+ + + +D+S+ + +A Sbjct: 3 RNNKWLLCLMAAALLIVTWSGGTLPQAMAASQGSKIDAVLVMDASNSMKNSDPDRIGSEA 62 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + I + ++G + DR+ + + + +T +K F + T Sbjct: 63 MKMFIDMLSTT-----GDKVGVVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYT 117 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----------EGI 387 I+ + A + E IV+L DG N D Sbjct: 118 DISVGLDEAVKVLKQGMEPAHAP---------MIVVLADGNNDLDPNTGKTSQEASDHLN 168 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +AK GI I TI + + ++A L+N + F +S +L +I + + Sbjct: 169 QAVQEAKGSGIPIYTIGLNADGKLNKEALAELAN-QTGGKSFTTSSADDLPQILSEIFAS 227 Query: 448 EIFERVI 454 +++ Sbjct: 228 HQQLKIV 234 >gi|148692830|gb|EDL24777.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_d [Mus musculus] Length = 651 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 73/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 242 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S + F Sbjct: 302 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPASNANLQAAQDFVRR 358 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 359 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 414 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 415 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 474 Query: 448 EIFERV 453 + V Sbjct: 475 PLLTDV 480 >gi|148692829|gb|EDL24776.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_c [Mus musculus] Length = 658 Score = 66.2 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 73/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 249 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 308 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S + F Sbjct: 309 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPASNANLQAAQDFVRR 365 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 366 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 421 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 422 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 481 Query: 448 EIFERV 453 + V Sbjct: 482 PLLTDV 487 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 73/225 (32%), Gaps = 29/225 (12%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P + VD S+ + + +A+ I + ++G + Sbjct: 32 NAASPSASKVDAVLVVDVSNSMNTSDPGKIGNEAMKMFIDMLS-----TQNDKVGIVAYT 86 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D V + + + + +KTF N T + ++ A + Sbjct: 87 DVVQREKALLNISSEADKQELKTFIDGLNRGAYTDTSVGVKEAIRILQD---------GK 137 Query: 364 NLEAKKYIVLLTDGENTQDNE----------EGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 IV+L DG N + + +AK G+ I TI + + + Sbjct: 138 TAGHAPMIVMLADGNNDFNKTTGRTESQSAQDMAQAVAEAKKSGVPIYTIGLNADG---K 194 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + L++ A + F +S +L I + + + +V+ + Sbjct: 195 LNKNKLADIAQQTGGKSFITSSADDLPNILSEIFASNLKLKVVPL 239 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 54/196 (27%), Gaps = 40/196 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ A I R+G + + + ++ + + Sbjct: 111 NRLASLKKVAADFI-------KKRPNDRIGLVVYAGESYTKTPITSDKGIVLNALKEITY 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI + T+ + + K + K I+LLTDG N E Sbjct: 164 GSLED--GTAIGMGLATSVNRL-----------KESKALSKVIILLTDGINNSGFIEPQT 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA--SPNSF 428 A I+ TI N L A + ++ Sbjct: 211 AAELAVEYDIKTYTIGLGTNGNALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAY 270 Query: 429 FEANSTHELNKIFRDR 444 F A + L I+ + Sbjct: 271 FRATNNESLASIYDEI 286 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 59/181 (32%), Gaps = 24/181 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V++ L ++ + R+G F D FS + R + + Sbjct: 117 VDRLSAVKEVLGEFLQR-------REGDRVGLVVFGDAAYLQAPFSTDLQLSRRLLDECE 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA DA+ + S K I+ LTDG +T+ + Sbjct: 170 V--GMAGPRTAFGDAIGLGVNLFSES-----------EAPAKTIIALTDGNDTKSQVPSV 216 Query: 388 AICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 A + IRI T+A + + + L A + S+F A L I+ Sbjct: 217 EAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGSYFFAADRASLAGIYDQ 276 Query: 444 R 444 Sbjct: 277 L 277 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/294 (9%), Positives = 75/294 (25%), Gaps = 10/294 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G I L M V++G G + +DV + L+ +A ++A+ + ++ Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + L + E D + + +V + Sbjct: 72 EAIVAGNEAINNNL---TLDGFGSMTIDNTDIHYEFSDSLPFDSSTNTATSPYVRVRIE- 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 D+ +++F M +S + + + T + + + Sbjct: 128 DVDVADYLVAIFNIDMSARSSAVAGPSSSITTTCNVVPLSICEGSESSTTLSGYSEGSLH 187 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + ++ + + D ++ Sbjct: 188 VLKASSTKDSAIGSGNFMPMALKDADGNT--IPGASSYGDALAGSFDTCLTVIEDEMITS 245 Query: 256 EHFVDSSSLRHVIKKK----HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 E R + + ++ + N+ V + +D Sbjct: 246 EPGNMVGPTRGIDTRFGIYEGTFKNDENKYPADKDTYYDSNNLVEVQTVQIDDG 299 >gi|55962354|emb|CAI11851.1| novel protein (zgc:56119) [Danio rerio] gi|56207241|emb|CAI21014.1| novel protein (zgc:56119) [Danio rerio] Length = 946 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 56/169 (33%), Gaps = 7/169 (4%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D+++ FN V + K + + G T IN+A+ A Sbjct: 333 DDLSIDDYFSIIDFNHNVRCWSEDLVQASSIQVDEAKKYIQNIKPNGGTNINEALLRAIQ 392 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAF 405 +I ++ + ++ I+L++DG+ T + I K + + ++ Sbjct: 393 MLIKASHHGLIDPRSVS----MIILVSDGDPTVGEIKLSTIQKNVKLRMKEEFSLFSLGI 448 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + R + N + + E K F ++ + + + Sbjct: 449 GFDVDFDFLERIAMDNRGIAQRIYANQNAAEQLKTFYSQVSSPLLRTIT 497 >gi|124249351|ref|NP_032432.2| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Mus musculus] gi|74146297|dbj|BAE28922.1| unnamed protein product [Mus musculus] gi|148692827|gb|EDL24774.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_a [Mus musculus] Length = 907 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 73/246 (29%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG+ + + + + Y + + + + Sbjct: 242 PTCSTSLLNGEFKVTYDVNRDKLCDLLVANNYFTHFFAPKNLTNMSKNLVFVIDISGSME 301 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR ++++ ++ + V++ F +V S + F Sbjct: 302 GQKVRQTKEALLKILEDMRPVDNFD---LVLFGSKVQSWKGSLVPASNANLQAAQDFVRR 358 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T +N + + + + +++LTDGE T+ + I Sbjct: 359 FSLAGATNLNGGLLRGIEILNKAQ----GSHPELSSPASILIMLTDGEPTEGETDRSQIL 414 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F + N +E + + + F +++ N Sbjct: 415 KNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWAQRIYEDHDATQQLQGFYNQVAN 474 Query: 448 EIFERV 453 + V Sbjct: 475 PLLTDV 480 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 46/445 (10%), Positives = 113/445 (25%), Gaps = 58/445 (13%) Query: 19 TGHFFIITALLMPV-------MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 G I+ A L M+ + + V+ L ++ V + + Sbjct: 194 GGDAEILVAELALPSQVRVGQMVRITAL-VESSVAQRASVRL--IGDQGVLAEEVLDLPA 250 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 Q++ + + ++ N ++ + + + Q Sbjct: 251 GSTQVPFEVQVTARGMQQLRVVVQGEVDGRVQNNEAAAIIQAYGPRQILLVAANSADAQP 310 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + S+ M + VI + Sbjct: 311 LASALEAANFQTAIRVPAEMPNDLIGLS------------------EYAAVIVVNTPARA 352 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + Q+ G + + + L Sbjct: 353 LPAGAMEALEISVRDLGRGLLMIGGEQSFGAGGYRDTPVEAALPVYMDVRDREQRPDLAL 412 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + ++K + ++AL IR + D R+G F+ + + Sbjct: 413 VFVIDRSGSMAEPAGNVQKLDIAKEALVQAIRMLYGED------RVGIVTFDSQAYTTMP 466 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + G + V G T I + + K++ Sbjct: 467 ITQG---VGEEEVLQAIASVTADGGTNIGAGLSAGQRMLT-----------GVEAKIKHM 512 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +LLTDG +++ A+ ++QGI + +A E+ + A ++ Sbjct: 513 ILLTDGWGEGNDQL--AVVEAMRAQGITLSVVA--AGSDTAEELKTL--ATAGGGRYY-- 564 Query: 432 NSTHELNKIFRDRIGNEIFERVIRI 456 + + + +E ++ Sbjct: 565 --AAAIMQAVPQILVDETITVAGKL 587 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 49/153 (32%), Gaps = 2/153 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ + + ++ V + +D + L+ A + + A+ L Sbjct: 22 RKQGGAILVMFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKELQDGASLY 81 Query: 76 SSRAKNSFTFPK--QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 +R + + Q E + + + ++ +V + E S V Sbjct: 82 EAREAATTLLLQNLQYQENGELNSSIDLSAPDYNSTQVAANLFIEFSEWPDPFSPILVEG 141 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 S + + + L I ++ +A A Sbjct: 142 SEYVRIRIENVGLTNFLAQIMNFDKSVRASAIA 174 >gi|261878614|ref|NP_001159906.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform b [Homo sapiens] Length = 769 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 68/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 103 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 162 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 163 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 219 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 220 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 275 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + + F ++ Sbjct: 276 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAK 335 Query: 448 EIFERV 453 + V Sbjct: 336 PLLVDV 341 >gi|221042234|dbj|BAH12794.1| unnamed protein product [Homo sapiens] Length = 769 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 68/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 103 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 162 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 163 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 219 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 220 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 275 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + + F ++ Sbjct: 276 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQVAK 335 Query: 448 EIFERV 453 + V Sbjct: 336 PLLVDV 341 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 66/194 (34%), Gaps = 34/194 (17%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + + +A ++ R+G F R + + + + Sbjct: 117 MQPVDRLTAAKAVIADFLQRRSG-------DRVGLLVFGQRAYMLTPLTLDLSAVREQLR 169 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 T A TA+ DA+ A + + E ++ +VLLTDG NT Sbjct: 170 DTVA--GLAGRETALGDAIGLAVKRLRTQPEG-----------QRVLVLLTDGVNTTGVL 216 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKT------------QQEKARYFLSNCA--SPNSFFE 430 + + A ++ +R+ TIAF + E L A + FF Sbjct: 217 QPLKAAELAAAEQVRVYTIAFGGDGGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFR 276 Query: 431 ANSTHELNKIFRDR 444 A+ ++L I+ + Sbjct: 277 AHDANQLAGIYAEL 290 >gi|317493250|ref|ZP_07951672.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918643|gb|EFV39980.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 544 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 92/291 (31%), Gaps = 28/291 (9%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + + ++ + ++ + LN F + V S + I Sbjct: 80 STFSADVDTGSYTVMRSLLNRGALPPSDSIRLEEWLNYFHYNYPKPVD-NSPFSVATEIA 138 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + ++ P + +D S K LV+++L ++ + Sbjct: 139 PTPWNAHSKLLRIAIKATDINATALPPANLVFLIDVSGSMSDEDKLPLVKNSLKLLVNKM 198 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D ++ ++ + + G ++ + + + GSTA + Sbjct: 199 RDQDKIS------IVIYSGETKTVLPPTSGK---DKSDILSAINQLSAGGSTAGGSGIDL 249 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMT 402 AY E + N I+L TDG+ + + K K + GI + T Sbjct: 250 AYQM----AEKGFIKNGINR-----IILATDGDFNVGITDTQQLEEKIKKKSKNGINLTT 300 Query: 403 IAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 + F + + A ++ +S E K+ +++ + + Sbjct: 301 LGFGQGNYNDS----LMMHIADVGNGNYAYIDSMQEAQKVLVEQLSSTMSV 347 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 50/176 (28%), Gaps = 15/176 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ G+ +T + MP++LG +++DV R S L+ A + + Sbjct: 2 LSRVVRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKI---------------EEYLIRNFENNLKKNFTDREV 111 L + +S Y N + F + + Sbjct: 62 ELDGRDDAISRAQTAIEKIANSAAFSGGGAGMSLGSHISVTYDAGNDAGSTVTVFFLKNI 121 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + + A + + ++F +G I A+A V Sbjct: 122 PANDDTSIPDSMKTTVASEASYAWVIAKPQAMQTIFPVPVGFTRSTINIAADAVAV 177 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/322 (12%), Positives = 93/322 (28%), Gaps = 26/322 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V+ + Y +N + + ++ + + + G S + + + Sbjct: 219 VITTKGYTNTINYDEEYAKL--SENQFKKATLNPLSTFSIDVDKAGYSNVRRMINNGDNI 276 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 Y + +N F + + + + L Y Sbjct: 277 PYDAVKIEEMVNYFDYDYPQPTD-EHPFSISTDVAKTPWNTQTQLVRIGLQGKEYLNEEL 335 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D S K L+ A ++ + + D V+ + Sbjct: 336 PASNLTFLIDVSGSMEDHNKLPLLISAFKLLVHQLIEKDKVS------IVVYAGAAGVVL 389 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + + GSTA ++ AY E + NN Sbjct: 390 PPTNG---DQKEKIINALQKLEAGGSTAGGQGIKLAYKL----AEKNFKKNGNNR----- 437 Query: 371 IVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++L TDG+ + + K ++ G+ + + F + + K + Sbjct: 438 VILATDGDFNVGASSDTAMEKLIEKKRASGVFLSVLGFGMGNYKDSKLETLADK--GNGN 495 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 ++ E K+F D G + Sbjct: 496 HAYIDTMQEAQKVFGDEFGGTL 517 >gi|260467412|ref|ZP_05813583.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259028808|gb|EEW30113.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 354 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/286 (9%), Positives = 67/286 (23%), Gaps = 7/286 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + +S +G + L++ ++ G VD +QA A+++ + Sbjct: 2 RRVASGFARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAVQTRSNQQQALDAALLSITTM 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV--EMNPR 125 S A + +F + ++ Sbjct: 62 DTTSTLAQRQAALQDSFIANGGQGTATLNSFVAGTTTAAATARATASFAMPTIFMKIARI 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + V+S+ + F + W + K + Sbjct: 122 DTVPVAVVSAVSKPPSLVNATFKVTGVSGYWNKTMTLYGTQFGAAVAKPLMTIDYTYGNT 181 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + + G+ V++ G L +Y Sbjct: 182 KDPKGYGTTNVSVLTTDSAGKTVTTLVQNQVCTVGNNPPTGVTLKTDASGTKYYCVDTMY 241 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 P + VD S++ + + + ++ + N Sbjct: 242 PANSAGAA-----VDVSTMGGLYLQMDVPSGNPKKLMSNDPATSNR 282 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/376 (10%), Positives = 95/376 (25%), Gaps = 28/376 (7%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ + + + P + + N ++ Sbjct: 19 AAGLCDDLDSDAPPVEYAARSAPVLQKAAASPQAQAIPDLSNRLGV---NMPVPAGANPQ 75 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + R + + + S + A T S +I+ ++ Sbjct: 76 WALNKSVARMGIRGTVEVQNRERYAHSDANPVHRVS-DAPVSTFSIDVDTGSYSNIRRML 134 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + LN F + D ++ Sbjct: 135 TRENRLPPADAVRVEEILNYFAYGYPLP-QDGKPFAVHTQTVDSPWQADAKLIRIAIQAA 193 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P + +D+S K LV+ + +++ D R+ ++ Sbjct: 194 DLAPEKRPPANLVFLIDTSGSMDDPDKLPLVKKTVCHFAEALRADD------RISLITYS 247 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + G+TA +A++ AY + + Sbjct: 248 GSTAEILPPT---AGDQKETIIAALKPLRAHGATAGGEALRMAY----DAAAKNYRKDGI 300 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLS 420 N I+L TDG+ + + N + GI + T+ + E Sbjct: 301 NR-----ILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQL-- 353 Query: 421 NCASPNSFFEANSTHE 436 A ++ +S E Sbjct: 354 ADAGDGNYSYIDSEAE 369 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/274 (8%), Positives = 74/274 (27%), Gaps = 2/274 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-VPLIQSLE 73 ++ G ++ ++++ ++LGV +D+ + L+ A TA + + V Sbjct: 6 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + A + +F + +++ V + + V + Sbjct: 66 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRV 125 Query: 134 SSR-YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + ++F + + + ++ A + + Sbjct: 126 AVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRP 185 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + ++ G + + +L G + Sbjct: 186 PGYDPNVDMDPSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGC 245 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + V + + L IK Sbjct: 246 AAVGNTVTTKPGNSAGPVAQGLNVRLNDFSGPIK 279 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 60/185 (32%), Gaps = 29/185 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + I K + + A A ++ ++ D +G F+ F + + Sbjct: 420 DTEDAGIPKLEIAKSASAKMVEHLESSD------GVGVIAFDHNYYWAYKFG---KLVRK 470 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V G TAI + A T+ S K +VLLTDG Q Sbjct: 471 EDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKN-----------KLVVLLTDGMGEQ 519 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 E ++AK I+I TI LS A + F+ ++ EL Sbjct: 520 SGYEIP--ADEAKRNNIKITTIGVGKFVN-----ASVLSWIADYTSGRFYLVSNPSELVD 572 Query: 440 IFRDR 444 +F Sbjct: 573 VFLKE 577 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 65.8 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 55/153 (35%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F DR ++ V + R + + +TAI+D + A + +S Sbjct: 125 DRIGLVIFGDRAYFAQPLTFDVDAVARAVDEAQI--GISGRATAISDGLGLAMKRLAAS- 181 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 + +VL++DG +T N + + A GIRI TIA Sbjct: 182 ----------EAPTRVVVLMSDGVDTSGNVQAVDAARLAAGHGIRIHTIALGPEDLENQP 231 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 +++ L A S + F +L Sbjct: 232 RSRDAVDTKTLREVAELSGGTAFRVRGMADLEA 264 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 36/302 (11%), Positives = 84/302 (27%), Gaps = 11/302 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-- 65 Y +L ++ G ++ + M + + + +DV + + L+ A T + + Sbjct: 13 RQYPNRL-RNQKGLTLVVMTMSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGATV 71 Query: 66 --VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + + +E + ++ N E D Sbjct: 72 ANRTNEKGDTDTAIIESYKKVIESPGNDEIELTATDDGNGLNLLSIEYSDSPNSGFSTTF 131 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 P V ++ LN FL GI+ + + + + Sbjct: 132 PSSPDMVYVRVEVSEIGLN--EFFLDLFGIEKTVSASAVAGPVFVDGTSNILPIGMCEGS 189 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S + + E + + P +Y + G + + Sbjct: 190 ELGPSGYEIAKAYEIKSGSQSDDPDGLGAGNYHLLDIGSGKSAVSDALVGKNTVVVTVED 249 Query: 244 LYPGPLDPSLSEEHFVDSS----SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 ++ +DS +L V LV ++ I + D+ + T Sbjct: 250 KIDSETGNAVGATEDIDSRFEGVTLDGVYYPPDLVITEPSTAITTDNIADHNDSTAPDKW 309 Query: 300 TF 301 T+ Sbjct: 310 TY 311 >gi|21322624|emb|CAC27151.2| inter-alpha-trypsin inhibitor heavy chain 2 [Platichthys flesus] Length = 304 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 7/168 (4%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D++ FN V + T K + + G T IN+A+ A Sbjct: 19 DDLTIDDHFIIVDFNHNVRCWNEELVHGSSIQVTDAKKYIQNIKPNGGTNINEALMRAVQ 78 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIAF 405 ++ ++ + ++ I+L++DG+ T + I K + + ++ Sbjct: 79 MLVRASNQGMIDPRSVS----MIILVSDGDPTVGEIKLSTIQKNVKRVMREEFSLFSLGI 134 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + R + N + + E + F ++ + + R+ Sbjct: 135 GFDVDYDFLERIAMENRGMAQRIYANHDAAEQLRAFYSQVSSPLLRRI 182 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 48/418 (11%), Positives = 116/418 (27%), Gaps = 44/418 (10%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 S L +AA +A++ ++ +L T P E ++ + + Sbjct: 1 MSLTIQKLARAALSAVLLLTLTASCALAPPPRPKAEPGTDPDDYALEEMVVVGRADSSET 60 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 A + + ++ + ++ + + + Sbjct: 61 RAPPRPPPKKEKAAEFSRVAVVNGKPFADMYFKHYGVNPTIDTEEENVSTFSVDVDSASY 120 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 ++R+Y + + + + +N F +V Sbjct: 121 ALARAYLSRN------------HLPAEEAIRVEEFVNAFRYDYQDP--GAEPFGVQVEAF 166 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + L +D S ++ + LV+ +LA ++ + Sbjct: 167 PSPNRQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMNMENRLELVKRSLAMLVEKL 226 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D + + D + + + R+ + + GST + +Q Sbjct: 227 DSRDTLA------IVVYGDTARTVLEPT---RIMDRSRILEAINALHPEGSTNVQAGLQV 277 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMT 402 AY S + ++L +DG + +I K+ QG+R+ T Sbjct: 278 AYAIAASQVREGATSR---------VILCSDGVANNGITQADSIFQSVKAYAQQGVRLTT 328 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF-------RDRIGNEIFERV 453 + F + E + ++ E +IF I ++ +V Sbjct: 329 VGFGMGNYNDELMERLSH--VGDGQYAYVDALPEARRIFIEQFTGTLQLIARDVKVQV 384 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 57/175 (32%), Gaps = 24/175 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ I + + + + +I S V +L ++K I Sbjct: 602 LKTWTGEFIDKLGVQEYGAQ---VALVKYATSIIKVSELSSDVDELKEKLMKVPFIQ--- 655 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ A + + K I+L+TDG+ T I K Sbjct: 656 -GKTNTGGALERAQQML----------AEGRPSVPKIILLITDGDATDKERLDAQI-EKL 703 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGN 447 K I I TI + E L+ A+ + +E ++KI +G Sbjct: 704 KKSNILIYTIGVGDLIDRNE-----LNRIATDEDFVYETRDFDSISKIKSSLLGR 753 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 51/381 (13%), Positives = 106/381 (27%), Gaps = 26/381 (6%) Query: 47 SYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 Q+ +A + ++ S + K T+ + I++ ++ + + Sbjct: 569 EDVRSK--QSCDSASMDMAIVFDGSDSVKADNFKKLKTWTGEFIDKLGVQEYGAQVALVK 626 Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + + ++ ++ +V L + + + T Sbjct: 627 YATSIIKVSELSSDVDELKEKLMKVPFIQGKTNTGGALERAQQMLAEGRPSVPKIILLIT 686 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + KE + + S ++ + N + A Y +++ + Sbjct: 687 DGDATDKERLDAQIEKLKKSNILIYTIGVGDLIDRNELNRIATDEDFVYETRDFDSISKI 746 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + K D S+ + VDSSS A V Sbjct: 747 KS--SLLGRVCKKAKPKTSGVCGDISVDLQFIVDSSSSVTRKN----FGFAKNFVANVSS 800 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D + V++G ++ V SD + G V+ + G T A+ Sbjct: 801 VFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGDTHTGTAL--- 857 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + E K ++ +TDG QD A + +G+RI I Sbjct: 858 --------RYISTNNRWREEVPKILIFVTDGTP-QDRAIVPAAARSLRDKGVRIFAIGVG 908 Query: 407 VNKTQQEKARYFLSNCASPNS 427 L AS Sbjct: 909 ------NAVESELKEIASEPY 923 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/368 (10%), Positives = 94/368 (25%), Gaps = 22/368 (5%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 +Y+ ++ + +Q + S ++ + F Sbjct: 212 DYMKNWVKSIAASFKVGENFARFSVVQYTKTAKTVVDFQTLDLSSISQKIDSMIYFQGRN 271 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS----MLDYQRDSEGQPLNCFGQP 207 G A + E G ++ S + ++ F Sbjct: 272 GRGGKTFTGNALERAHTLLKESEPGRKRIVLLLTDGSSDDEYGPVAKAIRDDKVDIFAVG 331 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 R K+ + R + + + + + + +D + Sbjct: 332 VGRARKNELVEITGDEQRVWQTRTFNNIGQFNQKLLAEVCSASEDVCPDQELDLVFMLDS 391 Query: 268 IKKKHLVRD--ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 A +I K ++ + I + Sbjct: 392 SGSIGTANFEIARDWLISVTKSFQIGPVATQVSVIQYTSEPIPEFDL-NDFKTAAEVEDG 450 Query: 326 TFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + + T A+ D I + N +A K +VL++DG++ Sbjct: 451 IKRMRLQNGSTRTDKARALDFVSDVIFTKN------FGARSQAPKVLVLISDGQDNHPE- 503 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIF 441 + K Q + + I + + ++ A+ AN+ +NK F Sbjct: 504 RVLEAAKKLHGQDVSVFVIGIG---NESQMNIKMMNQIATEPITKHIKYANTVEGINK-F 559 Query: 442 RDRIGNEI 449 ++ + +I Sbjct: 560 KNALTGQI 567 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 21/112 (18%), Positives = 38/112 (33%), Gaps = 14/112 (12%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I +A+ Y + +S + ++ ++++TDG+ E Sbjct: 69 NGDTCIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNCPAEIAKP-AELI 121 Query: 394 KSQGIRIMTIAF----SVNKTQQEKARYFLSNCAS---PNSFFEANSTHELN 438 ++Q I I + R L AS FE N+ +L Sbjct: 122 RAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQLI 173 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 46/138 (33%), Gaps = 13/138 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K+ +++ + S+ S K +N + T V+ + + F Sbjct: 209 KRFDYMKNWVKSIAASFKVGENFARFSVVQYTKTAKTVVDFQTLDLSSISQKIDSMIYFQ 268 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T +A++ A+ + S K+ ++LLTDG ++E Sbjct: 269 GRNGRGGKTFTGNALERAHTLLKESEPGR----------KRIVLLLTDG---SSDDEYGP 315 Query: 389 ICNKAKSQGIRIMTIAFS 406 + + + I + Sbjct: 316 VAKAIRDDKVDIFAVGVG 333 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 54/211 (25%), Gaps = 42/211 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + Sbjct: 300 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRKQSLLA-V 352 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 G T I + ++ I I+LL+DG++T Sbjct: 353 NSLTSNGGTNIAEGLRKGSKVIEERQ---------AKNPVCSIILLSDGQDTYTVSPSTG 403 Query: 382 ----DNEEGIAICNKAKS------------QGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 A KA + Q + + F + S Sbjct: 404 AHKPYTVSPTAGAQKASAEYCALLPSTNGSQQVPVHVFGFGADHDAVSLHSISQ---TSG 460 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 +F + + F IG + + Sbjct: 461 GTFSFIETEATIQDAFAQCIGGLLSVVAQDL 491 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 65.8 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 59/187 (31%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ + D R+ F+ R +KT Sbjct: 59 LETVKRAAQKLVDRLLPSD------RLAVIVFDHVAKVLIP---NQPVTDRDKIKTRISH 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 MG TAI++ +Q +I++ + + I LLTDGEN +N + + Sbjct: 110 LAAMGGTAIDEGLQLGLTELIAAKAGAISQ----------IFLLTDGENEHGNNSRCLQL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 +A + I + T+ F + + L A S ++ F Sbjct: 160 AEEAAKENITLNTLGFGYHWN-----QDVLEQIADAAGGSLMFIEYPQDVLIGFERLFNQ 214 Query: 448 EIFERVI 454 I Sbjct: 215 IISVGFT 221 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 84/294 (28%), Gaps = 25/294 (8%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 P + + ++ + T S +++ I+ + +N Sbjct: 70 PNTETYDLIDENNFQLVAANPLSTFSIDVDTASYSNVRRFINQR-QRPPIDAVRIEELIN 128 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 F + + + + +P + L P + +D S Sbjct: 129 YFSYDYPQP-QGEEPFSVTTEVSSAPWNPQHQLVHIGLQGKTLAIEELPPSNLVFLLDVS 187 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + L+++ ++ + + D V V + + G + Sbjct: 188 GSMNQPNRLPLLKEGFKLLVDQLSEQDTVAIAV------YAGAAGVVLPPTPGNE---KQ 238 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + GSTA + ++ AY+ + + ++L TDG+ Sbjct: 239 KIIAAIDGLQAQGSTAGGEGIKLAYELATRMLSEGKNNR---------VILATDGDFNVG 289 Query: 383 NEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + +GI + + F + + K ++ ++ ++ Sbjct: 290 VSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNH--GNGNYAYIDN 341 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 54/200 (27%), Gaps = 38/200 (19%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + A I R+G F + + L+ Sbjct: 105 AQDFSPTRLEAAKKVAAEFIND-------RPNDRIGLVIFGGESFTQCPLTTDHKVLL-- 155 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + T TAI + + + + S + ++LLTDG N Sbjct: 156 NLLTEVKFGMIEDGTAIGLGLANSVNRLKDS-----------KSKSRVVILLTDGSNNAG 204 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------EKARYFLSNCA--S 424 + A S IR+ TI T + L+ A + Sbjct: 205 QIAPLTAAELAASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVSGDFDERTLTEIAAIT 264 Query: 425 PNSFFEANSTHELNKIFRDR 444 +F A L+ I+ + Sbjct: 265 KGQYFRATDNTSLSAIYDEI 284 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 59/199 (29%), Gaps = 41/199 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + + ++G F + + + + + + Sbjct: 111 NRLSFAKQVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTDLSAA-KQFLADIS 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI A++ A + N E K I++LTDGEN + N I Sbjct: 163 PNMVTAQGTAIGAAIELASKSFSD-----------NKEIGKTIIVLTDGENHEGNA--IE 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY-FLSN--------------C-----ASPNSF 428 +A GIR+ I +L + C A +F Sbjct: 210 AAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEKMCRDIASAGEGTF 269 Query: 429 FEANSTHELNKIFRDRIGN 447 F S L + ++ Sbjct: 270 FSGQSASALVRAIESQLDK 288 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 59/199 (29%), Gaps = 41/199 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + + ++G F + + + + + + Sbjct: 111 NRLSFAKQVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTDLSAA-KQFLADIS 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI A++ A + N E K I++LTDGEN + N I Sbjct: 163 PNMVTAQGTAIGAAIELASKSFSD-----------NKEIGKTIIVLTDGENHEGNA--IE 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY-FLSN--------------C-----ASPNSF 428 +A GIR+ I +L + C A +F Sbjct: 210 AAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEKMCRDIASAGEGTF 269 Query: 429 FEANSTHELNKIFRDRIGN 447 F S L + ++ Sbjct: 270 FSGQSASALVRAIESQLDK 288 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/368 (7%), Positives = 85/368 (23%), Gaps = 6/368 (1%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A + + A+ L +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDL-TGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + + T + + S V + ++ Sbjct: 60 GITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + + D V +F+ + Sbjct: 240 NAYNTRFGIYANPYKDPSYGTPDFTGYAYDATTWPSQSNAYADFVSKRQTFTSY-----Q 294 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T + + + VL+ D + Sbjct: 295 GDLITGINTGGTYNPNYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPMGSG 354 Query: 382 DNEEGIAI 389 + + + Sbjct: 355 GSAGPVHL 362 >gi|22758319|gb|AAN05523.1| unknown protein [Oryza sativa Japonica Group] Length = 731 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 61/208 (29%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A++ VI+++ D R+ F+ + ++ Sbjct: 275 TKLALLKRAMSFVIQALGPGD------RLSVVTFSSSARRLFPLRKMTESGRQRALQ-RV 327 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A + IVLL+DG +T Sbjct: 328 SSLVADGGTNIADALRKAARVMEDR---------RERNPVCSIVLLSDGRDTYTVPVPRG 378 Query: 389 IC------NKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 + A ++ +++ F + + + A S +F Sbjct: 379 GGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPA-----MHSIAEMSGGTF 433 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + F IG + + Sbjct: 434 SFIDAAGSIQDAFAQCIGGLLSVVAQEL 461 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/290 (9%), Positives = 73/290 (25%), Gaps = 4/290 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +G I AL++ +LG G+ +D R + L+ A + ++A+ L + + + Sbjct: 8 HRQSGAVIITVALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDAL 67 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + T + + + T + + A + S Sbjct: 68 TRATSAGKTAADLNKINFQGEATGLAQTEVVFSDSLIGTYSHTFTPIANARYAKCLHTKS 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + ++ A +R+ + + + V + + Sbjct: 128 GIAPWILHALSAFSG-NSTYGASKSVAAVAVATRAPSQTNCLVPVGVCQRTATPGWGFAR 186 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 E ++ V+S + S L + + Sbjct: 187 GEW---IEGVTNSNDDVESGQFRWVDFSGSGGGAREIKDLLTSSGQCGLPGMATNVGKAG 243 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ A ++ + ++D Sbjct: 244 KTNGAVAAWNTRFGIYQGSLSAATAIPDQTGYAWYADSVAATNPGRYDDS 293 >gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Nomascus leucogenys] Length = 1313 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 57/194 (29%), Gaps = 15/194 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTF 327 K + A+ ++ +K D F+D V + + K + Sbjct: 297 TKIEQTKKAMNVILSDLKANDY------FNIISFSDTVNVWKAGGSIQATIQNVHSAKDY 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN A+ A + SN++ I+ L DGE T Sbjct: 351 LHCMEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIP--LIIFLMDGEPTAGVTTPS 408 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 +C + + ++AF + R L N +E + + Sbjct: 409 VILSNVCQAL-GHRVSLFSLAFGDDADFTLLRRLSLENRGIAQRIYEDTDAALQLEGLYE 467 Query: 444 RIGNEIFERVIRIT 457 I + V R+ Sbjct: 468 EISMPLLADV-RLN 480 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 59/196 (30%), Gaps = 35/196 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ + R Sbjct: 272 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHL-QRMSHSGRQQALQTV 324 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 G T I DA++ A I + I+LL+DG++T + Sbjct: 325 NSLVASGGTNIADALKKAAKVIEDRSH---------QNPVCSIILLSDGQDTYNIPSNIR 375 Query: 383 -------NEEGIAICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + +I N ++ G + + F V+ AS +F Sbjct: 376 GARPEYSSLVPSSILN--RTFGLVPVHGFGFGVDHDSDALHSIAE---ASGGTFSFIEDE 430 Query: 435 HELNKIFRDRIGNEIF 450 + F IG + Sbjct: 431 GVIQDAFAQCIGGLLS 446 >gi|115482404|ref|NP_001064795.1| Os10g0464500 [Oryza sativa Japonica Group] gi|110289213|gb|AAP54178.2| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113639404|dbj|BAF26709.1| Os10g0464500 [Oryza sativa Japonica Group] Length = 719 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 61/208 (29%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A++ VI+++ D R+ F+ + ++ Sbjct: 275 TKLALLKRAMSFVIQALGPGD------RLSVVTFSSSARRLFPLRKMTESGRQRALQ-RV 327 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A + IVLL+DG +T Sbjct: 328 SSLVADGGTNIADALRKAARVMEDR---------RERNPVCSIVLLSDGRDTYTVPVPRG 378 Query: 389 IC------NKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 + A ++ +++ F + + + A S +F Sbjct: 379 GGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPA-----MHSIAEMSGGTF 433 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + F IG + + Sbjct: 434 SFIDAAGSIQDAFAQCIGGLLSVVAQEL 461 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 51/165 (30%), Gaps = 37/165 (22%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G F + + + + D + TAI A+ A + Sbjct: 128 NDKIGLIVFAGDAYVQLPITTDYSSA-KLFLSNISTDIVPVQGTAIGSAIDLAARSFTPE 186 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 E K I+++TDGEN QD+ A +A +GI I TI + + Sbjct: 187 -----------TETSKAIIVITDGENHQDDAV--AAAKQAHEKGIVIHTIGMGLEQGAPI 233 Query: 413 --------------------EKARYFLSNC--ASPNSFFEANSTH 435 + L + A + A++T Sbjct: 234 PEKGKPGQFMQDAQGNVVISKLDEQTLQDIAKAGEGLYIRASNTE 278 >gi|295093271|emb|CBK82362.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Coprococcus sp. ART55/1] Length = 612 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 53/437 (12%), Positives = 116/437 (26%), Gaps = 49/437 (11%) Query: 23 FIITALLMPVMLGVGGM----LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 ++ ++M G+ G VD + E++ Sbjct: 15 AVMAGVMMVG--GLTGCGNDDAVDGRGMRLEKQ-----------------NGDKTEITGI 55 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ ++ ++ L N + + + ++ + + + +S Y Sbjct: 56 AEKYREKESKQKKKSLQDNGLSGIFNSTSNDVMYAEEACDVATVAGDTAMVTDTSNSMYT 115 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + + + + + S S + Sbjct: 116 EIAYDTREYDSVAENGFVSTADRPLSTFAA--DRDTASYSNVRSYIESGCLPPDGAVRIE 173 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + LN F R + + D + + + Sbjct: 174 EMLNYFTYDYRRKPEDGEKFSIYTEYSDCPWNKATKLMMVGINTDEIDFGDKKPSNLVFL 233 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S + K LV+ + A + ++ + D V+ D G Sbjct: 234 IDTSGSMYEDNKLPLVQQSFAMLAENLDENDKVSIV---------TYAGEDTVVLSGTSG 284 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G T DA+ TAY+ E NN ++L TDG+ Sbjct: 285 SEQYTINEALSSMTAEGCTNGGDAIITAYEL----AEKNFIEGGNNR-----VILATDGD 335 Query: 379 NTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + + K I + + F + + K N S+ +S Sbjct: 336 LNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADN--GDGSYAFIDSA 393 Query: 435 HELNKIFRDRIGNEIFE 451 +E K+ D +G + Sbjct: 394 YEAKKVLVDEMGGTLNT 410 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/254 (11%), Positives = 68/254 (26%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + V + D Y ++ D + Sbjct: 260 ICPNCRETAVDGELVVLYDVSREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + K K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVVDFNHNIRTWRNDLISATKTQVADAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANN----MGLLDPNSVSLIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T + F + Sbjct: 433 KIQKNVKENIRDNISLFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLRKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|332817002|ref|XP_003309880.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] Length = 769 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 103 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 162 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 163 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 219 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 220 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 275 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 276 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 335 Query: 448 EIFERV 453 + V Sbjct: 336 PLLVDV 341 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/372 (10%), Positives = 97/372 (26%), Gaps = 25/372 (6%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 + E +++ ++ + + + L + L Sbjct: 28 QAGAPAMLYEAAPAAARDSVMYERAPVAMQSNLAAKRSGVTLYGQLHAKYLPAAEARPLR 87 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 + + + T S +++ ++ L + + + Sbjct: 88 TDTERYQKQPENPVKAVAQEPVSTFSIDVDTGSYANVRRFLNN--GRLPPKDAVRIEEIV 145 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + + + D + P + VD S Sbjct: 146 NYFPYSYPLPQDGRPFAVHTQTVDSPWQSEAKLIKIGIQAQDTAKKDLPPANLVFLVDVS 205 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 K LV+ L + ++ D V + + G + Sbjct: 206 GSMTDPDKLPLVKKTLRILTEQLRPQDKVT------LITYASGEQLVLPPTSGK---DKD 256 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + G+T+ A++ AY+ E + N I+L TDG+ Sbjct: 257 TILRALNALHAGGATSGERALRMAYE----QAEKAYVKNGINR-----IILATDGDFNVG 307 Query: 383 NEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + + GI + T+ + + A ++ +S E K Sbjct: 308 VSDTETLKSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIAD--AGDGNYSYIDSEKEARK 365 Query: 440 IFRDRIGNEIFE 451 + R ++ + + Sbjct: 366 VLRHQLTSTLAT 377 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 62/179 (34%), Gaps = 26/179 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +V++ +++ I S MG F ++ + L + + Sbjct: 109 TRFDVVKEIVSNFISS-------RKNDNMGIVVFGAYSFIASPLTYDSNILKGVVSNLYI 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA+ +++ + + K + K +LLTDG NT D+E Sbjct: 162 --GMAGKFTALFESLAQGVNLL-----------KTSKSKTKIAILLTDGYNTPDSEFPFD 208 Query: 389 IC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A QG+++ I + + L A + F A++ EL ++ Sbjct: 209 AAIDFANKQGVKVYPIGIGKSDEYN---QKMLEKIAEQTGGVAFGASNASELAIVYAKI 264 >gi|114587328|ref|XP_001172432.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 2 [Pan troglodytes] Length = 893 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 418 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|114587324|ref|XP_001172464.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 5 [Pan troglodytes] Length = 911 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 418 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|114587330|ref|XP_001172443.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 3 [Pan troglodytes] Length = 898 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 418 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|114587326|ref|XP_001172455.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 4 [Pan troglodytes] Length = 911 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 418 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|114587332|ref|XP_001172399.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 1 [Pan troglodytes] Length = 774 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + F N + N +E + + F ++ Sbjct: 418 KNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 63/177 (35%), Gaps = 19/177 (10%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + D L +V + +K++ ++ R+G F F+ + + +T Sbjct: 116 VTVDRLDAVKQVLKELAANRESDRLGLIVFGAAAYLQTPFTDDHQVWQQLLDETEI--GM 173 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ST DA+ + S+ D + +++LTDG +T + Sbjct: 174 AGPSTVFGDAIGLSIKLFSDSDSDN-----------RVLIMLTDGNDTGSTVPPVDAAKV 222 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYF-----LSNCASPNSFFEANSTHELNKIFRDR 444 A + G+RI TIA T E A +S A F A E+ + + Sbjct: 223 AAANGVRIYTIAIGDPATVGEDALDMDTITRVSKIA-DGRTFRALDQDEMRQAYITI 278 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 40/112 (35%), Gaps = 14/112 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R + + + +G TAI +A+ D Sbjct: 128 DRVGLIAFAGRAFTYCPLTSDYSAFRLFLNDLNVN-IIPVGGTAIAEAIYKGID------ 180 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N K ++++TDGEN + + +KAK +GI I T+ Sbjct: 181 -----AFGENENNHKAMIIITDGENH--ETDPLKAASKAKEKGIVIYTVGVG 225 >gi|125532269|gb|EAY78834.1| hypothetical protein OsI_33939 [Oryza sativa Indica Group] gi|125575070|gb|EAZ16354.1| hypothetical protein OsJ_31816 [Oryza sativa Japonica Group] Length = 654 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 61/208 (29%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A++ VI+++ D R+ F+ + ++ Sbjct: 198 TKLALLKRAMSFVIQALGPGD------RLSVVTFSSSARRLFPLRKMTESGRQRALQ-RV 250 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I DA++ A + IVLL+DG +T Sbjct: 251 SSLVADGGTNIADALRKAARVMEDR---------RERNPVCSIVLLSDGRDTYTVPVPRG 301 Query: 389 IC------NKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 + A ++ +++ F + + + A S +F Sbjct: 302 GGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPA-----MHSIAEMSGGTF 356 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + F IG + + Sbjct: 357 SFIDAAGSIQDAFAQCIGGLLSVVAQEL 384 >gi|35465|emb|CAA32821.1| lambda HuHITI-13 [Homo sapiens] Length = 545 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 43/132 (32%), Gaps = 7/132 (5%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 KTF + G T IND + + + E+ ++ +++LTDG+ Sbjct: 4 KTFVKSMEDKGMTGINDGLLRGISMLNKAREEHRIPERSTS----IVIMLTDGDANVGES 59 Query: 385 EGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I ++ + + F N L N +E + + F Sbjct: 60 RPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGF 119 Query: 442 RDRIGNEIFERV 453 + + N + V Sbjct: 120 YEEVANPLLTGV 131 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 60/183 (32%), Gaps = 17/183 (9%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 H I + V++ L I R+G F D F+ + +++ Sbjct: 124 HKISRLDAVKEVLTDFI-------KTRQGDRLGLILFGDAAFVQTPFTADHDVWLDLLMQ 176 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T ST + DA+ + + + +K ++L+DG +T Sbjct: 177 TRVN--MAGKSTHLGDAIGLTIKRFNEATK----NQTSEKTREKVAIILSDGNDTGSYVP 230 Query: 386 GIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 I AK +RI IA +Q ++ A S ++A + EL + Sbjct: 231 PIDAAMVAKVNAVRIYMIAIGDPKSVGEQSLDMQTINKIASVSGGQAYQALNQQELLNAY 290 Query: 442 RDR 444 Sbjct: 291 AKI 293 >gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana] Length = 641 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 58/199 (29%), Gaps = 41/199 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L+++A++ VI+++ + D R+ F+ + ++ Sbjct: 219 KMELMKNAMSFVIQNLGETD------RLSVISFSSMARRLFPLRLMSETGKQAAMQA-VN 271 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--------- 380 G T I + ++ I + ++LL+DG++ Sbjct: 272 SLVADGGTNIAEGLKIGARVI-EGRRWKNPVSG--------MMLLSDGQDNFTFSHAGVR 322 Query: 381 ---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTH 435 + C I I T F + + S +F + Sbjct: 323 LRTDYESLLPSSCR------IPIHTFGFGSDHD-----AELMHTISEVSSGTFSFIETET 371 Query: 436 ELNKIFRDRIGNEIFERVI 454 + F IG + ++ Sbjct: 372 VIQDAFAQCIGGLLSVVIL 390 >gi|15223093|ref|NP_172283.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana] gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 641 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 58/199 (29%), Gaps = 41/199 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L+++A++ VI+++ + D R+ F+ + ++ Sbjct: 219 KMELMKNAMSFVIQNLGETD------RLSVISFSSMARRLFPLRLMSETGKQAAMQA-VN 271 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--------- 380 G T I + ++ I + ++LL+DG++ Sbjct: 272 SLVADGGTNIAEGLKIGARVI-EGRRWKNPVSG--------MMLLSDGQDNFTFSHAGVR 322 Query: 381 ---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTH 435 + C I I T F + + S +F + Sbjct: 323 LRTDYESLLPSSCR------IPIHTFGFGSDHD-----AELMHTISEVSSGTFSFIETET 371 Query: 436 ELNKIFRDRIGNEIFERVI 454 + F IG + ++ Sbjct: 372 VIQDAFAQCIGGLLSVVIL 390 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/336 (8%), Positives = 88/336 (26%), Gaps = 3/336 (0%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G ++ ++++ ++LG+ +D+ + L+ A +A + +V + ++ Sbjct: 11 RTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDENGNVS 70 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ T + + T R A P Y + + Sbjct: 71 AAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATFVSAASFTPPASGEYDIYVRV 130 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + L K+ A +++ + Y Sbjct: 131 AVTEIGLTQYLSDVFGINKNVSASAVAGRSAAIAYTCNISPIAMCADTSSPNTTWGYVPQ 190 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 P +K Q+G +G + L S ++ + G + ++ Sbjct: 191 GGYDPTVDKIPSTLYQMKPADHQSGGIGPGNFHLLDLGTSGKNAVR-DAFAGATNNCITI 249 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + +D+ + + + ++ + V + + Sbjct: 250 GNNIDTETGKGTGPVAQGINTRFGKFNGPTEEGEVVRSDKYTEEP--SPGLDESTYQDSN 307 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++ + + + Sbjct: 308 FYYADYVSRLANCESGGACNNSYYDADGSSGRRILR 343 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 56/200 (28%), Gaps = 28/200 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ S + ++ Sbjct: 279 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQA-V 331 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-----DN 383 G T I + ++ + + I+LL+DG++T Sbjct: 332 NSLVANGGTNIAEGLRKGAKVM---------EERREKNPVASIILLSDGQDTYTVSGSSG 382 Query: 384 EEGIAICNKAKSQGIRI-----MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 + I I + + + + S +F + Sbjct: 383 NQPQPNYRLLLPLSIHGGDNAGFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAV 442 Query: 437 LNKIFRDRIGNEIFERVIRI 456 + F IG + V + Sbjct: 443 IQDAFAQCIGGLLSVVVQEL 462 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/302 (11%), Positives = 76/302 (25%), Gaps = 43/302 (14%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + D + +EG + +V + +S + L Sbjct: 177 IFDDDEDLDPQHETAEGNTSTQDAGDVNSVRTVEVFTYTEVSAVPKSVSYDNFTILIHLK 236 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVI---------------KKKHLVRDALASVIRSIK 286 L G + + + SS K L++ A+ VI+++ Sbjct: 237 APLTSGRQNRNWNHAESPQSSQDSRAPVDLVTVLDVSGSMSGTKLALLKRAMGFVIQNLG 296 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D R+ F+ ++ G T I + ++ Sbjct: 297 PSD------RLSVIAFSSTARRHFPLRRMTETGKLEALQA-VNSLVSSGGTNIAEGLRKG 349 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNK-------AKSQ 396 + ++ I+LL+DG++T + Sbjct: 350 FKVVVDRKW---------KNPVCSIILLSDGQDTYTISGTSMTRPQADYKSLLPTSIHRN 400 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 G I + + + + S +F + + F IG + V Sbjct: 401 GSSGFRIPVHAFGFGSDHDAASMHSISEISGGTFSFIEAEGVIQDAFAQCIGGLLSVVVQ 460 Query: 455 RI 456 + Sbjct: 461 EL 462 >gi|125532271|gb|EAY78836.1| hypothetical protein OsI_33941 [Oryza sativa Indica Group] Length = 645 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 59/208 (28%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F+ S + K Sbjct: 189 NKLALLKQAMGFVIDNLGPAD------RLCVISFSSGASRLMRLSRMTD-AGKAHAKRAV 241 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 + G T I A++ A + A + ++LL+DG++T Sbjct: 242 GSLSARGGTNIGAALRKAAKVLDDR---------LYRNAVESVILLSDGQDTYTVPPRGG 292 Query: 382 -----------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 + + + T F ++ + A + +F Sbjct: 293 YDRDANYDALVPPSLVRSDAGGGGGRAPPVHTFGFG-----KDHDAAAMHTIAEVTGGTF 347 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 + + F IG + V + Sbjct: 348 SFIENEAAIQDGFAQCIGGLLSVAVQEL 375 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 65.4 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 64/187 (34%), Gaps = 28/187 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 SS K L R+A + D +G F+D + Sbjct: 417 SMSSDARGADKMALAREAAIRATTMMNAQDY------IGVIAFDDTPWDVVAPQSVTKL- 469 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ G T I A+Q Y+ + + N +K+++LLTDG+ Sbjct: 470 --DEIQQQISRIQADGGTDIFPALQLGYERVKAMN-----------TQRKHVILLTDGQ- 515 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + +++ + + ++ I + T+A + R L A ++ AN + Sbjct: 516 SALDDDYEGLLQQMTAENITVSTVALGDDSD-----RGLLEMIAELGKGRYYFANDAESI 570 Query: 438 NKIFRDR 444 KIF Sbjct: 571 PKIFSKE 577 >gi|297727663|ref|NP_001176195.1| Os10g0464900 [Oryza sativa Japonica Group] gi|22758314|gb|AAN05518.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432565|gb|AAP54180.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|255679473|dbj|BAH94923.1| Os10g0464900 [Oryza sativa Japonica Group] Length = 646 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 59/208 (28%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F+ S + K Sbjct: 189 NKLALLKQAMGFVIDNLGPGD------RLCVISFSSGASRLMRLSRMTD-AGKAHAKRAV 241 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 + G T I A++ A + A + ++LL+DG++T Sbjct: 242 GSLSARGGTNIGAALRKAAKVLDDR---------LYRNAVESVILLSDGQDTYTVPPRGG 292 Query: 382 -----------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 A + + T F ++ + A + +F Sbjct: 293 YDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFG-----KDHDAAAMHTIAEVTGGTF 347 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 + + F IG + V + Sbjct: 348 SFIENEAAIQDGFAQCIGGLLSVAVQEL 375 >gi|197101601|ref|NP_001126282.1| inter-alpha-trypsin inhibitor heavy chain H1 [Pongo abelii] gi|55730948|emb|CAH92192.1| hypothetical protein [Pongo abelii] Length = 911 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/402 (8%), Positives = 97/402 (24%), Gaps = 23/402 (5%) Query: 58 QTAIITASVPLIQSLEE--VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 AI I +++ + + + + V Sbjct: 99 DFAITADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKV 158 Query: 116 RDTAV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 E +++ Q ++ + + + I + Sbjct: 159 TFQLTYEEVLKRNHMQYEIAIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA 218 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + +L S+ Q S S NG + + + Sbjct: 219 QTIKKSFSG-KKGHVLFRPTVSQQQSC---------PTCSTSLLNGHFKVTYDVSRDKIC 268 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + + V + V+ ++++ + + Sbjct: 269 DLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPG--- 325 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F RV S + + F + +T +N + + + Sbjct: 326 DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLQGIEILNQVQ 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 E + +++LTDG+ T+ + I ++ + + F N Sbjct: 386 ES----LPELSNHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDF 441 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + F ++ + V Sbjct: 442 NFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAKPLLVDV 483 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 54/177 (30%), Gaps = 25/177 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + R L I R+ F D S+ L +TI+ Sbjct: 97 RFDIARKVLLDFIDR-------RPKDRIALEVFADYAYLAAPMSYEKKGL-KTILAALEP 148 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 TA+ +A+ + ++LLTDG +T N A Sbjct: 149 GVVGGRDTALYEALFLGARLFKKEEGRSN----------RVMILLTDGIDTVGNIPLEAA 198 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + K IR+ T+ + R L A + F++A L I+R Sbjct: 199 IRELKRAHIRVYTVGVG-----DDFRRGVLEKIARSTGGRFYDARYPEALANIYRRI 250 >gi|327265885|ref|XP_003217738.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 636 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 73/220 (33%), Gaps = 14/220 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +++ + +D S +K R+AL ++ IK+ D +N + Sbjct: 277 GYFVHFFAPKNISHLPKNIVFIIDVSISMSG-RKLQQTREALLKILEDIKEDDYLNFVLF 335 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + + + R + + G T +N + + + ++++ Sbjct: 336 GDDVH-KWKDTLIKATPENLDEASRYVQQIDI-----AGWTNLNGGLMAGIEMLNEAHKN 389 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 ++ I++LTDG T+ + I + ++ + + F + Sbjct: 390 RSLPERSAS----LIIMLTDGRPTKGERDTQVILSNVRNAIQGKYPLYNLGFGYDLDYGS 445 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + N +E + + + F D + N + V Sbjct: 446 LEKMAAENNGLARRIYEDSDSALQLQGFYDEVANPLLTEV 485 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 79/287 (27%), Gaps = 28/287 (9%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP- 231 + I + + +N F + + + +P Sbjct: 2 DSASYANLRRILNEGRLPPMDAVRVEEMINYFNYEDMAAEQRDTPFGITTEVAANPWNPE 61 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 S + P P VD S H K L++ +L + RS+ D V Sbjct: 62 DKASAYRHQSLAAQGSPKCPPPIWVFLVDVSGSMHSPDKLPLLKRSLRLLSRSLDADDRV 121 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + + G R ++ + GST ++ AY Sbjct: 122 S------LVVYAGASGVVLEPTPG---NKRATIEQALQQLSAGGSTNGGAGIRLAY---- 168 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 + + N ++L TDG+ A+ + K Q GI + T+ F Sbjct: 169 AKAREAFIEGGINR-----VILATDGDFNVGTVNHQALIDLIKQQRQAGIALTTLGFGGG 223 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 ++ +S L + R +G + R Sbjct: 224 NYNDHLMEQLADQ--GDGNYAYIDS---LMEA-RQGVGERAGCQPCR 264 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 56/156 (35%), Gaps = 17/156 (10%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D+ R+G F + F+ L + + + TAI DA+ A + Sbjct: 134 KRDSDRIGLIAFGENAYLQAPFTQDKQILSQLLQQMDVR--MAGAGTAIGDAIGVAVNHF 191 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S + K ++LLTDG +T + + A QG+ I IA K Sbjct: 192 EQSEVEN-----------KVLLLLTDGNDTSSEFPPLDAAHYAGEQGVVIYPIAIGDPKN 240 Query: 411 --QQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + L A + FEA+ L ++++ Sbjct: 241 VGEDSLDIATLERIADLTQGRVFEADDGQSLIEVYK 276 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/301 (12%), Positives = 84/301 (27%), Gaps = 24/301 (7%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 ++ + T + S S + +N F Sbjct: 83 ENIFLDTYNNPLSTFSVDVDTASYSNIRRFLNSSQKPPVDAVRIEEMINYFTYDYPDA-D 141 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + I +P L P + +D S K L Sbjct: 142 GDEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEPNKLPL 201 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A ++ + + D V+ V + + + ++ Sbjct: 202 LKSAFKLLVDELDEDDRVSIVV---------YAGAAGLVLDSTPGNEKDKILDALMNLEA 252 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAIC 390 GSTA + ++ AYD + + NN ++L TDG+ E + + Sbjct: 253 GGSTAGAEGIKLAYDV----AKKNFIKSGNNR-----VILATDGDFNVGISSEAELVRLI 303 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K + +GI + + F + K ++ ++ E K+ + +G + Sbjct: 304 EKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGNYAYIDNIAEARKVLVNEMGATLN 361 Query: 451 E 451 Sbjct: 362 T 362 >gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Macaca mulatta] Length = 946 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 69/254 (27%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + V + D K Y ++ D + Sbjct: 260 ICPNCRETVVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN V + + K + K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNVRTWRNDLISATKTQVSDAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 433 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 65.0 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 49/180 (27%), Gaps = 32/180 (17%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ R+G F+ + + L I D + TAI A+ Sbjct: 120 IRDFMKERKYDRIGLVIFSGLAFTQCPLTTDKDSLAEFINNINIGD-TGLDGTAIGSAIM 178 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 T+ + + S + I+L+TDG N + + A+S I+I + Sbjct: 179 TSVNRLKDS-----------RAKSRIIILVTDGNNNMGEIDPLTASKIARSYDIKIYAVG 227 Query: 405 FSVNKTQ------------------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A I + Sbjct: 228 VGSLDGAIYEVDDPFLGKREIKYRKDAINESVLKEVAYNTSGGYFRAQDVKSFENIMKQI 287 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 39/301 (12%), Positives = 84/301 (27%), Gaps = 24/301 (7%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 ++ + T + S S + +N F Sbjct: 85 ENIFLDTYNNPLSTFSVDVDTASYSNIRRFLNSSQKPPVDAVRIEEMINYFTYDYPDA-D 143 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + I +P L P + +D S K L Sbjct: 144 GDEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEPNKLPL 203 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A ++ + + D V+ V + + + ++ Sbjct: 204 LKSAFKLLVDELDEDDRVSIVV---------YAGAAGLVLDSTPGNEKDKILDALMNLEA 254 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAIC 390 GSTA + ++ AYD + + NN ++L TDG+ E + + Sbjct: 255 GGSTAGAEGIKLAYDV----AKKNFIKSGNNR-----VILATDGDFNVGISSEAELVRLI 305 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K + +GI + + F + K ++ ++ E K+ + +G + Sbjct: 306 EKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGNYAYIDNIAEARKVLVNEMGATLN 363 Query: 451 E 451 Sbjct: 364 T 364 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 57/196 (29%), Gaps = 25/196 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + + + + +G F S Sbjct: 96 SNSMAATDIQPTRLEAAKQGAQAFVDQL------PPRINLGLVSFAGSAAVLVPASTDRE 149 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +TA+ + + + I ++ E + IVLL+DG Sbjct: 150 SVRS-----GIRGLQLGPATAVGEGIFASLQAITTAGE--RMSDEGQPPPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPN 426 E T+ A+ + + TIA+ + + L A + Sbjct: 203 ETTRGRPNT-QAATAARDAEVPVDTIAYGTSDGTLDVGGQQIPVPVNEDALRELAEQTGG 261 Query: 427 SFFEANSTHELNKIFR 442 S+ A + EL ++R Sbjct: 262 SYHRATTGDELQSVYR 277 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 51/380 (13%), Positives = 96/380 (25%), Gaps = 34/380 (8%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + P + + E + Y+ V Sbjct: 41 DASPPPPPPPPPPPPSPAYAAQQAVVVSGSRIASEAAVAPDTSGQPAEAAGREYRYV--- 97 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQR 194 +++ R G + ++ A + S + + M Sbjct: 98 -MPVIVPQPEDRERYDGEEVSPVKIAAVEPLSTFSVDVDTGAYANARRFLSQGQMPPKAA 156 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + +N F DR + L P + Sbjct: 157 VRTEEFINYFRYDYDRPQDRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPAN 216 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D S K LV+ ALA + ++ D V+ + + Sbjct: 217 LVFLMDVSGSMGRPDKLPLVKTALAGLAGELQPQDKVS------IVVYAGAAGLVLEPTN 270 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 K ++ GSTA +Q AY ED N ++L Sbjct: 271 DTRK-----IRAALNQLQAGGSTAGGAGIQLAYQI----AEDNFIEGGVNR-----VILA 316 Query: 375 TDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 TDG+ + I + K + GI + T+ F + A ++ Sbjct: 317 TDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGTGNY----NEAMMEQIANHGNGNYA 372 Query: 430 EANSTHELNKIFRDRIGNEI 449 +S E K+ D + + + Sbjct: 373 YIDSALEAKKVLGDEMSSTL 392 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 83/295 (28%), Gaps = 24/295 (8%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL-NCFGQPADRTVKSYSSQNGKVG 223 + + + + V D + + + +C G D N + Sbjct: 185 AQATFLTNDLLPLVKKTVTDKKARISFSPTIEQQRKCPDCPGTLIDGDFIIKYDVNREND 244 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + D +++ +++ + +D S K R+A+ ++ Sbjct: 245 LGDIQIANGYF-----VHFFAPKDLPRLPKNVVFVIDMSGSMSG-TKMQQTREAMLKILE 298 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT--FAIDENEMGSTAIND 341 + D+ + F + I + + G T IN Sbjct: 299 DLDPEDHFGIILFDHRIQF--------WNTSLSKATKENIDEAMVYVKAIQSYGGTDINA 350 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GI 398 + A D + + + K+ I+LLTDG+ I K+ + Sbjct: 351 PVLKAVDMLKEDRKAKRLPEKSID----MIILLTDGDPNSGESRIPVIQENVKAAIGGQM 406 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + ++ F + N +E + + F D + + + V Sbjct: 407 SLFSLGFGNDVKYPFLDVMSRENNGLARRIYEGSDAALQLQGFYDEVSSPLLLDV 461 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 65.0 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 80/269 (29%), Gaps = 23/269 (8%) Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 E + + + + G G + N Y Y D E Sbjct: 432 VENLKIEATPVNGTKELHLWVEDGGLYGPYSSSNGEAYETTNAGGTYTAYVVA-DFPYGE 490 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + F + + + K + + S + V + R + + Sbjct: 491 QEFYLNVYIAKIDAAKIAAKTFNGFLKSSDQVG--VAYFGGDVPGGYTPRYDVSQTLTND 548 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + D G T + ++ A ++++ + L L + Sbjct: 549 TLSANNS-----IDDLWAYGGTPMGGGIKVARQELVANTAPGNIPVMIVLSDGNP-TLTS 602 Query: 376 DGENTQDNEEGIAI--CNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNS 427 DG ++ AI K + I I TI F + L A+ P+ Sbjct: 603 DGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGFGNDAN-----ETLLKQIATSPDY 657 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ A ++ EL+ I+R I E+ E+ + Sbjct: 658 YYFAATSEELSSIYRQ-IAKELKEKAAKN 685 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 44/419 (10%), Positives = 95/419 (22%), Gaps = 73/419 (17%) Query: 45 RWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 R + L QA + PL + + + Sbjct: 97 RAHSSKAELLQAVRR-----IQPLSTGTMTGLAIQFAITKALSDAEGGRPRSPDISKVVI 151 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 TD +D VRD + + R D Sbjct: 152 VVTDGRPQDSVRDVSARARAGGIELFAIGVGRVD-------------------------- 185 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + S S+++ + + + + Sbjct: 186 -KATLQQIASEPQDEHVDYVESYSVIEKLSKKFQEAFCLVSDLC------ATGDHDCEQV 238 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDA 277 Y +C + + + S + LV+ Sbjct: 239 CVSSPGSYTCACREGFTLNSDGKTCNVCNGGGGSSATDLVFLIDGSKSVRPENFELVKKF 298 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + ++ ++ D + ++ V + G + I T Sbjct: 299 INQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGT 353 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A++ D + + +K ++ TDG + + KAK G Sbjct: 354 MTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRSQDYIND---AAKKAKDLG 404 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERV 453 ++ + L AS +F +N+ IG ++ +R+ Sbjct: 405 FKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKTINQ-----IGKKLQKRI 452 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 56/168 (33%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + T A+ Sbjct: 69 IESLDVGPNATRVGLVNYASSVKQEFPL---RAHSSKAELLQAVRRIQPLSTGTMTGLAI 125 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A +S E R + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 126 QFAITKALSDAEGGRPRS---PDISKVVIVVTDGRP-QDSVRDVSA--RARAGGIELFAI 179 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L+K F++ Sbjct: 180 GVGR------VDKATLQQIASEPQDEHVDYVESYSVIEKLSKKFQEAF 221 >gi|224065915|ref|XP_002191423.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain H3, partial [Taeniopygia guttata] Length = 869 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 69/220 (31%), Gaps = 14/220 (6%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +++ S + +D+S ++ ++AL ++ IK+ D Sbjct: 254 GYFVHFFAPTNLPKLSKNIIFVLDTSGSMSG-REIEQTKEALLKILDDIKEDD------F 306 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + + + F + G T ++ + D + +++E+ Sbjct: 307 FNIILFDSEISTWKETLIKATPENLDEARKFVQHISAQGLTNLHGGLMRGIDILNAAHEE 366 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQE 413 + ++ I++LTDG+ I N K + + F Sbjct: 367 NLVPKRSAS----IIIMLTDGQPNVGLSNTHEIENAVKKAIDGRYTLYNLGFGSGVDYGF 422 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L N + + + F D + N + V Sbjct: 423 LERMALENKGLARRIYPDSDAALQLQGFYDEVSNPMLIDV 462 >gi|327265809|ref|XP_003217700.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 885 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 70/222 (31%), Gaps = 16/222 (7%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN-VNDTV 295 +++ + +D S K ++A+ ++ +K+ D+ Sbjct: 266 GYFVHFFAPKNLAHLPKNVAFVIDVSGSMWG-SKIRQAKEAMIKIVEDLKEDDHFNIILF 324 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + D +I K F + E G T N + + + ++++ Sbjct: 325 ESEVRKWKDGIIKATP-------ENVQEAKYFIGNITESGLTNFNGGLMAGIEMLNNAHK 377 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQ 412 ++ ++ ++L+DGE + I AK+ + ++ F N Sbjct: 378 LKIVPERSASLT----IMLSDGEANVGETDQFRIQENAKNASQGKYPLYSLGFGYNLDYG 433 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 R N ++ + + F D + N + V Sbjct: 434 FLERLSKVNNGVARRIYDDSDAALQLQGFYDEVANPLLTDVA 475 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 76/241 (31%), Gaps = 39/241 (16%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----KKKHLVRDALASVIRSIKKIDN 290 + ++ +D S + + + +I N Sbjct: 94 GLARPQKSNETNTQYTEGINMIFAIDVSESMKITDIHPSRFDAAKQICTDII-------N 146 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G F+ ++ + L + + + G TAI A+ TA + + Sbjct: 147 KRSNDRIGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSG-TAIGTALGTAINRL 205 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 KN ++ IVL++DGENT + I + I+I I + T Sbjct: 206 -----------KNAETKERIIVLISDGENTSGLMDPITAADLCLEYNIKIYCIGLGKDGT 254 Query: 411 -------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + L N + + F+ A L+ + + I ++++ Sbjct: 255 HQFKDDNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLDDVIAN-IDQLEKGKIVQ 313 Query: 456 I 456 + Sbjct: 314 L 314 >gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens] gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens] gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens] Length = 946 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/254 (11%), Positives = 70/254 (27%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + V + D K Y ++ D + Sbjct: 260 ICPSCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + K K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 433 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/363 (8%), Positives = 80/363 (22%), Gaps = 11/363 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I+ G +I + M ++ V +D+ L+ + A + A++ L + Sbjct: 8 RGIRKQRGLVVVIVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + T K + + + + Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVFPDSGYSSSPDGDRY 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + L F R++G+ L + + + +++ D + + Sbjct: 128 VRVVINQLDLESFFFARALGVTKRLTASAVAGPSPGGNACNIVPMAVC-EGDDAGTNGYD 186 Query: 193 QRDSEGQPLNCFGQPADRTVK-SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 ++ P + L+ C Sbjct: 187 SGVVYALKISSQSDPTMGSGNFQLLDFGSGASTVRTSLAGGYAGCIDLDAGETVTTKPGN 246 Query: 252 SLSEEH---------FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ + V++A ++ S + Sbjct: 247 TVGPVGQGLNTRFGVYSGGGLSSSDYPPDLYVKEASPTLTDSDVGSTPAWGYDDYQSDLA 306 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 SD + R + + G T ++ Sbjct: 307 GGVCPSDSNCRSNGDAERRILAIPVVDCSSASGGTTSFPVTAIGCFFLLQKAPTSNGGHT 366 Query: 363 NNL 365 + Sbjct: 367 SIY 369 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/412 (10%), Positives = 106/412 (25%), Gaps = 49/412 (11%) Query: 56 AAQTAI--ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A A L+ + + + + + + + RE D Sbjct: 325 ATDEAAGTDAIGSTLVDLGSDPENVTITVTATAAHDADRTVEATHDIRIAEEAVVRERPD 384 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRY--DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 VE+ S V + D ++ + + E E Sbjct: 385 PDEYRNVEVVNESSGVSVAVGGDGLDDADVSITDETPTTDDPYRAGPMVRIENERPIDDA 444 Query: 172 HKEHGVS-IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 E + D + S++ + S+ + + + + ++ Sbjct: 445 TVEIPIDENALEADANLSIVTWDPTSDEPWTPVETEIDRDAGVATAEVDHFSFFSVFRIE 504 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEH--------FVDSSSLRHVIKKKHLV--RDALAS 280 + + ++ + +S S+ + + + + Sbjct: 505 EWEDETSDTITLDGNETDGEIGNGSGIETADFVFVNDESGSMSGSPTHYAELAGKRFVGA 564 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + S R G + D + + ++ + + G T Sbjct: 565 LTDS----------ERAGRVGYASGANLDQPLTTDHDAVNSSLER-----LSASGGTNTR 609 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 ++ + + + ++LL+DG++ D + + A G+ I Sbjct: 610 AGLRVGLNHLEEEGWENRSA---------VMILLSDGKSGSD---PLPVAEDAAEAGVEI 657 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 T+ N + E L A + F+ +L F N+ Sbjct: 658 STVGLGNNINENE-----LREIAAITGGDFYHVEREEDLPDTFERVAENQTG 704 >gi|115623666|ref|XP_789748.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960627|ref|XP_001186460.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 846 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/297 (14%), Positives = 85/297 (28%), Gaps = 24/297 (8%) Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + A E ++ ++D DS+G+ L Sbjct: 250 EVNASWAVADKSDEAKLNNMTLVDLKSRSAKVLFDSKGEGLLTTSNGIMGDFIIKYDVMH 309 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 ++ ++Y G + +D S +K + A + Sbjct: 310 DAKAGHLQIVNGYF-----VHYFSPVGLPKTRKNVVFVIDVSGSM-RGRKMDQTKRAFTT 363 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ ID +N + + + K D + G T + Sbjct: 364 ILDDVRPIDRINIVLFESNVRVWRSNQMVEATGDNIAAA-----KNHVNDISAGGGTNLY 418 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEGIAICN---KAKSQ 396 D + A D ++ E I++LTDG+ T + I Sbjct: 419 DGLTNAVDLLMEHGNGEAMP---------LIIMLTDGQPTSGSVTSTSEIIKRITNLIDG 469 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + ++ F + LSN A +E +S K F D + N + + Sbjct: 470 RLSLFSVGFGNGVDFSFLEKLSLSNQALARKVYEDSSASLQMKGFYDEVANPLLFNI 526 >gi|163760702|ref|ZP_02167782.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] gi|162282024|gb|EDQ32315.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] Length = 668 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 84/305 (27%), Gaps = 24/305 (7%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 R G S +++ AE + S + ++ M D + + +N F Sbjct: 200 RVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVNYFNYD 259 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + V + + + + P + VD S Sbjct: 260 YPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQANLVLLVDVSGSMQE 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 K L++ A +I+ ++ D V+ + + + Sbjct: 320 TDKLPLLKSAFRLLIQKLEPEDTVSIV---------TYAGDAGTVLEPTPASDKAKILDA 370 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNE 384 D GSTA ++ AY E N ++L TDG+ D++ Sbjct: 371 LDDLRPGGSTAGAAGIEEAYRL----AEKARVNGGVNR-----VLLATDGDFNVGASDDD 421 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++ + + G+ + F + + N ++ E K Sbjct: 422 ALKSLIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQN--GNGVAAYIDTLAEAEKTLAQE 479 Query: 445 IGNEI 449 + Sbjct: 480 ATASL 484 >gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus leucogenys] Length = 946 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 71/254 (27%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + V + D K Y ++ D + Sbjct: 260 ICPSCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + R FN V + + K K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDRFSVIDFNQNVRTWRNDLISATKTQVADAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + ++ + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVS----LIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 433 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 57/188 (30%), Gaps = 22/188 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ V++ + ID R+ F D ++ Sbjct: 78 KIDKMKTAMKFVVKKLSSID------RLSIVTFMDTATRICPLRQVTDASQPELLGL-ID 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G+T I D +QT + N + ++L++DG+ + Sbjct: 131 ALNPGGNTNITDGLQTGLKVLADRNLSSGRVVG--------VMLMSDGQQNRGGN----- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + T F + ++ +F N +L+ F + + Sbjct: 178 AADVKIGNAPVYTFGFGADYDPTVLNAVARNSM--GGTFSVVNDVDKLSMAFSQCLAGLL 235 Query: 450 FERVIRIT 457 V +T Sbjct: 236 TVVVQDLT 243 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 41/166 (24%), Gaps = 33/166 (19%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + H + + + + TAI + +A + + H Sbjct: 138 GSSAAVLIPPTIDRHTFLTRLDQLQVGEL--GDGTAIGMGLASAVLHLTQYSTLPSH--- 192 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-------- 414 I+L TDG+N A + K + I I + K Sbjct: 193 --------IILFTDGDNNTGEIHPRAAADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQKK 244 Query: 415 ----------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 L A +F A S L IF I Sbjct: 245 EISGTLNTVFNETELQKIAGYGNGRYFSAKSPELLTDIFNRFIQKI 290 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 85/294 (28%), Gaps = 25/294 (8%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 P + + ++ + T S +++ I+ + +N Sbjct: 70 PNTETYDLIAENNFQLVAANPLSTFSIDVDTASYSNVRRFINQR-QRPPIDAVRIEELIN 128 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 F + ++ + + SP + L P + +D S Sbjct: 129 YFSYDYPQP-QAEEPFSITTEVSSAPWSPQHQLVHIGLQGKTLAIEELPPSNLVFLLDVS 187 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + L+++ ++ + + D V V + + G + Sbjct: 188 GSMNQPNRLPLLKEGFKLLVDQLTEQDTVAIAV------YAGAAGVVLPPTPGNE---KQ 238 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + GSTA + ++ AY+ + + ++L TDG+ Sbjct: 239 KIIAAIDGLQAQGSTAGGEGIKLAYELATRMLSEGKNNR---------VILATDGDFNVG 289 Query: 383 NEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + +GI + + F + + K ++ ++ ++ Sbjct: 290 VSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNH--GNGNYAYIDN 341 >gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 [Mus musculus] gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 950 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 332 QTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKTQIADAKRYIEKIQPSGGTN 391 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ N ++ I+L++DG+ T + I K Sbjct: 392 INEALLRAIFILNEASN----MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQ 447 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 448 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 503 Query: 456 ITK 458 + Sbjct: 504 NVQ 506 >gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 328 QTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKTQIADAKRYIEKIQPSGGTN 387 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ N ++ I+L++DG+ T + I K Sbjct: 388 INEALLRAIFILNEASN----MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQ 443 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 444 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 499 Query: 456 ITK 458 + Sbjct: 500 NVQ 502 >gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus] Length = 950 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 332 QTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKTQIADAKRYIEKIQPSGGTN 391 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ N ++ I+L++DG+ T + I K Sbjct: 392 INEALLRAIFILNEASN----MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQ 447 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 448 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 503 Query: 456 ITK 458 + Sbjct: 504 NVQ 506 >gi|3024068|sp|Q61703|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus] gi|122889675|emb|CAM13914.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 328 QTVEAMKTILDDLRTDDQFSVVDFNHNVRTWRNDLVSATKTQIADAKRYIEKIQPSGGTN 387 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ N ++ I+L++DG+ T + I K Sbjct: 388 INEALLRAIFILNEASN----MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQ 443 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 444 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 499 Query: 456 ITK 458 + Sbjct: 500 NVQ 502 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 61/210 (29%), Gaps = 43/210 (20%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI ++ D R+ F+DR + + K+ Sbjct: 200 KLALLKQAMGFVIDNLGPHD------RLSVVSFSDRARRVTRL-LRMSGDGKAAAKSAVE 252 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---- 385 G T I + ++TA + ++LL+DG++T Sbjct: 253 SLVARGGTNIAEGLRTAARVLEERRH---------RNTVSSVILLSDGQDTYTAPRWSRG 303 Query: 386 --------------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 + + + T F + + A + +F Sbjct: 304 PGAGATPNYEALVPPSFMATSTRDWSAPVHTFGFG-----NDHDAAAMHVIAESTAGTFS 358 Query: 430 EANSTHELNKIFRDRIGNEIFERV--IRIT 457 + + F IG + V R+T Sbjct: 359 YIGNEAVIQDAFAQCIGGLLTVVVQDARVT 388 >gi|118575253|ref|YP_874996.1| hypothetical protein CENSYa_0043 [Cenarchaeum symbiosum A] gi|118193774|gb|ABK76692.1| conserved hypothetical protein [Cenarchaeum symbiosum A] Length = 311 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 62/209 (29%), Gaps = 37/209 (17%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + + + A A ++ + D R+G F I+ + Sbjct: 92 DGSGSMAADDYAPTRLDAAKAAAAQLVGRLAPGD------RVGVILFGSSAITISYLTSD 145 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + G+TA+ D + + + E I+LL+ Sbjct: 146 RAEAAGRIGEIV----QGDGATALGDGLALGVEMAAAGPEKST------------IILLS 189 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------------KTQQEKARYFLSNCA 423 DG + A IR+ TI + E L A Sbjct: 190 DGVHNSGRTVPGEALELAIQGNIRVHTIGMGSDEPVRVGDDIFGEPRYAELDEDTLREIA 249 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEIF 450 + ++ + L+ IF + + +I Sbjct: 250 DRTGGMYYTSVDNPTLDGIF-EALSEDIA 277 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L +DA + + + D +G F+ + R + Sbjct: 479 KIELAKDAAQAAVELLGPKD------AIGVIAFDGDSYTVSEL---RSTSDRGAISDAIS 529 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + AM AY+ ++ K+++L+TDG ++ + + Sbjct: 530 TIEASGGTNMYPAMADAYEALL-----------GATAKLKHVILMTDGVSSPGD--FQGV 576 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + I + T+A + L A ++ + + ++F Sbjct: 577 AGDMSASRITLSTVALGQGSS-----EDLLEELAQIGGGRYYFCDDPQSVPQVFAKETVE 631 Query: 448 EIFERVIRI 456 + + Sbjct: 632 ASKSAINEL 640 >gi|332216197|ref|XP_003257231.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2 [Nomascus leucogenys] Length = 769 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 103 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 162 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 163 GQKVKQTKEALLKILGDMPPG---DYFDLVLFGTRVQSWKGSLVQASEANLRAAQDFVRG 219 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 220 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHAAILIMLTDGDPTEGVTDRSQIL 275 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +S + + F N + N +E + + F ++ Sbjct: 276 KNIRSAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 335 Query: 448 EIFERV 453 + V Sbjct: 336 PLLVDV 341 >gi|332216195|ref|XP_003257230.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1 [Nomascus leucogenys] Length = 911 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/246 (10%), Positives = 67/246 (27%), Gaps = 10/246 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V + Sbjct: 245 PTCSTSLLNGHFKVTYDVSRDKICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMR 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLKILGDMPPG---DYFDLVLFGTRVQSWKGSLVQASEANLRAAQDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + + + E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIEILNQVQES----LPELSNHAAILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +S + + F N + N +E + + F ++ Sbjct: 418 KNIRSAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQVAK 477 Query: 448 EIFERV 453 + V Sbjct: 478 PLLVDV 483 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 64.6 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 58/191 (30%), Gaps = 24/191 (12%) Query: 269 KKKHLVRDALASVIRSIKKID-------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 DA I + + R+G F + F+ + Sbjct: 106 MDAADFVDASGEQIDRLSAAKGVLNEFVAGREGDRLGLIVFGNAAYLQAPFTDDHETWLA 165 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + ++ STA+ D++ A S + +++LTDG +T Sbjct: 166 LLDESIVN--MAGPSTALGDSIGLAIAHFRQS-----------KTENRVLIVLTDGNDTG 212 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + AK +G+ I T+A ++ L A + F A+ L Sbjct: 213 SRVPPLDAAEVAKVEGVTIYTVAVGDPTTVGEEALDMETLETVARLTGGDSFVASDLVAL 272 Query: 438 NKIFRDRIGNE 448 + ++ E Sbjct: 273 RETYQRIDALE 283 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 64.6 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 23/185 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ A AS++ + D R+ F+ + + +K Sbjct: 56 LAMVKQAAASLVDRLLPSD------RLSVIAFDHKAKVLVP---NQTVWDKEAIKAQIAT 106 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ M+ I + + + + I LLTDGEN DN+ + + Sbjct: 107 LEPGGGTAIDEGMKLGLKEIAAGKQGTISQ----------IFLLTDGENEHGDNQRCLEL 156 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A I + + F V+ Q + + A F + F+ + Sbjct: 157 AKLAAEYNITLNALGFGVHWNQDVLEQIA--DAAGGRLVF-IEYAEQAIACFQSLFSHIS 213 Query: 450 FERVI 454 Sbjct: 214 SVDYT 218 >gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa] gi|3024050|sp|O02668|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa] Length = 935 Score = 64.6 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN + + + K KT+ G T Sbjct: 317 QTVEAMKTILDDLRAEDQFSLVDFNHNIRTWRNDLVSATKTQVADAKTYIEKIQPSGGTN 376 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 377 INEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELQLSKIQKNVKQNIQ 432 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + + ++ + R N + T K F + ++ ++R Sbjct: 433 DNVSLFSLGIGFDVDYDFLKRLSNDNRGMAQRIYGNQDTASQLKKFYN----QVSTPLLR 488 Query: 456 ITK 458 + Sbjct: 489 NVQ 491 >gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 416 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 54/193 (27%), Gaps = 28/193 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A+A + + D + + +K Sbjct: 58 KLDYTKKAVAFAVGHLSPQDYCSVVAFDDMVTMVASSHQV---------ANKDALKMAVE 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 GST ++ M + ++++ ++LLTDG + A+ Sbjct: 109 SIYPGGSTNLSGGMLLGVREVKLAHKENQINR---------VLLLTDGMANVGVTDHSAL 159 Query: 390 CNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 K + G+ + T ++ L A +F+ ++ IF Sbjct: 160 VEKSREMAAGGVNLSTFGLG-----EDFEEDLLQAMVEAGGGNFYYIEKPDQIPGIFEQE 214 Query: 445 IGNEIFERVIRIT 457 + + ++ Sbjct: 215 LTGLLSIVAQNLS 227 >gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] Length = 946 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/252 (11%), Positives = 68/252 (26%), Gaps = 23/252 (9%) Query: 219 NGKVGIRDEKLSPYMVSCNK---------SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + M N+ Y ++ D + Sbjct: 262 SNCSETAVDGQLVVMYDVNREEKAGELEVFNGYFVHFFAPDNLEPIPKNILFVIDVSGSM 321 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 ++ + ++ + + FN + + + K K + Sbjct: 322 WGVKMKQTVEAMKTILDDL---RAEDHFSVVDFNHNIRTWRNDLVSATKTQIADAKRYIE 378 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T IN+A+ A + +N + + I+L++DG+ T + I Sbjct: 379 KIQPNGGTNINEALLRAIFILNEANN----MGLLDPNSVSLIILVSDGDPTVGELKLSKI 434 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K + + ++ + R N + T K F + Sbjct: 435 QKNVKQNIQDNVSLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN--- 491 Query: 447 NEIFERVIRITK 458 ++ ++R + Sbjct: 492 -QVSTPLLRNVQ 502 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 86/307 (28%), Gaps = 32/307 (10%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 I T+ E + ++ + + + +N F Sbjct: 52 TIDTEEENVSTFSVDVDSASYALARAYLSRNHLPAEEAIRVEEFVNAFRYDYQDP--GAE 109 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 +V + L +D S ++ + LV+ Sbjct: 110 PFGVQVEAFPSPNRQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMNMENRLELVKR 169 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +LA ++ + D + + D + + + R+ + + GS Sbjct: 170 SLAMLVEKLDSRDTLA------IVVYGDTARTVLEPT---RIMDRSRILEAINALHPEGS 220 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS- 395 T + +Q AY S + ++L +DG + +I K+ Sbjct: 221 TNVQAGLQVAYAIAASQVREGATSR---------VILCSDGVANNGITQADSIFQSVKAY 271 Query: 396 --QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF-------RDRIG 446 QG+R+ T+ F + E + ++ E +IF I Sbjct: 272 AQQGVRLTTVGFGMGNYNDELMERLSH--VGDGQYAYVDALPEARRIFIEQFTGTLQLIA 329 Query: 447 NEIFERV 453 ++ +V Sbjct: 330 RDVKVQV 336 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 71/215 (33%), Gaps = 34/215 (15%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P + +DSS L + A + + R+G +N I Sbjct: 59 PCASPVDVVLSIDSSGSMTTSDPGDLRKSAAKEFVTGLDLS-----MDRVGVVSWNTSAI 113 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S + + I T G+T ++ +++A D + Sbjct: 114 SW-PLTNNTKDIESAIDSTG-----ADGNTCLDTGLKSAIDLLSE------------CSG 155 Query: 368 KKYIVLLTDGENTQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K IVLLTDG +T + ++A+S+GI + TI + + S Sbjct: 156 SKVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTIGLGPDADARNLTEIAHS-- 213 Query: 423 ASPNSFFEANSTHELNKIFRDR---IGNEIFERVI 454 + F+ A + L I++ I + + V Sbjct: 214 -TGGEFYSAPDANALAGIYKRIRSSITGIVAKDVT 247 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 17/157 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + R+G F D F+ ++ + TA DA+ + Sbjct: 136 REHDRLGLILFADSAYVQAPFTEDINVWQS--LLEDVELGYAGFKTAFGDAIGLSIAVFE 193 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ ++LLTDG++T + A G++I TIA T+ Sbjct: 194 QEQSR-----------QRVMILLTDGDDTSSKMPPVKAAEIAAKYGVKIYTIAIGDPSTK 242 Query: 412 QEKARYF--LSNC--ASPNSFFEANSTHELNKIFRDR 444 L A+ F A +L++ + Sbjct: 243 GRYKMDLPTLEKVSAATGGQMFHAMDRKQLDQAYATI 279 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 114/440 (25%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 43 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 102 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 103 LLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 162 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + S S+++ + Sbjct: 163 VSARARASGVELFAI-GVGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 221 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C + + + Sbjct: 222 VSDLC------ATGDHDCEQVCISSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDL 275 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ + ++ ++ D + ++ V + G Sbjct: 276 VFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 330 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 331 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 384 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 385 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 435 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 436 FKTINQ-----IGKKLQKKI 450 >gi|55958063|emb|CAI12958.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 935 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/254 (11%), Positives = 69/254 (27%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + V + D K Y ++ D + Sbjct: 249 ICPNCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 308 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + K K + Sbjct: 309 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRY 365 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 366 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLS 421 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 422 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN- 480 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 481 ---QVSTPLLRNVQ 491 >gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 [Homo sapiens] gi|229462889|sp|P19823|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; AltName: Full=Inter-alpha-trypsin inhibitor complex component II; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|55958062|emb|CAI12957.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 946 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/254 (11%), Positives = 69/254 (27%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + V + D K Y ++ D + Sbjct: 260 ICPNCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + K K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 433 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/301 (12%), Positives = 84/301 (27%), Gaps = 24/301 (7%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 ++ + T + S S + +N F Sbjct: 71 ENIFLDTYNNPLSTFSVDVDTASYSNIRRFLNSSQKPPVDAVRIEEMINYFTYDYPDA-D 129 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + I +P L P + +D S K L Sbjct: 130 GDEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEPNKLPL 189 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A ++ + + D V+ V + + + ++ Sbjct: 190 LKSAFKLLVDELDEDDRVSIVV---------YAGAAGLVLDSTPGNEKDKILDALMNLEA 240 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAIC 390 GSTA + ++ AYD + + NN ++L TDG+ E + + Sbjct: 241 GGSTAGAEGIKLAYDV----AKKNFIKSGNNR-----VILATDGDFNVGISSEAELVRLI 291 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 K + +GI + + F + K ++ ++ E K+ + +G + Sbjct: 292 EKKRDEGIFLTVLGFGTGNYKDSKMESLADK--GNGNYAYIDNIAEARKVLVNEMGATLN 349 Query: 451 E 451 Sbjct: 350 T 350 >gi|225387166|ref|ZP_03756930.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] gi|225046714|gb|EEG56960.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] Length = 556 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 104/356 (29%), Gaps = 25/356 (7%) Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 + T+ EV+ A E +S + + L+ + + ++ Sbjct: 25 GAGGGKTASATEAEVKAEAGSYASETMAAQSQWDGAVMEAEGPPLSHNTEEYNYIAENAF 84 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 L A T + +++ I + + + LN F + Sbjct: 85 LAVANAPLSTFAADVDTASYANLRRKILE-GNEVPADAVRIEEMLNYFTYDYPEPTEDEP 143 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + I D + L + +D S K LV+ Sbjct: 144 -FSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSMESADKLGLVKR 202 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A + +++ D V+ + GV + + T + GS Sbjct: 203 AFLLLTENLRPEDTVS------IVTYASSDTVVL---DGVSGEEKAAIMTAIENLTAGGS 253 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKA 393 T + ++TAY E+ + NN ++L TDG+ + + K Sbjct: 254 TDGSKGIETAYRL----AEEHFQKDGNNR-----VILATDGDLNLGLTSEGDLTRLIQKK 304 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K G+ + + F + K N + +S E ++ + +G + Sbjct: 305 KESGVFLSVMGFGTGNIKDNKMEALADN--GNGQYAYVDSLMEAKRVLVEELGGTL 358 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 56/176 (31%), Gaps = 33/176 (18%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + IK + R+G F D + + L+ + + +TA+ +A Sbjct: 109 QQIKALIAKRAGDRVGIIVFADHAYLLSPLTQEIPALLT--LSDELDFDLVGRTTALGEA 166 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A ++L+TDG NT N + + A +QGIRI T Sbjct: 167 ILLARQHGDPGRPTA-------------LLLVTDGRNTAGNADPLQEAKLAAAQGIRIYT 213 Query: 403 IAFSVNKT----------------QQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + + E L A +F A + +L+ I Sbjct: 214 LGVGADPDTFIQPYDEAGSGQADPSSELDEPLLKELAQTGQGRYFRARTQSDLDTI 269 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 117/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + + S S+++ + Sbjct: 164 VSARARASGVELFAIGVGSV-DKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 222 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 223 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 276 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ ++ ++ ++ D + ++ V + G Sbjct: 277 VFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 331 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 332 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 385 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 386 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 436 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 437 FKTINQ-----IGKKLQKKI 451 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 44/418 (10%), Positives = 116/418 (27%), Gaps = 34/418 (8%) Query: 23 FIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNS 82 +++ + + + G+ +++ + + S + Sbjct: 94 ALLSGIAASSIAVIAGLG--LMQHYQP----------GYLATPPMEAHLADTHSDIEEIV 141 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 T + ++ + R + + R + + P + + + L Sbjct: 142 VTGMRAELSQAEERQHKAKAIADRQRRMAEAQMAAKPMAAAPTAAVHADAYAPADILQAT 201 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDYQRDSEGQPL 201 R + + + + S + S + + + + Sbjct: 202 TREYRDRFNQVDDNPVIATRDNPFSTFSIDVDTAAYSFTRRLLNQGQLPPKDAVRIEEMV 261 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N F + + + + D P + + P P + +D Sbjct: 262 NYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNKLLHIGIQGYQLPAGHIPQSNLVFLLDV 321 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S K LV+ ++ ++ ++K D V + + + + Sbjct: 322 SGSMDEPSKLPLVKQSMELLLSTLKPEDTVA------IVVYAGAAGTVLEPTKVRE---K 372 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT- 380 + + + GSTA + + AY E + N I+L TDG+ Sbjct: 373 SKILAALHNLQAGGSTAGGEGLALAYQL----AEANFNPKGVNR-----IILATDGDFNV 423 Query: 381 --QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 +E + +++GI + + F Q + N ++ + S + Sbjct: 424 GQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQDALMQTLAQNGNGTAAYIDTLSEAQ 481 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 58/201 (28%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRLTAAQEAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNREST 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I K D TA + + TA I + + +VL++DG+ Sbjct: 162 KTAIDKLQLADR-----TATGEGIFTALQAIATVG---AVIGGGDEPPPARVVLMSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T + AK QG+ I T++F E L A Sbjct: 214 TVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S F A+S +L ++F + Sbjct: 274 SGGEAFTASSLEQLKQVFTNL 294 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 49/479 (10%), Positives = 120/479 (25%), Gaps = 62/479 (12%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G ++ + + +L V G+ L A A + A+ + Sbjct: 3 RSFPHAQRGGISLMVIVSLTTLLAVVGLAFSAGLSYLVRSKLNAATDAAGLAAARAISNG 62 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + A + + +N N + + S + Sbjct: 63 TTQADQIANAKAAGQRFFHANFPSNYLMSNATLND-------------ISVTFSGSEVTI 109 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +S+ L F + + K + + V+D S S+ Sbjct: 110 GVSASASLPAALFGGF------------GTSALAPAVVTETKRKDLDMIVVMDTSGSLSP 157 Query: 192 YQRDSEGQPLNCFGQ-PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + + Q A R + D S+ + Sbjct: 158 SAANVRSSAITFLNQFNATRDRVGLVHFAFGAIVDDAIRQTARGFDRASMTNHIKAYAFS 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 S + + ++ + + + VI + + Sbjct: 218 GSTASAEGMYTARQQINSVPTANLNRSNMRVIVFFSDGAPNSFGAYLNWKPGVACADPGT 277 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 ++ + +++G T + + ++ N A + Sbjct: 278 IYTDDDGAGTPAGLYKLDQQYDDLGGTCTPADLPSKAASLPDWYNAHNPAKTPNDPALRE 337 Query: 371 IVLLTDG------------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-----KTQQE 413 ++T N A+ K++ +GI + T+ + E Sbjct: 338 FPVVTTSPRVVTNAITYANVNRAARNLVEAMAAKSRDEGIYVFTLGLGSSLKTGTGVDGE 397 Query: 414 KARYFLSNCASP--------------NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 K L A+ + A + ++L F +++ ++RI+K Sbjct: 398 KGEDTLKCMANAVDAPARCYNPAKPVGVYCFAATQNDLTPCF-----SKLASAILRISK 451 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/330 (11%), Positives = 88/330 (26%), Gaps = 42/330 (12%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML---------DYQRDSEGQPLNC 203 + I + ++ +VID S SM R + + + Sbjct: 309 LGDIQIANGYFVHFFAPKDLPRLPKNVVFVIDMSGSMSGTKMQQEAHRAARSLQKRSTDG 368 Query: 204 FGQPADRTVKSY----------SSQNGKVGIRDEKLSPYM-----VSCNKSLYYMLYPGP 248 + + +G I+ + ++ +++ Sbjct: 369 GTARISFSPTIEQQRKCPDCPGTLIDGDFIIKYDVNRENDLGDIQIANGYFVHFFAPKDL 428 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +D S K R+A+ ++ + D+ + F Sbjct: 429 PRLPKNVVFVIDMSGSMSG-TKMQQTREAMLKILEDLDPEDHFGIILFDHRIQF------ 481 Query: 309 DPSFSWGVHKLIRTIVKT--FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + I + + G T IN + A D + + + K+ Sbjct: 482 --WNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSID- 538 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+LLTDG+ I K+ + + ++ F + N Sbjct: 539 ---MIILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLDVMSRENNG 595 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 +E + + F D + + + V Sbjct: 596 LARRIYEGSDAALQLQGFYDEVSSPLLLDV 625 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 114/440 (25%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + S VV R + Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGHSRSPDISKVVIVVTDGRPHDSVQD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + S S+++ + Sbjct: 164 VSARARASGVELFAI-GFGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 222 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 223 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 276 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ + ++ ++ D + ++ V + G Sbjct: 277 VFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 331 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 332 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 385 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 386 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 436 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 437 FKTINQ-----IGKKLQKKI 451 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 64.2 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 47/169 (27%), Gaps = 17/169 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + N+ DT R F+ V + + T ++ + Sbjct: 86 MRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHKEACL-ADVQKLRAGSCTNLSGGLFRGV 144 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK-----SQGIRIMT 402 + + A I+L+TDG + + +C + + I T Sbjct: 145 ELMQQR--------GATKGAVSSILLMTDGIANEGVRDKDDMCRALRGLMGPAPDYTIYT 196 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + + + ++ S + + F D +G + Sbjct: 197 FGYGKDHNENMLRQLSE---TGNGMYYFIESNDIIPESFGDCLGGLLSV 242 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ +K+ D R+ F+ + +K Sbjct: 59 LETVKQAAVQLVEKLKEGD------RLSVVAFDHQAQVIVP---NQMINDSASIKGKINK 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAI 389 G TAI+ ++ + + ++ + + +LTDGEN + + + + Sbjct: 110 LRASGGTAIDKGLKLGIEELNKGRKESISQA----------FILTDGENEHGDNDLCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 A I + ++ F + + L A + + + F Sbjct: 160 AKLATDYNITLNSLGFGDDWN-----QDVLEKIADAGGGNLSYIQQPEQAIEEFSKLFNR 214 Query: 448 EIFERVI 454 + Sbjct: 215 IKSVGIT 221 >gi|307726382|ref|YP_003909595.1| hypothetical protein BC1003_4370 [Burkholderia sp. CCGE1003] gi|307586907|gb|ADN60304.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 350 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/287 (11%), Positives = 74/287 (25%), Gaps = 9/287 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G ++TAL + V++G + VD+ R + L+ AA A + + L + Sbjct: 11 LSRQRGAVAVMTALCLTVLIGTTALAVDLGRAWVVRNELQNAADAAALAGAGSLGPNYAS 70 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + A+ + + + + Sbjct: 71 PNWTQAQTKAQSAITLNRTEGTSLLTAQVQTGYWNVTGTPAGMQALPVPSPGAYDKPAVQ 130 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 L + I T + T + + S+ Y Sbjct: 131 VTVSRAAGQNGGPLSLVLAPVLGISTMPISATAVAVISAPGYAGPGALFPTAISLCLYTA 190 Query: 195 DSEGQPLNCFGQPAD---------RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 PA T + G + + + Sbjct: 191 YWNASTGQPVNDPATGLPQVFKIGSTYQYPGCAAGVGSGQWTSFQYNVQDVTSIRNLIAN 250 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 PL ++ ++ ++ S + D L V+ S+ V Sbjct: 251 GNPLPMNIGDKTWIQSGVKNTIYNSVPSPADVLLPVVNSLDTGSPVP 297 >gi|260824347|ref|XP_002607129.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] gi|229292475|gb|EEN63139.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] Length = 642 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 44/155 (28%), Gaps = 9/155 (5%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F+ R S ++ T+ G T INDA+ A + Sbjct: 326 DFFNIVTFSSRTKRWKSELQVANQENIRAADTYVTSMAAHGGTNINDAILEASVLLDPEL 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 + IVLLTDG+ T I A+ + + F + + Sbjct: 386 RS------RHDSHASMIVLLTDGQPTGGVTNTNHIIANARDSLAGNHALFCLGFGYDVSF 439 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + R L N + + F D + Sbjct: 440 EFLERLALQNGGFARRIYPDDDGELQLTSFFDEVA 474 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 86/292 (29%), Gaps = 24/292 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 ++ AE + + S + + +N F + Sbjct: 148 VKVSAEPLSTFSIDVDKASYSNMRRFINKGEVPPADAIRVEELVNYFKYDYPKPE-GDDP 206 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + L P+ + +D S K LV+ + Sbjct: 207 VAFGIEVGKCPWDGKHRLVKIGLKAREIDTDNLPATNFVFLIDVSGSMDWDGKLDLVKSS 266 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + ++ +++ ID R+ + + G ++ + GST Sbjct: 267 MKLLVNNLRPID------RVAIVVYAGAAGQVLPSTPGSE---KSKILESLNGLTAGGST 317 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAK 394 A + + AY ++ + NN I+L TDG+ N+ + + Sbjct: 318 AGGEGIVLAYKI----AKENLIEGGNNR-----IILCTDGDFNVGVSSNDGLEKLIENER 368 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 G+ + + + + + +K + A + ++ E NK+ G Sbjct: 369 KSGVFLSILGYGMGNYKDDKMQTLAQ--AGNGNHAYIDNMQEANKVLVSEFG 418 >gi|194227183|ref|XP_001916967.1| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Equus caballus] Length = 946 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 61/187 (32%), Gaps = 17/187 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A+ +++ ++ D + R + + V + I K Sbjct: 330 VEAMKTILDDLRTEDQFSVVDFNHNVR-TWRNDLVSATTTQVADAKKYIEK-----IQPS 383 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 384 GGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLSKIQKNVK 439 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + + T K F + ++ Sbjct: 440 QNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAHRIYGNQDTSSQLKKFYN----QVST 495 Query: 452 RVIRITK 458 ++R + Sbjct: 496 PLLRNIQ 502 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 62/196 (31%), Gaps = 33/196 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A++ VI+++ D R+ F+ + ++ Sbjct: 262 KLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLFPLRRMTLTGRQQALQA-IS 314 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I DA++ + I+LL+DG++T G A Sbjct: 315 SLVASGGTNIADALKKGAKVVKDR---------RRKNPVSSIILLSDGQDTHSFLSGEAD 365 Query: 390 CNKA----------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 N + S ++I T F + + A S +F ++ + Sbjct: 366 INYSILVPPSILPGTSHHVQIHTFGFGTDHDSAA-----MHAIAETSNGTFSFIDAEGSI 420 Query: 438 NKIFRDRIGNEIFERV 453 F +G + V Sbjct: 421 QDAFAQCMGGLLSVVV 436 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/301 (12%), Positives = 88/301 (29%), Gaps = 25/301 (8%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + +A T S +++ ++ + M + LN F + Sbjct: 93 NGVRLVAQAPVSTFSIDVDTGSYSNMRRML-NAGRMPPADAVRVEELLNYFPYDYAQPAD 151 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + P V + P + VD S + K L Sbjct: 152 GRP-FAVHAALAPAPWHPANVLLRIGIKGKDMASAALPPANLVFLVDVSGSMNSPDKLLL 210 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ +L ++ ++ D R+ + + G +T + Sbjct: 211 LKSSLKLLVNKLRPQD------RITLVTYASGTRVALPPTPG---SDKTAISAAIDQLVA 261 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAIC 390 GSTA + AY + + N ++L TDG+ + ++ Sbjct: 262 GGSTAGASGIALAYQ----AAQQSFIAGGINR-----VLLATDGDFNVGVTDFRQLKSMV 312 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + G+ + T+ F ++ A ++ ++ E NK+ I + + Sbjct: 313 EEKRKSGVSLSTLGFGTGNYNEQLMEQLAD--AGDGAYSYIDNLMEGNKVLVSEISSTLA 370 Query: 451 E 451 Sbjct: 371 T 371 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 65/201 (32%), Gaps = 30/201 (14%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + ++ + R A I R+G F+ + + Sbjct: 105 DVSDSMNTPDFGGKSRFAGARTAAMRFID-------NRPADRIGLVVFSGGSFTRCPLTL 157 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 L R + +T A + TAI A+ TA + + +S +K +VL+ Sbjct: 158 DHEVLGR-LAETVAPGFFDEPGTAIGTAILTATNRLKAS-----------SSKEKALVLI 205 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTI-------AFSVNKTQ--QEKARYFLSNCA-- 423 TDGEN A + GIRI T+ AF K R L A Sbjct: 206 TDGENNAGEVTPETAARLAANYGIRIYTVFAGKEARAFENTSNTALNRKGRSELETVARI 265 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S F A L K FRD Sbjct: 266 SGGRMFSAGDVFGLMKSFRDI 286 >gi|260892924|ref|YP_003239021.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] gi|260865065|gb|ACX52171.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] Length = 298 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 21/65 (32%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 KS G ++ A + +LG+ ++VD L A A + L Sbjct: 3 WKSERGTVAVLVAAALTFLLGLAALVVDGGGLLLARERLVNAVDAAALAGVQFLPGDPSG 62 Query: 75 VSSRA 79 A Sbjct: 63 AVQTA 67 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 68/180 (37%), Gaps = 29/180 (16%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D V TV +G FN+ S + + T G TA +A+ TA + Sbjct: 120 DEVPRTVNLGVISFNNTATVLQSPTRNRSDV-----LTAIDRLAVSGGTATGEAIATATE 174 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + + +N IVL++DG +T + I +A+ I I T+AF + Sbjct: 175 MLRN------QPGENGRRPPSAIVLISDGTSTNGR-DPIEAAAEARRLRIPIYTVAFGTD 227 Query: 409 KTQQE--------------KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L+ A + F A+S L+ +F +R+G+++ R Sbjct: 228 QGTITVPGRDGVERTERVPPDPTALAQIAEMTGGETFTADSADRLDTVF-ERLGSQLGTR 286 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/386 (9%), Positives = 94/386 (24%), Gaps = 36/386 (9%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + +S + + I + + + + + + V Sbjct: 114 DSDSDKDGLSDQFEIEQFGTDPLIVDTDKDGLSDKEEYDLGLDPLTVSALENLVTSTGIP 173 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +V L+ L + L E + + Sbjct: 174 GELRVSLTGDAYLAKHVKIFELGLPNNLQPPGLVGKPYEIHLDDEKVQSMFKAHVSFPYD 233 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S+ T + + + +L + + Sbjct: 234 ESVDPNNLGIFTFDEVKELWVPADTSERQIVIDEASKVVSLELEHFSKYGLFDK--QAWE 291 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK--------KIDNVNDTVRMG 298 + DS+++R + L +D + + R Sbjct: 292 SYYSNIPNVCEPADSNNVRPLELAFVLDSSGSMRWSDPDNIRIVGSKHLVDRLKEVDRGA 351 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ S + + ++K+ + G T I D + A + ++ Sbjct: 352 VIDFDSTAQLLQSLTDN-----KAVIKSALDLIDASGGTDIGDGVSKALEEFANARSAS- 405 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +VLLTDG + ++ + + IR++ I + Sbjct: 406 ---------DWAVVLLTDGSGSYNHALTTELVQ----KNIRVLGITMGSGAN-----QSL 447 Query: 419 LSNCASP--NSFFEANSTHELNKIFR 442 + + + N+ EL ++F Sbjct: 448 IRGISDSTYGIYQHVNTADELIEVFE 473 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 73/254 (28%), Gaps = 14/254 (5%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + V + D K Y ++ D + Sbjct: 260 ICPSCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + + + K I K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLFQLQKHRLQIAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLS 432 Query: 388 AICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I K I + ++ + R N + T K F + Sbjct: 433 KIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN- 491 Query: 445 IGNEIFERVIRITK 458 ++ ++R + Sbjct: 492 ---QVSTPLLRNVQ 502 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 68/201 (33%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + A D++ + + +G F + S + Sbjct: 99 SMNATDVAPSRIRAAQSAAKKF------ADDLTEGINLGLISFAGTPSTLVSPTPDHTAT 152 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + K D+ TA + + A D I + +IVLL+DG+ Sbjct: 153 KKAVDKLVLADK-----TATGEGIFAALDQIRT---LNAVLGGPEAAPPAHIVLLSDGKQ 204 Query: 380 TQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T +E KAK +GI + TI+F E L A Sbjct: 205 TVPDEPTDPRGAFTAARKAKEEGIPVSTISFGTAYGTVELDGDRVPVPVDDPSLKQIANL 264 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S +FF A+S ELN+++ Sbjct: 265 SGGNFFTASSLDELNEVYEKL 285 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 25/142 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 + +VL+TDG N + I A +G++I I + + Sbjct: 189 --------MRPATSRALVLVTDGANNAGQIDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 Query: 413 ----------EKARYFLSNCAS 424 + L A+ Sbjct: 241 QSVLGLNPSLDLDEPTLKEIAT 262 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 58/201 (28%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRLTAAQEAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNREST 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I K D TA + + TA I + + +VL++DG+ Sbjct: 162 KTAIDKLQLADR-----TATGEGIFTALQAIATVG---AVIGGGDEPPPARVVLMSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T + AK QG+ I T++F E L A Sbjct: 214 TVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S F A+S +L ++F + Sbjct: 274 SGGEAFTASSLEQLKQVFTNL 294 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 63.9 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/269 (13%), Positives = 75/269 (27%), Gaps = 62/269 (23%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH------------------VIKKKHLV 274 N +Y + S+ + + + V+K+ Sbjct: 555 YYDGNYYGWYYAWNNYGYDFYSDFFVPNPTPGTYKIIIVPLTKESIVIEPKVVMKRMDAA 614 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + A + + + D T + +R+ + + + V Sbjct: 615 KLAAITFNNMLGEGD---FVGLATFTTYAERISVNQTP-LKYMTKDKLRVNNEIEGLYAK 670 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNN--LEAKKYIVLLTDGENTQ----------- 381 +T DA+ + DE + ++LLTDGE T Sbjct: 671 LATDHADALYWGVK-VFPIWPDETQNNCTECINNTRPLMILLTDGETTTCDKNEDYFNNT 729 Query: 382 ------------DNEEGIAICNKAKSQ------GIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++ + + + K I I TI F + + FL + A Sbjct: 730 CKNKCVRDNGHYGAQQALCVADYIKRNIKVNGFNIPICTIGFGTDIGSD--GQAFLRDIA 787 Query: 424 SPN------SFFEANSTHELNKIFRDRIG 446 SP +F A ++ EL + ++ Sbjct: 788 SPRPDNGEACYFFATTSEELIEAYKTIFN 816 >gi|261820223|ref|YP_003258329.1| hypothetical protein Pecwa_0904 [Pectobacterium wasabiae WPP163] gi|261604236|gb|ACX86722.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 539 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 108/469 (23%), Gaps = 51/469 (10%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV- 66 I G AL +L +VD + +K+A A + Sbjct: 22 RERLSAFIHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQ 81 Query: 67 ------PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 Q + K + + ++ + + A Sbjct: 82 ATINGEEYSQEDINTLAYEYVKSNLGMNKALSEKLVAGDVSVTEGRNSATRKTYTVTVAF 141 Query: 121 EMNPRKSAYQVVLS---SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 P + S +++ P + L ++VS S+ + Sbjct: 142 ATKPSLLSLGARAQEVYSTSEVINRPTEVALVMPVTGDMSAGDIRSLKSVSHSFVERLLG 201 Query: 178 SIQWVIDFSR-SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S D S++ Y + + + + + + Sbjct: 202 SADTKRDNLWLSLVPYSQSVNVYDAEDANRIRRWAAPGALNPPELRSLFASGVVSSLADR 261 Query: 237 NKSLYYMLYPGPLDP-SLSEEHFVDSSSLRHV--IKKKHLVRDALASVIR-SIKKIDNVN 292 E F D + + L ++ S + + Sbjct: 262 RFPDRRANLLCMYRGLGREENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPISWRGPNPDF 321 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D T + P+ + + + + + M A + Sbjct: 322 DDNDAVDTRWIVADKGCPNAALMPLTNEASKLNQRIDQFSARFNVNYAIGMSWAGAALSP 381 Query: 353 SNEDEVH----------RMKNNLEAKKYIVLL---------TDGEN-------------- 379 + + N + +K IV++ TD N Sbjct: 382 NMRGSDGWGDTTLPLDFNLDGNGDGQKVIVMMANTIGDWFDTDSYNFNRNEFRGGTGNDL 441 Query: 380 --TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 T + +C+ +++ I+ + ++ R A+P Sbjct: 442 ARTFAAQRFRDLCSSFRARNIKFYFVGI-RPGDPEDFGRNLFDREATPG 489 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 TRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 60/187 (32%), Gaps = 20/187 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ LA + R+G F D F+ + + +T Sbjct: 120 TRLDAVKQVLAQFSQHRDG-------DRLGLILFGDAAYLQAPFTADHETWLALLDETQV 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK-------NNLEAKKYIVLLTDGENTQ 381 ST + DA+ A + +K +++LTDG +T Sbjct: 173 --GMAGQSTHLGDAIGLAIKVFNDQADAAKQAATQQGSAVVTRPVKEKVVIVLTDGNDTD 230 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCAS--PNSFFEANSTHEL 437 I A S+ IRI IA +Q + AS F+A S EL Sbjct: 231 SLVPPIDAAKVAASRDIRIHMIAMGDPRTVGEQALDMEVIEQVASLTGGQSFQALSPAEL 290 Query: 438 NKIFRDR 444 ++++ Sbjct: 291 TRVYKKI 297 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 49/176 (27%), Gaps = 15/176 (8%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ G+ +T + MP++LG +++DV R S L+ A + + Sbjct: 2 LSRVVRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV------ 120 L + ++ + D V Sbjct: 62 ELDGRDDAITRARTAIEKIANSAAFSGGGTGMSLGSHISVVYNAGNDAGSTVTVLFLKNI 121 Query: 121 ---EMNPRKSAYQVVLSSRYDLLL------NPLSLFLRSMGIKSWLIQTKAEAETV 167 + S+ Q ++S ++F +G I A+A V Sbjct: 122 PANDDTAIPSSMQTTVASEASYAWVIAKPQAMQTIFPIPVGFTRDTINIAADAVAV 177 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 92/313 (29%), Gaps = 26/313 (8%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 Y + + + + + + T + +I+ I + Sbjct: 54 YSPNVGYNTEEYNIINESGFQSVSTSPLSTFAADVDTASYANIRRFI-TDGELPPADAVR 112 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + LN F + + I +P L + Sbjct: 113 IEEMLNYFYYDYPQPKDDEP-FSVTTEISSCPWNPDTKLMQIGLQ-AKNTDTTTKPSNLV 170 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 +D S+ K LV++A + +K+ D ++ +D G Sbjct: 171 FLIDVSASMDEPDKLPLVKNAFLLLCDELKENDTISIV---------TYAGTDSVVLEGA 221 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + D GSTA +D ++TAY E +VL TD Sbjct: 222 KGSDKKSIMSAIEDLTAGGSTAGSDGIKTAYKI---------AEKYFKTEGNNRVVLATD 272 Query: 377 GENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 G+ E I + K K I + + F + + K + N ++ +S Sbjct: 273 GDLNVGITSEGELIKLIKKEKESNIFLSVLGFGTDNIKDNKMQSLADN--GDGNYSYIDS 330 Query: 434 THELNKIFRDRIG 446 E K+ D +G Sbjct: 331 RFEAKKVLSDELG 343 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 38/330 (11%), Positives = 93/330 (28%), Gaps = 32/330 (9%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS---MLDYQRDS 196 N ++F + I W+ + ++ G+ I + S S + + + D Sbjct: 704 WENESTIFDLGVPIGYWIKIATDPNSDNAIVGVEDDGMDIHVSMWNSTSWTPLTEIEDDC 763 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS-PYMVSCNKSLYYMLYPGPLDPSLSE 255 F + + G + L + L ++ Sbjct: 764 YEYSKRIFDVAFEGSSGDAMVVWGDSTNTPKYLVWNGSNWGAEGSSSSLASSGYTRWVNL 823 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID----NVNDTVRMGATFFNDRVISDP- 310 E VDS + + + S + + + + + + Sbjct: 824 ESNVDSDEISLITSDGSSDINVQKWNGDSWNVPYEVEISSSRYYEVSDIVYPWKDTTYSS 883 Query: 311 ----------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + + + G TAI++ + A + + + + Sbjct: 884 TELNWKEWRATVTSSLSNNSLVHLSNSIDTITADGLTAIDEGLYEANNELSAITGNST-- 941 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +VL++DG + + I +AK I T+ N+ + + ++ Sbjct: 942 ----------VVLMSDGLDNAGHHSLIEEALRAKEHNTVIYTVGLGNNEDEVDPILCEIA 991 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIF 450 N + ++ A ++ L IF Sbjct: 992 NI-TGGKYYFAPNSTVLEDIFIGIASEITN 1020 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 50/153 (32%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F +R ++ + + R I + STAI D + A + S Sbjct: 145 DRIGLVLFANRAYVAAPLTFDLAAVGRAIEEASI--GITGRSTAIADGLGLALKRVTESG 202 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + IVLL+DG++ + + A G+RI TIA + Sbjct: 203 AAS-----------RVIVLLSDGQDNAHQIDARQVAGLAARHGVRIHTIALGPDDLETRP 251 Query: 410 -TQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + L AS + +L Sbjct: 252 AARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 284 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 60/190 (31%), Gaps = 29/190 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + + + V+ A+ + + D R+ F F++ + Sbjct: 107 DFNLDDRAVTRLEAVK-AVGADFARRRAGD------RLALVVFGSEAYFASPFTFDTESV 159 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R I + +T+I+D + A + +S + ++LL+DG N Sbjct: 160 ARRIEEATI--GISGRATSISDGLGLALKRLSTSTA-----------TSRVVILLSDGIN 206 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-------ARYFLSNCA--SPNSFFE 430 + A G+R+ TIA + L + S F Sbjct: 207 NAGATNPRGVAELAARYGVRVHTIALGPKDLTTAEVGERGVVDAATLRAISQISGGESFR 266 Query: 431 ANSTHELNKI 440 +T +L + Sbjct: 267 VRTTEDLVAV 276 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 34/287 (11%), Positives = 85/287 (29%), Gaps = 24/287 (8%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + + +N F + Sbjct: 161 STFSIDVDAASYSNMRRFLNKGELPPTNAIRTEELINYFSYNY-ARPTGNDPVRITTEVG 219 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + L P P+ + +D S + ++ LV+ +L ++ ++ Sbjct: 220 TCPWNEQHRLVRIGLKAKEIPTENLPASNLVFLIDVSGSMYGPERLDLVKSSLKLLVNNL 279 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ++ D V ++ + + G + ++ + GSTA + + Sbjct: 280 REKDKVA------IVIYSGAAGEKLASTPGS---DKQKIREAIDELEASGSTAGGEGIML 330 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMT 402 AY + NN I+L TDG+ ++E + + + GI + Sbjct: 331 AYKI----AQKNFISGGNNR-----IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTV 381 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + + + K + + ++ E N++ + G + Sbjct: 382 LGYGMGNYKDNKMQTLAEK--GNGNHAYIDNLQEANRVLVNEFGATL 426 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 32/304 (10%), Positives = 76/304 (25%), Gaps = 41/304 (13%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + + + S S+++ + + + + Sbjct: 178 GVGRVDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQEAF-----CVVSDLCATGDHD 232 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKH 272 + Y SC + S + Sbjct: 233 CQQICNSSP-GSYTCSCRDGFTLNSDGKTCSACNGAAGGSATDLVFLIDGSKSVRPENFE 291 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LV+ + ++ S+ D + ++ V + G +K + I Sbjct: 292 LVKRFINQIVDSLDVSDKLAQVG---LVQYSSSVRQEFPL--GRYKTKKDIKAAVKKMSY 346 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A++ D + + +K ++ TDG + + K Sbjct: 347 MEKGTMTGAALKYLIDNTFTISSGARPGA------QKVGIVFTDGRSQDYIND---AAKK 397 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AK G ++ + L AS +F +N+ IG ++ Sbjct: 398 AKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKTINQ-----IGKKL 446 Query: 450 FERV 453 +++ Sbjct: 447 QKKI 450 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 53/167 (31%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D +T R+G + V + K + T A+Q Sbjct: 67 IESLDVGPNTTRVGVINYASAVKHEFPLKAHRSKASLLQAVRKIEPLST--GTMTGLAIQ 124 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + E K +++TDG + +AK GI + I Sbjct: 125 FAINKAFSEVEGSRLKF---PEISKVAIVVTDGRPQDG---VKDVSARAKQSGIELFAIG 178 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 ++ L AS + S +L+K F++ Sbjct: 179 VGR------VDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQEAF 219 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 48/169 (28%), Gaps = 21/169 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + I+ TAI DA+ + S Sbjct: 139 DRIGLVVFGDAAYPQAPLTLDHDSVR--ILLDQMQIGMAGPRTAIGDAIGLTVKLMADS- 195 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K ++LLTDG +T AK + + TI T E Sbjct: 196 ----------HAQEKVLILLTDGNDTSSAIPPERAAEIAKQHKLVVHTIGIGDPGTTGED 245 Query: 415 ARYF--LSNCAS--PNSFFEA----NSTHELNKIFRDRIGNEIFERVIR 455 L+ AS F A E+ +I + R Sbjct: 246 RVDLEALARIASITGGRAFRALGQQRDLAEVYATLDKLTPEKIKREIYR 294 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 60/190 (31%), Gaps = 29/190 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R AL + IR + + +R+ F + +L+ + Sbjct: 105 SRFEAARAALRTFIREL------PEGLRLALVTFARDAHLVVPLTTDRGRLLEAVDFLQL 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + +E L I+LLTDG + + + Sbjct: 159 NL-----GTAIGDAILESIQALPPLSERAEDPDPRRLAT---IILLTDGR-SLGGVDPVV 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------ARYFLSNCA--SPNSFFEANST 434 +A Q IR+ TI + L A +F +S Sbjct: 210 AAQEAARQQIRVHTIGIGRTTSGPVPGLPEVYAQAALFDEETLKEVARVGDGQYFYVDSA 269 Query: 435 HELNKIFRDR 444 +L + +RD Sbjct: 270 EKLKEAYRDL 279 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 46/440 (10%), Positives = 115/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + Sbjct: 43 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGLVNYASAVKQEFPLRAHGSKAA 102 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + E ++ VV R + Sbjct: 103 LLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGEGGRARSPDISKVVIVVTDGRPQDSVRD 162 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ GI+ + I + + S S+++ + Sbjct: 163 VSARSRASGIELFAI-GVGRVDKATLREIASEPQDEHVDYVESYSVIEKLSKKFQEAFCV 221 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + + Y +C++ + + Sbjct: 222 VSDLC------ATGDHDCEQVCVSSPASYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 275 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ + ++ ++ D + ++ V + G Sbjct: 276 VFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 330 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 331 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 384 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 385 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 435 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 436 FKTINQ-----IGKKLQKKI 450 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 63.9 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 52/391 (13%), Positives = 99/391 (25%), Gaps = 35/391 (8%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A ++ + + T E+ + + Sbjct: 14 MAVAVCGCGNKKEADETQTGKTDVTVTGEDTVPSGDVAEEDNEMQVDGIMQESATNMATA 73 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLF----LRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + Y L N M +++ + T A + + Sbjct: 74 DCESGFYDDYLQCPDTEEWNTNEYSYTEENPWMNVQTSPLSTFAADVDTASYTQIRSAIE 133 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 D SM+ + LN F D + Sbjct: 134 N--GYDIDPSMVRI-----EEMLNYFHYDYPLPKDDE-KFAVYTEYMDCPWNEDTKLALV 185 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 S+ P+ + +D S K LV+ AL + ++ + D V+ Sbjct: 186 SMNTQAIDFKSAPASNLVFLIDVSGSMFDDNKLPLVQQALTMLAENLTEKDRVSIV---- 241 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 SD GV + + GST + ++TAY + Sbjct: 242 -----TYAGSDEVVLQGVSGDDYHEISSAIEGLEAYGSTNGSAGIETAYAL----AKKYF 292 Query: 359 HRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + NN ++L TDG+ + + + K G+ + T + K Sbjct: 293 IKGGNNR-----VILCTDGDLNVGLTSEGQLEKLITEKKDSGVFLSTFGVGYGNYKDNKL 347 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ +S E K D +G Sbjct: 348 ELLADK--GNGNYAYIDSMFEAKKALVDELG 376 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 63.5 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 60/196 (30%), Gaps = 25/196 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + ++ + + + +G F Sbjct: 96 SNSMAATDIAPTRLAAAKQGASAFVDQL------PPRINLGLVSFAGSATVLVP-----A 144 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V+ +TA+ + + + I ++ + IVLL+DG Sbjct: 145 SADRESVRAGIRGLQLGPATAVGEGIFASLQAITTAGK--RFSDTGQSAPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPN 426 E T+ AI A+ I + TIA+ + L + A + Sbjct: 203 ETTRGRPNNQAI-EAARQARIPVDTIAYGTADGTLDVGGQEVPVPVNEQALRDIAEQTGG 261 Query: 427 SFFEANSTHELNKIFR 442 S+ A S EL ++R Sbjct: 262 SYHRATSGDELRSVYR 277 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 63.5 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 29/75 (38%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ G+ +T + MP++LG +++DV R S L+ A + + Sbjct: 2 LSRIVRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 PLIQSLEEVSSRAKN 81 L + ++ Sbjct: 62 ELDGRDDAITRAQTA 76 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 63.5 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 59/210 (28%), Gaps = 39/210 (18%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + + L I T R+G F + H L + Sbjct: 153 PRGKRLTRLDTAKLVLQDFISR-------RRTDRLGVVVFGKAAYVLSPPTLDYHLLTQM 205 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + + +TAI DA+ TA + + K ++LLTDG++ Sbjct: 206 VSQMTLNVID-GSATAIGDALGTAVARLR-----------RSDAQSKVVILLTDGDSNAG 253 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA------------------RYFLSNCA- 423 + A S G ++ TI + + + L A Sbjct: 254 AISPEYATHLATSLGAKVYTIQIGTDDEVEVEDGIDLFGQPRYVRHRFPVNPALLQEIAQ 313 Query: 424 -SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + + A L D + R Sbjct: 314 KTGGASYVATDAKALADSMHDVLDRLEKTR 343 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 63.5 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 59/148 (39%), Gaps = 25/148 (16%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +M +++ + S + +K ++ G T I+ + A D + + Sbjct: 584 KMMIVSYDNEAYLEQSLTS-----RSGTLKNSIAAISDGGGTNISAGLNLALDNLEAEKG 638 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + ++L++DG++ E+ A ++A GI + T+ F E Sbjct: 639 ------------SRAVILMSDGQDGGSEEDMQAATDRAAKLGISVYTVGFG------ECD 680 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIF 441 ++ A + F +A+++ EL+ I+ Sbjct: 681 DAYMQAIAEVTGGKFVKASASTELSDIY 708 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 63.5 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 57/165 (34%), Gaps = 20/165 (12%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + DR+ + + + +T +K F + T ++ + A + + Sbjct: 80 VSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDMSVGLDEAVKVLKQGMDPAHA 139 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNE----------EGIAICNKAKSQGIRIMTIAFSVNK 409 IV+L DG N D + +AK GI I TI + + Sbjct: 140 P---------MIVVLADGNNDLDPNTGRTSKEASEQLAQAVKEAKGSGIPIYTIGLNADG 190 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ L+N + F +S +L +I + + +++ Sbjct: 191 KLNKETLAELAN-QTGGKSFTTSSADDLPQILSEIFASHQQLKIV 234 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 67/197 (34%), Gaps = 21/197 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWG 315 + + + A+ ++ ++ V V +T D D Sbjct: 68 SMNQDDAPGLRIDAAKAAVTDLLGTLPAPTQVGLMVYGTSTGSTDAERAAGCQDIKTLAP 127 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 V L + + G T I +A++ A + + + IVL++ Sbjct: 128 VGTLNAATLTSQVAGITASGYTPIGNALRAAAQALPNEGP-------------RSIVLVS 174 Query: 376 DGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DGE+T + + QG+ + T+ F V+ T +++ A+ ++ +A + Sbjct: 175 DGEDTCAPPAPCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQ--ATGGTYSDAGN 232 Query: 434 THELNKIFRDRIGNEIF 450 L + ++ I Sbjct: 233 ATGLTDALQAKVEVAIS 249 >gi|153006798|ref|YP_001381123.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] gi|152030371|gb|ABS28139.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] Length = 381 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 18/160 (11%), Positives = 51/160 (31%), Gaps = 2/160 (1%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + + G +I A+++ V+ G + ++V L+ A+ + + L Sbjct: 1 MPARGKRRERGAVAVIVAIVLLVLGGFMALSLNVGHLLSVRGELQNASDAGALGGAWDLD 60 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD-REVRDIVRDTAVEMNPRKSA 128 +++ ++ + + + +Y + + + + + NP + Sbjct: 61 CTVDGIAKARVAALDYATRHSTDYQPLLPSDTMVETGYWSVDKERFWPISDPGSNPELVS 120 Query: 129 YQVVLSSRYD-LLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + R + N +F + A A V Sbjct: 121 LTNAVRVRSTRVDANAAPVFFPVFLGGNNTANVGAGAIAV 160 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 59/185 (31%), Gaps = 24/185 (12%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + AL ++ + + D R+ ++D+V + + V+ Sbjct: 64 KQALRFLVDQMAEED------RLAIVTYDDQVHVPFP---SQPVVQKDAVRLLVDGITAG 114 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI---AICN 391 G+T ++ + T I ++L+TDG + Sbjct: 115 GTTNLSGGLATGMQQIRPHAGPGRVSR---------VLLMTDGLANVGVTDPDVLAGWAR 165 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + +G+ + T+ ++ A +F + ++ +IF++ + + Sbjct: 166 AWREKGLAVSTMGVG-PHFSEDLLVALAE--AGGGNFHYIANPDQIPRIFQEELHGLLQV 222 Query: 452 RVIRI 456 V + Sbjct: 223 AVQGL 227 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 53/401 (13%), Positives = 105/401 (26%), Gaps = 26/401 (6%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ--KIEEYLIRNFENNLKKNFTDREV 111 AA +A I + RA+ E + V Sbjct: 103 SSAADSAQIMQDKEPQAAKSAAELRAEFDAGEIAALKSKSEDSAAALGMAKRAAPAAPGV 162 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + A M + R L L+P + + T S Sbjct: 163 VAQGQLLAEPMAAVAPSPVPPTDGRAQLQLDPSRERFANAAVNPIKSVAADPVSTFSADV 222 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 S + +M D Q + +N F K+ V + + Sbjct: 223 D-SASYSFVRRSLTAGAMPDPQSVRVEEMINYFPYDWPGPEKADQPFKATVTVMPTPWNH 281 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + ++ P + +D S K L++ A ++ +K D V Sbjct: 282 DTQLMHVAIKGYDIAPATAPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNKLKADDTV 341 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + + + + ++ + + GST + + AYD Sbjct: 342 S------IVTYAGNAGTVLEPT---RVAEKSKILSAIDRLEAGGSTGGAEGIAAAYDL-- 390 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVN 408 + + + L TDG+ +E+ I + + GI + + F Sbjct: 391 ------AKKAFVKDGVNRVM-LATDGDFNVGPSIDEDLKRIIEEKRKDGIFLTVLGFGRG 443 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + N S ++ E K + G+ + Sbjct: 444 NLNDSLMQTLAQN--GNGSAAYIDTLAEAQKTLVEEAGSTL 482 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 69/252 (27%), Gaps = 27/252 (10%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 +T + M ++G M VD S + A+ A A + L + + Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTEGATD---------D 51 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + ++ N N T + T ++ Y P+ Sbjct: 52 QLRAYALDFFNANLNKINPANTT-------LTVTLPSNTTGGGLLKMTARLDYKPYFYPV 104 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 F + +G + S K + + V+D S SM S ++ Sbjct: 105 --FGQLVGKSETDANQRISFNITSEVRLKNT-LEVALVLDNSGSMTKTGTGSGQTRIDLL 161 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKL--------SPYMVSCNKSLYYMLYPGPLDPSLSEE 256 A + V + + Q + D + + V + L P +E Sbjct: 162 KTAAKQLVDTLAQQAAMIKQVDRPVQFGLVPFAASVNVGPGNGNAPWMDTEGLSPVSNEN 221 Query: 257 HFVDSSSLRHVI 268 + + Sbjct: 222 FDWSTLNAADKY 233 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 27/248 (10%), Positives = 56/248 (22%), Gaps = 19/248 (7%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 D+ + + + + S S N +G+ Sbjct: 297 EYDYIDTYGPFASWQGCVEARPYPYNVNDAAPSGGSANTGIGV---------GDPATMFV 347 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHV------IKKKHLVRDALASVIRSIKKIDNVNDTV 295 M P D ++ + + K Sbjct: 348 PMFAPDEPGNHWRLTQDPDEAAPTTYGAINSWWNDDPTSSTGQSRLRNMSKYFQPRPINA 407 Query: 296 RM----GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 ++ + T +K G T + + M + + Sbjct: 408 PALAAGNGPNYSCSTNPITPLTDVSVTTGLTAIKAAIDLMKPDGGTNVPEGMAWGWRVVS 467 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S R + K +++LTDG NT ++ + A S+ + Sbjct: 468 SGEPFTQGRPETERGNDKVVIVLTDGANTYYTPSSLSHSDPADSKSTYASFGYLNPGYNG 527 Query: 412 QEKARYFL 419 R F+ Sbjct: 528 TSVGRLFM 535 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 41/321 (12%), Positives = 85/321 (26%), Gaps = 30/321 (9%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + I + + + D G P + +Y + N Sbjct: 322 NVNDAAPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWRLTQDPDEAAPTTYGAINSWWN 381 Query: 224 IRDEKLSPYMVSCNKSLYYMLYP-GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 + N S Y+ P + S+ V L ++ Sbjct: 382 DDPTSSTGQSRLRNMSKYFQPRPINAPALAAGNGPNYSCSTNPITPLTDVSVTTGLTAIK 441 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +I + T + RV+S I + +T + Sbjct: 442 AAIDLMKPDGGTNVPEGMAWGWRVVSSGEPFTQGRPETERGNDKVVIVLTDGANTYYTPS 501 Query: 343 ------------MQTAYDTIISSNEDEVHRMKNNLEAKKYIVL-LTDGENTQD-NEEGIA 388 ++ + + ++G T NE+ Sbjct: 502 SLSHSDPADSKSTYASFGYLNPGYNGTSVGRLFMGTSSAIGQFDYSNGNYTNALNEQMAT 561 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCASPNSF-------------FEANS 433 +CN AK+ I +MT+A ++ T+ + L +C+S + F F + Sbjct: 562 LCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSCSSNSRFRKDPTDASKPAKLFWNAT 621 Query: 434 THELNKIFRDRIGNEIFERVI 454 L+ F++ R++ Sbjct: 622 GASLSNDFKEIGNELSNLRIV 642 >gi|125559999|gb|EAZ05447.1| hypothetical protein OsI_27661 [Oryza sativa Indica Group] Length = 704 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 59/201 (29%), Gaps = 38/201 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI + D R+ F+ + + + K+ Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLT-RMSDDGKASAKSAVE 299 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + A + A ++LL+DG++T + G Sbjct: 300 SLAAGGGTNILKGLVEAAKVFD---------GRRYRNAVASVILLSDGQDTYNVNGGWG- 349 Query: 390 CNKAK--------------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + +K + + + T F + + A + +F + Sbjct: 350 ASNSKNYSVLVPPSFKRSGDRRLSVHTFGFGTDHDAAA-----MHAIAEETGGTFSFIEN 404 Query: 434 THELNKIFRDRIGNEIFERVI 454 + F IG + V Sbjct: 405 QAVVQDAFAQCIGGLLSVTVQ 425 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 56/161 (34%), Gaps = 21/161 (13%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + + + + T + + G T+I ++ + I SN+ Sbjct: 349 MVTFDSSAEIQNNLTKIIDE-NTYQKITANLPQKPSGGTSICGGLKAGFQAISQSNQSTS 407 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARY 417 I+LLTDGE+ Q ++ C + K G I TIA ++ Sbjct: 408 GSE---------IILLTDGEDNQ-----MSSCFEEVKQSGAIIHTIALG---PSADRELE 450 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNE--IFERVIRI 456 LSN F+ + L F I ++ +++ Sbjct: 451 TLSNMTRGRRFYAHKDINGLIDAFSRISSRSGNISQQAVQL 491 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 54/176 (30%), Gaps = 29/176 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + + A I + + +G F+ + + V Sbjct: 429 KLEMAKSAA------IATAEVLTRNDSIGVYAFDSEAHVVVPMTRLTSS---SAVAGQIA 479 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T ++ A A + + K++++LTDG+ + A+ Sbjct: 480 GLTSGGGTNLHPAFTEARNALQ-----------RTKAKIKHMIILTDGQ--TSGQGYEAL 526 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 ++ +++G+ I T+A L AS + + +IF Sbjct: 527 ASQCRAEGVTISTVAIGDGAHVG-----LLQAIASLGGGKSYTTLDAANIVRIFTQ 577 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 34/337 (10%), Positives = 77/337 (22%), Gaps = 12/337 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + I+ G +I + M V++ V +D+ L+ + A + A++ L + Sbjct: 8 RGIRKQRGLVVVIVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + T K + + + + Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVFPDSGYSSSPDGDRY 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + L F R MG+ L + + + +++ D + + Sbjct: 128 VRVVINQLDLESFFFARVMGVTKRLTASAVAGPSPGGNACNIVPMAVC-EGDDAGTNGYD 186 Query: 193 QRDSEGQPLNCFGQPADRTVK-SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + PA + E L+ C Sbjct: 187 AGVVYALKIADQNDPAMGSGNFQLLDFGSGASTVREALAGGYAGCVDLDAGETVTTKPGN 246 Query: 252 SLSEEH------FVDSSSLRHVIKKKHL---VRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ F D S V+ ++ + Sbjct: 247 TVGPVGQGLNTRFGDYSGGGLSATDYPPDLYVKQTTPTLTDADVGSTPAWGYQDYKDDLN 306 Query: 303 NDRVISD-PSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ D + R + G T Sbjct: 307 DNVCPGDATNCKSNGDAERRILAIPVVDCSGASGGTN 343 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 107 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 156 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 210 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 211 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 268 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 269 GQLSDVYKSI 278 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 32/85 (37%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G I L M V++G G + +DV + L+ +A ++A+ + + Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDSGSDHA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFEN 100 ++ + I + + Sbjct: 72 AAILAGNAAINNNLILDGFGSMTID 96 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 50/153 (32%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F +R ++ + + R I + STAI D + A + S Sbjct: 132 DRIGLVLFANRAYVAAPLTFDLAAVGRAIEEASI--GITGRSTAIADGLGLALKRVTESA 189 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + IVLL+DG++ + + A G+RI TIA + Sbjct: 190 AAS-----------RVIVLLSDGQDNAHQIDARQVAGLAARHGVRIHTIALGPDDLETRP 238 Query: 410 -TQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + L AS + +L Sbjct: 239 AARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 45/441 (10%), Positives = 126/441 (28%), Gaps = 40/441 (9%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 AL++ +++ V + + + A + +I + + + Sbjct: 8 ALMLILLMTVISWGAVSAQPFIVKGIVTDAESGEAVAGCSVVINGTQNGTITGTDGTYSI 67 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR---------------KSAYQV 131 + L+ +F + R V A + Sbjct: 68 RANKGATLVFSFIGYRTQKIKVTSARLDVALETDTQQLEECVVIGYGVETQKMICGATPI 127 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ + + +++ + T S +++ ++ + Sbjct: 128 AMNHSLLYNVEMSAEEYKAIQENGFKKVWDNPLSTFSIDVDVASYSNMRRFLNK-GELPP 186 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + +N F + I + L P P Sbjct: 187 ADAIRTEELINYFSYNYAQPT-GNDPVRITSEIGTCPWNEQHRLVRIGLKAKEIPTENLP 245 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + +D S + ++ LV+ +L ++ +++ D V ++ + Sbjct: 246 ASNLIFLIDVSGSMYGPERLDLVKSSLKLLVNNLRDKDKVA------IVVYSGAAGEKLA 299 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + G + ++ + GSTA + ++ AY NN I Sbjct: 300 STPGS---DKQKIREAIDELEAGGSTAGGEGIKLAYKI----ARKNFITGGNNR-----I 347 Query: 372 VLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +L TDG+ ++E + + + G+ + + + + + K + + Sbjct: 348 ILCTDGDFNMGVSSDQELKKLIEQKRKSGVFLTVLGYGMGNYKDSKMQTLAEK--GNGNH 405 Query: 429 FEANSTHELNKIFRDRIGNEI 449 ++ E N++ + G + Sbjct: 406 AYIDNLQEANRVLVNEFGATL 426 >gi|239980776|ref|ZP_04703300.1| hypothetical protein SalbJ_15127 [Streptomyces albus J1074] Length = 628 Score = 63.5 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 60/178 (33%), Gaps = 15/178 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + RDA+ +V+ S+ V +D + V L R +K Sbjct: 41 TRIEAARDAVGTVVDSLPDGYPTGLRVYGADR---TSGCTDTRLARPVEPLDRDAMKKAV 97 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +++ A + + ++ ++L++DGE+ + Sbjct: 98 AGVEPKGDTPIGLSLRKAVADL--------PEPEPGAVGRRTVLLISDGEDNCGSPPPCE 149 Query: 389 ICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + G+ RI I F V +E+ A ++++A L + + Sbjct: 150 AAEELAESGLDLRIDAIGFQVEGKAREELTCVAE--AGHGAYYDAPDAEALARQLQRA 205 >gi|149437043|ref|XP_001515962.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 948 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 58/179 (32%), Gaps = 7/179 (3%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V S L+ T K + G T Sbjct: 331 QTVEAMKTILDDLRAEDQFSVIDFNHNVRSWKDNLVPATDLMTTDAKKYIEKIQPNGGTN 390 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ + + I+L++DG+ T + I K Sbjct: 391 INEALLRAIFILREASN----LGMLDPNSVSLIILVSDGDPTVGELKPTVIQKNVKKNMR 446 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 I + ++ + R N + T K F +++ + + V Sbjct: 447 DNISLFSLGIGFDVDYDFLERLSRENHGMAQRIYGNQDTSSQLKQFYNQVSTPLLQNVQ 505 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 43/126 (34%), Gaps = 2/126 (1%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +KLI G ++ AL + +LG ++ D + + A +A++ + L + Sbjct: 5 RKLIGGEGGTAAVLFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQELPGN 64 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ A + + + + R + A + + + Sbjct: 65 PD--AALAVARTYAEANGVTDAEAVFSVSEDGRQIVGEATRRVGLYFARVLGFTQGEVKA 122 Query: 132 VLSSRY 137 ++R Sbjct: 123 RAAARV 128 >gi|296481520|gb|DAA23635.1| inter-alpha globulin inhibitor H2 polypeptide [Bos taurus] Length = 946 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 17/187 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A+ +++ ++ D+ + FN V + + K K + Sbjct: 330 VEAMKTILDDLRTEDHFS------VVDFNHNVRTWRNDLVSATKTQVADAKNYIEKIQPS 383 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 384 GGTNINEALLRAIFILNEANN----LGMLDPNSVSLIILVSDGDPTVGELKLSKIQKNVK 439 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + T K F + ++ Sbjct: 440 QNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLKKFYN----QVST 495 Query: 452 RVIRITK 458 ++R + Sbjct: 496 PLLRNVQ 502 >gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 [Bos taurus] gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus] Length = 946 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 17/187 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A+ +++ ++ D+ + FN V + + K K + Sbjct: 330 VEAMKTILDDLRTEDHFS------VVDFNHNVRTWRNDLVSATKTQVADAKNYIEKIQPS 383 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 384 GGTNINEALLRAIFILNEANN----LGMLDPNSVSLIILVSDGDPTVGELKLSKIQKNVK 439 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + T K F + ++ Sbjct: 440 QNIRDNISLFSLGIGFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLKKFYN----QVST 495 Query: 452 RVIRITK 458 ++R + Sbjct: 496 PLLRNVQ 502 >gi|222612967|gb|EEE51099.1| hypothetical protein OsJ_31818 [Oryza sativa Japonica Group] Length = 600 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 59/208 (28%), Gaps = 41/208 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F+ S + K Sbjct: 173 NKLALLKQAMGFVIDNLGPGD------RLCVISFSSGASRLMRLSRMTD-AGKAHAKRAV 225 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 + G T I A++ A + A + ++LL+DG++T Sbjct: 226 GSLSARGGTNIGAALRKAAKVLDDR---------LYRNAVESVILLSDGQDTYTVPPRGG 276 Query: 382 -----------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 A + + T F ++ + A + +F Sbjct: 277 YDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFG-----KDHDAAAMHTIAEVTGGTF 331 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRI 456 + + F IG + V + Sbjct: 332 SFIENEAAIQDGFAQCIGGLLSVAVQEL 359 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 63.5 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 62/171 (36%), Gaps = 20/171 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ +V+R + N R+G F F+ + + +T Sbjct: 110 TAVKTVLRQLA---NERAGDRLGLIVFGSSAYLQSPFTEDHRTWLLLLNETRIR--MAGP 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 STA+ DA+ A + + ++LLTDG +T + A + Sbjct: 165 STALGDAVGLAIKLFKDA-----------ETEHRVLLLLTDGNDTGSLVPPVDAARVAAT 213 Query: 396 QGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + IRI IA ++ L+ A + FEA S+ +L +F+ Sbjct: 214 EDIRIYPIAVGDPTAVGEEAIDLDTLARMAEVTGGQAFEALSSEDLIAVFK 264 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 60/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K T K + G T Sbjct: 328 QTVEAMKTILDDLRTEDQFSVVDFNHNVRTWRNDLVSATKTQITDAKRYIEKIQPSGGTN 387 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ N ++ IVL++DG+ T + I K Sbjct: 388 INEALLRAIFILNEASN----LGMLNPDSVSLIVLVSDGDPTVGELKLSKIQKNVKQNIQ 443 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 444 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLKKFYN----QVSTPLLR 499 Query: 456 ITK 458 + Sbjct: 500 NVQ 502 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 59/201 (29%), Gaps = 40/201 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ + + + +G F D + + + Sbjct: 112 TRFDAAKHAIRTFVE-------GREHDPLGLVIFGDEAALVTPPTLDYTSFLSRMDAVRV 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +A+ M A + S+ + +K +V+++DGEN + Sbjct: 165 MKL--GRGSALGLGMAVATVHLEKSSAE-----------RKVMVIVSDGENNAGEITPES 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNC--ASPNSF 428 A S GIRI + + + L ++ Sbjct: 212 AARVAASLGIRIYAVGVGGEGSVATEFTDPETGKSYRGTYEGKIDMELLKAVTESTRGQA 271 Query: 429 FEANSTHELNKIFRDRIGNEI 449 F A S L+++FR+ E Sbjct: 272 FLAGSPGALSQVFREIDALES 292 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 63/198 (31%), Gaps = 27/198 (13%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + + R A +V+ R+G F + + H ++ Sbjct: 112 SQTDTQSRFEIARQAARNVVEQ-------RSNDRIGLVVFRGEAYTLSPLTRD-HTVLSL 163 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + + TAI A+ A + + +S + ++LLTDGEN Sbjct: 164 LLDNLSSRIIQDDGTAIGSALLVALNRLQAS-----------ESELQMVILLTDGENNAG 212 Query: 383 NEEGIAICNKAKSQGIRIMT--IAFSVNKTQQEKARYF----LSNCA--SPNSFFEANST 434 + A +G+R +AF K + L A + S+F N+ Sbjct: 213 EVSPLTAAALAARRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGGSYFTVNNK 272 Query: 435 HELNKIFRDRIGNEIFER 452 EL + Sbjct: 273 TELETTIASIAARAKNGQ 290 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 63.1 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 47/155 (30%), Gaps = 18/155 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F D + + I+ TAI DA+ + S Sbjct: 131 DRIGLVVFGDAAYPQAPLTLDHDSVR--ILLDQMQIGMAGPRTAIGDAIGLTVKLMADS- 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K ++LLTDG +T AK + + T+ T E Sbjct: 188 ----------HAQEKVLILLTDGNDTSSAIPPERAAEIAKQHKLVVHTVGIGDPGTTGED 237 Query: 415 ARYF--LSNCAS--PNSFFEANSTH-ELNKIFRDR 444 L+ AS F A +L +++ Sbjct: 238 RVDLEALARIASITGGRAFRALGQEKDLTEVYATL 272 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 107 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 156 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 210 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 211 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 268 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 269 GQLSDVYKSI 278 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 64/194 (32%), Gaps = 28/194 (14%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + ++ + ++V++ + + I S R+G F S+ Sbjct: 99 YGFNEKKYKQSRLNVVKEVVQNFINS-------RVKDRIGLVVFGTTAGIASPLSFDKEA 151 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + STA+ D + ++ + KN+ K I+LL+DGE Sbjct: 152 QKNIVGN--INVGVLGKSTALIDGIVSSIQLL-----------KNSKSKSKIIILLSDGE 198 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 ++ AK I+I TI + + A + FE + + Sbjct: 199 DSASKIPLEFALKLAKKYNIKIYTITIDKSY------SDMMKVIANKNGAKNFEVQNKED 252 Query: 437 LNKIFRDRIGNEIF 450 L K+++ E Sbjct: 253 LVKVYKSIDSLEKS 266 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 65/200 (32%), Gaps = 31/200 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + K L ++A + +++ D R+G F+ I F ++T Sbjct: 413 VSKFDLAKEAAILALTALQAGD------RIGVLAFDTDTIWVIPFQAVGEGAAVAELQTR 466 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEG 386 G T I A+ + + H VLLTDG + +N Sbjct: 467 IATMAIGGGTNIERALAVGLPALAAEPHSVRHA-----------VLLTDGRSYSNNYPRY 515 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR-- 442 + A++ I + TIA + L A ++ +L +I Sbjct: 516 QQLVETARAAQITLSTIAIGTDAD-----TDLLEQLARWGNGRYYFVPDAADLPRITLQE 570 Query: 443 -DRIGNEIFER---VIRITK 458 + G+E+ +R+ + Sbjct: 571 SEIAGSELTVEQPSPVRLNQ 590 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 22/166 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G F V++ S+ K + + + TA+ D++ T+ + + +S Sbjct: 127 NDNIGLVVFGTSVLTASPLSFD--KNSQKEIIKYIDIGIVGEQTAMFDSLATSINILKNS 184 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 I+LLTDGE+ I AK I+I TI E Sbjct: 185 KAKSN-----------IIILLTDGEDNASKIPPQIILKLAKKYKIKIYTIGIG------E 227 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 R LS + + F ANS +L +++ + I + + T Sbjct: 228 SNRQMLSTISQETGAKSFLANSKDDLVEVY-NTINKLEKSDIDKNT 272 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 52/180 (28%), Gaps = 24/180 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ +A I R+G F + L ++ A Sbjct: 118 SRWDAVKAVVADFIAQ-------RADDRLGLIVFGTGAYPQAPLTRDHASLK--LLLDEA 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TA+ DA+ + ++ E K ++LLTDG +T Sbjct: 169 AVGMAGPNTAVGDAIGLGIRMLDAA-----------EEQDKVLILLTDGNDTGSAVPPAR 217 Query: 389 ICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A + + TI + L A + FF A L +++ Sbjct: 218 AAALAAQHHVTVHTIGIGDPAATGEDRVDFDILREVARTAGGQFFPARDLATLREVYATL 277 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 45/153 (29%), Gaps = 15/153 (9%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R + + + + + + A TAI D + A + +S Sbjct: 141 DRFALIPMAEEAATLVPLTGDRELVASQLARLRAGML--GDDTAIGDGIALALRQLQASG 198 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQE 413 + ++L +DGE+ A++ G+ + T+ Sbjct: 199 AERRPA----------LILFSDGESNAGLLRPSEALALARAAGVALYTVEITGGQALAPV 248 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L++ A + F + +L + Sbjct: 249 EGEPSLADMAETTGGRHFHVTRSADLEAVIATI 281 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 63/222 (28%), Gaps = 30/222 (13%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + + + +DSS L + A S + + + Sbjct: 62 TTVTIEVTGAGSTSTSAVPMDVVFAIDSSGSMQSNDPSGLRKTAAKSFVDKMDSSRDTAG 121 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++D + + + + GST +N ++ A D + ++ Sbjct: 122 -----VVSWDDSIDFSLPLTNDFPLVKTN-----IDSVDSSGSTNLNVGLEEAIDILDAN 171 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 E I+ LTDG+ T + +A +G I +I Sbjct: 172 PRTENSVE--------VIIFLTDGQGTY----LHSTAQEAADKGYVIYSIGLGG------ 213 Query: 414 KARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERV 453 L + A+ +++ + L IF D + Sbjct: 214 VNPTPLQDMATTTGGAYYSSPDATSLQAIFDDIFSEVTTSTI 255 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 30/280 (10%), Positives = 64/280 (22%), Gaps = 33/280 (11%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 E + + S S+++ + A Sbjct: 682 GVGRVEMSTLRQMASDPLDDHVDYVESYSVIEKLTKKFQEAFCVSDLCATGDHD------ 735 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH---LVRD 276 + Y +C + + V + LV+ Sbjct: 736 -CEQVCISSPGSYKCACKEGFTLTDDGRSCRACSNAATDVVFLIDGSKSVRPENFELVKK 794 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + +I + DN ++ V + G + + + + Sbjct: 795 WINQIIDKLDVSDNKAHVG---LVQYSSAVKQEFPL--GRYNNKKDLKEAVKKMAYMERG 849 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A++ D + + K ++ TDG + + KAK Q Sbjct: 850 TMTGQALRYLTDNSFGPGQGARPGVT------KVGIVFTDGRSQDYIGD---AAKKAKDQ 900 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G ++ + L AS +F Sbjct: 901 GFKMYAVGVG------NAVEDELKEIASEPTAEHYFYTAD 934 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 55/167 (32%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D + R+G + RV ++ S K T + T A+Q Sbjct: 571 IEGLDVGPNATRVGVVNYASRVKNEVSLKTHRTKAGLIKAVTKIEPLST--GTMTGLAIQ 628 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG + + + +A+ GI I I Sbjct: 629 FAMNVAFSEAEGAR---LRSPDISKVAIVVTDG---RPQDNVKDVAQRARDAGIEIFAIG 682 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 683 VGR------VEMSTLRQMASDPLDDHVDYVESYSVIEKLTKKFQEAF 723 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 52/153 (33%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F +R ++ + + R I + STAI D + A ++ S Sbjct: 132 DRIGLVLFANRAYVAAPLTFDLAAVGRAIEEASI--GITGRSTAIADGLGLALKSVTES- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 A + IVLL+DG++ + + A G+RI TIA + Sbjct: 189 ----------SAASRVIVLLSDGQDNAHQIDARQVAGLAARHGVRIHTIALGPDDLETRP 238 Query: 410 -TQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + L AS + +L Sbjct: 239 AARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 58/201 (28%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRLVAAQEAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNREAT 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I K D TA + + TA + + + IVL++DG+ Sbjct: 162 KAAIDKLQLADR-----TATGEGIFTA---LQAVATVGAVIGGGDEPPPARIVLMSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T + AK QG+ I T++F E L A Sbjct: 214 TVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLKKIADL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S F A+S +L ++F + Sbjct: 274 SGGDAFTASSLEQLKQVFTNL 294 >gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Ailuropoda melanoleuca] Length = 946 Score = 63.1 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 328 QTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKTQIVDAKKYIEKIQPSGGTN 387 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + +N + E+ I+L++DG+ T + I K Sbjct: 388 INEALLRAIFILNEANN----LGMLDPESVSLIILVSDGDPTVGELKLSKIQKNVKQNIR 443 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 444 DNIALFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 499 Query: 456 ITK 458 + Sbjct: 500 NVQ 502 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 38/377 (10%), Positives = 86/377 (22%), Gaps = 51/377 (13%) Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + I+ N R V + V N + L Sbjct: 115 TGTMTGLAIQFAITKAFSNTEGGRARSPDISKVVIVVTDGRPQDSVRDVSARARANGIEL 174 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 F +G + + S S+++ + Sbjct: 175 FAIGVGR----------VDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFQEAFCVVSD 224 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS---- 262 + + + Y +C + + + Sbjct: 225 LC------ATGDHDCEQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFL 278 Query: 263 ---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + LV+ + ++ ++ D + ++ V + G Sbjct: 279 IDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHT 333 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + I T A++ D + + +K ++ TDG + Sbjct: 334 KKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRS 387 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 KAK G ++ + L AS +F Sbjct: 388 QDYINN---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKT 438 Query: 437 LNKIFRDRIGNEIFERV 453 +N+ IG + +++ Sbjct: 439 INQ-----IGKRLQKKI 450 >gi|291452634|ref|ZP_06592024.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] gi|291355583|gb|EFE82485.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] Length = 658 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 60/178 (33%), Gaps = 15/178 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + RDA+ +V+ S+ V +D + V L R +K Sbjct: 71 TRIEAARDAVGTVVDSLPDGYPTGLRVYGADR---TSGCTDTRLARPVEPLDRDAMKKAV 127 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +++ A + + ++ ++L++DGE+ + Sbjct: 128 AGVEPKGDTPIGLSLRKAVADL--------PEPEPGAVGRRTVLLISDGEDNCGSPPPCE 179 Query: 389 ICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + G+ RI I F V +E+ A ++++A L + + Sbjct: 180 AAEELAESGLDLRIDAIGFQVEGKAREELTCVAE--AGHGAYYDAPDAEALARQLQRA 235 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 117/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 28 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 87 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 88 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 147 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + + S S+++ + Sbjct: 148 VSARARASGVELFAIGVGSV-DKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 206 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 207 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 260 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ ++ ++ ++ D + ++ V + G Sbjct: 261 VFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 315 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 316 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 369 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 370 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 420 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 421 FKTINQ-----IGKKLQKKI 435 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 63.1 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 117/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + + S S+++ + Sbjct: 164 VSARARASGVELFAIGVGSV-DKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 222 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 223 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 276 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ ++ ++ ++ D + ++ V + G Sbjct: 277 VFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 331 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 332 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 385 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 386 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 436 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 437 FKTINQ-----IGKKLQKKI 451 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTPDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|159900556|ref|YP_001546803.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893595|gb|ABX06675.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 423 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 58/180 (32%), Gaps = 25/180 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + V++A +V+ + + D + FNDR + +K Sbjct: 63 RLYQVKEACNNVVNQLNRQDYFS------VVSFNDRAEVVVPC---QRPNDKDQIKRAIG 113 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + M I R + + +VLLTDG D + I Sbjct: 114 MIEAKGGTEMATGMMMGLQEIS--------RPMMSRGISR-MVLLTDGRTYGDESRCVEI 164 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--EANSTHELNKIFRDRIGN 447 +A+S+GI I + L AS + + ++ +F D I Sbjct: 165 ARRAQSKGIGITALGIGDEWN-----EDLLETIASAENSRTEYITNAQQIVNVFSDEIKR 219 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 76/283 (26%), Gaps = 24/283 (8%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 S + S+ + +N F + + + Sbjct: 153 VDTGSYSNVRRFLNAGSLPPADAVRVEELINYFRYDDPAPTNG-QPFAVRTELATTPWNK 211 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + + P + VD S K L++ +L ++R ++ D Sbjct: 212 DSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMDAPDKLPLLQSSLKLLVRQLRAQD-- 269 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+ + + G + + GSTA ++ AY Sbjct: 270 ----RITLVTYAGNTSVVLPPTPG---DQQGRIVEAIDALQSGGSTAGASGIELAYK--- 319 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVN 408 + + R N I+L TDG+ + + + GI + T+ F Sbjct: 320 -AAQQGYLRGGINR-----ILLATDGDFNVGVTNFDQLKGMVAEKRRSGIALSTLGFGTG 373 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ ++ E K+ +G + Sbjct: 374 NYNDNLMEQLAD--AGDGAYAYIDTALEARKVLTHELGATLAT 414 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 54/166 (32%), Gaps = 22/166 (13%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + DR+ + + + +T +K F + T ++ + A + + Sbjct: 80 VSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDMSVGLDEAVKVLKQGMDPAHA 139 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNE----------EGIAICNKAKSQGIRIMTIAFSVNK 409 IV+L DG N D + +AK GI I TI + + Sbjct: 140 P---------MIVVLADGNNDLDPNTGRTSKEASEQLNQAVKEAKGSGIPIYTIGLNADG 190 Query: 410 T-QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +E + F +S +L +I + + +++ Sbjct: 191 KLNKETLAELAKQ--TGGKSFTTSSADDLPQILSEIFASHQQLKIV 234 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 46/416 (11%), Positives = 111/416 (26%), Gaps = 43/416 (10%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S ++ + S + + ++ V+ A + Sbjct: 69 SEATPAAESTPSENVSEGSGNPAAESAPAEQSTEQSVEQLVKQSEEQKAGSADEPVMELA 128 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 ++ + + + ++ + + + ++ D + + Sbjct: 129 TPSEAQPATTSATPTQKPTGTENPTTVERSVQSDDDDADTVANQNEAKDDETKDNADKTV 188 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 S L + S + G KL + S Sbjct: 189 HLGIASYRGMLKSASAGLSTPEHTKSIEYQGNGAYTLKLDVTGKDASTSTTDTTPIDIAL 248 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR----SIKKIDNVNDTVRMGATFFNDRV 306 D R K ++ A+ S + + I++ N+ V++ + +++ Sbjct: 249 VLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALVKYANQI 308 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR------ 360 + + G T + T ++T+ + + ++ Sbjct: 309 GTATG-ADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTSVNGLQAAGATYADAAMEVAQ 367 Query: 361 ---MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAF----- 405 AKKY++ TDGE + + N A K+ G + +I Sbjct: 368 QALAGGRAGAKKYVIFFTDGEPNHWSGFDDDVANAAIKKSQELKNAGTTVYSIGIFDGAN 427 Query: 406 -----------------SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + L + AS + ++ A+S +L +IF D Sbjct: 428 PSASVSSASNANKFMHGISSNYPNATGYRSLGDRASGDYYYSASSATQLAQIFNDI 483 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/320 (11%), Positives = 91/320 (28%), Gaps = 24/320 (7%) Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFS 186 + + L + + + ++ +E + S + S Sbjct: 63 MMAMSAPAPMPTSLPSQLDRDKYEDVDTNPVKLVSEDPVSTFSIDVDTASYSNVRSFLND 122 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 ++ + +N F V + + + + Sbjct: 123 GNLPPKDAVRIEELINYFDYDYPIPASKDVPFATHVNVVPAPWAEGKQLMHVGIKGYDLD 182 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P L+ VD S + K L + AL +I + + D+++ + Sbjct: 183 RTEQPPLNLTLLVDVSGSMNHEDKLPLAKKALKLLIDKMDEDDHIS------VVVYAGAA 236 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + G ++ + + + GSTA + ++ AY + + Sbjct: 237 GTVLEPTKGSE---KSKIFAALDNLSAGGSTAGGEGLRLAYSL--------AEQNYDAAS 285 Query: 367 AKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + + LLTDG+ +E + + G+ + + F + A Sbjct: 286 VNRVM-LLTDGDFNVGVTSDERLEDFVARKRESGVYLSVLGFGRGNYNDAMMQKIAQ--A 342 Query: 424 SPNSFFEANSTHELNKIFRD 443 ++ +E K+ D Sbjct: 343 GNGMASYIDTLNEARKVLND 362 >gi|288942712|ref|YP_003444952.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] gi|288898084|gb|ADC63920.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] Length = 1059 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/280 (11%), Positives = 71/280 (25%), Gaps = 19/280 (6%) Query: 178 SIQWVIDFSRSMLDYQRDSE-GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP----Y 232 ++ +++D S SM DY R ++ + + D + Sbjct: 49 NVLFILDESGSMDDYNRMTDLKNSMKTLLDNPMMDHVQAAIMGYTTANTDIYYNNRGWNT 108 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + N SL L+ + + + + + Sbjct: 109 YANVNGSLAIKLHSDFKTVGEGSNRSAMKAIVDGLSPRASTPTVWALKLGMEWFDSRATQ 168 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 S + + + D + + T+Y Sbjct: 169 IEGTPSDRTRATSPTYSSPMSDPAYWCVPNHL-VLLTDGEPNSNDPNYYGL-TSYKGTTC 226 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K A+ G N + ++ +Q + TIA + Sbjct: 227 VKDSTSIEEKGRCAAEIAAW----GYNN--DLRTDSVWED--NQNVTTHTIALGDSLGSN 278 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + FL N A ++EA++ L F + + Sbjct: 279 D--KAFLKNIAIKGGGGYYEADNASTLLNAFETIVDDAKS 316 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 63/182 (34%), Gaps = 24/182 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V DA +++ + + D V+ ++ V + K+ V Sbjct: 58 KMEAVIDASCALVNWLTRHDAVS------IVAYSADVQLIQPVTHLTEKVS---VTDKIR 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + ST ++ +A ++ S A K ++LLTDG T ++ A+ Sbjct: 109 NIQVATSTNLSGGWLSALKSLNQS---------KIPNAYKRVLLLTDGNPTSGIKDKEAL 159 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A GI TI + ++ A +F+ ++ + IF + G Sbjct: 160 VTIAADHLSMGISTTTIGVGNDFNEEMLVEI---AKAGGGNFYYIDNPENASDIFFEEFG 216 Query: 447 NE 448 + Sbjct: 217 DI 218 >gi|319954909|ref|YP_004166176.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319423569|gb|ADV50678.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 703 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 87/276 (31%), Gaps = 24/276 (8%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S + + S + + +N F + + + + + Sbjct: 270 YSNIRRLINAGSQIPIDAVKIEEMINYFDYDYPQPL-GEHPFSINLEAAKTPWNAATKIV 328 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 L Y P+ + +D S + K L++ A ++ +++ D V+ Sbjct: 329 RIGLQGKEYLNEDLPASNLTFLIDVSGSMNNQNKLPLLKSAFKLLVHQLREKDRVS---- 384 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + G + + + + GSTA + ++ AY E Sbjct: 385 --IVVYAGAAGVVLEPTGG---NDKEKIIKALDNLSAGGSTAGGEGIELAYAL----AEK 435 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 KNN +++ TDG+ +++ + + + GI + + F + + Sbjct: 436 NFKPNKNNR-----VIMATDGDFNVGASSDKDMETLIEEKRKSGIFLSVLGFGMGNYKDS 490 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + ++ E KIF + G + Sbjct: 491 KLEKLADK--GNGNHAYIDTMQEAQKIFGEEFGGTL 524 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 56/178 (31%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +KK D + VR GA + D ++ Sbjct: 855 DYDEYNIMKDFMTDLVKKADVGKNQVRFGALKYADDPEVLFYL--DTLGTKWEVISVLQN 912 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ G+T +A+ + E + + ++++TDGE+ ++ Sbjct: 913 DQPMGGNTYTAEALAFSDHMFT-----EARGSRLQRGVPQVLIVITDGESHDADKLNAT- 966 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI I+ + L+ S + +F + L IF D + Sbjct: 967 AKALRDKGILILAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 1019 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 46/142 (32%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + A + ++ D + VR+GA F+ + + ++ Sbjct: 1047 PDDFKKMKAFLASVVQDFDVTVNRVRIGAAQFSHNYQPEFPLGTFTGEEEISLQIEKIQQ 1106 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I A++ + N + +++LTDG Q +E Sbjct: 1107 IF--GYTHIGAALRRVGRYFRP-----DMGSRINAGTPQVLLVLTDG---QSQDEVAQAA 1156 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1157 EDLRRKGINIYSVGIGDVDDQQ 1178 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 41/122 (33%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 714 MSQSEISDAIDRMAHIGETTLTG------HALTFVSQYFSPAKGARPNVRKFLILITDGE 767 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 ++ + +GI I ++ F N TQ L + P F + Sbjct: 768 AQDIVKDP---AVALRQEGIIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFDI 817 Query: 437 LN 438 L Sbjct: 818 LQ 819 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 58/177 (32%), Gaps = 21/177 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S+ +D + +R+G F++ S S G+ ++ V + + + A A Sbjct: 289 SVSALDIKENCMRVGLVAFSNETKVINSLSRGI---NKSEVLQYIQNLSPQAGNAYTGA- 344 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A I + + N + VL+T + + QG+ I T+ Sbjct: 345 --AIRKIRKEVFGAQNGSRKNQGVPQIAVLVTH---RPSEDNVTKAAVNLRRQGVTIFTM 399 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFERVI 454 Q L AS + + S +L F ++ N+I + Sbjct: 400 GIEGASDTQ------LEKIASHPAEQYVSKLKSFSDLAAHNQTFLKKLRNQITHTLS 450 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/282 (11%), Positives = 79/282 (28%), Gaps = 20/282 (7%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQP-LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + V D + + + C G D N I D ++ Sbjct: 182 PLVDKTVTDKKAHVSFSPTLDQQRKCTECDGTLIDGDFFITYDVNRPHDIGDIQIVNGYF 241 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 +++ +D+S K ++AL +++ + + Sbjct: 242 -----VHFFAPANLPRVPKMVVFVIDNSYSM-YGNKMAQTKEALGTILGEL------PED 289 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 F+ + + + + + +G T ++DA + + ++ Sbjct: 290 DYFAIIVFSTTFVVWRPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQ 349 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 + ++LLTDG+ Q I + I + +AF + Sbjct: 350 RNGTAP----KNMVLMMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDADY 405 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 N +E + K F + + + + V Sbjct: 406 GFLDTLSKQNNGIVRRIYEDSDAPLQLKGFYEEVSSPLLSEV 447 >gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens] Length = 543 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/246 (12%), Positives = 67/246 (27%), Gaps = 14/246 (5%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 V + D K Y ++ D + ++ Sbjct: 30 VDYEDLVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSGSMWGVKMK 89 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 + ++ + + FN + + + K K + G Sbjct: 90 QTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRYIEKIQPSG 146 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 147 GTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLSKIQKNVKE 202 Query: 396 Q---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 I + ++ + R N + T K F + ++ Sbjct: 203 NIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN----QVSTP 258 Query: 453 VIRITK 458 ++R + Sbjct: 259 LLRNVQ 264 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ +L ++ + + D R+ F+ + Sbjct: 202 KIESVQQSLVQLLNFLSEKD------RLCLIVFDGGAKRHTPLKTLTEGNKKYFKGA-IA 254 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GST I A+ I I LL+DG+++ Sbjct: 255 AISAGGSTNIAAGTDIAFQQIQQRKMKNQVTS---------IFLLSDGQDS-------GA 298 Query: 390 CNKAKSQGIRIM-TIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIG 446 + + Q RI + + +S SF+ + L++ F D +G Sbjct: 299 AERIQKQKDRISDVVTIHSFGYGNDHDADLMSKICKVGQGSFYYIENVKLLDEFFADALG 358 Query: 447 NEIFE 451 Sbjct: 359 RLSSA 363 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 56/198 (28%), Gaps = 30/198 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ A+ +I+ + +D R+ FN + Sbjct: 112 DVSGSMRDFGRLDKLKSAMRFIIKKLAPMD------RLSVVTFNGGATRECPLRAMSEDA 165 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + ++ G T I ++ + ++L++DGE Sbjct: 166 VP-VLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTGARTAG--------VILMSDGEQ 216 Query: 380 TQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE---ANSTH 435 + + ++ Q + T++F N L A + + Sbjct: 217 NSGD------ATRVRNPQNYPVYTLSFGSNADMN-----LLQKLAGGGGTYNPVLDSGGM 265 Query: 436 ELNKIFRDRIGNEIFERV 453 + +F + + V Sbjct: 266 SMLDVFSQLMAGLLTVVV 283 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/248 (10%), Positives = 66/248 (26%), Gaps = 35/248 (14%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVI 268 + + + Y +C + + + S Sbjct: 229 TGDHDCEQVCISSPGSYTCACREGFTLNTDGKTCNVCSGGGGSSATDLVFLIDGSKSVRP 288 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ + ++ ++ D + ++ V + G + I Sbjct: 289 ENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHTKKDIKAAVR 343 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T A++ D + + +K ++ TDG + + Sbjct: 344 NMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRSQDYIND--- 394 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRI 445 KAK G ++ + L AS +F +N+ I Sbjct: 395 AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKTINQ-----I 443 Query: 446 GNEIFERV 453 G ++ +++ Sbjct: 444 GKKLQKKI 451 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 57/168 (33%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + + T A+ Sbjct: 68 IESLDVGPNATRVGVVNYASTVKQEFPL---RAHVSKAALLQAVRRIQPLSTGTMTGLAI 124 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + + K ++++TDG QD+ ++ +A++ G+ + I Sbjct: 125 QFA---ITKAFSDAEGGRSRSPDISKVVIVVTDGRP-QDSVRDVSA--RARASGVELFAI 178 Query: 404 AFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + L AS S +L++ F++ Sbjct: 179 GVGR------VDKATLQQIASEPQEEHVDYVESYNVIEKLSRKFQEAF 220 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 45/440 (10%), Positives = 117/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + + S S+++ + Sbjct: 164 VSARARASGVELFAIGVGSV-DKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 222 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 223 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 276 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ ++ ++ ++ D + ++ V + G Sbjct: 277 VFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 331 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I T A++ D + + +K ++ TD Sbjct: 332 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 385 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 386 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 436 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 437 FKTINQ-----IGKKLQKKI 451 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/370 (10%), Positives = 96/370 (25%), Gaps = 33/370 (8%) Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 N + + + + + S ++ + S+ + Sbjct: 31 QDISNANEPSSKIIQSPEVVEAAPVILVESNMSYSKMAAPTSIRAPSPMPSIGTSISNSE 90 Query: 155 SWLIQTKAEAETVSRSYHKEHGVS-------IQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 ++ + + + S+ + +N F Sbjct: 91 NYAEIERNAVHATHEQAFATLSIDTDTGSYANVRRFLNNGSLPPTDAVRVEELINYFNYD 150 Query: 208 -ADRTVKSYSSQNGKVGIRDEKLSPYM--VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + + + V +L P+ + VD S Sbjct: 151 FKNAKKQGNAPFLVTTEMVKSPWHATNRIVKVGIKAEDVLAAKQNQPAANLVFLVDVSGS 210 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + K L + +L + + ++ D + + + G + Sbjct: 211 MNSDDKLQLAKASLKMLTKQLRAQDTIT------LITYAGNTEVVLPATSG---NQTQKI 261 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + + GST A++ AY E+ + N I++LTDG+ Sbjct: 262 LNAIDNLSANGSTNGEAAIKLAYQ----QAEENFKKQGINR-----ILMLTDGDFNVGVS 312 Query: 385 EGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + +GI + T+ F N ++ +S E K+ Sbjct: 313 NVKDMLDIIRNNRDKGISLSTLGFGQGNYNDHMMEQVADN--GNGNYSYIDSLSEAKKVL 370 Query: 442 RDRIGNEIFE 451 D + + Sbjct: 371 IDEMSSTFNT 380 >gi|319777804|ref|YP_004134234.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171523|gb|ADV15060.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 343 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/246 (9%), Positives = 57/246 (23%), Gaps = 4/246 (1%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 + ++ ++ G VD R +QA A+++ + S Sbjct: 1 MAPLFVFMLAPIITAIGFSVDYTRAVQTRSNEQQALDAAVLSITGMDTTSTLAQRQTMLQ 60 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL-SSRYDLL 140 + + +F + T + + T R + V + S+ Sbjct: 61 DTFIANHGLGTPTLNSFVVSANGTATAQAMASYSMPTVFMQIARINTVPVAVGSAASKTP 120 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH---KEHGVSIQWVIDFSRSMLDYQRDSE 197 + F S W T + + D Sbjct: 121 ALVQTTFKVSKVSGWWNKTMTLYGTTFGATVAKPLMSIEYTYNGFGDPKGYGTTNVYTIT 180 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 V + + + G+ + + + S + P + + + Sbjct: 181 NNGGADIKTLVQSQVCTTAGVSTFTGLTADAILQTSGTKKYSTTCLNTMYPANSAGASID 240 Query: 258 FVDSSS 263 + Sbjct: 241 VNQMAG 246 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 57/184 (30%), Gaps = 37/184 (20%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ I + R+ F D + + L+ + E Sbjct: 110 SAVRQQIDRLIAARPG---DRIALIVFADHAYLLSPLTQETNALL--GLTRELDFELVGR 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 +TA+ +A+ A H+ A ++L+TDG NT + + + +A + Sbjct: 165 TTALGEAILLARQ----------HKAPERSTA---LLLVTDGRNTAGSADPLREAKQAAA 211 Query: 396 QGIRIMTIAFSVNKT-----------------QQEKARYFLSNCA--SPNSFFEANSTHE 436 GIR+ T+ + E L A +F A + + Sbjct: 212 AGIRLYTLGVGADPDTFAEAMTPAQTPAQSDPSAELDEALLQQLAEVGHGRYFRARTQGD 271 Query: 437 LNKI 440 L I Sbjct: 272 LEAI 275 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 63/158 (39%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + N K+ ++++TDG++ ++ ++ + +GI I + Sbjct: 899 HANALFT-----EEHGSRINQNVKQMLIVITDGKSHDHDQLNDT-ASELRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNAIYVDNFDKLKDVF 984 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 47/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAILNIKQITGGTYTGRALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG + + + +++ I I + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGM---SADRVVEPAKRLRAEQITIHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 581 ---ANKIELQEIAGKEERVSFGQNFDALKSI 608 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 84/304 (27%), Gaps = 29/304 (9%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + T S +++ + + + +N F + Sbjct: 106 AETPVSTFSIDVDTGSYANVRRFL-NDGQLPPKDAVRLEELVNYFPYAYPLP-QGDVPFG 163 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + +P ++ P + VD S H + +V+ L Sbjct: 164 VSTELAVTPWNPQTRLLRIAIKASDRSVEELPPANLVFLVDVSGSMHRREGLPMVQGTLK 223 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ ++ D V+ + + ++ GSTA Sbjct: 224 LLVDQLRPQDRVS------LVTYAGDSQV---LLDSAPGSDKAKIRAAIDQLTAGGSTAG 274 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ--- 396 +Q AY +S + I+L TDG+ + ++ A + Sbjct: 275 ESGIQLAYQ--QASKHLIDGGINR-------ILLATDGDFNVGISDFDSLKQLAADKRKS 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE----LNKIFRDRIGNEIFER 452 G+ + T+ F V+ + A ++ ++ E L + + Sbjct: 326 GVSLTTLGFGVDNYNERLMEQL--ADAGNGNYAYIDNLREARKVLVDQLSSTLATVASDV 383 Query: 453 VIRI 456 +++ Sbjct: 384 KLQL 387 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 57/196 (29%), Gaps = 25/196 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + + + + +G F S Sbjct: 96 SNSMAATDITPTRLEAAKQGAQAFVDQL------PPRINLGLVSFAGSAAVLVPASTDRE 149 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +TA+ + + + I ++ + + IVLL+DG Sbjct: 150 SVRS-----GIRGLQLGPATAVGEGIYASLQAIATAGQ--RLSDEGQPPPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPN 426 E T+ A+ I + TIA+ + + L A + Sbjct: 203 ETTRGRPNT-QAATAARDAEIPVDTIAYGTSDGTLDVGGQQIPVPVNEEALRELADQTGG 261 Query: 427 SFFEANSTHELNKIFR 442 S+ A S EL ++R Sbjct: 262 SYHRATSGDELQSVYR 277 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 61/206 (29%), Gaps = 26/206 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + +D + I + + +G F+ + Sbjct: 96 SNSMAATDVAPNRLAAAKDGADAFIDQL------PPRINLGLVSFSGSAALLVPPTTDRQ 149 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + STAI + + I ++ E IVLL+DG Sbjct: 150 SVRS-----GIHGLQLGPSTAIGEGIFAGLQAITTAGEQLA--ADGGTPPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNC--ASPN 426 E + A A+ G+ + TIA+ + L A+ Sbjct: 203 ETQRGRPNAQAA-QAARDAGVPVDTIAYGTADGSLDVGGQEIPVPVNEDALREIARATDG 261 Query: 427 SFFEANSTHELNKIFRDRIGNEIFER 452 S+ A S EL ++ + G+ I R Sbjct: 262 SYHRAASGDELRSVYENL-GSSIGYR 286 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 15/149 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + +D +D VR+G +ND + H L +++ G T Sbjct: 256 NFLYSVVSGLDVSSDRVRVGLAQYNDDIYPAFQL--NQHPLKSMVLEQIQNLPYRTGGTN 313 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A++ + ++ + + ++L+TDGE N+E ++ K GI Sbjct: 314 TGNALEFIRTSYLTEGSGSRAK----DRVPQIVILVTDGE---SNDEVQEAADQLKEDGI 366 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + +V + L AS Sbjct: 367 VVYVVGINV------QDVQELQKIASEPF 389 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 16/165 (9%), Positives = 45/165 (27%), Gaps = 17/165 (10%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + + D++++G F+ + + K+ + T A Sbjct: 1049 SMVNQSNIGTDSIQIGLLQFSSIPKEEFRLNRYSSKVDIYNAIFAVQQMRD--GTRTGKA 1106 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + ++Y++++TDG + + + I I Sbjct: 1107 LNFTL-------PFFESSKGGRPSVQQYLIVITDGVAQDNVILP---AKALRDKNIIIFA 1156 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 I + K L + + L + ++ + Sbjct: 1157 IGVG-----EAKKSQLLEITNDEDRVYYDVDFEVLQNLEKEILSK 1196 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 33/93 (35%), Gaps = 10/93 (10%) Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +Y++++TDG+++ ++ G+ I I +E L A Sbjct: 940 NVARYLIVITDGKSSDP---VADAAEGLRASGVIIYAIGV------REANIDELKEIA-K 989 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F L I ++ + + + +K Sbjct: 990 DKIFFVYEFDLLKDIQKEVVRDICSSEACKNSK 1022 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVPPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|126340361|ref|XP_001365240.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain2 [Monodelphis domestica] Length = 951 Score = 62.7 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 60/187 (32%), Gaps = 17/187 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A+ +++ ++ D + N R + K T K + Sbjct: 331 VEAMKTILDDLRAEDQFSVVDFNHNVR-NWRDDLVLA-----SKAQITDAKKYIEKIQPN 384 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + ++ + + I+L++DG+ T + I K Sbjct: 385 GGTNINEALLRAIFILNEASN----LGMLDPNSVSLIILVSDGDPTVGELKLSQIQKNVK 440 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + T K F + ++ Sbjct: 441 QSMQDNISLFSLGIGFDVDYDFLERLSQENHGVAQRIYGNQDTSLQLKQFYN----QVST 496 Query: 452 RVIRITK 458 ++R + Sbjct: 497 PLLRNVQ 503 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 55/447 (12%), Positives = 116/447 (25%), Gaps = 57/447 (12%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 + L+ + SV S + K L + T Sbjct: 403 KTVLETDIDR--LPDSVKFGFSASTGADHQVQLIRGLKVYSVNKLNQLDLVKSVNKDTYP 460 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + + E+ ++ +++N ++ S G + + Sbjct: 461 DADSHTFNVGDEVPYTFVVRNSGTTTLSKVVVNDPNITDVSCGTGTLASNQQTTCSGTLT 520 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 G + + + D + P A + + + D Sbjct: 521 LTEGMVGSEGHFTNTATATGTDTDGNIVNSPEASVTIKAIKPLGAPEKHKRIKKNADNTY 580 Query: 230 -SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI--- 285 V+ S + P+D +L + + K ++ A+ + + Sbjct: 581 TVNVDVTGAASSSTITTTQPVDFTLVLDVSGSMRENMGSVTKLQALQSAVNNFLDEAAKI 640 Query: 286 -KKIDNVNDTVRMGATFF--------NDRVISDPSFSWGVH------KLIRTIVKTFAID 330 K + ++ VR+G F ++ D ++ +K Sbjct: 641 NKGAQSGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTYNYSQIVKKLTADTDGLKNEVNK 700 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T + Q A+ + + EAKK ++ TDG+ T + + Sbjct: 701 LTAGGATRADYGFQHAFTVMSEA----------RTEAKKVVIFFTDGKPTSEKTFDGKVA 750 Query: 391 NKA-------KSQGIRIMTIAFSVNKTQQEK---ARYFLSNCAS---------------- 424 N A K G + +I F+ +S Sbjct: 751 NDAVEYAKQLKDSGAIVYSIGVFDGANPASTATSENKFMHAVSSNYPNAANYEDLSEGSN 810 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFE 451 + A LN IF + +E Sbjct: 811 AGYYKTATDASGLNSIFEEIRKSETTT 837 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 34/112 (30%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G +I A+LM +LG + VD+ L+ A + I + + V Sbjct: 11 EGGAISVIVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCARDATGVDC 70 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ + + + + T R+V V + Sbjct: 71 STTSALASGLANRNALDGMSKVHTIDLDKTTRKVSVTTSAKEVGGADNSVSL 122 >gi|218670888|ref|ZP_03520559.1| hypothetical protein RetlG_04158 [Rhizobium etli GR56] Length = 133 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 43/111 (38%), Gaps = 1/111 (0%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + +LI G II L+ ML G +D +R L+ AA +A++ A+ Sbjct: 2 SFLSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKY 61 Query: 69 IQSLEEVSSRAK-NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + E + N+F + E + E ++ +V D V T Sbjct: 62 KSGMPEATIAKTINAFLSANGEFETAVAGKPEVASDESELCLDVADAVPTT 112 >gi|73949158|ref|XP_535195.2| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Canis familiaris] Length = 946 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 60/183 (32%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + + T K + G T Sbjct: 328 QTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATRTQVTDAKKYIEKIQPSGGTN 387 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + +N + ++ I+L++DG+ T + I K Sbjct: 388 INEALLRAIFILNEANNLGLLDPESVS----LIILVSDGDPTVGELKLSKIQKNVKQHIR 443 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 444 DNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 499 Query: 456 ITK 458 + Sbjct: 500 NVQ 502 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 45/397 (11%), Positives = 103/397 (25%), Gaps = 29/397 (7%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-- 118 + + L ++ S T + + T R Sbjct: 142 SVAVAQALQDKEAPALAKPDASQTSEYDANAALTNKPEGSAAALGATKRAAPAAPGIVPQ 201 Query: 119 ---AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 A M + R + L+P + T S Sbjct: 202 RQFAEPMAAIAPSPVPPAEGRMQMQLDPNRERFANAAANPIKSVATDPVSTFSADVDSAS 261 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 ++ + +M D + +N F + V + + Sbjct: 262 YAFVRRSL-TGGAMPDPLSVRVEEMINYFPYDWPGPNNADQPFKATVTVMPTPWNRDTEL 320 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + ++ P + +D S K L++ A ++ +K D V+ Sbjct: 321 MHVAIKGYDIAPATTPRANLVFLIDVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVS--- 377 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + + + + ++ + + GST + ++ AYD Sbjct: 378 ---IVTYAGNAGTVLAPT---RVAEKSKILSAIDRLEPGGSTGGAEGIEAAYDL------ 425 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + ++L TDG+ + + I + + GI + + F Sbjct: 426 ---AKQGFVKDGVNRVMLATDGDFNVGPSSDGDLKRIIEEKRKDGIFLTVLGFGRGNLND 482 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + N S ++ E K + G+ + Sbjct: 483 SLMQTLAQN--GNGSAAYIDTLAEAQKTLVEEAGSTL 517 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 227 LSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 284 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA D + + N K K +++TDG + A++ GI I Sbjct: 285 MAIKTAMDQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 338 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + L AS + F + +L FR+ + E + Sbjct: 339 YAVGV------DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 392 Query: 455 RI 456 Sbjct: 393 EN 394 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 59/190 (31%), Gaps = 25/190 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++A V+ ++ + D ++ V S + R F Sbjct: 310 KIEQAKEAAKFVLNNLNEGD------LFNIIAYDSDVESFEPELQKLDDKTREKALGFVD 363 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GST I+ A+ A + Y++ LTDG T + I Sbjct: 364 NLYAGGSTNIDGALAKAMGMLKDDKRPS------------YMLFLTDGLPTHGEQNEAKI 411 Query: 390 CNKAKSQG-IRIMTIAFSVNKTQQEKARYFL-SNCASPNSFFEANSTHELN---KIFRDR 444 + AK + +R I+F V + L C + + ++ ++ Sbjct: 412 VDNAKQKNDVRARVISFGVGYDVNSRLLDRLSRECFGQSEY--VRPNEDIETHVAKLYNK 469 Query: 445 IGNEIFERVI 454 I + V Sbjct: 470 ISAPVMTNVA 479 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 62.3 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/304 (9%), Positives = 76/304 (25%), Gaps = 41/304 (13%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + + S S+++ + + + Sbjct: 23 GVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCVVSDLC------ATGDH 76 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKH 272 + Y +C++ + + S + Sbjct: 77 DCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENLE 136 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LV+ ++ ++ ++ D + ++ V + G + I Sbjct: 137 LVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSY 191 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A++ D + + +K ++ TDG + + K Sbjct: 192 MEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRSQDYIND---AAKK 242 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AK G ++ + L AS +F +N+ IG ++ Sbjct: 243 AKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKTINQ-----IGKKL 291 Query: 450 FERV 453 +++ Sbjct: 292 QKKI 295 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 59/203 (29%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I V + +G F+ Sbjct: 294 DKSGSMSSSNRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIRSDLIQITGSN 348 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I ++ A++ + IVLLTDGE+ Sbjct: 349 ERDKLLGSLPT-TASGGTSICSGIRRAFEVVRKLYSHTDGSE---------IVLLTDGED 398 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 C ++ K G I IA + + A+ F A E Sbjct: 399 N-----TAGACVDEVKQSGAIIHFIALGPSADKAVIEMST----ATGGVHFYATDEAENN 449 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F +I ++ +++ Sbjct: 450 GLIDAFGALASGNTDISQQSLQL 472 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 61/234 (26%), Gaps = 36/234 (15%) Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 S +G+ + D N + S+ + +D+S + Sbjct: 186 DSDGPISGEGELADAIDVTVWYDENCNNILDADAEEAGDSVCVQLVIDTSGSMGG-SRIA 244 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + + +I + N T FN+ + + + + + + Sbjct: 245 NTKSGAKQLAETILDANPDNQVG---VTRFNNGASTPQQLTDDLDDV-----EAAIDGLS 296 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T + + + D + +V+ DG+ D Sbjct: 297 ASGGTNAQAGVDAGQAELENCPHDN-----------RVMVVFGDGDINTDGS-------A 338 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 AK G I I L + AS F A + +IF Sbjct: 339 AKVAGTEIFAIGVGGASFSD------LEDLASDPADEHVFFAIDDGAIEQIFGQ 386 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 50/155 (32%), Gaps = 22/155 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F FS+ V + R I +T+I+D + A + +S Sbjct: 135 DRVALVVFGSEAYFAAPFSFDVEAIARQIESAQI--GVSGRATSISDGLGIALKRMENS- 191 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-------V 407 A + ++LL+DG N + A G+R+ TIA Sbjct: 192 ----------EAASRVVILLSDGVNNAGATNPRGVAELAAQMGVRVHTIALGPKDLSSAD 241 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L + S F +T +L + Sbjct: 242 PGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 38/377 (10%), Positives = 86/377 (22%), Gaps = 51/377 (13%) Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + I+ N R V + V N + L Sbjct: 100 TGTMTGLAIQFAITKAFSNTEGGRARSPDISKVVIVVTDGRPQDSVRDVSARARANGIEL 159 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 F +G + + S S+++ + Sbjct: 160 FAIGVGR----------VDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFQEAFCVVSD 209 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS---- 262 + + + Y +C + + + Sbjct: 210 LC------ATGDHDCEQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFL 263 Query: 263 ---SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + LV+ + ++ ++ D + ++ V + G Sbjct: 264 IDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHT 318 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + I T A++ D + + +K ++ TDG + Sbjct: 319 KKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRS 372 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 KAK G ++ + L AS +F Sbjct: 373 QDYINN---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKT 423 Query: 437 LNKIFRDRIGNEIFERV 453 +N+ IG + +++ Sbjct: 424 INQ-----IGKRLQKKI 435 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 44/373 (11%), Positives = 97/373 (26%), Gaps = 29/373 (7%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + P + V A + ++ + Sbjct: 43 AALGYAAPAPDGYDSVVVTATKRTSREQRLTSR-------ARPVIATPGLTPPPPPPPPS 95 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VS 178 P ++ P+ + G + ++ AE + S + + Sbjct: 96 PPPPPAAYSFAAPSPVVAPNFAPPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYA 155 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + + + +N F R V + S + Sbjct: 156 NVRRFLNEGAAPPHDALRVEELINYFDYGYARPTAQEPPFKPTVTVVPSPWSQDRQLMHI 215 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + P P L+ +D+S + L + AL +I ++ D V+ Sbjct: 216 GVQGYATPRAGQPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVS------ 269 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + S + G KL ++ GSTA ++ AY + + + Sbjct: 270 MVAYAGSAGAVLSPTDGKSKLK---MRCALTALRSGGSTAGGQGLELAYALARQNLDPKA 326 Query: 359 HRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 ++L+TDG+ D + G+ + F Sbjct: 327 VNR---------VILMTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMM 377 Query: 416 RYFLSNCASPNSF 428 + N ++ Sbjct: 378 QALAQNGNGTAAY 390 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 52/167 (31%), Gaps = 24/167 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + V++ L + K R+G F D F+ + + + Sbjct: 117 VSRLDAVKEVLDGFLAKRKG-------DRVGLVVFGDAAYLQAPFTTDLQLSQELLGECE 169 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA DA+ + E K I+ LTDG +T+ + Sbjct: 170 V--GMAGPRTAFGDAIGLGVNLFDEDTERA-----------KTIIALTDGNDTKSKVPPV 216 Query: 388 AICNKAKSQGIRIMTIAFSVNK--TQQEKARYFLSNCAS--PNSFFE 430 A + I+I T+A + + L + AS +F Sbjct: 217 EAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFF 263 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 46/400 (11%), Positives = 106/400 (26%), Gaps = 35/400 (8%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + + + + A Sbjct: 32 AELKQEQAIQKENSAQAGDDTVQKRQAEAAQQAAKKAAEYKANAEAKAASLADPKAGSLA 91 Query: 120 VEMNPRKSAYQVVLSSRY-DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GV 177 P+ ++S+ + R I+ ++A + S + Sbjct: 92 TAEAPQHEMRTRAVASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSY 151 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN----------GKVGIRDE 227 + + + + LN F P K +++ K + Sbjct: 152 ANVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPS 211 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + + P + +D+S ++ L+R AL ++ ++ Sbjct: 212 PWNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRA 271 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 DN+ G T T +K + + GST ++ AY Sbjct: 272 QDNITIVTYAGGTHVALAS---------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAY 322 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIA 404 + + + I+L TDG+ + I K + +GI + T+ Sbjct: 323 E--QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 373 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++ +S E K+ +D Sbjct: 374 VGDDNFNEAMMVRI--ADVGNGNYSYIDSLSEAQKVLKDE 411 >gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] Length = 584 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 60/191 (31%), Gaps = 29/191 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI ++ D R+ F++ + + + + K Sbjct: 163 SKLALLKQAMGFVIDNLGPAD------RLSIVSFSNDASREIRLT-RMSGDGKASAKEAV 215 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 GST I+ + A + + A ++LL+DG++ Q Sbjct: 216 ESLVADGSTNISRGLLVASEVLADR---------RYRNAVTSVILLSDGQDNQSGVGRNH 266 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + A S+ I T F + + A + +F + + Sbjct: 267 QNLVPPLFRDADSRPGSIHTFGFGSDHDAAA-----MHAIAEVARGTFSFVENLAVIQDS 321 Query: 441 FRDRIGNEIFE 451 F IG + Sbjct: 322 FAQCIGGLLSV 332 >gi|329851995|ref|ZP_08266676.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] gi|328839844|gb|EGF89417.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] Length = 399 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 45/172 (26%), Gaps = 2/172 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + G+ +I+AL +L V +D +L+ AA + + Sbjct: 2 RRQFVQWAHCQGGNVAVISALCAAPLLYVITATIDHSAMVKDRFSLQAAADAGALMGAAK 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L +E+ A + + + + + Sbjct: 62 LALGSDELVFGAAEAAAHQQIGDLRNPVTFEVVVDRSTGAVTVTGRSQHAPLIGFMSDGP 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + + LL L+ G I +A H V++ Sbjct: 122 T-PISARATAEGLLKTPLCILQI-GSSEMKIDDQATVRAGGCLVHSNKDVTV 171 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 52/174 (29%), Gaps = 43/174 (24%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK--NNLEAKKYIVLLT 375 + + G T + M A I N ++++ +T Sbjct: 458 SARNADFVSRINALSPKGGTMHDIGMIWAGRLISPDGIFAADNASAPNGDPISRHVIFMT 517 Query: 376 DGENTQD----------------------------------NEEGIAICNKAKSQGIRIM 401 DGE N AIC +++ + I Sbjct: 518 DGEMGASPSNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAICKAIRNKNVTIW 577 Query: 402 TIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVI 454 +IAF + + + CA+ + A ++ EL+ FRD G+ R++ Sbjct: 578 SIAFGLPHSAYTQG------CATGTSRALTAANSSELDSRFRDIAGSIAELRLV 625 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/321 (9%), Positives = 77/321 (23%), Gaps = 16/321 (4%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI---------QSLEEV 75 I A + ++GV G VD R + L+QA A + A L + + Sbjct: 4 IGAASIIPLVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPANIQD 63 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + FP + + D V T +++ + ++ Sbjct: 64 KADNFFDTNFPSGMYGTTNVGYTLSAGTATQMDGAATASVPTTLMKVF-NVPQIDIAVNC 122 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR-SYHKEHGVSIQWVIDFSRSMLDYQR 194 +L L + + L S ++ ++ + + + + Sbjct: 123 SAELDLPNIDVVLVLDMSGSMNSNGTTGSKRITALKNAVFSFYDVVMAAKPAGTRVRIGI 182 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQ-----NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 ++ + + + + + K + Sbjct: 183 VPYNGAVSVGDELLTLSTTTGIDYLADSWDYQTREPKWKQVSNNDGVEEGDILSETNVTE 242 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +S + + + + ++ + + ++D D Sbjct: 243 LLPRQPSQLGSGNSAHYHWNANKNAKKDKDCYDYAGQSYTVGSEIWEINSVTWDDEYWGD 302 Query: 310 PSFSWGVHKLIRTIVKTFAID 330 + I K Sbjct: 303 KWKKAEKAACVANITKKTIAG 323 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 43/186 (23%), Gaps = 6/186 (3%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 F K S G+F I TA M ++ VD++ S+ L+ + A + Sbjct: 2 FALKAFWSSERGNFAITTAFAMLPIMIGLAGAVDLIGTSHDASQLQNSLDAAGLAIGTKF 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + F + T + R S Sbjct: 62 SPDMAAGDVQQLGLQFFAANMSAADQQEYLGSVSAFAATASGSPSAYFISLSSSISRPSF 121 Query: 129 YQVVL------SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 S+ + + L S + + + Sbjct: 122 ISASAPWQAYRSASVKIKPGAQACVLALDPHASAAVNLQGSTNVSMDNCVIAANSDASDS 181 Query: 183 IDFSRS 188 ++ S Sbjct: 182 VNRGGS 187 >gi|261878616|ref|NP_001159907.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] gi|261878618|ref|NP_001159908.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] Length = 623 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 56/188 (29%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++++ + + F RV S + + F Sbjct: 15 MRGQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFV 71 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T +N + + + E + +++LTDG+ T+ + Sbjct: 72 RGFSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQ 127 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I ++ + + F N + N +E + + + F ++ Sbjct: 128 ILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQV 187 Query: 446 GNEIFERV 453 + V Sbjct: 188 AKPLLVDV 195 >gi|221044458|dbj|BAH13906.1| unnamed protein product [Homo sapiens] Length = 623 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 56/188 (29%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++++ + + F RV S + + F Sbjct: 15 MRGQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFV 71 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T +N + + + E + +++LTDG+ T+ + Sbjct: 72 RGFSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQ 127 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I ++ + + F N + N +E + + + F ++ Sbjct: 128 ILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQV 187 Query: 446 GNEIFERV 453 + V Sbjct: 188 AKPLLVDV 195 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 32/159 (20%), Positives = 61/159 (38%), Gaps = 15/159 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D + I++ + + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDNPEILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + N K+ ++++TDGE + D E K + +GI I + Sbjct: 899 HANALFT-----EEHGSRINQNVKQMLIVITDGE-SDDRVELNDTAAKLRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFR 442 + + L A N+ ++ +L I+ Sbjct: 953 VG------KADQKELEGMAGNKNNTIYVDNFDKLKDIYA 985 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 51/151 (33%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G F+D++ + S + + + + K + G T +A+ Sbjct: 475 GPDKVRVGVVQFSDKMRVEFSITDYSNDID--LRKAILNIQQLTGDTHTGEALDFIL--- 529 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + Y+++LTDG++ E + +++ I + I Sbjct: 530 ---PRIKNGIKERMSQVPCYLIVLTDGKSQYSVVEP---AKRVRAEQITVHAIGIG---- 579 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 E + L A + L I Sbjct: 580 --EANKKELQEIAGKEERVSFGQNFDALKSI 608 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 50/191 (26%), Gaps = 33/191 (17%) Query: 31 PVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 ML G D +R ++ A+I A + + + + + ++ Sbjct: 1 MPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQI-NNTGDTDALKLKVTDWFHAQV 59 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 E + N + + F++ Sbjct: 60 ENSYTLGEIDIDTTNH---------------------------NITATASGTVPTTFMKI 92 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I + + + + + SY +++ VID S SML + + Sbjct: 93 ANIDTVPVSVASAVKGPATSY-----LNVYIVIDTSPSMLLAATTAGQATMYSGIGCQFA 147 Query: 211 TVKSYSSQNGK 221 S + G Sbjct: 148 CHTGDSHKVGN 158 >gi|227818616|ref|YP_002822587.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] gi|36958872|gb|AAQ87297.1| Hypothetical protein RNGR00523 [Sinorhizobium fredii NGR234] gi|227337615|gb|ACP21834.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] Length = 140 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 36/106 (33%), Gaps = 4/106 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + ++ +S +G + AL+ +L + +++ R Y + + AA A + Sbjct: 3 LSLFLRRAWRSQSGATAVEFALVCFPLLLLVLGVIEFGRAFYVRNDMSYAADVA----AR 58 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 ++ + + + + L+ T + V Sbjct: 59 EVLIGKIARDAPDSEAQAKLASAVRDSFDSGDPARLEIAVTKQTVD 104 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 36/289 (12%), Positives = 78/289 (26%), Gaps = 24/289 (8%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + S+ + +N F + + Sbjct: 46 STFSIDVDTGSYSNVRRFLNAGSLPPADAVRVEELINYFRYDDPAPTNG-QPFAVRTELA 104 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + P + VD S K L++ +L ++R + Sbjct: 105 TTPWNKDSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMDAPDKLPLLQSSLKLLVRQL 164 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + D R+ + + + G + + GSTA ++ Sbjct: 165 RAQD------RITLVTYAGNISVVLPPTPG---DQQGRIVEAIDALQSGGSTAGASGIEL 215 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMT 402 AY + + R N I+L TDG+ + + + GI + T Sbjct: 216 AYK----AAQQGYLRGGINR-----ILLATDGDFNVGVTNFDQLKGMVAEKRRSGIALST 266 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + F A ++ ++ E K+ +G + Sbjct: 267 LGFGTGNYNDNLMEQL--ADAGDGAYAYIDTALEARKVLTHELGATLAT 313 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 50/155 (32%), Gaps = 22/155 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F F++ + R I + +T I+DA+ A + S+ Sbjct: 135 DRVALIVFGSEAYYAAPFTFDTEAIARRIEEATI--GISGRATNISDALGLALKRMAGSD 192 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 D + ++LL+DG N + A G+R+ TIA + + Sbjct: 193 ADT-----------RVVILLSDGANNAGATNPRGVAQLAAQMGVRVHTIAMGPKSVDEAE 241 Query: 415 -------ARYFLSNC--ASPNSFFEANSTHELNKI 440 L S F +T +L + Sbjct: 242 EGERGVVDAETLDAISKVSGGETFRVRTTEDLIAV 276 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 54/172 (31%), Gaps = 23/172 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G F V T + + T I +A++ A Sbjct: 86 RAGVWTFGKWVNMLVPHRDVTDPWRATAQAKASEINSVGLFTNIGEALEKA--------- 136 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENT------QDNEEGIAICNK----AKSQGIRIMTIAF 405 E +K I+LLTDG Q+ E I ++ K G+ + TIA Sbjct: 137 -TFEGADGGAEFRKSIILLTDGMVDIDKSPEQNKREWRRIADEVIPRLKEAGVTVHTIAL 195 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 S N + L A+ A+S +L +IF + +T Sbjct: 196 SANADTNLLNKISL---ATGGMAEVAHSADDLMRIFLKAFDVAAPAEQVPLT 244 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 50/430 (11%), Positives = 111/430 (25%), Gaps = 50/430 (11%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G+ + S +++ AA + + P S F Sbjct: 15 AFLGLSLMSSCMSAKAPSMETAAHDEATSRAQPQSPSAALSGKAEAEVSQFTPPPPPPAA 74 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 A R ++ R + Sbjct: 75 PAPAAKRKMDEGKRSAGPTHANLLAPLEETRPQQGN-----TFEAHQPHAFTAAREDALS 129 Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 ++ + + T++R Y + S+ + + +N F Sbjct: 130 TFAVDVDTASYTLARRYLNQ------------GSLPPPEAVRVEEFVNYFKY--QYAPPD 175 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + + P + + VD+S H K L Sbjct: 176 KGAFTVHLEGAPSPFDPSRHFLRVGVQGKVVSRSQRKPAHLVFLVDTSGSMHSQDKLPLA 235 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++A+ +R++ + D V + + + T Sbjct: 236 KEAMKVAVRNLNENDTVA------IVTYAGSTQDVLPPTPATEVQR---IHTAIDLLQSG 286 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAICNKA 393 G TA+ M+ AY + +++LTDG+ N ++ + Sbjct: 287 GGTAMGSGMELAYRHAVKKASGNAISR---------VIVLTDGDANIGPNLSAESMLSGI 337 Query: 394 K---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR-------D 443 + ++G+ + TI F + + + + F +S E K+F + Sbjct: 338 EKYVAEGVTLSTIGFGMGNYRDDLMERLADK--GNGNCFYVDSYQEAKKVFEAQLTGTLE 395 Query: 444 RIGNEIFERV 453 I ++ +V Sbjct: 396 VIAKDVKIQV 405 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 49/352 (13%), Positives = 99/352 (28%), Gaps = 30/352 (8%) Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 +V + + Y+ P + + S + T+ Sbjct: 85 VSQSPAATSDAVVVTGNRPALQSSVQMKRSAAPAYEAREAPNTEKYNGESVSSVMRVTET 144 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 T S +++ ++ + + LN F + Sbjct: 145 PVSTFSVDVDTGAYANVRRML-NDGTTPTEAAVRTEELLNYFRYDYPLPQDRSKPFSITT 203 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 + + L P P+ + VD S + K LV+ AL+ + Sbjct: 204 DVAQTPWNAQTRLMRVGLRAYDVPRSERPAANLVFLVDVSGSMNDPDKLPLVKTALSMLS 263 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +++ D V+ V GA + VK + GSTA Sbjct: 264 DNLRPDDKVSIVVYAGAAGMVLAPTHEGK-----------YVKQALECLSAGGSTAGGQG 312 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIR 399 M AY ++ E + N ++L TDG+ A+ + + G+ Sbjct: 313 MALAY----ATAEANFIKGGINR-----VILATDGDFNVGISSIGEVEALVKQNRESGVT 363 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 + + F + A ++ +S E K+ D + + + Sbjct: 364 LTALGFGTGNY----NEALMEKMADVGNGNYAYIDSAMEARKVLDDELSSTL 411 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 62.3 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 39/366 (10%), Positives = 99/366 (27%), Gaps = 31/366 (8%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN---PLSLFLRSMG 152 N + + +V + M + S + + S + + Sbjct: 28 SNSRDTSPATPVIAQPTPVVAAPTMMMEAAAMSKMARYSPIMPMPMPRQQDNSEKYQHLA 87 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 T S +++ ++ + + +N F + Sbjct: 88 ANPIHQTATDAVATFSIDTDTGSYANVRRFLNN-GQLPPTDAVRIEELINYFNYDFSQAK 146 Query: 213 KSYSS---QNGKVGIRDEKLSPYMVSCN-KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + ++ + + + + ++ K+ + P + VD S Sbjct: 147 RLANAPFLVSTETVAAPWRTANRIIKVAIKADDPTITKQSTLPPANLVFLVDVSGSMSDN 206 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K LV+ +L + + ++ D ++ + R + + Sbjct: 207 DKLPLVKSSLKMLTKQLRPQDTIS------IVTYAGRTQVTLPAT---RGSDTDKILAAI 257 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + GST A++ AY + + N I+++TDG+ + Sbjct: 258 DSLDASGSTNGEAAIKLAYQ----QAKIHYKKDGINR-----ILMMTDGDFNVGVSDVDE 308 Query: 389 ICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + G+ + T F N ++ +S E K D + Sbjct: 309 MLDIIRRERDSGVSLSTFGFGEGNLNDHMMEQVADN--GNGNYSYIDSLSEAKKALVDEM 366 Query: 446 GNEIFE 451 Sbjct: 367 SATFNT 372 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 54/195 (27%), Gaps = 34/195 (17%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ I + + R+ F D + L+ + Sbjct: 110 SAVRQQISRLIETRPG---DRIALVVFADHAYLLSPLTQETKALLTLTRELDFD--LVGR 164 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 +TA+ +A+ A ++L+TDG NT + + ++ +A + Sbjct: 165 TTALGEAILLARQHADPKRPTA-------------LLLVTDGRNTAGSADPLSEARRAAA 211 Query: 396 QGIRIMTIAFSVNKT-------------QQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 G+ + T+ + E L A +F A + +L Sbjct: 212 SGMTLYTLGVGADPDTFVEALQPAQSDPSAELDEALLQQLAKVGQGRYFRARTQGDL-DA 270 Query: 441 FRDRIGNEIFERVIR 455 + R Sbjct: 271 INQTLDTLEPAATAR 285 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/351 (10%), Positives = 70/351 (19%), Gaps = 32/351 (9%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + + + V LS + + + Sbjct: 337 GTDPLVADSNDNGVSDGEESYTTTATNETLGVTLSLTGNGDIGNGTTIAPQD--DPRFNT 394 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 ++ + S + P +S Sbjct: 395 SRVGNMSASPVVELNSEQEFSSANVTLGYNETGVENESQDLAVFTYDPEAGIFVPLNSTV 454 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + + + + D V + Sbjct: 455 DATNNTATAETTHFSTFAVFDISNWATTYNATEPVRQTDDDGLRPVDVTLVMDTSGSMSS 514 Query: 280 SVIRSIKKIDN----VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 SV + D R F+ + T + G Sbjct: 515 SVKLRNTAGQRFVAGLLDVDRAAVVDFDSSAYVAQDLTSDFGAANST-----LDNLGSGG 569 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T I + TA + +N + ++LLTDG I+ A + Sbjct: 570 GTDIGSGLSTA---------NSQFASNSNDSRAQVMILLTDGRGNGG----ISEAQTAAN 616 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 Q + T+ F R L + A + F EL +F Sbjct: 617 QNTTVYTVGF------DNANRDKLRDIANITDGEFNYVTDRSELPNVFSRI 661 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 50/153 (32%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F +R ++ + + R I + STAI D + A + S Sbjct: 132 DRIGLALFANRAYVAAPLTFDLAAVGRAIEEASI--GITGRSTAIADGLGLALKRVTESG 189 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 + IVLL+DG++ + + A G+RI TIA + Sbjct: 190 AAS-----------RVIVLLSDGQDNAHQIDARQVAGLAARHGVRIHTIALGPDDLETRP 238 Query: 410 -TQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + L AS + +L Sbjct: 239 AARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 41/176 (23%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G+ +T + MP++LG +++DV R S L+ A + + L Sbjct: 8 RFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGRD 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ + T D V A Sbjct: 68 DAITRAQTAIEKIANSAAFSGGGTGMSLGSNSSVTYEAGNDAGNTVTVLFLKSIPADDDT 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 P + K +QT V S Sbjct: 128 PIPASMETTEPSEASYAWVIAKPQAMQTIFPIPVGFNRDTINIAADAVAVYHASAC 183 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 55/192 (28%), Gaps = 32/192 (16%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + A + + + F + R +V T Sbjct: 105 DPSRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASSQVVVPPT-----TDRAVVST 153 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 154 AIANLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL 210 Query: 387 IAICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEAN 432 AK G + + TIA+ AS F A Sbjct: 211 ----EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAE 266 Query: 433 STHELNKIFRDR 444 S +L+ +++ Sbjct: 267 SLGQLSDVYKSI 278 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 61/191 (31%), Gaps = 29/191 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + A + + + D V+ F+ R + + I+ + Sbjct: 121 TRLTNAVQAATTAVSRLNDGDVVS------VVTFDTRTSVVVPPTTVGPETRGRILAS-V 173 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-- 386 + G T I+ ++ + ++ +++L+DG+ + Sbjct: 174 RGISLGGDTCISCGIEEGLSLLGQTSAGVSR-----------MLVLSDGDANHGVRDVPG 222 Query: 387 -IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 A+ +A+ +G+ I TI V+ LS A S + + L +IF Sbjct: 223 FRAMAQRARDRGVAITTIGVDVDYN-----EKILSAIALDSNGRHYFVENDAALARIFEA 277 Query: 444 RIGNEIFERVI 454 ++ V Sbjct: 278 E-AEQLTTSVA 287 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 52/163 (31%), Gaps = 23/163 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ + + +L+ ++ TAI + + T+ + Sbjct: 127 RIGLVTFSRAATEVVAPTTNRQRLLDSVELIGLEF-----GTAIGEGILTSLQALPP--- 178 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK- 414 E + + I+LLTDG + + + A Q +RI TI Sbjct: 179 LEQRKDAKDPSELATIILLTDGR-SISGIDPLEAARIAAEQKVRIHTIGVGRVTEGPVPG 237 Query: 415 -----------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F NS +L + ++ Sbjct: 238 LESVYQWAAYFDEDVLKQIAAITGGKYFFVNSAGKLRETYQQL 280 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 62/181 (34%), Gaps = 24/181 (13%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +++ V+ L S ++ R+G F D + L +++ Sbjct: 53 VRRIDGVKLLLESFLQQRTG-------DRVGLIAFGDDAYLQAPVTEDFDTL--SLLLEQ 103 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TA+ DA+ A + S K ++LLTDG++T + Sbjct: 104 MDVRMAGAGTALGDAIGVAVNHFEHS-----------ESNNKVLLLLTDGKDTTSQFPPV 152 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCAS--PNSFFEANSTHELNKIFRD 443 + A +G+ I IA E+A L+ AS FEA L +++ Sbjct: 153 DAAHFAGERGVTIYPIAIGDATNVGEEAIDLEMLARIASYTGGQVFEALDGDALAAVYQT 212 Query: 444 R 444 Sbjct: 213 L 213 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 54/407 (13%), Positives = 113/407 (27%), Gaps = 40/407 (9%) Query: 59 TAII-----TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A + + + P Q + S + P + V Sbjct: 14 AAALGLFVASCAQPSEQIVVTGSPAPTSGTKAPPPPPPPPPPPPPPPSQALADNTVVVTA 73 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL-----RSMGIKSWLIQTKAEAETVS 168 R A + + V S+ + P+ + + R G + ++ + + Sbjct: 74 SRRAGAPSRSRTGAVSAEVASAPRGYSIPPVVVPVDPGRERYDGEEVSPVKLVSAEPVST 133 Query: 169 RSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 S + + + +N F R + + Sbjct: 134 FSVDVDTGAYANTRRFLRQGVTPPRDAVRTEEMINYFRYDYARPTTRDVPFTTNIDVAKT 193 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + L P + +D S K LV+ AL+ + + + Sbjct: 194 PWNEDTYLMRIGLRGYDIERDERPPANLVFLMDVSGSMGSPDKLPLVQTALSGLAGELGE 253 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D V+ + + K ++ + + GSTA +Q AY Sbjct: 254 QDRVS------IVVYAGAAGLVLEPTNDTAK-----IRAALMSLSAGGSTAGGAGIQLAY 302 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIA 404 + ED N ++L TDG+ D + + + K + +GI + T+ Sbjct: 303 NI----AEDNFIEGGVNR-----VILATDGDFNVGVSDRDALVEMVEKNRDRGITLTTLG 353 Query: 405 FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 F + A ++ +S E K+ D + + + Sbjct: 354 FGTGNF----NEAMMEQIANKGNGNYAYIDSALEAKKVLSDEMSSTL 396 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 227 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 284 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA + + + N K K +++TDG + A++ GI I Sbjct: 285 MAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 338 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + L AS + F + +L FR+ + E + Sbjct: 339 YAVGV------DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 392 Query: 455 RI 456 Sbjct: 393 EN 394 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 47/169 (27%), Gaps = 29/169 (17%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F + + L+ + + I + Sbjct: 133 KGRPNDRIGIVAFGGAPYNPCPPTLDHDWLLNNMDR-------------IQTGIMEDGTA 179 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 I S R+ K I+L+TDG N A + GIRI I+ Sbjct: 180 IGSGIAAAARRLDQLEVKSKVILLMTDGANNSGKLSPQDAARLAATLGIRIHAISIGTPG 239 Query: 410 --------------TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +QE L A SFF A L +IF+ Sbjct: 240 MHPIYMPNGPPINSGRQEFDPETLQEVANIGSGSFFRAEDLSTLERIFK 288 >gi|218460583|ref|ZP_03500674.1| hypothetical protein RetlK5_14236 [Rhizobium etli Kim 5] Length = 309 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 92/335 (27%), Gaps = 71/335 (21%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A+I A + ++E+ + K + +++ N Sbjct: 3 LDAALIAAVKQI-DNVEDADTLKKKVSDWFHAQVDNSYSLGEIEIDTANH---------- 51 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + + F++ I+S + + + + SY Sbjct: 52 -----------------NITATASGTVPTTFMKIANIESVDVSVASAVKGPATSY----- 89 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 +++ VID S SML S + + GK Sbjct: 90 LNVYIVIDTSPSMLLAATTSGQATMYAGIGCQFACHTGDAHTIGKKK------------- 136 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + SS + + + + DA+ V+ I + D ++ ++ Sbjct: 137 ------------------YANNYEYSSEKKIKLRADVAGDAVREVLDMIDESDANHERIK 178 Query: 297 MGATFFNDRVISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +G D + + + +T + ++ T + + Sbjct: 179 VGLYSLGDTLTEVLTPTLSTDTARTRLADASYGLTSSTSKAATYFDVSLATLKQKVGAGG 238 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + K ++LLTDG +Q + Sbjct: 239 DGS-----TSGSPLKLVLLLTDGVQSQREWVTDGV 268 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 46/172 (26%), Gaps = 21/172 (12%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 +N+N ++D V + R F GST I+ ++ Sbjct: 313 ENLNQDDLFNLILYDDTVEMFKPELLRCNAENRAEALRFIEGVRPGGSTDIDQGLRAGLK 372 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAF 405 I + Y++ LTDG T + I A++ ++ Sbjct: 373 LIADESRPN------------YVIFLTDGLPTSGETNELKIAEAARAANPLKAKLFVFGV 420 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI---FRDRIGNEIFERVI 454 + + R N F L F +RI + Sbjct: 421 GYDVNARLLDRLSGEN---GGVSFYVKPDDNLEVAVSRFYERISTPALTDIT 469 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 60/173 (34%), Gaps = 29/173 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K L R A V+ +++ ID+V F++ R +K Sbjct: 408 RKIELARLAAIGVVENLRPIDSVG------VLIFDNSFQWAVPI---RKAEDRATIKKLI 458 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A+ AY I+ K+IVLLTDG + + + + Sbjct: 459 SGITPDGGTQIAPALTEAYQRILPQTAM-----------YKHIVLLTDGISEEGDSMTLT 507 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNK 439 +A++ + I T+ + R FL AS + N L + Sbjct: 508 --KEAQANHVTISTVGLGQDVN-----RAFLEKVASNADGKAYFLNDPSGLEQ 553 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/326 (11%), Positives = 87/326 (26%), Gaps = 23/326 (7%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + V T + ++ + + L + L E + Sbjct: 1435 YAVGIGAGVSSTTLLDIAGYNSRVLQATDFASLEVIGRELQEFICNAAYCGDPGTPEFGS 1494 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDS--EGQPLNCFGQPA--DRTVKSYSSQNGKV 222 S + + + + D + ++L + + + NG Sbjct: 1495 RSGDFFEGSV--VTFQCDPTYTLLGSATTTCFGNGTWSDPVPICVSPDPCNNSPCLNGGT 1552 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 R + + C + + + S + L D L + Sbjct: 1553 CQRVGLTTEFTCLCPEGYHGPICQFYAACSNRTLNLDVVFLLDGSGSVGSANFDLLKTFT 1612 Query: 283 RSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I D + R+G ++D+ S+ ++ A + G T+ Sbjct: 1613 TRIATNFDVSTNLTRVGVVQYSDQTNSEFVL--NTFSTEAEVLAAIAAISYQNGGTSTGA 1670 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + S + N +++LTDG ++ D A++ GI I Sbjct: 1671 ALDYVRQNVFISASGDRPDAAN------ILIVLTDGVSSDDVSFPAM---AARNAGITIY 1721 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNS 427 ++ L A + Sbjct: 1722 SVGIGDGVDYNT-----LQQIAGDPN 1742 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/389 (10%), Positives = 84/389 (21%), Gaps = 41/389 (10%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A I A ++ + + N L D+ Sbjct: 1654 AAIAAISYQNGGTSTGAALDYVRQNVFISASGD--RPDAANILIVLTDGVSSDDVSFPAM 1711 Query: 120 VEMNPRKSAYQVVLSSRYDLL-----LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 N + Y V + D + L++ G S E V + + Sbjct: 1712 AARNAGITIYSVGIGDGVDYNTLQQIAGDPNKVLQATGFSSLDDIGGQLEELVCDATYCG 1771 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + CF + Sbjct: 1772 DP-----GAPINGFRNGNFFEGGTVTWGCFNGFNLVGALTAVCLGNGSWSEPVP------ 1820 Query: 235 SCNKSLYYMLYPGPLDPSLSEE--HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 C + +D S ++ ++ + D Sbjct: 1821 ECMAPTTPPPPGCDELSFGGWDLVFLLDGSGSVGSNNFLNV-KNFTKLITDLFPVGD--- 1876 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++G F+D + + + I+ G T +A+ + Sbjct: 1877 NATKVGLVQFSDTIQKEFDLRD--YDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFN 1934 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + +++LTDGE+ A+ QGI I I Sbjct: 1935 TINGNRG------SHPDMLIVLTDGESFDPVTFAS---QSARDQGITIFAIGVGTGVDYA 1985 Query: 413 EKARYFLSNCASPNS-FFEANSTHELNKI 440 L A + +L + Sbjct: 1986 T-----LEEIAGDPQKVQQVTDFADLTSV 2009 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 43/431 (9%), Positives = 97/431 (22%), Gaps = 63/431 (14%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + + + I A ++ N+ + L Sbjct: 2466 YLNAFSTK---QAVLDA--INAVTYQQGGTNTGAAITYTMQEIFAS--ANGARANYPDVL 2518 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 D+ N Y V + +N +L + L Sbjct: 2519 IVVTDGESSDDVAVPALSARNAGTLIYAVGV----GNGVNQATLLQIAGNAGQVLQAADF 2574 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN-------------CFGQPAD 209 T +++ + D + + S Sbjct: 2575 AGLTTVVQSLQQNLCDAAYCGDPGAPVNGNRVGSYFVGNTVTFSCSAGYIIQGSATATCQ 2634 Query: 210 RTVKSYSSQNGKVGIRDEKLSP--YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + S+ V +P +C + Y S + S+ Sbjct: 2635 SSGQWTSAVPTCVASDPCPANPCQNGATCTQVGSTTQYTCTCPQGYSGNNCEVFSACTDR 2694 Query: 268 I-----------------KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 +L++ +++ + N + + FN + Sbjct: 2695 ALDVDLVVLLDGSGSVGSDNFNLLKAFTQNIVGNFDIAVNNTRVGVVQYSDFN-----NI 2749 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 F+ + ++ + G T A+ + ++ N + Sbjct: 2750 EFNLNAYATEAEVLAAIGAISYQRGGTFTGAAIDFVRQDVFTTA------GGNRADKPDI 2803 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFF 429 +++LTDGE++ + GI I + L A P Sbjct: 2804 LLVLTDGESSDSVAGP---AQNTLNAGITIYAVGIGSGVN-----ADTLQEIAGDPGRVL 2855 Query: 430 EANSTHELNKI 440 + L I Sbjct: 2856 QVADFQGLAAI 2866 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/239 (8%), Positives = 58/239 (24%), Gaps = 16/239 (6%) Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 S + + ++ P F D +Q + + + Sbjct: 2055 TLAGSSSITCQPDGQWTDVVPTCLFIDNCDPNPCLNGAQCFQTA---DSYRCTCAEGYEG 2111 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 +Y + +D S L++ + + R+G Sbjct: 2112 TNCEIYTALNAQTFDLVFLLDGSGSVG-ASSFDLMKSFTNRITTNFDVSPTST---RVGV 2167 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ + F + ++ + G+T +A+ + Sbjct: 2168 VQYSSQGSVATEFRLDSYSNKDDVIAAVNGIVYQNGNTYTGEALNYVRQNSFAVAN---- 2223 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +V++TDG++ D +G+ + + Sbjct: 2224 --GGRADVANILVVITDGQSVDDVTGP---AQDLLREGVTVYALGIGDGIQYSTLEAIA 2277 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 53/168 (31%), Gaps = 20/168 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++ L ++ K D +D R+G ++ V ++ + ++ + Sbjct: 370 FDLLKTFLQNIT---MKFDVSSDITRIGVVQYSTDVNTEFELK--TYATEAEVIHAISNI 424 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + GST I + + + N +V +TDG + D Sbjct: 425 TRQRGSTFIGAGINFVRTNSFTVAAGDRPLAPN------ILVTITDGISADDVAGP---A 475 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHEL 437 A+ QGI +I L + A F S EL Sbjct: 476 QAARDQGILTYSIGIGEEIQWPT-----LLSIAGARHRVFNVTSFSEL 518 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 38/347 (10%), Positives = 93/347 (26%), Gaps = 30/347 (8%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + S + + SFT + + D + + + Sbjct: 577 VARHFTISSSRNDNMSYRSFTAATNVGAVQYSDTVRSEFFLSSFDTDFEVVRALDGISYL 636 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + L +P++ G L+ V+ ++ V+ Sbjct: 637 AGGTFTGFALDFVQQSAFSPVAGA--RDGYPDILVV-------VTDGVSQDDVVAPAESA 687 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + Q A + N + P + S Sbjct: 688 RKEGIAVFAVGIGSAVDYATLLQIAGIDGRILQINNFVDLASASQTLPDYICNYASCQGR 747 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P + L V S++ LV+D ++ R+ + ++G ++ Sbjct: 748 DVPLDIVFLLDGSGSVGSAN-------FDLVKDFTRTLARNFDIA---ANMTQIGVVQYS 797 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D V + + ++ + + G T A+ T ++ + + + N Sbjct: 798 DTVNREFGLGD--FHNRQDVLNAISAVSYQQGGTLTGAAIDFVRQTSFTTGDGDRPDVPN 855 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++++TDG + + + A+ +GI + Sbjct: 856 ------MLIVVTDGVSGDSVQGP---ADAARREGITTFGVGIGNGID 893 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 48/149 (32%), Gaps = 14/149 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 LV+D + V+ + DT +G ++D V ++ S KL Sbjct: 1034 TTNFELVKDFTSEVVLNFNIS---ADTTNVGVVQYSDTVRNEFFLSSYDTKLPLIDAINQ 1090 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T A+ + N +V+LTDG Q ++ + Sbjct: 1091 ISYLT--GGTLTGFAIDYVRQS-----SFSRPAGARNTFPD-VLVVLTDG---QSQDDVV 1139 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + A+SQGI I + + Sbjct: 1140 SSAAAARSQGITIFAVGIGSEVDFTTLLQ 1168 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 12/124 (9%), Positives = 36/124 (29%), Gaps = 11/124 (8%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + + R+G ++ + ++ ++ ++ GST + Sbjct: 166 FEVSSSQTRVGVVQYSTGINTEFDL--NSFATEAEVINAIRGLSHQRGSTFTGAGITFTR 223 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + N ++++TDG + + A++ I +I Sbjct: 224 LESFTGASGDRPDAPN------VLIVITDGISADSVDAP---AEAARADNITTYSIGIGD 274 Query: 408 NKTQ 411 Sbjct: 275 EINY 278 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 36/142 (25%), Gaps = 15/142 (10%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 A ++D V S+ S G T A+ + S Sbjct: 604 AVQYSDTVRSEFFLSSFDTDFEVVRALDGISYL--AGGTFTGFALDFVQQSAFS------ 655 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +V++TDG + D A+ +GI + + Sbjct: 656 PVAGARDGYPDILVVVTDGVSQDDVVAP---AESARKEGIAVFAVGIGSAVDYA----TL 708 Query: 419 LSNCASPNSFFEANSTHELNKI 440 L + N+ +L Sbjct: 709 LQIAGIDGRILQINNFVDLASA 730 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 62/179 (34%), Gaps = 17/179 (9%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L ++ + D R+G F F+ V + Sbjct: 124 KAASNVSRLVALKSLLSSFSQQRDGDRLGLIVFGSGAYLQVPFTEDVRL--WQTLLEQMD 181 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + +TAI DA+ + SN +++ ++L+TDG +T + + Sbjct: 182 TQMAGPATAIGDAVGLSIRAFERSN-----------TSQRILLLVTDGSDTSSRLDPVDA 230 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFRDR 444 A ++GI I T+ T + F L+ A + FE NS+ + +I Sbjct: 231 ARVAAAEGIEIFTLGMGSVDTVGDDQVDFNTLNKIAKITNGRAFEGNSSTAIAEILAQI 289 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/318 (11%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 170 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 171 NYFPAPQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQTSE 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L++ AL ++ ++ DN+ G T Sbjct: 229 LPPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 286 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 331 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+ TDG+ + I K + +GI + T+ + + Sbjct: 332 -ILFTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 388 Query: 427 SFFEANSTHELNKIFRDR 444 ++ +S E K+ +D Sbjct: 389 NYSYIDSLSEAQKVVKDE 406 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 50/155 (32%), Gaps = 22/155 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F FS+ V + R I +T+I+D + A + +S Sbjct: 135 DRVALVVFGSEAYFAAPFSFDVEAIARQIEGAQI--GISGRATSISDGLGLALKRMENS- 191 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-------V 407 A + ++LL+DG N + A G+R+ TIA Sbjct: 192 ----------EAASRVVILLSDGVNNAGATNPRGVAELAAQMGVRVHTIALGPKDLSTAD 241 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L + S F +T +L + Sbjct: 242 PGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 65/193 (33%), Gaps = 19/193 (9%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 SS + L+++ + ++ K D + F DR Sbjct: 257 VFDASSSINANNPNNYQLMKNFMKDIVDRFNKTGP--DGTQFAVVTFADRATKQFGLKDY 314 Query: 316 VHKLIRTIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 K +K + G TAI D ++ A + E +K ++L Sbjct: 315 SSKAD---IKGAIDKVSPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQKVVIL 366 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEAN 432 LTDG+N + + +G+ I+ I + L N AS + F + Sbjct: 367 LTDGQNNGHKS-PEHESSLLRKEGVVIVAIGVGTG-----FLKSELINIASSEEYVFTTS 420 Query: 433 STHELNKIFRDRI 445 S +L+KI D + Sbjct: 421 SFDKLSKIMEDVV 433 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 62/194 (31%), Gaps = 16/194 (8%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + + K V++ + S N + F+ V + Sbjct: 59 FDDSSSIQYDNKENYQMMKDFVKELVDSFTTVGVNGRNGSQFG---VVQFSQGVKTAFPL 115 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T I ++ + S E N + +K ++ Sbjct: 116 -NKFKTKEDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGAEG-----GGNPDKQKIVI 169 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 L+TDG++ +K K++G+ ++ I + L A+ ++ Sbjct: 170 LMTDGKSNAGAP-PQHEAHKLKAEGVTVIAIGIGQG-----FVKTELEQIATMKNYVLTT 223 Query: 433 ST-HELNKIFRDRI 445 ++ EL+ + + I Sbjct: 224 NSFSELSTLLKLVI 237 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 50/168 (29%), Gaps = 20/168 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 I +D T R+ + V + K + + T A+ Sbjct: 80 MIDTLDVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIQPLSA--GTMTGLAI 137 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A D + + N K ++++TDG + ++ + A++ GI I + Sbjct: 138 QAAMDEVFTEEMGTRPA---NFNIPKVVIIVTDG---RPQDQVENVAANARTAGIEIYAV 191 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS F + +L FR+ Sbjct: 192 GVGR------ADMQSLRIMASEPLDEHVFYVETYGVIEKLTSKFRETF 233 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 55/192 (28%), Gaps = 32/192 (16%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + A + + + F + R +V T Sbjct: 107 DPSRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASSQVVVPPT-----TDRAVVST 155 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 156 AIANLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL 212 Query: 387 IAICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEAN 432 AK G + + TIA+ AS F A Sbjct: 213 ----EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAE 268 Query: 433 STHELNKIFRDR 444 S +L+ +++ Sbjct: 269 SLGQLSDVYKSI 280 >gi|221044450|dbj|BAH13902.1| unnamed protein product [Homo sapiens] Length = 623 Score = 61.9 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 56/188 (29%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++++ + + F RV S + + F Sbjct: 15 MRGQKVKQIKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFV 71 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T +N + + + E + +++LTDG+ T+ + Sbjct: 72 RGFSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQ 127 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I ++ + + F N + N +E + + + F ++ Sbjct: 128 ILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDHDATQQLQGFYSQV 187 Query: 446 GNEIFERV 453 + V Sbjct: 188 AKPLLVDV 195 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 59/189 (31%), Gaps = 12/189 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTF 327 K + A+ +++ ++ D+ F+D V + + K + Sbjct: 263 TKLQQTKKAMDTILSDLQASDS------FNIITFSDTVNIWKAEGSIQATVQNIHSAKNY 316 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN A+ A + SN++ I+ LTDGE T Sbjct: 317 VSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP--LIMFLTDGEPTAGETTPS 374 Query: 388 AICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + + + + + ++AF + R L N +E + Sbjct: 375 VILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAE 434 Query: 445 IGNEIFERV 453 I + V Sbjct: 435 ISRPLLADV 443 >gi|332817004|ref|XP_003339088.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] gi|332817006|ref|XP_516520.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 6 [Pan troglodytes] Length = 623 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 55/188 (29%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++++ + + F RV S + + F Sbjct: 15 MRGQKVKQTKEALLKILGDMQPG---DYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFV 71 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T +N + + + E + +++LTDG+ T+ + Sbjct: 72 RGFSLDEATNLNGGLLRGIEILNQVQES----LPELSNHASILIMLTDGDPTEGVTDRSQ 127 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I ++ + + F N + N +E + + F ++ Sbjct: 128 ILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQV 187 Query: 446 GNEIFERV 453 + V Sbjct: 188 AKPLLVDV 195 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 62/226 (27%), Gaps = 39/226 (17%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 M + + + V + + + + I R+ Sbjct: 91 DMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRFIEQ-------RPNDRI 143 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F D+ S + L+ + + T I + + + + Sbjct: 144 GLIGFADQAYSFAPPTLDHAWLLAHLEQ--LEPGMIGQQTGIAAPLASGVNRL------- 194 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSV--------N 408 K + ++ +VL TDG N DN K + I T+ + Sbjct: 195 ----KKSDAPRRVLVLFTDGRNNVDNRLTPEQAAALGKEFDVVIHTVGIGSRNAFVLVTD 250 Query: 409 KTQQEK--------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +++ L + A + ++F A + ++ + Sbjct: 251 PFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGMKQVMDEI 296 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 53/154 (34%), Gaps = 21/154 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F DR + + V + + I +TAI D + A + Sbjct: 131 DRVGLVVFGDRAYVAAAPTHDVGAVAQVIGTLQI--GVSGKATAIADGLGLAIRRLR--- 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 + I+LL+DG++T + +A A+ G+R+ TIA Sbjct: 186 --------GREAESRVIILLSDGQDTTGAVDPVAAAQAAQELGMRVYTIALGPADLASSP 237 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + L A + F ST +L + Sbjct: 238 DARDAVDSETLRRIAEVAGGETFRVRSTEDLEAV 271 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/407 (9%), Positives = 105/407 (25%), Gaps = 49/407 (12%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ + + + + + +R + T ++ + + Sbjct: 77 ATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAE 136 Query: 124 PRKSAYQ-------VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +S VV R + +S R+ G++ + I + + Sbjct: 137 GGRSRSPDISKVVIVVTDGRPQDSVQDVSARARASGVELFAI-GVGRVDKATLRQIASEP 195 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S S+++ + + + + Y +C Sbjct: 196 QDEHVDYVESYSVIEKLSRKFQEAFCVVSDLC------ATGDHDCEQVCISSPGSYTCAC 249 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDALASVIRSIKKID 289 ++ + + S + LV+ + ++ ++ D Sbjct: 250 HEGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSD 309 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + ++ V + G + I T A++ D Sbjct: 310 KLAQVG---LVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDN 364 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + +K ++ TDG + + KAK G ++ + Sbjct: 365 SFTVSSGARPGA------QKVGIVFTDGRSQDYIND---AAKKAKDLGFKMFAVGVG--- 412 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERV 453 L AS +F +N+ IG ++ +++ Sbjct: 413 ---NAVEDELREIASEPVAEHYFYTADFKTINQ-----IGKKLQKKI 451 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 56/168 (33%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + S + + + T A+ Sbjct: 68 IQSLDVGPNATRVGMVNYASTVKQEFSL---RAHVSKAALLQAVRRIQPLSTGTMTGLAI 124 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + + K ++++TDG ++ A +A++ G+ + I Sbjct: 125 QFA---ITKAFSDAEGGRSRSPDISKVVIVVTDGRPQDSVQDVSA---RARASGVELFAI 178 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L++ F++ Sbjct: 179 GVGR------VDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAF 220 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 52/424 (12%), Positives = 127/424 (29%), Gaps = 37/424 (8%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 LK + A ++ +S+ + + + K Sbjct: 88 VRSVLKNLDEKAAEATFTLILPEDAFISAFSMTINDVVYDSQVKPKAVAQKEYNKAKARG 147 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE--T 166 + + + P + ++V ++ + + + + + K + + + + Sbjct: 148 QTAGQVK-----QSKPGDTKFKVSVTVAAETSVLFNLTYQQLLIRKGGMFENRISIQPGQ 202 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV---- 222 V + E ++ + S ++ LN + ++ + K Sbjct: 203 VVPNVSVEVYITEPQGLKEVTSTWLISDKNDQAKLNSMTTLDIKPRTAHILFDSKEEGLL 262 Query: 223 -----GIRDEKLSPY-MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 GI + + Y ++ K+ + + G S + ++ ++VI + Sbjct: 263 TTSRRGIMGDFIVTYDVIHTKKAGHLEIVDGYFVHYFSPDGLPNTR--KNVIFVIDVSGS 320 Query: 277 ALASVIRSIKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF--AI 329 R K+ D+V R+ F+ V I Sbjct: 321 MYGQKTRQTKRAFTTILDDVRPIDRINIILFSSYAHVWRE-DQMVEATSDNIAAAKRHVN 379 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I D++ A + ++ + + I++LTDG+ + Sbjct: 380 GLSVGGGTNIYDSLMKAVEILLEHDTGDAMP---------LIIMLTDGQVGNAAAIVRDV 430 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + + + +I F + LSN A +E +S K F D + N + Sbjct: 431 TSVIGGR-LSLFSIGFGNGVDFPFLEKLSLSNQALARKVYEDSSASLQMKGFYDEVANPL 489 Query: 450 FERV 453 + Sbjct: 490 LFNI 493 >gi|198435715|ref|XP_002125840.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 3908 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 30/260 (11%), Positives = 77/260 (29%), Gaps = 23/260 (8%) Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 L+ + + + S Y V N + P + + Sbjct: 3368 SLSLTYTCTNGSNVGSNCTFQCDEDATLLGSTYRVCLNDGTWSRSNPVCRACKTAVTDVI 3427 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + + R A + ++ + D R ++D ++ + Sbjct: 3428 FIVDGSWSVGEIN-FRKAKDFLKALVEPFEVGWDNSRFAVVQYSDDPRTEFLMNEHFTVT 3486 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTA-YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ + G+T A+ + Y + +N K ++LTDG Sbjct: 3487 D--VLNAIDAIPYKGGNTNTGKALAFSLYTALSPAN-------GARPYVNKVALVLTDGR 3537 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS---FFEANSTH 435 + + + + G++++T+ + + L + ASP + + Sbjct: 3538 SQDEVGNP---ARELRQAGVKVLTVGVG------DADKNELKSIASPPYDSSVYHVSDYD 3588 Query: 436 ELNKIFRDRIGNEIFERVIR 455 +++I V+R Sbjct: 3589 SISEIKAHLAAKLCEGEVLR 3608 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 59/173 (34%), Gaps = 19/173 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + A ++ + + V MG ++D ++ + H +++ Sbjct: 413 PQNFKKIQAFLVSLVDAFSIGFNNVLMGYAQYSDDARTEFNL--NEHVTKDDLIRAINQV 470 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + G+TA A+ + +S K +++TDGE+ D+ Sbjct: 471 QYKGGNTATGGALDYIRTNLFTS------EGGTRRGVLKTAIVITDGESILDDV--TEPA 522 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 K G+ + +I R L + AS + F ++ ++ I Sbjct: 523 RMLKEIGVEVFSIGV------AAALRSELEDIASSPASDHVFSVDNFDDIKNI 569 >gi|198433657|ref|XP_002122417.1| PREDICTED: similar to PK-120 [Ciona intestinalis] Length = 864 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 66/226 (29%), Gaps = 16/226 (7%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +++ +D S K ++AL +++ + +ID + Sbjct: 282 GYFVHFFAPTNLPVIPKKVVFVIDVSGSMSGH-KIVQTKEALRTILDDLNEID------Q 334 Query: 297 MGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 F+ V+ K G T N A + + + Sbjct: 335 FNIITFSSTTNVWHPNEMVDVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLLETISS 394 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQ 412 + + ++ ++LLTDG+ T AI + + I + F + + Sbjct: 395 NRTNTLEEAS----MMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHE 450 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI-RIT 457 + N + K F D + + + VI R T Sbjct: 451 FLDQIASENKGLSRKIYNDADAALQLKDFYDEVASPLLAHVIMRYT 496 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 59/189 (31%), Gaps = 12/189 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTF 327 K + A+ +++ ++ D+ F+D V + + K + Sbjct: 297 TKLQQTKKAMDTILSDLQASDS------FNIITFSDTVNIWKAEGSIQATVQNIHSAKNY 350 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T IN A+ A + SN++ I+ LTDGE T Sbjct: 351 VSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP--LIMFLTDGEPTAGETTPS 408 Query: 388 AICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + + + + + ++AF + R L N +E + Sbjct: 409 VILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAE 468 Query: 445 IGNEIFERV 453 I + V Sbjct: 469 ISRPLLADV 477 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 59/203 (29%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I V + +G F+ Sbjct: 258 DKSGSMSSSNRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIRSDLIQITGSN 312 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I ++ A++ + IVLLTDGE+ Sbjct: 313 ERDKLLGSLPT-TASGGTSICSGIRRAFEVVRKLYSHTDGSE---------IVLLTDGED 362 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 C ++ K G I IA + + A+ F A E Sbjct: 363 N-----TAGACVDEVKQSGAIIHFIALGPSADKAVIEMST----ATGGVHFYATDEAENN 413 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F +I ++ +++ Sbjct: 414 GLIDAFGALASGNTDISQQSLQL 436 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 27/188 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + R A ++ + V +G F+DR++ + ++ Sbjct: 121 KIEVARRAATLLVERM------PGGVDVGLLAFSDRIVLSLPPTGDRRRV-----LDAIE 169 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T +Q A + +V ++DG D + Sbjct: 170 SLKPGGGTMYTYPLQAALSWLKPYKLFNAST---------LVVFVSDGLP-ADAATYRTL 219 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 ++ +S GI + T+ ++ L A + A S EL K F+ Sbjct: 220 LSEFRSLGIPVYTVYIGPGG---DEGERELKLIAGSTGGEEYTAGSAEELLKAFKTL-AE 275 Query: 448 EIFERVIR 455 + ++R Sbjct: 276 KASSILVR 283 >gi|331003698|ref|ZP_08327192.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412081|gb|EGG91476.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] Length = 528 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 45/315 (14%), Positives = 92/315 (29%), Gaps = 25/315 (7%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 Y+ +N + ++ S+ + T + +I+ I S + Sbjct: 52 VSYEPNVNYNTEEYNAIKESSFHAVATSPLSTFAADVDTASYANIRRFI-NSGELPPADS 110 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + LN F + + I +P + + Sbjct: 111 VRIEEMLNYFRYDYPQPKDGEP-FSVSTEISACPWNPDTKLMMIGMQAKKVEESEKKPSN 169 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +D S K LV++A + +K+ D ++ D Sbjct: 170 LVFLIDVSGSMDEPDKLPLVKNAFLLLCEELKENDTISIV---------TYAGYDQVVLE 220 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + + D GSTA +D ++TAY + + ++L Sbjct: 221 GASGSDSKEIMSAIEDLEAAGSTAGSDGIKTAYKI---------AKKYFKSDGNNRVILA 271 Query: 375 TDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 TDG+ + + K K G+ + + F + K N ++ Sbjct: 272 TDGDLNVGITSEGKLTRLIKKEKESGVFLSVLGFGTENIKDNKMEALADN--GNGNYSYI 329 Query: 432 NSTHELNKIFRDRIG 446 +S E K+ + +G Sbjct: 330 DSRFEAKKVLSEELG 344 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 47/183 (25%), Gaps = 2/183 (1%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ G+ +T + MP++LG +++DV R S L+ A + + Sbjct: 2 LSRIVRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE-NNLKKNFTDREVRDIVRDTAVEMNPR 125 L + ++ + D T + + Sbjct: 62 ELDGRDDAITRAQTAIEKISNSAAFSAGGTGMSLGSHISVTYDAGNDAGSTVTVLFLKDI 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + S + + + + + + +R V Sbjct: 122 PANDDTPIPSSMETTVASEASYAWVIAKPQAMQTIFPIPVGFTRDT-INIAADAVAVYHA 180 Query: 186 SRS 188 S Sbjct: 181 SAC 183 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 46/440 (10%), Positives = 118/440 (26%), Gaps = 57/440 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-------VVLSSRYDLLLNP 143 +R + T ++ + + +S VV R + Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQD 163 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 +S R+ G++ + I + + + S S+++ + Sbjct: 164 VSARARASGVELFAIGVGSV-DKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCV 222 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS- 262 + + + Y +C++ + + Sbjct: 223 VSDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDL 276 Query: 263 ------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 S + LV+ ++ ++ ++ D + ++ V + G Sbjct: 277 VFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GR 331 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + I ST A++ D + + +K ++ TD Sbjct: 332 FHTKKDIKAAVRNMSYMEKSTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTD 385 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 G + + KAK G ++ + L AS +F Sbjct: 386 GRSQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTAD 436 Query: 434 THELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 437 FKTINQ-----IGKKLQKKI 451 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 57/198 (28%), Gaps = 39/198 (19%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A+ +I+++ D R+ F+ + R + + Sbjct: 174 LKRAVNFIIQNLGPSD------RLSIVTFSSSARRILPLRT-MSGSGREDAISVVNSLSA 226 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T I ++ + + I+LL+DG +TQ + + Sbjct: 227 TGGTNIVAGLRKGVRVLEERRQHNSVAS---------IILLSDGCDTQSHSTHNRL-EYL 276 Query: 394 K-------SQG-------IRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHEL 437 K + G I T F ++ + S +F S L Sbjct: 277 KLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAA-----MHAISDVSGGTFSFIESIDIL 331 Query: 438 NKIFRDRIGNEIFERVIR 455 F IG + V R Sbjct: 332 QDAFARCIGG-LTSIVAR 348 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 44/393 (11%), Positives = 103/393 (26%), Gaps = 27/393 (6%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + P + + + A+ + E + + Sbjct: 102 LAEPPQTAPDAPEADARLAAAPDAGGGAETAGAPVPAEPRARSAEGAAPQTFAADEAMPM 161 Query: 124 PRKSAYQVVLS---SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSI 179 A + LS + P ++ AE + S + +I Sbjct: 162 AAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAI 221 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + + +N F + + +P + + Sbjct: 222 LRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSVTRTPWNPETQLVHVA 281 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L + P L+ +D+S K L++ + ++ ++ D V Sbjct: 282 LQGRMPAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVA------I 335 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + + R+ + + + GSTA + + AY T Sbjct: 336 VTYAGSAGEVLAPT---AANQRSTILSALDRLDAGGSTAGEEGLALAYRTASEMAGAGEV 392 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKAR 416 +VL TDG+ + + + GI + + F + Sbjct: 393 TR---------VVLATDGDFNLGISDPEELARLVAHERDTGIYLSVLGFGRGNLDDATMQ 443 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N +S +E K+ D++ + Sbjct: 444 ALAQN--GNGQAAYIDSLNEAQKVLVDQLSGAL 474 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 57/193 (29%), Gaps = 34/193 (17%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I R F + + ++ + + TA + A+ A Sbjct: 121 ISKGLRDGRQALLLFAGSPLVQCPMTTDHAAF-EALLGMVSTELVSDQGTAFDSALNLAM 179 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + ++ +K IVLL+DGEN N A+ + K G+ + TI Sbjct: 180 RLFERTEP---PGDVKEVQGEKVIVLLSDGENHSGNFR--AVADALKQSGVSVFTIVLGK 234 Query: 408 NKTQQEK--------------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + + A S +FF+A+ + DR+ Sbjct: 235 PLPAAIPLGQSSGVKKDAAGKIVKTRSSPETMRRLAGDSGGTFFDASE----DDAVYDRV 290 Query: 446 GNEIFERV--IRI 456 I V R+ Sbjct: 291 AERISTLVSTSRL 303 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/248 (10%), Positives = 65/248 (26%), Gaps = 35/248 (14%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-------SLRHVI 268 + + + Y +C + + + S Sbjct: 184 TGDHDCEQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRP 243 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + LV+ + ++ ++ D + ++ V + G + I Sbjct: 244 ENFELVKKFINQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFHTKKDIKAAVR 298 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T A++ D + + +K ++ TDG + + Sbjct: 299 NMSYMEKGTMTGAALKYLIDNSFTVSSGARPGA------QKVGIVFTDGRSQDYIND--- 349 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRI 445 KAK G ++ + L AS +F +N+ I Sbjct: 350 AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFKTINQ-----I 398 Query: 446 GNEIFERV 453 G + +++ Sbjct: 399 GKRLQKKI 406 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 26/50 (52%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE 50 M+ + ++++ G+F I+TA+L PV+LG GM + V + Sbjct: 1 MMSTSFLHPCLRRMLSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSK 50 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 54/203 (26%), Gaps = 37/203 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI ++ D R+ F+ + + + + Sbjct: 226 KLALLKQAMRFVIDNLGPDD------RLSVVSFSSEARRLTRLA-RMSDAGKALSVNAVE 278 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN---------- 379 G T I + ++TA + +VLL+DG++ Sbjct: 279 SLVARGGTNIAEGLRTAAKVLDERQHRNAVSS---------VVLLSDGQDTYTMMRRRGP 329 Query: 380 ------TQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 + + I T F + A+ +F Sbjct: 330 SGVHAGNYEELVPPSFARTGADGDWSAPIHTFGFGNDHDAAAMHVIAE---ATGGTFSFI 386 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + + F IG + V Sbjct: 387 ENEAVIQDAFAQCIGGLLSVVVQ 409 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 43/363 (11%), Positives = 110/363 (30%), Gaps = 26/363 (7%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 I++ + E + + V + T++ + + V+ + + + + + Sbjct: 170 IQKGYNQLIEEIKQDDIEHVNVYKSAKATSMFGDSARDGCIVITTKSGNYRIQYDESYAQ 229 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + T S K +++ +I+ ++ + +N F Sbjct: 230 ITENDFERTNL-SPLSTFSIDVDKASYSNVRRMINN-GQVIASDAVKIEEMINYFNYNYP 287 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + + D L Y P+ + +D S Sbjct: 288 QPTDDHP-FSINTEVTDTPWHNKTQLVRIGLQGKSYADKDLPASNLTFLIDVSGSMSSHN 346 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A ++ +++ D V+ + + G + + + Sbjct: 347 KLPLLKSAFKLLVNQLREKDKVS------IVVYAGAAGVVLEPTSG---NNKEKIISALN 397 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEG 386 + GSTA ++ AY E + NN ++L TDG+ + + Sbjct: 398 NLQSGGSTAGGAGIKLAYKL----AEKNFKKKGNNR-----VILATDGDFNVGASSDNDM 448 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + + G+ + + F + K + ++ E K+F G Sbjct: 449 KTLIEEKRKSGVFLSVLGFGYGNYKDSKLETLADK--GNGNHAYIDNMQEAQKVFGKEFG 506 Query: 447 NEI 449 + Sbjct: 507 GTL 509 >gi|225024147|ref|ZP_03713339.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] gi|224943172|gb|EEG24381.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] Length = 573 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 80/295 (27%), Gaps = 24/295 (8%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + T S +I+ + + + + +N F + Sbjct: 119 AEQPVSTFSIDVDTGSYANIRRFLTQTGRLPPADAVRIEEIINYFDYGYAKPTDG-KPFA 177 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 D + P + VD S + K +V+ L Sbjct: 178 VHTETVDSPFRSGAKLIRIGIQAKEVSQAALPPANLVFLVDVSGSMYSRDKLPMVKYTLC 237 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ + D R+ + D + G R + GSTA Sbjct: 238 TLAHQTRAQD------RITLVTYADGNKVVLPPTPG---NQRQKILAALDSLKAGGSTAG 288 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQ 396 +A+Q + + I+L TDG+ + ++ + + Sbjct: 289 ENAIQ---------QAYQAAQRAYIRNGINRILLATDGDFNVGITDFNTLRSMVAEKRKS 339 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 GI + T+ F + A ++ +S E K+ ++ + + Sbjct: 340 GISLTTLGFGSGNYNERLMEQL--ADAGDGNYSYIDSPEEAQKVLHRQLSSTLAT 392 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 49/173 (28%), Gaps = 28/173 (16%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R F++ + + V++ + + G T I A+ A + Sbjct: 171 DDRFALISFSEDAVIEVPM-QKVNERNKQQALHAIDRLSVKGRTNIASAVSLAAQVVNGV 229 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT----------I 403 E + + LLTDG E I + I I Sbjct: 230 AE---------PNKVRSVFLLTDGNANTGYTEAIDLVKL-----TSIFVEANRNPHTPPI 275 Query: 404 AFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + E + L A S SF+ +++ F D IG + + Sbjct: 276 SLHTFGYGPEPDQKLLRGMAMATSGGSFYSVRDNSQVSSAFGDAIGGILSLAL 328 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 82/304 (26%), Gaps = 26/304 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 S I + + + D + +N F Q DR Sbjct: 85 NGDSREIAPDPDHLSTFALDVDTASYGYARRTLSEGRLPDPATVRPEEFVNSFRQDYDRP 144 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G R + + + P + +D S + Sbjct: 145 -DGDGFSVTVDGARTD-DEDWSLVRVGLATRPAERQSERPPAALTFVIDISGSMGEPGRL 202 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 L ++AL ++ ++ D+V F+D + + R V Sbjct: 203 DLAQEALGTMTDRLRDDDSVA------LVTFSDEAETVLPMTRLGDH--RGRVHDAIDGL 254 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 ST + M+T Y+T + + +VL++D + + I Sbjct: 255 EPTDSTNLGAGMETGYETAVEGRREGATNR---------VVLVSDALANTGDTDADTILE 305 Query: 392 KA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + GI + + + R ++T + ++F +++ Sbjct: 306 RIATERREHGITLFGVGVGSDYGDALMERLADK---GDGHTTYVSTTEDAREVFSEQLPR 362 Query: 448 EIFE 451 I Sbjct: 363 HIEL 366 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/375 (10%), Positives = 92/375 (24%), Gaps = 26/375 (6%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L S+ + + L ++ V + + Sbjct: 27 LSLSVITACAPQSKITPASDKASTTTLETAEQSIQADAAAPVVVMATPAMAESRQLSKMT 86 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 ++ + P + + + T +A + Sbjct: 87 TNARIMPPPSQGYMAPKQQENYAEIEPNAVNATSEQAFATLSIDTDTGSYANVRRFLNQG 146 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSS-QNGKVGIRDEKLSPYMVSCNKSLYYML-- 244 + + +N F K ++ + + P + Sbjct: 147 QLPPKDAVRVEELINYFNYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLL 206 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P + VD S K L + +L + + ++ D++ + Sbjct: 207 TAKQKQPPANLVFLVDVSGSMDTEDKLQLAKSSLKMLTKQLRAQDSIT------LITYAG 260 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + G + + GST A++ AY + H K+ Sbjct: 261 NTKVVLPSTPG---NQTQKILNAIDNLTASGSTNGEAAIKLAYQ------QATEHFKKDG 311 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSN 421 + I++LTDG+ + + +GI + T+ F N Sbjct: 312 INR---ILMLTDGDFNVGVSSVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQVADN 368 Query: 422 CASPNSFFEANSTHE 436 ++ +S E Sbjct: 369 --GNGNYSYIDSLSE 381 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/281 (10%), Positives = 77/281 (27%), Gaps = 36/281 (12%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS-QNGKVGIRDEKLSPYMVSCNKSLYY 242 + + + + L Q A ++ + + Y +C++ Sbjct: 197 PQDEHVDYVESYNVIEKLAKKFQEAFCVSDLCATGDHYCEQVCVSSPGSYTCACHEGFTL 256 Query: 243 MLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + + S + LV+ + ++ ++ D + Sbjct: 257 NSDGKTCNVCRGGGNGSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQVG 316 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 ++ + + G + I T A++ D + + Sbjct: 317 ---LVQYSSSIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSS 371 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +K ++ TDG + + KAK G ++ + Sbjct: 372 GARPGA------QKVGIVFTDGRSQDYIND---AARKAKDLGFKMFAVGVG------NAV 416 Query: 416 RYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFERV 453 L AS + +F +N+ IG ++ +++ Sbjct: 417 EEELREIASEPVADHYFYTADFKTINQ-----IGKKLQKKI 452 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 55/167 (32%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 IK +D + R+G + V + K + T A+Q Sbjct: 70 IKSLDVGPNATRVGLVNYASTVKPEFPLRAHTSKASLLQAVHRIQPLST--GTMTGLALQ 127 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A I + D + + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 128 FA---ITKALSDAEGGRSRSSDISKVVIVVTDGRP-QDSVRDVS--ERARASGIELFAIG 181 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L K F++ Sbjct: 182 VGR------VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQEAF 222 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 48/396 (12%), Positives = 122/396 (30%), Gaps = 27/396 (6%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 +++ + + + ++ +++ + P + + + +NF N R + Sbjct: 22 ESSSLPVAKDV--ERDQRQRQSELMESAPLGRSADSVNQNFRNRGLTPRASRRLSHSPNA 79 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG- 176 + A + + + S I + + + + Sbjct: 80 RFKSVPSSSIASKPKPLLTAEQAIPEASNRENYSAIDENAFKRVKHNPLSTFAIDVDTAS 139 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S + + + +N F S + I +P Sbjct: 140 YSNLRRFLNNGQLPPTNAIRIEELINYFNYDYPEPE-SDRPFSITTEISQAPWNPTHQLV 198 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + P + +D S + K L++DA ++ +++ D V+ Sbjct: 199 HIGIQGEKMAIEDLPPSNLVFLLDVSGSMNTPNKLPLLKDAFRMLVNELREEDQVS---- 254 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + G + + T + N GSTA ++ AY +D Sbjct: 255 --IVVYAGAAGVVLPPTPGNE---KDKILTAIENLNAGGSTAGGAGIKLAYKL----AQD 305 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + NN ++L TDG+ + E + + + +++G+ + + F Q Sbjct: 306 NFIKSGNNR-----VILATDGDFNVGVSSDTELVKLIEQKRNKGVFLTVLGFGSGNLQDS 360 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 K + ++ ++ E K+ + IG + Sbjct: 361 KMEKIANK--GNGNYAYIDNELEAKKVLVNEIGATL 394 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 70/203 (34%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S + + +G F S + K Sbjct: 109 TRLAAAQEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNRDAT-----KVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ T+ +I + ++ IVLL+DG EN+ D Sbjct: 158 DNLKLSERTATGEAIFTSLQSIDT---LSAVLGGSDQAPPARIVLLSDGKQTVPENSDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F + E L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTAS 274 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL +++ D + +I R Sbjct: 275 SLEELRQVY-DTLEEQIGFETTR 296 >gi|42407700|dbj|BAD08848.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408122|dbj|BAD09262.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|125602049|gb|EAZ41374.1| hypothetical protein OsJ_25891 [Oryza sativa Japonica Group] Length = 704 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 57/199 (28%), Gaps = 34/199 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI + D R+ F+ + + + K+ Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLT-RMSDDGKASAKSAME 299 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + A + A ++LL+DG++T + G Sbjct: 300 SLAAGGGTNILKGLVEAAKVFD---------GRRYRNAVASVILLSDGQDTYNVNGGWG- 349 Query: 390 CNKAK--------------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + +K + + + T F + + +F + Sbjct: 350 ASNSKNYSVLVPPSFKRSGDRRLSVHTFGFGTDHDAVAMNAIAEE---TGGTFSFIENQA 406 Query: 436 ELNKIFRDRIGNEIFERVI 454 + F IG + V Sbjct: 407 VVQDAFAQCIGGLLSVTVQ 425 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 70/203 (34%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S + + +G F S + K Sbjct: 109 TRLAAAQEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNRDAT-----KVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ T+ +I + ++ IVLL+DG EN+ D Sbjct: 158 DNLKLSERTATGEAIFTSLQSIDT---LSAVLGGSDQAPPARIVLLSDGKQTVPENSDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 G +AK +G+ I TI+F + E L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTAS 274 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL +++ D + +I R Sbjct: 275 SLEELRQVY-DTLEEQIGFETTR 296 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 59/196 (30%), Gaps = 25/196 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + + + + +G F S Sbjct: 96 SNSMAATDIAPNRLEAAKQGAEAFVDQL------PPRINLGLVSFAGSATVLVPASTDRE 149 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + +TAI + + + I ++ + IVLL+DG Sbjct: 150 SVR-----AGIRGLQLGPATAIGEGIFASLQAINTAGK--RFSDAGQSPPPAAIVLLSDG 202 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPN 426 E T+ A + A+ + + TIA+ + + L+ A + Sbjct: 203 ETTRGRPNTQAT-DAARQAHVPVDTIAYGTSDGTLDVGGQEVPVPVNEQALNEIADQTEG 261 Query: 427 SFFEANSTHELNKIFR 442 S+ A + EL +++ Sbjct: 262 SYHRAATGDELRSVYK 277 >gi|229588184|ref|YP_002870303.1| hypothetical protein PFLU0636 [Pseudomonas fluorescens SBW25] gi|229360050|emb|CAY46904.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 651 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 46/181 (25%), Gaps = 1/181 (0%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L G ++ A + V L ++VD R + L+ A T+ + A+ Sbjct: 4 RLRSRQRGAIGLMAAGTLAVALVFMLLVVDSGRLYMEKRKLQSIADTSALEAAGRGGLCS 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ + + + L + A +VV Sbjct: 64 PTTTANDYAKENATRNGFT-VVAGDNSRGLVVTCGLLTTNANSVRVFTPDATKNDAVRVV 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + S S + A A + + S + Sbjct: 123 ATRSVMTSIATGIWSMFSGAPVSSQMMLSATAVAAYAPPVAQLTIRSSLAAVNSANSPLL 182 Query: 193 Q 193 Sbjct: 183 N 183 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 17/198 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + ++ + + + K ++ +D + IK V TV + Sbjct: 37 NNTAILLDASGSMAKRIDGVSKYNMAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKV 96 Query: 309 DPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 S +G + + G T I A++ A + + Sbjct: 97 QSCESIRGVYGFQRFDKQSFLNSLNGIGPTGWTPIAKALEDAKASFN----------GVH 146 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K + LLTDGE T + I + + Q I++ I F N+ + A Sbjct: 147 KLGSKSVYLLTDGEETCG-GDPIKTAKELRKQHIKVNVIGFDFNEGFNGQLHAI--AGAG 203 Query: 425 PNSFFEANSTHELNKIFR 442 ++EA+S ++N+IF+ Sbjct: 204 GGKYYEAHSQKDMNRIFK 221 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 68/219 (31%), Gaps = 39/219 (17%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + + +++ + +I+S+ D G F + S Sbjct: 95 DDSGSMAANDYQPTRLEAAKESASVLIKSLDPKDY------AGVVIFESGATTAAYLSPD 148 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 +++ A E + G TA+ D + A D + KK +VLL+ Sbjct: 149 ----KDRVMEKTAAIEQKNGQTALGDGLALAVDM-----------ADSIPNQKKVVVLLS 193 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV------------NKTQQEKARYFLSNCA 423 DG A +++ T+ L + A Sbjct: 194 DGVGNAGVISPEDATAFAAQNKVQVFTVGLGSKSPVLLGTDPTGTPQYATLDEAALQSIA 253 Query: 424 --SPNSFFEANSTHELNKIF----RDRIGNEIFERVIRI 456 + +++ + L++I+ ++ + + + I Sbjct: 254 EKTGGTYYTSVDEQTLHQIYAGLNKEIVREKEETSIQPI 292 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +++ + + + ++ L + I R G F D F+ + Sbjct: 114 TTADGSTLTRLNAAKNVLKTFIA-------KRSGDRFGLILFGDAAFIQTPFTADQQVWL 166 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + A ST + DA+ + + ++ +++LTDG +T Sbjct: 167 S--LLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDT 215 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SPNSFFEANSTHE 436 E + A ++GI+I TIA +Q + + + F A E Sbjct: 216 GSFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAE 275 Query: 437 LNKIFR 442 L+K ++ Sbjct: 276 LDKAYQ 281 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 53/144 (36%), Gaps = 13/144 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + D A+ + + + R+G F R + + + + + + Sbjct: 7 SKNAVPWDWNAAAKAVLSDFLDRREGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVV 66 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + + + ++ +VLLTDG NT + Sbjct: 67 --GLAGRETAIGDAIALSVKRLREQKQG-----------QRVVVLLTDGVNTAGVLNPLK 113 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 AK++G+R+ TIAF + Sbjct: 114 AAELAKAEGVRVHTIAFGGSGGDS 137 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 73/218 (33%), Gaps = 25/218 (11%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 Y P++ +DSS + + +A+ + + N ++D Sbjct: 28 YASTSSPNVDVVFVLDSSGSMKQSDPEEIRTEAIKMFLDMGQVQGNKAG-----LVAYSD 82 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 +I + + + + ++ A + T I + + S ++ Sbjct: 83 TIIKEHNLDAINSESDKERIEEMASNIPLGQKTDIGRGLLEGAKVLDSGHDSNNRP---- 138 Query: 365 LEAKKYIVLLTDGENTQDNE------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 I+LL+DG+N + + + KS+G + TI + + + Sbjct: 139 -----LIILLSDGKNDSQRSASESLKDLNSAISTCKSKGYPVYTIGLNYDG---TVDKAQ 190 Query: 419 LSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVI 454 L+ AS + N +L I +D G+ +V Sbjct: 191 LTQIASETKGKNYITNKASDLTDILKDIYGDSANVKVQ 228 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 22/144 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ ++ + R+G F + ++ ++ Sbjct: 115 SRLSRSKYAILDFVQQ-------HGRGRVGLVAFAGQAFLQCPLTFDYDAFRDALLAIDE 167 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +G T I A+ AY + ++ K +VL+TDGE+ + I Sbjct: 168 QTI-PVGGTDIGRALDEAYRAMEKNDR------------HKILVLITDGEDLEKAG--IK 212 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +GI + TI Sbjct: 213 TAQALAEKGIVVYTIGVGTAAGSP 236 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 47/161 (29%), Gaps = 20/161 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F+ +V + + G T I + M+ A + Sbjct: 157 DRLCLIMFDCKVYRLTRLMRVTQE-NVQKFRVAISSLQARGGTDIGNGMKMALSIL---- 211 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA--ICNKAKSQGIRIMTIAFSVNKTQQ 412 + + I LL+DG + E I + I T F + Sbjct: 212 -----KHRKYKNPVSAIFLLSDGVDEGAEERVRDDLIQYNIRDS-FTIKTFGFG-----R 260 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 + +S A F+ + +++ F + +G + Sbjct: 261 DCCPKIMSEIAHYKEGQFYFVPNLTNIDECFAEALGGLVSV 301 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 59/183 (32%), Gaps = 25/183 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S + LV++A + R++ D +G F++ Sbjct: 534 IDKSGSMSSGDRLDLVKEAARATARTLDPSD------EIGVIAFDNSPQVLVRL---QPA 584 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 R + + + G T A++ AY + S K+++LL+DGE Sbjct: 585 ANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALV-----------KHVILLSDGE 633 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 A+ + I + ++ + R +F + ++ Sbjct: 634 --SPENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAER---GRGRYFYSEDGTDVP 688 Query: 439 KIF 441 +IF Sbjct: 689 RIF 691 >gi|323650180|gb|ADX97176.1| inter-alpha (globulin) inhibitor h4 isoform 1 [Perca flavescens] Length = 354 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 53/165 (32%), Gaps = 12/165 (7%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + G F+ + +K KTFA + + G+T IN A+ + + Sbjct: 113 EDDHFGLITFDSSIFHWKRELVQANKKNLESAKTFARNIQDRGTTDINAAVLEGARMLNA 172 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNK 409 + I+LLTDG+ T I + + + + + F + Sbjct: 173 HPREGSAS---------IIILLTDGDPTSGETNLERIQSNVRRDIADKFPLYCLGFGHDV 223 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + L N +E + K F D + + V Sbjct: 224 NFEFLEKMSLQNNGVARRIYEDSDADLQLKGFYDEVATPLLTDVT 268 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 61.2 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 46/399 (11%), Positives = 106/399 (26%), Gaps = 35/399 (8%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + + + + + + TA Sbjct: 32 AELKQEQAIQKENSAQAGDDTVQKRQAEAAQQAAKKAAEYKAKAEAKALADAKAGSLATA 91 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVS 178 + S + + R I+ ++A + S + + Sbjct: 92 EAPQHEMRT-RASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYA 150 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN----------GKVGIRDEK 228 + + + LN F P K +++ K + Sbjct: 151 NVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSP 210 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + P + +D+S ++ L+R AL ++ ++ Sbjct: 211 WNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQ 270 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ G T T +K + + GST ++ AY+ Sbjct: 271 DNITIVTYAGGTHVALAS---------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE 321 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAF 405 + + I+L TDG+ + I K + +GI + T+ Sbjct: 322 --QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGV 372 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++ +S E K+ +D Sbjct: 373 GDDNFNEAMMVRI--ADVGNGNYSYIDSLSEAQKVLKDE 409 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 42/185 (22%), Positives = 62/185 (33%), Gaps = 29/185 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + I K + + A A +I ++ D +G F+ FS + + Sbjct: 420 DTEDAGISKLEIAKSASAKMIEHLESSD------GVGVIAFDHNYYWAYEFS---KLVRK 470 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V G TAI + A +K + K IVLLTDG Q Sbjct: 471 KDVIESISSIEVGGGTAIIPPLSEAVK-----------TLKKSKAKSKLIVLLTDGMGEQ 519 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNK 439 E N+AK I+I TI LS AS F+ ++ +EL Sbjct: 520 GGYEIP--ANEAKRNNIKITTIGVGKFVNLP-----VLSWIASFTSGRFYLVSNPYELVD 572 Query: 440 IFRDR 444 +F Sbjct: 573 VFLKE 577 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 46/399 (11%), Positives = 106/399 (26%), Gaps = 35/399 (8%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + + + + + + TA Sbjct: 32 AELKQEQAIQKENSAQAGDDTVQKRQAEAAQQAAKKAAEYKAKAEAKALADAKAGSLATA 91 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVS 178 + S + + R I+ ++A + S + + Sbjct: 92 EAPQHEMRT-RASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYA 150 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN----------GKVGIRDEK 228 + + + LN F P K +++ K + Sbjct: 151 NVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSP 210 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + P + +D+S ++ L+R AL ++ ++ Sbjct: 211 WNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQ 270 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ G T T +K + + GST ++ AY+ Sbjct: 271 DNITIVTYAGGTHVALAS---------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE 321 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAF 405 + + I+L TDG+ + I K + +GI + T+ Sbjct: 322 --QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGV 372 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++ +S E K+ +D Sbjct: 373 GDDNFNEAMMVRI--ADVGNGNYSYIDSLSEAQKVLKDE 409 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 52/154 (33%), Gaps = 21/154 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F DR + V + R I +TAI D + A + Sbjct: 131 DRVGLVVFGDRAYVAAPQTHDVASVARLIDGLQI--GVSGKATAIADGLGLAIRRLR--- 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 + I+LL+DG++T + +A A+ G+R+ TIA Sbjct: 186 --------ERDAKSRVILLLSDGQDTTGMVDPVAAAQTARDLGMRVYTIALGPADLSDDP 237 Query: 409 KTQQEKARYFLSNC--ASPNSFFEANSTHELNKI 440 + L A+ F +T +L + Sbjct: 238 GARDAVDADTLRRIAQAAGGETFRVRTTDDLQAV 271 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 37/318 (11%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 170 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 171 NYFPAPQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQTRD 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L+R AL ++ ++ DN+ G T Sbjct: 229 LPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + Sbjct: 286 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGANR------ 331 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 332 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 388 Query: 427 SFFEANSTHELNKIFRDR 444 ++ +S E K+ +D Sbjct: 389 NYSYIDSLSEAQKVLKDE 406 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 60/194 (30%), Gaps = 29/194 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV+ L ++ + D R+ F++ K + +K+ Sbjct: 105 KIQLVQTTLNFMVERLSPAD------RICLISFSNDATKISRLVQMSPKGKKQ-LKSMIP 157 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I ++ + R + + I+LL+DG++ Sbjct: 158 RLVASGGTNIVGGLEYGLQAL---------RQRRTINQLSSIILLSDGQDNNGTTVL--- 205 Query: 390 CNKAKS--QGIRI---MTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFR 442 +AK+ I I ++ L+ A P +F+ + F Sbjct: 206 -QRAKATMDSIVIRDDYSV--HTFGYGHGHDSTLLNALAEPKNGAFYYVKDEETIATAFA 262 Query: 443 DRIGNEIFERVIRI 456 + +G + +I Sbjct: 263 NCLGELMSVVADQI 276 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 53/177 (29%), Gaps = 21/177 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDN----VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + +DA + I A D + + Sbjct: 19 TRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNSEKM 78 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G T I ++++ A + ++ I+L++DG T Sbjct: 79 IAHMDGLQPRGFTPIGESLRKAAAELPKEG-------------QRSIILVSDGVATCTPP 125 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + K QGI I T+ F+V Q++ + A+ ++ A+ L K Sbjct: 126 PVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIAD--ATGGTYANASDADSLAK 180 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 62/216 (28%), Gaps = 41/216 (18%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 V + + + R AL + D + + Sbjct: 105 DVVWTPGAPSVSRLEAARRALKLFLAGGAAPD-GTAFDPRPGDAVGLVAFAAVPETVCPA 163 Query: 318 KLIRTIVKTFAIDENEMGS----TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 L +++ A G T I D++ A + ++++ + ++L Sbjct: 164 TLNHSVLFKVADALQPKGGADAGTNIGDSLAEAVIRLDAADQKS-----------RVLIL 212 Query: 374 LTDGENT--------------QDNEEGIAICNKAKSQGIRIMTI------AFSVNKTQQE 413 L+DGE+ + A + G+R+ TI Sbjct: 213 LSDGEHNILKEDVRDAQRPGIDRTLKPREAAQLAANLGVRVYTIDAGGDPPLGAPPDAVA 272 Query: 414 K---ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + R L + A + F+A S EL +R+ Sbjct: 273 QRFAGRKALKDVAEMTGGKSFQATSGAELLSAYREI 308 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 67/208 (32%), Gaps = 17/208 (8%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ + + + K ++ ++ + IK V TV + Sbjct: 33 SFKNNTAILLDASGSMAKRIDGVSKYNMAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNS 92 Query: 306 VISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 S +G + R G T I A++ A + Sbjct: 93 GKVQSCESIRGVYGFQRFDRQSFLNSLNGIGPTGWTPIAKALEDAKASFT---------- 142 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + K + LLTDGE T + + + + Q I++ I F + + Sbjct: 143 GLHKLGSKSVFLLTDGEETCG-GDPVKTAKELRKQHIKVNVIGFDFKEGFNGQLHEIAK- 200 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEI 449 A ++EA+S ++N+IF + Sbjct: 201 -AGGGKYYEAHSQKDMNRIFTMAASSLA 227 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 71/203 (34%), Gaps = 31/203 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + ++A + + D +G F S L + Sbjct: 104 DPDRLTAAKEAGRRFVDGL------PDEFNVGLVAFAGSAAVLVPPSTDREAL-HDGIGR 156 Query: 327 FAIDENEMGSTAINDAMQT---AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A + TAI +A+ T A ++ ++ + + I++L+DG NT Sbjct: 157 LAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPAR--------IIILSDGANTSG- 207 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEAN 432 + + ++A + + + TI+F ++ L A + F EA+ Sbjct: 208 MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEAS 267 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 +T EL ++ D IG + R R Sbjct: 268 TTAELKDVYED-IGTSVGYRTER 289 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 71/203 (34%), Gaps = 31/203 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + ++A + + D +G F S L + Sbjct: 104 DPDRLTAAKEAGRRFVDGL------PDEFNVGLVAFAGSAAVLVPPSTDREAL-HDGIGR 156 Query: 327 FAIDENEMGSTAINDAMQT---AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 A + TAI +A+ T A ++ ++ + + I++L+DG NT Sbjct: 157 LAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPAR--------IIILSDGANTSG- 207 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEAN 432 + + ++A + + + TI+F ++ L A + F EA+ Sbjct: 208 MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEAS 267 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 +T EL ++ D IG + R R Sbjct: 268 TTAELKDVYED-IGTSVGYRTER 289 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 82/304 (26%), Gaps = 26/304 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 S I + + + D + +N F Q DR Sbjct: 85 NGDSREIAPDPDHLSTFALDVDTASYGYARRTLSEGRLPDPATVRPEEFVNSFRQDYDRP 144 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G R + + + P + +D S + Sbjct: 145 -DGDGFAVTVDGARTD-DEDWSLVRVGLATRPAERQSERPPAALTFVIDISGSMGEPGRL 202 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 L ++AL ++ ++ D+V F+D + + R V Sbjct: 203 DLAQEALGTMTDRLRDDDSVA------LVTFSDEAETVLPMTRLGDH--RGRVHDAIDGL 254 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 ST + M+T Y+T + + +VL++D + + I Sbjct: 255 EPTDSTNLGAGMETGYETAVEGRREGATNR---------VVLVSDALANTGDTDADTILE 305 Query: 392 KA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + GI + + + R ++T + ++F +++ Sbjct: 306 RIATERREHGITLFGVGVGSDYGDALMERLADK---GDGHTTYVSTTEDAREVFSEQLPR 362 Query: 448 EIFE 451 I Sbjct: 363 HIEL 366 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 35/316 (11%), Positives = 85/316 (26%), Gaps = 29/316 (9%) Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 +N LR + + ++ A + + +I + + Sbjct: 22 MNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKPATLIKQAARIKQM-TLLLLLF 80 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 L A + + ++ S SL ++ +D Sbjct: 81 LFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPINLALVID 140 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S+ + R+A + + D ++ +++ + Sbjct: 141 RSTSMSGE-RIEKAREAAILAVNMLNITDTLS------VVAYDNHAEVIIPATKVTD--K 191 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ + + G TA+ + + N E I+L++DG+ Sbjct: 192 PALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIILISDGQAN 242 Query: 381 QDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + A +GI I TI + ++ A S + ++ Sbjct: 243 TGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNHTFVANSA 297 Query: 436 ELNKIFRDRIGNEIFE 451 +L K F + + Sbjct: 298 DLEKAFTKEFQDVMSV 313 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 61.2 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 39/142 (27%), Positives = 58/142 (40%), Gaps = 15/142 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R V T + STAI +A+ T+ I N IVLL+DG N Sbjct: 146 DRGQVSTAIDNLELAESTAIGEAVFTSLTAI--ENFQSSLDADGEEVPPARIVLLSDGYN 203 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSF 428 T + A+ + A GI + TIAF + + R L A + S+ Sbjct: 204 TVGRPDTQAV-SAALDAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATLEEIADQTGGSY 262 Query: 429 FEANSTHELNKIFRDRIGNEIF 450 EA S EL ++++D G++I Sbjct: 263 SEAASAAELEQVYQDL-GSQIG 283 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 59/167 (35%), Gaps = 13/167 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +KK D + V+ GA ++D + I+ D GST Sbjct: 830 MIDLVKKADVGKNQVQFGALKYSDFPEVLFNL--NEFSSKSEIISFIQNDHPRGGSTYTA 887 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ + S +HR + ++++TDGE + D A + +GI + Sbjct: 888 KALAHSAHLFSESLGSRMHRG-----VPQVLIVITDGE-SHDAHLLNATARALRDKGILV 941 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + LS S + +F + L IF + + Sbjct: 942 LAVGIEG-----ANHEELLSMAGSTDRYFFVENFEGLKGIFENVSAS 983 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 39/121 (32%), Gaps = 18/121 (14%) Query: 320 IRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +T + + +G T I A+ D S KK++VLLTDG Sbjct: 679 TKTKISDAIDGLSLIGRGTLIGGALTFVSDYFSVSK-------GARPNVKKFLVLLTDG- 730 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHEL 437 + + + G+ I ++ ++ Q L + + F + L Sbjct: 731 --KSQDAVKEAAVALRQDGVIIYSVGVFGSEYSQ------LEEISGRSDMVFYVENFDIL 782 Query: 438 N 438 Sbjct: 783 K 783 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 45/141 (31%), Gaps = 14/141 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + ++ + D V +G F+ + S K+ Sbjct: 1014 FKKMKDFLVTIVNDFDIRPGKVHVGLAQFSHEYRPEFSLIPFRDKIEVKNQIGRIQQIF- 1072 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKN--NLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T I A + + + N ++ +++LTDG Q +E Sbjct: 1073 -GNTLIGAA-------LRNVGSYFWPDFGSRINAGVQQVLLVLTDG---QSQDEVAQAAE 1121 Query: 392 KAKSQGIRIMTIAFSVNKTQQ 412 +++GI I ++ QQ Sbjct: 1122 DLRNKGIDIYSLGVGQVNDQQ 1142 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/438 (9%), Positives = 107/438 (24%), Gaps = 54/438 (12%) Query: 39 MLVDVVRWSY------YEHALKQAAQTAIIT--ASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +VD R + L Q ++ + A+ + + + + + Sbjct: 44 FVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHISKAA 103 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR-----KSAYQVVLSSRYDLLLNPLS 145 +R + T ++ + + + V++ + N Sbjct: 104 LLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDGRPQDNVQD 163 Query: 146 LFLRSMGIKSWLIQ-TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 + R+ L + + S S+++ + Sbjct: 164 VSARARASGVELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFCVV 223 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + + + Y +C++ + Sbjct: 224 SDLC------ATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNGQWGGHRLALERVW 277 Query: 265 RHVIKKKHLVRD------ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 R R+ ++ ++ D + ++ V + G Sbjct: 278 RGGDGASATPRNHSXXXXXXNQIVDTLDVSDKLAQVG---LVQYSSSVRQEFPL--GRFH 332 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + I T A++ D + + R +K ++ TDG Sbjct: 333 TKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPRA------QKVGIVFTDGR 386 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTH 435 + + KAK G ++ + L AS +F Sbjct: 387 SQDYIND---AAKKAKDLGFKMFAVGVG------NAVEDELREIASEPVAEHYFYTADFK 437 Query: 436 ELNKIFRDRIGNEIFERV 453 +N+ IG ++ +++ Sbjct: 438 TINQ-----IGKKLQKKI 450 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 86/304 (28%), Gaps = 29/304 (9%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 +A T S +++ +++ L + + L + + + Sbjct: 104 AEAPVSTFSIDVDTGSYSNLRRMLN--AGQLPPRDAVRVEELVNYFPYRYSLPQGDAPFA 161 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 I +P + + P + VD S + K L+++AL Sbjct: 162 VDTEIAPTPWNPRSLLLRVGIQAADPAKQALPPANLVFLVDVSGSMNSPDKLPLLQNALK 221 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + ++ D R+ + + + + G+TA Sbjct: 222 LFVAQLRPQD------RVALVTYASGTRVVLEPT---AGDRKAAITAAIDGLVPGGATAG 272 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQ 396 + AY + H + I+L TDG+ + ++ + Sbjct: 273 AAGIDLAYRM--AEQGFVEHGINR-------ILLATDGDFNVGITRFETLKDRVAERRKS 323 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE----LNKIFRDRIGNEIFER 452 GI + T+ F + A ++ +S E L F + + Sbjct: 324 GIALSTLGFGGGNYNDQLMEQL--ADAGDGAYRYIDSLAEAQKVLVDEFTSTLATVASDV 381 Query: 453 VIRI 456 I++ Sbjct: 382 KIQL 385 >gi|89094518|ref|ZP_01167457.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] gi|89081254|gb|EAR60487.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] Length = 445 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 53/173 (30%), Gaps = 24/173 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I I+ ++ + ++ RV + + G+TA+ + Sbjct: 95 IMAINRLSQNDIVSVVSYDSRVNVVVPAT---KVSDTNTIARAINRIQANGNTALFAGVS 151 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQGIRI 400 + + + ++LL+DG + + AK G+ + Sbjct: 152 KGANELRKFLDLNKVNR---------VILLSDGLANIGPSTPNELGKLGLSLAKE-GMSV 201 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 TI + ++ A S + + +L ++F+ G+ + Sbjct: 202 TTIGLGLGYN-----EDLMTQLAGFSDGNHAFVENADDLARVFQYEFGDVLSV 249 >gi|126463435|ref|YP_001044549.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126105099|gb|ABN77777.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 651 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/393 (11%), Positives = 104/393 (26%), Gaps = 27/393 (6%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + P + + + A+ + E + + Sbjct: 102 LAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAPQTFAADEAMPM 161 Query: 124 PRKSAYQVVLS---SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSI 179 A + LS + P ++ AE + S + +I Sbjct: 162 AAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAI 221 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + + +N F + I +P + + Sbjct: 222 LRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPWNPETRLVHVA 281 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L + P L+ +D+S K L++ + ++ ++ D V Sbjct: 282 LQGRMPAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVA------I 335 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + + R+ + + + GSTA ++ + AY T Sbjct: 336 VTYAGSAGEVLAPT---AANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGAGEV 392 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKAR 416 +VL TDG+ + + + G+ + + F + Sbjct: 393 TR---------VVLATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQ 443 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N +S +E K+ D++ + Sbjct: 444 ALAQN--GNGQAAYIDSLNEAQKVLVDQLSGAL 474 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 49/148 (33%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + ++ + R+G + + + ++ Sbjct: 110 SRLEKSKQIVRQILGGLGS-------DRVGIIAYAGSAFPQLPITTDYAAA-KMFLQALN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI DA+ A N+ + + +++DGE+ + N + Sbjct: 162 TDMISSQGTAIGDAIDLATTYYDDDNQ-----------TNRVLFIISDGEDHEGNV--ES 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 I ++A +GIRI TI K + Sbjct: 209 IADEAAEKGIRIYTIGVGTEKGGPIPIK 236 >gi|332216199|ref|XP_003257232.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 3 [Nomascus leucogenys] gi|332216201|ref|XP_003257233.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 4 [Nomascus leucogenys] Length = 623 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 55/188 (29%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++++ + + F RV S + + F Sbjct: 15 MRGQKVKQTKEALLKILGDMPPG---DYFDLVLFGTRVQSWKGSLVQASEANLRAAQDFV 71 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T +N + + + E + +++LTDG+ T+ + Sbjct: 72 RGFSLDEATNLNGGLLRGIEILNQVQES----LPELSNHAAILIMLTDGDPTEGVTDRSQ 127 Query: 389 ICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I +S + + F N + N +E + + F ++ Sbjct: 128 ILKNIRSAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDATQQLQGFYSQV 187 Query: 446 GNEIFERV 453 + V Sbjct: 188 AKPLLVDV 195 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 53/177 (29%), Gaps = 21/177 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDN----VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + +DA + I A D + + Sbjct: 108 TRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNSEKM 167 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G T I ++++ A + ++ I+L++DG T Sbjct: 168 IAHMDGLQPRGFTPIGESLRKAAAELPKEG-------------QRSIILVSDGVATCTPP 214 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + K QGI I T+ F+V Q++ + A+ ++ A+ L K Sbjct: 215 PVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIAD--ATGGTYANASDADSLAK 269 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 60.8 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 53/190 (27%), Gaps = 32/190 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + F + + + Sbjct: 109 SRLSAAKTAAKDFLGDL------PPRFNVSLVKFAASAQVVVAPTTDRAAVSTA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + + STAI + + ++ + + +D H + IVLL+DG Sbjct: 158 TNLQVLPSTAIGEGIYSSLNALKLVPDDPKHP---GQKPPAAIVLLSDGATNVGRPSL-- 212 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQ-----------QEKARYFLSNCASPNSFFEANST 434 AK G + + TIA+ AS F A S Sbjct: 213 --EAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESL 270 Query: 435 HELNKIFRDR 444 +L+ +++ Sbjct: 271 GQLSDVYKSI 280 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 42/390 (10%), Positives = 107/390 (27%), Gaps = 31/390 (7%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ + ++E + + + N++ + T A + Sbjct: 41 ADSAANLPLPETSMVAPQVESSVRQRAQGNVQASKTASRPMLPAPPVLSTPAAPVPAAAL 100 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-------QWVID 184 L P+ + +++ + + + + Sbjct: 101 AEPMHMHERLMPMPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFL 160 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + ++ + +N F + + + + + Sbjct: 161 NAGTLPPVDAVRVEELINYFRYDDPAPTDGTP-FAVRTELAPTPWNTDTLLLRIGVAGRD 219 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P P + VD S K L++ +L ++R ++K D R+ + Sbjct: 220 VPTAALPPANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAG 273 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + G +T + G TA ++ AY + + R N Sbjct: 274 STAVVLPPTSG---AQQTRIVEAIDSLQSGGGTAGASGIELAYK----AAQQAYLRGGIN 326 Query: 365 LEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ + + + + G+ + T+ F Sbjct: 327 R-----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD- 380 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 381 -AGDGAYAYIDSALEARKVLTHELGSTLAT 409 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 41/298 (13%), Positives = 90/298 (30%), Gaps = 34/298 (11%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 ++A T S +++ ++ + + + LN F P K +++ Sbjct: 130 SRAPLATFSLDVDTGSYANVRRFLNQ-GQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKP 188 Query: 220 ----------GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 K + + + P + +D+S + Sbjct: 189 IAACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAE 248 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + L+R AL ++ ++ DN+ G T T +K Sbjct: 249 RLPLIRSALKLLVNDLRAQDNITIVTYAGGTHVALAS---------TAGNNTTAIKAAID 299 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + GST ++ AY+ + + I+L TDG+ + I Sbjct: 300 NLDTYGSTGGEAGLRLAYE--QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDI 350 Query: 390 ---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 K + +GI + T+ + + ++ +S E K+ +D Sbjct: 351 EALVKKEREKGITLSTLGVGDDNFNEAMMVRI--ADVGNGNYSYIDSLSEAQKVLKDE 406 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 50/153 (32%), Gaps = 21/153 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F DR ++ V + I STAI+D + A ++ S Sbjct: 129 DRIGLVIFGDRAYVAQPPTFDVGSVAHAIEAAQI--GISGRSTAISDGLGLATRRLLQS- 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK----- 409 K +VLL+DG +T + A S GIR+ TIA Sbjct: 186 ----------DATSKVVVLLSDGVDTSGKVQAGDAARLAASHGIRVHTIALGPEDLENQP 235 Query: 410 ---TQQEKARYFLSNCASPNSFFEANSTHELNK 439 + A A + F + +L Sbjct: 236 ESRDAVDAAALHAMAEAGGGTSFRVRNMEDLET 268 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 70/231 (30%), Gaps = 27/231 (11%) Query: 227 EKLSPYMVSCNKSL--YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 Y N ++ + + S + + +V+ ++ ++ + Sbjct: 22 VPPQNYSSRTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNT 81 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ F+D ++ +T+ + + G T + + Sbjct: 82 LGSEDYLS------IVTFSDSANVLFDL-LPMNDSNKTMAIEKIENLSTEGGTELWKGLN 134 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ----GIRI 400 +A + ++++ + I LLTDG+ T + + K K I Sbjct: 135 SALNILLNN---------KTPNTNQSIFLLTDGQPTDSGIDTNLV--KFKQAYPKLNCTI 183 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 T FS + + + + F + F + + N + Sbjct: 184 NTFGFSSSSNSELMNKIAMEY---NGMFSFIPDASFIATAFANALANTLTV 231 >gi|323491534|ref|ZP_08096713.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] gi|323314110|gb|EGA67195.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] Length = 411 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 31/342 (9%), Positives = 84/342 (24%), Gaps = 23/342 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ + M ++GV + +DV + L+ + A + ++ E Sbjct: 12 RRQKGLVLVMVTVAMLALVGVAALAIDVSHAVLNKARLQNSVDAAA--LAAAIVMDSEGT 69 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ A + + + + T + D V ++ Sbjct: 70 NAEATTAANTTLTNLASATGNSEMDFSSSTVTVQYSNDPTIFPQTSGYDADLDTYVRVTV 129 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + F G+ L+ + + +++ + Y Sbjct: 130 SA---YSLDNFFAYLFGVDKQLVSSAVAGPSPGIDVVNVVPMAVCEGDSGTGGSTGYDAG 186 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS---PYMVSCNKSLYYMLYPGPLDPS 252 +D +Y + G + + Y S + PG Sbjct: 187 ELYALKIADQNQSDMGSGNYQLLDFGSGADTVRQALAGGYEGSVGVGDTVLTKPGSTVGP 246 Query: 253 LSEEHFVDSSSLRH-------VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF--- 302 + + + ++ + + + I + + + Sbjct: 247 VGQGLNTRFGNYSGGGLSATDYPSDIYVREPNQKATLDNNGDIKYNDASAGGQPFGYEEY 306 Query: 303 -----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + D G R +V G T+ Sbjct: 307 EAELPDCSGDPDCRMDSGGQHDRRILVVPIVDCSGASGGTST 348 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 53/171 (30%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + I +D T R+ + V + K + + T Sbjct: 56 LSEMIDTLDVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIEPLSA--GTMTG 113 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+QTA + + + N + ++++TDG + ++ + A++ GI I Sbjct: 114 LAIQTAMEEVFTEEMGTRPAAFNIP---RVVIVVTDG---RPQDQVQDVAASARTAGIEI 167 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 T+ L AS F + +L FR+ Sbjct: 168 YTVGVGR------ADMQALRIMASEPLDEHVFYVETYGVIEKLTSRFRETF 212 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 38/114 (33%), Gaps = 14/114 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + + D TAI A+ + + Sbjct: 133 VVVFAGNAYMQLPITTDLAMAKKMVD-DANPDMLSNQGTAIASAIDLSLGSFSD------ 185 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K I+L TDGEN + + KAKSQG+++ TIA + Sbjct: 186 -----RHDVGKAIILFTDGENHEGDAL--EAAKKAKSQGVKVYTIAVGSEEGAP 232 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 68/203 (33%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S + + +G F S + K Sbjct: 111 TRLAAAQDAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNREAT-----KVAI 159 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ T+ +I + ++ IVLL+DG EN D Sbjct: 160 DNLQLSERTATGEAIFTSLQSIDT---LAAVLGGSDQAPPARIVLLSDGKQTVPENPDDP 216 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 G +AK + + I TI+F + + E L A S SFF A+ Sbjct: 217 RGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTAS 276 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL ++ D + +I R Sbjct: 277 SLEELRDVY-DTLEEQIGFETTR 298 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 37/318 (11%), Positives = 92/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 170 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 171 NYFPAPQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQTRD 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L+R AL ++ ++ DN+ G T Sbjct: 229 LPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 286 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 331 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 332 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 388 Query: 427 SFFEANSTHELNKIFRDR 444 ++ +S E K+ +D Sbjct: 389 NYSYIDSLSEAQKVLKDE 406 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 68/213 (31%), Gaps = 33/213 (15%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + + +RS+ D +++G F + + Sbjct: 92 DISLSMRAQDIQPTRFEAAKQEAKNFVRSL------PDGIKVGLVSFAGYATLEAEPTTD 145 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ TAI D + +++ + +DE + +VLL+ Sbjct: 146 HQRV-----IDQIELLQMARRTAIGDGLL---ESLRAIPKDENGKPLGPST----VVLLS 193 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR-----------YFLSNC-- 422 DG + + + A+ G+ + TI + + L Sbjct: 194 DGRTNSG-VDPMEVAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYWMQFDEETLRAIAE 252 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A+ ++ A S L + +R+ G + + R Sbjct: 253 ATGGQYYAAGSAEALRQAYRNL-GRMVGWKPQR 284 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 38/251 (15%), Positives = 70/251 (27%), Gaps = 47/251 (18%) Query: 214 SYSSQNGKVGIRDEKLSPY---------MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 ++ G P C + + G + +DSSS Sbjct: 67 WNGNRFDWWGTNGSSYFPQEGGGFGEDGNYGCYYNYTHYNNTGICQGVMDVVFLIDSSSS 126 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 R + V + + + F+ V ++ + + Sbjct: 127 ITD----ENYRKEIDFVKSILDYYYLHPNYTLVSILEFSTDVRVLQELTYDACDVRKA-- 180 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-- 382 + G T I A++ A+ + +S D + IVL+TDG T Sbjct: 181 ---IDSDRMSGLTNIAKAIEEAHRILKNSRSDIPDQ----------IVLITDGFQTVHSS 227 Query: 383 ----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFE 430 N I AK+ I+I TI ++ L AS + +F Sbjct: 228 INCNDHPHDCNAYAIEKARAAKADDIQIYTIGVGAASYYEDD----LRQIASSPSDQYFS 283 Query: 431 -ANSTHELNKI 440 + + + Sbjct: 284 LVDDYSSIQTV 294 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 46/161 (28%), Gaps = 18/161 (11%) Query: 12 KKLIKS--CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++L G +I A+L+ +LG + VDV L+ A + I + Sbjct: 2 RRLGADNTEKGAVSVIVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKCA 61 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + N + V ++ R + Sbjct: 62 RDTANADCSTT----------------STLAGSLANQNSLDGMSNVYSIQLDKTARTVSV 105 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + F +++GI S + KA A + S Sbjct: 106 TTSAKETGSPDNSVSLFFAKAIGIPSKEVGAKASATWGNPS 146 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 67/224 (29%), Gaps = 13/224 (5%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V ++Y G + +D S K + A+ ++ ++ D+ Sbjct: 227 VYGGYFIHYFAPRGLQPVEKNVVFVIDVSGSM-FGTKLQQTKKAMDKILSDLQTSDS--- 282 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIV-KTFAIDENEMGSTAINDAMQTAYDTIIS 352 F+D V + + K + G T IN A+ A + Sbjct: 283 ---FNIITFSDTVNIWKAEGSIQATVQNIHNAKNYVSRMEANGWTDINAALLAAASVLNH 339 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNK 409 SN++ I+ LTDGE T I + + + + + ++AF + Sbjct: 340 SNQEPGKGRGVGQIP--LIMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDA 397 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L N +E + I + V Sbjct: 398 DFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISRPLLADV 441 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 60.8 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 52/148 (35%), Gaps = 24/148 (16%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + K+ D+ A F V + + + + G T I Sbjct: 56 VDAAKEFDDRTKDSYFSAVGFASGVKLIQAPTQSLSTFNTAVNTVS----PLNGGTNIFR 111 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 ++ Y + K + ++L+TDG Q I CN KS+GI ++ Sbjct: 112 GLRGCYQQL-----------KTKPMTDRVLILVTDGFGGQ----PINYCNFIKSKGILLV 156 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFF 429 T+ + Q FL NCA+ F+ Sbjct: 157 TVGIGTSINQN-----FLKNCATSEEFY 179 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 63/198 (31%), Gaps = 35/198 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI+++ D R+ F+ ++ R Sbjct: 268 KLALLKKAMCFVIQTLGPND------RLSVIAFSSTARRLFPLRQ-MNVNGRMQAIQAVN 320 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEE 385 + G T I+D ++ I + I+LL+DG++T ++E Sbjct: 321 SLVDGGGTNISDGLKKGAKVI---------EHRRLKNPVCSIILLSDGQDTYSVPTFDDE 371 Query: 386 GIAICNKA--------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + ++I T F ++ + A S +F ++ Sbjct: 372 LQTNHSALVPPSILPGTGNHVQIHTFGFGMDHDSAA-----MHAIAETSSGTFSFIDAEG 426 Query: 436 ELNKIFRDRIGNEIFERV 453 + F IG + V Sbjct: 427 SIQNGFAQCIGGLLSVVV 444 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 31/200 (15%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + K + ++A S++ D R+G F+ + F L ++ Sbjct: 415 VSKFDMAKEAAQLATESLQPED------RIGLLAFDTETLWVVPFQLISGGLSVAQIQEQ 468 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 G T I A++ + ++ VLLTDG + ++ Sbjct: 469 IASLPSGGGTRIERALEVGL-----------PALAEQPTKVRHAVLLTDGRSFMNDNALY 517 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL--NKIFR 442 + A+SQ I + TIA ++ L A ++ A+ ++ + Sbjct: 518 QRLVETARSQQITLSTIAIGLDSDTA-----LLKQLAAWGGGRYYYADQPADIPRLTLLE 572 Query: 443 DRIG--NEIFERVIR--ITK 458 +I + E+ +R +T+ Sbjct: 573 SKIAGSDPAVEQALRAELTQ 592 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/318 (11%), Positives = 92/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 170 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 171 NYFPAPQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQTRD 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L+R AL ++ ++ DN+ G T Sbjct: 229 LPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 286 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 331 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 332 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 388 Query: 427 SFFEANSTHELNKIFRDR 444 ++ +S E K+ +D Sbjct: 389 NYSYIDSLSEAQKVLKDE 406 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 54/210 (25%), Gaps = 52/210 (24%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K LV+ A+ VI S+ D R+ F+ + +++ Sbjct: 179 RWDKLALVKQAMGFVIGSLGPHD------RLSVVSFSSGARRVTRL-LRMSHTGKSLATE 231 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I + ++TA + ++LL+DG + Sbjct: 232 AVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSS---------VILLSDGHDNYSMPR- 281 Query: 387 IAICNKAKSQG-------------------------IRIMTIAFSVNKTQQEKARYFLSN 421 +++G I T F + Sbjct: 282 -------RARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGNDHDAAAMHVVAE-- 332 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A+ +F + + F IG + Sbjct: 333 -ATGGTFSFIENEAVIQDAFAQCIGGLLTV 361 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 41/124 (33%), Gaps = 20/124 (16%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R+ + G T I A+ AY + + KK+++LLTDG+ Sbjct: 531 NRSRIAGEIARIQPGGGTEIFSALDAAYQDMTVTQAR-----------KKHVILLTDGKA 579 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + + + ++ I + T+ + L A F + L Sbjct: 580 STGGIR--DLVSAMIAESITVTTVGLG-----NDLDEQLLKMIADVGGGRFHAVPDPNNL 632 Query: 438 NKIF 441 +IF Sbjct: 633 PRIF 636 >gi|145500364|ref|XP_001436165.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403304|emb|CAK68768.1| unnamed protein product [Paramecium tetraurelia] Length = 604 Score = 60.4 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 64/229 (27%), Gaps = 24/229 (10%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 P + + + + + V S +K +V+ L ++ + Sbjct: 159 PQVPQQIEGDAQNQFNMDQQQHSKVGVDLLCVIDRSGSMSGEKIEMVKQTLNILLNFLGP 218 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D R+ F+D + + +T G T I Q A Sbjct: 219 KD------RLCLIQFDDTCQRLTNLRRVTDE-NKTYYSDIISKIYANGGTVIGLGTQMAL 271 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 I I +L+DG++ K Q + I + F Sbjct: 272 KQIKYRKSVNNVTA---------IFVLSDGQDEAAISSLQKQLAYYK-QTLTIHSFGFGS 321 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + ++ + SF+ N+ L++ F D +G V Sbjct: 322 DHD-----AKLMTKISNLGKGSFYFVNNISLLDEFFVDALGALTSMVVT 365 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 54/203 (26%), Gaps = 37/203 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI ++ D R+ F+ + + + + Sbjct: 210 KLALLKQAMRFVIDNLGPDD------RLSVVSFSSEARRLTRLA-RMSDAGKALSVNAVE 262 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN---------- 379 G T I + ++TA + +VLL+DG++ Sbjct: 263 SLVARGGTNIAEGLRTAAKVLDERQHRNAVSS---------VVLLSDGQDTYTMMRRRGP 313 Query: 380 ------TQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 + + I T F + A+ +F Sbjct: 314 SGVHAGNYEELVPPSFARTGADGDWSAPIHTFGFGNDHDAAAMHVIAE---ATGGTFSFI 370 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + + F IG + V Sbjct: 371 ENEAVIQDAFAQCIGGLLSVVVQ 393 >gi|52082207|ref|YP_080998.1| YwmC protein [Bacillus licheniformis ATCC 14580] gi|52787598|ref|YP_093427.1| YwmC [Bacillus licheniformis ATCC 14580] gi|319648080|ref|ZP_08002297.1| YwmC protein [Bacillus sp. BT1B_CT2] gi|52005418|gb|AAU25360.1| YwmC [Bacillus licheniformis ATCC 14580] gi|52350100|gb|AAU42734.1| YwmC [Bacillus licheniformis ATCC 14580] gi|317389715|gb|EFV70525.1| YwmC protein [Bacillus sp. BT1B_CT2] Length = 228 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 61/201 (30%), Gaps = 16/201 (7%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR- 305 + ++ + + + K L ++ S ++ + + N Sbjct: 35 DAPNVAIMLDASGSMAKKIGGVSKYELAKNEAFSFGSKLENANVLMRVFGSEGNNKNSGK 94 Query: 306 --VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + +G + G T I +A+Q A + + + Sbjct: 95 VQSCNAIRGVYGFQTYDEQSFRNSLNGIGPTGWTPIANALQDAKNAL----------DQL 144 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + K + LLTDGE T + + + + + I F + A Sbjct: 145 DNNGKNVVYLLTDGEETCG-GNPVKVATELRKSNAVVNVIGFDYEGDFHGQLTSI--AAA 201 Query: 424 SPNSFFEANSTHELNKIFRDR 444 +F+A + +++ +IF Sbjct: 202 GGGEYFQAKTKNDIKRIFTQE 222 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +++ + + + ++ L + I R G F D F+ + Sbjct: 114 TTADGSTLTRLNAAKNVLKTFIA-------KRSGDRFGLILFGDAAFIQTPFTADQQVWL 166 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + A ST + DA+ + + ++ +++LTDG +T Sbjct: 167 S--LLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDT 215 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SPNSFFEANSTHE 436 E + A ++GI+I TIA +Q + + + F A E Sbjct: 216 GSFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAE 275 Query: 437 LNKIFR 442 L+K ++ Sbjct: 276 LDKAYQ 281 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 53/161 (32%), Gaps = 17/161 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++ T R+G ++ +V + S +K I T Sbjct: 55 MIDIINSLEVGLSTTRVGVVQYSSQVQTVFSLKTFSNKSDMEKAINEIIPL--AQGTMTG 112 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + + E KN + +++TDG + + + +A+ GI I Sbjct: 113 LAIQYAMNVAFTEEEGARPLSKNIP---RVAIIVTDG---RPQDRVTEVAVQAREAGIEI 166 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELN 438 + Q L AS + F S + Sbjct: 167 YAVGV------QRADVSSLRAMASHPLDDHVFHVESFDLIQ 201 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 56/212 (26%), Gaps = 21/212 (9%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 Y C + + + + + VI Sbjct: 433 CVSTPGSYYCICPEGQELQADGKTCNKCSTGYIDLVFVIDGSKSVRPQNFELVKEFVINI 492 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + +G ++ RV ++ S + I + T A++ Sbjct: 493 VDSSAISAQGTHIGLVQYSSRVRTEFPLSQYTNGQD--IKTAVKNIQYMEKGTMTGLALK 550 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + S E K ++ TDG +QD+ KAK GI + + Sbjct: 551 HMVEQSFSEAEGAR------KNVPKIGLVFTDGR-SQDDI--SEWAKKAKEAGITMYAVG 601 Query: 405 FSVNKTQQEKARYFLSNCASPN----SFFEAN 432 + L+ AS SF+ A+ Sbjct: 602 VG------KAVEDELNEIASDPVNKHSFYTAD 627 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/261 (8%), Positives = 68/261 (26%), Gaps = 2/261 (0%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-VPLIQSLEEVSSRAKNSFTFP 86 +++ ++LGV +D+ + L+ A TA + + V + + A + Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIATLSS 60 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR-YDLLLNPLS 145 +F + +++ V + + V ++ + + Sbjct: 61 IASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQYLSA 120 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 +F + + + ++ A + + ++ Sbjct: 121 VFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRPPGYDPNVDMDPSL 180 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + ++ G + + +L G + + + V + Sbjct: 181 VHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGCAAVGNTVTTKPGN 240 Query: 266 HVIKKKHLVRDALASVIRSIK 286 + L IK Sbjct: 241 SAGPVAQGLNVRLNDFSGPIK 261 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/118 (16%), Positives = 44/118 (37%), Gaps = 22/118 (18%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------EG 386 GSTA+ +++ ++ + K ++L TDGE + Sbjct: 473 GGSTALYAGADEGLESLKNAQNN------------KVMLLFTDGEENSSLQYFGKRAFRA 520 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + KA+ +GIR+ TIA+ + + + + + E+ +++ + Sbjct: 521 SEVVKKAREKGIRVFTIAYGTGVNNKTLNALSML---TDGKTYFIENPDEIKQVYEEL 575 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 33/357 (9%), Positives = 85/357 (23%), Gaps = 27/357 (7%) Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 N + ++ V + + S +L + + G Sbjct: 1236 EAVVNDSLVGKDNVQFGAVVYGTNPAEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKAL 1295 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + SY G+S ++ + + R + + Sbjct: 1296 EYTRIRFGTSYGGRPGISHILILVTDGATTEADRPNLPIVSKALKDDGIIVFAVGVGKAV 1355 Query: 221 KVGIRDEKLSPY------MVSCNKSLYYMLYPGPLD----PSLSEEHFVDSSSLRHVIKK 270 ++ P +++ + D E+ + Sbjct: 1356 PQELQQIAGYPDRWFLVQNYKGLDNIHDNITQVVCDESKPACSHEQLDLVFLIDGSASIT 1415 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 A + + + VR+G ++ + + Sbjct: 1416 SSNFTSAKTFMKEIVDSFTISENRVRIGVAQYSANPKKEFFLNEYYSSSDMKKQIDSISQ 1475 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T + + +Y++++TDG + E Sbjct: 1476 LKATTYTG---------KGLRFVKQFFDPANGGRKNVPQYLIVMTDGMSNDSVNED---A 1523 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +S G++I +I + + + SP + +E + L+ I R + Sbjct: 1524 AALRSSGVKIFSIGIGLRNSFELVMIA-----GSPKNVYEVETFQALDSIKRQIVAQ 1575 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 63/171 (36%), Gaps = 21/171 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAI 339 +I I + VR G ++D ++ + + ++K G T Sbjct: 862 MIELISMFQVGANRVRFGVVQYSDVRRTEF---FISEHNTQKMLKDAISQIEQLGGGTLT 918 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 +A+ + +++ +D H++ + +V++TDGE + + ++ GI Sbjct: 919 GEALTSMKQLFVNAAKDRPHKVPQS------LVVITDGE---SQDRVTEAAAEIRNDGIT 969 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEI 449 I I + + + A F N+ L K+ ++ + E+ Sbjct: 970 IFAIGV------KNAVEEEIRDIAGSNEKMFFVNNFDSL-KVIKNDLAREL 1013 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 62/189 (32%), Gaps = 24/189 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + D D +++G ++D ++ + ++K + + G T Sbjct: 54 NFLYTLVNGFDVGLDKIQIGLIQYSDNARTEFFL--NSYSNKEDVLKYIQNLKYKGGGTK 111 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +++ S + V++TDG+ QD+ AI K+ GI Sbjct: 112 TGLSLEFMLTQHFSEAAGSRAAEG----VPQIAVVITDGQA-QDSIREPAI--AVKNAGI 164 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIG------NEI 449 + I ++ L+ AS + + L I ++ I E Sbjct: 165 ILYAIGI------KDAVLSELNEIASDPDDKHVYSVADFNALQSISQNMIQVLCTTVEEA 218 Query: 450 FERVIRITK 458 + +I + Sbjct: 219 ARQTGQIAQ 227 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 50/161 (31%), Gaps = 19/161 (11%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +K+ + D V++G F+ + +K I + T + +A Sbjct: 1050 SMVKQAEIGPDRVQIGLIQFSSETKEEFPL--NRYKRKDEIQSAIRGIQQLSQGTLMGEA 1107 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ +S + K+Y++++TDGE + G+ I Sbjct: 1108 LKYTLPYFSASK-------GGRVNTKQYLIVITDGEAQDAVGNP---AKAIRDHGVIIYA 1157 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I QQ L + +S L F + Sbjct: 1158 IGV-----QQANNTQLLEIAGKQEQVYYEDSFDSL--AFLN 1191 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/346 (8%), Positives = 94/346 (27%), Gaps = 23/346 (6%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 N N + + ++ + + L + + + G+ Sbjct: 77 YSDNARTEFFLNSYSNKEDVLKYIQNLKYKGGGTKTGLSLEFMLTQHFSEAAGSRAAEGV 136 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + S + ++ +D+ LN D Sbjct: 137 PQIAVVITDGQAQDSIREPAIAVKNAGIILYAIG-----IKDAVLSELNEIASDPDDKHV 191 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + + + + + + + + + + Sbjct: 192 YSVADFNALQSISQNMIQVLCTTVEEAARQTGQIAQVCRTANQADIVLLVESTTRMGDAT 251 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A + + +D + +R+G +ND + + I+++ + Sbjct: 252 FEKAKNFLYDLVSNLDVGINKIRIGLVTYNDETNPEFLL--NSYSSKTEILESIQNMKYV 309 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ T + + ++++T+G+++ E + Sbjct: 310 EGYTYTGRALEYVNTTYFTQAAGSRFE----ESVAQILIIVTEGDSSDTLTEP---AKEL 362 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 KS+GI + + ++ +Q L +S F++ + + Sbjct: 363 KSRGISVYVVGTNIKYDRQ------LQEASSKPDEKFFYQLDDFDD 402 >gi|147898495|ref|NP_001088330.1| inter-alpha (globulin) inhibitor H2 [Xenopus laevis] gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis] Length = 935 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/247 (12%), Positives = 75/247 (30%), Gaps = 10/247 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S ++ NG + ++ + Y L+ D + + Sbjct: 258 PTCSTTAVNGNLLVKYDVKREESSHLQVFNGYFLHFFAPDNLPTLPKNILFVIDVSGSMW 317 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ + ++ ++ + N + G FN + + + + Sbjct: 318 GLKMKQTVDAMKSILEDL---NSDDQFGIIDFNHNIRCWKDELVYASSVEKGDASKYVQR 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T INDA+ A + ++ + + IVL++DG+ T + I Sbjct: 375 IQPNGGTNINDALLRAIFILKEASN----KGLLEQNSVSLIVLVSDGDPTVGELKLPKIQ 430 Query: 391 NKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + I + ++ + R N + T K F +++ Sbjct: 431 KNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVST 490 Query: 448 EIFERVI 454 + + ++ Sbjct: 491 PLLKNIV 497 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 49/173 (28%), Gaps = 9/173 (5%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + + G+ I+ AL+ P+ + M +D+ + ++ A ITA+ Sbjct: 6 RLSLAALRRAREQGGNVAILFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAA 65 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN---------NLKKNFTDREVRDIVR 116 L ++ V + + + E + + K + Sbjct: 66 GDLHKAETRVLTLLSENGFGDVLLLAEEDRPKRLDPRYPSRAWVEVVKGHYTADAMLPPA 125 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 ++ +V + + + T A +V Sbjct: 126 LRFEAGGEPFNSVRVTVVQSGRYFIMDGIQAPPLITTSGVAYSTAEAAFSVGS 178 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 68/215 (31%), Gaps = 29/215 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 PG ++ +D S K + A +I + D R+ + Sbjct: 88 PGAQRSPVNLALVIDRSGSMSGY-KLAQAKQAARHLIGLLNDQD------RLAIIHYGSD 140 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V S PS R + + + G T I + + ++ Sbjct: 141 VKSLPSLE--ATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQR---------T 189 Query: 366 EAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++L++DG+ T+ +EE + + ++ G+ + I + + Sbjct: 190 YGVNRLILMSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFN-----EDLMQAF 244 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A ++ +L+ +F+ + + V R Sbjct: 245 AEYGAGAYGFLEDAAQLSTLFQKDL-QQAGTTVAR 278 >gi|109071570|ref|XP_001110086.1| PREDICTED: collagen alpha-1(XXI) chain-like isoform 2 [Macaca mulatta] Length = 959 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 56/172 (32%), Gaps = 23/172 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 I E L A S F ++K R+ I ++ Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISK-IREVIKQKL 211 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR ++ Sbjct: 37 YDLVFILDTSSSVG-KEDFEKVRQWVANLVDTFEI---GPDKTRVGVVRYSDRPTTEFDL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G ++ I + + G+T DA++ S + KK + Sbjct: 93 --GRYQTREQIKEAAKNIKYYGGNTNTGDALRYINTYSFSEEAGGRP---TDSAIKKVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDHVLDP---ATAAHKAGIRIFAVGVG------EALKEELDEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDYNAIDKI 209 >gi|260426558|ref|ZP_05780537.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] gi|260421050|gb|EEX14301.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] Length = 1088 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 59/417 (14%), Positives = 104/417 (24%), Gaps = 40/417 (9%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 L + + R E + A A I S +I + S + + + E Sbjct: 277 LPAAVISLFGTRTDMAEWQITAAVDGAAIGGSQGVISVV--ESWTLQFNADGSLASVSEA 334 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN-PLSLFLRSMG 152 L D V +S +D + + Sbjct: 335 SPSMNVTGLTTGAGDLTVALDFGTIGGTDGLTTGVGITSVSDTHDGTSQCNAADCAQRWN 394 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + +T + + V P P Sbjct: 395 AGTGRFETTQQTLVNGGLSGWDTLVRNYPF--------------TTAPALPAEAPPATCF 440 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 ++ S + VG + L +++ + Sbjct: 441 RAVSFVDDVVGSDQIMMIVDRSGSMSWSSNSGQAEVCLNGLDDDNDGTVDEGDCADSRIE 500 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 VR A + + + + +G FN+ K + Sbjct: 501 FVRAAGRAFVDL-----QTSQGIDLGLLEFNEGNTLLRPIDTLNAG-NAQDYKDAIDALS 554 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G TAI DA + E + LLTDG NT + +A + Sbjct: 555 PGGDTAIGDAFDASTGEFTRVAEVGR---------VRTAYLLTDGFNTAG-GDPVAAAER 604 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGN 447 G+RI I + R L++ AS +EA + + L IF + G+ Sbjct: 605 LDDIGVRIHAI-----PAGNDVDREELTDIASGTGGQVYEARNVNALTGIFAELAGS 656 >gi|262091909|gb|ACY25458.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 621 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 42/382 (10%), Positives = 105/382 (27%), Gaps = 25/382 (6%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ + + ++ Q+ E E + + + + + + E Sbjct: 75 AQEQAAAVPQGAREEDSAEQEVAQQQEAEESAQQEAAMAEEQEEEQAEAVAEEEEQEEAA 134 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + S ++P + +W I ++ + T S + S+ Sbjct: 135 EEEEEAEEEPSAAAPSVSPSETNFKDYEESAWAITSEDDTATFSLDADRASW-SVAVNYA 193 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPAD-RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 S ++ + +N + + + + + + Sbjct: 194 DSGYFIEPASVRAEEWINAHDYEYELPDGPYEFGVTVDLIPHPLQREDFHLL---RIGLQ 250 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 ++ D+S + + R AL ++ S+ V D R+G F+ Sbjct: 251 APVIERTHPVNVIFVADASGSMAEGNRIDIARAALQALWASL-----VPDLDRVGMIQFS 305 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 I + ++ T + + + + N Sbjct: 306 VDPIPASFV--PHTRPDSEFLQASIDRLLPYYGTNVQAGIDLGVQ-LANDARQAWPDSDN 362 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFL 419 Y+VL++DG D + AI A IR++TI + Sbjct: 363 ------YVVLISDGVANVDATDPFAILRSAGEDDESNPIRLITIGVGIGHYNDVLLEQLA 416 Query: 420 SNCASPNSFFEANSTHELNKIF 441 ++ +S + + F Sbjct: 417 QY--GNGWYYYIDSPEQAWETF 436 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/318 (11%), Positives = 92/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 170 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 171 NYFPAPQPVADEEDNTKPIAARIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQTRD 228 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L+R AL ++ ++ DN+ G T Sbjct: 229 LPPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 285 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 286 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 331 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 332 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 388 Query: 427 SFFEANSTHELNKIFRDR 444 ++ +S E K+ +D Sbjct: 389 NYSYIDSLSEAQKVLKDE 406 >gi|54038464|gb|AAH84380.1| LOC495168 protein [Xenopus laevis] Length = 554 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/247 (12%), Positives = 75/247 (30%), Gaps = 10/247 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S ++ NG + ++ + Y L+ D + + Sbjct: 258 PTCSTTAVNGNLLVKYDVKREESSHLQVFNGYFLHFFAPDNLPTLPKNILFVIDVSGSMW 317 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ + ++ ++ + N + G FN + + + + Sbjct: 318 GLKMKQTVDAMKSILEDL---NSDDQFGIIDFNHNIRCWKDELVYASSVEKGDASKYVQR 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T INDA+ A + ++ + + IVL++DG+ T + I Sbjct: 375 IQPNGGTNINDALLRAIFILKEASN----KGLLEQNSVSLIVLVSDGDPTVGELKLPKIQ 430 Query: 391 NKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + I + ++ + R N + T K F +++ Sbjct: 431 KNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVST 490 Query: 448 EIFERVI 454 + + ++ Sbjct: 491 PLLKNIV 497 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 60.4 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 41/155 (26%), Gaps = 1/155 (0%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G I+ A+ + V+LG + +D+ R + + L+ AA A + + L+ S Sbjct: 13 QRGAVTIMFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGAGSLVNSSLSTYD 72 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + D R P + + Sbjct: 73 WDAAEQKGLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVPVTGDVPA-VRATV 131 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 L + L+ I + + + Sbjct: 132 ALTSTQNNGPLKLFFAPFLGINESNIPASAIAAIY 166 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 81/278 (29%), Gaps = 29/278 (10%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +S+ + N + + S + + + ++ Sbjct: 236 SIMFMQSLHSVTEFCTEKTHNVEAPNLQNKMCNGKSTWDVIMNSTDFQNTSPMTEMNPPT 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I + +G Sbjct: 296 QPTFSLLKSKQRVVCLVLDKSGSMSSEDRLFRMNQAAELFLIQIIE-----KGSLVGMVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + T + + G T+I ++ + II S + Sbjct: 351 FDSVAEIRNNLTKITDD-NVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + + C + K G+ I TIA K LS Sbjct: 410 ---------IILLTDGEDNEIHS-----CIEEVKQSGVIIHTIALG---PSAAKELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + + F+ + L F + I R IT+ Sbjct: 453 DMTGGHRFYANKDINGLTNAF-----SRISSRSGNITQ 485 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 46/174 (26%), Gaps = 5/174 (2%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + + ++ +G+ ++ LL PVML VD + A + Sbjct: 1 MMARIAGRAVQALRDRSGNVAVMAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAV 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFP-----KQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 TA + ++ + ++ + + + + Sbjct: 61 TAVANISRANAAAALTMADNRQGNVQLIDRATLSTPMAAGTGAQMLVEPGRYSADPSTES 120 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + A V ++ R L + F I + I S Sbjct: 121 PWRFTPGAEPANAVRVTFRKKGKLYFGAGFFEPPVITTTGIAAARAEAAFSIGS 174 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 56/192 (29%), Gaps = 29/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A+A I + D R+ T F+ +V + + + + Sbjct: 59 NKLDYAKQAIAYAIEQLLPSD------RLSLTLFDTQVETKIPSTL---ATDKQRLLETI 109 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 STA++D N + ++LL+DG Sbjct: 110 KLIRSGSSTALHDGWVQG---------GIQVGQYLNNDHLNRVILLSDGLANVGETNPDV 160 Query: 389 ICNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 I + GI + ++ L A +FF S +L +IF Sbjct: 161 IASDVHGLMKTGISTSALGVG-----RDYDEDLLEAIARSGDGNFFHIASPEDLPQIFET 215 Query: 444 RIGNEIFERVIR 455 + + + R Sbjct: 216 EL-QGLATTIGR 226 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 62/224 (27%), Gaps = 28/224 (12%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 SL ++ +D S + R+A I +K D V+ Sbjct: 51 NNYLKISLTGFKQDETEKSPINLSLVIDRSGSMSG-DRIEKAREAAIMAINMLKDDDIVS 109 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 ++D + K ++K GSTA+ + + Sbjct: 110 ------VIAYSDNAYLIIPAT--KVKNKNEMIKIINDTIKPGGSTALFAGVSKGITEVNK 161 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK---AKSQGIRIMTIAFSVNK 409 + I+LL+DG+ + + A QGI + TI Sbjct: 162 FIKKNQVNR---------IILLSDGQANIGPSTTKELADLGQVAGKQGIAVTTIGLGNGY 212 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ A S + ++ +L F + + Sbjct: 213 N-----EDLMTALAGFSDGNHAYVENSADLETAFVREFNDVMSV 251 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 78/230 (33%), Gaps = 13/230 (5%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 ++ G V + + N+ +Y + + + V++ + Sbjct: 565 DTTVVGNVASTEVTHFSKYILLNRKVYQNSFEWQDVWNTTGFSGVEAILVIDDSGSMVSN 624 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + IDN+ + ++G F + + ++ + T Sbjct: 625 DRYNQRLTVAQNLIDNLPENSKVGVVKFTSSTTKLTTSLTSDKETAKSYLTT--SYFRSS 682 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T++ A+ +++ + + K +++L+DG T ++ A Sbjct: 683 GGTSMYTAINSSFSMFEA----------TDDNILKMMIVLSDG-ATSYTYLHSSVVTTAN 731 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + G++I T+ + + + +F+ A+ +L I++D Sbjct: 732 NNGVKIYTVGLGSSSSSYFTQYLKPLANNTGGAFYLASDASQLEDIYKDI 781 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 39/185 (21%), Positives = 58/185 (31%), Gaps = 29/185 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + I K + + A A +I ++ D V ++ + + Sbjct: 420 DTEDAGIPKLEIAKSASAKMIEHLESSDGVGVIAFDHNYYWAYKFGKIS---------KK 470 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V G TAI + A +K + K IVLLTDG Q Sbjct: 471 EDVIESISSIEVGGGTAIIPPLSEAVK-----------TLKKSKAKSKLIVLLTDGMGEQ 519 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNK 439 E N+AK I+I TI LS AS F+ ++ EL Sbjct: 520 GGYEIP--ANEAKRNNIKITTIGVGKYVN-----ATVLSWIASFTSGRFYLVSNPSELVD 572 Query: 440 IFRDR 444 +F Sbjct: 573 VFLKE 577 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 60.4 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 61/240 (25%), Gaps = 59/240 (24%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + A +I + VR+G F + Sbjct: 97 SMAATDVAPTRIGASKQAARDLIVGLPAS------VRLGMVSFAATATVVLPPTDN---- 146 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS--------------------------- 352 R + TA + A + Sbjct: 147 -RQDMLDAIDRFQLQLGTATGSGLIQALAVLFPDDGIDLEAILFSGESLAPGPGGRSLTE 205 Query: 353 --------SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 E E + ++LL+DG T + + A +G+R+ T+ Sbjct: 206 AAAADAVRKREQERPAAQPGSYRHGAVILLSDGRRTTG-PDPLDAARMAAQRGVRVYTVG 264 Query: 405 FSVNKTQQE----------KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 F + L A + +F+A S +L++++R ER Sbjct: 265 FGTPQGGAAAESGLSYYMQLDEPALRAVAAITNGEYFQAGSAADLSQVYRQLSARFALER 324 >gi|323529406|ref|YP_004231558.1| hypothetical protein BC1001_5117 [Burkholderia sp. CCGE1001] gi|323386408|gb|ADX58498.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 353 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 52/177 (29%), Gaps = 5/177 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + G ++TAL + ++G+ + VD+ R + L+ AA A + + Sbjct: 2 QARKSHRAARERQRGAVAVMTALCLTALVGITALAVDLGRAWVVRNELQNAADAAALAGA 61 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L + + + + + + + A+ + Sbjct: 62 GSLGPNYKSPNWTQAAAKAQSAITLNKTEGVALVTAQVQTGYWNVKGTPAGMQALPVPAP 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSM-----GIKSWLIQTKAEAETVSRSYHKEHGV 177 + + + L + GI + I A A + Y + Sbjct: 122 GAYDRPAVQVTVSRAAGQNGGPLSLVLAPVLGITTMPISATAVAVISAPGYAGPGAL 178 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 48/419 (11%), Positives = 117/419 (27%), Gaps = 45/419 (10%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + E + A+++ + + E + E + + + Sbjct: 66 NDVSEAAPVAESTPSENVSEGSENPAAEGAPAEQSTEQPVEQLVKQSEEQKAGSADEPVI 125 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV--SRSYHKEHGVSIQWVIDFSR 187 ++ S + + G ++ ++ + + ++ + + Sbjct: 126 ELATPSEAQPATASATPTQKPTGTENPTTVERSVQSDDDDADTVANQNEAKDDETKNNAD 185 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + S L + S + G KL+ + S Sbjct: 186 KTVRLGIASYRGMLKSASSGLSTPEHTKSIEYQGNGAYILKLNVIGKDASTSTTDTTPID 245 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR----SIKKIDNVNDTVRMGATFFN 303 D R K ++ A+ S + + I++ ND V++ + Sbjct: 246 IALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNDKVKVALVKYA 305 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR--- 360 +++ + + G T + T ++T+ + + ++ Sbjct: 306 NQIGTATG-ADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTSVNGLQAAGATYADAAME 364 Query: 361 ------MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAF-- 405 AKKY++ TDGE + + N A K+ G + +I Sbjct: 365 VAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDGDVANAAIKKSQELKNAGTTVYSIGIFD 424 Query: 406 --------------------SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + L + AS + ++ A+S +L +IF D Sbjct: 425 GANPSASVSSASNANKFMHGISSNYPNATGYWNLGDRASGDYYYSASSATQLAQIFNDI 483 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 42/375 (11%), Positives = 99/375 (26%), Gaps = 34/375 (9%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 L++ E R I D+A ++ S +V + Y L + + + + Sbjct: 23 TTDGALTDPAVLQEPVHTEETRAIETDSADQVFLAASKSRVEVQESYVLPSSTPIIPMPN 82 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS-----------MLDYQRDSEGQ 199 + + R + + +D + + Sbjct: 83 PPVSENRENYPKTPISPIRQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEE 142 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 +N F + + +P SL P L+ + Sbjct: 143 FINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLL 202 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + K L++ + ++ ++ D R+ + + + G Sbjct: 203 DVSGSMNSPDKLPLMQRSFNLLVSQLRPQD------RVAIAVYAGQSGVVLEPTSG---D 253 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + G T + + AYD + N I + TDG+ Sbjct: 254 QKAQINQAINQLRAGGGTHGSAGIHLAYDL----AQANYLPDGINR-----IFIGTDGDF 304 Query: 380 TQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + G+ + + F ++ + + +S E Sbjct: 305 NVGTTSLTELKALIERKREAGVFLSVLGFGTGNYNDALMEELSNH--GNGTAYYLDSYQE 362 Query: 437 LNKIFRDRIGNEIFE 451 K+F ++ + Sbjct: 363 ARKLFATQLAATLQT 377 >gi|162448748|ref|YP_001611115.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] gi|161159330|emb|CAN90635.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 51/179 (28%), Gaps = 25/179 (13%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A ++ I + T R+ ++D + G R + Sbjct: 146 DPIAYVRAGLVEMIDALQP---TDRISLVRYSDAAEVVLEQAEGS---DREALTEAFEGL 199 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 GST + + + TAY + ++ L+DG T + + Sbjct: 200 TARGSTNLYEGLFTAYALAEQHLDPAWQNR---------VIFLSDGVATAGLTSPQRLVS 250 Query: 392 KAK---SQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRI 445 A +GI + I + +F+ + ++F + + Sbjct: 251 LAAGYAEKGIGLTAIGVGAEFDV-----DAMRGISEVGAGNFYFLEDPKAVEEVFAEEV 304 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 62/158 (39%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D + V+ GA ++D+ + I++ + + G+T A++ Sbjct: 841 VKKADVGRNRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDG++ ++ ++ + +GI I + Sbjct: 899 HANALFT-----EKHGSRIKQNVKQVLIVITDGKSHDHDQLNDT-ASELRDKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 48/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGTKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 581 ---ANKIELQEIAGKEERVSFGQNFDALKSI 608 >gi|125575071|gb|EAZ16355.1| hypothetical protein OsJ_31817 [Oryza sativa Japonica Group] Length = 579 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 58/198 (29%), Gaps = 31/198 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K LV+ A+ VI ++ D R+ F+ + ++ + K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRL-LRMSEVGKATAKRAV 210 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T I D ++ A + ++LL+DG+++ Sbjct: 211 ESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS---------VILLSDGKDSYVVPRRGN 261 Query: 389 ICNKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + Q I T F + S + +F + Sbjct: 262 GMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAES---TGGTFSFVENEAA 318 Query: 437 LNKIFRDRIGNEIFERVI 454 + F IG + V Sbjct: 319 IQDSFAQCIGGLLSVAVQ 336 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 59/221 (26%), Gaps = 38/221 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + + Sbjct: 100 SMRATDVAPSRLVAAQEAAKQFADQLTPG------INLGLIAYAGTATVLVQPTTNREAT 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 K TA + + TA I + + IVL++DG+ Sbjct: 154 -----KNGLDKLQLADRTATGEGIFTALQAIATVG---AVIGGGDEPPPARIVLMSDGKE 205 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T + AK QG+ I T++F E L A Sbjct: 206 TVPSNPDNPKGAFTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLEKIAQL 265 Query: 424 SPNSFFEANSTHELNKIF--------RDRIGNEIFERVIRI 456 S F A+S +L +F + I + +R+ Sbjct: 266 SGGDAFTASSLEQLKAVFTSLQQQIGYETIKGDASVGWLRL 306 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 45 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 100 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + K+ + Sbjct: 101 --GLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSRQAGGRP---GDRAFKQVAI 155 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + R L AS F Sbjct: 156 LLTDG---RSQDLVLDAAATAHRAGIRIFAVGVGA------ALREELEEIASEPKSAHVF 206 Query: 430 EANSTHELNKI 440 + ++KI Sbjct: 207 HVSDFDAIDKI 217 >gi|134100328|ref|YP_001105989.1| hypothetical protein SACE_3793 [Saccharopolyspora erythraea NRRL 2338] gi|133912951|emb|CAM03064.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 327 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 55/201 (27%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + +G F + + + Sbjct: 110 NRLEAAKVAAKEFADQLTPG------INLGLVSFAGTATVLVMPTTDRASVKQA-----I 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + +TA D + A I S + IVL+ DG T E Sbjct: 159 DNLKLSEATATGDGINAAMSAIDS---FGKMVGGPSGAPPARIVLMADGGQTIPRELDAP 215 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCA--SPNSFFEAN 432 +AK I I TI+F E + A S F +A Sbjct: 216 RGAYTKAQEAKKANIPISTISFGTKHGSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAA 275 Query: 433 STHELNKIFRDRIGNEIFERV 453 S +L +++ G +I + Sbjct: 276 SAEQLREVYATL-GEQIGYEI 295 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 68/203 (33%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S + + +G F S + K Sbjct: 109 TRLAAAQDAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNREAS-----KVAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + TA +A+ T+ +I + ++ IVLL+DG EN D Sbjct: 158 DNLQLSERTATGEAIFTSLQSIDT---LAAVLGGSDQAPPARIVLLSDGKQTVPENPDDP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 G +AK + + I TI+F + + E L A S SFF A+ Sbjct: 215 RGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTAS 274 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL ++ D + +I R Sbjct: 275 SLEELRDVY-DTLEEQIGFETTR 296 >gi|218672134|ref|ZP_03521803.1| hypothetical protein RetlG_11055 [Rhizobium etli GR56] Length = 125 Score = 60.0 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 39/126 (30%), Gaps = 29/126 (23%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI--- 400 +N A+K + ++DG + C K +G++I Sbjct: 1 GDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVL 60 Query: 401 MTIAFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 T + Q + + CASP +FE + T + + I Sbjct: 61 YTTYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTEGITDAMKALFLKVI-- 118 Query: 452 RVIRIT 457 R RIT Sbjct: 119 RAPRIT 124 >gi|311070192|ref|YP_003975115.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] gi|310870709|gb|ADP34184.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] Length = 227 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 13/129 (10%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G + G T I A+ A + + + + + + L Sbjct: 104 YGFQGFDEQSFRNSLNTIGPTGWTPIAKALNEAKSSF----------DQLDKKGENVVYL 153 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 LTDGE T I + I + I F + + + +F A+S Sbjct: 154 LTDGEETCG-GNPIKTAKELHKHNITVNVIGFDFKEGYKGQLNAIAK--VGGGEYFPASS 210 Query: 434 THELNKIFR 442 ++ +IF+ Sbjct: 211 QSDIKQIFK 219 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 44/408 (10%), Positives = 101/408 (24%), Gaps = 35/408 (8%) Query: 60 AIITASVPLIQSLEE-VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A+ S + + + + +A + + + + + + + + D Sbjct: 147 ALQAVSDEIDATKADHPADQADPARAYVNVLLTDGAWTGQDGTTTLSPASQNPANTTADL 206 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLN-PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + V ++ + + I + +A V Sbjct: 207 FDNQDISTYVVAVAGDPAAEMAADETAAAGGTDAAIDGNTPEELEQALANVVQNIISSVV 266 Query: 178 S---------IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + + ++D S SML+ + + G ++ + + G I Sbjct: 267 APECIGGLPRVMILLDASSSMLNIGGGTMAGGMGETGWDQ--AREALAGEQGLFDIDVGI 324 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + V+ + + + + Sbjct: 325 GAAEDVTQLGLAVFGHNQPAPGEQNILVQ-YAPCAKDNFAWALDPNSSCVEP-----GCT 378 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D A ++D P F + T + +Q D Sbjct: 379 DPWGG--PPIAWTYSDGQQDPPGFDIPTTSHMPQCAG---NTFCSGSGTYTHLGLQLIKD 433 Query: 349 ---TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTI 403 + + N E + +L+TDG+ N + + + GI I Sbjct: 434 NQVQYQADGLMDGAEFPTNDETIYFNILITDGQYNGYSTNAQVQGELEEMYNDGITTYVI 493 Query: 404 AFSVNKTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFRDRIG 446 F L N A ++++AN+ EL G Sbjct: 494 GFGDGVDTPAAMAQ-LQNMAQWGSGDSENYYDANNQAELEAALTSIFG 540 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 54/162 (33%), Gaps = 16/162 (9%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAIN 340 ++++K+ + G T F I+ + + G T I Sbjct: 114 MQAVKEFTEYREGDAFGLTVFGTEYINWVPVTKDTSAIALATPFLAPDRMSKWFGGTNIA 173 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + ++ + + I+L++DG + N + + + + ++ I Sbjct: 174 KALRGSQQQLLQQEDG-----------DRMIILVSDGVSGSPN-DTVDMAQELRNNKIVA 221 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I E L+ + F N+ L++ FR Sbjct: 222 YCIYIGSGNGSPEMN--ALAAI-TGGQVFGVNNPKALDETFR 260 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 15/118 (12%), Positives = 41/118 (34%), Gaps = 12/118 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + + + ++ + G T + A++ + + +++ Sbjct: 133 DRVGLVVFAGDAFIQSPLTSDYSAV-KLFLRAVDPEVMPQGGTNVGAALRLSRQVLENAD 191 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 ++ +VLLTDGE+ + K G++++ + + Sbjct: 192 RGSK---------ERVVVLLTDGEDLVGDV--AEATEALKDSGVQVLAVGVGSESGEP 238 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/325 (9%), Positives = 86/325 (26%), Gaps = 31/325 (9%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 + + + S + + V D S SM + + Sbjct: 44 KLEPNGDGTYKLSLSVTGTASSTSESSKADVVIVFDISNSMDEETNTYVEYATGRY---G 100 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV- 267 + + + + + + + + + G + + + + R+ Sbjct: 101 SVSSDAPTGSSTRRRLYRRSTNNWGYYQYTEITNDTTSGTVYYLGDNYQYHEYTGKRYSQ 160 Query: 268 IKKKHLVRDALASVIRSI---KKIDNVN--------DTVRMGATFFNDRVISDPSFSWGV 316 + + + A ++I + + + DT AT ++ + S G Sbjct: 161 KTRLDVAKSATNTMIDQLLANNATNPGSVRISLVSFDTFASDATAWSTSSENLHSIVNGY 220 Query: 317 HKLIRTIVK------TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + + T Q + N N +Y Sbjct: 221 KTPQSSHLGGHRGGTNWEDALQKADGTQPRADAQKHVIFVSDGNPTFRISSINGNPDDQY 280 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRI-------MTIAFSVNKTQQEKARYFLSNCA 423 + G++ + + AK +I T+ A Sbjct: 281 NDVHGHGDDDYYHSHPNYNYDAAKDDAKKIVDGGAAFYTVG--TFGDAARMQNLATEAGA 338 Query: 424 SPNSFFEANSTHELNKIFRDRIGNE 448 S +++++A+ L F++ + + Sbjct: 339 S-DNYYKADDEAALKAAFKNIVASI 362 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 30/282 (10%), Positives = 76/282 (26%), Gaps = 37/282 (13%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--YSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + + + L Q A V + + + Y +C++ Sbjct: 199 PQDEHVDYVESYNVIEKLAKKFQEAFCVVSDLCATGDHDCEQLCVSSPGSYTCACHEGFT 258 Query: 242 YMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + S + LV+ + ++ ++ D + Sbjct: 259 LNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQV 318 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++ + + G + I T A++ D + + Sbjct: 319 G---LVQYSSSIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVS 373 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K ++ TDG + + KAK G ++ + Sbjct: 374 SGARPGA------QKVGIVFTDGRSQDYIND---AARKAKDLGFKMFAVGVG------NA 418 Query: 415 ARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFERV 453 L AS + +F +N+ IG ++ +++ Sbjct: 419 VEEELREIASEPVADHYFYTADFKTINQ-----IGKKLQKQI 455 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 55/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + T A+ Sbjct: 72 IESLDVGPNATRVGLVNYASTVKPEFPL---RAHGSKASLLQAVRRIQPLSTGTMTGLAL 128 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 129 QFA---ITKALSDAEGGRARSPDISKVVIVVTDGRP-QDSVRDVS--ERARASGIELFAI 182 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L K F++ Sbjct: 183 GVGR------VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQEAF 224 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 64/188 (34%), Gaps = 23/188 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + VI + RDA ++ + + + M F+D V F+ Sbjct: 84 DTSESVITQFQTERDAAKRFVKQM-----LREQDEMDLISFSDTVDEIVPFTN-----DA 133 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + ++ +T++ DA+ A + + D R K +V++TDG NT Sbjct: 134 GRMNAGIGNLHKGDATSLYDAIYLASQRLTEAKRDATRR--------KILVIVTDGGNTT 185 Query: 382 DNEEGIAICNKAKSQGIRIMTI---AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 A+ G I I + + + L A + +F H+ Sbjct: 186 KGMRYQQAVEAAERAGAAIYPIIMVPIEADAGRNTGGEHALIQMAQDTGGKYFYVLDKHD 245 Query: 437 LNKIFRDR 444 L+K F Sbjct: 246 LDKAFAHL 253 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 96/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ + T + ++D V +F+ Sbjct: 255 GDKAATTNAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 48/162 (29%), Gaps = 8/162 (4%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + + ++ S T A+ I Sbjct: 212 NCNFAIVQYGSSIRTELSLLDNEDGARSLQKVKQIKQIY--NLTKTASAINHVLTDIF-- 267 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + ++K I++L+DG+ D + NK + +G+ +I + + Sbjct: 268 ----IPENGSKNNSEKIIIVLSDGKILGDPMTLDEVLNKTQMKGVTRYSIGVGDGILKNK 323 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 A ++ A P ++ +S LN I I + Sbjct: 324 DAIKEMTQIADPGKYYSVSSYGALNDILSSLERGIIGTEGTQ 365 >gi|260881326|ref|ZP_05404133.2| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] gi|260849116|gb|EEX69123.1| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] Length = 428 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/267 (11%), Positives = 72/267 (26%), Gaps = 12/267 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G ++TA L+P ++ GM VD L+ AA A + + L + + Sbjct: 5 RRGQQGAILVLTAFLLPFIIAFTGMAVDFGSAYVRRSQLQNAADAAALAGAYHLDDNQAD 64 Query: 75 VSSRAKNSFTF---PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT-------AVEMNP 124 + + N+ + +D + T + Sbjct: 65 DVVLKYLKTNLDPHFTSYSYQTGDDFPDKFETLNYHTDKQKDELDVTLRSSVEASFLKLF 124 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 V +++ + SL M + + K+ + G++I+ I Sbjct: 125 DIDTIPVYATAKAKVSKEKESLPTDDMFNYAITVANKSYETGNPSIFMVSSGMNIKGNIL 184 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + S+L + + + + +P + Y Sbjct: 185 TNGSILFWNDRVNTLDGKIYSAVPLNKQVWSNKAWDNKPLAFA--TPDGKTFTDIHDYNK 242 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKK 271 G D + ++ + Sbjct: 243 NNGIADIRIDSNSGIEKMIRNYRDMAI 269 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 58/216 (26%), Gaps = 35/216 (16%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-------- 306 + ++ + + A +I I +D +G T++ Sbjct: 5 DNSGSMTAQDAGGQTRIDAAKQASTQLINDI------SDRTDVGLTYYGGNTGETEADVE 58 Query: 307 --ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 D + G + + G T I A+ + Sbjct: 59 MGCQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN--------- 109 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNC 422 IVL++DG + + + GI I TI +V+ +E+ Sbjct: 110 ------IVLVSDGIANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECI--AG 161 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ +A+ L + +TK Sbjct: 162 VGGGTYADASDAQSLTDALTRAASRQYNSYTSDVTK 197 >gi|124003889|ref|ZP_01688737.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123990944|gb|EAY30411.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 704 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/278 (11%), Positives = 72/278 (25%), Gaps = 30/278 (10%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK--------SYSS 217 + S L + +N F + Sbjct: 248 STFSIDVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHP 307 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + +P+ L + VD+S K L++ + Sbjct: 308 FSVNTEYGTCPWNPHHKLLQIGLQGENLQTKNASPANLVFLVDASGSMDSEDKLPLLKRS 367 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 +++ + ++ + + H R + T + GST Sbjct: 368 FKVLLKQLTDS-----RTKIAIVAYAGASGLVLPATSVSH---REKILTALENIESGGST 419 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAK 394 A + ++ AY + NN ++L TDG+ +EE + + + + Sbjct: 420 AGGEGIELAYKI----AQQAFIAGGNNR-----VILATDGDFNVGLSSDEELMQLISNKR 470 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 G+ + + F A +++ + Sbjct: 471 KSGVYLTCLGFGTGNLNDSMMEKL--TNAGNGNYYYID 506 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 60/219 (27%), Gaps = 26/219 (11%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + Q L P+ R K + + P + + Sbjct: 39 LRTFASAQMEQRLVKNISPSRRRFKLGLILLALILLCTALARPQY----GFRWVDVQRRG 94 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D + + S + R A+ + ++ R+G F Sbjct: 95 IDILFAIDTSRSMLSQDLKPNRLERARYAVMDFVATLGG-------DRVGLIPFAGSSYL 147 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + +K T I + A T+ S Sbjct: 148 MCPLTLDYQAFTDS-LKALDTKIIPRRGTNIAKVIALAEKTVADS------------SNH 194 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 K +++LTDGEN Q + + AK G+ I TI Sbjct: 195 KILIILTDGENLQGDVL--KAADLAKKNGLTIYTIGVGT 231 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 66/176 (37%), Gaps = 20/176 (11%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + VR +++++ + + D R+G ++DR ++ G +K I + Sbjct: 51 KEDFEKVRQWVSNLVETFEI---GPDKTRVGVVRYSDRPTTEFDL--GKYKTREEIKEAA 105 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G+T DA++ S E ++ KK +LLTDG + + Sbjct: 106 RKIRYYGGNTNTGDALRYIN---TYSFSKEAGGRLSDRTVKKVAILLTDGRSQDYVLDP- 161 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 N A+ GIRI + E + L AS F + + ++KI Sbjct: 162 --ANAARQAGIRIFAVGVG------EALKEELDEIASEPKSAHVFHVSDYNAIDKI 209 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 46/407 (11%), Positives = 107/407 (26%), Gaps = 29/407 (7%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 + + P E + + T + L + Sbjct: 75 AEQSVAVAQAPQDKEAPAVAKPDSSQTSEFDANDIAALTNKPEGAAAALGAAKRSAPAAP 134 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + + A+ +P A R + L+P + Sbjct: 135 GIAPQRQLAEPMAAIAPSPAPPA-----EGRVQIQLDPDRERFANAAANPIKSVATDPVS 189 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 T S ++ + +M D + +N F + V + Sbjct: 190 TFSADVDSASYAFVRRSL-TGGAMPDPLSVRVEEMINYFPYDWPGPDNAEQPFKATVTVM 248 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + ++ P + +D S K L++ A ++ + Sbjct: 249 PTPWNHDTELMHVAIKGYDIAPATTPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNRL 308 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 K D V+ + + + + ++ + + GST + ++ Sbjct: 309 KPDDTVS------IVTYAGNAGTVLTPT---RVAEKSKILSAIDRLEAGGSTGGAEGIEA 359 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMT 402 AYD + + ++L TDG+ +E+ I + + GI + Sbjct: 360 AYDL---------AKQGFVKDGVNRVMLATDGDFNVGPSSDEDLKRIIEERRKDGIFLTV 410 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + F + N S ++ E K + G+ + Sbjct: 411 LGFGRGNLNDSLMQTLAQN--GNGSAAYIDTLAEAQKTLVEEAGSTL 455 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 53/190 (27%), Gaps = 34/190 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK V+ +L ++ + + D R+ F+ + K Sbjct: 243 KKIASVQQSLVQLLDFLSEKD------RLCLITFDGSAQRLTPLKTLTQD-NKNYFKKAI 295 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + A++ I I LL+DG++ Sbjct: 296 YSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS---------IFLLSDGQD-------QG 339 Query: 389 ICNKAKSQG------IRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIF 441 + Q + I + + + ++ C SF+ L++ F Sbjct: 340 AAEYIQRQKDVVEDIVTIHSFGYGSDHDAALMSKI----CKVGQGSFYYIEDVKLLDEFF 395 Query: 442 RDRIGNEIFE 451 D +G Sbjct: 396 ADALGRLSSA 405 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 70/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 37 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 96 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+A + + D ++ +++ + Sbjct: 97 NLALVIDRSTSMSGE-RIEKAREAAILAVNMLNTTDTLS------VVAYDNHAEVIIPAT 149 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 150 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 198 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 199 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 253 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 254 TFVANSADLEKAFTKEFQDVMSV 276 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 48/390 (12%), Positives = 109/390 (27%), Gaps = 22/390 (5%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + ++ + + T Q +Y + + R Sbjct: 1305 SASDYTATYDQATRTVTVQFRADYELEKDATYSISFDVKPSEEAYNKFEKDGGYYRNEDG 1364 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +L+ + S ++ + +K+ + + R Sbjct: 1365 GYILADGEEPSEENQYKGDEKTDDPSVPEDSQTSSGKPGFPANKKATLQYTYDGGTGRFE 1424 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P+ + + + + + +P V+ Y + + Sbjct: 1425 YPHPVLQIPEPVLPDEYNKRIEPNDDGTYSLTLDVTGIEGNPATVTTK---YPVDLVFVI 1481 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D SLS ++ +D + ++ + +D + + I D + + + S Sbjct: 1482 DKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEIIP-DLCSQQYDIQIAGYQFSGSST 1540 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 W V + ST + A+ A D + + + +N KK Sbjct: 1541 RVLDWSRE---EQQVLSGLKIARTSSSTEPSQALADALDMLKTGS----PAHRNQSNVKK 1593 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---RYFLSNCASPN 426 Y++ +TDGE T+ + G I TI S + + L S Sbjct: 1594 YLIFMTDGEPTEPEDWSYNAVRNHAVPGASIYTIGVSSDASTNLMEGIRSTAL----SNG 1649 Query: 427 SF----FEANSTHELNKIFRDRIGNEIFER 452 + F+ S + F I Sbjct: 1650 MYAPATFKGTSAQLIRDAFTQIKDEIISTS 1679 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/290 (12%), Positives = 72/290 (24%), Gaps = 30/290 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + + +K + + + + + + Sbjct: 990 DAYFSFGEDGTTKVRFPHPVIPAPATNHPEYRKYIKDNGNGTYTLTLDVKSDVGSVTTGQ 1049 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K +D S S + S + ++ +V AL + D Sbjct: 1050 KDPTPTAVMFVIDKSGSMDQSFGSGNSDA---RREVVNSALELFFNQLSDGDYNIQFG-- 1104 Query: 298 GATFFNDRVISDPSFSWGVHK------LIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 F+D G + GST + +++A + Sbjct: 1105 -GYKFSDSGERVNFNDQGWETEYWETDTSNALSHLKLTSRETDGSTYPSQTLRSAISAL- 1162 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-----GIAICNKAK--SQGIRIMTIA 404 ++ K+Y++ LTDGE Q++ + K G I Sbjct: 1163 -------ENVELGENGKRYLIFLTDGEPGQNSYSFSEKEAENCYSAIKNLDSGTTFYAIQ 1215 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFE 451 + + + + + F NS ELN F Sbjct: 1216 VANSDSHGFMESMVSNANSVDGVTAQKFVGNSADELNAAFSQMAAEISGS 1265 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 34/127 (26%), Gaps = 13/127 (10%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIR 321 + I + + + R ++ ++ ++ +G + + Sbjct: 629 YRTSYGSHGRPSTEGKEGFIDELFEQSPNS---RFSIVTYSTDASTELGWTEYGRNGSGQ 685 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +K D G T + A + + ++ L+DG+ T Sbjct: 686 QTIKKAIGDLQANGGTNYEAGLYQAVEVLKERGNSSNIP---------VVIFLSDGKPTY 736 Query: 382 DNEEGIA 388 + Sbjct: 737 YYSDVDE 743 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 60.0 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 43/390 (11%), Positives = 108/390 (27%), Gaps = 31/390 (7%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ + ++E L + + N++ + T A + Sbjct: 41 ADSAANLPLPETSMVAPQVESSLRQRAQGNVQASKTASRPMLPAPPVLSTPAAPVPAAAL 100 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-------QWVID 184 L P+ + +++ + + + + Sbjct: 101 AEPMHMHERLMPMPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFL 160 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + ++ + +N F + + + + + Sbjct: 161 NAGTLPPVDAVRVEELINYFRYDDPAPTDGTP-FAVRTELAPTPWNTDTLLLRIGVAGRE 219 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P P+ + VD S K L++ +L ++R ++K D R+ + Sbjct: 220 VPTAALPAANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAG 273 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + G +T + G TA ++ AY + + R N Sbjct: 274 STAVVLPPTSG---AQQTRIVEAIDSLQSGGGTAGASGIELAYK----AAQQAYLRGGIN 326 Query: 365 LEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ + + + + G+ + T+ F Sbjct: 327 R-----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD- 380 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 381 -AGDGAYAYIDSALEARKVLTHELGSTLAT 409 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 43/390 (11%), Positives = 108/390 (27%), Gaps = 31/390 (7%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + ++ + ++E L + + N++ + T A + Sbjct: 62 ADSAANLPLPETSMVAPQVESSLRQRAQGNVQASKTASRPMLPAPPVLSTPAAPVPAAAL 121 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-------QWVID 184 L P+ + +++ + + + + Sbjct: 122 AEPMHMHERLMPMPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFL 181 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + ++ + +N F + + + + + Sbjct: 182 NAGTLPPVDAVRVEELINYFRYDDPAPTDGTP-FAVRTELAPTPWNTDTLLLRIGVAGRE 240 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 P P+ + VD S K L++ +L ++R ++K D R+ + Sbjct: 241 VPTAALPAANLVFLVDVSGSMGAPDKLPLLQSSLKLLVRQLRKQD------RITLVTYAG 294 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + G +T + G TA ++ AY + + R N Sbjct: 295 STAVVLPPTSG---AQQTRIVEAIDSLQSGGGTAGASGIELAYK----AAQQAYLRGGIN 347 Query: 365 LEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 I+L TDG+ + + + + G+ + T+ F Sbjct: 348 R-----ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLAD- 401 Query: 422 CASPNSFFEANSTHELNKIFRDRIGNEIFE 451 A ++ +S E K+ +G+ + Sbjct: 402 -AGDGAYAYIDSALEARKVLTHELGSTLAT 430 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 68/199 (34%), Gaps = 35/199 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + + +R+G T F+D + ++ + RTI Sbjct: 106 DRITAAKRAASRFLDQLPPG------IRVGVTTFSDVPDGTQTPTYDHDLIRRTIEAQI- 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TA DA+Q A DT+ +N +VLL+DG T + + Sbjct: 159 ----ADGGTATGDALQVALDTL-------ERLEQNGERTPAAMVLLSDGATTTGR-DPVM 206 Query: 389 ICNKAKSQGIRIMTIAFSVNK-------------TQQEKARYFLSNC--ASPNSFFEANS 433 + A I I T+A L AS F+A Sbjct: 207 VARAAGEARIPIYTVALGTRDATVPNPGPTGPPLLPVAPDPETLQAIADASGGRAFQAQD 266 Query: 434 THELNKIFRDRIGNEIFER 452 EL+ I+ + +G+ + R Sbjct: 267 DQELSSIY-ETLGSRLGTR 284 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 34/88 (38%), Gaps = 11/88 (12%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ----------AAQTAIIT 63 I +G+F I+TALLM ++G GM VD L A ++ + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 A++ + +S ++ ++ Sbjct: 68 AAMAM-NGNGTISLGKTDARDIFMSQVS 94 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 32/300 (10%), Positives = 74/300 (24%), Gaps = 1/300 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMLTNSNVTFSNSLSNPFQPKSSIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + S + + + + +V Sbjct: 135 TSRTGIVNWFIQTLNMVPGVSVANASVSATAVATVGAAQTTCAIPVFVCKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + S Y L Sbjct: 195 GATYNIGDWLAVKTGSPPSFGAGNFGWSALDGSNSAASIKSELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 DS++ + T + ++D V +F+ Sbjct: 255 GNKVADSAAYNTRFGIYANPYKDPSYGTPDFTGYAYDATTWPAQSNAYSDFVSKRLTFTS 314 >gi|256397309|ref|YP_003118873.1| hypothetical protein Caci_8209 [Catenulispora acidiphila DSM 44928] gi|256363535|gb|ACU77032.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 177 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 42/165 (25%), Gaps = 2/165 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ ++ G + + PV++ + +VDV A A + Sbjct: 13 RRSLRRALRDDRGSLAMAVVIWAPVVVLLMAFVVDVGLLISDRTQASDYADQAARRVAQD 72 Query: 68 LIQS--LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + Q K ++ + + ++ + V NP Sbjct: 73 IDQGWLKTHNVRGPNGEDPGIKVNVDPDTGDCVPDAEQYLLDNQITDTTITSCQVTGNPT 132 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + V L + LF+ + A V Sbjct: 133 EVDLYVNPRITVTLQMQYKPLFVGFALKGDSTVTGTGSATPVIPK 177 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 30/282 (10%), Positives = 76/282 (26%), Gaps = 37/282 (13%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--YSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + + + L Q A V + + + Y +C++ Sbjct: 155 PQDEHVDYVESYNVIEKLAKKFQEAFCVVSDLCATGDHDCEQLCVSSPGSYTCACHEGFT 214 Query: 242 YMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + S + LV+ + ++ ++ D + Sbjct: 215 LNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQV 274 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++ + + G + I T A++ D + + Sbjct: 275 G---LVQYSSSIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVS 329 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K ++ TDG + + KAK G ++ + Sbjct: 330 SGARPGA------QKVGIVFTDGRSQDYIND---AARKAKDLGFKMFAVGVG------NA 374 Query: 415 ARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFERV 453 L AS + +F +N+ IG ++ +++ Sbjct: 375 VEEELREIASEPVADHYFYTADFKTINQ-----IGKKLQKQI 411 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 55/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + T A+ Sbjct: 28 IESLDVGPNATRVGLVNYASTVKPEFPL---RAHGSKASLLQAVRRIQPLSTGTMTGLAL 84 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 85 QFA---ITKALSDAEGGRARSPDISKVVIVVTDGRP-QDSVRDVS--ERARASGIELFAI 138 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L K F++ Sbjct: 139 GVGR------VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQEAF 180 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 50/137 (36%), Gaps = 8/137 (5%) Query: 278 LASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + S I + ++ D VR+G F ++ S + + GS Sbjct: 66 VRSFISNFAGTMPLSPDDVRVGLVTFGTSAVTRWDLSDSRAQNADLLAAAAKKLPYAAGS 125 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + + A + + S + A K I+++TDG + + ++ K +++ Sbjct: 126 TYTHLGLAKAEEILFS------FQKGGRDNAPKMILVMTDG-ASSRRSQTLSAAEKLRNR 178 Query: 397 GIRIMTIAFSVNKTQQE 413 G+ I+ + E Sbjct: 179 GVIIVVLGVGTGVNSAE 195 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 60.0 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 58/216 (26%), Gaps = 35/216 (16%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-------- 306 + ++ + + A +I I +D +G T++ Sbjct: 45 DNSGSMTAQDAGGQTRIDAAKQASTQLINDI------SDRTDVGLTYYGGNTGETEADVE 98 Query: 307 --ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 D + G + + G T I A+ + Sbjct: 99 MGCQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN--------- 149 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNC 422 IVL++DG + + + GI I TI +V+ +E+ Sbjct: 150 ------IVLVSDGIANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECI--AG 201 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ +A+ L + +TK Sbjct: 202 VGGGTYADASDAQSLTDALTRAASRQYNSYTSDVTK 237 >gi|291008772|ref|ZP_06566745.1| hypothetical protein SeryN2_29978 [Saccharopolyspora erythraea NRRL 2338] Length = 324 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 55/201 (27%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + +G F + + + Sbjct: 107 NRLEAAKVAAKEFADQLTPG------INLGLVSFAGTATVLVMPTTDRASVKQA-----I 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + +TA D + A I S + IVL+ DG T E Sbjct: 156 DNLKLSEATATGDGINAAMSAIDS---FGKMVGGPSGAPPARIVLMADGGQTIPRELDAP 212 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCA--SPNSFFEAN 432 +AK I I TI+F E + A S F +A Sbjct: 213 RGAYTKAQEAKKANIPISTISFGTKHGSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAA 272 Query: 433 STHELNKIFRDRIGNEIFERV 453 S +L +++ G +I + Sbjct: 273 SAEQLREVYATL-GEQIGYEI 292 >gi|239906053|ref|YP_002952792.1| hypothetical protein DMR_14150 [Desulfovibrio magneticus RS-1] gi|239795917|dbj|BAH74906.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 391 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/392 (10%), Positives = 101/392 (25%), Gaps = 19/392 (4%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G + +AL+M V+ G+ + VD Y L+ AA A + + L+++ ++ + Sbjct: 13 ESGSVVVFSALIMIVLAGLATLAVDYGFLQYKRSQLQTAADAAALAGAADLLRNGDDFDA 72 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + F ++ + E + T + +V Sbjct: 73 VRATAVDFGQRNLGEQDTVASAVTTGDVELLKGETPAAGATPDTV-------RVTAGRTA 125 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQWVIDFSRSMLDYQRDS 196 +G + + A A K S + S ++ Sbjct: 126 QRGNPVDMFLGPVLGWNTQDLTATASASLFCSDQTKCLKPFSPPAKFTWDDSCDTNKKFK 185 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + +V G +G ++ P + + E Sbjct: 186 NNDAFDPGSSCELSSVNVLGYDAGDLGTPIVLKLSDSHDTVVPGHFQAVDYPPANTGNPE 245 Query: 257 HF---VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +VR+ VI + + + D + + Sbjct: 246 SGAAVYRLNIAGCTASNNTVVREGDELVIEPGNMVGP-TNQGLADLIATDPGASWDSATN 304 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +K+ + + + ++I V N + + Sbjct: 305 SITGSAYADPMKSPRVALIPFY--DPSRPQNSGRNSIYVYQLGAVFIENTNAKGEVVGRF 362 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + G + C+ A + + ++ Sbjct: 363 M-RGMAVEPK-RTTGACDTA---SVSLYSVGL 389 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 65/190 (34%), Gaps = 22/190 (11%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I + V+ ++ + + T R+G F S L+ I + Sbjct: 117 ITRLQAVQRSVRAFVAQ-------RKTDRIGLVIFASSAWPFAPISEDKQALLARINQ-- 167 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 TAI DA+ A + SS + +A + +LLTDG +T Sbjct: 168 LAPGMIGQQTAIGDALGVAVKLLDSS---------LDRDASRLAILLTDGNDTASQLSPA 218 Query: 388 AICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 A S +++ TIAF + + + L A + +A+++ + Sbjct: 219 LAAQLAASHHVQVHTIAFGDINSSGEDKVDTALLKQIAQLTGGEALQASTSGKALDSVWQ 278 Query: 444 RIGNEIFERV 453 +I +V Sbjct: 279 QIDAMTPSQV 288 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 58/216 (26%), Gaps = 35/216 (16%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV-------- 306 + ++ + + A +I I +D +G T++ Sbjct: 45 DNSGSMTAQDAGGQTRIDAAKQASTQLINDI------SDRTDVGLTYYGGNTGETEADVE 98 Query: 307 --ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 D + G + + G T I A+ + Sbjct: 99 MGCQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN--------- 149 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNC 422 IVL++DG + + + GI I TI +V+ +E+ Sbjct: 150 ------IVLVSDGIANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECI--AG 201 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ +A+ L + +TK Sbjct: 202 VGGGTYADASDAQSLTDALTRAASRQYNSYTSDVTK 237 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 60/171 (35%), Gaps = 15/171 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R A+ +V+ ++ V +D V L R VK+ Sbjct: 48 TRMESARRAVGAVVDALPDGYPTGLRVYGADRPQG---CADTRLVRPVRPLDRAAVKSAV 104 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I +++ A + + ++ + IVL++DGE+T Sbjct: 105 AGVRPTGDTPIGLSLRKAAEDL--------PAPRDGAARTRTIVLVSDGEDTCGTPPPCE 156 Query: 389 ICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + + G+RI T+ F V +E+ A +++A L Sbjct: 157 VAARLAGQGAGLRIDTVGFQVKGAAREQLECVAE--AGNGRYYDAPDADAL 205 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 58/168 (34%), Gaps = 21/168 (12%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDENE--MGSTAINDAMQ 344 D V F+ R+ + + +L + G T + DA+ Sbjct: 108 PNKDQV--FLMQFDFRIFMRQPLTNSLRQLSDSLPYVDTPTFNQLRAQSGGGTLLYDAVV 165 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + N +K ++LLTDGE+ + A+ I +I Sbjct: 166 TASQEV-----------MLNRTGRKALILLTDGEDYGSDASVGDAIEAAQRADTLIYSIL 214 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 F+ + + + SFFE + ++++IF I E+ + Sbjct: 215 FADQGDGRRPLQRMSKE--TGGSFFEVSKKQDIDQIF-TAIQEELRSQ 259 >gi|257093736|ref|YP_003167377.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046260|gb|ACV35448.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 452 Score = 59.6 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 56/176 (31%), Gaps = 21/176 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D + T F+DRV + R + + GST ++ Q Sbjct: 75 ADQLEPTDIASLVVFDDRVQTLVPP---RPVGDRQALHLALSRVHSGGSTNLHGGWQAGA 131 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT----QDNEEGIAICNKAKSQGIRIMTI 403 D ++ + ++LL+DG D A+C +A +G+ T Sbjct: 132 DGLLPAAGQAALAR---------VILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTY 182 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG--NEIFERVIRIT 457 + + + + ++ +L + F + + R +R++ Sbjct: 183 GLGSHFNEDLMVEMAKR---GGGNHYYGDTAADLFEPFAAEFDFISALCARHVRLS 235 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 65/206 (31%), Gaps = 25/206 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + S+ F+ S +V+ L ++ S ++G ++D Sbjct: 34 SCRTAPSDLVFILDGSWSVDDINFEIVKKWLVNITMSFNIGQK---FTQVGVVQYSDDPF 90 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLE 366 ++K E G+T A++ A D + + S Sbjct: 91 LHIPLGKHFSSSD--LIKAMESIEYMGGNTNTGRAIKFANDKLFALSERGPNGIA----- 143 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--- 423 K V+LTDG + +E +A A+ +GI + I E L A Sbjct: 144 --KIAVVLTDG---KSQDEVLAAAEAARKKGIILFAIGVGS-----ETEEAQLRAIANKP 193 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 S F + K R+ I ++ Sbjct: 194 SSTYVFSVKDYKAIAK-IREVIRQKL 218 >gi|221640506|ref|YP_002526768.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] gi|221161287|gb|ACM02267.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] Length = 651 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 43/393 (10%), Positives = 103/393 (26%), Gaps = 27/393 (6%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + P + + + A+ + E + + Sbjct: 102 LAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAPQTFAADEAMPM 161 Query: 124 PRKSAYQVVLS---SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSI 179 A + LS + P ++ AE + S + +I Sbjct: 162 AAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAI 221 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + + +N F + + +P + + Sbjct: 222 LRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSVTRTPWNPETRLVHVA 281 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L + P L+ +D+S K L++ + ++ ++ D V Sbjct: 282 LQGRMPAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVA------I 335 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + + R+ + + + GSTA + + AY T Sbjct: 336 VTYAGSAGEVLAPT---AANQRSTILSALDRLDAGGSTAGEEGLALAYRTASEMAGAGEV 392 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKAR 416 +VL TDG+ + + + G+ + + F + Sbjct: 393 TR---------VVLATDGDFNLGISDPEDLARLVAHERDTGVYLSVLGFGRGNLDDATMQ 443 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 N +S +E K+ D++ + Sbjct: 444 ALAQN--GNGQAAYIDSLNEAQKVLVDQLSGAL 474 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 61/204 (29%), Gaps = 46/204 (22%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R + +++ +ND D + + T Sbjct: 586 NRYWAGYDNSYYDDWEYISEHYAGNDKNYNDYATIDLNLTEDFSDYN-----TEVKALVP 640 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------- 382 G T + + + + + D +A K +V+L+DG+ Sbjct: 641 DGGTPMRKGLYYSIKHLRDNGRD---------DAVKAVVVLSDGDYNYYGDPLARGSGGT 691 Query: 383 NEEGIAICNK-----------------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-- 423 + + K AK I+I +IA++ + + + L A Sbjct: 692 KWDWSDMQEKYYTFSDLNSSEQDMRIFAKDNDIKIFSIAYADGIS--SEGKAVLQALAEG 749 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + ++ A S +L +I+ D G Sbjct: 750 TGGKYYYAPSGEDLEEIYEDIAGE 773 >gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] Length = 947 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/256 (11%), Positives = 70/256 (27%), Gaps = 16/256 (6%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + V + D K Y ++ D + Sbjct: 260 ICPSCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSG 319 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + ++ + + FN + + + K K + Sbjct: 320 SMWGVKMKQTVEAMKTILDDL---RAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRY 376 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEE 385 G T IN+A+ A + +N + + I+L++DG+ T + Sbjct: 377 IEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGKCELK 432 Query: 386 GIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I K I + ++ + R N + T K F Sbjct: 433 LSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFY 492 Query: 443 DRIGNEIFERVIRITK 458 + ++ ++R + Sbjct: 493 N----QVSTPLLRNVQ 504 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 43/389 (11%), Positives = 108/389 (27%), Gaps = 35/389 (8%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + I + + ++ + +N + + S + + + Sbjct: 244 YNGSNIGTDNQIKVNGDAEITAYYKKADKVNFTVKYYVNGIEQSDSFTKSVKKGIQTVSI 303 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 +I + + + + ++ + D + P Sbjct: 304 PEKYNQYTFNKMIINDVEQEKKTTFDVKSDTTIAYYY-TDLNAPDAVEPDGDLNGPAQPE 362 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS-S 263 S ++ + V+ +D S S + VD + Sbjct: 363 YPNQGAVKTSKTASDTDFPKTGLTRVELGVTGVPLKQGTDVVLVIDVSGSMDWDVDGKQT 422 Query: 264 LRHVIKKKHLVRDALASVIRSI--KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + K+ + +D+ + + D R+ F+ ++ + + Sbjct: 423 TDNTKKRITIAKDSAKQFVNQLFANNEDGSKSNNRVSVVIFSSSGYTNGILCSLKNVDNK 482 Query: 322 TIVKTFAIDE--NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 V N G T ++AM A + + + K ++ ++DG Sbjct: 483 QTVIDAIDGISNNPTGGTDYDNAMTMAEQVLETVKD---------TTRNKAVLFMSDGAP 533 Query: 380 TQDNEEGIAI------------CNKAKSQ-GIRIMTIAFSVNKTQ-----QEKARYFLSN 421 ++ K+ G I T++F + +Q +++ R L + Sbjct: 534 ENGYNGKTGYDIYPDAFKAHEKSSEIKNNYGATIYTVSFGLKGSQYKELTEDRCRQILRD 593 Query: 422 -CASPNS-FFEANSTHELNKIFRDRIGNE 448 AS + + ANS +L F + Sbjct: 594 YMASNENCYKNANSKEDLENAFTNIATAI 622 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 58/203 (28%), Gaps = 46/203 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +I ++ R+G + + + + Sbjct: 110 NRIEKSKQLVTQIINNLGS-------DRIGIIAYAGSAYPQLPITTDYSSA-KLFLSQMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI +A++ A + + + +++DGE+ Sbjct: 162 TDMLSSQGTAIGEAIELAKTYYNDEEQ-----------TNRVLFIISDGEDHVGES--SN 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-----------------------ARYFLSNCASP 425 I +A +GIRI TI ++ L AS Sbjct: 209 IAEQANDEGIRIFTIGVGKSEGGPIPLKRNGIVQSYKKDQNGETVITRLDDTTLKEIASG 268 Query: 426 --NSFFEANSTHELNKIFRDRIG 446 + + ++T + + ++ + Sbjct: 269 ANGEYIDGSNTATVVETVQNLLN 291 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 17/168 (10%), Positives = 50/168 (29%), Gaps = 18/168 (10%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 ++N+ R+ ++ +V + + + GST ++ Sbjct: 81 AVENLRPHDRVAVVIYDHQVEVVVPSTL---AENKEEILRRLRPVRPRGSTNLHAGWLEG 137 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTI 403 + + + + +++L+DG I + + +G+ T+ Sbjct: 138 STQVAAHLDAKRLNR---------VIVLSDGLANTGETNPNVIAEQVRGLSQRGVSTSTL 188 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 ++ + A +++ S +L IF + Sbjct: 189 GVGLDYNEDLMMAMAE---AGQGNYYFIESPDDLPGIFAQELSGLATT 233 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 96/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ + T + ++D V +F+ Sbjct: 255 GDKAATTNAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 48/381 (12%), Positives = 82/381 (21%), Gaps = 28/381 (7%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + E + D V D + + V Sbjct: 328 TELELATTEATVTVVARDESSGEEVSKTITLDAPGFADEVYDIELTDPESGAEISVTGEG 387 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + S E + D S + Sbjct: 388 IVKSDFVVDPVPAEENRSFYAGPFIHIRNFSDFESATVEMPLDD----DVDPSDGNLSVY 443 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 Q + V + S + V P Sbjct: 444 KWDQHDEKPWHAVETDVHVENGTAVATVDSFSYFSVFWVDNWNDAITDTVNLAEHPEYVA 503 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV-----NDTVRMGATFFNDRVISDP 310 S + + I+ K + R F Sbjct: 504 NETEGSIEPIDLAFVIDESGSMGGARIQDAKASAKRFVGGLYEDDRAALVSFAGGATLGQ 563 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S + + + N G T +Q A D + S+ E + Sbjct: 564 SLTTDHGAVNAS-----IDQLNAGGGTNTGAGLQKAVDELTSNGEGDTQE---------- 608 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 I+LL DG + + I A I I TI QE A+ F++ Sbjct: 609 IILLADGGTGLG-PDPVTIAQTADEHRITINTIGMGTGIDAQELTSI---ADATGGEFYQ 664 Query: 431 ANSTHELNKIFRDRIGNEIFE 451 + + EL ++F N I Sbjct: 665 VSDSSELPEVFDRVEQNRISL 685 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 49/158 (31%), Gaps = 19/158 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + ++ D TAI +A++ A + Sbjct: 129 DRVGIVAYAASAYPQLALTTDHSAA-KMFLQGMNTDMLSSQGTAIQEAIRMASNYFDDKT 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + +L+DGE+ I ++A+ +G+ I TI K Sbjct: 188 P-----------TARLLFILSDGEDH--EMGATEIASEAQEKGVHIYTIGIGTEKGAPIP 234 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + ++ ++ + + +I Sbjct: 235 IKE-----GGGQTYKRDSNGEVVITKLNSELLQQIASN 267 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 54/201 (26%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 +S + ++A + + +G F + Sbjct: 112 SMASNDVPPNRLAAAKEAGKQFADQLTPA------INLGLVEFAANATLLVPPT-----T 160 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R VK+ TA + + TA I + V + IVL +DG Sbjct: 161 NRGAVKSGIDSLQPAPKTATGEGIFTALQAI--ATVGSVMGGGEGPPPAR-IVLESDGAE 217 Query: 380 TQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA-- 423 + AK QG++I TI+F + L Sbjct: 218 NVPLDPNAPQGAFTAARAAKGQGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEI 277 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + F A+S L ++ Sbjct: 278 TDGEAFHADSLDSLKNVYTTL 298 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 45/399 (11%), Positives = 105/399 (26%), Gaps = 35/399 (8%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + + + + + + TA Sbjct: 32 AELKQEQAIQKENSAQAGDDTVQKRQAEAAQQAAKKAAEYKANTEAKALADAKAASLATA 91 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVS 178 + N + R I+ ++A + S + + Sbjct: 92 EAPQYEMRTRAAASKAFAAQGGNVMGTA-RYEHYDENPIKQVSQAPLATFSLDVDTGSYA 150 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN----------GKVGIRDEK 228 + + + LN F P K +++ K + Sbjct: 151 NVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSP 210 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + P + +D+S ++ L++ AL ++ ++ Sbjct: 211 WNAQRTLLKVDVQARDMQVRDLPPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQ 270 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 DN+ G T T +K + + GST ++ AY+ Sbjct: 271 DNITIVTYAGGTHVALAS---------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE 321 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAF 405 + + I+L TDG+ + I K + +GI + T+ Sbjct: 322 --QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGV 372 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++ +S E K+ +D Sbjct: 373 GDDNFNEAMMVRIAD--VGNGNYSYIDSLSEAQKVLKDE 409 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 55/185 (29%), Gaps = 37/185 (20%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISS 353 R+G F+ + + + + + T + A++ Sbjct: 128 DRLGLVAFSGVAFTQCPLTLDYQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMTAFDPK 187 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 K I+L+TDGE+ E G+ KA +GIRI + Sbjct: 188 -----------SSTDKVILLITDGEDN--EEAGLKAAEKASDEGIRIFVLGIGDPAGGPV 234 Query: 413 -------------------EKARYFLSNCA--SPNSFFEANSTH-ELNKIFRDRIGNEIF 450 + L A + + + + +L++++ + I + Sbjct: 235 PSLDGSGFEKDAGGKIILSKPDESTLQAIANETGGDYIRSEAGDFDLDQLYFNGIKKKTE 294 Query: 451 ERVIR 455 +++ Sbjct: 295 AEILK 299 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 58/198 (29%), Gaps = 37/198 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ AL VIR + +D R+ F + + +K Sbjct: 83 DRIDKVKTALQFVIRKLSDLD------RLCIVTFCTNATRLCPLRFVTAAA-QAELKALV 135 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGI 387 G T + ++T + + + ++L++DG +N + + Sbjct: 136 DGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAVS--------VMLMSDGYQNHGGDARDV 187 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRD- 443 + + + + T +F + L A +F + L F Sbjct: 188 HL------KNVPVYTFSFGASHDSN-----LLEAIARKSLGGTFNYVADSANLTGPFSQL 236 Query: 444 ------RIGNEIFERVIR 455 I ++ V R Sbjct: 237 LGGLLTIIAQDLELTVTR 254 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 60/172 (34%), Gaps = 18/172 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ I +D + R+G ++ +V + S + I T Sbjct: 57 MMDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRADMERAINSIIPL--AQGTMTG 114 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + ++ E K + +++TDG + + + +A++ GI I Sbjct: 115 LAIQYAMNVAFTTQEGARPLHKRIP---RIAIVVTDG---RPQDRVTEVATQARNAGIEI 168 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + Q L ASP F S EL + F + +++ Sbjct: 169 YAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFAKQFQDKL 213 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 51/156 (32%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ ++ + R+G ++ RV ++ + I K Sbjct: 361 FELVKQFVNRIVDLLEVSPHGT---RVGLVQYSSRVRTEFPL--NKYHSADEIKKAVMDV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S E N + ++ TDG +QD+ Sbjct: 416 EYMEKGTMTGLALKHMVEHSFSELEGARPLSYNIP---RIGLVFTDGR-SQDDI--SEWA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK GI + + + L AS Sbjct: 470 RRAKESGIVMFAVGVG------KAVEEELRAIASEP 499 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 44/166 (26%), Gaps = 20/166 (12%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D ++ F+ G R +K E G T I + Y Sbjct: 241 DRLDANDDFSLVTFSSGADVRI--DDGPVGPRRAAIKATIDGIREGGGTNIGQGLALGY- 297 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAF 405 + R A K ++LL+DG + A G++ T Sbjct: 298 -------AQAARPGIPEGAVKVVLLLSDGRANAGITSSERLSRLALDAFQGGVQTSTFGL 350 Query: 406 SVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEI 449 + +S AS + ++ ++ + + Sbjct: 351 GADYDGA-----LMSAIASDGAGGYYYLRDPDQIAPALATELDRRL 391 >gi|22758317|gb|AAN05521.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432564|gb|AAP54179.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 606 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 58/198 (29%), Gaps = 31/198 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K LV+ A+ VI ++ D R+ F+ + ++ + K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRL-LRMSEVGKATAKRAV 210 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T I D ++ A + ++LL+DG+++ Sbjct: 211 ESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS---------VILLSDGKDSYVVPRRGN 261 Query: 389 ICNKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + Q I T F + S + +F + Sbjct: 262 GMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAES---TGGTFSFVENEAA 318 Query: 437 LNKIFRDRIGNEIFERVI 454 + F IG + V Sbjct: 319 IQDSFAQCIGGLLSVAVQ 336 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 56/177 (31%), Gaps = 7/177 (3%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVI-SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I +I N ++ N+ + F + ++ K F +T + Sbjct: 327 IWTINATQNTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEPDSTTNME 386 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS---QG 397 DA+ A + + ++ + K IV LTDGE + A+ Sbjct: 387 DALNKA---LSIAKLGKMRFKDSAKTPKPIIVFLTDGEMNEGITNPQALMKYVSDINVDN 443 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 I ++ F + + L+N +EA+ F I + + V Sbjct: 444 YPIYSLGFGKGADIEFLKKLSLNNTGFARVIYEASDASLQLHNFYKEISSPVLSNVT 500 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/307 (12%), Positives = 83/307 (27%), Gaps = 45/307 (14%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR--DEK 228 ID+S + + S LN + T + + Sbjct: 1284 SVVGSAYQCNCPIDYSGIHCEIWQCSTMICLNNGTATYNNTSDGCECECISEETPEGNIT 1343 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSL----------------SEEHFVDSSSLRHVIKKKH 272 + P + E ++ Sbjct: 1344 YVGDHCEYRFPGVCIPSPCGNGTCVELTQTLHDCACDFGFQGENCDHVCNTTISGRLDIQ 1403 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL---------IRTI 323 +V D S+ + K + + A ++ + + Sbjct: 1404 IVIDTSGSLTSAPNKDQVLMNFTNNLANMYDTINQVKIGLTSFSESSVLEMPLDFYNQLE 1463 Query: 324 VKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + GS T I ++TA + + +S ++L+TDG + + Sbjct: 1464 LQDGVSNMTWQGSFTNITSGVETALNDMDTS-----------DAVDDVMILITDGFQSTN 1512 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + ++AK+ G+R++ + F + Y ++N + A + EL Sbjct: 1513 TTLMFQMIDQAKADGVRLIALGFFGDFAFYSPNLYLMTN-----EVYHAANYAELL-AID 1566 Query: 443 DRIGNEI 449 + I I Sbjct: 1567 NTIFETI 1573 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 72/213 (33%), Gaps = 21/213 (9%) Query: 265 RHVIKKKHLVRDAL-ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 R I K L+++ V+ + ++ MG F + +I + Sbjct: 119 RKTITKMDLLKEVTKGFVLGNKQEGLTGRPQDMMGLVTFARGAQVLAPLTLDHQAIIDQL 178 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK----KYIVLLTDGEN 379 K + E TAI A+ + I ++ A ++L+TDG Sbjct: 179 SKLQYTTDLEQDGTAIGYAIYKTANLIAATRHYAEELEGAGKPAYTIKNSIMILVTDGLQ 238 Query: 380 TQD---------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ--EKARYFLSNCA--SPN 426 + N E + AK G+++ I + R + + Sbjct: 239 APNPLDQGKEFRNVELLDAAVYAKKLGVKVYIINVEPRIASEEFSAHRLLMKKITELTGG 298 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVIRI-TK 458 F+ +++ L+ I+ + +++ + + + T+ Sbjct: 299 RFYMVDNSLNLSSIYSEI--DQLEKSPLPLETQ 329 >gi|253701795|ref|YP_003022984.1| von Willebrand factor A [Geobacter sp. M21] gi|251776645|gb|ACT19226.1| von Willebrand factor type A [Geobacter sp. M21] Length = 368 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 57/206 (27%), Gaps = 16/206 (7%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G ++ A +M + + + VD R + L A A A+ L +E Sbjct: 9 KGQILLVVAAVMFFGIFLAALAVDAGRAYGVKAKLHAAVDAASYEAAKALAHGEDEDDME 68 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 K + +Y NF F + + + + + A V ++ Sbjct: 69 EKA-----SEAARDYFRANFPAAY---FGAQCSGPELELSERKSGKKMRALTVSATATLP 120 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 ++F +G S + ++ A + V+ S + + + Sbjct: 121 ------NVFAGILGWDSIDLPAQSRAVRT--DADVVLVLESSDVLRESFPQVKQRVANFS 172 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + Sbjct: 173 DRFSQHYDRMSLVTFAAGADPVISIC 198 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 34/352 (9%), Positives = 101/352 (28%), Gaps = 30/352 (8%) Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 L + + + + AY+ V ++ + N + + + Sbjct: 70 LAETTQNETQQYTDQQALQGQLQAAPAYESVAKAKATRISNLGTARYQQFDDNPVKQVAQ 129 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF------GQPADRTVKSY 215 T S +++ ++ + + +N F Sbjct: 130 NPLATFSLDVDTGSYANVRRFLNQ-GQLPPPDAVRVEEMVNYFPSDWVINDKQSIPASKP 188 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + + + + P+ + +D+S + ++ L++ Sbjct: 189 IPFAMRYELAPAPWNEQRTLLKVDILAQDLKSEALPASNLVFLIDTSGSMYSDERLPLIQ 248 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +L +++ +++ DN++ + + G H + + N G Sbjct: 249 SSLKLLVKELREQDNIS------IVTYAGDSRIALPSTSGNH---KDEINAAIDSLNARG 299 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNK 392 ST ++ AY ++ + I+L TDG+ ++ + + K Sbjct: 300 STNGGAGLEMAYQ--QAAKGFIKGGVNR-------ILLATDGDFNVGIDDPKSIESMVKK 350 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + G+ + T+ + + ++ ++ E K+ Sbjct: 351 QRESGVTLSTLGVGRDNYNEAMMVRIAD--VGNGNYSYIDTLSEAQKVLNSE 400 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 96/337 (28%), Gaps = 25/337 (7%) Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 ++ + V+ ++RY+ + + ++ + + R Sbjct: 103 QYEMRTRASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRR 162 Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + + + + + + + +P + + + + Sbjct: 163 FLNQGQLPLPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWN 220 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 + P + +D+S ++ L++ AL ++ ++ DN Sbjct: 221 AQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDN 280 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + G T T +K + + GST ++ AY+ Sbjct: 281 ITIVTYAGGTHVALAS---------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE-- 329 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSV 407 + + I+L TDG+ + I K + +GI + T+ Sbjct: 330 QAEKGFIKGGVNR-------ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGD 382 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++ +S E K+ +D Sbjct: 383 DNFNEAMMVRIAD--VGNGNYSYIDSLSEAQKVLKDE 417 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 70/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 25 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 84 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+A + + D ++ +++ + Sbjct: 85 NLALVIDRSTSMSGE-RIEKAREAAILAVNMLNTTDTLS------VVAYDNHAEVIIPAT 137 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 138 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 186 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 187 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 241 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 242 TFVANSADLEKAFTKEFQDVMSV 264 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 54/178 (30%), Gaps = 38/178 (21%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + +++ D TAIN+A++ A Sbjct: 129 DRIGIIAYAAQAYPQLPITTDYSAA-KMFLQSMNTDMLSSQGTAINEAIELASTYYDDET 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + +++DGE+ + KA +GIRI TI K Sbjct: 188 Q-----------TNRVLFIISDGEDHAEGT-TEDAVEKATEEGIRIFTIGVGKEKGAPIP 235 Query: 415 ARY-----------------------FLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + L A + ++T E + ++++ Sbjct: 236 IKRNGIVESLKKDMNGEVVITKLNEVILKEIAAEGNGEYINGDNTSEAVEYIKEQLNQ 293 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 51/187 (27%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A S+I + D R+ F+ +K Sbjct: 63 LTTVKEAAQSLIDRLNPGD------RIAVVAFDHHAKVLVP---NQLVEDPEQIKALIQR 113 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI+D M+ + + + + + LLTDGEN DN+ Sbjct: 114 LEPKGGTAIDDGMKLGIEELAVGKQGTISQA----------FLLTDGENEHGDNQRCQQF 163 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F + L A S +F Sbjct: 164 AELAAGYNITLNTLGFGSHWN-----EDVLEGIADSGGGSLSFIEKPENAVDVFNSLFTR 218 Query: 448 EIFERVI 454 + Sbjct: 219 IETVSLT 225 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 70/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 37 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 96 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+A + + D ++ +++ + Sbjct: 97 NLALVIDRSTSMSGE-RIEKAREAAILAVNMLNITDTLS------VVAYDNHAEVIIPAT 149 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 150 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 198 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 199 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 253 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 254 TFVANSADLEKAFTKEFQDVMSV 276 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 64/463 (13%), Positives = 120/463 (25%), Gaps = 76/463 (16%) Query: 38 GMLVDVVRWSYYE------HALKQA-AQTAIITASVPLIQS--LEEVSSRAKNSFTFPKQ 88 G +DV R + L + ASV + + + Q Sbjct: 169 GRAIDVARLAASRGIPIDIVDLALVETDAEALVASVEAPNGVRDGQEALIVATVESTVAQ 228 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL-- 146 + LI + ++ V Y+V + + D + Sbjct: 229 RATVRLIDDVGVVAERELDLGPGATRVEFVVPIKGSGFQRYRVQVEAAQDGRVQNNEAAA 288 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 +R G L+ + A+ + + +I + S L Sbjct: 289 LIRVQGPPRVLLVARNAADARPLATALTAADIVAEIIAPEAAPRSLADLSAYDALVLVNT 348 Query: 207 PADR-------TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL---------- 249 PA + Y G+ + + V Sbjct: 349 PARALPVGLMQAIPGYVRDLGRGLLMIGGEESFGVGGYGRTAVEEALPVYMDVRNRELRP 408 Query: 250 --------DPSLSEEHFVDSSSLRH------VIKKKHLVRDALASVIRSIKKIDNVNDTV 295 D S S + ++ R +K + +DA+A + D V Sbjct: 409 DLAIVFVIDKSGSMDACHCANPDRGGPITSSSERKIDIAKDAVAQATALLSPQDTVG--- 465 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 F+ V G T I + A + + + Sbjct: 466 ---VVTFDGAAF---PTFVATRGATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQVDA 519 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K+++LLTDG + ++ I + + QGI + +A Sbjct: 520 RI-----------KHMILLTDGWGSGGDQL--DIAARLREQGITLTVVAAGSGS------ 560 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRD----RIGNEIFER 452 +L A ++ A ++ +IF IGN I E+ Sbjct: 561 ATYLQQLAAEGGGRYYPAADMADVPQIFVQETITAIGNYIVEQ 603 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 92 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 149 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA + + + N K K +++TDG + A++ GI I Sbjct: 150 MAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 203 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + L AS + F + +L FR+ + E + Sbjct: 204 YAVGV------DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 257 Query: 455 RI 456 Sbjct: 258 EN 259 >gi|125532270|gb|EAY78835.1| hypothetical protein OsI_33940 [Oryza sativa Indica Group] Length = 606 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 58/198 (29%), Gaps = 31/198 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K LV+ A+ VI ++ D R+ F+ + ++ + K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRL-LRMSEVGKATAKRAV 210 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +T I D ++ A + ++LL+DG+++ Sbjct: 211 ESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS---------VILLSDGKDSYVVPRRGN 261 Query: 389 ICNKA------------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + + I T F + S + +F + Sbjct: 262 GMSYMDLVPPSFASSGGRGRLAPIHTFGFGADHDAAAMNTIAES---TGGTFSFVENEAA 318 Query: 437 LNKIFRDRIGNEIFERVI 454 + F IG + V Sbjct: 319 IQDSFAQCIGGLLSVAVQ 336 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 59.6 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 33/192 (17%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ L++ + K R G F++ ++ ++ ++ ++ K Sbjct: 43 APKNRLSAAVSVAKDFVKRRPNDRFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVN--E 100 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 E +TAI + A + KN+ K I+L+TDG + + + Sbjct: 101 EASATAIGMGLAKAVARL-----------KNSTAKSKVIILITDGVSNTGEIDPLTAAGM 149 Query: 393 AKSQGIRIMTIAFS-----------------VNKTQQEKARYFLSNCA---SPNSFFEAN 432 AK GI++ I T + L+ A A Sbjct: 150 AKELGIKVYPIGVGSKGLVPFPYSDPIFGTRYINTYIDLDMETLNKIAETTGTGKAALAT 209 Query: 433 STHELNKIFRDR 444 L I + Sbjct: 210 DAKGLADIMNEI 221 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 65/203 (32%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S + + +G F S + + K Sbjct: 109 SRLAAAQEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTPNRDET-----KAAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + TA +A+ T+ +I + + IVLL+DG+ T Sbjct: 158 DNLTLSERTATGEAIFTSLQSIDT---LAAVLGGSEQAPPARIVLLSDGKQTVPESPDDP 214 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 +AK +G+ I TI+F E L A S +FF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTAS 274 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL ++ D + +I R Sbjct: 275 SLEELRDVY-DTLEEQIGFENAR 296 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 27/191 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++DA+ VI ++ D + F+D V + ++ + + Sbjct: 58 KLAALKDAVKRVIETLTPQD------IVAIVLFDDTVQTLVPATF---ATDKATLIAQVD 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGI 387 E G TA++ M + N + + ++LLTDG+ D + Sbjct: 109 AIEEAGGTAMSGGMAAGIVELR-----------KNHDPGRVGAMLLLTDGQTWGDEDRCR 157 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI-- 445 A+ + G+RI + + A+ ++ F+ + Sbjct: 158 ALAQELARDGVRITALGLGAEWNEALLDDIAE---ATGGISDYIADPAQITTFFQHAVRT 214 Query: 446 GNEIFERVIRI 456 R R+ Sbjct: 215 AQGTVARDARL 225 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/332 (7%), Positives = 79/332 (23%), Gaps = 9/332 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F ++ K+ G ++ + M V++GV + +D ++ L+ A TA + + Sbjct: 4 FRRSGRR--KAQQGLTLVVMTVSMAVIVGVAALSIDANHLMVSKNRLQNALDTAALAGAT 61 Query: 67 PLIQSLEEVSSRAKNSFTF--PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ EE ++ + + ++ + + + + Sbjct: 62 VANRTYEEDDAKEAIVEAYNKVTSAAGNDELVLAASDDGTSLKSLTIEYSDNANSGFSSN 121 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S+ + + +G + + + + + D Sbjct: 122 FPSSADYIYVRLQVSNVELSEYLAGLLGYSKSINSSTVAGPVFADRTSNILPIGMC-EGD 180 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + G + N + S + ++ Sbjct: 181 LAGDPAYPAGYEIGTTYEIKSGSQSDDPDGLGAGNYHLLDVGSGKSAVSDALVGKNTVVV 240 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + E + + + + + + N Sbjct: 241 TVDES---IDSETGNAVGATDSIDSRFEGKTLDGVYYPPDLITTEPSPNITTDNLAEHNA 297 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +D ++ + + G+ Sbjct: 298 S-TADDKLTYNKYLANTKACLDDKDCLSGGGA 328 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 65/203 (32%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S + + +G F S + + K Sbjct: 109 SRLAAAQEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTPNRDET-----KAAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + TA +A+ T+ +I + + IVLL+DG+ T Sbjct: 158 DNLTLSERTATGEAIFTSLQSIDT---LAAVLGGSEQAPPARIVLLSDGKQTVPESPDDP 214 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 +AK +G+ I TI+F E L A S +FF A+ Sbjct: 215 RGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTAS 274 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL ++ D + +I R Sbjct: 275 SLEELRDVY-DTLEEQIGFENAR 296 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 55/186 (29%), Gaps = 29/186 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +I S+ R+G + + + + Sbjct: 110 NRIEKSKQLVTQIINSLGS-------DRIGIIAYAGSAYPQLPITTDYSSA-KLFLSQMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI +A++ A + + + +++DGE+ + Sbjct: 162 TDMLSSQGTAIGEAIELAKTYYNDEEQ-----------TNRVLFIISDGEDHVGEA--SS 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE----LNKIFRDR 444 + +A +GIRI TI + L S+ + + L++ Sbjct: 209 LAEQANKEGIRIFTIGVGKTEGGPIP----LKRNGIVQSYKKDQNGETVITRLDEATLKA 264 Query: 445 IGNEIF 450 I + Sbjct: 265 IAEQAN 270 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D V++G ++D + L ++ MG T+ A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQ--SSLKSAVLDRIRNLPYSMGGTSTGSALEFIRA 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + +VL+TDGE++ E + ++ K G+ + + ++ Sbjct: 324 NSLTEMSGSRAKDG----VPQIVVLVTDGESSD---EVQDVADQLKRDGVFVFVVGINI- 375 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L AS F + + + + V R K Sbjct: 376 -----QDVQELQKIASEPFEEFLFTTENFSILQALSGTLLQALCSTVERQMK 422 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 54/162 (33%), Gaps = 21/162 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR G ++D++IS + + + + + E G T A+ Sbjct: 882 GPDRVRFGVVQYSDKIISQFFLTQ--YASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVF 939 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ + +Y++++TDG++T + G+ I I Sbjct: 940 QNTARV---------DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGV----- 982 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 ++ L AS FF L I ++ I + Sbjct: 983 -RDANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDICSSE 1022 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 48/169 (28%), Gaps = 17/169 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + D +++G F+ + + + + + T A+ Sbjct: 1056 MVNQSNIGADEIQIGLLQFSSNPREEFRL--NRYSSKVDMCRAILSVQQMSDGTHTGKAL 1113 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +Y++++TDG +QDN + + I I I Sbjct: 1114 NFTL-------PFFDSSRGGRPRVHQYLIVITDGV-SQDNVAPP--AKALRDRNIIIFAI 1163 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L + F+ + L + ++ + + Sbjct: 1164 GVG-----NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D V++G ++D + L ++ MG T+ A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQ--SSLKSAVLDRIRNLPYSMGGTSTGSALEFIRA 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + +VL+TDGE++ E + ++ K G+ + + ++ Sbjct: 324 NSLTEMSGSRAKDG----VPQIVVLVTDGESSD---EVQDVADQLKRDGVFVFVVGINI- 375 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L AS F + + + + V R K Sbjct: 376 -----QDVQELQKIASEPFEEFLFTTENFSILQALSGTLLQALCSTVERQMK 422 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 54/162 (33%), Gaps = 21/162 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR G ++D++IS + + + + + E G T A+ Sbjct: 882 GPDRVRFGVVQYSDKIISQFFLTQ--YASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVF 939 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ + +Y++++TDG++T + G+ I I Sbjct: 940 QNTARV---------DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGV----- 982 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 ++ L AS FF L I ++ I + Sbjct: 983 -RDANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDICSSE 1022 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 48/169 (28%), Gaps = 17/169 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + D +++G F+ + + + + + T A+ Sbjct: 1056 MVNQSNIGADEIQIGLLQFSSNPREEFRL--NRYSSKVDMCRAILSVQQMSDGTHTGKAL 1113 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +Y++++TDG +QDN + + I I I Sbjct: 1114 NFTL-------PFFDSSRGGRPRVHQYLIVITDGV-SQDNVAPP--AKALRDRNIIIFAI 1163 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L + F+ + L + ++ + + Sbjct: 1164 GVG-----NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 >gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca] Length = 885 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 60/185 (32%), Gaps = 13/185 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ + FN V + + K K + G T Sbjct: 265 QTVEAMKTILDDLRAEDQFSVIDFNHNVRTWRNDLVSATKTQIVDAKKYIEKIQPSGGTN 324 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQ 396 IN+A+ A + +N + E+ I+L++DG+ T N + I K Sbjct: 325 INEALLRAIFILNEANN----LGMLDPESVSLIILVSDGDPTVGNCELKLSKIQKNVKQN 380 Query: 397 ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I + ++ + R N + T K F + ++ + Sbjct: 381 IRDNIALFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYN----QVSTPL 436 Query: 454 IRITK 458 +R + Sbjct: 437 LRNVQ 441 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 T+ + IC A++QG+ I T+AF Q L +CAS S + +++ Sbjct: 4 TEADARLSDICAAARAQGVVIYTVAFEAPSGGQS----ALQDCASSPSHHFDVNGTDISS 59 Query: 440 IFRDRIGNEIFERVIRITK 458 F I ++I R +++T+ Sbjct: 60 AF-SAIASDI--RALKLTQ 75 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 76/261 (29%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +S+ + N + + S + ++ + ++ Sbjct: 237 SIMFMQSLDSVTEFCTAETHNTEAPNLQNKMCNSRSTWDIIMNSEDFQNASPMTGTDPPP 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I + ++ Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSSEDRLFQMNQAAELFLIQIIEKGSLTG-----MVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + + G T+I ++ + II SN++ Sbjct: 352 FESSATIQNYLTEITDH-NAYEKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ I++C + K G I TIA K LS Sbjct: 411 ---------IVLLTDGEDD-----NISLCFEEVKKSGSVIHTIALG---PSAAKELEILS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N + F+ + L F Sbjct: 454 NMTGGHRFYANKDINGLIDAF 474 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 54/441 (12%), Positives = 112/441 (25%), Gaps = 73/441 (16%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + A A +V++ A+N+ F E + F N + R Sbjct: 129 QSAVDAGQSAA---------QVTANARNANQFTVSVNIEPEQKIFFNLTYEELLSRRKGI 179 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + V V ++ L +N +++ + ++ Sbjct: 180 YEQAIHVTPGSVVPKMSVRVNIFETLPINKITVPQLRGNDVEQDHNSTENEIAQIIRTNE 239 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 VS+ + + + + +GQ + + K + G Sbjct: 240 TTVVSVVYEPAEAEQIKMSKDGLQGQFVVQYDVDRSSIEKKGGEIHVVDG---------- 289 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK------ 287 +++ + +D+S + + A+ S++ ++K Sbjct: 290 ----YFVHFFVPADLPTLPKHVIFVLDTSGSMAG-TRIEQTKQAMNSILDQLRKDEDIFS 344 Query: 288 ----IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT----------------- 326 V + D + Sbjct: 345 VVEFSSGVTEWDLRKPYKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPV 404 Query: 327 ----------FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 F + ST INDA+ A +++ + I+ LTD Sbjct: 405 TEQSVKRAKEFVAAMDVTSSTNINDALLLALK--------NSQSVQSRVRLTPIIIFLTD 456 Query: 377 GENTQDNEEGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 GE T + I + + I +AF Q + N +EA Sbjct: 457 GEPTASVTDTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAA 516 Query: 433 STHELNKIFRDRIGNEIFERV 453 K F D + + + V Sbjct: 517 DATLQLKGFFDEVASPLLSNV 537 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 65/189 (34%), Gaps = 25/189 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS-DPSFSWGVHK 318 D+S + +++A+ + ++ V FN V + + Sbjct: 536 DTSGSMLQDNRIGALKEAVGVFLGTLPPGSKVA------VIEFNSFVNPLVFGPANEIFT 589 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 VK+ G T+ DA+ A + I N ++ ++ LTDGE Sbjct: 590 TRFDDVKSQVNRFRANGGTSYYDAVDRALELI------------ANQTGRRAVLALTDGE 637 Query: 379 NTQDN-EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 +T ++ KA++ G+ + T+ + L A + +F A Sbjct: 638 DTSSRLAGLDSVILKARNLGLPVHTLGVGREDEIEVGE---LQRLARETRGRYFPARDAT 694 Query: 436 ELNKIFRDR 444 +L IF + Sbjct: 695 KLRVIFAEL 703 >gi|240140254|ref|YP_002964732.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] gi|240010229|gb|ACS41455.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] Length = 441 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/218 (10%), Positives = 59/218 (27%), Gaps = 5/218 (2%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSLEEVSSRAK 80 +I AL ++G+ G +D R++ L+ A ++ L + S E + Sbjct: 1 MIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTT 60 Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + + + + + V ++ + +R ++ Sbjct: 61 QTIQAEAKAGADAPVSIQVTVASDKTSVEARAEQVIKLTFGAFVGMASIPISARARASVV 120 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRS-YHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 L + + A+ + + VS ++ ++ Q Sbjct: 121 GRMRLCMLALDPAAAGAFNLERNAQVTAYDCALYSNSVSRSGMVGRDGALARAQTICSAG 180 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 S + +R+ P N Sbjct: 181 GFKDDRAN-FTPSPQTSCPVIEDPLRNRPAPPVGNCVN 217 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 50/148 (33%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ ++ S+ R+G + + + ++ Sbjct: 110 NRLEKSKQLVSQILSSLGS-------DRVGIIAYAGGAYPQLPITTDFSAA-KMFLQALN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI+DA++ A + + + +++DGE+ + N Sbjct: 162 TDMISSQGTAISDAIELATTYYDDDQQ-----------TNRVLFIISDGEDHEGNV--ED 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 I +A +GIRI TI K + Sbjct: 209 IAEQAAEKGIRIFTIGVGTEKGGPIPIK 236 >gi|149921110|ref|ZP_01909568.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] gi|149817997|gb|EDM77456.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] Length = 532 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 65/183 (35%), Gaps = 29/183 (15%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT--FAIDENE 333 DA A +I ++ D V+ +N + + + + ++ + Sbjct: 157 DAAAGMISRLRDGDTVS------VVSYNTKAHTIVPVTTLDARNRDRVISDLRVGVASRP 210 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---C 390 G+T ++ ++ T+ ++LL+DGE + + I Sbjct: 211 SGNTCVSCGVEAGLQTLQGRRPGIDR-----------MLLLSDGEANRGVRDEPGIRRLA 259 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 +A+++G+ I +I V+ +S A + + + + L+ IF + + Sbjct: 260 REARNRGVSISSIGVDVDYN-----EVLMSAIAREANGRHYFSETGSNLDAIFDQELDSL 314 Query: 449 IFE 451 I Sbjct: 315 IQA 317 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/286 (15%), Positives = 81/286 (28%), Gaps = 24/286 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 T S +++ ID + +N F R + Sbjct: 197 TADQPVSTFSIDVDTAAYANVRRFIDN-GRAPPRDAVRVEEMINYFDYGYTRPTSAARPF 255 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 S + L P L+ VD S + K L + ++ Sbjct: 256 AVTATTTASPWSEGRRIVHVGLQGYELPENQRRPLNLTFLVDVSGSMNSPDKLDLAKQSM 315 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I ++ D R+ ++ + + + + G KL ++ G TA Sbjct: 316 NLIIDRLRPQD------RVAVAYYAEGAGTTLAPTAGTQKLK---LRCAVASLRASGGTA 366 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKS 395 M AY + R K N I++ TDG+ D++ + Sbjct: 367 GATGMTNAY----DQAQASFGRNKVNR-----ILMFTDGDFNVGVTDDKRLEDYVADKRR 417 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 GI + F Q + + A + + E ++F Sbjct: 418 TGIYLSVYGFGRGNYQDARMQAIAQ--AGNGTAAYVDDLKEARRLF 461 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV++ L +I + +D R+ FN+ +F+ + + + Sbjct: 238 KIQLVKETLVKIINLMSSMD------RICIVCFNESGDRPLTFTRVTDE-NKQTLLNLIQ 290 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I++ + A I + I+LL+DG++T+ A Sbjct: 291 QIYAGGGTNISEGINHALKAIQNRKFKNNVTS---------ILLLSDGQDTKAYTRVKAY 341 Query: 390 CNK--AKSQ-GIRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRI 445 +K K I TI F + + L N +F + L+ F + Sbjct: 342 IDKYQIKDAFNIE--TIGFGEDHDPKLLRTLSDLRN----GTFNFMQDVNYLDTAFINIF 395 Query: 446 GNEIFE 451 I Sbjct: 396 AGMIST 401 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 86/304 (28%), Gaps = 25/304 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I I A+ + S + S + S+ + +N F Sbjct: 63 HIDDNAIVQAAQQPISTFSIDVDTGSYSNVRRFLSAGSLPPVDAVRVEELINYFRYDHPA 122 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + + + + P + VD S K Sbjct: 123 PTND-KPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDAPDK 181 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L+R +L ++R ++K D R+ + + G + + Sbjct: 182 LPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPG---DQQGRIVEAIDS 232 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GI 387 GSTA ++ AY + + R N I+L TDG+ + Sbjct: 233 LQSGGSTAGASGIELAYK----AAQQGYLRGGINR-----ILLATDGDFNVGVTDFDALK 283 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + ++ + G+ + T+ F A ++ ++ E K+ +G Sbjct: 284 GMVSEKRRSGVALSTLGFGTGNYNDNLMEQSAD--AGDGAYAYIDTPLEARKVLTHELGA 341 Query: 448 EIFE 451 + Sbjct: 342 TLAT 345 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 92 LSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 149 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA + + + N K K +++TDG + A++ GI I Sbjct: 150 MAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 203 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + L AS + F + +L FR+ + E + Sbjct: 204 YAVGV------DRAEVRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEEARADLT 257 Query: 455 RI 456 Sbjct: 258 EN 259 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/380 (11%), Positives = 92/380 (24%), Gaps = 12/380 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++L G I+ + M M+G + + W + L+ A A + + Sbjct: 3 LKTLIRRLCTGTEGAVAIMVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAF 62 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK-------NFTDREVRDIVRDTA 119 L A + + + N EV T Sbjct: 63 ELGSGSNSSVISAAAIQDAGRNGFQATGGATIAVHTPPASGKYAGNPQMVEVSVSQPTTL 122 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + QV + + L + + A + S Sbjct: 123 LFSALFLKSLQVNARAVAKTGSVGDACILALDNVINDAAYFTGSATVTLHNCGIAANSSS 182 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQ----PADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + S S + A + ++ + S Sbjct: 183 SNAVVVSGSATVTTAFVQSVGGYDISGSGVLHAGTIITHAAATADPYAGLSAPSAASCAS 242 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 K+ + L PG L+ + V + +VI A A+V + +N+ Sbjct: 243 VPKTKDFTLSPGTYCGGLTFQGNVFLNPGVYVISGGTFKITAQATVFGTGVTFVLINNAK 302 Query: 296 RMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + + + W GV + N T I A+ + S Sbjct: 303 LDVTGGSAITITAPTAGIWAGVAFFSDRKNELQVNKLNGNSVTNITGAIYMPSQELDFSG 362 Query: 355 EDEVHRMKNNLEAKKYIVLL 374 + ++ + Sbjct: 363 GNGTDASGCTRLIADFVKFI 382 >gi|72679950|gb|AAI00088.1| LOC498793 protein [Rattus norvegicus] Length = 568 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 44/131 (33%), Gaps = 11/131 (8%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T IN+A+ A + ++ N ++ I+L++DG+ T + I Sbjct: 2 IQPSGGTNINEALLRAIFILNEASN----MGLLNPDSVSLIILVSDGDPTVGELKLSKIQ 57 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 K I + ++ + R N + T K F + Sbjct: 58 KNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLKKFYN---- 113 Query: 448 EIFERVIRITK 458 ++ ++R + Sbjct: 114 QVSTPLLRNVQ 124 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 52/158 (32%), Gaps = 17/158 (10%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + D R+ FN R + + R +K + G T+I++ ++ + Sbjct: 71 DKLRDQDRLSIVVFNHRAEV---LLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGIE 127 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + D + + LLTDGEN DN + A + + T+ F Sbjct: 128 ELAKGRRDTISQA----------FLLTDGENEHGDNNRCLKFAQLAADYNLTVNTLGFGN 177 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 N Q + + A S + F + Sbjct: 178 NWNQHILEKI---SDAGLGSLSHIEHPDQAVDKFNSLL 212 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 83/274 (30%), Gaps = 26/274 (9%) Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + I + +S+ + N + +Y S + D+ + + Sbjct: 227 DLNQNSKNSIMYMQSLPSVVEFCDKNTHNSEAPNMQNKMCNYKSTWEIIMESDDFRNSSV 286 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 V+ + + + +D S + + +R A + I +I Sbjct: 287 VNSLVPPFETTFELLQTQDRAVSLVLDVSGSMNTNNRITNLRTAAEVFLIQIIEI----- 341 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F R + G T I ++ + I ++ Sbjct: 342 GSRVGIVTFESSAYEKSPLLQITSVATRQRLVQNLPT-TAGGGTKICAGIEKGLEIITNA 400 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQ 412 IVLLTDGE++ +++C K K G I TIA Sbjct: 401 I---------GTTYGSEIVLLTDGEDS-----TMSLCREKVKESGAIIHTIALG---PSA 443 Query: 413 EKARYFLSNCASPNSFF--EANSTHELNKIFRDR 444 K SN + + + +L + F + Sbjct: 444 AKELEEFSNITGGLQLYAVDVDVPSKLVEAFSEI 477 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 65/173 (37%), Gaps = 14/173 (8%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 D L +V ++ + R+G F D F+ + + + Sbjct: 120 DRLTAVKSVLETFAAKREGDRLGLILFGDSAYLQSPFTADHEAWLALLDQAQV--GMAGE 177 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 ST + DA+ T I + E N +K ++LTDG +T I A++ Sbjct: 178 STHLGDAVGLTIKTYIDNPE--------NQTVEKVAIILTDGNDTDSLVPPIDAAKVAQA 229 Query: 396 QGIRIMTIAFSVNKTQQEKARYF--LSNCA--SPNSFFEANSTHELNKIFRDR 444 GIR+ +A T ++A F + A + F A S +L+ +++ Sbjct: 230 YGIRLYIVAMGSPNTTGDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQTI 282 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 45/170 (26%), Gaps = 27/170 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ G T I A+ Sbjct: 129 DRVGLLPFAGTAFLMCPLTLDYDAFRNSLEALDTN-IIPQGGTDIASAI----------- 176 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 E NN K +VL++DGE+ + + AK + + I T+ + Sbjct: 177 -YEAEAAFNNDANHKILVLVSDGEDLEGEAL--SAAQAAKERDLTIYTVGVGTPSGELIP 233 Query: 415 ARYFLSNCASPNSFFEANS------THELNKIFRDRIGNEIFERVIRITK 458 + A L++ +I R + + Sbjct: 234 LVQGGKDGA------FVKDEKGQPVKSRLDETMLQKIAEATGGRYEPLGQ 277 >gi|148555259|ref|YP_001262841.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] gi|148500449|gb|ABQ68703.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] Length = 625 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/332 (11%), Positives = 91/332 (27%), Gaps = 3/332 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L ++ G +I AL + V++G+G V++ R + A ++ A A + ++ + Sbjct: 16 LARATRGAVAVIGALSLTVLVGMGAFAVEISRGYAADTANQRIADMAALAGALAYNVNSN 75 Query: 74 EVSSRAKNSFTFPKQKI---EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 A Q + + ++ K V V + AY Sbjct: 76 PSEMTATAKAVVVAQGLPASAATVALVTDSATSKQLVQVSVTTSVPIALGRVFSSALAYD 135 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V + + + S + VS + + V + + Sbjct: 136 VTATGSATTTATTTTAPPCIAALSSTPTYGITLSGGVSITSPGCAVNTNAGVTVPWGTTI 195 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 ++ + G+ ++ G+ + + K + + G D Sbjct: 196 TAKQVNAGKGIDNPGKGITTSPTANDIVQNKASAATDWMKDDSTLKGLLCKVNQLSGYSD 255 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P ++ + ++ L + + + ++ TF Sbjct: 256 PDYADGNRSCTTPLVTPATQTSASTEDYILNYSPAANVAPYWNSGTKTYTFPAKSYDLRD 315 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 G + T MG T + Sbjct: 316 IVMSGGITAVFEGPLTLKARSVSMGGTGMTIG 347 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 55/171 (32%), Gaps = 2/171 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + S G I+ AL++ V++G+ + VD+ R + L+ AA A + + Sbjct: 3 RLRSIKR--LHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGA 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 LI + + + + + + ++ + + Sbjct: 61 GSLIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQEQDVNVAIQPGYWNLITPSFT 120 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + V + ++ +++ L + I E + + Sbjct: 121 SNTGLVTHTGDGNIPAVQVTITLSHLKFFFAPILGIPEGTVQATAIAAVSP 171 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 72/187 (38%), Gaps = 10/187 (5%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 SS +KK + +A V+ K R+G F D F+ + Sbjct: 120 SSDSGTVKKISRL-EATKEVLADFVKTRKG---DRLGLILFGDAAFVQTPFTADQSVWLE 175 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +T ST + DA+ A S+ED+ +N +K ++LTDG +T Sbjct: 176 LLNQTDVA--MAGQSTHLGDAIGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDGNDTG 233 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 E I A ++G+RI IA +Q ++ A S F+A + EL Sbjct: 234 SYVEPIDAAKVAAAKGVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDEL 293 Query: 438 NKIFRDR 444 + + + Sbjct: 294 EQAYDEI 300 >gi|297714304|ref|XP_002833596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like, partial [Pongo abelii] Length = 384 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 57/183 (31%), Gaps = 11/183 (6%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V +D++ FN V + + K K + G T Sbjct: 39 QTVEAMKTILDDLRAEDHFSVIDFNQNVRTWRNDLISATKTQVADAKRYIEKIQPSGGTN 98 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-- 396 IN+A+ A + ++ + + I+L++DG+ T + I K Sbjct: 99 INEALLRAIFILNEASNLGLLDPNSVS----LIILVSDGDPTVGELKLSKIQKNVKENIQ 154 Query: 397 -GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + ++ + R N + T K F + ++ ++R Sbjct: 155 DNISLFSLGMGFDVDYDFLKRLSSENRGIAQRIYGNQDTSSQLKKFYN----QVSTPLLR 210 Query: 456 ITK 458 + Sbjct: 211 NVQ 213 >gi|218509026|ref|ZP_03506904.1| hypothetical protein RetlB5_16575 [Rhizobium etli Brasil 5] Length = 125 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 38/124 (30%), Gaps = 29/124 (23%) Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEG---------------IAICNKAKSQGIRI---MT 402 + A+K + ++DG + C K +G++I T Sbjct: 3 ARRTPSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYT 62 Query: 403 IAFSVNKTQ---------QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + Q + + CASP +FE + T + + I R Sbjct: 63 TYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMKALFLKVI--RA 120 Query: 454 IRIT 457 RIT Sbjct: 121 PRIT 124 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 52/171 (30%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 227 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 284 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA + + + N K K +++TDG + A++ GI I Sbjct: 285 MAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 338 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS + F + +L FR+ + Sbjct: 339 YAVGV------DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETL 383 >gi|327286711|ref|XP_003228073.1| PREDICTED: cartilage matrix protein-like [Anolis carolinensis] Length = 565 Score = 59.2 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 68/248 (27%), Gaps = 38/248 (15%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE-----EHFVDSSSLRHVIKK 270 + + + + Y +C + + +D S + Sbjct: 300 TGDHDCAQVCVSAGASYSCACREGFTLNSDGKTCNACSGGAPTDLVFLIDGSKSV-RPEN 358 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ S+ D ++ V + G +K + I Sbjct: 359 FELVKKFINQIVDSLDVSDRNAHVG---LVQYSSSVRQEFPL--GQYKDKKDIKAAVRRM 413 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+Q D+ + + + K ++ TDG + + Sbjct: 414 SYMEKGTMTGQALQYLVDSSFAISSGARPGV------PKVAIVFTDGRSQDYINDS---A 464 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR--I 445 KAK G ++ + L AS +F FR I Sbjct: 465 KKAKELGYKMFAVGVG------NAVEDELKEIASEPVAEHYFYTAD-------FRTINQI 511 Query: 446 GNEIFERV 453 G ++ ++ Sbjct: 512 GKKLQNKI 519 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 56/167 (33%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D + R+G + V S+ S K + T A+Q Sbjct: 139 IESLDVGPNATRVGVVNYASAVKSEFSLKTHRTKASLLQAVRRIEPLST--GTMTGLAIQ 196 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E R+ E K +++TDG + + + +A++ GI I I Sbjct: 197 FAINRAFSEGEGARVRV---PEINKVAIIVTDG---RPQDAVKDVAARARALGIEIFAIG 250 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + L ASP S +L K F++ Sbjct: 251 VGR------VDMHTLRLIASPPLEEHVDYVESYSVIEKLTKKFQEAF 291 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 59.2 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 62/202 (30%), Gaps = 28/202 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + + A + + + + +G F+ + + Sbjct: 311 DKSGSMNGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTAYIKSNLIQIISSK 365 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I IVLLTDGE+ Sbjct: 366 ERNNLLESLPT-TANGGTSICAGIKSAFQVIGEIYPQIDGSE---------IVLLTDGED 415 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--- 436 + K G I IA + Q + + + F A+ + Sbjct: 416 NTAKNCIGEV----KQSGAIIHLIALGPSADQAVIEMSTI----TGGNHFFASDEAQNNG 467 Query: 437 LNKIFRDRI--GNEIFERVIRI 456 L F ++ ++ +++ Sbjct: 468 LIDAFGALASGNTDLSQQPLQL 489 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 50/154 (32%), Gaps = 24/154 (15%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V R+G FN + + V + + T + G+T I DA++ + Sbjct: 668 VKAGDRVGIVGFNTKAEIVVDITSDVEE-----IITKVMSLKPGGATDIGDAIRVGTELF 722 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA----ICNKAKSQGIRIMTIAFS 406 R + +++LLTDG T+ + A G+ I TI Sbjct: 723 ---------RRCGRPDRDWHMILLTDGVPTKGEPDPETKALSEATAASRMGVTISTIGIK 773 Query: 407 VNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 + E+ + + A S EL Sbjct: 774 L----PEEGIRLIEHIAGISGGRSHHITDPEELT 803 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 95/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +++ + T + ++D V +F+ Sbjct: 255 GNKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 28/194 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + L+ A + I + + +G FN + Sbjct: 314 DVSGSMASYDRLDLMNRAAKHFLSQIIE-----NRSWVGMVHFNHLANIKSELIQMNSNI 368 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++ A+ + + I+LL+DGE+ Sbjct: 369 ERNQLLQTLPT-SADGGTSICSGIKAAFQVFKNGGYETDGTE---------ILLLSDGED 418 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + + C ++ K G + IA + L + A+ + Sbjct: 419 STAKD-----CIDEVKDSGAIVHFIALGPSADLAVTNMSIL----TGGKHMYASDEAQNN 469 Query: 437 -LNKIFRDRIGNEI 449 L F + Sbjct: 470 GLIDAFVALASENV 483 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/249 (12%), Positives = 69/249 (27%), Gaps = 28/249 (11%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 A + + ++ S SL ++ +D S+ Sbjct: 27 AKAITPAVTVKSELSAPVMLANSEDKNYLKISLTGFNLDSKRRSPINLALVIDRSTSMTG 86 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + R+A + + D ++ +++ + ++ Sbjct: 87 -DRIEKAREAAILAVNMLDASDTLS------VVAYDNNAEVIIPAT--KVNNKPALIAKI 137 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-- 385 + MG TA+ + + + N E I+LL+DG+ Sbjct: 138 QQHIHPMGMTALFAGVSKGIGQVDKN---------LNPEQVNRIILLSDGQANTGPTSIS 188 Query: 386 -GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + A +GI I TI + ++ A S + ++ +L F Sbjct: 189 ELSDLARMAAKKGIAITTIGLGEDYN-----EDLMTAIAGYSDGNHSFVANSADLESAFT 243 Query: 443 DRIGNEIFE 451 + + Sbjct: 244 KEFQDVMSV 252 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/282 (10%), Positives = 76/282 (26%), Gaps = 37/282 (13%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--YSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + + + + L Q A V + + + Y +C++ Sbjct: 199 PQDEHVDYVESYNVIEKLAKKFQEAFCVVSDLCATGDHDCEQLCVSSPGSYTCACHEGFT 258 Query: 242 YMLYPGPLDPSLSEEHFVDSS-------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + S + LV+ + ++ ++ D + Sbjct: 259 LNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQV 318 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 ++ + + G + I T A++ D + + Sbjct: 319 G---LVQYSSSIRQEFPL--GRFHSKKDIKARVRNMSYMEKGTMTGAALKYLIDNSFTVS 373 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K ++ TDG + + KAK G ++ + Sbjct: 374 SGARPGA------QKVGIVFTDGRSQDYIND---AARKAKDLGFKMFAVGVG------NA 418 Query: 415 ARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIFERV 453 L AS + +F +N+ IG ++ +++ Sbjct: 419 VEEELREIASEPVADHYFYTADFKTINQ-----IGKKLQKQI 455 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 55/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + T A+ Sbjct: 72 IESLDVGPNATRVGLVNYASTVKPEFPL---RAHGSKASLLQAVRRIQPLSTGTMTGLAL 128 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 129 QFA---ITKALSDAEGGRARSPDISKVVIVVTDGRP-QDSVRDVS--ERARASGIELFAI 182 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L K F++ Sbjct: 183 GLGR------VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQEAF 224 >gi|257389158|ref|YP_003178931.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257171465|gb|ACV49224.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 393 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 52/163 (31%), Gaps = 14/163 (8%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D + F+ + L R + G T + + ++ A +T+ S Sbjct: 70 DEDYVSIVAFDTEATVILPAT-RWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLSS 128 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-GIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + K ++LL+DG++ + + + GIRI + + + Sbjct: 129 SA--------TGPDTVKRLLLLSDGKDNERTPDEFEGLAEAIDDAGIRIQSAGIGTDYNE 180 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A ++ + ++ F + + + V Sbjct: 181 ATIRTLGT---AGRGTWTHLEAPGDIEDFFGEAV-EQAGSVVA 219 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/277 (12%), Positives = 73/277 (26%), Gaps = 23/277 (8%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL-S 230 + S + + GQP N F D V Q+ + + Sbjct: 220 DNRTATASSEEETNSENPNFAPPAANGQPANAFTLDQDIVVYWRHQQDLPGSVDLVAYKA 279 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P L + + + K + D ++ + ++ D Sbjct: 280 PGKDRGTFMLSITPGDDLPPITTGSDWVFVLDISGSMSAKLATLGDGVSQALGKLRGGD- 338 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R F+DR S + G T + + A + + Sbjct: 339 -----RFRIVLFDDRAEELTSGFVDATPNNIRQYTKKVMQLQSRGGTNLFGGLSLALNPL 393 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + IVL+TDG + + ++ +R+ T F + + Sbjct: 394 DADRPTG-------------IVLVTDGVANVGKTQQKDFIDLLENHDVRLFT--FVMGNS 438 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++N AS +++ ++ + Sbjct: 439 ANRPMLTAMTN-ASNGFAISVSNSDDIAGQILNATAK 474 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 86/304 (28%), Gaps = 25/304 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I I A+ + S + S + S+ + +N F Sbjct: 48 HIDDNAIVQAAQQPISTFSIDVDTGSYSNVRRFLSAGSLPPVDAVRVEELINYFRYDHPA 107 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + + + + P + VD S K Sbjct: 108 PTND-KPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDAPDK 166 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L+R +L ++R ++K D R+ + + G + + Sbjct: 167 LPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPG---DQQGRIVEAIDS 217 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GI 387 GSTA ++ AY + + R N I+L TDG+ + Sbjct: 218 LQSGGSTAGASGIELAYK----AAQQGYLRGGINR-----ILLATDGDFNVGVTDFDALK 268 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + ++ + G+ + T+ F A ++ ++ E K+ +G Sbjct: 269 GMVSEKRRSGVALSTLGFGTGNYNDNLMEQS--ADAGDGAYAYIDTPLEARKVLTHELGA 326 Query: 448 EIFE 451 + Sbjct: 327 TLAT 330 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 60/170 (35%), Gaps = 17/170 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + +G ++ D +F GV + K I + T ++ A Sbjct: 110 VDAYSVSEEATHVGVITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKA 169 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIR 399 ++ A + + K+ + LTDG T D + + K K +GI Sbjct: 170 LELANSVLFTEARGMRP------NKKQVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGID 223 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIGNE 448 + T+ + E L + +S + + + A + EL ++ + + Sbjct: 224 VYTVGVGDDVDLFE-----LLSISSGDKYTYSAKNFDELQAKVQEILQEQ 268 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 821 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 878 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ GST +A+ + E + N + ++++TDGE+ ++ Sbjct: 879 DQAMGGSTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 932 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 933 AKALRDKGILVLAVGIDG-----ANPMELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 985 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 411 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAP 470 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 471 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 525 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 526 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGI---- 572 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + ++ L I Sbjct: 573 --KEANQTQLREIAGEEKRVYYVHNFDALKDI 602 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 1016 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQI 1073 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1074 FGNTHIGAALREVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1125 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1126 RHRGIDIYSVGIGDVDDQQ 1144 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 250 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 306 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 307 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 360 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 361 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 414 Query: 453 VI 454 V Sbjct: 415 VS 416 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 1053 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 1110 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ GST +A+ + E + N + ++++TDGE+ ++ Sbjct: 1111 DQAMGGSTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 1164 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 1165 AKALRDKGILVLAVGIDG-----ANPMELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 1217 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 77/230 (33%), Gaps = 22/230 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 631 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAP 690 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 691 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 745 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 746 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGI---- 792 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +E + L A + ++ L R+++ EI + K Sbjct: 793 --KEANQTQLREIAGEEKRVYYVHNFDALKD-IRNQVVQEICTEEADLNK 839 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 1248 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQI 1305 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1306 FGNTHIGAALREVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1357 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1358 RHRGIDIYSVGIGDVDDQQ 1376 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 470 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 526 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 527 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 580 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 581 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 634 Query: 453 VI 454 V Sbjct: 635 VS 636 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 57/191 (29%), Gaps = 22/191 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + +DAL ++ + D R+G FN+ + I Sbjct: 226 SKMAVAKDALVALTEQLHPDD------RVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIR 279 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D G T I D M A D + + + + +++TD + A Sbjct: 280 EDIEAGGGTNIADGMAEAADMLGEYADSDPTEAETRQ------IVITDAMPNTGQTDDQA 333 Query: 389 ICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + ++ GI + V+ + ++ +S + F + Sbjct: 334 LQDRLAGYAEDGIHTSFVGVGVDFNPELVDEITA---VRGANYRSVHSAED----FETYL 386 Query: 446 GNEIFERVIRI 456 G E V + Sbjct: 387 GEEFEYMVTPL 397 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/413 (9%), Positives = 97/413 (23%), Gaps = 46/413 (11%) Query: 47 SYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 + +Q + + + + + + + N Sbjct: 104 VLEKQQARQVYEQEKAAGREAGLTEKDSFKAFDVKVQPVRAGQDTRVRLVYMQPAFVDNG 163 Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL--------------RSMG 152 R V + + E + R+ LN S + + +G Sbjct: 164 IGRYVYPLEEGSVDEKKLAFWTANEKVEERFSFTLNLRSSYPVDALRLPKHPQAKIQQLG 223 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + W + + + + G P + F D V Sbjct: 224 SQQWQVTLDNRNASTTIKGKGRSE---------KGDNNFAPPATNGHPPSAFTLDQDIVV 274 Query: 213 KSYSSQNGKVGIRDEKL-SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 Q+ + +P L + + + K Sbjct: 275 YWRHQQDLPGSVDLVAYKAPGKDRGTFMLSITPGDDLPPITTGSDWVFVLDISGSMNAKL 334 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + D + + ++ D R F+DR S + Sbjct: 335 ATLGDGVRQALGKLRGND------RFRIVLFDDRAEELTSGFVDATPNNIRQYTQKIMQL 388 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + + A + + IVL+TDG + Sbjct: 389 QSRGGTNLFGGLSLALTPLDADRPTG-------------IVLVTDGVANVGKTRQKDFID 435 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 ++ +R+ T F + + +++ AS +++ ++ + Sbjct: 436 LLENHDVRLFT--FVMGNSANRPMLTAMTD-ASNGFAISVSNSDDIAGQILNA 485 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 899 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 50/159 (31%), Gaps = 18/159 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYVTDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNE 448 + L A + L I + I Sbjct: 581 ---ANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREI 616 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 899 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 50/159 (31%), Gaps = 18/159 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYVTDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNE 448 + L A + L I + I Sbjct: 581 ---ANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREI 616 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 899 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 48/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 581 ---ANKIELQEIAGKEERVSFGQNFDALKSI 608 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 841 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 899 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 953 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 48/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 581 ---ANKIELQEIAGKEERVSFGQNFDALKSI 608 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 786 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 843 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 844 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 897 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A N+ ++ +L +F Sbjct: 898 VG------KANQKELEGMAGNKNNTIYVDNFDKLKDVF 929 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 48/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 420 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 477 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 478 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 525 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 526 ---ANKIELQEIAGKEERVSFGQNFDALKSI 553 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 96/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ + T + ++D V +F+ Sbjct: 255 GDKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 64/201 (31%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S N+ + +G F + + + +I+ Sbjct: 109 TRLQAAQEAATSF------ARNMTPGINLGLISFAGTATVLVNPTTDRNGVIKA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + STA + + A + S + IVL++DG+ T + Sbjct: 158 ENLKLAQSTATGEGIFAA---LQSVESFSSLVGGADGPPPARIVLMSDGKQTVPEDLYAA 214 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCA--SPNSFFEAN 432 AK G+ I +I+F L A S F++A Sbjct: 215 RGGYTAAQAAKQAGVPISSISFGTTHGSVTIDDKPQPVSVDDESLREIARLSGGDFYKAA 274 Query: 433 STHELNKIFRDRIGNEIFERV 453 S EL K++ D G +I + Sbjct: 275 SAEELKKVYADL-GEQIGYEL 294 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 79/261 (30%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ N + ++ S + ++ + ++ Sbjct: 291 SIMFMQNLDSVTEFCTANTHNTEAPNLQNKMCNHRSTWDVIMDSEDFQNASSMAGTNPPP 350 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + ++ +D S + + A A + I + +++ +G Sbjct: 351 HPIFSLLRAKQRVVCLVLDKSGSMDSEDRLLRMNQAAALYLIQIIERESL-----VGMVT 405 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + T + + G T+I ++ + I SN++ Sbjct: 406 FESTAKIQNNLTKITDD-DTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNTSGSE 464 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ + C + K G I TIA K LS Sbjct: 465 ---------IVLLTDGEDNGIHS-----CFEEVKQSGAIIHTIALG---PSAAKELEILS 507 Query: 421 NCASPNSFFEANSTHELNKIF 441 + F+ + L F Sbjct: 508 SMTGGYRFYANKDINGLIDAF 528 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 96/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ + T + ++D V +F+ Sbjct: 255 GDKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 47/127 (37%), Gaps = 8/127 (6%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI-DENEMGSTAI 339 V+ I + ++ V + F D +F + + F GSTA Sbjct: 91 VLDFIDLVPISSEEVHLSVVTFADSPQDVFTFKQPQATNKQLAKEAFKYLRYRRGGSTAT 150 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + + A + K +VL+TDGE+ + + + I ++A+++GI Sbjct: 151 DKGLIRARRYLT------RPVYGTRANVPKVLVLMTDGESDR-HYDTIQAADQARAEGIS 203 Query: 400 IMTIAFS 406 + + Sbjct: 204 VFVVGVG 210 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 65/193 (33%), Gaps = 19/193 (9%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 SS + L++D + ++ K D + F DR Sbjct: 248 VFDASSSINANNPNNYGLMKDFMKDIVDRFNKTGP--DGTQFAVVTFADRATKQFGLKDY 305 Query: 316 VHKLIRTIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + +K G TAI D ++ A + E +K ++L Sbjct: 306 SS---KAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQKVVIL 357 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEAN 432 LTDG+N + + +G+ I+ I + L N AS + F + Sbjct: 358 LTDGQNNGHKS-PEHESSLLRKEGVVIVAIGVGTG-----FLKSELINIASSEEYVFTTS 411 Query: 433 STHELNKIFRDRI 445 S ++L+KI + + Sbjct: 412 SFNKLSKIMENVV 424 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 63/194 (32%), Gaps = 16/194 (8%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + + K+ V++ + S N + F+ V + Sbjct: 50 FDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGSQFG---VVQFSQGVKTAFPL 106 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T I +Q + S E N + +K ++ Sbjct: 107 -NKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEG-----GGNPDKQKIVI 160 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 L+TDG++ +K K++G+ ++ I + L A+ ++ Sbjct: 161 LMTDGKSNAGAP-PQHEAHKLKAEGVTVIAIGIGQG-----FVKTELEQIATMKNYVLTT 214 Query: 433 ST-HELNKIFRDRI 445 ++ EL+ + + I Sbjct: 215 NSFSELSTLLKLVI 228 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 57/201 (28%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVSPSRLAAAQEASKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I K D TA + + TA I + + IVL +DG+ Sbjct: 162 KTAIDKLQLADR-----TATGEGIFTALQAIATVG---AVIGGGDEPPPARIVLFSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR---------YFLSNCA-- 423 T + AK QG+ I TI+F E L A Sbjct: 214 TVPSNPDNPKGAFTAARTAKDQGVPISTISFGTPYGYVEINEQRQPVPVDDQMLKKIADL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S F A+S +L +++ + Sbjct: 274 SEGEAFTASSLEQLREVYANL 294 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 59/173 (34%), Gaps = 8/173 (4%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQ 344 ++ ++ V T N + + + + + K F D ST + DA+ Sbjct: 412 TSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDALN 471 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN---KAKSQGIRIM 401 A+ +I+ + + N K IV LTDGE T E + + I Sbjct: 472 KAH--LIAKLGETRFKDGANT-PKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIY 528 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ F + L+N +EA+ + F I + + V Sbjct: 529 SLGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVT 581 >gi|189536038|ref|XP_693697.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Danio rerio] Length = 1157 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 54/428 (12%), Positives = 122/428 (28%), Gaps = 32/428 (7%) Query: 51 HALKQAAQTA---IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 + L A + A +++S + + + + + + + Sbjct: 71 NQLHIAKEAAFEVDLSSSAFISNFTITSNHKVYVAQVRRRTDARKIYDNAKKQGKTAGLV 130 Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + R++ + P +R L+ L R +G + + Sbjct: 131 ATKEREMEKFRVAVNVPPG--------ARVSFSLSYEELLSRRLGRYELSLGLRPGQPVQ 182 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + S + + + S + P+ + ++ + + + Sbjct: 183 NLSLEVSISERTGISFIRALPLRTSRLLSNTVQADAEAPPSTKVKQNAYCAHVRYTPSIQ 242 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK------------KHLVR 275 + + + + Y L + + D + + + + Sbjct: 243 QQRNVSPKGLSADFIIQYDVELKDPMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISG 302 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID----- 330 + + I+ K + +FN SD +W + +R + Sbjct: 303 SMIGTKIKQTKAAMVSILSDLREGDYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVR 362 Query: 331 -ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK-KYIVLLTDGENTQDNEEGIA 388 G T IN A+ +A + S + + I+ LTDGE T E Sbjct: 363 KIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDV 422 Query: 389 ICNKA-KSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 I + A KS G + + +AF + R L N +E + K F D + Sbjct: 423 ILHNAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVARMVYEDDDAAIQLKGFYDEVA 482 Query: 447 NEIFERVI 454 + + Sbjct: 483 TPLLSDIQ 490 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 65/193 (33%), Gaps = 19/193 (9%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 SS + L++D + ++ K D + F DR Sbjct: 245 VFDASSSINANNPNNYGLMKDFMKDIVDRFNKTGP--DGTQFAVVTFADRATKQFGLKDY 302 Query: 316 VHKLIRTIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + +K G TAI D ++ A + E +K ++L Sbjct: 303 SS---KAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQKVVIL 354 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEAN 432 LTDG+N + + +G+ I+ I + L N AS + F + Sbjct: 355 LTDGQNNGHKS-PEHESSLLRKEGVVIVAIGVGTG-----FLKSELINIASSEEYVFTTS 408 Query: 433 STHELNKIFRDRI 445 S ++L+KI + + Sbjct: 409 SFNKLSKIMENVV 421 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 63/194 (32%), Gaps = 16/194 (8%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + + K+ V++ + S N + F+ V + Sbjct: 47 FDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGSQFG---VVQFSQGVKTAFPL 103 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T I +Q + S E N + +K ++ Sbjct: 104 -NKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEG-----GGNPDKQKIVI 157 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 L+TDG++ +K K++G+ ++ I + L A+ ++ Sbjct: 158 LMTDGKSNAGAP-PQHEAHKLKAEGVTVIAIGIGQG-----FVKTELEQIATMKNYVLTT 211 Query: 433 ST-HELNKIFRDRI 445 ++ EL+ + + I Sbjct: 212 NSFSELSTLLKLVI 225 >gi|72008858|ref|XP_787086.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115969503|ref|XP_001184155.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 429 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 45/338 (13%), Positives = 95/338 (28%), Gaps = 14/338 (4%) Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 V D+ RDT V + +Y +N + + + T+ + Sbjct: 18 SVPDVPRDTPVIFVLIYDQLVNKVRGQYQQRINV-NPRQVVDNLGVDVQVTEPQRIKDIS 76 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPADRTVKSYSSQNGKVGIRDE 227 + V +D SR ++ Q + D Sbjct: 77 VAFQGKPVPSLSDVDLSRDLVRESNSRYKFRYMPTPDEQSDFSEKGIDGDVVLTYNLMDA 136 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + N + P LD + FV S K ++AL +++ ++ Sbjct: 137 PTGSHTQVQNDYFVHFFSPIGLDVISKQIVFVIDVSASMYGTKLSQTKEALKTMLDNLNP 196 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D N + R + + + + + + T +N+A+ A Sbjct: 197 TDYFNIITFSDGVQY-WRENNRLAPAQRRYMDDAM---AYVDSLRDDSETNLNEAIVKAG 252 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIA 404 + + S ++LLTDG + + I + A+ + + + Sbjct: 253 ELLDSEARYNRP----GDSVYSMMILLTDGRPSVGTTDQQEILDNAREVIAGKHSLNILG 308 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 F + N + +E + E + F Sbjct: 309 FGRLVDFDLLVKLAYENNGTAKMIYEGTTAAEQLREFY 346 >gi|299532595|ref|ZP_07045984.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] gi|298719398|gb|EFI60366.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] Length = 412 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/320 (11%), Positives = 74/320 (23%), Gaps = 11/320 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G I AL + +LG G +D R + L+ A + + A+ L + Sbjct: 8 LRRQRGAIIITVALALLFLLGFMGFALDFGRLFVVKTELQTATDSCALAAAQEL----DG 63 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S + + + +D D + + P SA Sbjct: 64 ASDARTRAVNAGLAAGNSNRVHFQGASAGIAVSDISFSDALNGSYSTTFPVTSAKYARCE 123 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 LL L+++G S A + + ++ Sbjct: 124 HARTGLLP---WLLQALGAFSGNASYGANQSVKALAVATRAPSQTNCLMPIGICQKSPSS 180 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 S + + N+ + G + + Sbjct: 181 PFGYSRGEWLS----GVTNSNDEVEASGAFKWLDYTGSGGGANEIKNLLAGNGACNLPGT 236 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + D S S + D+ D +D + + Sbjct: 237 ATNVTDPSKNGKTNGAVDAWNTRFGIYKGSYNQADSPPDETGYAWYAADDSGKNPGRYDA 296 Query: 315 GVHKLIRTIVKTFAIDENEM 334 T ++N Sbjct: 297 LFSTKRSTHAPYEGDNKNPD 316 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 67/188 (35%), Gaps = 26/188 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I + R+G + + + + +++ Sbjct: 110 NRLEKTKQLVSQIINQL-------GNDRVGIVGYAGSAYPILPMTTDYS-IAKMYLQSMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA NDA++ A D + + K I+L++DGE+ + Sbjct: 162 TNMVSSQGTAFNDAIKLAVD------------YFDVKDTSKLIILVSDGEDHGEGA--SE 207 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-LNKIFRDRIGN 447 + AK +G+RI+TI K + +S + + + + K++ D + N Sbjct: 208 AIDLAKEKGVRILTIGVGTEKGALIPLKDNKGTISS---YKKDQNGENVITKLYPDVLKN 264 Query: 448 EIFERVIR 455 + + Sbjct: 265 IATKTKSK 272 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 56/160 (35%), Gaps = 7/160 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +K ++S G I+ AL +PV+LG + VD+ R L+ AA A + + L S Sbjct: 10 QKRLQSERGGAAILFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSLSDS 69 Query: 72 LEEV---SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-MNPRKS 127 S+ + ++ + + ++ + +R + Sbjct: 70 GGNPYNWSAAGSAALDIARRNVANGAGIQDALIETGYWNIQDPSEGLRAPGTPGVPAAGD 129 Query: 128 AYQVVLSSRYDLLLNPLS---LFLRSMGIKSWLIQTKAEA 164 V ++ LN F +GI +Q + A Sbjct: 130 VAAVQVTITISRTLNNGPLRLFFAPVLGIAEQDVQGSSVA 169 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 60/204 (29%), Gaps = 45/204 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I + R+G + + + + ++ Sbjct: 110 SRLAKAKRIVSEIIAQLGS-------DRIGIIAYAGQAYPQLPITTDY-GAAKMFLQGLN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAIN A+ A + + + + +++DGE+ Sbjct: 162 TNMLSSQGTAINQALDLASTYYDD-----------DEQTNRVLFIISDGEDHS-EGSTEG 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-----------------------EKARYFLSNCA-- 423 KA QGI+I TI K + + L + A Sbjct: 210 AVEKAVDQGIKIFTIGVGTEKGAPIPIKRNGVLQSLKKDLQGEVVITKLNKSVLEDIADE 269 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 F + +T++ I ++ + Sbjct: 270 GNGQFIDGTNTNDAVTIIKEELNK 293 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR----VISD 309 + + + + A+ S++ + V + T + D Sbjct: 40 DSSGSMTARDAGGSGTRMDAAKRAVGSMVDGLPAGAQVGLAIYGAGTGSSGAEKVAGCKD 99 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 V + + +K G T I A++TA + ++ Sbjct: 100 VRVVQPVGPVNKPALKRAVTATKASGYTPIGQALRTAAAQLPKEG-------------QR 146 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 IVL++DGE+T + + + QG+ + TI F V+ + + N + + Sbjct: 147 SIVLVSDGEDTCAPPQPCEVAKELSKQGVDLHVHTIGFRVDAKARAQLACIAQN--TGGT 204 Query: 428 FFEANSTHELN 438 + +A+ L Sbjct: 205 YHDASDADSLL 215 >gi|218672104|ref|ZP_03521773.1| von Willebrand factor type A [Rhizobium etli GR56] Length = 366 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 80/266 (30%), Gaps = 23/266 (8%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 +M D Q + +N F R K+ V + + + ++ Sbjct: 10 GAMPDPQSVRVEEMINYFPYDWPRPDKADQPFKATVTVMPTPWNHDTELMHVAIKGYDIA 69 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D S K L++ A ++ +K D V+ + Sbjct: 70 PATAPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNRLKPDDTVS------IVTYAGNA 123 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + ++ + + GST + ++ AYD + Sbjct: 124 GTVLEPT---RVAEKSKILSAIDRLEAGGSTGGAEGIEAAYDL--------AKKAFVKDG 172 Query: 367 AKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + + L TDG+ +E+ I + +GI + + F + N Sbjct: 173 VNRVM-LATDGDFNVGPSSDEDLKRIIEDKREEGIFLTVLGFGRGNLNDSLMQTLAQN-- 229 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 S ++ E K + G+ + Sbjct: 230 GNGSAAYIDTLAEAQKTLVEEAGSTL 255 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 70/216 (32%), Gaps = 29/216 (13%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 PG ++ +D S K + A ++ +K D R+ + Sbjct: 88 PGARRSPVNLALIIDRSGSMSGY-KLEQAKQAARHLVTLLKDDD------RLAIVHYGSD 140 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V S P R + + + G T I+ + + ++ D Sbjct: 141 VKSLPGLQ--ATPANRERMIQYIEGIWDEGGTNISAGLLAGQAQVETARSDYRVNR---- 194 Query: 366 EAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++L++DG+ T+ + + + +++GI + +I + + Sbjct: 195 -----LILISDGQPTEGSTDEGSLKQVVKDIRTRGITVSSIGVGTDFN-----EDLMQAF 244 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 A S+ +L +F+ + + +V R Sbjct: 245 AEYGAGSYGFLEDAGKLATLFQKDL-QQASTQVARN 279 >gi|300933821|ref|ZP_07149077.1| hypothetical protein CresD4_07088 [Corynebacterium resistens DSM 45100] Length = 676 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 60/182 (32%), Gaps = 25/182 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDN----VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + A +I ++ N V A D V + + + Sbjct: 72 TRLDAAKKAATGLIDALPDSANMGMVVYGQQESNAPNNRAAGCKDVETISPVGPINKGEL 131 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K + G T I +++ A + + E ++ IVL++DG +T Sbjct: 132 KDRISNFKAKGYTPIGNSLLKAAEEL-------------GKEGERSIVLVSDGHDTCAPP 178 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + K +G I T+ F + KAR L A S + A + EL+ Sbjct: 179 PVCEVAKKLAGEGYNLTIHTVGFHAD----RKARKELECIAKTSGGQYLSAENASELSNS 234 Query: 441 FR 442 + Sbjct: 235 MK 236 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 95/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +++ + T + ++D V +F+ Sbjct: 255 GNKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 63/185 (34%), Gaps = 22/185 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A S+I S+ D R+ F+ R + T++++ Sbjct: 59 LETVKKAALSLIESLGVND------RLSVIAFDHRAKVILP--SQSREDDLTLIRSKIQQ 110 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ + ++ +I LLTDGEN +N+ + + Sbjct: 111 LQAGGGTAIDEGIKLGIQESSTGSKG----------YVSHIFLLTDGENEHGNNQRCLKL 160 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A GI + T F + Q + + A + + L + F Sbjct: 161 AEVAAEYGITLNTFGFGDHWNQDILEKIA--DIAGGSLSYIERPEQALIE-FTRLFNRLQ 217 Query: 450 FERVI 454 R+ Sbjct: 218 SVRLT 222 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 95/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 +++ + T + ++D V +F+ Sbjct: 255 GNKAAATNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 44/123 (35%), Gaps = 14/123 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + +T +++ D TAI+ A+ A Sbjct: 129 DRVGLVGYAGSAFPQVPITTDYAS-TKTFLQSMNTDMVSSQGTAISQAIDLAKSYYND-- 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + K +++L++GE+ + ++ A ++GI+I TI + Sbjct: 186 ---------DDQTNKVLIILSEGEDH--DSNVESMAETAAAEGIKIYTIGVGTERGDPIP 234 Query: 415 ARY 417 + Sbjct: 235 IKK 237 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 76/275 (27%), Gaps = 25/275 (9%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T S +++ +I+ +++ + +N F + + V I Sbjct: 103 STFSADVDTASYTNVRRLIENR-NIVPEDAVRIEEFINYFDYDYPQPEDGSA-FGRYVEI 160 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D + + P + +DSS + K LV+ A + + Sbjct: 161 ADCPWNRDHKLMMVGIQGKELQQQETPPSNLVFLIDSSGSMNSYDKLPLVQSAFSMLAEQ 220 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + K D ++ S G + GST ++ Sbjct: 221 LDKNDRISIV---------TYAGSSAVLLDGEKGSNTDEILEQLYSITASGSTNGEGGIK 271 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIM 401 TAY+ + ++L TDG+ + + GI + Sbjct: 272 TAYEL--AEEHFIKGGNNR-------VILATDGDLNVGASSEEELTRLIETKRDNGIYLS 322 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + F + + N +F +S E Sbjct: 323 VLGFGEGNYKDARMEALADN--GNGNFSYIDSEDE 355 >gi|332559488|ref|ZP_08413810.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] gi|332277200|gb|EGJ22515.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] Length = 651 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 40/384 (10%), Positives = 99/384 (25%), Gaps = 24/384 (6%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + E + A + + + + + P Sbjct: 111 DAPEADARLAAVPEAGGGAETAGAPVPAEPRARSAEGAAPQTFAADEAMPMAVPPAPDFA 170 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRS 188 ++ P ++ AE + S + +I + Sbjct: 171 LSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAILRSSLRAGQ 230 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + +N F + + +P + +L + Sbjct: 231 LPPREAVRIEEMINYFPYDYPAPESGTPPFRPSLSVTRTPWNPETRLVHVALQGRMPAIE 290 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P L+ +D+S K L++ + ++ ++ D V + Sbjct: 291 DRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVA------IVTYAGSAGE 344 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + R+ + + + GSTA + + AY T Sbjct: 345 VLAPT---AANQRSTILSALDRLDAGGSTAGEEGLALAYRTASEMAGAGEVTR------- 394 Query: 369 KYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +VL TDG+ + + + G+ + + F + N Sbjct: 395 --VVLATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQN--GN 450 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 +S +E K+ D++ + Sbjct: 451 GQAAYIDSLNEAQKVLVDQLSGAL 474 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 41/145 (28%), Gaps = 1/145 (0%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G + L + + + + VD R + L+ A A I A + + Sbjct: 7 KKQQGAIGLFGVLTLLMAVLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVG-GCAQN 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 +S N D N +SA +V+ + Sbjct: 66 NSSEAYKAAAQAAAAANGYQGNLLAAPNAVQLGGYHTDSDGIRTFAANNERSAVRVLATQ 125 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQT 160 L +F + + +++ + Sbjct: 126 EVPSSLFAGGIFNQRIVLRTEAVGA 150 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 96/378 (25%), Gaps = 12/378 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I+ AL++ V++G G+ +D+ + L+ +A A A+ + Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAIN 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S T + + N T + + S V + Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + ++N L + + + + + + ++ Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVA 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + + G + + + + Y L Sbjct: 195 GATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTP 254 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ + T + ++D V +F+ Sbjct: 255 GDKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA- 313 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYI 371 + N GS + A + + E + + + A Sbjct: 314 -------SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDC 366 Query: 372 VLLTDGENTQDNEEGIAI 389 VL+ D + + + + Sbjct: 367 VLMLDPMGSGGSATPVHL 384 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 50/399 (12%), Positives = 111/399 (27%), Gaps = 42/399 (10%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + E S S K K+ + RN + + + +D + Sbjct: 204 DNISVGEEFSVSIDIKSNYATKAKLTLFSGRNKVGEQQVQIQKGKNSFVFKDKQSSGGFK 263 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 V + + N S F M + L+ + ++ + + + I Sbjct: 264 GYRVLVEAEGDTNKVNNEFSTFTNVMDKPNILLINGVKGDSEALEGILSNSGANIKKIAP 323 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSY-SSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S S E + + D + + V L + + +L Sbjct: 324 SASPSTLNELLEYKSIVLNDVHRDDLSNGFMDNIEAYVKDYGGGLITFGGEDSYALGGYK 383 Query: 245 YPGPLDPSLSE------------------EHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + S+ + K L ++A +K Sbjct: 384 DTSLEKVLPVYMDKRGKNEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAA------MK 437 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 ++N+ + + F+D + +K G T+I A++ Sbjct: 438 ALENLREVDEISVIAFDDTYDEVVPL---QKVGDKEAIKELISGIQIRGGTSIYPALEQG 494 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 Y+ + K+ +LLTDG++ + + I + T+A Sbjct: 495 YNM-----------QMQSSAKIKHTILLTDGQDGYGLDNYATLLQNFIDNNITLSTVAVG 543 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L++ S++ T ++ +IF + Sbjct: 544 EGANAGLLNQ--LASIGKGRSYYTDIYT-DIPRIFAKEV 579 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 55/169 (32%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +K +D D R+G + V ++ S +K I + + T Sbjct: 75 LITMLKFLDIGPDNTRVGLLQYGSTVKNEFSLK--TYKRKPDIERAVKRMMHLATGTMTG 132 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E N + +++TDG I KA++ GI I Sbjct: 133 LAIQYAMNIAFSEAEGARP---LNQYVPRIAMIVTDGRPQDP---VAEIAAKARNSGILI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHE---LNKIFRD 443 I L S F + + L +F++ Sbjct: 187 FAIGVGR------VDMSTLKTIGSQPHSEHVFLVANFSQIETLTSVFQN 229 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + +D S +V+ + ++ S++ Sbjct: 550 YSLGEDGKSCKGEKKCGEGPVDLVFVIDGSKSLG-EDNFEIVKQFVKGILDSLEISQKAA 608 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+G ++ V ++ + + + + + K + + + A++ ++ S Sbjct: 609 ---RVGLIQYSTHVRTEFTMAQ--YSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFS 663 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + R + ++ TDG +E KAK GI I I ++ Sbjct: 664 EAQGARARPMRVP---RVAIVFTDGRA---QDEVSEYAEKAKQSGITIYAIGIGKAIDEE 717 Query: 413 EKARYFLSNCASPNSFFEANSTHE 436 L AS + Sbjct: 718 ------LQEIASAPQEKHVIYAED 735 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 + + +D D R+ + V + L + +K T Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIESLLK---SHLTKDTIKQAITRIEPLAAGTMT 143 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A++ A D + + K K +++TDG + ++ + A++ GI Sbjct: 144 GMAIKKAMDEAFTEKSGARPKSK---NISKVAIIVTDG---RPQDQVEEVSAAARASGIE 197 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I + L AS + F + +L FR+ + Sbjct: 198 IYAVGV------DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 59/180 (32%), Gaps = 25/180 (13%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F S + + T+I +A+ A D+I Sbjct: 123 PERFNVGVVAFARTAAVVISPTTDHAAVT-----NAIAGLTTRPGTSIGEAVFNALDSIR 177 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + + IVLL+DG+NT AI A + + + TIA+ Sbjct: 178 SFDREAATDP-----PPAAIVLLSDGDNTSGRPVSEAI-EAAANAKVPVSTIAYGTPDGY 231 Query: 412 QEKA---------RYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERV--IRITK 458 + L + + A S EL ++++ IG + + IT+ Sbjct: 232 VMIDNRPVQVPVNKAALQELSEGTGGRAYTAESASELREVYQQ-IGTSLGYTIEHQEITQ 290 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 69/206 (33%), Gaps = 24/206 (11%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y L PG P +D+S + A ++I + R+ Sbjct: 102 YQLNPGSAKPVKDIVLVIDNSGSMNETDPNQDRYTAAKNLINRMD------RDNRVSVIM 155 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ F+ ++ + + G T I+ A++ I S + Sbjct: 156 FDHATTLLQPFTRVNNQETKDEIIAEIDGLATTDGGTDISLALEDTMSHIQESRDAGRSA 215 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +++L+DG + D++ +A + K Q I + TI S+ L Sbjct: 216 ---------MVIMLSDGFSETDHDRVLA---EYKQQQIAVNTIGLSLVNPD---GAQLLQ 260 Query: 421 NCA--SPNSFFEANSTHELNKIFRDR 444 A + +++ +L+ +F+ Sbjct: 261 TIAAETGGQYYDVQHAEDLSFVFQKI 286 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 59/203 (29%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + + +G F+ Sbjct: 312 DKSGSMAGSNRLNRMNQAAKHFLMQTIE-----NGSWVGMVHFDSTAYIKSKLIQITSSN 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ + + IVLLTDGE+ Sbjct: 367 ERNKLLESLPT-AASGGTSICRGIKSAFQVLTGTYPQIDGSE---------IVLLTDGED 416 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 C ++ + G I IA + Q + + A+ Sbjct: 417 N-----TAGSCVDEVRQSGAIIHFIALGPSADQAVIEMSTI----TGGKHKYASDEAANN 467 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + ++ ++ +++ Sbjct: 468 GLIDAFAALVSGNADLSQQSLQL 490 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/319 (13%), Positives = 95/319 (29%), Gaps = 35/319 (10%) Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + YD + + + ++ + + RSY +E + Sbjct: 108 TREYDYQEEHRFVSAKDFPLSTFAADCDTASYSNIRSYIEEGML------------PPAG 155 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + +N F ++ D + L + Sbjct: 156 AVRVEEMINYFDYDYVSGPEAGKKFAVYTEYADCPWNKDTKLMMVGLNTAAIDMSEKKAS 215 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D+S + K L + A + ++ + D ++ SD Sbjct: 216 NLVFLIDTSGSMYEENKLPLAQKAFKMLAENLDENDRISIV---------TYAGSDTVVL 266 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 GV + GST + + TAY+ E + + NN ++L Sbjct: 267 NGVAGSEAYTICEALDSLEASGSTNGSAGLITAYEI----AEQQFIKDGNNR-----VIL 317 Query: 374 LTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 TDG+ + + + + K GI + + F + + K + ++ Sbjct: 318 ATDGDLNVGLTSESDLVGLITEEKDSGIFLSVLGFGSDNLKDNKLEALADH--GNGNYSY 375 Query: 431 ANSTHELNKIFRDRIGNEI 449 +S +E K+ D +G + Sbjct: 376 LDSVYEAKKVLVDEMGGTL 394 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 821 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 878 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ GST +A+ + E + N + ++++TDGE+ ++ Sbjct: 879 DQAMGGSTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 932 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 933 AKALRDKGILVLAVGIDG-----ANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 985 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 69/212 (32%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 411 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAP 470 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 471 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 525 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 526 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGI---- 572 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + + L I Sbjct: 573 --KEANQTQLREIAGEEKRVYYVHDFDALKDI 602 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 1016 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQI 1073 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1074 FGNTHIGAALREVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1125 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1126 RHRGIDIYSVGIGDVDDQQ 1144 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 250 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 306 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 307 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 360 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 361 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 414 Query: 453 VI 454 V Sbjct: 415 VS 416 >gi|302796872|ref|XP_002980197.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] gi|300151813|gb|EFJ18457.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] Length = 542 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 57/190 (30%), Gaps = 30/190 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV+ A+ VIR++++ D R+ F+D + + Sbjct: 97 KLELVKTAMEFVIRNLRQQD------RLAIVSFSDEPKVHLGLKRMTYDGREAALSA-VE 149 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T I ++ +D + N I+LL+DG + Sbjct: 150 KLRTLGGTEIRPGLKAGFDLLSRR---------RNRNPVSSIMLLSDGMDN---AITFKR 197 Query: 390 CNKAK--------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 C S+ + + T F + + A+ SF + F Sbjct: 198 CKVLPVDSYLEDCSERVPVHTFGFGSDHDPEAMLSIAE---ATGGSFCYVQEESTVQHAF 254 Query: 442 RDRIGNEIFE 451 IG + Sbjct: 255 AQCIGGLLSV 264 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 48/146 (32%), Gaps = 21/146 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+D + ++ G T I ++ A + S++ Sbjct: 455 LVVFDDTANWVLQLQPLPSMV---EIERALGSFGIGGGTNIRPGIEQAALALASTDAKI- 510 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K+++LLTDG + + IA + ++ GI I T+A ++ Sbjct: 511 ----------KHVLLLTDGIAESNYSDLIA---QMRASGITISTVAVGLDANPNLVDV-- 555 Query: 419 LSNCASPNSFFEANSTHELNKIFRDR 444 A + S E+ +IF Sbjct: 556 --ANAGGGRSYRVTSIDEVPRIFLQE 579 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 46/148 (31%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +I ++ R+G + + + + Sbjct: 111 SRIDKAKQLVTQIINNLGS-------DRVGIIAYAGSAYPQLPITTDYSSA-KMFLNAMN 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI DA++ A + + +V+++DGE+ + Sbjct: 163 TDMLSSQGTAIRDAIELAKTYYNDEEQ-----------TNRVLVIISDGEDHAGEV--AS 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 I A QGIRI TI + + Sbjct: 210 IAESATEQGIRIFTIGVGSEAGDRIPIK 237 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 57/190 (30%), Gaps = 30/190 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K LV+ A+ VIR++++ D R+ F+D H + Sbjct: 99 KLELVKTAMEFVIRNLRQQD------RLAIVSFSDEPKVHLGLKRMTHDGRAAALSA-VE 151 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +G T I ++ +D + N I+LL+DG + Sbjct: 152 KLRSLGGTEIRPGLKAGFDLLSRR---------KNRNPVSSIMLLSDGMDN---AITFKR 199 Query: 390 CNKAK--------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 C S+ + + T F + + A+ SF + F Sbjct: 200 CKVLPVDSYLEDCSERVPVHTFGFGSDHDPEAMLSIAE---ATGGSFCYVQEESTVQHAF 256 Query: 442 RDRIGNEIFE 451 IG + Sbjct: 257 AQCIGGLLSV 266 >gi|84498148|ref|ZP_00996945.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381648|gb|EAP97531.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 533 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 41/392 (10%), Positives = 92/392 (23%), Gaps = 38/392 (9%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 + A+I A V L+ ++ FEN D Sbjct: 2 NSLPAALIAAVVVLVTGCSASGEGDTSA-----------AHDYFENYPTAQENDSASGTS 50 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 A + L + S I T+ + Sbjct: 51 SSSAVGGQASSGVAQPFPAAPNVPGPLEDNTF---VDAGTSGFIDTRERPRSTFAVDVDG 107 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + + + + +N F + + R Sbjct: 108 GSFRVARSLLHDGHLPPPESVRPEEWVNSF-DSGFPAPRKDDLELQSDQARASSEDDGTR 166 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 L ++ VD+S + ++ LV+ +LA + +++ D + Sbjct: 167 LVRIGLQGREVDVREWQPVALTMVVDTSGSMDIRERLGLVKSSLALLAENLRPDDTIAIV 226 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + GST + + YD + Sbjct: 227 ---------TYQTDATPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGLLLGYDQAREAY 277 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQ 411 + ++L +DG +G + + GI ++T+ + + Sbjct: 278 KQGATN---------VVLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGNYS 328 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + ++ E K+F + Sbjct: 329 DHLMEQLADQ--GDGFYEYIDTFEEARKLFVE 358 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 40/121 (33%), Gaps = 15/121 (12%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + V G T I +++ AY+ + +K+I+LLTDG++ Sbjct: 465 DKEKVIEKINGLTSGGGTNIFPSLELAYEQLTP-----------LELQRKHIILLTDGQS 513 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + I + K I + T+A F++ N + + Sbjct: 514 ATSPDYLTTI-QEGKENNITLSTVAIGEGSDSVLLEELSDE---GGGRFYDVNDSSTIPS 569 Query: 440 I 440 I Sbjct: 570 I 570 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/159 (22%), Positives = 60/159 (37%), Gaps = 6/159 (3%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+G F D F+ + + +T ST + DA+ A Sbjct: 144 KTRKGDRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVA--MAGQSTHLGDAIGLAIKV 201 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN- 408 S+ED+ +N +K ++LTDG +T E I A ++ +RI IA Sbjct: 202 FEQSSEDKASAEENAKPREKVAIVLTDGNDTGSYVEPIDAAKVAAAKDVRIHMIAMGDPR 261 Query: 409 -KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +Q ++ A S F+A + EL + + + Sbjct: 262 TVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEI 300 >gi|219520386|gb|AAI43866.1| COL21A1 protein [Homo sapiens] Length = 957 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LKAIANKPSSTYVFYVEDYIAISKI 203 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 68/235 (28%), Gaps = 49/235 (20%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + A +I + VR+G F I + ++ Sbjct: 97 SMAASDVAPTRIGAAQQAARDLIVGLPPG------VRLGIVSFAATAIVVLPPTDNRLRM 150 Query: 320 IRTIVKTFAIDENEMGSTAIND-------------AMQTAYDTIISSN------------ 354 + I + + GS I + +++ Sbjct: 151 LDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLEGILFGGESLAPGTGGRSLTEAAAAD 210 Query: 355 -----EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + E ++LL+DG T + + A +G+R+ T+ F + Sbjct: 211 AVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTG-PDPLDAARMAAQRGLRVYTVGFGAVQ 269 Query: 410 TQQEKARYF----------LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L A + +F+A S +L +++R G ER Sbjct: 270 DAGTEGSSLSYEMQVDEPTLRQIATLTDGEYFQAGSAADLTRVYRQLSGRFALER 324 >gi|198438279|ref|XP_002126427.1| PREDICTED: similar to LOC779593 protein [Ciona intestinalis] Length = 1012 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 69/253 (27%), Gaps = 42/253 (16%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 ++ P VD+S K V+ ALA +RS+ D+ Sbjct: 311 GFFAHHFAPPSLAAFPKLVVFVVDTSGSMFGY-KLKQVKQALADSLRSLNNEDH------ 363 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVK-TFAIDENEMGSTAINDAMQTAYDTIIS--- 352 F D S T+ + G T + A+QTA+ + Sbjct: 364 FNIVVFGDTAEPWISGVLSTASTRSINDAITYVDAVSARGGTNMLVALQTAFAIMEPYLP 423 Query: 353 -----------SNEDEVHRM-----------------KNNLEAKKYIVLLTDGENTQDNE 384 + K IV LTDG T+D+ Sbjct: 424 SLPENETMVEDTTPFPTPVPLQPETNHFIRKRATETQTELSNYAKMIVFLTDGRPTKDDV 483 Query: 385 EGIAICNKA-KSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I ++ K G + + TI F + + N FE+ + F Sbjct: 484 GTDDIASRIEKINGGRVNLHTIGFGSLVDMRFLEKLAALNGGVSRRVFESLDAATQIRHF 543 Query: 442 RDRIGNEIFERVI 454 D + + V Sbjct: 544 FDEVSAPVLTDVT 556 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 40/130 (30%), Gaps = 13/130 (10%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G G T I A+ A + + + + +K + L Sbjct: 104 YGFQSFNEQSFLNSLNAIGPTGWTPIAKALNEAKSSF----------DQLDTKGEKVVYL 153 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 LTDGE T I + I + I F + + + +F A + Sbjct: 154 LTDGEETCG-GNPIKTAKELHKDNITVNVIGFDYKEGYKGQLNAIAK--VGGGEYFPAYT 210 Query: 434 THELNKIFRD 443 ++ KIF Sbjct: 211 QKDVEKIFTQ 220 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 19/169 (11%) Query: 289 DNVNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D V +N V++ + R V TAI ++ Sbjct: 122 DAATKFVAALPAQYNVSVVTLSGHPNTLVPPTTDRAPVNQGIKTLELADGTAIASSIDVG 181 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + + +VLL+DG T + +A +KAK Q + I TIAF Sbjct: 182 LEALKQAPAGDDGKQAPG-----LMVLLSDGSETGG-GDPVASADKAKQQNVPIYTIAFG 235 Query: 407 VNKTQQE---------KARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + L AS +A S +L+ +++ Sbjct: 236 TQNGYVDLDGQRFNVAPDTDMLKRIADASSGKALDAASASQLDDVYKTL 284 >gi|170741047|ref|YP_001769702.1| hypothetical protein M446_2843 [Methylobacterium sp. 4-46] gi|168195321|gb|ACA17268.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 439 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 43/137 (31%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K + G +I L +P+++ G ++ R L++A TA + A+ L Sbjct: 2 RSLKTFLADRAGAAALILGLCLPMLVAGSGAALEYARIHKRRTELQKAVDTAALAAAGEL 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + R T + +N +I + Sbjct: 62 TIAGSDTYVRNLAVRTAYEAAGGTDPAVTRATARVQNRRGWVQVEIRETVRSLFGRLLTM 121 Query: 129 YQVVLSSRYDLLLNPLS 145 + LS++ L+ + Sbjct: 122 PTMELSAQATGELSGST 138 >gi|296198464|ref|XP_002746714.1| PREDICTED: collagen alpha-1(XXI) chain isoform 2 [Callithrix jacchus] Length = 954 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I K D +++G ++D + + G + ++ G+T Sbjct: 60 LINITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLMAAVESILYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|296198462|ref|XP_002746713.1| PREDICTED: collagen alpha-1(XXI) chain isoform 1 [Callithrix jacchus] Length = 957 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I K D +++G ++D + + G + ++ G+T Sbjct: 60 LINITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLMAAVESILYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 41/130 (31%), Gaps = 13/130 (10%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G G T I A+ A + + + + +K + L Sbjct: 104 YGFQSFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSF----------DQLDAKGEKVVYL 153 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 LTDGE T I + + + I + I F + + + +F + Sbjct: 154 LTDGEETCG-GNPIKTAKELQKENITVNVIGFDYKEGYKGQLNAIAK--VGGGEYFPVYT 210 Query: 434 THELNKIFRD 443 ++ KIF Sbjct: 211 QKDVEKIFTQ 220 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/307 (12%), Positives = 83/307 (27%), Gaps = 45/307 (14%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR--DEK 228 ID+S + + S LN + T + + Sbjct: 1125 SVVGSAYQCNCPIDYSGIHCEIWQCSTMICLNNGTATYNNTSDGCECECISEETPEGNIT 1184 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSL----------------SEEHFVDSSSLRHVIKKKH 272 + P + E ++ Sbjct: 1185 YVGDHCEYRFPGVCIPSPCGNGTCVELTQTLHDCACDFGFQGENCDHVCNTTISGRLDIQ 1244 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL---------IRTI 323 +V D S+ + K + + A ++ + + Sbjct: 1245 IVIDTSGSLTSAPNKDQVLMNFTNNLANMYDTINQVKIGLTSFSESSVLEMPLDFYNQLE 1304 Query: 324 VKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + GS T I ++TA + + +S ++L+TDG + + Sbjct: 1305 LQDGVSNMTWQGSFTNITSGVETALNDMDTS-----------DAVDDVMILITDGFQSTN 1353 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + ++AK++G+R++ + F Y ++N + A + EL Sbjct: 1354 TTLMFQMIDQAKAEGVRLIALGFFGEFAFYSPNLYLMTN-----EVYHAANYAELL-AID 1407 Query: 443 DRIGNEI 449 + I I Sbjct: 1408 NTIFETI 1414 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 57/208 (27%), Gaps = 37/208 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI +++ D R+ F+ + + Sbjct: 250 KLALLKRAMRFVIENLEPSD------RLSVVAFSSSACRLFPLRKMTA-FGQQQSQQAVD 302 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + ++ A + I+LL+DG ++ + Sbjct: 303 SLVADGGTNIAEGLRKAARVVEDRQ---------ARNPVCSIILLSDGVDSHNLPPRDGS 353 Query: 390 C---NKA----------KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN-S 433 + A + I ++ + + A S +F + Sbjct: 354 APEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHD-HDHDSRAMHAVAQMSSGTFSFIDMV 412 Query: 434 THELNKIFRDRIGNEIFERV----IRIT 457 + IG + V R++ Sbjct: 413 GSSIQDALAQCIGGLLSVSVVAQETRLS 440 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 59/196 (30%), Gaps = 26/196 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + LVR+ +I + D +G +++ V D + R + T Sbjct: 990 RIELVRETCHFLIDQLTADDY------LGIVSYSNTVREDVPLLRMTPEARR-LAHTMIS 1042 Query: 330 DENEMGSTAINDAMQTAYD---TIISSNEDEVHRMKNNLEAKKYI---VLLTDGENTQDN 383 G TA+ ++ S + ++ + + L TDG+ T Sbjct: 1043 SLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLFTDGQATTGP 1102 Query: 384 EEGIAICNKA------KSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTH 435 I + Q I + T F + + L A ++ + Sbjct: 1103 CTVNEIMGQMTSLQSPADQNITVHTFGFGDDHSV-----ELLQGVAEAQSGVYYYISCAD 1157 Query: 436 ELNKIFRDRIGNEIFE 451 ++ F D +G + Sbjct: 1158 DIPSGFGDALGGLLAV 1173 >gi|317122034|ref|YP_004102037.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] gi|315592014|gb|ADU51310.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] Length = 207 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 28/104 (26%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G L++PV+L G+ +D R + A A + A + + Sbjct: 47 QQGAVAAAFVLVLPVLLAAVGLGLDAGRLVVVRAHAQAVADLAGLAAVQEIDEDAFARGE 106 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 A + D V +V + + Sbjct: 107 PALREAAAAATARQWAEDGLRRAFGDAVAEDATVDVVVVNASPA 150 >gi|290960274|ref|YP_003491456.1| lipoprotein [Streptomyces scabiei 87.22] gi|260649800|emb|CBG72916.1| putative lipoprotein [Streptomyces scabiei 87.22] Length = 537 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/285 (12%), Positives = 78/285 (27%), Gaps = 28/285 (9%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T + + +D + D + +N F Q +R G Sbjct: 102 STFALDVDTASYGYARRTLD-DGGLPDPSTVRPEEFVNSFRQDYERP-DGDGFSVTVDGA 159 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 R L P + +D S + L +DAL ++ Sbjct: 160 RTGP--GNWSLVRVGLATRAAGDRQRPPAALTFVIDVSGSMAEPGRLDLAQDALRTMTNR 217 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ D+V F+D + + R ++ ST + ++ Sbjct: 218 LRDDDSVA------VVTFSDEAETVLPMT--RLDGNRGEIREAVAGLEPTDSTNLAAGVE 269 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRI 400 T Y+T + +VLL+D + + I + + GI + Sbjct: 270 TGYETAVEGLRKGATNR---------VVLLSDALANTGSTDADTILERIAGERREHGITL 320 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + ++ E K+F + + Sbjct: 321 FGVGVGSDYGDALMEQLADK---GDGHTTYVSTEEEAEKVFCEEL 362 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 49/144 (34%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I + R+G + + ++ Sbjct: 92 NRLEKSKQIISKIIDRLGS-------DRVGIIVYAGNSYPLLPITTDHAAAN-MFLQNAN 143 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAIN+A++ A N+ + +++++L+DGE+ Q+ Sbjct: 144 PDMVSSQGTAINEALELAKTY-----------YNNDEQTNRFLIILSDGEDHQEE--TKQ 190 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 + + G++I TI + Sbjct: 191 VAQNLANNGVKIYTIGVGTARGGP 214 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 59/173 (34%), Gaps = 8/173 (4%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQ 344 ++ ++ V T N + + + + + K F D ST + DA+ Sbjct: 412 TSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDALN 471 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN---KAKSQGIRIM 401 A+ +I+ + + N K IV LTDGE T E + + I Sbjct: 472 KAH--LIAKLGETRFKDGANT-PKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIY 528 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ F + L+N +EA+ + F I + + V Sbjct: 529 SLGFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVT 581 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 58/176 (32%), Gaps = 31/176 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R+A S + T R+G ++D+V+ + V K+ + Sbjct: 528 LDEAREAARSFLDQCDFT-----TTRVGLISYSDQVVLQTDLTDNVRKV-----EAGLAR 577 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T + A++ + KY+V+LTDG + + Sbjct: 578 LEADGTTNLAGALELGRRKL----------ATVPTGHVKYLVVLTDGYPDDPDNALLEAA 627 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDR 444 AK GI I+ I E + +L AS S F EL + F Sbjct: 628 -HAKGSGIEIVAIGTG------EADQAYLDRIASTQAGSIFARK--GELVRAFGHI 674 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 70/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 2 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 61 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+A + + D ++ +++ + Sbjct: 62 NLALVIDRSTSMSGE-RIEKAREAAILAVNMLNITDTLS------VVAYDNHAEVIIPAT 114 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 115 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 163 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 164 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 218 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 219 TFVANSADLEKAFTKEFQDVMSV 241 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 50/204 (24%), Gaps = 40/204 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A+ ++ G + + A Sbjct: 112 SRLEAAKQAVRVIV-------PEAGGTAFGLVALASEAALMVPPTLDREAFFARLNSLQA 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + +AI + TA +ISS KK IVL+TDGEN + Sbjct: 165 GEL--GDGSAIGMGVSTAAYHLISSAA-----------PKKSIVLITDGENNAGSVHPGT 211 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCAS--PNSF 428 A GI + + + + L A + Sbjct: 212 AAQLAFENGITLYVLGVGTRGSVPLEYVDPATGKTYSGYLDSRFDESPLQEIALTAGGRY 271 Query: 429 FEANSTHELNKIFRDRIGNEIFER 452 F S EL G E + Sbjct: 272 FGVESMGELTAAVSAVTGREQTVQ 295 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 48/156 (30%), Gaps = 19/156 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + ++ D TAI +A++ A + + Sbjct: 129 DRVGIVAYAASAYPQLALTTDHSAA-KMFLQDMNTDMLSSQGTAIQEAIRMASNYFDENT 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + +LTDGE+ I +A+ +G+ I TI K Sbjct: 188 P-----------TARLLFILTDGEDH--EMGATEIATEAQEKGVHIYTIGIGTEKGAPIP 234 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + ++ + + + +I Sbjct: 235 IKE-----GGGQTYKRDRNGEVVITKLNRELLQQIA 265 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 55/173 (31%), Gaps = 24/173 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L +++ + ++ D V F+D+ + V + Sbjct: 422 TKIELAKESAMRTVELLRSKDTVG------VVAFDDQPWWVVPP---QKLGNKEEVLSSI 472 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A+ +A + ++ +++I+L+TDG+ + N Sbjct: 473 QSIPSAGGTNIYPAVSSALEEML-----------KIKSQRRHIILMTDGQ-SAMNSGYQD 520 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + I + ++A + A+ ++ L +F Sbjct: 521 LTDTMVENKITMSSVAVGTDADTHLLQSLAE---AAKGRYYFVEDETTLPAVF 570 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 58.5 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 62/204 (30%), Gaps = 46/204 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +I S+ R+G + + + + ++ Sbjct: 110 NRLDKSKQLVTQIINSLAS-------DRVGIIAYAGKAFPQLPITTDYASA-KMFLQNMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI++A++ A + + +++++DGE+ Sbjct: 162 TDMLSSQGTAISEAIELAKTYYDDEEQ-----------TNRVLIIISDGEDHGGEAV--D 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEK-----------------------ARYFLSNCA-- 423 I +A +GIRI+T+ K L A Sbjct: 209 IAEEANEEGIRILTVGVGDVKGGPIPIKRNGVVLNYKKDNKGETVITRLDETTLKEIAEE 268 Query: 424 SPNSFFEANSTHELNKIFRDRIGN 447 + + ++T+E+ +D + Sbjct: 269 TNGIYINGSNTNEVVDAIKDVLDK 292 >gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP-6] gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound [Methanosaeta concilii GP-6] Length = 726 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 53/188 (28%), Gaps = 24/188 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++A V+ + +D R F S + + K F Sbjct: 292 KMDQAKEAARYVLDHLNPLD------RFAIVSFATTTRSFSPSLEPAAQADKG--KDFLD 343 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 MGST IN AM A ++ LTDG T+ AI Sbjct: 344 RLEAMGSTDINRAMIEAVGLAEEVRPTT-------------LIFLTDGLPTEGVTVTGAI 390 Query: 390 CNK-AKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + A+ +RI + + + + N + E F ++ Sbjct: 391 LDNVAREAPDNVRIFSFGVGDDVDTDLLDQISMDNGGASTYVRPGEEIDEEVSAFYRKVK 450 Query: 447 NEIFERVI 454 + + Sbjct: 451 MPVLSDLS 458 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 70/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 2 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 61 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+A + + D ++ +++ + Sbjct: 62 NLALVIDRSTSMSGE-RIEKAREAAILAVNMLNITDTLS------VVAYDNHAEVIIPAT 114 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 115 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 163 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 164 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 218 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 219 TFVANSADLEKAFTKEFQDVMSV 241 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 59/169 (34%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D ++ D+ G+T Sbjct: 828 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLDDL--STKWEVISVLQKDQPMGGNTY 885 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE+ ++ + +GI Sbjct: 886 TAEALGFSDHMFTEARGSRLHKG-----VPQVLIVITDGESHDADKLNDT-AKALRDKGI 939 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 940 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 983 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 56/170 (32%), Gaps = 15/170 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ D + VR+GA F+ + G + I Sbjct: 1011 PDDFKKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPL--GTFIGKKEISFQIENI 1068 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T I A++ D + N + +++LTDG Q +E Sbjct: 1069 QQIFGYTHIGAALREVGDYFRP-----DMGSRINAGTPQVLLVLTDG---QSQDEVAQAA 1120 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + +GI I ++ + + + + ++ EL K+ Sbjct: 1121 EELRHKGIDIYSVGIGDVDD-----QQLIQITGTADKKLTVHNFDELTKV 1165 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 56/180 (31%), Gaps = 21/180 (11%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ VR+GA + D F + Sbjct: 441 GSGSTQATDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWD--LEFEINKYTNKH 495 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G + Sbjct: 496 DLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKK------QRGNRVPCHLVVLTNGMSKD 549 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 E N+ + + IR+ I +E + L A + + + L I Sbjct: 550 SILEP---ANRLREELIRVYAIGV------KEANQTQLREIAGEDKRVYYVHDFDALKDI 600 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 55/177 (31%), Gaps = 21/177 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S+ +D + +R+G +++ S S + ++ V + + A A Sbjct: 253 SVSALDIKENCMRVGLVTYSNETKVINSLS---RGVNKSEVLQNIQNLSPRAGKAYTGA- 308 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A I + + N + VL+T + + +G+ I T+ Sbjct: 309 --AIRKIRKEVFSARNGSRKNQGVPQIAVLVTH---RPSEDNVTKAAVNLRREGVTIFTM 363 Query: 404 AFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK---IFRDRIGNEIFERVI 454 Q L AS + + EL F ++ N+I V Sbjct: 364 GIEGASDSQ------LEKIASHPAEQHVSKLKTFSELAAHNQTFLKKLRNQITLTVS 414 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + +G T + + ++ +++++L+TDGE Sbjct: 678 MSQNEISNAIDRMAHIGETTLTG------SALTFVSQYFSPAKGARPNVRRFLILITDGE 731 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 ++ + +GI I ++ F N TQ L + P F + Sbjct: 732 AQDIVKDP---AVALRQEGIIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFDI 781 Query: 437 LN 438 L Sbjct: 782 LQ 783 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 58.5 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 49/155 (31%), Gaps = 25/155 (16%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++ F+ ++ + G T I M A +TI Sbjct: 450 SRQDQLSIIAFDSVADVILPLQPLPDLAT---IEDALSRLSAGGGTNIRSGMALAAETIA 506 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +++ H ++LLTDG E + ++QG+ + T+A +N Sbjct: 507 TADARIRH-----------VILLTDGV---SETEYADLVANLRAQGVTVSTVAIGLNTDP 552 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A ++ L +I + Sbjct: 553 E------LERVAQIGGGKYYVVRQAEALPQILLEE 581 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 65/214 (30%), Gaps = 20/214 (9%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + + + ++ + + ++A+ + S+ + NV V + Sbjct: 168 FKESYNVEIILDASGSMAAKSNGKTRMDAAKEAIQAFAESLPEQANVALRVYGHKGSGKE 227 Query: 305 R----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 +G+ + + G T I ++Q A + Sbjct: 228 SDKTLSCGSSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLSK-------- 279 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYF 418 + I L++DG T + + + + I I I F V+ Q++ + Sbjct: 280 -LPGDKNTNMIFLVSDGIETC-DGDPVEAAKQLAQSEITPIINVIGFGVDGPGQQQLKEV 337 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 A+ + EL F G EI + Sbjct: 338 AK--AAGGRYVLIQDQKELQDEFNR--GKEIANK 367 >gi|148657117|ref|YP_001277322.1| peptidase M23B [Roseiflexus sp. RS-1] gi|148569227|gb|ABQ91372.1| peptidase M23B [Roseiflexus sp. RS-1] Length = 982 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/404 (9%), Positives = 85/404 (21%), Gaps = 26/404 (6%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + + + + + + Sbjct: 293 ATIINMTPTRYTMEVQAPPQTGSGAYDLTVSIKGATATEASAVSYNGASNANVILTLDRS 352 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + ++ + L G+ + +S D Sbjct: 353 GSMSTDNKMPAAHNAARQFVDLMQVGDGVGVVGFDDRVTTAFPLTVITDPPPLSSLIFTD 412 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S L + N + SL Sbjct: 413 TMESGTGKWIPDPPWGLTSVAYRGSAAWTDSPAGNYANNANSVLAIADPIVLPASLTTPA 472 Query: 245 YPGPLDPSLSEEHFV-----------DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + SL A+ + + + Sbjct: 473 LSFWHRYDIENYFDYGRVEVSTDNGATWQSLAAYTGVNTTWSRAVIDLSPYRGQTIRLRF 532 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + A D D R GST+I +Q + + +S Sbjct: 533 RLTTNAYLTRDGWYIDDVTVGPKWVDARADAIAAIGTLTPRGSTSIGGGLQRSQQLLSAS 592 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + IVLL+DG+ + + ++ I + TI + QQ Sbjct: 593 APGRT----------RAIVLLSDGQENTAPYV-SDVLPQIRASQITVHTIGLGTDADQQL 641 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + ++ A +L + I + R IT Sbjct: 642 MLSIAAQ---TGGTYNYAPRPDQL-AGIYNTISGAVSNRQTLIT 681 >gi|12052774|emb|CAB66559.1| hypothetical protein [Homo sapiens] Length = 957 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLFD---------KSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 51/396 (12%), Positives = 112/396 (28%), Gaps = 34/396 (8%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRN--FENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + E + + F+ + R + + A++ Sbjct: 770 ARAEAIYTTARERDLTVGHVAARDQSSSVFQTSTNLAPRKRVIFQLTYQEALQRKRGIYQ 829 Query: 129 YQVVLSSRYDLLLNPLSLFLR-SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y V + + +++ + S+ + + +T G++ S Sbjct: 830 YGVSFRMLQPVSMFSITVSISESVPLSTVNALGLETEQTSVPGPVPLQGITTVRNSPVSA 889 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + ++ ++ FG + + ++ + ++ + +++ Sbjct: 890 VITYTPTSNQQHLISPFGLN--------GKFVIEYDVFRDRTTEMVIDQSYFAHFITSNL 941 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 P +D S I K VR A+ +++ + + D FN V Sbjct: 942 PPMSKRV-VFLIDVSGSMFGI-KIDQVRQAMNTILHGLAETD------FFSVIAFNSSVS 993 Query: 308 SDPS------FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + G I + + G T I A++ A S+ Sbjct: 994 RWSPSGTAAVLASGTTANINSAMNFLNTTVVTRGGTDILQAVEAAIQLFDSAA------T 1047 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYF 418 A ++VLLTDG T AI + ++ I TI F + Sbjct: 1048 GGTNTASDFMVLLTDGRPTDGTVSSTAIISAIRNLNRGRFGINTIGFGTLVDMNLLRKIA 1107 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 N + F +++ F + I I Sbjct: 1108 AQNSGTSIQIFIDLNSYAQISNFYEEISQPILSNTT 1143 >gi|113476847|ref|YP_722908.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167895|gb|ABG52435.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 477 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 50/145 (34%), Gaps = 25/145 (17%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ +F +T +K + G T ++ ++T + +SN Sbjct: 87 VEFSSNSQVLTNFD-----ADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNTPN-- 139 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 I+L TDG+ +I + + GI ++T+ Sbjct: 140 -----------ILLFTDGQPNDPRAS-KSIAREIREAGINLVTVG--TGDANSNYLTSLT 185 Query: 420 SNCASPNSFFEANSTHELNKIFRDR 444 N P+ F ANS E+++ FR Sbjct: 186 EN---PDLVFFANSG-EIDQAFRAA 206 >gi|218458154|ref|ZP_03498245.1| hypothetical protein RetlK5_01327 [Rhizobium etli Kim 5] Length = 156 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 38/145 (26%), Gaps = 29/145 (20%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---------- 385 + A I + + +N + +K + + DG Sbjct: 9 YNDQQTSFDEALKGIEGEITGNIGKGTSNADRQKIVFFVADGVADSYKPSGCTSPKGANG 68 Query: 386 -------GIAICNKAKSQGIRI---MTIAFSVNKTQ---------QEKARYFLSNCASPN 426 C K K +GI++ T + + K + CA+P Sbjct: 69 GRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETKIAAKMEECATPG 128 Query: 427 SFFEANSTHELNKIFRDRIGNEIFE 451 +F + T + + + Sbjct: 129 FYFAVSPTEGIEAAMKALFLKIVSA 153 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 61/190 (32%), Gaps = 25/190 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVN----DTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + A I ++ + A D+ D + + + Sbjct: 86 TRMDAAKKAANDTIDTLADSAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDL 145 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + G T I +A++ A + + SS K+ I+L++DG +T Sbjct: 146 EDKINGLEPKGYTPIGNAIKKAAEELGSSG-------------KRNIILVSDGIDTCAPP 192 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKI 440 + GI I T+ F V+ +KA+ L S ++ A+ T L + Sbjct: 193 PVCDVAEDIAGDGIDLAIHTVGFKVD----DKAQKELECISEVSGGTYTSADDTEALTEA 248 Query: 441 FRDRIGNEIF 450 D Sbjct: 249 LTDAAQRVAG 258 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 61/164 (37%), Gaps = 20/164 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D ++ + + + +K+ A D + T + A + S Sbjct: 77 LVAYSDSIVREHNLDSINSSEDKDRIKSMASDISLGQRTDTGRGLLEAVKLMESG----- 131 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKA----KSQGIRIMTIAFSVNKTQQ 412 H+ NN I+LL+DG+N + + ++ A KS+G + TI + N Sbjct: 132 HKSGNNP----VIILLSDGKNDPERSQDESLNDLKNALQICKSKGYPVYTIGLNYNG--- 184 Query: 413 EKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L + +S + N+ +L I D + +V Sbjct: 185 TVDKTQLGDISSSTGGKDYITNTASDLPAILTDIYADNSKLKVQ 228 >gi|119613594|gb|EAW93188.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_c [Homo sapiens] Length = 1350 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 41/125 (32%), Gaps = 6/125 (4%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T +N A+ A + SN++ I+ LTDGE T I + + Sbjct: 396 WTDVNSALLAAASVLNHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQ 453 Query: 396 Q---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + ++AF + R L N +E K + I + Sbjct: 454 ALGHRVSLFSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEISMPLLAD 513 Query: 453 VIRIT 457 V R+ Sbjct: 514 V-RLN 517 >gi|332210152|ref|XP_003254169.1| PREDICTED: collagen alpha-1(XXI) chain [Nomascus leucogenys] Length = 957 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|297678422|ref|XP_002817079.1| PREDICTED: collagen alpha-1(XXI) chain-like [Pongo abelii] Length = 612 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 822 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLED--FGTKLEVISVLQN 879 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ G+T +A+ + E + N + ++++TDGE+ ++ Sbjct: 880 DQAMGGNTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 933 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 934 AKALRDKGILVLAVGI-----AGANPAELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 986 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L++V+ Sbjct: 412 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSAVVGMFDIAP 471 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 472 HK---VRVGAVQYADSWD--LEFEINKYSNRQDLGKAIENIRQMGGNTNTGAALNFTLSL 526 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 527 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGV---- 573 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + + L I Sbjct: 574 --KEANQTQLREIAGEEKRVYYVHDFDALKDI 603 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 1017 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQI 1074 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1075 FGNTHIGAALRKVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1126 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1127 RHRGIDIYSVGIGDVDDQQ 1145 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 251 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 307 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 308 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 361 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 362 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 415 Query: 453 VI 454 V Sbjct: 416 VS 417 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 46/155 (29%), Gaps = 19/155 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + A+ + + R+G F + Sbjct: 99 SNSMLAEDFEPNRLERTKYAIGKLFEGL-------QQDRVGLVVFAGEPKVQLPITSDYR 151 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + TAI A++ A E + I+L+TDG Sbjct: 152 MARAFARRID-PSLVSVQGTAIGKALEQALLAFSGDTE---------QSHGRVIILITDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 EN D+ IA+ +A G++I TI + Sbjct: 202 ENHDDDA--IAVAERAAQMGVKIFTIGIGTPEGAP 234 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 46/400 (11%), Positives = 109/400 (27%), Gaps = 33/400 (8%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 L+ A +A+ L + + + R + + Sbjct: 141 LQDKQPQAAKSAA-ELRADFDAGEIATLKNKSEDSAAALGMAKRAAPAAPGVVAQGQLLA 199 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 + + + P Q+ L + N + ++S+ + + Sbjct: 200 EPMAVAPSPVPPADGHMQIQLDPSRERFANAAANPIKSVATDPV---------STFSADV 250 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S +M D Q + +N F K+ V + + Sbjct: 251 DSASYSFVRRSLTGGAMPDPQSVRVEEMINYFPYDWAGPEKADQPFKATVTVMPTPWNHD 310 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + ++ P + +D S K L++ A ++ +K D V+ Sbjct: 311 TELMHVAIKGYDIAPATAPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTVS 370 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + + + ++ + + GST + ++ AY+ Sbjct: 371 ------IVTYAGNAGTVLEPT---RVAEKSKILSAIDRLEAGGSTGGAEGIEAAYNL--- 418 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + L TDG+ +E+ I + + GI + + F Sbjct: 419 -----AKQAFVKDGVNRVM-LATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGFGRGN 472 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + N S ++ E K + G+ + Sbjct: 473 LNDSLMQTLAQN--GNGSAAYIDTLAEAQKTLVEEAGSTL 510 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/375 (10%), Positives = 95/375 (25%), Gaps = 24/375 (6%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + + E + T ++ + + R Sbjct: 78 DATNVLVAPASPPKSEAAQAGGTAQQTANERAVVAEPAPPAPTGEF--TLNSTAPSTSRL 135 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG- 176 T V K + D + R K+ + E + S + Sbjct: 136 TRVPAAESKLMAPQPTMAPADQIAPQEENRNRVQDFKTNPVHAALEDPVSTFSIDVDTAS 195 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 S S+ + +N F + N V + + Sbjct: 196 YSFVRRSLKEGSVPQADTVRVEEMINYFPYDWKGPDSVSTPFNSTVSVMPTPWNAQTKLM 255 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + ++ P + +D S K L++ A ++ ++ D ++ Sbjct: 256 HVAIKGFDIKPTEQPKANLVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLRADDTISIV-- 313 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + + + GSTA ++ AY + Sbjct: 314 -------TYAGEAGTVLMPTRAAEKDKILNAIDNLTPGGSTAGEAGIKEAYKL----AQQ 362 Query: 357 EVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + N ++L TDG+ D+++ + + + G+ + F + + Sbjct: 363 SFVKDGVNR-----VMLATDGDFNVGQSDDDDLKRLIEQERKSGVFLSVFGFGHDNLNDQ 417 Query: 414 KARYFLSNCASPNSF 428 + N ++ Sbjct: 418 MMQTIAQNGNGTAAY 432 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 57/186 (30%), Gaps = 26/186 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A +R + + D ++ V + +++ Sbjct: 167 KLEKAKQAAIEALRRLGQKD------MFSVIVYDHNVKTIVP---AQSARNVEWIESRIR 217 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+TA+ + + + +N + I+LL+DG + Sbjct: 218 GIGPGGNTALFGGVSQGASEV--------RKNLSNKYVHR-IILLSDGLANVGPSSPEDL 268 Query: 390 ----CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 K I + TI + + + N S + + S+ +L KIF + Sbjct: 269 GRLGAALIKES-ISVTTIGVGTDYNE-DLMARLSQN--SDGNTYFVESSRDLPKIFAAEL 324 Query: 446 GNEIFE 451 G+ + Sbjct: 325 GDVLNV 330 >gi|295688686|ref|YP_003592379.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] gi|295430589|gb|ADG09761.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] Length = 583 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 44/302 (14%), Positives = 89/302 (29%), Gaps = 28/302 (9%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G S ++ AE + S + + ++ + +N F R Sbjct: 117 GAASNPVKRVAEEPVSTFSIDVDTAAYANTRRFLNDGAVPPRDAIRVEELVNYFDYGYPR 176 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + V I S + L P P L+ VD+S + Sbjct: 177 PTSAQTPFRATVAIAPSPWSSQRQILHIGLQGYAAPRSEAPPLNLVFLVDTSGSMMGPDR 236 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L + AL +I ++ D R+ + + + + G KL ++ Sbjct: 237 LPLAQKALNVLIDQLRPQD------RVAMVAYAGSAGAVLAPTDGRSKLK---MRCALGA 287 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 GSTA ++ AY + + + ++L+TDG+ + + Sbjct: 288 LQAGGSTAGGRGLELAYGLAKQNFDKKAVNR---------VILITDGDFNVGIADPSRLK 338 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE----LNKIFRD 443 + Q G+ + F + N ++ +E L F Sbjct: 339 DFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQN--GNGVAAYVDTLNEARKLLRDDFES 396 Query: 444 RI 445 + Sbjct: 397 SL 398 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 849 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 906 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ G+T +A+ + E + N + ++++TDGE+ ++ Sbjct: 907 DQAMGGNTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 960 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 961 AKALRDKGILVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 1013 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 69/212 (32%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 439 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAP 498 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 499 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 553 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 554 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGV---- 600 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + + L I Sbjct: 601 --KEANQTQLREIAGEEKRVYYVHDFDALKDI 630 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 1044 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GAFIGEKEISFQIENIKQI 1101 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1102 FGNTHIGAALRKVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1153 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1154 RHRGIDIYSVGIGDVDDQQ 1172 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 278 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 334 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 335 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 388 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 389 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 442 Query: 453 VI 454 V Sbjct: 443 VS 444 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 43/124 (34%), Gaps = 18/124 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 708 MSQSDISNAIDQMAHIGQTTLTG------SALSFVSQYFSPTKGARPNVRKFLILITDGE 761 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 +E + +G+ I ++ F N TQ L + P F + Sbjct: 762 AQDIVKEPAI---ALRQEGVIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFDI 811 Query: 437 LNKI 440 L ++ Sbjct: 812 LQRV 815 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 53/149 (35%), Gaps = 15/149 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + +D +D V++G ++D + L +++ MGST Sbjct: 256 NFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQ--SSLKSVVLEQIRNLPYNMGSTN 313 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + + ++L+TDGE N+E ++ K G+ Sbjct: 314 TGSALEFIRANYLTEMSGSRAKDG----VPQIVILVTDGE---SNDEVQDAADQLKRDGV 366 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + ++ + L AS Sbjct: 367 FVYVVGINI------QDVQELQKIASEPF 389 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/250 (8%), Positives = 61/250 (24%), Gaps = 13/250 (5%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + A + + + + S + D + +N + + + Sbjct: 951 ALSKMALVFRNTARTNVAQYLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDA--NTT 1008 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 R + + + + S++ + Sbjct: 1009 ELEEIANNRVFFTDDFHFLKSIHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEK 1068 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 V R + + + D +++G F+ + + + + + Sbjct: 1069 IKDFVKRMVNQSNIGADKIQIGLLQFSSTPREEFTLKNNYSSKDE-MCRAISNVTQINSG 1127 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ +Y++++TDG++ D + + Sbjct: 1128 TETGKALNFTL-------PFFDISQGGRPGVHQYLIVITDGDSHDDIVSP---AKALRDR 1177 Query: 397 GIRIMTIAFS 406 I I I Sbjct: 1178 NIIIFAIGVG 1187 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 13/73 (17%), Positives = 26/73 (35%), Gaps = 9/73 (12%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R +Y++++TDG+++ + GI I I ++ L Sbjct: 960 RNTARTNVAQYLIVITDGQSSDP---VADAAQGLRDTGINIYAIGV------RDANTTEL 1010 Query: 420 SNCASPNSFFEAN 432 A+ FF + Sbjct: 1011 EEIANNRVFFTDD 1023 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 53/149 (35%), Gaps = 15/149 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + +D +D V++G ++D + L +++ MGST Sbjct: 256 NFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQ--SSLKSVVLEQIRNLPYNMGSTN 313 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + + ++L+TDGE N+E ++ K G+ Sbjct: 314 TGSALEFIRANYLTEMSGSRAKDG----VPQIVILVTDGE---SNDEVQDAADQLKRDGV 366 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + ++ + L AS Sbjct: 367 FVYVVGINI------QDVQELQKIASEPF 389 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/250 (8%), Positives = 61/250 (24%), Gaps = 13/250 (5%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + A + + + + S + D + +N + + + Sbjct: 748 ALSKMALVFRNTARTNVAQYLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDA--NTT 805 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 R + + + + S++ + Sbjct: 806 ELEEIANNRVFFTDDFHFLKSIHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEK 865 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 V R + + + D +++G F+ + + + + + Sbjct: 866 IKDFVKRMVNQSNIGADKIQIGLLQFSSTPREEFTLKNNYSSKDE-MCRAISNVTQINSG 924 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ +Y++++TDG++ D + + Sbjct: 925 TETGKALNFTL-------PFFDISQGGRPGVHQYLIVITDGDSHDDIVSP---AKALRDR 974 Query: 397 GIRIMTIAFS 406 I I I Sbjct: 975 NIIIFAIGVG 984 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 13/73 (17%), Positives = 26/73 (35%), Gaps = 9/73 (12%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R +Y++++TDG+++ + GI I I ++ L Sbjct: 757 RNTARTNVAQYLIVITDGQSSDP---VADAAQGLRDTGINIYAIGV------RDANTTEL 807 Query: 420 SNCASPNSFFEAN 432 A+ FF + Sbjct: 808 EEIANNRVFFTDD 820 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 53/184 (28%), Gaps = 26/184 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R A +++ + D T F+ G R ++ Sbjct: 519 IDSARRAAQALVDRLAPAD------DFSLTTFSSDAEVVI--EDGPVGPRRAAIRRAIEG 570 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E G T I + Y + R +A + ++L++DG T + Sbjct: 571 LREGGGTNIGAGLSLGY--------AQASRPGIPEDAVRVVLLVSDGRATSGLTHSERLA 622 Query: 391 NKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRI 445 A +GI+ + + Q +S AS + ++ ++ + Sbjct: 623 WLALDAFQRGIQTSALGLGDDFDGQ-----LMSAIASDGAGGYYYLRHPEQIAPALSTEL 677 Query: 446 GNEI 449 + Sbjct: 678 DKRL 681 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 41/130 (31%), Gaps = 13/130 (10%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G G T I A+ A + + + + +K + L Sbjct: 104 YGFQSFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSF----------DQLDAKGEKVVYL 153 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 LTDGE T I + + I + I F + + + +F A + Sbjct: 154 LTDGEETCG-GNPIKTAKELQKDNITVNVIGFDYKEGYKGQLNAIAK--VGGGEYFPAYT 210 Query: 434 THELNKIFRD 443 ++ KIF Sbjct: 211 QKDVEKIFTQ 220 >gi|189066649|dbj|BAG36196.1| unnamed protein product [Homo sapiens] Length = 957 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 63/176 (35%), Gaps = 20/176 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 R + ++ ++ D + VR+G ++ I + + + Sbjct: 840 RSFVRDLVSNLMSGD---NDVRVGVAEYSSTYTQIVFPFSSSQSAIDSSLSSMI---QTA 893 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-ICNKA 393 G+TA ++ A D I S+ + ++L+TDGE + +E+ I + Sbjct: 894 GATATGTSLGEAADDIGSTARSSAA---------RVLILMTDGETSDGDEQNIDPSVDAL 944 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ G+ I I + + L S + F + +L+ IG + Sbjct: 945 RALGVSITAIGVGNSAS----ESELLQIAGSSDHVFNNIAFVDLSSFINQIIGQIL 996 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 64/201 (31%), Gaps = 17/201 (8%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 G S S+ F+ SS + L++D ++++RS+ Sbjct: 429 GDQGVPGLPGPEGGECIVSASDVAFILDSSDSIDDRDYQLLKDFTSALVRSLTVSSTNAR 488 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F+ + FS+ LI I + TA +A++ A I S Sbjct: 489 VAIEL---FSSEPQIETGFSYDESYLISVINSLPHLKL----GTATGEALRMARQDIFSD 541 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 N+ + +++TDG + +D + + G+++ + T + Sbjct: 542 NDALFRSFSVPA----FAIVITDGNSLEDASYVAEQARRLREHGVQVFALGVGSQITVSQ 597 Query: 414 KARYFLSNCASP-NSFFEANS 433 L + A F Sbjct: 598 -----LVDIAGDNARVFGVAD 613 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/409 (10%), Positives = 100/409 (24%), Gaps = 37/409 (9%) Query: 56 AAQTAIITASVPLIQSLEEV--SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A +A A + P E T + Sbjct: 27 AQDSAADAARALTGAGKAGNPGTVPPAVPSAPPAPPAAEADAGAPRARPGAALTRQYAPQ 86 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 ++ + + + N + ++ + + + + Sbjct: 87 AYSAQPAAVSLLPAPSGYYAPPQAEERENYARY------RDNPVVAAQEQPVSTFGADVD 140 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + + +N F + + +P Sbjct: 141 TGSYTNVRRLLNEGRLPPPDAVRAEEFINYFDYGYATPDSRQQPFSIITEVSAAPWNPQR 200 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + P+ + VD+S K L++ AL ++ ++ D V Sbjct: 201 QLLKIGIQGYRVAPQDIPAANLVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAI 260 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 S + + + + GST + AY + Sbjct: 261 VTYA----------GQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYA--QA 308 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNK 409 + + I+L +DG+ + + +K + GI + T+ Sbjct: 309 AKGFVKGGVNR-------ILLASDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGN 361 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A A S+ +S L + R + ++ ++ I + Sbjct: 362 FNDALAMQL--ADAGNGSYHYLDS---LREA-RKVLAAQMSSTLLTIAR 404 >gi|119624855|gb|EAX04450.1| collagen, type XXI, alpha 1, isoform CRA_c [Homo sapiens] Length = 552 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|119624854|gb|EAX04449.1| collagen, type XXI, alpha 1, isoform CRA_b [Homo sapiens] Length = 567 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|119624853|gb|EAX04448.1| collagen, type XXI, alpha 1, isoform CRA_a [Homo sapiens] Length = 429 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|114607957|ref|XP_001157649.1| PREDICTED: collagen alpha-1(XXI) chain isoform 3 [Pan troglodytes] Length = 957 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|114607959|ref|XP_001157475.1| PREDICTED: collagen, type XXI, alpha 1 isoform 1 [Pan troglodytes] gi|114607961|ref|XP_518554.2| PREDICTED: collagen, type XXI, alpha 1 isoform 4 [Pan troglodytes] gi|114607963|ref|XP_001157591.1| PREDICTED: collagen, type XXI, alpha 1 isoform 2 [Pan troglodytes] Length = 954 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|18780273|ref|NP_110447.2| collagen alpha-1(XXI) chain precursor [Homo sapiens] gi|74752071|sp|Q96P44|COLA1_HUMAN RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|15593270|gb|AAL02227.1|AF414088_1 collagen XXI [Homo sapiens] gi|19310967|gb|AAL86699.1|AF438327_1 alpha 1 type XXI collagen precursor [Homo sapiens] gi|55665071|emb|CAH73913.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202573|emb|CAI22496.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202937|emb|CAI22395.1| collagen type XXI alpha 1 [Homo sapiens] gi|116496597|gb|AAI26109.1| Collagen, type XXI, alpha 1 [Homo sapiens] gi|215434893|gb|ACJ66843.1| alpha 1 type XXI collagen precursor [Homo sapiens] Length = 957 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|17974510|gb|AAL50033.1|AF330693_1 alpha 1 chain-like collagen COLA1L precursor [Homo sapiens] gi|55665070|emb|CAH73912.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202574|emb|CAI22497.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202938|emb|CAI22396.1| collagen type XXI alpha 1 [Homo sapiens] Length = 954 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 52/163 (31%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G + + G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + ++ A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDDVKDAAQAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|313235671|emb|CBY11123.1| unnamed protein product [Oikopleura dioica] Length = 362 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/269 (12%), Positives = 78/269 (28%), Gaps = 17/269 (6%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR-DEKLSPYMVSCNKSL 240 I FSR + D + P N ++ + + P + Sbjct: 25 SILFSRHLDDVTEYCDNNPSNQKTFHNRDAKTRHNEVCDTKSVWEVIRDHPDYLIGAGEQ 84 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVI-------KKKHLVRDALASVIRSIKKIDNVND 293 + S+ F + + + + + ++ V Sbjct: 85 PVDDFDSVNADLWSKTQFTVLRQNNTKTVLAIDISTSMGIGKRWVNVKKAATQYLEAVPH 144 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G F+ + +K T+I A++ + + + +S Sbjct: 145 DAHVGIVLFHRVAYVKVPLKQIENADSSDFLKNRLEKVELGVGTSIASALKVSMNVLQTS 204 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + N I+LL+DG+ + D + + + + TIAF + +Q+ Sbjct: 205 GPKDSRDSGGN------IILLSDGQESHDPRIDDELIQELIDNKVTVNTIAFGTDASQKL 258 Query: 414 KARYFLSNCASPNSFFEANSTHE-LNKIF 441 + + + F + + + L F Sbjct: 259 EEVSSRTK--GSSYFSDPENPNSILQDAF 285 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 64 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 121 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ GST +A+ + E + N + ++++TDGE+ ++ Sbjct: 122 DQAMGGSTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 175 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 176 AKALRDKGILVLAVGIDG-----ANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 228 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 10/139 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++ D + VR+GA F+D + G + I + Sbjct: 259 FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQIENIKQI 316 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 317 FGNTHIGAALREVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 368 Query: 394 KSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 369 RHRGIDIYSVGIGDVDDQQ 387 >gi|326504154|dbj|BAK02863.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 651 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 54/205 (26%), Gaps = 40/205 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI + D R+ F+ R + + + + Sbjct: 184 SKLALLKQAMRFVIDILGPDD------RLSVVSFSSRARRVTRLT-RMSDAGKALCVRAV 236 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 T I + ++TA + +VLL+DG++ Sbjct: 237 ESLTARTGTNIAEGLRTAAKVLDERRHRNGVSC---------VVLLSDGQDNYTPMRQAF 287 Query: 382 DNEEGIAIC----------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFF 429 A + + T F + + A+ +F Sbjct: 288 GRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFG-----NDHDATAMHAVSEATGGTFS 342 Query: 430 EANSTHELNKIFRDRIGNEIFERVI 454 + + F +G + V Sbjct: 343 FIENEAVIQDAFAQCVGGLLSVVVQ 367 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 57/196 (29%), Gaps = 37/196 (18%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + AL VIR + +D R+ F+ + + +K Sbjct: 85 IDKAKTALQFVIRKLSDLD------RLCIVTFSTNATRLCPLRFVTAAA-QAELKALVDG 137 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAI 389 G T + ++T+ + R + ++L++DG +N + + + Sbjct: 138 LKADGMTNMKAGLETSMSVV-DGRRLAAGRAVS-------VMLMSDGYQNDGGDARDVHL 189 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRD--- 443 + + + T F + L A +F + L F Sbjct: 190 ------KNVPVYTFGFGASHDSN-----LLEAIARKSLGGTFNYVADSANLTGPFSQLLG 238 Query: 444 ----RIGNEIFERVIR 455 I ++ V R Sbjct: 239 GLLTIIAQDLELTVTR 254 >gi|78484443|ref|YP_390368.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362729|gb|ABB40694.1| Hypothetical protein; predicted integral membrane protein with a von Willebrand factor type A domain [Thiomicrospira crunogena XCL-2] Length = 363 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 46/162 (28%), Gaps = 23/162 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R G + D + + R ++K A +A+ A S + Sbjct: 161 RFGFILYADDAYTLMPLTSD-ATTARLMLKRLKPYLAGRTDEATGEALGLALQQAEKSTD 219 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE-- 413 + +VL++DG N A+ I I TI N + Sbjct: 220 STE---------NRIVVLISDGSTRDSRLPIAEAINYAQGLNIPIYTIGVGANSKDADKR 270 Query: 414 ---------KARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +++ S +L K+ + Sbjct: 271 EFRGLLYEALESSSLKQIADQTQGRYYQIGSGQDLQKVLQAI 312 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA S + V +G F + + + Sbjct: 109 TRLRAAQDAARSFAEGLTPG------VNLGLISFAGTATVLAAPTTEREGV-----VHAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-----ENTQDN 383 + STA + + A + + IVL+TDG ++ Sbjct: 158 ENLKLAQSTATGEGIFAA---LQAIESFSAVIGGAEGPPPARIVLMTDGKQTVPQDEYAP 214 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCA--SPNSFFEAN 432 AK +GI I TI+F + E + A S F++A Sbjct: 215 RGAFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASMREIARLSGGDFYKAA 274 Query: 433 STHELNKIFRDRIGNEIFERV 453 + EL +++ D +G +I + Sbjct: 275 TAEELKQVY-DSLGEQIGYEI 294 >gi|326530406|dbj|BAJ97629.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 657 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 54/205 (26%), Gaps = 40/205 (19%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI + D R+ F+ R + + + + Sbjct: 190 SKLALLKQAMRFVIDILGPDD------RLSVVSFSSRARRVTRLT-RMSDAGKALCVRAV 242 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 T I + ++TA + +VLL+DG++ Sbjct: 243 ESLTARTGTNIAEGLRTAAKVLDERRHRNGVSC---------VVLLSDGQDNYTPMRQAF 293 Query: 382 DNEEGIAIC----------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFF 429 A + + T F + + A+ +F Sbjct: 294 GRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFG-----NDHDATAMHAVSEATGGTFS 348 Query: 430 EANSTHELNKIFRDRIGNEIFERVI 454 + + F +G + V Sbjct: 349 FIENEAVIQDAFAQCVGGLLSVVVQ 373 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 58/157 (36%), Gaps = 18/157 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + F+D V SF+ + ++ + + +TA+ DA+ A + Sbjct: 109 REQDEFDLMDFSDTVREVVSFTN-----DKKRIENGLNELRKGDATAVYDAVYLASQRLG 163 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM---TIAFSVN 408 +N R + +VL+TDG+NT +A+ G+ + + + Sbjct: 164 ETNAGGGRR--------RVLVLITDGDNTVHGVGYDQAVEQAQRAGVMVYALIVVPIEAD 215 Query: 409 KTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRD 443 + + L A+ +++ N +L K++ Sbjct: 216 AGRNTGGEHALIQMATDTGGNYYYVNDPRDLAKVYAK 252 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 51/167 (30%), Gaps = 15/167 (8%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 +G F + +I + K + + T+I A+ + I S Sbjct: 155 HNDLVGLIGFARTAQVLSPLTLDHQAIIDQLNKFSIVKHQDEDGTSIGYAIFKTANLIAS 214 Query: 353 SNEDEVHRMKNNLEAKK--YIVLLTDGENTQDNEE---------GIAICNKAKSQGIRIM 401 + + + K ++++TDG + + AK QG+R+ Sbjct: 215 TKHFAEELKEASPYTIKNSIMLIVTDGFQDPNPLDREDQYRSIELEDAAKYAKEQGVRVY 274 Query: 402 TIAFSVNKTQQEKARY--FLSNCA--SPNSFFEANSTHELNKIFRDR 444 I +E + + F+ + EL I+ D Sbjct: 275 IINVEPRIASEEFGSERRVMQKVTEITGGKFYLLDHIEELKNIYADI 321 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 58.1 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 58/224 (25%), Gaps = 26/224 (11%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH--- 272 + + I + +C + M + V + Sbjct: 244 TGDHDCEHICISTPGSFKCACREGFTLMNDSRSCSACSNAATDVVFLIDGSKSVRPENFE 303 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LV+ + +I + + ++ V + G H R++ + Sbjct: 304 LVKKWINLIIDKLDVSETNTHVG---LVQYSSTVKQEFPL--GRHNSKRSLKEAVKRMNY 358 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ D N+ + K ++ TDG + + K Sbjct: 359 MERGTMTGHALSFLVDNSFGPNQGARPGV------PKVGIVFTDGRSQDYIGD---AAKK 409 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 AK+ G ++ + L AS + +F Sbjct: 410 AKALGFKMYAVGVG------NAVEDELREIASEPIADHYFYTAD 447 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + D R+G + RV ++ S K + + T A+Q Sbjct: 83 IDGLSVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST--GTMTGLAIQ 140 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG QDN I +A+ GI I I Sbjct: 141 FAMNVAFSEAEGGR----KSPDISKVAIIVTDGRP-QDNIR--DIAARAREAGIEIFAIG 193 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 194 VGR------VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAF 234 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 57.7 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 60/187 (32%), Gaps = 25/187 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A ++ ++ D V+ ++D V + L + ++ Sbjct: 59 DKIAYAKKAAQFIVDNLSPEDRVS------IVQYDDIVEVLSPSAP---VLNKQELRQRI 109 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-- 386 T ++ M Y+ + + + ++LL+DG + Sbjct: 110 ALMEARNMTNLSGGMLAGYEQVERTKQARFVNR---------VLLLSDGLANHGITDPTV 160 Query: 387 --IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + K ++ GI + T F V E LS +++ S ++ IF + Sbjct: 161 LQQMVQEKFRNAGIAVST--FGVGADFNELLMTSLSE-YGGANYYFIESPDKIPGIFAEE 217 Query: 445 IGNEIFE 451 + + Sbjct: 218 LRGLLAV 224 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 57.7 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 59/224 (26%), Gaps = 26/224 (11%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH--- 272 + + I + +C + M + V + Sbjct: 226 TGDHDCEHICISTPGSFKCACREGFTLMNDSRSCSACSNAATDVVFLIDGSKSVRPENFE 285 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LV+ + +I + + ++ V + G H R++ + + Sbjct: 286 LVKKWINLIIDKLDVSETNTHVG---LVQYSSTVKQEFPL--GRHNSKRSLKEAVKRMDY 340 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ D N+ + K ++ TDG + + K Sbjct: 341 MERGTMTGHALSFLVDNSFGPNQGARPGV------PKVGIVFTDGRSQDYIGD---AAKK 391 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS 433 AK+ G ++ + L AS + +F Sbjct: 392 AKALGFKMYAVGVG------NAVEDELREIASEPIADHYFYTAD 429 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + D R+G + RV ++ S K + + T A+Q Sbjct: 65 IDGLSVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST--GTMTGLAIQ 122 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG QDN I +A+ GI I I Sbjct: 123 FAMNVAFSEAEGGR----KSPDISKVAIIVTDGRP-QDNIR--DIAARAREAGIEIFAIG 175 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 176 VGR------VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAF 216 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 52/173 (30%), Gaps = 29/173 (16%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G F + + + + + T I A++ A + Sbjct: 127 TDRIGIIAFAGTATLQCPLTDDYEAVRIVLNGLNSNTVE-IPGTDIGSALRLAENAFPEG 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT--- 410 K +VL++DGE+ Q + K++GIR+ T+ + Sbjct: 186 ------------SKSKTLVLISDGEDLQHSALRE--ARILKTKGIRVYTMGVGSPEGTII 231 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + L A + ++ E ++ RI R Sbjct: 232 RHPETGEEVKSKLDEATLQEIARITEGEYYRVTPGGEEIQLILKRIYESESTR 284 >gi|325914146|ref|ZP_08176499.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] gi|325539649|gb|EGD11292.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] Length = 525 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/297 (12%), Positives = 79/297 (26%), Gaps = 24/297 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +Q + S S S+ + +N F Sbjct: 63 VQAADNPVSTFSIDVDTGSYSNVRRYLTSGSLPPVDAVRVEEMINYFRYDDPAPRDG-QP 121 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + + V + P+ + VD S K L++ + Sbjct: 122 FAVRTELAPTPWNHDSVLLRIGITGRAVAASAMPAANLVFLVDVSGSMGAPDKLPLLQSS 181 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L + R ++ D R+ + + G + + G T Sbjct: 182 LKLLTRQLRAQD------RITLVTYAGNTAVVLPPTPG---NQQARIVEAIDSLQSGGGT 232 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAK 394 A ++ AY + + R N I+L TDG+ + + + + Sbjct: 233 AGASGIELAYK----AAQQSYLRDGINR-----ILLATDGDFNVGVTDFDTLKGMVAEKR 283 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 G+ + T+ F A ++ +S E K+ +G + Sbjct: 284 RSGVALSTLGFGTGNYNDTLMEQLAD--AGDGAYAYIDSPLEARKVLTHELGATLET 338 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/263 (11%), Positives = 69/263 (26%), Gaps = 28/263 (10%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 L A + + ++ S SL + Sbjct: 37 TLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPI 96 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D S+ + R+ + + D ++ +++ + Sbjct: 97 NLALVIDRSTSMSGE-RIEKAREEAILAVNMLNITDTLS------VVAYDNHAEVIIPAT 149 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ + + G TA+ + + N E I+L Sbjct: 150 KVTD--KPALIASIQQHIHPRGMTALFAGVSMGIGQVDK---------HLNREQVNRIIL 198 Query: 374 LTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 ++DG+ + A +GI I TI + ++ A S + Sbjct: 199 ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYN-----EDLMTAIAGYSDGNH 253 Query: 429 FEANSTHELNKIFRDRIGNEIFE 451 ++ +L K F + + Sbjct: 254 TFVANSADLEKAFTKEFQDVMSV 276 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 49/386 (12%), Positives = 111/386 (28%), Gaps = 14/386 (3%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + ++ + + T Q +Y + + R Sbjct: 1354 SASDYTATYDQATRTVTVQFRADYELEKDATYSISFDVKPSEEAYNKFETDGGYYRNEEG 1413 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +L+ + S ++ + +K+ + + R Sbjct: 1414 GYILADGEEPSEENQYKGDEKTDDPSVPEDSQTSSGKPGFPANKKATLQYTYDGGTGRFE 1473 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P+ + + + + + +P V+ Y + + Sbjct: 1474 YPHPVLQIPEPVLPDEYNKRIEPNDDGTYSLTLDVTGIEGNPATVTTK---YPVDLVFVI 1530 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D SLS ++ +D ++ + +D + + I D + + + S Sbjct: 1531 DKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEIIP-DLCSQQYDIQIAGYQFSGSST 1589 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 W V N +T + A+ A D + + ++ +N KK Sbjct: 1590 RVLDWSRE---EQQVLNNLKISNTSYNTEPSQALADALDMLKTGSQ----AHQNQSNVKK 1642 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA---RYFLSNCASPN 426 Y++ +TDGE T+ E +K G I TI S + + N + Sbjct: 1643 YLIFMTDGEPTESEELSYYAISKNPVPGASIYTIGVSSDASTDLMEGIRSTAEGNGMTAP 1702 Query: 427 SFFEANSTHELNKIFRDRIGNEIFER 452 + F+ S + F I Sbjct: 1703 ATFKGTSAQLIKDAFTQIKDEIISTS 1728 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/300 (13%), Positives = 78/300 (26%), Gaps = 28/300 (9%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 S + + + +K + + + Sbjct: 1028 SSGQRGVPTNADAYFSFGEDGTTKVRFPHPVIPAPATNHPEYKKYIKDNGNGTYTLTLDV 1087 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + + K +D S S + S + ++ +V AL + Sbjct: 1088 KSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDA---RREVVNSALELFFNQLS 1144 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D F+D WG + + + + T+ T Sbjct: 1145 DGDYNIQFG---GYKFSDSGERVNFNDWGWQ--DKYWETDTSNALSHLKLTSWETDGSTY 1199 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-----EEGIAICNKAK--SQGIR 399 + S + ++ K+Y++ LTDGE Q++ EE + K G Sbjct: 1200 PSQTLRSAISALENVELGENGKRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTT 1259 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSF--------FEANSTHELNKIFRDRIGNEIFE 451 I + F+ + S +F F NS ELN F Sbjct: 1260 FYAIQV-----ANSDSHGFMESMVSNANFVDGVTAQKFVGNSADELNAAFSQMAAEISGS 1314 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 10/108 (9%), Positives = 30/108 (27%), Gaps = 11/108 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I + + + + + + +G++ + +K + G T Sbjct: 695 FIDDLFEQSPNSRFSIVTYSTYAS--TELDWTEYGMNGSGQQTIKEAIGELQANGGTNYE 752 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + A + + ++ L+DG+ T + Sbjct: 753 AGLYQAMEVLKERGNSSNIP---------VVIFLSDGKPTYYYSDVDE 791 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/271 (12%), Positives = 75/271 (27%), Gaps = 24/271 (8%) Query: 155 SWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + I+ E + S + S + + +N F Sbjct: 230 TNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQKAAVRLEEMVNYFPYDYPLPSA 289 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + + + + + + + + P + +D S K L Sbjct: 290 ATAPFKPTITVIPAPWNQAKRLVHIGIKALPLAHPPK--ANLVFLLDVSGSMGSPDKLPL 347 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+ ++ ++ ++ D V+ + + + + + N Sbjct: 348 VKQSMELLLSGLQPTDTVS------IVVYAGAAGTVLEPTP---VAEQQKILAALDRLNA 398 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---C 390 GSTA ++ AY E R N I+L TDG+ + + Sbjct: 399 GGSTAGAQGIELAYQL----AEANYQRDAVNR-----IILATDGDFNVGIADPEQLKGYV 449 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++ GI + + F + N Sbjct: 450 ERKRANGIELSILGFGSGNYNDALMQQLAQN 480 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 42/113 (37%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ +K+ G + ++ +M+ G+ +D+ R ++ L++ A A + Sbjct: 8 LERFLKNEKGTVTVYLVIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACSVLADYHT 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 S ++ Q +Y+ N + +NF + R V + Sbjct: 68 STKQDFGLFTYKGANYDQDFAKYVKANLTFSADQNFNLLDYRYEGSKADVSNS 120 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 71/206 (34%), Gaps = 29/206 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + +++ S I S+ +G F+ S + + Sbjct: 97 SMAATDVAPDRLTSAKESAQSFIESL------PPRFNVGLVAFSSVATVVASPTQDHQAV 150 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +I TAI + + + I + IVLL+DGEN Sbjct: 151 ADSIANLTISS-----GTAIGEGVFASLQAIR-----SFDEKATDDPPPAAIVLLSDGEN 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---------QQEKARYFLSNCA--SPNSF 428 T A ++A++ G+ + TIAF + + L A + F Sbjct: 201 TSGRPVAAAA-DEARAAGVPVSTIAFGTGVSIIEIEGHYVPANIDKETLKELAMTTGGRF 259 Query: 429 FEANSTHELNKIFRDRIGNEIFERVI 454 +EA ST EL ++ D IG+ + + Sbjct: 260 YEAESTGELKDVYAD-IGSSLGTETV 284 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 65/220 (29%), Gaps = 37/220 (16%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + ++ L+++I ++K R+G F+ ++ Sbjct: 114 SLSMKATDIPPSRLEKIKYELSNIINTLKS-------DRIGLVIFSSSAFMHCPLTYDKG 166 Query: 318 KLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 L T ++ + E K E K +VL +D Sbjct: 167 ALNLFTQILNTNLMPIGNAGTDFYAPLEL---VLKKYQEANKSNRKQQNEYAKVVVLFSD 223 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR------------------YF 418 GE D AI ++ K IR+ T+ + + Sbjct: 224 GEEFGD--RYTAIVDQYKQNNIRVFTVGVGSLQGGKIPTSLGFKKDKKGKVVLSKLSTTS 281 Query: 419 LSNCA--SPNSFFEANSTH----ELNKIFRDRIGNEIFER 452 L A + FFE + T EL ++ G ++ R Sbjct: 282 LQTIAEQTNGRFFEVSETKNEIPELINTIQEIKGQKLDVR 321 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 52/426 (12%), Positives = 110/426 (25%), Gaps = 71/426 (16%) Query: 41 VDVVRWSYYEHALKQ-AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 ++ E +L+ ++ A++ A + + F + R Sbjct: 81 INYASAVRNELSLQGPHSKAALLQAVRRIQPLSTGTMTGLAIQFAISRAFSAAEGGRGSA 140 Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 N KK V + P+ V +R + +F +G Sbjct: 141 PNFKK----------VAIVVTDGRPQDGVQDVSARARAAG----IEIFAIGVGRVDMGTL 186 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + +E + S S+++ + + + Sbjct: 187 RQMASE----------PLDEHVDYVESYSVIEKLTHKFQEAFCVVSDLC------ATGDH 230 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE-------EHFVDSSSLRHVIKKKH 272 + Y +C +D S + Sbjct: 231 DCEQVCVSTPGAYRCACRDGFSLNNDGKTCTACNGGLGSALDLVFLIDGSKSV-RPENFE 289 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LV+ + ++ S++ D ++G ++ V + G K + I Sbjct: 290 LVKKFINQIVDSLEVSDK---QAQVGLVQYSSSVRQEFPL--GQFKSKKDIKAAVKKMSY 344 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A++ D+ S+ + K ++ TDG + + K Sbjct: 345 MEKGTMTGQALKYLVDSSFSAINGARPGV------PKVGIVFTDGRSQDYISD---AAKK 395 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR--IGN 447 AK G R+ + L AS +F FR IG Sbjct: 396 AKDSGFRMFAVGVG------NAVEDELREIASEPVAEHYFYTAD-------FRTISKIGK 442 Query: 448 EIFERV 453 ++ ++ Sbjct: 443 KLQMKI 448 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 52/167 (31%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D + R+G + V ++ S K + T A+Q Sbjct: 66 IEALDVGPNATRVGVINYASAVRNELSLQGPHSKAALLQAVRRIQPLST--GTMTGLAIQ 123 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A S+ E + KK +++TDG ++ A +A++ GI I I Sbjct: 124 FAISRAFSAAEGGR---GSAPNFKKVAIVVTDGRPQDGVQDVSA---RARAAGIEIFAIG 177 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS S +L F++ Sbjct: 178 VGRVDMGT------LRQMASEPLDEHVDYVESYSVIEKLTHKFQEAF 218 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 66/176 (37%), Gaps = 20/176 (11%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + VR +++++ + + D R+G ++DR ++ G +K I + Sbjct: 51 KEDFEKVRQWVSNLVETFEI---GPDKTRVGVVRYSDRPTTEFDL--GKYKTCEEIKEAA 105 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G+T DA++ S E ++ KK +LLTDG + + Sbjct: 106 RKIRYYGGNTNTGDALRYIN---TYSFSKEAGGRLSDRTVKKVAILLTDGRSQDYVLDP- 161 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 N A+ GIRI + E + L AS F + + ++KI Sbjct: 162 --ANAARQAGIRIFAVGVG------EALKEELDEIASEPKSAHVFHVSDYNAIDKI 209 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 42/386 (10%), Positives = 92/386 (23%), Gaps = 85/386 (22%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + +K+ + + K + + I ++ + E + Sbjct: 38 MTLINVKADNSVGDKPSFEIQIDEAKPNPAIVGEDILIKGKIIPKPFEMEMERQKKEIVL 97 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 T S + + GKV + Y + N S Y +++ +H Sbjct: 98 VLDTSGSMNEKVGKVCTNNRGW--YCKTHNSSDLYHRESLFYHNWINDYCEEHGKVGQHY 155 Query: 268 I-----KKKHLVRDALASVIRSIKKID-------NVNDTVRMGATFFNDRVISDP----- 310 K ++ A + I +K + N + + +N S Sbjct: 156 ASYSKSTKMEELKKAANNFIDKMKDVPDLKICIVNYSSEATINPCGYNGDKNSASVEEDR 215 Query: 311 --------SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI------------ 350 S + + +G T + ++ A + Sbjct: 216 HHTIPNYKSLGTKFLNSNDNTLHSMINGLKALGGTNTGEGLRKAEYMLEQGDKDAKKTIV 275 Query: 351 --ISSNEDEVHRMKNNLEAKKYIVLLTDGENT---------------------------- 380 KN+ +KY +L + Sbjct: 276 FMSDGLPTYYSVYKNHQNVQKYYWVLKYSWDNGYHWEKEYYWKEEDYWDYYTSTDDTFPN 335 Query: 381 ---QDNEEGIAICNK--------AKSQGIRIMTIAFSVNKTQQEKARYFLSNC----ASP 425 N + C K KS + ++ + + + A + Sbjct: 336 YAGTGNSDDQGCCKKYAKKIGEIIKSNNSNVFSLGYGLG-NKNSDANKIMKEIHDSMGGA 394 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 F A T +++IF I Sbjct: 395 EKDFFATDTGAIDEIFNQIADKIIES 420 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/411 (9%), Positives = 101/411 (24%), Gaps = 30/411 (7%) Query: 45 RWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 + ++A + ++A + + + + +E + + Sbjct: 50 YADNNKSVKQEAKPASELSAVA---SQPVPAARAVEMNRDRVAGREKEAGKVRSDARQDR 106 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 T + P ++ + F + Sbjct: 107 LATLPTESRRLGIEQPNAAPGFMPQLDGIAGHGEGPGVGGDKFAYVENNPFRAV-ADEPL 165 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T S I+ + + + +N F V Sbjct: 166 STFSIDVDTASYSKIRSYLIDYHQLPPQGAVRVEELINYFTYDYATPTDQ-KPFAANVEA 224 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + P+ + +D S + +K L++ + ++ Sbjct: 225 AACPWNAEHRLVRIGIKGKEIANAERPASNLVFLLDVSGSMNNARKLPLLKQGMKLLVDQ 284 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + D V V GA + ++ + GST ++ Sbjct: 285 LGENDKVAIVVYAGAAGM---------VLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIE 335 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKAKSQGIRI 400 AY ++ + ++L TDG+ + +KAK G+ + Sbjct: 336 LAYQA--ATENFIKGGVNR-------VILCTDGDFNVGVTSTSDLVTMAADKAK-SGVFL 385 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + F ++ ++ E K+ +++ + Sbjct: 386 SVMGFGTGNHNDAMMEELSGKA--NGNYAFIDTITEAKKVLVEQMSGTLTT 434 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 48/156 (30%), Gaps = 19/156 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + +++ + TAI +A++ A + Sbjct: 129 DRVGIVAYAASAYPQLALTTDHSAA-KMFLQSMNTNMLSSQGTAIQEAIRMATNYFDDK- 186 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + +++DGE+ I +A+ +GI I TI K Sbjct: 187 ----------STTSRLLFIISDGEDH--EMGATEIAAEAQEKGIHIYTIGVGTEKGSPIP 234 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 R S+ + + + +I Sbjct: 235 MREL-----GEQSYKRDRNGEVVITRLNKELLQQIA 265 >gi|254414094|ref|ZP_05027862.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] gi|196179230|gb|EDX74226.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] Length = 601 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 59/182 (32%), Gaps = 28/182 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK R A ++ + D V+ T ++D V + + + + Sbjct: 58 KKIAYARQAACYAVQQLLGSDRVS------VTIYDDIVETLIPSTLATE---KNYITRQI 108 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA++D + N E ++LL+DG + A Sbjct: 109 ERIHPRNMTALHDGWVEGGKQVSQ---------YLNPEGLNRVILLSDGLANKGQTNADA 159 Query: 389 ICNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 I + QG+ T+ + L A +++ ++ +L +IF+ Sbjct: 160 IASDVYGLAQQGVSTTTMGVGDDYN-----EDLLEVMANSGDGNYYYIDTPEQLPEIFQT 214 Query: 444 RI 445 + Sbjct: 215 EL 216 >gi|124008506|ref|ZP_01693199.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123986014|gb|EAY25864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 425 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 63/195 (32%), Gaps = 30/195 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + V+ A+ VI ++K D ++ ++D + + + + Sbjct: 60 DKLNYVKKAVDFVIDNLKSDDVLS------IVQYDDEIDVV---ASSAKVTNKKALHEKV 110 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T ++ M Y + S+ + ++LL+DG Sbjct: 111 KGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNR---------VLLLSDGLANAGITAPEQ 161 Query: 389 ICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + A + GI + T + ++N + +++ + ++ +IF Sbjct: 162 LQQIAQKKFREAGIALSTFGVGSDFN-----EVLMTNLSEYGGANYYFIDMPDKIPQIFA 216 Query: 443 DRIGNEIFERVIRIT 457 + + V + T Sbjct: 217 QELEGLLSV-VAQNT 230 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 56/181 (30%), Gaps = 28/181 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + R A + + D R+ T F+DRV + + + Sbjct: 59 KMEVARQAACFAVEQLLPSD------RLSVTIFDDRVECPVPSTLVR---DKATIIRTIQ 109 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GSTA++D + N ++LL+DG AI Sbjct: 110 GIHSRGSTALHDGWVQGGIQVSQ---------HLNPAHLNRVILLSDGLANVGETNPDAI 160 Query: 390 CNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 +G+ T+ ++ L A SF+ T +L IF+ Sbjct: 161 AQHVHGLAQRGVSTSTMGIG-----EDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQAE 215 Query: 445 I 445 + Sbjct: 216 L 216 >gi|219363061|ref|NP_001136879.1| hypothetical protein LOC100217033 [Zea mays] gi|194697454|gb|ACF82811.1| unknown [Zea mays] Length = 459 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 56/203 (27%), Gaps = 38/203 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI ++ D R+ F+ + ++ Sbjct: 5 KLALLKRAMRFVIENLDPSD------RLSVVAFSSSAWRLFPLQRMTASGQQQSLQA-VN 57 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN---------- 379 G T I + ++ A + I+LL+DG + Sbjct: 58 SLAADGGTNIAEGLRKAARVVEDRQ---------ARNPVCSIMLLSDGVDSHNLPPRDGS 108 Query: 380 --TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 +I + I F ++ + + A S +F + Sbjct: 109 APDYGPLVPRSIL-PGSEHHVPIHAFGFGMDHDSRA-----MHAVAQMSSGTFSFIDMAG 162 Query: 436 ELNKIFRDRIGNEIF--ERVIRI 456 + F IG + + R+ Sbjct: 163 SIQDAFAQCIGGLLSVVAQETRL 185 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 44/131 (33%), Gaps = 20/131 (15%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + V + G T + AM D + K+++ Sbjct: 462 TDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDML-----------GIASAKIKHMI 510 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE 430 +L+DG++ + I ++ G+ I T++ A ++ A + Sbjct: 511 VLSDGQSQGGD--FEGISSELAQMGVTISTVSLGQGA-----AVDLMAAIAQIGNGRAYV 563 Query: 431 ANSTHELNKIF 441 N+ E+ +IF Sbjct: 564 TNNAEEMPRIF 574 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 58/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D + VR GA + D ++ Sbjct: 799 DYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDD--FGTKLEVISVLQN 856 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ GST +A+ + E + N + ++++TDGE+ ++ Sbjct: 857 DQAMGGSTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGESHDADKLNAT- 910 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 911 AKALRDKGILVLAVGIDG-----ANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 963 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 62/188 (32%), Gaps = 21/188 (11%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 S+ + + S H ++ L+ V+ + VR+GA + D F Sbjct: 380 SDIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWD--LEFE 434 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + K G+T A+ + + + + + ++V+ Sbjct: 435 INKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKK------QRGNKVPCHLVV 488 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEAN 432 LT+G + E N+ + + IR+ I +E + L A + + Sbjct: 489 LTNGMSKDSILEP---ANRLREEHIRVYAIGI------KEANQTQLREIAGEEKRVYYVH 539 Query: 433 STHELNKI 440 L I Sbjct: 540 DFDALKDI 547 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 77/265 (29%), Gaps = 15/265 (5%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + + A+ + ++ G+ + + + G F Sbjct: 890 PQVLIVITDGESHDADKLNATAKALRDKGI-LVLAVGIDGANPVELLAMAGSSDKYFFVE 948 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 +K S K++ + L + +D S+ Sbjct: 949 TFGGLKGIFSDVTASVCNSSKVASTSRTQVIILPLPPQGAGEIDKVDLVFLMDGSTSIQP 1008 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + ++ D + VR+GA F+D + G + I Sbjct: 1009 ND----FKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPL--GTFIGEKEISFQI 1062 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G+T I A++ + N + +++LTDG Q +E Sbjct: 1063 ENIKQIFGNTHIGAALREVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVA 1114 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 + +GI I ++ QQ Sbjct: 1115 QAAEALRHRGIDIYSVGIGDVDDQQ 1139 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 203 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 259 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 260 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 313 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 314 TIFTLGIEGTSDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 367 Query: 453 VI 454 V Sbjct: 368 VS 369 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/165 (20%), Positives = 57/165 (34%), Gaps = 21/165 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ V + R I + STAI+D + A + Sbjct: 134 DRVGLVIFAETAYFAAPLTFDVEAVGRLIDQATI--GISGRSTAISDGLGLALKRLA--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS------VN 408 + + +VLL+DG N + + A+ GIR+ TIA Sbjct: 189 --------RSDARSRVVVLLSDGVNNAGAVQPRDAGSLAERLGIRVHTIALGPADLETDP 240 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFE 451 K++ L A S F +T +L ++ R E Sbjct: 241 KSRDAVDTATLRAIAETSGGETFRVRTTDDLRQVARAIDALETSA 285 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 47/124 (37%), Gaps = 21/124 (16%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ + G T I + A D++I K+++LLTDG Sbjct: 477 TDKSSIAEKVAGIQGSGGTNIYGGLAEAIDSLIKVKAKN-----------KHVILLTDGW 525 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 + N + + +KA+ GI I T+ + + L + A +F+ + + Sbjct: 526 SNVGNYD--ELISKARRHGITISTV------SAAGGSAQLLRSIAEKGGGTFYNTRDSAD 577 Query: 437 LNKI 440 + +I Sbjct: 578 IPQI 581 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 49/144 (34%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I + R+G + + ++ Sbjct: 110 NRLEKAKQIISKIIDKLGS-------DRVGVIIYAGNSYPLLPITTDHAAAN-MFLQNAN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAIN+A++ A N+ + +++V+++DGE+ Q+ Sbjct: 162 PDMVSSQGTAINEALELAKTY-----------YNNDEQTNRFLVIISDGEDHQEE--TKQ 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 + + G++I TI K Sbjct: 209 VAQNLSNDGVKIYTIGVGTEKGGP 232 >gi|126310280|ref|XP_001371684.1| PREDICTED: similar to collagen XXI [Monodelphis domestica] Length = 957 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 52/172 (30%), Gaps = 23/172 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ D +++G ++D + + G+T Sbjct: 60 LVNISNNFDIGPKFIQVGVVQYSDYPVLEIPLGSHHSGENLMEAMESIQYL--GGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDHLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDNRITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 I E L A S F ++K R+ I ++ Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISK-IREVIKQKL 211 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 57/183 (31%), Gaps = 27/183 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L+ +I + +MG F + + +KT Sbjct: 110 SRLDYAKMLLSQLIDRLTD-------DKMGLIVFAGDAFIQMPITSDKVSA-KMFLKTIQ 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D + TAI A+ A + + + + I LLTD EN +DN Sbjct: 162 PDLIQRQGTAIGSAIDLAVKSFNDTKQSG----------GRAIFLLTDAENHEDNAV--E 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 A+ + I + + + + + S+ + + + + +G + Sbjct: 210 AAKMARDKNITVNVVGIGTPEGSPIPVKGTM-------SYIKDKDGNVVVSKLNEELGKQ 262 Query: 449 IFE 451 I Sbjct: 263 IAA 265 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 52/171 (30%), Gaps = 20/171 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D +D R+ + V + K + + T Sbjct: 92 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLST--GTMTG 149 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++TA + + + N K K +++TDG + A++ GI I Sbjct: 150 MAIKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDK---VEEVSAAARASGIEI 203 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS + F + +L FR+ + Sbjct: 204 YAVGV------DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETL 248 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 48/156 (30%), Gaps = 19/156 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + ++ D TAI +A++ A + + Sbjct: 129 DRVGIVAYAASAYPQLALTTDHSAA-KMFLQGMNTDMLSSQGTAIQEAIRMASNYFDENT 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + +LTDGE+ I +A+ +G+ I TI K Sbjct: 188 P-----------TARLLFILTDGEDH--EMGATEIATEAQEKGVHIYTIGIGTEKGAPIP 234 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + ++ + + + +I Sbjct: 235 IKD-----GGEQTYKRDRNGEVVITKLNRELLQQIA 265 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 57.7 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 51/157 (32%), Gaps = 19/157 (12%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + L +++ +K R+G F + Sbjct: 111 DASKSMLARDVQPSRLERAKLELNTLLDELKG-------DRVGLVVFAGDAFIQSPLTSD 163 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + ++ ++ G + I A++ A + +++ ++ +VLL+ Sbjct: 164 YSAV-KLFLRAVDPEQMPQGGSNIGAALKLANQVLSNADRGAK---------ERAVVLLS 213 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 DGE+ K G++++ + + Sbjct: 214 DGEDLFGEVGEAT--EALKDGGVQVLAVGVGSESGEP 248 >gi|239908812|ref|YP_002955554.1| hypothetical protein DMR_41770 [Desulfovibrio magneticus RS-1] gi|239798679|dbj|BAH77668.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 439 Score = 57.3 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 50/163 (30%), Gaps = 22/163 (13%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + +T R+ ++ + + V + I ++ G T ++ + Sbjct: 74 DKLKNTDRVSLIAYDSSIETRVP---SVKVEDKAIFHRAIEGIDDGGCTNLHGGWLKGAE 130 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAF 405 I + I+LL+DG+ + + I C + G+ T Sbjct: 131 QISP---------YIDPSTISRIILLSDGQANEGLTDEAEIFKQCRELADAGVTTSTYGL 181 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 N + A + + + +L F++ + Sbjct: 182 GSNFN-----ETLMIGMAKNGQGNSYYGRTADDLMDPFQEELS 219 >gi|156096226|ref|XP_001614147.1| sporozoite surface protein 2 [Plasmodium vivax SaI-1] gi|148803021|gb|EDL44420.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 57.3 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 48/165 (29%), Gaps = 15/165 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+ VD S V L +I S+ DT+ + F + Sbjct: 39 CNESVDLYLLVDGSGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTE 95 Query: 309 DPSFSWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G R + G+T + A+ + N Sbjct: 96 LIRLGSGQSIDKRQALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NRE 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +A + ++L+TDG + + NK K + + + I Sbjct: 148 KAIQLVILMTDGVPNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 191 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 57.3 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 45/167 (26%), Gaps = 41/167 (24%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH------RMKNNLEAKKYI 371 + + TAI + T+ D I ++ A I Sbjct: 144 TTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPSVAPTGSAVSGNGTGPYAADVI 203 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK----------------- 414 V+LTDG NTQ + +A ++ +R+ TI F Sbjct: 204 VVLTDGANTQG-VDPQTAAKQAAARRLRVYTIGFGTTTPAPMVCGSSQVGGFGGFGGFGG 262 Query: 415 ---------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L + A + +++ A + +L Sbjct: 263 FGGGGRLGDRSPLVIDEQALRDVAATTGGTYYRAQNAGQLQDALGTL 309 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 14/134 (10%), Positives = 34/134 (25%), Gaps = 4/134 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G ++ M + G ++ D+ + L A + + Sbjct: 7 LKNLLGNFLNNQNGLAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAAALAGAQ 66 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + + + + + N + N T V +T Sbjct: 67 ELPDNPAQAVQ---VARDYAGKNGFAPDSLNI-SISGDNRTISVAGREVVNTIFARVLGI 122 Query: 127 SAYQVVLSSRYDLL 140 + V S + Sbjct: 123 YSKTVSAGSSASVQ 136 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 47/424 (11%), Positives = 100/424 (23%), Gaps = 44/424 (10%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A A + + + ++ P+ + L + Sbjct: 155 ALDRAA-------SEIAQTKADHPEDQVDTPRAYVNVVLTDGSWTSQDGTSPLSPASQNP 207 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 TA + + V++ D + + + G + I E + Sbjct: 208 SITAGALYDDQDIPTYVVAVAGDPQAEAAADEVAAAGGTTAAIDGNTPEELGTALNEIVT 267 Query: 176 GV--------------SIQWVIDFSRSMLDYQRDS--------EGQPLNCFGQPADRTVK 213 + ++D S SML+ + Sbjct: 268 DIIESVVAPECVGGLPRAMIILDASSSMLNTDGGTMFGAMGETGWDQAGGALVGEMGLFD 327 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + L+ + + + + + K Sbjct: 328 IDVGTGTAEDLTHLGLTVFGYDTPNLAGAESLEEDPGYLPAYDAGEPRVLVGYGPCSKDN 387 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 AL + A ++D + P F + Sbjct: 388 FAWALDPNSSCTEPGCTDPWGGPPIAWDYDDGQANPPEFDAPTFSHMPQCAG---NGFCS 444 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMK----NNLEAKKYIVLLTDGENTQD--NEEGI 387 T + +Q D + + ++ E + +L+TDG+ N + Sbjct: 445 GSGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTADETLYFNILITDGQYNGYSTNAQVQ 504 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFR 442 + + GI I F L N A S N++++AN+ EL + Sbjct: 505 GELEEMYNSGITTYVIGFGDGVDTAAAQAQ-LQNMAQWGSGSQNNYYDANNQTELEQALT 563 Query: 443 DRIG 446 G Sbjct: 564 TIFG 567 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 41/113 (36%), Gaps = 18/113 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TA+ + +A + + +A + ++++TDG N Q + N A + Sbjct: 2 TALYSGINSATTEFKNY----------STDASRIMIVVTDGYNNQSGASSATVINNAIEE 51 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 + I + L N ++ ++ N +LN IF + I Sbjct: 52 NVIIYCVGVG------SVNSTVLKNISESTGGCYYYINQFSQLNGIFENIISE 98 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 47/138 (34%), Gaps = 21/138 (15%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE- 378 + ++ G T+I A+ AY + + K+I+LLTDG+ Sbjct: 469 DKDAIQDDIATIRADGGTSIYPALALAYTALKDA-----------HTKFKHIILLTDGQS 517 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 T + ++ + GI + T+A L A ++ ++ Sbjct: 518 ATTGDYYFLS--RRMARAGITMSTVAVGEGAD-----TLLLEQLAAWGQGRYYFSDEISN 570 Query: 437 LNKIFRDRIGNEIFERVI 454 + +IF I ++ Sbjct: 571 IPRIFTKETMKAIKSYLV 588 >gi|149188837|ref|ZP_01867127.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] gi|148837257|gb|EDL54204.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] Length = 504 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/282 (9%), Positives = 61/282 (21%), Gaps = 22/282 (7%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 A + +LG+ V+ R+ + L+ A S + ++ N+ Sbjct: 1 MAFTLIPVLGMTFFAVEGTRYIQETNRLRDA---------AQAAASAVTIEDQSANANEM 51 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 K I +Y+ + + Y V ++ ++ Sbjct: 52 AKDYIRDYVRDINSETVVATRFYQAPDPENDV------DEFIQYTVEATTNHNSWFASNL 105 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + I +V DFS SM + P Sbjct: 106 ----IPVFGETQDLKGVAVAKKYPFNLGDKNIDIVFVSDFSGSMSWQWGGNSSDPCTATN 161 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMV--SCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 + K+ D + P C L + + + Sbjct: 162 CKIADLKVAVKEIADKLLCSDIQTDPATNEDYCADDDQPELTSKLDNRVAVVPFNIRTRE 221 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + D + Sbjct: 222 SNGSNV-FTVTQLRYRDDVDEDDSPRTYEDVNWNKWREYTSG 262 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/297 (12%), Positives = 82/297 (27%), Gaps = 24/297 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 +QT A + S + ++ + +N F Sbjct: 57 VQTAANPVSTFSIDVDTGSYSNVRRFLSAGTLPPVDAVRVEELINYFRYDDPAPADG-KP 115 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + + + + P + VD S K L++ + Sbjct: 116 FAVRTELAPTPWNNDSLLLRVGVAGRAIATADLPPANLVFLVDVSGSMESPDKLPLLQSS 175 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L ++R ++ D R+ + + G + + GST Sbjct: 176 LKLLVRQLRAKD------RITLVTYAGNTAVVLPPTPG---DQQGRIIEAIDTVQSGGST 226 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAICNKAK 394 A ++ AY + + R N I+L TDG+ + + + + Sbjct: 227 AGASGIELAYK----AAQQGYLRGGINR-----ILLATDGDFNVGVTDFDQLKGMVAEKR 277 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 G+ + T+ F A ++ +S E K+ +G + Sbjct: 278 RSGVALSTLGFGTGNYNDTLMEQL--ADAGDGAYAYIDSPLEARKVLTHELGATLAT 332 >gi|296225410|ref|XP_002758285.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Callithrix jacchus] Length = 844 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 63/232 (27%), Gaps = 10/232 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S S NG + + + + + + + V K Sbjct: 245 PTCSTSLLNGDFKVTYDVSREEICDLLVANNHFAHFFAPQNLTNMNKNVVFVIDISGSMK 304 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++++ + + F RV S + + F Sbjct: 305 GQKVKQTKEALLQILGDMRPG---DYFDLVLFGSRVQSWRGSLVQASQANLQAARDFVRG 361 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T +N + +T+ E + +++LTDG+ T+ + I Sbjct: 362 FSLDEATNLNGGLLRGIETLNKVRES----LPELSNHASILIMLTDGDPTEGVTDRSQIL 417 Query: 391 NKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 +S + + F N + N +E + + Sbjct: 418 KNVRSAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYEDRDAAQQLQ 469 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 54/167 (32%), Gaps = 17/167 (10%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + D R+ FN R + + R +K + G T+I++ ++ + Sbjct: 71 DKLRDQDRLSIVVFNHRAEV---LLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGIE 127 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + +D + + LLTDGEN DN + A + + T+ F Sbjct: 128 ELAKGRKDTISQA----------FLLTDGENEHGDNNRCLKFAQLAADYNLTVNTLGFGN 177 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 N Q + + A S + F + + Sbjct: 178 NWNQHILEKI---SDAGLGSLSHIEHPDQAMDKFDSLLTRMQTVGLT 221 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 56/169 (33%), Gaps = 17/169 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + +V+ K D + ++D + F++ K T E Sbjct: 176 KNFVMTVMDQFKGTDT-----QFSLMQYSDDFKT--HFTFNNFKNDPTSKNLVGPIEQLN 228 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + +AKK ++++TDG+ D+ + +A+ Sbjct: 229 GKTHTASGIRKVVRELFQEWNGAR------KDAKKILIVITDGQIQGDSLNYRDVIPEAE 282 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 +G+ I + L AS F+ N+ L I Sbjct: 283 KEGVIRYAIGVGYAFNTPSARQE-LRTIASQPAQEHVFQVNNFDALKNI 330 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 60/189 (31%), Gaps = 22/189 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ ++ + + K R+G F + S L I + Sbjct: 118 TRLQAVQQSVKKFVAARKS-------DRIGLVIFANSAWPFAPVSEDKQALETRI--SQL 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + S+ + EA K +LLTDG +T Sbjct: 169 TPGMAGQQTAIGDALGVTVKLLDSTGD---------KEASKLAILLTDGNDTASQLTPRL 219 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCA--SPNSFFEANSTHELNKIFRDR 444 A S +++ TIAF + + L + A + + A ++ Sbjct: 220 AAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARMTGGRSWTAENSGASLDAVWKE 279 Query: 445 IGNEIFERV 453 I +V Sbjct: 280 IDAITPVQV 288 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 + + +D D R+ + V + L + +K T Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMT 143 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A++ A D + + K K +++TDG + ++ + A++ GI Sbjct: 144 GMAIKKAMDEAFTEKSGARPKSK---NISKVAIIVTDG---RPQDQVEEVSAAARASGIE 197 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I + L AS + F + +L FR+ + Sbjct: 198 IYAVGV------DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|326385750|ref|ZP_08207379.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] gi|326209729|gb|EGD60517.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] Length = 543 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 50/521 (9%), Positives = 124/521 (23%), Gaps = 81/521 (15%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + K+ + +ITAL + L G +D R + + A A A+ Sbjct: 14 RSRALLTDIAKADDANVTLITALAIIPALFGLGFTIDYARAEMLQSRI-NAVADAAALAA 72 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + S ++ +Y + T + + + R Sbjct: 73 TDTTYISKTSTVAQAASTQIFTTQVTDYADFVYTPASDLTVTITDGGTLNLGRTALVKWR 132 Query: 126 KSAYQVVL-------------SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 S+ + S+ Y ++ +L S L+ + + + + Sbjct: 133 GSSTNLFSGILGVATLPIGGSSTAYASFFPNINFYLVMDKSPSMLLPSTSSGISAIQKVT 192 Query: 173 -------KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 ++ SR++L T + S Sbjct: 193 GCAFACHQQLPGQNYVKDSSSRAVLIDANYYSSSSSGYNAYYLLSTTANSSGSYNLYNSS 252 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK---------------- 269 K+ S S + + + Sbjct: 253 GVKIGTVSSSKIPSGSSSPASLSYKDNNNNSQTLSVLYADGYWLTHNYAKVYSDLSSIPL 312 Query: 270 KKHLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDR-------VISDPSFSWGVHKLIR 321 + A +I ++ + T ++ F+ V + + Sbjct: 313 RVGAETLAAQDLIPFAQQKETDNKVTYQLQLYSFDWTHSGKSSPVTQYTNMTDVKSLTTA 372 Query: 322 TIVKTFAIDENEMGSTAIND-------AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + + + + + ++ ++++ Sbjct: 373 SVPDLDGTQDFWYSNNNPTSSKNINDMGTEFLNMMTSMNTAMPNPGNGSTGNPQEVLMII 432 Query: 375 TDGENTQD----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR-------- 416 TDG + + IA C K++ I+I + Sbjct: 433 TDGVVDETISGSRRTRELSSTDIAACTTIKNRKIKIAILYTEYAAATLTGDSWSQTNTAP 492 Query: 417 ------YFLSNCASPNS-----FFEANSTHELNKIFRDRIG 446 L +CAS S + +S ++ + Sbjct: 493 YLPNVLPALKSCASVGSDGNPLVYTVSSDQSISTALQALFA 533 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 59/169 (34%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D ++ D+ G+T Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDNL--DTKWEVISVLQNDQPLGGNTY 883 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE+ ++ + +GI Sbjct: 884 TAEALGFSDHMFTEARGSRLHKG-----VPQVLIVITDGESHDADKLNAT-AKALRDKGI 937 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 938 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 981 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 60/198 (30%), Gaps = 24/198 (12%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + +++ + SI +D +R+G +++ + S + ++ Sbjct: 233 STNSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLS---RGVNKS 286 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 V A A A I H + N + VL+T + Sbjct: 287 EVLQDIQSLAPQAGKAYTGA---ALRKIRKEVFSAQHGSRKNQGVPQIAVLVTH---SPS 340 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK 439 + + QG+ + TI Q L AS + + S +L Sbjct: 341 QDNVTKAAVNLRRQGVIVFTIGVEGASDTQ------LEKIASHPAEQYVSQLRSFSDLAA 394 Query: 440 ---IFRDRIGNEIFERVI 454 F ++ N+I V Sbjct: 395 HNQTFLKKLRNQITHTVS 412 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R + I+ D N+ VR+GA F+ + G+ I Sbjct: 1009 PDDFRKMKEFLASVIQDFDISNNRVRIGAAQFSHTYQPEFPL--GMFIGKEEISFQIENI 1066 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T I A++ +H + +++LTDG Q +E Sbjct: 1067 KQIFGYTHIGAALRQVGHYFRPDMGSRIHAG-----TPQVLLVLTDG---QSQDEVAQAA 1118 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +GI I ++ QQ Sbjct: 1119 EELRHKGIDIYSVGIGDVDDQQ 1140 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 47/150 (31%), Gaps = 21/150 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V++G F+D + + + +G T + Sbjct: 651 RADRVQIGVVQFSDVNKEEFQL---NRYTSQEEISDAIDRMAHIGET------TLMGSAL 701 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNK 409 ++ +K+++L+TDGE ++ + +GI I ++ F N Sbjct: 702 TFVSQYFSPAKGARPNVRKFLILITDGEAQDIVKDP---AVALREEGIIIYSVGVFGSNV 758 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELN 438 TQ L + P F + L Sbjct: 759 TQ-------LEEISGRPEMVFYVENFDILK 781 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 59/169 (34%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D ++ D+ G+T Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDNL--DTKWEVISVLQNDQPLGGNTY 883 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE+ ++ + +GI Sbjct: 884 TAEALGFSDHMFTEARGSRLHKG-----VPQVLIVITDGESHDADKLNAT-AKALRDKGI 937 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 938 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 981 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 60/198 (30%), Gaps = 24/198 (12%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S + +++ + SI +D +R+G +++ + S + ++ Sbjct: 233 STNSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLS---RGVNKS 286 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 V A A A I H + N + VL+T + Sbjct: 287 EVLQDIQSLAPQAGKAYTGA---ALRKIRKEVFSAQHGSRKNQGVPQIAVLVTH---SPS 340 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK 439 + + QG+ + TI Q L AS + + S +L Sbjct: 341 QDNVTKAAVNLRRQGVIVFTIGVEGASDTQ------LEKIASHPAEQYVSQLRSFSDLAA 394 Query: 440 ---IFRDRIGNEIFERVI 454 F ++ N+I V Sbjct: 395 HNQTFLKKLRNQITHTVS 412 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R + I+ D N+ VR+GA F+ + G+ I Sbjct: 1009 PDDFRKMKEFLASVIQDFDISNNRVRIGAAQFSHTYQPEFPL--GMFIGKEEISFQIENI 1066 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T I A++ +H + +++LTDG Q +E Sbjct: 1067 KQIFGYTHIGAALRQVGHYFRPDMGSRIHAG-----TPQVLLVLTDG---QSQDEVAQAA 1118 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +GI I ++ QQ Sbjct: 1119 EELRHKGIDIYSVGIGDVDDQQ 1140 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 47/150 (31%), Gaps = 21/150 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V++G F+D + + + +G T + Sbjct: 651 RADRVQIGVVQFSDVNKEEFQL---NRYTSQEEISDAIDRMAHIGET------TLMGSAL 701 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNK 409 ++ +K+++L+TDGE ++ + +GI I ++ F N Sbjct: 702 TFVSQYFSPAKGARPNVRKFLILITDGEAQDIVKDP---AVALREEGIIIYSVGVFGSNV 758 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELN 438 TQ L + P F + L Sbjct: 759 TQ-------LEEISGRPEMVFYVENFDILK 781 >gi|293356227|ref|XP_574569.3| PREDICTED: rCG40584-like [Rattus norvegicus] Length = 1182 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 18/186 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +V+ +++ R F+ + +F+ + + Sbjct: 163 TYTDFEKMLAFVKAVMSQLQQSST-----RFSLMQFSHTFRTHFTFNDFISTSSPLRLLD 217 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F GST A++ + ++ + +A K ++++TDG D + Sbjct: 218 FVNQLR--GSTRTASAIKHVITELFTTQKGAR------KDATKILIVITDGRKEGDRLDY 269 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 + A++ GI I Q ++R L + AS F + L ++ Sbjct: 270 GDVIPMAEAAGIIRYAIGVGQ-AFYQAQSRQELKDIASSPSREYVFSVENFDALKD-IQN 327 Query: 444 RIGNEI 449 ++ +I Sbjct: 328 QLKEKI 333 >gi|149923979|ref|ZP_01912364.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149815157|gb|EDM74708.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 785 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 81/306 (26%), Gaps = 33/306 (10%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP-ADRTVK 213 S++ + T S S++ + M D + +N F + Sbjct: 254 SFVATGEDRKSTFSIDVDTASYASVRQSLRNGW-MPDPGSVRTEEMINYFDYGYVAPSGG 312 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYM---LYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + + +P + + + +D S K Sbjct: 313 AGAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSMSSRGK 372 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++ ++ + D+V+ + + G + + Sbjct: 373 LPLIKHGFTQLVEQLGAEDHVS------IVVYAGAAGVVLPPTSG---DQKETILGALDR 423 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + + AY+ + + ++L TDG+ + A+ Sbjct: 424 LEAGGGTNGSAGIVEAYEL--AQANFVDGGVNR-------VILGTDGDFNVGLSDHDALV 474 Query: 391 NKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + G+ + + + + A ++ + E K+ + I Sbjct: 475 ELIEQKRESGVFLSVLGVGGHYDD-----ELMEQLADHGNGNYAFLDGKREAEKVLVEEI 529 Query: 446 GNEIFE 451 G + Sbjct: 530 GGTLTT 535 >gi|293344414|ref|XP_001080404.2| PREDICTED: rCG40584-like [Rattus norvegicus] gi|149067645|gb|EDM17197.1| rCG40584, isoform CRA_b [Rattus norvegicus] Length = 1182 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 18/186 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +V+ +++ R F+ + +F+ + + Sbjct: 163 TYTDFEKMLAFVKAVMSQLQQSST-----RFSLMQFSHTFRTHFTFNDFISTSSPLRLLD 217 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F GST A++ + ++ + +A K ++++TDG D + Sbjct: 218 FVNQLR--GSTRTASAIKHVITELFTTQKGAR------KDATKILIVITDGRKEGDRLDY 269 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 + A++ GI I Q ++R L + AS F + L ++ Sbjct: 270 GDVIPMAEAAGIIRYAIGVGQ-AFYQAQSRQELKDIASSPSREYVFSVENFDALKD-IQN 327 Query: 444 RIGNEI 449 ++ +I Sbjct: 328 QLKEKI 333 >gi|149067644|gb|EDM17196.1| rCG40584, isoform CRA_a [Rattus norvegicus] Length = 1084 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 18/186 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +V+ +++ R F+ + +F+ + + Sbjct: 163 TYTDFEKMLAFVKAVMSQLQQSST-----RFSLMQFSHTFRTHFTFNDFISTSSPLRLLD 217 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 F GST A++ + ++ + +A K ++++TDG D + Sbjct: 218 FVNQLR--GSTRTASAIKHVITELFTTQKGAR------KDATKILIVITDGRKEGDRLDY 269 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 + A++ GI I Q ++R L + AS F + L ++ Sbjct: 270 GDVIPMAEAAGIIRYAIGVGQ-AFYQAQSRQELKDIASSPSREYVFSVENFDALKD-IQN 327 Query: 444 RIGNEI 449 ++ +I Sbjct: 328 QLKEKI 333 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 40/415 (9%), Positives = 94/415 (22%), Gaps = 41/415 (9%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + + + + + R Sbjct: 41 AAQSVARDDDAAHAPEREEYVADRASAEHSVAAPAPAAPPASALAGPVARAPAPQAAKKV 100 Query: 120 VEMNPR-----KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + + L G ++ ++ Sbjct: 101 SLGKAELHRREPRPMKPSADALAGAPLESKPQDAAPAGGNTFEAWKANAFVETAKDPLST 160 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + + Y+ + +P Sbjct: 161 FAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVNYFKFRYAPPETGAFAVHLEGAPSPF 220 Query: 235 SCNKSLYYMLYPGPLDPSLSEE-----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + G + + VD+S H K L R+A+ ++++ + D Sbjct: 221 DAKRHFLRVGVQGKVVSRSQRKPAHLVFLVDTSGSMHSEDKLPLAREAIKVAVKNLNEND 280 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V + + + G TA+ M+ AY Sbjct: 281 TVA------IVTYAGNTRDVLPPTP---ATDAKSIHAALDSLTAGGGTAMGSGMELAYRH 331 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAK---SQGIRIMTIAF 405 + V LTDG+ N A+ + ++G+ + T+ F Sbjct: 332 AVKKASGSVVSRVVV---------LTDGDANIGRNVSANAMLDSIHKYTAEGVTLTTVGF 382 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR-------DRIGNEIFERV 453 + + + + F +S E K+F + I ++ +V Sbjct: 383 GMGNYRDDLMEKLADK--GNGNCFYVDSLREAKKVFETQLTGTLEVIAKDVKFQV 435 >gi|72093926|ref|XP_787298.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] gi|115954441|ref|XP_001184332.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] Length = 504 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 59/407 (14%), Positives = 117/407 (28%), Gaps = 41/407 (10%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 LK A A SL E + + S T + + N Sbjct: 116 VRSKLKN---LAWKAAEAKFTISLPEDAFISAFSMTIDDIVYHSEVKPKDVAQREYNEAK 172 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN---------PLSLFLRSMGI--KSWL 157 + + + + K V + +R +L N LF + + I + Sbjct: 173 AKGQTAGQVKQSSPDENKFTVSVTVEARTSVLFNLTYEQLLKRKNGLFEQRISIQPGQMV 232 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + EA KE S S L+ + + + A + ++ Sbjct: 233 PDIRVEARLTEPQGLKEVNASWAVADKTDESKLNDMTLVDLKSRSAKVLFASKGEGLLTT 292 Query: 218 QNGKVGIRDEKLSPYM--------VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 NG +G K + ++Y G + +D S + Sbjct: 293 SNGIMGDFIIKYDVMHDVKAGHLQIVNGYFVHYFSPVGLPKTRKNVVFVIDVSGSM-RGR 351 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A +++ ++ ID +N + + S + R Sbjct: 352 KIDQTKRAFTTILDDVRPIDRINIVLFESDVRVWRSNQMVEATSDNIAAAKRH-----VN 406 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + D ++ A D ++ E I++LTDG+ T + + + Sbjct: 407 RIRAGGGTNLYDGLRNAVDLLMEHGNGEAMP---------LIIMLTDGQPTSGSVKSTSE 457 Query: 390 ----CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + + +++F + LSN A +E + Sbjct: 458 IIQRITNLIDGRLSLFSVSFGNGVDFSFLEKLSLSNQALARKVYEDS 504 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 58/183 (31%), Gaps = 5/183 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-VPLIQSLE 73 ++ G ++ ++++ ++LGV +D+ + L+ A TA + + V Sbjct: 6 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + A + +F + +++ V + + V + Sbjct: 66 DQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRV 125 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + LS + + A A + + + I D + ++ D Sbjct: 126 AVTDMGISQYLSAVFGIV----KNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAW 181 Query: 194 RDS 196 Sbjct: 182 GYR 184 >gi|296119163|ref|ZP_06837733.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] gi|295967789|gb|EFG81044.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] Length = 585 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 49/142 (34%), Gaps = 18/142 (12%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + I + D G T + A++ A + + + Sbjct: 15 DAGCRDVTTLAPVRSGQLDDI-RGEVDDIEASGYTPMGSALRQAAEELPAEG-------- 65 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLS 420 ++ IVL++DG +T + + QGI I T+ F V+ + + Sbjct: 66 -----ERSIVLVSDGIDTCAPPPVCEVAEELHDQGIDLIINTVGFLVDDEARSELECIAD 120 Query: 421 NCASPNSFFEANSTHELNKIFR 442 A + +A+ L + + Sbjct: 121 --AGGGQYLDADDAESLAESMK 140 >gi|300776964|ref|ZP_07086822.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502474|gb|EFK33614.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 800 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/290 (11%), Positives = 80/290 (27%), Gaps = 24/290 (8%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 T+ + + S + + ++D + +N F + + + Sbjct: 346 TRNQPLSTFSIDVDNASYSNVRRMINNGQVVDKNAVRIEEMVNYFKYDYPQPKN-ENPFS 404 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 D +P L P P+ + +D S K L++ + Sbjct: 405 INTEYSDAPWNPKHKLLKIGLQGKNLPMDKLPASNLVFLIDVSGSMSDENKLPLLKSSFK 464 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ ++ D V + + + + GSTA Sbjct: 465 VLLNQLRPKDKVG------IVVYAGSAGMVLPPT---SAGEKDKIIEALDRLQAGGSTAG 515 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQ 396 ++ AY ++ + NN +++ TDG+ + + + Sbjct: 516 GAGIELAYKL----AQENFVKEGNNR-----VIIATDGDFNVGTSSISDLKTLIEDRRKS 566 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 G+ + + F + + ++ ++ E NK Sbjct: 567 GVFLTCLGFGMGNYKDNTLETLADK--GNGNYAYIDNMQEANKFLGKEFA 614 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 56/198 (28%), Gaps = 37/198 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + H + ++A I K R+G F S + L + + Sbjct: 127 RLHYAKKSIADFIDKRKS-------DRLGLVVFGAEAYSVCPPTNDHEYLQNRLKEISTE 179 Query: 330 DENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + T I A+ + S KK I+L+TDG +T ++ Sbjct: 180 YLGDYNRQTNITAAISGGLARLRKS-----------KAPKKIIILVTDGSHTANSNLTPR 228 Query: 389 I-CNKAKSQGIRIMTIAFS---------------VNKTQQEKARYFLSNCA--SPNSFFE 430 + A I TI +N + + L A + +F Sbjct: 229 MAAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEELLKEIAEKTGGLYFS 288 Query: 431 ANSTHELNKIFRDRIGNE 448 ++ + + E Sbjct: 289 VREAEQMKDVLKKIDALE 306 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 52/351 (14%), Positives = 94/351 (26%), Gaps = 73/351 (20%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S + V+D S Q R + + + G Sbjct: 30 ANARSSTGMDGPTMTGKVVDPDTSNYWKFWAGGYNGKEVTTQNVGRIWTDKTVKAVESGD 89 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK-KKHLVRDALASVIR 283 D + +S P + L +D + ++ A + I Sbjct: 90 SDFLTTLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRIDALKTAANTFID 149 Query: 284 ----SIKKIDNVNDTVRMGATFF---------NDRVISDPSFSWGVHKL----------I 320 + I + + R+ F + D +++ + Sbjct: 150 AIAAQNQSITDASKQHRVAIVKFAGKKKTDKVGNDTYRDGRYTYNYSQTMKNLTSCKGKD 209 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +K + N GST + ++ A + I+S +AKK +V TDG T Sbjct: 210 ADSLKDTVGNINPAGSTQADYGLELAENITINSGR---------ADAKKIVVFFTDGSPT 260 Query: 381 QDNEEGIAICNKA-------KSQGIRIMTIAFSVNKT--------QQEKARYFLSNCAS- 424 + ++ + A K+ G I TI K F+ +S Sbjct: 261 SSSGFQASVADSAIASAKSLKANGADIYTIGIFSGANPSADPTAEGTSKVNKFMHAVSSN 320 Query: 425 ------------------------PNSFFEANSTHELNKIFRDRIGNEIFE 451 + + A S EL KIF + G+ I Sbjct: 321 YPGATSSISFWGEWVIDYGTRAENSDYYKSATSASELEKIFEEISGSIIQT 371 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 67/212 (31%), Gaps = 40/212 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + A + + + +G F S + + K Sbjct: 109 SRLEVAQQAGKEFVDGL------TQGINLGFVTFAGTASVMQSPTTNREAV-----KAAI 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN----- 383 + TA + + TA +I + IVL++DG+ T + Sbjct: 158 DNIKLAERTATGEGILTALQSIET---LATVLGGAETPPPARIVLMSDGKQTVPDDKDVD 214 Query: 384 --EEGIAICNKAKSQGIRIMTIAFSV--------NKTQQEKARYF--------LSNCA-- 423 AKS+GI + TI+F ++ Q ++ L A Sbjct: 215 NPRHAFTAARLAKSKGIPVSTISFGTEWGSVEIPDQDGQGGSQRVKVPVDNESLREIAKL 274 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 S F+ A+S EL ++ D + +I R Sbjct: 275 SGGEFYTASSLEELTAVY-DTLEEQIGYETTR 305 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 75/261 (28%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +S+ + N + + S + + + ++ Sbjct: 236 SIMFMQSLHSVTEFCTEKTHNVEAPNLQNKMCNGKSTWDVIMNSTDFQNTSPMTEMNPPT 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I + +G Sbjct: 296 QPTFSLLKSKQRVVCLVLDKSGSMSSEDRLFRMNQAAELFLIQIIE-----KGSLVGMVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + T + + G T+I ++ + II S + Sbjct: 351 FDSVAEIRNNLTKITDD-NVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + + C + K G+ I TIA K LS Sbjct: 410 ---------IILLTDGEDNEIHS-----CIEEVKQSGVIIHTIALG---PSAAKELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIF 441 + + F+ + L F Sbjct: 453 DMTGGHRFYANKDINGLTNAF 473 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 22/186 (11%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +++ + + + ++ L + I R G F D F+ + Sbjct: 114 TTADGSTLTRLNAAKNVLKTFIA-------KRSGDRFGLILFGDAAFIQTPFTADQQVWL 166 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + A ST + DA+ + + ++ +++LTDG +T Sbjct: 167 S--LLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDT 215 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCA--SPNSFFEANSTHE 436 E + A ++GI+I IA +Q + + + F A E Sbjct: 216 GSFVEPVDAAKIAAARGIKIYIIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAE 275 Query: 437 LNKIFR 442 L+K ++ Sbjct: 276 LDKAYQ 281 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 46/166 (27%), Gaps = 21/166 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N R+ FND + + + G T + D ++ A + Sbjct: 382 NPQDRVRVVLFNDYASNLTGGFLPATQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLD 441 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + I L+TDG + + K + IR+ T + Sbjct: 442 ADRTSA-------------IWLVTDGVTNVGETKQRKFVDLLKQKDIRVFTFIMGNGANR 488 Query: 412 QEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L AS +++ ++ +++ +R Sbjct: 489 P-----LLKAITKASNGFAINVSNSDDIIGQLEKA-ASKVTHEALR 528 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 53/158 (33%), Gaps = 17/158 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA-YDTII 351 + + F+ + KL G TA+ DA+ A D + Sbjct: 121 NQDKAMLVSFDTKAELVSDLIGDTEKLD-----HAIRSLRPGGGTALYDAIFFACRDKLS 175 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++ IV+++DG++ Q A+ + + +I+ +++K + Sbjct: 176 QDQPK--------HKFRRAIVIVSDGDDNQSQYTRDQALEMAQKADVVLYSISTNISKIE 227 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + + L A + F +L + F + Sbjct: 228 SDGDK-VLKYYAAETGGKAFFPFKVEDLEQSFENIANE 264 >gi|116748933|ref|YP_845620.1| hypothetical protein Sfum_1496 [Syntrophobacter fumaroxidans MPOB] gi|116697997|gb|ABK17185.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 427 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 66/233 (28%), Gaps = 1/233 (0%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 +SK + G II AL + ++LG G VD+ + L+ AA + + L Sbjct: 7 FSKSRLSERDGATVIIVALALVMLLGFGAFAVDIGYLYVVRNELQNAADAGALAGAAALY 66 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + N + E L N D + T P S Sbjct: 67 NNDGTAVQPTANVIGQEAAMRNTAVRTAVEVTLNGNSGDVQRGHYSFATGTFT-PNASLL 125 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V L + L+ + F+ ++ + + A + + ++ + V + Sbjct: 126 PVSLWNVSTEELDANTDFINAVKVTTHRSAPAAPSFFSRIFGYDSFALAAEAVAYIGFAG 185 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 Y + + C D S + N S Sbjct: 186 TLYPLNVDKPIAMCQESLLDAEGNYSCSIGRMINSGSGAGHNTAAWTNYSQPC 238 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 57.3 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 58/176 (32%), Gaps = 20/176 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V++A+ VI ++ D ++ F+D + +K Sbjct: 60 KLRNVKEAVKMVIDRLEPSDYIS------VVIFDDSAQVIIP---SMPANDPVGMKAAID 110 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T ++ M + + + + ++LLTDG D + + Sbjct: 111 RIQDAGGTTMSLGMIQSLGELR---RWNIPNAVSR------MILLTDGVTYGDTDRCRQL 161 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A + GI I + + + + +P F S ++ IF ++ Sbjct: 162 ARDAAAAGISIYPLGIGADWDENLLDDVGQLSGGTPAEF--IRSPNDAMTIFEQQV 215 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 62/185 (33%), Gaps = 22/185 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A S+I S+ D R+ F+ R L ++++ Sbjct: 59 LETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQSRQDDLT--LIRSKIQQ 110 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ S ++ +I LLTDGEN DN+ + + Sbjct: 111 LRAGGGTAIDEGIKLGIQESSSGSKG----------YVSHIFLLTDGENEHGDNQRCLKL 160 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A GI + T F + Q + + A + + L + F Sbjct: 161 AAVAAEYGITLNTFGFGDHWNQDILEKIA--DIAGGSLSYIERPEQALIE-FTRLFNRLQ 217 Query: 450 FERVI 454 R+ Sbjct: 218 SVRLT 222 >gi|323700441|ref|ZP_08112353.1| hypothetical protein DND132_3035 [Desulfovibrio sp. ND132] gi|323460373|gb|EGB16238.1| hypothetical protein DND132_3035 [Desulfovibrio desulfuricans ND132] Length = 389 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 52/157 (33%), Gaps = 9/157 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + ++ + +G ++ +L M ++G+ + VD+ R AL+ A+ Sbjct: 2 RTHFRR-PDNESGFATVMVSLCMAALMGLTALAVDLGRAYLKRSALQT--------AADA 52 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + A + + Y RN + + + V + + Sbjct: 53 GALAGANSLLAAGKDLDKLRLIVTNYTTRNLTDADGPAKALTDADIVFLRDGVPDEEQPN 112 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 +V ++ + F +++G + I + A Sbjct: 113 QVEVTVTLSGERENAFPLYFGKAVGKPAMDIVVTSRA 149 >gi|290991702|ref|XP_002678474.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284092086|gb|EFC45730.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 467 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 60/173 (34%), Gaps = 20/173 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I+N+ + R+G F+D+ + + V L + +K ++ E GST MQ Sbjct: 128 IENLREFERLGIVLFDDKAETLLPLTI-VQDLDKKSLKETVLNIKEKGSTNFEAGMQRGI 186 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAFS 406 D S + ++ I+ LTD + I K + G I + Sbjct: 187 DLFSSLDSSDLSNSNR-------IIYLTDACPNVGGTATLDILTKDANSGPYSIFSTFVG 239 Query: 407 VNKTQQEKARYFL---SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + K L C ++F ST E F+ + + V I Sbjct: 240 IGLDFNSKIVDELTRVRGC----NYFSVKSTEE----FKKILNEDFNYIVTPI 284 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 59/169 (34%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D ++ D+ G+T Sbjct: 829 DFMIGLVKKADVGKNRVRFGALKYADDPEVLFYLGD--FDTKLEVISVLQNDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 ++A+ + E + N + ++++TDGE+ ++ + +GI Sbjct: 887 TSEALGFSDHMFT-----EAQGSRLNKGVPQVLIVITDGESHDADKLNAT-AKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 941 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTAS 984 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 69/212 (32%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 410 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKSFLSEVVGMFNIAP 469 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 470 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 524 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 525 LQKAKQ------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGV---- 571 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + + L I Sbjct: 572 --KEANKTQLREIAGEEKRVYYVHDFDALKDI 601 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 48/132 (36%), Gaps = 10/132 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ D + VR+GA F+ + G + I + G+T I Sbjct: 1022 LVSVVQDFDVSLNRVRIGAAQFSHNYRQEFPL--GTFIGEKEISFQIENIQQLGGNTHIG 1079 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA++ + N + +++LTDG Q +E + +GI I Sbjct: 1080 DALRQVGHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEALRHRGIDI 1131 Query: 401 MTIAFSVNKTQQ 412 ++ QQ Sbjct: 1132 YSVGIGDVDDQQ 1143 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 21/177 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S+ +D + +R+G +++ S S G+ ++ + + + A A Sbjct: 254 SVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEILRNIQNLSPQTGNAYTGA- 309 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + + + N + VL+T + + +G+ I T+ Sbjct: 310 --AIKKLRKEVFSARNGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVAIFTL 364 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFERVI 454 + Q L AS + + + +L F ++ N+I V Sbjct: 365 GIEGARDSQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVS 415 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 40/123 (32%), Gaps = 20/123 (16%) Query: 319 LIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + ++ + +G +T A+ + +K+++L+TDG Sbjct: 679 MSQSDISNAIDQMVHIGETTLTGSALSFVSQYFSPTK-------GARPNVRKFLILITDG 731 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTH 435 E + + +GI I ++ F N TQ L + P F + Sbjct: 732 EA---QDVVKEPAVALRQEGIIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFD 781 Query: 436 ELN 438 L Sbjct: 782 ILQ 784 >gi|290981305|ref|XP_002673371.1| predicted protein [Naegleria gruberi] gi|284086954|gb|EFC40627.1| predicted protein [Naegleria gruberi] Length = 353 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 20/173 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ + R+G F+D+ + + V L + +K + E GST MQ Sbjct: 20 IDNLREFERLGIVLFDDKAETFLPLTI-VQDLDKKSLKETVLKIKEKGSTNFEAGMQRGI 78 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMT--IA 404 D + ++L I+ LTD + + I K + G I + I Sbjct: 79 DLFST-------LDSSDLSNSNRIIYLTDACPNVGGTDTLDILTKDANSGPYNIFSTFIG 131 Query: 405 FSVNKTQQEKAR-YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + C ++F ST + F + + V I Sbjct: 132 IGLDFNSDIVEELTQVRGC----NYFSVKSTED----FTKILNQDFNYIVTPI 176 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 57.3 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 59/185 (31%), Gaps = 27/185 (14%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 V++A ++ +++ D R+ F+ + + +K Sbjct: 62 TVKEAAIQLVDQLREGD------RLSVIAFDHKAKVIVP---NQDVTDKAHIKAQIDRLE 112 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICN 391 G T I+D ++ + SS + + +LTDGEN DN + I Sbjct: 113 AAGGTCIDDGIKLGLQELASSPGKRAAQ----------VFMLTDGENEHGDNGRCLEIAA 162 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEI 449 A G+ + ++ F N + L A S + ++ F + Sbjct: 163 VAAEHGVTLNSLGFGENWN-----QDVLEKIADAANGSLAYIETPNQALTEFERLLQRAQ 217 Query: 450 FERVI 454 + Sbjct: 218 SVGLT 222 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 23/185 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I ++ D R+ F+ R +K Sbjct: 59 LETVKQAAVGLIERLQPDD------RLSIVAFDHRAKV---LVRNQPMGNLDQIKRKINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ +I + +D V + + LLTDGEN +NE I + Sbjct: 110 LGADGGTAIDEGLKLGVKELIKAKQDTVSQ----------VFLLTDGENEHGNNESCIKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A + I ++ F N Q + + A+ + + + F Sbjct: 160 AELAAENNLTINSLGFGANWNQDILEKIA--DIATGSLSY-IEEPEQALSEFARLFNRMQ 216 Query: 450 FERVI 454 + Sbjct: 217 SVGLT 221 >gi|4039081|gb|AAC97485.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 48/165 (29%), Gaps = 15/165 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+ VD S V L +I S+ DT+ + F + Sbjct: 39 CNESVDLYLLVDGSGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTE 95 Query: 309 DPSFSWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G R + G+T + A+ + N Sbjct: 96 LIRLGSGQSIDKRQALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NRE 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +A + ++L+TDG + + NK K + + + I Sbjct: 148 KAIQLVILMTDGVPNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 191 >gi|4039079|gb|AAC97484.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 48/165 (29%), Gaps = 15/165 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+ VD S V L +I S+ DT+ + F + Sbjct: 39 CNESVDLYLLVDGSGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTE 95 Query: 309 DPSFSWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G R + G+T + A+ + N Sbjct: 96 LIRLGSGQSIDKRQALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NRE 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +A + ++L+TDG + + NK K + + + I Sbjct: 148 KAIQLVILMTDGVPNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 191 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 69/206 (33%), Gaps = 24/206 (11%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y L PG P +D+S + A ++I + R+ Sbjct: 102 YQLNPGSAKPVKDIVLVIDNSGSMNETDPNQDRYTAAKNLINRMD------RDNRVSVMV 155 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISSNEDEVHR 360 F+ F+ ++ + + G T I+ A++ I S + Sbjct: 156 FDHATTLLQPFTRVKNQETKDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRDAGRSA 215 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +++L+DG + D++ +A + K Q I + TI S+ L Sbjct: 216 ---------MVIMLSDGFSETDHDRVLA---EYKQQQIAVNTIGLSLVNPD---GAQLLQ 260 Query: 421 NCA--SPNSFFEANSTHELNKIFRDR 444 A + +++ +L+ +F+ Sbjct: 261 TIAAETGGQYYDVQHAEDLSFVFQKI 286 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 51/172 (29%), Gaps = 33/172 (19%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+G F R + L + + + TA+ DA+ + + Sbjct: 128 RKQDRIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNSVED--GTALGDAILAGVNRLR 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT- 410 + ++L+TDG N E AK+ GIR+ I + Sbjct: 186 -----------QRPAEGRALILITDGRNNAG-AEPQLAAQAAKALGIRVHAIGIGSRGSA 233 Query: 411 ----------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +FEA L+++F + Sbjct: 234 VIPVPSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEAGDATVLSRVFAEI 285 >gi|32475885|ref|NP_868879.1| hypothetical protein RB9427 [Rhodopirellula baltica SH 1] gi|32446428|emb|CAD76256.1| hypothetical protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 484 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 45/170 (26%), Gaps = 24/170 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ID ++D + ++ V + R+ +K STA+ + Sbjct: 113 IDRLSDDDIVSVVLYDSNVTVLVPAT---KATDRSSIKQKIRGIQAGSSTALFAGVSKGA 169 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC----NKAKSQGIRIMTI 403 + R E ++LL+DG + + + K I + T+ Sbjct: 170 AEV---------RKFLADEQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEA-ISVSTL 219 Query: 404 AFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFE 451 + AS + L +F + Sbjct: 220 GLGSGYN-----EDLMVALASVGGGNHAFIEDADSLVSVFNQEFDGLLSV 264 >gi|115622698|ref|XP_001202504.1| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] gi|115631776|ref|XP_785188.2| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] Length = 338 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 46/141 (32%), Gaps = 13/141 (9%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K G T + D ++ A D ++ E I++LTD Sbjct: 5 TSDNIAAAKRHVNRIRAGGGTNLYDGLRNAVDLLMEHGNGEAMP---------LIIMLTD 55 Query: 377 GENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 G+ T + + + + + +++F + LSN A +E + Sbjct: 56 GQPTSGSVKSTSEIIQRITNLIDGRLSLFSVSFGNGVDFSFLEKLSLSNQALARKVYEDS 115 Query: 433 STHELNKIFRDRIGNEIFERV 453 S K F D + N + + Sbjct: 116 SASLQMKGFYDEVANPLLFNI 136 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL ++ D G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VISVLQNDHPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ A + +GI Sbjct: 887 TAEALAFSNHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNATAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANSWELLAMAGSGDKYYFVETFGGLKGIFSDVSAS 984 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 497 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 547 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 548 SRDSVLGPAHKLREENIRVHAIGV------KEANQMQLREIAGEEKRVYYVHDFDALRNI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 251 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQRIQDLSPQVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 308 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 362 FTMGIEG------ANPDELEKIASHP 381 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 1012 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1071 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1072 IF--GYTHIGDALRKVKYYFQPDT-----GSRINAGTPQVLLVLTDG---RSQDEVAQAA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1122 EELRHKGVDIYSVGIGDVDDQE 1143 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVRDPAL---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 59/173 (34%), Gaps = 20/173 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I I +D + R+G ++ +V + S R ++ T Sbjct: 57 MIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFF---TRAEMERAINSIVPLAQGTMT 113 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+Q A + + E K + +++TDG + + + A++ GI Sbjct: 114 GLAIQYAMNVAFTVQEGARPPHKKIP---RIAIIVTDG---RPQDRVSEVAAHARNAGIE 167 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 I + Q L ASP F S EL + F + +++ Sbjct: 168 IYAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFGKQFQDKL 213 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 59/173 (34%), Gaps = 20/173 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I I +D + R+G ++ +V + S R ++ T Sbjct: 57 MIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFF---TRAEMERAINSIVPLAQGTMT 113 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+Q A + + E K + +++TDG + + + A++ GI Sbjct: 114 GLAIQYAMNVAFTVQEGARPPHKKIP---RIAIIVTDG---RPQDRVSEVAAHARNAGIE 167 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 I + Q L ASP F S EL + F + +++ Sbjct: 168 IYAVGI------QRADMNSLRAMASPPLEEHVFLVESF-ELIQQFGKQFQDKL 213 Score = 45.4 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 57/201 (28%), Gaps = 22/201 (10%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA-------LASVIRSI- 285 VS S + + G + + + ++ + + + I Sbjct: 313 VSSEGSFHCVCPEGQQLQADGKTCSKCGAGHVDLVMVIDGSKSVRPQNFELVKQFVNRIV 372 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ++ D +G ++ RV ++ + I E T A++ Sbjct: 373 DLLEVSPDGTHVGLVQYSSRVRTEFPL--NKYHSAEEIKAAVMKMEYMEKGTMTGLALKH 430 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + S E + ++ TDG +QD+ +AK GI + + Sbjct: 431 MVEHSFSELEGARPLS---HNVPRIGLVFTDGR-SQDDI--SEWAQRAKESGIVMFAVGV 484 Query: 406 SVNKTQQEKARYFLSNCASPN 426 + L AS Sbjct: 485 G------KAVEEELRAIASEP 499 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 56/161 (34%), Gaps = 18/161 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +N+ R G FN S + L+ + + G T ++ M+ Sbjct: 88 DYAENMLPGDRGGIISFNTEADMLQEMSDNRYDLLDALSAL----PDPSGGTDLSQGMRA 143 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + + + K+ +VL+TDG +T + E +A+ GI I T+ Sbjct: 144 ANEQFVQTKGAN----------KQIMVLITDGADTINLAEVYNQVREARMNGITIFTLGL 193 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T L + A + + + + + + +D Sbjct: 194 GSLATG--LDEALLQDIADQTRGQYRQVPNATVIESVLQDI 232 >gi|159900699|ref|YP_001546946.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893738|gb|ABX06818.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 828 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 51/152 (33%), Gaps = 19/152 (12%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D++ +G F+D +F+ + + ++ + G T I A++ Sbjct: 417 TDSLGADDTIGVLAFDDTNDWTVTFTKVGQGVQLSEIQNNIAGLSAGGGTDIYAALEVGM 476 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + H VLLTDG + ++ ++QGI + TIA Sbjct: 477 GGLAQQTGKVRHA-----------VLLTDGR-SGGESSYESLIAPLRAQGITLSTIAIGG 524 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + L + A + A+ +L Sbjct: 525 DAD-----TVLLESLAKLGAGRYHFASRPDDL 551 >gi|73973418|ref|XP_532180.2| PREDICTED: similar to alpha 1 type XXI collagen precursor [Canis familiaris] Length = 961 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 54/163 (33%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H + +V G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSHDSGKNLVAAMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 44/122 (36%), Gaps = 14/122 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + ++ D TAIN+A+Q A Sbjct: 128 DRVGIIAYAGSAFPQLPITTDYASA-KMFLQNMNTDMLSSQGTAINEAIQLAKTYYND-- 184 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + + +++DGE+ + + I +A +GIRI TI K + Sbjct: 185 ---------DEQTNRVLFIISDGEDHEGDSV--NIAEEASEEGIRIFTIGVGTTKGGRIP 233 Query: 415 AR 416 + Sbjct: 234 IK 235 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 39/277 (14%), Positives = 75/277 (27%), Gaps = 21/277 (7%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 + + + I+ + T+ S + S +D R + Sbjct: 201 MATARYTNTPIEKVSLEATIESSIAIKSVYSPTHAVDVKRPDEKHATVKFEASNYLPTTD 260 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + + G +D + FV S Sbjct: 261 FRLLYDVGDAPLAASVLSYRPDNSDEGFFLMLASPNHSQGEVDLTKKTVIFVVDRSGSMQ 320 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 KK R+A+ V+ ++ + D ++ V S R + Sbjct: 321 GKKIEQAREAMRYVLNNLHEGDT------FNIVAYDSTVESFKPELQKFDDATRKSALAY 374 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 GST I+ A+ +A+ + + YI+ LTDG T Sbjct: 375 VDGLYAGGSTNISGALDSAFAMLT------------GSDRPNYILFLTDGLPTAGETNEG 422 Query: 388 AICNKAKSQGI---RIMTIAFSVNKTQQEKARYFLSN 421 I AK + + R++ + + R N Sbjct: 423 KIVELAKQKNVHRARMINFGVGYDVNSRLLDRMSREN 459 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 57/172 (33%), Gaps = 25/172 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + ++A S++ D R+G F+ F L ++ Sbjct: 411 SKFTMAKEAAIMATESLRAED------RIGVLAFDVSTRWVVDFQPVGTGLSLADIQRRI 464 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGI 387 G T I +A+QT + H VLLTDG T D + Sbjct: 465 STLPLGGGTDIYNALQTGLPELARQPGRVRHA-----------VLLTDGRSFTDDRQAYQ 513 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 A+ +A+S+ I + TIA + L A ++ A ++ Sbjct: 514 ALIEEARSRNITLSTIAIGTDAD-----IDLLQTLARWGAGRYYFAAEPGDI 560 >gi|219850571|ref|YP_002465004.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544830|gb|ACL26568.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 59/196 (30%), Gaps = 36/196 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+ A ++ + D FNDR S R +K Sbjct: 60 RLQQVKQAAMQILDLLG------DNESFALVTFNDRAEVVVSSQLARA---RAEIKRQIS 110 Query: 330 DENEMGSTAINDAMQTAYDTII-SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T + + + + +HR ++LLTDG D + Sbjct: 111 AIEAAGGTEMATGLALGVQELQRAMMPRAIHR----------LLLLTDGRTYGDESRCVE 160 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFR--- 442 I +A+++GI I + L A + + + S ++ KIF Sbjct: 161 IARRAQARGIGITALGIGSEWN-----EDLLETIAARENSRTHY-ITSAADITKIFTAEV 214 Query: 443 ----DRIGNEIFERVI 454 ++ R+ Sbjct: 215 ERMHSIFAQDVQVRLA 230 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 17/150 (11%), Positives = 39/150 (26%), Gaps = 11/150 (7%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS- 71 LI S G ++ AL+M V + +++D+ L A + L Sbjct: 2 SLINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDS 61 Query: 72 -LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 E + S + + + + V Q Sbjct: 62 QQAETVALDYASRNGLDNNVTVEITDDDHQIGLSGSKQVGMNFAVIFGID---------Q 112 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 V +++ + ++ ++ Sbjct: 113 VEVAASSKARVGHVTAVTGAVPFGVVSQNF 142 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 114 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 173 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 174 NYFPALQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQVRD 231 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L++ AL ++ ++ DN+ G T Sbjct: 232 LPPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 288 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 289 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 334 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 335 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 391 Query: 427 SFFEANSTHELNKIFRDR 444 ++ S E K+ +D Sbjct: 392 NYSYIYSLSEAQKVLKDE 409 >gi|194215132|ref|XP_001499245.2| PREDICTED: similar to Collagen alpha-1(XXII) chain [Equus caballus] Length = 1632 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + V+ +A+++ + + D R+G ++D+ + Sbjct: 46 YDLVFLLDTSSSVG-KEDFEKVQQWVANLVDTF---EVGADHTRVGVVRYSDQPTTAFEL 101 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G + + G+T DA++ S + K+ + Sbjct: 102 --GHFRSREAVKAAARRLAYHGGNTNTGDALRYITRHSFSPQAGGRP---GDRAFKQVAI 156 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + E + L AS F Sbjct: 157 LLTDG---RSQDLVLDAAATAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHIF 207 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 208 HVSDFNAIDKI 218 >gi|120601101|ref|YP_965501.1| hypothetical protein Dvul_0050 [Desulfovibrio vulgaris DP4] gi|120561330|gb|ABM27074.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 389 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 51/160 (31%), Gaps = 9/160 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+L+ G+ +I AL +LG+G M VD+ +++ A+ Sbjct: 2 SLLKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQK--------AADIA 53 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR-DIVRDTAVEMNPRKS 127 + + + + + + N + R+ + A+ + + Sbjct: 54 ALAGAQALINSSGNTDMARTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMN 113 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 V + + + +F G + A AE V Sbjct: 114 RIDVHIRRNAEAGNSVGLIFANLFGDSYSDLTVMARAEAV 153 >gi|46581745|ref|YP_012553.1| hypothetical protein DVU3344 [Desulfovibrio vulgaris str. Hildenborough] gi|46451168|gb|AAS97813.1| hypothetical protein DVU_3344 [Desulfovibrio vulgaris str. Hildenborough] gi|311235373|gb|ADP88227.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 389 Score = 56.9 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 51/160 (31%), Gaps = 9/160 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+L+ G+ +I AL +LG+G M VD+ +++ A+ Sbjct: 2 SLLKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQK--------AADIA 53 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR-DIVRDTAVEMNPRKS 127 + + + + + + N + R+ + A+ + + Sbjct: 54 ALAGAQALINSSGNTDMARTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMN 113 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 V + + + +F G + A AE V Sbjct: 114 RIDVHIRRNAEAGNSVGLIFANLFGDSYSDLTVMARAEAV 153 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 40/298 (13%), Positives = 89/298 (29%), Gaps = 41/298 (13%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 + G+ II+ LL+ +LG ++D+ + L A + + A++ L + Sbjct: 5 NDRGNVAIISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALELPDNEVRA- 63 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + +YL +N + + + + Q+ Sbjct: 64 ----------RTAAVDYLQKNNVD--------------PSLALITVGADHKSIQI----- 94 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + + N LF + +GI S I+ K +A + Sbjct: 95 -EEVKNVKHLFAQIIGINSSNIKAKTKAVVAPAKSVTGGIRPFAVEVYK-------FSYG 146 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP---LDPSL 253 + L S G G + Y S S+ + P Sbjct: 147 DLVTLKEDAGDGYSGNYGAVSLGGSGGSVFRANALYGYSGTISVGDYIDTEPGNMAGACN 206 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 +++++S + +R + ++ S+ + V A F+ + V ++ Sbjct: 207 DIKNYINSEHSTFDNFPRDSIRLWIMPLVDSLAVSGQKSVLVVGFAAFYVENVTNNSG 264 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 41/429 (9%), Positives = 120/429 (27%), Gaps = 30/429 (6%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 II L+ L + G + + ++ A I +E R+ + Sbjct: 6 IIMLLMSS--LILSGCGPESENKESLQQQPSTPTDQQVLAAQHAAI---KEAEQRSVAAK 60 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 K + + + K ++ + S ++RY + Sbjct: 61 ATADAKAKALAQQEAQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDN 120 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + + ++ + + R + + + + + D + Sbjct: 121 PVKQVAQNPLATFSLDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSN 180 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 +P + + + + + P+ + +D+S Sbjct: 181 NKEPVPASKP--IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSG 238 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 ++ L++ +L +++ +++ DN+ + G H + Sbjct: 239 SMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIALPSISGSH---KAE 289 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + GST ++ AY ++ + I+L TDG+ Sbjct: 290 INAAIDSLDADGSTNGGAGLELAYQ--QAAKGFIKGGINR-------ILLATDGDFNVGI 340 Query: 384 EEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ ++ K + G+ + T + + ++ ++ E K+ Sbjct: 341 DDPKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRI--ADVGNGNYSYIDTLAEAQKV 398 Query: 441 FRDRIGNEI 449 + + Sbjct: 399 LNSEMRQTL 407 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 32/270 (11%), Positives = 72/270 (26%), Gaps = 51/270 (18%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + G + + ++ D S ++ +++A Sbjct: 163 KKNDNGSYTVNVDVKGAVNSTTVTTTQPIDFTLVLDVSGSMDDPMSKTDRTRRLDALKEA 222 Query: 278 LASVIRSIKKIDN--VNDTVRMGATFF--------NDRVISDPSFSWGVH------KLIR 321 + + + + ++ V +G F D + +++ Sbjct: 223 VKAFLDEAANTNTEAGSELVHVGLVKFAGDKTDKIGDDMYRSGGYTYNYSQIVSNLTADM 282 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 +K G+T ++ A + +AKK ++ DG T Sbjct: 283 NGLKNKVSKLKAAGATRADNGFNRAVKVM--------GSASARTDAKKVVIFFADGSPTS 334 Query: 382 DNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKAR---YFLSNCAS------- 424 + + NKA K G + +I + + F+ +S Sbjct: 335 SSGFEGKVANKAVEAAKELKDGGAAVYSIGIFASANPSSLSSNENQFMHAVSSNFPKATK 394 Query: 425 ----------PNSFFEANSTHELNKIFRDR 444 + A + ELN IF + Sbjct: 395 YNQLGEGNIEAGYYKSATNASELNTIFDEI 424 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 24/206 (11%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y L PG P +D+S A ++I + R+ Sbjct: 102 YQLNPGSAKPVKDVVLVIDNSGSMKDTDPNQDRYTAAKNLINRMD------RDNRVSVIV 155 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ F+ ++ I+ + G T I+ A++ I S + Sbjct: 156 FDHATTLLQPFTRVKNQEIKDEIMAEIDGLATTDGGTDISLALEDTMSHIQESQDAGRSA 215 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +++L+DG + D++ +A K Q I + TI S+ L Sbjct: 216 ---------MVIMLSDGFSETDHDRVLA---DYKQQQIAVNTIGLSLVYKD---GANLLQ 260 Query: 421 NCA--SPNSFFEANSTHELNKIFRDR 444 A + +++ + +L+ +F+ Sbjct: 261 TIAAETGGQYYDVQNAADLSFVFQKI 286 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 57/193 (29%), Gaps = 30/193 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ +I+ + ID R+ F+ + ++ Sbjct: 78 KIEKLKAAILFMIKKLSSID------RLSIVTFSRDARRLCPLRQITENSQKD-LENLIN 130 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G+ I +QT + +++ + +N D+ Sbjct: 131 GLHAYGAANITAGLQTGLKVLNDRRFTGGRVAT-------IMLVSSSEQNNGDD------ 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIG 446 ++ + + T F L A +F + + LNK F + Sbjct: 178 ADQILVGNVPVHTFGFGAY-----HEPGVLKAIAHNSIGGTFSDVQNMDNLNKAFSQCLA 232 Query: 447 NEIFERV--IRIT 457 + V +++T Sbjct: 233 GLLTIVVQDLKLT 245 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL ++ D G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VISVLQNDHPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ A + +GI Sbjct: 887 TAEALAFSNHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNATAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANSWELLAMAGSGDKYYFVETFGGLKGIFSDVSAS 984 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 497 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 547 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 548 SRDSVLGPAHKLREENIRVHAIGV------KEANQMQLREIAGEEKRVYYVHDFDALRNI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 251 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQLIQDLSPQVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 308 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 362 FTMGIEG------ANPDELEKIASHP 381 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 1012 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1071 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1072 IF--GYTHIGDALRKVKYYFQPDT-----GSRINAGTPQVLLVLTDG---RSQDEVAQAA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1122 EELRHKGVDIYSVGIGDVDDQE 1143 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVRDPAL---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V+ ++RY+ + + ++ + + R + + + + + Sbjct: 17 NVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEML 76 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + +P + + + + + Sbjct: 77 NYFPALQPVADKQDNTKPIAACIPMPFAVKY--ELAPSPWNAQRTLLKVDVQARDMQVRD 134 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D+S ++ L++ AL ++ ++ DN+ G T Sbjct: 135 LPPANLVFLIDTSGSMQPAERLPLIQSALKLLVNDLRAQDNITIVTYAGGTHVALAS--- 191 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 T +K + + GST ++ AY+ + + Sbjct: 192 ------TAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE--QAEKGFIKGGVNR------ 237 Query: 370 YIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L TDG+ + I K + +GI + T+ + + Sbjct: 238 -ILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIAD--VGNG 294 Query: 427 SFFEANSTHELNKIFRDR 444 ++ S E K+ +D Sbjct: 295 NYSYIYSLSEAQKVLKDE 312 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 41/342 (11%), Positives = 90/342 (26%), Gaps = 38/342 (11%) Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH-----KEHGVSIQWVIDFSRS 188 +S + + L + G A+T + S KE D + Sbjct: 64 TSGTESSTDGTDEILNTEGSPENGSGNGKTADTGAVSDAEVTPEKEAATEGITEQDVPEA 123 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + L+ + L+ + +++ M+ Sbjct: 124 ENVGAGNPVAVHLSRGAASEAAPEHQKYIKKNAENDYTLTLNVKGMYDSETTKPMIDVLL 183 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + ++ + + +++ + I A Sbjct: 184 IVDKSGSMNWKMDTDKVGKPSRMDVLKQVVTGTGGLTDSIFGNTQIDAQMAVVTYSGSND 243 Query: 309 DPSFSWGVH------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + V + G T ++T + Sbjct: 244 FLDQRYDDAEIIQEWTKQKDTVNNAVNNIQAKGGTNCEAGLRTGATAL----------EG 293 Query: 363 NNLEAKKYIVLLTDGENT----------------QDNEEGIAICNKAKSQGIR-IMTIAF 405 + AKK+++ L+DG+ T AI K G+ TI Sbjct: 294 SRENAKKFVIFLSDGDATFYYGDDGYTKGPGSGSSPTAREKAIAQVQKITGLEGFYTIGM 353 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + + + + ++ AS F+ AN+T L K F++ +G Sbjct: 354 TSSSSSEFLTNLANNSKASEKRFYPANNTEALEKAFQEIVGE 395 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 44/133 (33%), Gaps = 12/133 (9%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K F + ST INDA+ A +++ + I+ LTDGE T Sbjct: 346 KEFVAAMDVTSSTNINDALLLALK--------NSQSVQSRVRLTPIIIFLTDGEPTASVT 397 Query: 385 EGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + I + + I +AF Q + N +EA K Sbjct: 398 DTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKG 457 Query: 441 FRDRIGNEIFERV 453 F D + + + V Sbjct: 458 FFDEVASPLLSNV 470 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 41/121 (33%), Gaps = 15/121 (12%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T I +++ AY + +K+I+LLTDG++ Sbjct: 463 DPKKAADKILSITPGGGTEIYRSLEQAYTELED-----------LELQRKHIILLTDGQS 511 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + N + A+ K I + T++ + + + F++ + Sbjct: 512 STSN-DYDALIENGKDHNITLSTVSIGQDADRNLLEQL---AGTGSGRFYDVTDATTIPA 567 Query: 440 I 440 I Sbjct: 568 I 568 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 62/172 (36%), Gaps = 23/172 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + D +++G ++D + + G H+ ++K G+T Sbjct: 67 LVNITRNFDIGPKFIQVGVVQYSDYPVLEIPL--GTHESTENLIKEMESIHYLGGNTKTG 124 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q AYD + K++ K V+LTDG + +E + +A+ I + Sbjct: 125 RAIQFAYDHLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDVAAEARKNKITL 172 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 I E L A S F +++ ++ I ++ Sbjct: 173 FAIGVGS-----EIEEDELKAIANKPSSTYVFYVEDYIAISR-IKEVIKQKL 218 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 56/194 (28%), Gaps = 22/194 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + I ++ + T R+G F+ +V S + ++ Sbjct: 788 PDNFNSTKTFMKTVIDEVSANHATTRIGIINFSHKVELVSSLE--TYTTKESLKSAVDKM 845 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 T A++ A ++ K +++TDG+ + +D + Sbjct: 846 LYLGEGTYTASAIKKAISLFQAARPAVR----------KVALVVTDGQADNRDKVHLDLV 895 Query: 390 CNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIF 441 +A + I I I + + + A+ F++ L Sbjct: 896 VKEAHAANIEIFVIGIVQKTDPHYHNFLKE-MHLIATDPDEEHFYQIEDFKTLSALADKL 954 Query: 442 RDRIGNEIFERVIR 455 +I + E R Sbjct: 955 ITKICENVSEIYSR 968 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 45/138 (32%), Gaps = 14/138 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 L ++ + S+ +I ++ V V++ F+ V D F+ + Sbjct: 63 FDLQKNFVLSLTDNIFQMKPVKSQNYNVKLAGMQFSSTVSIDHPFTAWKNVQNFKEKIRA 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + T A+ A + + K L+TDG + ++ Sbjct: 123 LVYI--GQGTYSYYAISNATQLFKTEGRERSI---------KVAFLMTDGVDHPNSPNVQ 171 Query: 388 AICNKAKSQGIRIMTIAF 405 I A+S GI TI Sbjct: 172 GIATAARSLGIHFFTIGL 189 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 65/201 (32%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + D +G F L I + Sbjct: 106 DRLTAAKEAARRFVDGL------PDEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRL-V 158 Query: 329 IDENEMGSTAINDAMQT---AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + TAI +A+ T A + + + IVLL+DG NT + Sbjct: 159 EGATGVQGTAIGEAINTSLGAVKALDGEAAKDPPPAR--------IVLLSDGANTSG-MD 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEANST 434 + A + + + TIAF ++ L A + F EA+S Sbjct: 210 PMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDEVARETGGQFHEADSA 269 Query: 435 HELNKIFRDRIGNEIFERVIR 455 EL ++ D IG+ + R R Sbjct: 270 KELRAVYDD-IGSSVGYRTKR 289 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 59/162 (36%), Gaps = 16/162 (9%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D V+ + + + +K A++ T + A + + Sbjct: 77 LVAYSDNVVREHNLDTINSNDDKERIKNMALNIPLGQKTDTGAGILEAVNLM-------- 128 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN------KAKSQGIRIMTIAFSVNKTQQ 412 ++ K I+LL+DG+N + ++ + K +G + TI + + T Sbjct: 129 -NSGHDKNHKPVIILLSDGKNDPQRKTEDSLKDLKSSISTCKDKGYPVYTIGLNYDGTVD 187 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +SN ++ ++ +L KI D + +V Sbjct: 188 KTQLEEMSNETKGKNYIT-STAADLPKILTDIYADNSKLKVQ 228 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 56.9 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 73/261 (27%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F S+ + N + + S + + + ++ Sbjct: 237 SIMFMPSLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPT 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I+ I+ + +G Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYL--IQVIEKGS---LVGMVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + + G T+I ++ + II S++ Sbjct: 352 FDSVAEIQNHLTRITDDNVYQKITAKLPQV-ANGGTSICRGLKAGFQAIIHSDQSTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + N C K G I TIA K LS Sbjct: 411 ---------IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALG---PSAAKELETLS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N FF L F Sbjct: 454 NMTGGYRFFANKDITGLTNAF 474 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/144 (22%), Positives = 52/144 (36%), Gaps = 19/144 (13%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + T + + G T+I + ++ + I SN+ + Sbjct: 348 MVTFDSYAQIQNNLIKITED-NTYQKITANLPQEASGGTSICNGLKAGFQAISQSNQSTL 406 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARY 417 I+LLTDGE+ Q I++C + + G I TIA EK Sbjct: 407 GSE---------IILLTDGEDNQ-----ISLCFEEVRQSGAIIHTIALG---PSAEKELE 449 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 LSN + F+ N + L F Sbjct: 450 TLSNMTRGHRFYAHNDINGLIDAF 473 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 17/95 (17%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T N A+Q A + + + + K +VLL+DG++ + + Sbjct: 111 SGKTNTNGALQEARNIF----------VGSRQDVHKVVVLLSDGQSNTG-GDPTTTAEEL 159 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + G+ I TIA + + L++ A+ + Sbjct: 160 RQDGVEIFTIAIGLF------NKDELNSIATSDHH 188 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + +D+S+ +++ + S++ D F+ Sbjct: 88 KTIKKSCPIDIAFLLDASASMGRRTW-GKIKNYVKSIVDMGDISDQGTHVGI---ITFST 143 Query: 305 RVISDPSFS-WGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + D F + K+ +K + + G T I+ A+ A ++ + Sbjct: 144 DPVIDIPFDKYKGVKMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFT------QEAG 197 Query: 363 NNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +++K VL++DG T+D I N+ K +G+++ T+ + E L Sbjct: 198 MREDSQKVAVLMSDGIQTKDRGPFTPTIIAANRLKMKGVQVYTVGIGASIDVLE-----L 252 Query: 420 SNCASPN 426 AS + Sbjct: 253 MWIASED 259 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 84/265 (31%), Gaps = 24/265 (9%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 S + I F +S+ + + N + +Y S + D+ + + Sbjct: 233 STRESIMFMQSLDSVTKFCTAKTHNTEAPNLQNKMCNYRSTWDVIMDSDDFQNASPMPGT 292 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + + +D S + + A + I + +++ + Sbjct: 293 NLPPHPTFSLLKSKQRVVCLVLDKSGSMDSDDRLLRMNQAAELYLIQIIEKESL-----V 347 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G F+ + + + + G T+I + ++ + I+ SN+ Sbjct: 348 GMVTFDSSAEIQNNLTKITDD-NAYQNIIAKLPQFAGGGTSICNGLKAGFQAIVYSNQST 406 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKAR 416 I+LLTDGE+ Q ++ C +AK+ G I +IA K Sbjct: 407 SGSE---------IILLTDGEDNQ-----MSSCFEEAKASGAIIHSIALG---PSAAKEL 449 Query: 417 YFLSNCASPNSFFEANSTHELNKIF 441 LSN F + L F Sbjct: 450 ETLSNMTGGLRFSANKDINGLIDAF 474 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 63/158 (39%), Gaps = 13/158 (8%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +KK D +D V+ GA ++ + + I++ D++ +T Sbjct: 823 MISLVKKSDVSHDRVQFGAVKYSAEPETFFYL--NRYTTKSAIIRAIQNDKSIGETTYTA 880 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + E H + + + ++++TDG+ + D E + K ++ GI I Sbjct: 881 KALRHSEGLFS-----EEHGSRKHRGVPQVLIVITDGD-SHDAAELDEVSKKLRANGIII 934 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 I + + L+ S + +F N+ L Sbjct: 935 YAIGIE-----RARPDELLTMAGSEDKYFYVNTFEGLK 967 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 53/162 (32%), Gaps = 20/162 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + K D D + +GA F+ R + S + I++ ST A+Q Sbjct: 640 VNKSDIGLDRMHVGAIQFSSRNKEEFRLSQ--YATKSDIIRAIGRMSLMGQSTLTGGALQ 697 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 D + KK ++L+TDGE D + + +GI + ++ Sbjct: 698 FVLD-------YFRPIKGSRPYVKKILILITDGEAQDDVKTP---AEALRQEGIIVYSVG 747 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 R L + P F ++ K + I Sbjct: 748 V------FNANRTQLVEISGKPEMVFYVEDF-DILKHLENEI 782 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 61/192 (31%), Gaps = 21/192 (10%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +D S+ +++ L V+R V++G ++DR Sbjct: 984 ECGIPADLVFLIDGSNSISD-SDFTKMKNFLQDVVRPFDTG----HNVQVGIAQYSDRYR 1038 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + S + + I + E T I A+ + + N Sbjct: 1039 KEFSL-NMFSHKLELETQIGRIRQMEGLQTYIGAALDRVRNFFTPEG-----GSRVNENI 1092 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPN 426 ++ ++++TDG + + + +G+ I I Q LS A S + Sbjct: 1093 QQILLVITDGRSHDK---VVKAAEDLRKKGVDIYAIGVGRIDHLQ------LSQIAGSSD 1143 Query: 427 SFFEANSTHELN 438 + ++ EL Sbjct: 1144 RKYTVDNFSELK 1155 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/263 (13%), Positives = 79/263 (30%), Gaps = 29/263 (11%) Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 D + R G +L + + + Sbjct: 736 TDGVARDYVRDPARILRGKNVTIFSVGVYNANRSQLEEISGDGSLVFHVENFDHLKALER 795 Query: 254 SEEHFVDSSSLRHVIKKKHLV----------RDALASVIRSI----KKIDNVNDTVRMGA 299 V + I +V + +I KK D + V+ GA Sbjct: 796 KLVFRVCALHDCKRITGLDVVFVLDHSGSIKKQYQDHMINLTIHLVKKADVGRNGVQFGA 855 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++D+ + I++ + + GST A++ A E H Sbjct: 856 LKYSDQPNILFYL--NTYSNRSAIIENLRMRRDTGGSTYTAKALKHANALFT-----EEH 908 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + K+ ++++TDG++ ++ ++ + +GI I+ + + + L Sbjct: 909 GSRIKENVKQMLIVITDGKSHDHDQLNDT-ASELRDKGITILAVGVG------KANQKEL 961 Query: 420 SNCASP-NSFFEANSTHELNKIF 441 A N+ ++ +L +F Sbjct: 962 EGMAGNKNNTIYVDNFDKLKDVF 984 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 45/157 (28%), Gaps = 21/157 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 + KI D ++G F+D + R + + T A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDTTKEEFQL---NRYFTRQEISDAIDRMSLINKGTLTGKAL 711 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 H L AKK+++L+TDG + I + + + I ++ Sbjct: 712 NFVGQYFT-------HSKGARLGAKKFLILITDGVARDYVRDPARI---LRGKNVTIFSV 761 Query: 404 AFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNK 439 R L + S F + L Sbjct: 762 GVY------NANRSQLEEISGDGSLVFHVENFDHLKA 792 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 57/174 (32%), Gaps = 30/174 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +++ + + D + F+++ + R + T Sbjct: 474 KIEAAKESARATAEVLSPSD------LITVVAFDNQPTTIVRL---QRASNRMRIATDIA 524 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I A++ AY+ + +N K++++L+DG+ D + Sbjct: 525 RLQAGGGTNIYPALREAYEILQGANAKV-----------KHVIVLSDGQAPYDGI--ADL 571 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 C + +S I + + + R L+ + + L +IF Sbjct: 572 CQEMRSARITVSAVGIG------DADRNLLNLITDNGDGRLYMTDDLAALPRIF 619 >gi|125586596|gb|EAZ27260.1| hypothetical protein OsJ_11197 [Oryza sativa Japonica Group] Length = 540 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 51/200 (25%), Gaps = 34/200 (17%) Query: 271 KHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + +K + D R+ F++ + R + Sbjct: 75 LDVSTSMAGNKLDRMKAALLFVIRKLADVDRLSIVTFSNDAARLCPLRFVAGDAARADLG 134 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G+T I ++ + + R N ++L++DG+ + + Sbjct: 135 ALVDGLAADGNTNIRAGLEIGLA-VAAGRRLTAGRAVN-------VMLMSDGQQNRGD-- 184 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFR 442 + G+ + T + L A F L F Sbjct: 185 ----ATRLDPGGVPVHTFGLGADHDPA-----VLQAIAGKSREGMFHYVADGVNLTAPFS 235 Query: 443 D-------RIGNEIFERVIR 455 I ++ V R Sbjct: 236 QLLGGLLTIIAQDLELTVTR 255 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 71/230 (30%), Gaps = 23/230 (10%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++Y G + +D S +K R+AL ++ + D Sbjct: 254 IENGYFVHYFAPEGLTTMPKNVVFVIDKSGSMSG-RKIQQTREALIKILDDLSPRD---- 308 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+ ++FA + +G T INDA+ A + S Sbjct: 309 --QFNLITFSSEATQWSPSLVPASAENVNKARSFAAAIHALGGTNINDAVLMAVQLLDRS 366 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDN---EEGIAIC--NKAKSQGIRIMT----IA 404 N +E ++ I+LLTDG+ T C + + + Sbjct: 367 NREERLPTRSVS----LIILLTDGDPTVGEGPASNSKTRCTGENVREAVSGQYSSLPGLG 422 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 F V+ + L +E + + + F + N + V Sbjct: 423 FDVSY---APEKLALDTGGLARRIYEDSDSAMQLQDFYQEVANPLLTAVT 469 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 31/70 (44%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G II A++MP+M+ +D+ +Y + L++ A A I + + + Sbjct: 11 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 70 Query: 73 EEVSSRAKNS 82 + A + Sbjct: 71 SLATGNAVKN 80 >gi|29788808|gb|AAP03354.1| hypothetical protein [Oryza sativa Japonica Group] gi|108708692|gb|ABF96487.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 540 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 51/200 (25%), Gaps = 34/200 (17%) Query: 271 KHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + +K + D R+ F++ + R + Sbjct: 75 LDVSTSMAGNKLDRMKAALLFVIRKLADVDRLSIVTFSNDAARLCPLRFVAGDAARADLG 134 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G+T I ++ + + R N ++L++DG+ + + Sbjct: 135 ALVDGLAADGNTNIRAGLEIGLA-VAAGRRLTAGRAVN-------VMLMSDGQQNRGD-- 184 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFR 442 + G+ + T + L A F L F Sbjct: 185 ----ATRLDPGGVPVHTFGLGADHDPA-----VLQAIAGKSREGMFHYVADGVNLTAPFS 235 Query: 443 D-------RIGNEIFERVIR 455 I ++ V R Sbjct: 236 QLLGGLLTIIAQDLELTVTR 255 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 47/158 (29%), Gaps = 21/158 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G + + + + ++T T I A+ A + Sbjct: 128 DDKIGLIVYAGEAYTQLPITSDYVSA-KMFLETINPSMITTQGTDIKQAIDLAMKSFTP- 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 N + K I ++TDGE+ + + A +GI++ + + Sbjct: 186 ----------NQDVSKAIFVITDGEDNEGGVV--EMAKAAAEKGIKVYVLGVGSPQGAPI 233 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + N+ + + + + EI Sbjct: 234 PMP-------GSSQYITDNTGNVVVSKLNEAMCREIAA 264 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D V++G ++D + L ++ MG T+ A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQ--SSLKSAVLDRIRNLPYSMGGTSTGSALEFIRA 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + +VL+TDGE++ E + ++ K G+ + + ++ Sbjct: 324 NSLTEMSGSRAKDG----VPQIVVLVTDGESSD---EVQDVADQLKRDGVFVFVVGINI- 375 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L A+ F + + + + V R K Sbjct: 376 -----QDVQELQKIANEPFEEFLFTTENFSILQALSGTLLQALCSTVERQMK 422 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 21/162 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR G ++D++IS + + + + + G T A+ Sbjct: 882 GPDRVRFGVVQYSDKIISQFFLTQ--YASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVF 939 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ + +Y++++TDG++T + G+ I I Sbjct: 940 QNTARI---------DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGV----- 982 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 ++ L AS FF L I ++ I + Sbjct: 983 -RDANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDICSSE 1022 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 48/169 (28%), Gaps = 17/169 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + D +++G F+ + + + + + T A+ Sbjct: 1056 MVNQSNIGADEIQIGLLQFSSNPQEEFRL--NRYSSKVDMCRAILSVQQMSDGTHTGKAL 1113 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +Y++++TDG +QDN + + I I I Sbjct: 1114 NFTL-------PFFDSSRGGRPRVHQYLIVITDGV-SQDNVAPP--AKALRDRNIIIFAI 1163 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L + F+ + L + ++ + + Sbjct: 1164 GVG-----NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 27/314 (8%), Positives = 74/314 (23%), Gaps = 16/314 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K G I AL M ++LG + +D+ + L+ + + Sbjct: 8 RSGKKQEGAVAITVALCMVILLGFAALAIDIGNLLIARNELQNS------------ADAA 55 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + T + F+ + V+ T+V+ + + Y Sbjct: 56 AMAGAGCLIRRTACSNTSASQPDWPTADATASAFSTSATTNQVQGTSVQTSTVATGYWNT 115 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + Y L P + + I+ + + + + + Sbjct: 116 TGTPYGLESLPFTPGANDLPAVQVTIRKDGSNANGAVPIFLGRIFGARILKASAVATAVL 175 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 P F + Y++ K++P + G Sbjct: 176 STPGSVGPGGLFPLAISQC--MYTNYWDSSSQSP-KIAPNSGVVPGFSWPNQIAGQPYIF 232 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + ++ + + T+ + + ++ + Sbjct: 233 QIGSSYHYGTCSSGQWTTFDVNDNSAGYAKTMLTNGNPNTLTIGASPGTW-IQTGTENTL 291 Query: 313 SWGVHKLIRTIVKT 326 Sbjct: 292 FNNTATCSAAGNGQ 305 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D V++G ++D + L ++ MG T+ A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQ--SSLKSAVLDRIRNLPYSMGGTSTGSALEFIRA 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + +VL+TDGE++ E + ++ K G+ + + ++ Sbjct: 324 NSLTEMSGSRAKDG----VPQIVVLVTDGESSD---EVQDVADQLKRDGVFVFVVGINI- 375 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L A+ F + + + + V R K Sbjct: 376 -----QDVQELQKIANEPFEEFLFTTENFSILQALSGTLLQALCSTVERQMK 422 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 63/191 (32%), Gaps = 31/191 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A++ + D FND F+ + + ++ Sbjct: 111 KLDKSRQAVSQFFKLANPED------EFFLVQFNDSASLIQPFTRNLEE-----IQNHLA 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ DA+ A + K +K ++L++DG + I Sbjct: 160 FTQSKGRTALLDAVYLALHEM-----------KKAKNPRKALLLISDGGDNSSRYTEPEI 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 N K ++I I + + L+ A + ++ ++ +EL + Sbjct: 209 KNLVKEADVQIYAIGIYESAAGRGRTPEESSGPALLTEIAEQTGGRQYQVDNLNELPDV- 267 Query: 442 RDRIGNEIFER 452 +IG E+ + Sbjct: 268 AAKIGVELRNQ 278 >gi|297471458|ref|XP_002685231.1| PREDICTED: collagen type VI alpha 4-like [Bos taurus] gi|296490811|gb|DAA32924.1| collagen type VI alpha 4-like [Bos taurus] Length = 780 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 51/168 (30%), Gaps = 21/168 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V+ G ++D V + S + + G T + +A+ + Sbjct: 202 GPDRVQFGVVQYSDEVSPQFTLSQHSSVAGLEVAVDSIQ--QKGGGTKMGEALGSMIQVF 259 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S V Y++++TDG++ + G+ I + Sbjct: 260 ADSARSNVP---------WYLIVVTDGQSMDP---VADAAEALRGHGVTIYAVGV----- 302 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ L A + F + L I ++ + + + K Sbjct: 303 -RDANIAELQEIA-EDRMFFVHDFESLKTIQQEVVQDICSLETCKNRK 348 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 36/123 (29%), Gaps = 12/123 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + D +++G F+ + + + + + + G T A+ Sbjct: 376 MVNQSNIGADEIQIGLLQFSSDPQEEFRL--NRYSSKVDVHRAISDVKQINGGTYTGKAL 433 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +Y++++TDG + + + + I I I Sbjct: 434 NFTL-------PFFGSSRGGRPSVHQYLIVVTDGVSRDNVALP---AKALRDRNIIIFAI 483 Query: 404 AFS 406 Sbjct: 484 GVG 486 >gi|220913381|ref|YP_002488690.1| hypothetical protein Achl_2636 [Arthrobacter chlorophenolicus A6] gi|219860259|gb|ACL40601.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 310 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 12/105 (11%), Positives = 28/105 (26%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 M V++G G + VD + L+ A + + + + + A + Sbjct: 1 MVVLIGAGALAVDTGQIYAERAQLQNGADASALAGADLCSANGGCTQAAATSVADALANS 60 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ V +D + L+ Sbjct: 61 NSNDSKSTVQSVDLSVPGQVTVTTSTKDGTTNAGFLTKMFASALN 105 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 31/70 (44%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G II A++MP+M+ +D+ +Y + L++ A A I + + + Sbjct: 17 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 76 Query: 73 EEVSSRAKNS 82 + A + Sbjct: 77 SLATGNAVKN 86 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 60/221 (27%), Gaps = 38/221 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVQPNRMAAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNRDST 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + K D TA + + TA I + + IVL +DG+ Sbjct: 162 KAALDKLQFADR-----TATGEGIFTALQAIATVG---AVIGGGDKPPPARIVLFSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T AK QG+ I TI+F E L A Sbjct: 214 TMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQL 273 Query: 424 SPNSFFEANSTHELNKIF--------RDRIGNEIFERVIRI 456 S + + A S EL ++ + I + +R+ Sbjct: 274 SGGNAYNAASLQELKAVYATLQQQIGYETIKGDASVGWVRL 314 >gi|170744040|ref|YP_001772695.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168198314|gb|ACA20261.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 654 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 31/293 (10%), Positives = 80/293 (27%), Gaps = 23/293 (7%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 T+ + I + + +N F + Sbjct: 180 TREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFPYAYPAPASPDAPFR 239 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + + + + P + VD+S + LV+ +LA Sbjct: 240 VTASVFPSPWAEGRKLLHIGIRGYAVAPAERPPANLVFLVDTSGSMAAPNRLPLVKQSLA 299 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ ++ D R+ + V + + + GSTA Sbjct: 300 MLLTTLDARD------RVALVAYAGEVGTVLEPTPAGEAGR---ILAAIETLQAHGSTAG 350 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNEEGIAICNKAKSQ 396 + ++ AY ++ + + ++L TDG+ +E + + + Sbjct: 351 GEGIRQAYAL--AARHFDPKAVNR-------VILATDGDFNVGITGRDELTGFVARERRK 401 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 GI + + F + + + ++ E K+ + + + Sbjct: 402 GIFLSVLGFGMGNLNDALMQALAKD--GNGVAAHIDTAQEARKVLVEEATSTL 452 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 46/122 (37%), Gaps = 13/122 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + +++ D TAI++A++ A Sbjct: 129 DRVGIIAYAAQAFPQLPITTDY-GAAKMFLQSMNTDMLSSQGTAIHEAIELAATYFDDEE 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + L++DGE+ +++ AI +A QGIRI TI + Sbjct: 188 Q-----------TNRILFLVSDGEDHAEDQVMDAI-EQATDQGIRIFTIGVGSARGAPIP 235 Query: 415 AR 416 + Sbjct: 236 IK 237 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 56/182 (30%), Gaps = 22/182 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + T Sbjct: 195 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMEGAIRALVPLAQGTM 252 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 253 TGLAIQYAMNVAFSVAEGARPPEARVP---RVAVIVTDG---RPQDRVAEVAAQARARGI 306 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + L ASP F S + G + R+ Sbjct: 307 EIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLI-----QEFGQQFQGRLCP 355 Query: 456 IT 457 I Sbjct: 356 IN 357 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 51/156 (32%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 501 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 555 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG + D Sbjct: 556 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGRSQDDVSV---WA 609 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 610 ARAKEEGIVMYAVGVG------KAVEEELREIASEP 639 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 56/177 (31%), Gaps = 30/177 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST------------AINDA 342 VR+G F + + + I + +GS I+ Sbjct: 126 VRVGVVSFAGTAAVVQAPTTSRDDVFAAIERFQLQRGTAIGSGIVLSLATIFPDAGIDIQ 185 Query: 343 MQTAYDTI-----ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 T T+ + E + A ++LLTDG+ T + I A +G Sbjct: 186 QITGQRTMPRMLGDPEKKAEFTPVPPGSYASAAMILLTDGQRTTG-PDPIDAAKMAADRG 244 Query: 398 IRIMTIAFSVNKTQ----------QEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 IR+ T+ + + L A + +F A + +L K+++ Sbjct: 245 IRVYTVGVGTTQGEIIGFEGWSMRVRLDEDTLRQIAQMTTGEYFYAGTAEDLKKVYQ 301 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 54/164 (32%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ + +D + R+G ++ +V S G + + T Sbjct: 70 QFLVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTM 127 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + + +V++TDG + + + +A+++GI Sbjct: 128 TGLAIQYAMNVAFSEAEGARP---SEERVPRVLVIVTDG---RPQDRVAEVAAQARARGI 181 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 182 EIYAVGVQRADVGS------LRAMASPPLDQHVFLVESFDLIQE 219 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 17/157 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 417 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 471 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 472 EYMERGTMTGLALRHMVEHSFSEVQGARPRDLNVP---RVGLVFTDGR-SQDDISV--WA 525 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +AK +GI + + + L AS S Sbjct: 526 ARAKEEGIVMYAVGVG------KAVEEELREIASEPS 556 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 54/164 (32%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ + +D + R+G ++ +V S G + + T Sbjct: 57 QFLVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + + +V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSEAEGARP---SEERVPRVLVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 169 EIYAVGVQRADVGS------LRAMASPPLDQHVFLVESFDLIQE 206 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 17/157 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 404 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 458 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 459 EYMERGTMTGLALRHMVEHSFSEVQGARPRDLNVP---RVGLVFTDGR-SQDDISV--WA 512 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +AK +GI + + + L AS S Sbjct: 513 ARAKEEGIVMYAVGVG------KAVEEELREIASEPS 543 >gi|326504464|dbj|BAJ91064.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 720 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 22/199 (11%), Positives = 57/199 (28%), Gaps = 27/199 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ A+ VI + +D R+ F++ Sbjct: 72 DVSKSMRKNDRLGKMKTAMHFVINKLGHMD------RLSIVKFSEEAERLCPLLSVTPP- 124 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDG 377 + + + T I D ++ + + + I LL+DG Sbjct: 125 NKARLNHLVDGLQVIDPTNIRDGLEAGLSVL----------AGRRITGGRVASIFLLSDG 174 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + + + + + + T F + + + +F + + Sbjct: 175 DENRGHATTVDV------SDVPVYTFGFGTDYDPKVLDEIARRS--KGGTFNFVDDEENM 226 Query: 438 NKIFRDRIGNEIFERVIRI 456 + F +G + V + Sbjct: 227 TEPFSQILGGLLSIVVQDL 245 >gi|1353722|gb|AAB01767.1| unknown [Naegleria fowleri] Length = 357 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 38/138 (27%), Gaps = 19/138 (13%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + K A + + T ++ + I I+L TDG Sbjct: 2 KQKAKQVAKNIHAGTCTNLSGGLFEGLRLIKQRTTCNEITS---------ILLFTDGLAN 52 Query: 381 QDNEEGIAICNKAK-------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + I +K + I T F + A ++ N+ Sbjct: 53 EGITNTSEIVSKMNTTIHEEIRKQITCFTFGFGSDTDANMLTSIAQ---AGNGLYYFLNN 109 Query: 434 THELNKIFRDRIGNEIFE 451 ++ K F + IG + Sbjct: 110 VDDIPKAFGNVIGGLVSV 127 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 42/412 (10%), Positives = 107/412 (25%), Gaps = 41/412 (9%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN---FTDREVRDIVRDTAVE 121 + L ++ + + I R F + ++++ + D + + + Sbjct: 209 TSGLNATVGTYFAGTGMASDLTVDDINSSQTRAFVSAVERSIVHYGDITMTFLANLRRAD 268 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI-------KSWLIQTKAEAETVSRSYHKE 174 + Y ++ +++ + + E S H Sbjct: 269 AEGQGLTYVSAVAVEEMSVVHYNQGNPSGDPATAEDAEPPNVPLVAVYPKEGTLMSDHPY 328 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + + + F A R + P + Sbjct: 329 VVLDWADEPKRAAAAAFLDYLLSDEAQQVFLDNAFRGHDGRTGDLLTEENGVLPAEPVEL 388 Query: 235 SCNKSLYYML---------YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 S + P + + + S + L ++A + + Sbjct: 389 SLPSANVLNAMLENWAELRKPANVLLVIDTSGSMQESVPGTGSTRLELAKEAAITSLDEF 448 Query: 286 KKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLI----RTIVKTFAIDENEMGSTAIN 340 D V + R + + G T + Sbjct: 449 SDSDRVGLWMFSTDLEDNGQDWRELVPLGPLGASVNGTPRREELAERISNLPPGGGTGLY 508 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAK----S 395 D A+ + + + +V LTDG+N + + + Sbjct: 509 DTALAAHTLVAEHSRPDAINA---------VVFLTDGKNEDLNGISLEKLLDSITPEPGQ 559 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 QG+RI TI++ + + + A+ + ++A+ ++++F I N Sbjct: 560 QGVRIFTISYGEDADLKTMTQIAE---ATNAAAYDASDPQSIDEVFEAVISN 608 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 57/169 (33%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D ++ D+ G+T Sbjct: 829 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLGDL--GSKWEVISVLQKDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + E + + ++++TDGE+ ++ + +GI Sbjct: 887 TAEALGFSDHMFT-----EARGSRLQKGVPQVLIVITDGESHDADKLNAT-AKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 941 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 984 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 44/353 (12%), Positives = 92/353 (26%), Gaps = 26/353 (7%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ N R V ++ S ++ + S S Sbjct: 269 LVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGKAYTGAAIRKIRKEVFSARNGSRKN 328 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + S + V+++ +M K Sbjct: 329 QGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIASHPAEQHVSK 388 Query: 214 SYSSQNGKVGIRDE-----KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + + + VS L G +D ++ + + S Sbjct: 389 LKTFSDLAAHNQTFLKKLRNQIMHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQA 448 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 H ++ L+ V+ VR+GA + D F + + K Sbjct: 449 TDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWD--LEFEINKYTNKHDLGKAIE 503 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G+T A+ + + + + ++V+LT+G + E Sbjct: 504 NIRQMGGNTNTGAALNFTLGLLQKAKK------QRGNRVPCHLVVLTNGMSKDSILEP-- 555 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 N+ + + IR+ I +E + L A + + L I Sbjct: 556 -ANRLREELIRVYAIGV------KEANQTQLREIAGEEKRVYYVHDFDALKDI 601 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 53/170 (31%), Gaps = 15/170 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R + ++ D + VR+GA F+ + G + I Sbjct: 1012 PDDFRKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPL--GTFVGKKEISFQIENI 1069 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T I A++ + N + +++LTDG++ + Sbjct: 1070 QQIFGYTHIGAALRQVGHYFRP-----DMGSRINAGTPQVLLVLTDGQSQDEVAR---AA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + +G+ I ++ + + + ++ EL K+ Sbjct: 1122 EDLRHKGVDIYSVGIGDVDD-----QQLIQITGTAGKKLTVHNFDELTKV 1166 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 49/150 (32%), Gaps = 21/150 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V++G F+D + + + + +G T + + Sbjct: 654 GADRVQIGVVQFSDVNKEEFQL---NRYMSQNEISNAIDRMTHIGETTLTG------SAL 704 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNK 409 ++ +++++L+TDGE ++ + +GI I ++ F N Sbjct: 705 TFVSQYFSPAKGARPNVRRFLILITDGEAQDIVKDP---AVALRQEGIIIYSVGVFGSNV 761 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELN 438 TQ L + P F + L Sbjct: 762 TQ-------LEEISGRPEMVFYVENFDILQ 784 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 43/143 (30%), Gaps = 15/143 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S+ +D +R+G +++ S S + ++ V + + A A Sbjct: 254 SVSALDIKEHCMRVGLVTYSNETKVINSLS---RGVNKSEVLQNIQNLSPWSGKAYTGA- 309 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A I + + N + VL+T + + +G+ I T+ Sbjct: 310 --AIRKIRKEVFSARNGSRKNQGVPQIAVLVTH---RPSEDNVTKAAVNLRREGVTIFTM 364 Query: 404 AFSVNKTQQEKARYFLSNCASPN 426 Q L AS Sbjct: 365 GIEGASDSQ------LEKIASHP 381 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 57/169 (33%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ +KK D + VR GA + D ++ D+ G+T Sbjct: 809 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLGDL--GSKWEVISVLQKDQPMGGNTY 866 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + E + + ++++TDGE+ ++ + +GI Sbjct: 867 TAEALGFSDHMFT-----EARGSRLQKGVPQVLIVITDGESHDADKLNAT-AKALRDKGI 920 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + +F + L IF D + Sbjct: 921 LVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSAS 964 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 44/353 (12%), Positives = 92/353 (26%), Gaps = 26/353 (7%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ N R V ++ S ++ + S S Sbjct: 249 LVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGKAYTGAAIRKIRKEVFSARNGSRKN 308 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + S + V+++ +M K Sbjct: 309 QGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIASHPAEQHVSK 368 Query: 214 SYSSQNGKVGIRDE-----KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + + + VS L G +D ++ + + S Sbjct: 369 LKTFSDLAAHNQTFLKKLRNQIMHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQA 428 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 H ++ L+ V+ VR+GA + D F + + K Sbjct: 429 TDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWD--LEFEINKYTNKHDLGKAIE 483 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G+T A+ + + + + ++V+LT+G + E Sbjct: 484 NIRQMGGNTNTGAALNFTLGLLQKAKK------QRGNRVPCHLVVLTNGMSKDSILEP-- 535 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 N+ + + IR+ I +E + L A + + L I Sbjct: 536 -ANRLREELIRVYAIGV------KEANQTQLREIAGEEKRVYYVHDFDALKDI 581 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 53/170 (31%), Gaps = 15/170 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 R + ++ D + VR+GA F+ + G + I Sbjct: 992 PDDFRKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPL--GTFVGKKEISFQIENI 1049 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T I A++ + N + +++LTDG++ + Sbjct: 1050 QQIFGYTHIGAALRQVGHYFRP-----DMGSRINAGTPQVLLVLTDGQSQDEVAR---AA 1101 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + +G+ I ++ + + + ++ EL K+ Sbjct: 1102 EDLRHKGVDIYSVGIGDVDD-----QQLIQITGTAGKKLTVHNFDELTKV 1146 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 49/150 (32%), Gaps = 21/150 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V++G F+D + + + + +G T + + Sbjct: 634 GADRVQIGVVQFSDVNKEEFQL---NRYMSQNEISNAIDRMTHIGETTLTG------SAL 684 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA-FSVNK 409 ++ +++++L+TDGE ++ + +GI I ++ F N Sbjct: 685 TFVSQYFSPAKGARPNVRRFLILITDGEAQDIVKDP---AVALRQEGIIIYSVGVFGSNV 741 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELN 438 TQ L + P F + L Sbjct: 742 TQ-------LEEISGRPEMVFYVENFDILQ 764 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 43/143 (30%), Gaps = 15/143 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S+ +D +R+G +++ S S + ++ V + + A A Sbjct: 234 SVSALDIKEHCMRVGLVTYSNETKVINSLS---RGVNKSEVLQNIQNLSPWSGKAYTGA- 289 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A I + + N + VL+T + + +G+ I T+ Sbjct: 290 --AIRKIRKEVFSARNGSRKNQGVPQIAVLVTH---RPSEDNVTKAAVNLRREGVTIFTM 344 Query: 404 AFSVNKTQQEKARYFLSNCASPN 426 Q L AS Sbjct: 345 GIEGASDSQ------LEKIASHP 361 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 73/256 (28%), Gaps = 14/256 (5%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 S + + +V + I K S + + + +Y L Sbjct: 21 PSAAATSSLSTGSTGVEKLTFSVDVFVVDKNGRLITGLKPSNFKIVNTVTNFYELIDVST 80 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +S+ + L + S ++N+ G T F S Sbjct: 81 SNLISKPGGYSAMLLLDQTGSISTTDPYNLRIEASKIFLNNLGTDDYTGLTSFTSSYTSV 140 Query: 310 PSFSWGVHKLIRTIVKT-FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + K+ + N G T + S+ + + + A Sbjct: 141 VKLHSGFTNKTEQMKKSLDTLALNVSGGTPLY----------TSTIQSVTYTAQKGPTAN 190 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 K +++ TDGEN KA Q I + T+ S + A+ + +F Sbjct: 191 KAVIVFTDGENNVTTNTLEDATAKAIQQKIPLFTVGLSTDVNVNVLAQMANE---TGGAF 247 Query: 429 FEANSTHELNKIFRDR 444 F A +L F Sbjct: 248 FYAKDAGQLISTFGTL 263 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 56.5 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 55/194 (28%), Gaps = 28/194 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I + + +G F+ + + Sbjct: 314 DVSGSMTSYDRLNRMNQAAKYFLSQIIE-----NRSWVGMVHFSSQATIVHELIQINSDI 368 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++ A+ + I+LL+DGE+ Sbjct: 369 ERNQLLQTLPT-SANGGTSICSGIKAAFQVFKNGEYQTDGTE---------ILLLSDGED 418 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + + C ++ K G + IA + L + + A + Sbjct: 419 STAKD-----CIDEVKDSGSIVHFIALGPSADLAVTNMSIL----TGGNHKLATDEAQNN 469 Query: 437 -LNKIFRDRIGNEI 449 L F Sbjct: 470 GLIDAFGALASENA 483 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 35/264 (13%), Positives = 77/264 (29%), Gaps = 23/264 (8%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 M D Q + +N F + V + + + ++ Sbjct: 1 MPDPQSVRVEEMINYFPYDWPGPETADQPFKATVTVMPTPWNHDTKLMHVAIKGYDIAPA 60 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 P + +D S K L++ A ++ +K D V+ + + Sbjct: 61 TAPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNRLKADDTVS------IVTYAGNAGT 114 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + ++ + + GST + ++ AYD + Sbjct: 115 VLEPT---RVAEKSKILSAIDKLEAGGSTGGAEGIEAAYDL--------AKKAFVKDGVN 163 Query: 369 KYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + + L TDG+ +E+ I + + GI + + F + N Sbjct: 164 RVM-LATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQN--GN 220 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 S ++ E K + G+ + Sbjct: 221 GSAAYIDTLAEAQKTLVEEAGSTL 244 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 73/261 (27%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F S+ + N + + S + + + ++ Sbjct: 237 SIMFMPSLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPT 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I+ I+ + +G Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYL--IQVIEKGS---LVGMVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + + G T+I ++ + II S++ Sbjct: 352 FDSVAEIQNHLTRITDDNVYQKITAKLPQV-ANGGTSICRGLKAGFQAIIHSDQSTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + N C K G I TIA K LS Sbjct: 411 ---------IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALG---PSAAKELETLS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N FF L F Sbjct: 454 NMTGGYRFFANKDITGLTNAF 474 >gi|327543487|gb|EGF29906.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 464 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 45/170 (26%), Gaps = 24/170 (14%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ID ++D + ++ V + R+ +K STA+ + Sbjct: 93 IDRLSDDDIVSVVLYDSNVTVLVPAT---KATDRSSIKQKIRGIQAGSSTALFAGVSKGA 149 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC----NKAKSQGIRIMTI 403 + R E ++LL+DG + + + K I + T+ Sbjct: 150 AEV---------RKFLADEQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEA-ISVSTL 199 Query: 404 AFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFE 451 + AS + L +F + Sbjct: 200 GLGSGYN-----EDLMVALASVGGGNHAFIEDADSLVAVFNQEFDGLLSV 244 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 61/221 (27%), Gaps = 38/221 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVQPNRMAAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNRDST 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + K D TA + + TA I + + IVL +DG+ Sbjct: 162 KRALDKLQFADR-----TATGEGIFTALQAIATVG---AVIGGGDAPPPARIVLFSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T AK QG+ I TI+F E L A Sbjct: 214 TMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQL 273 Query: 424 SPNSFFEANSTHELNKIF--------RDRIGNEIFERVIRI 456 S + + A + EL ++ + I + +R+ Sbjct: 274 SGGNAYNAATLQELKSVYATLQQQIGYETIKGDASVGWVRL 314 >gi|290986713|ref|XP_002676068.1| predicted protein [Naegleria gruberi] gi|284089668|gb|EFC43324.1| predicted protein [Naegleria gruberi] Length = 413 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 24/175 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ D R+G F+D+ + + V L + +K + E GST M+ Sbjct: 82 IDNLKDFERLGIVLFDDKAETFLPLTI-VQDLEKKSLKERVMKITEKGSTNFEAGMKRGI 140 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-----GIRIMT 402 D + + ++ I+ LTD + + + K GI Sbjct: 141 DLFSTMDSSDLSNSNR-------IIYLTDACPNVGGTDSLDVLT--KDANSGPYGILSTF 191 Query: 403 IAFSVNKTQQEKAR-YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 I ++ + + C ++F S+ E F+ + + V I Sbjct: 192 IGIGLDFNSEIVEELTKVRGC----NYFSVKSSEE----FKKILNEDFNYIVTPI 238 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 50/144 (34%), Gaps = 19/144 (13%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + T + + G T+I ++ + I SN+ Sbjct: 347 LVTFDSFAKIQSKLIKIIDD-NTYQKITANLPQEADGGTSICRGLKAGFQAISQSNQSTF 405 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARY 417 I+LLTDGE+ Q I++C + K G I TIA ++ Sbjct: 406 GSE---------IILLTDGEDYQ-----ISLCFGEVKQSGTVIHTIALG---PSADEELE 448 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 LSN + F+ + + L F Sbjct: 449 TLSNMTGGHRFYAHKNINGLIDAF 472 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/340 (11%), Positives = 102/340 (30%), Gaps = 34/340 (10%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + L + + + + S + + Q Sbjct: 627 DPVSTVLEYVPLGYADGTTGANGAFVMDGHTMATCPISHQRGWYYVNSASKDVYAQNEPY 686 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI-- 268 T K Y + P + + Y ++ ++ Sbjct: 687 TAKEYRGAQDGYYAGETHWLPNPGNSSAKFYIDNSGTLKCRFQFVQNSCSPVYMKADEAS 746 Query: 269 KKKHLVRDALASVIRSIKKIDNVND--TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K ++++++A ++ + + R F++ + +++ ++ + + Sbjct: 747 TKSEVLQNSIARFAATLGYLSPGSQIAMTRFSVDTFSNAECALLNWTNDTGEVTAAMNQE 806 Query: 327 FAIDENEMG------------------STAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + E G ST +++ + + + Sbjct: 807 YGNPLAEGGRANQTLDGLRVYNYGITGSTHTYRGIESYIENMTNGASGGYVPNAPQGNNS 866 Query: 369 KYIVLLTDGENTQDNEE-GIAICNKAKSQGIRIMTI-----AFSVNKTQQEKA--RYFLS 420 +Y+++ TDG++ N + + + K+ G I+T+ + + + S Sbjct: 867 RYLIIFTDGKDNSGNLQKSMDDTDALKNNGYTIITVLMQSAGMTSEDVEHSTTFLKRLAS 926 Query: 421 NCASPNSFFEA---NSTHELNKIFRDRIGNEIFERVIRIT 457 + AS +F N L K+F+D I +EI + + T Sbjct: 927 SNASGEKYFYTAMYNDPEGLVKVFQD-IAHEIAKPLQGYT 965 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL +V D G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VVSVLQNDHPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE+ D E+ + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGESN-DAEKLNTTAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 497 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 547 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 548 SRDSVLGPAHKLREENIRVHAIGV------KEANQTQLREIAGEEKRVYYVHEFDALRNI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 251 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQRIQDLSPQVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 308 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 362 FTMGIEG------ANPDELEKIASHP 381 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 1012 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1071 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1072 IF--GYTHIGDALRKVKYYFQP-----DMGSRINAGTPQVLLVLTDG---RSQDEVAQAA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1122 EELRHKGVDIYSVGIGDVDDQE 1143 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVRDPAI---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 55/170 (32%), Gaps = 26/170 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + AL + ++ D G T F+D+ S Sbjct: 388 DVSGSMEDEGRLEQAKAALRIFVEQLQDDD------GFGLTIFSDQATVLTPISPI--GS 439 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 RT V G T + D + AY + ++ + + +V+LTDG + Sbjct: 440 RRTEVLNRIAGLTPRGGTRLLDTVVEAYQELTATPPGQRI---------RAVVVLTDGLD 490 Query: 380 TQDNEEGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASP 425 + + + + I++ TIAF + L AS Sbjct: 491 NRSQRSAEDVLDLLRQDREGYSIKVFTIAFGGDADVH-----LLKEIASA 535 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 73/261 (27%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F S+ + N + + S + + + ++ Sbjct: 237 SIMFMPSLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPT 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I+ I+ + +G Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYL--IQVIEKGS---LVGMVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + + G T+I ++ + II S++ Sbjct: 352 FDSVAEIQNHLTRITDDNVYQKITAKLPQV-ANGGTSICRGLKAGFQAIIHSDQSTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + N C K G I TIA K LS Sbjct: 411 ---------IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALG---PSAAKELETLS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N FF L F Sbjct: 454 NMTGGYRFFANKDITGLTNAF 474 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 51/177 (28%), Gaps = 34/177 (19%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + V + + TA+ + T+ + N Sbjct: 134 LVLFGKEALLEVPPTIDVEYFLERLEAVRLFSL--GDGTALGMGVGTSLLHLSRVNASF- 190 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------- 408 + +V+LTDG+NT A+ GI + T+ + Sbjct: 191 ----------RAVVILTDGKNTTGEILPETAAEMARELGIPLFTVGVGSDRPVSLDVIDP 240 Query: 409 --------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 ++ L A S FF + L++IF IG V R Sbjct: 241 STGTRYAGVLEEGYDEETLRRIAEISGGQFFSGYTPTSLHRIF-QYIGATATADVRR 296 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 73/261 (27%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F S+ + N + + S + + + ++ Sbjct: 237 SIMFMPSLHSVTEFCTEKTHNTEAPNLQNKMCNGKSTWDVIMNSVDFQNTSPMTEMNPPT 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I+ I+ + +G Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSAEDRLFQMNQAAELYL--IQVIEKGS---LVGMVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + + G T+I ++ + II S++ Sbjct: 352 FDSVAEIQNHLTRITDDNVYQKITAKLPQV-ANGGTSICRGLKAGFQAIIHSDQSTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ + N C K G I TIA K LS Sbjct: 411 ---------IILLTDGEDNEINS-----CFEDVKRSGAIIHTIALG---PSAAKELETLS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N FF L F Sbjct: 454 NMTGGYRFFANKDITGLTNAF 474 >gi|254464606|ref|ZP_05078017.1| TadE-like protein [Rhodobacterales bacterium Y4I] gi|206685514|gb|EDZ45996.1| TadE-like protein [Rhodobacterales bacterium Y4I] Length = 178 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 6/59 (10%), Positives = 23/59 (38%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ + G+ + A+++P + + V++ + L++A + Sbjct: 1 MIRRAAHALRRFRQQQDGNATVEFAIVIPAFIMILMSTVELGMINLRHSQLERALDQTV 59 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/279 (13%), Positives = 85/279 (30%), Gaps = 27/279 (9%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 +S+ V + M + N F S GI S + + Sbjct: 78 ISLNNVSERRDKMRLGMTLATNPKDNSFVACGPLWSYECGSSYYSTGICSRVDSNFNFTH 137 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + Y +D + + + + +I ++K ++ Sbjct: 138 SIAPAYQRCETYMDIVIVLDGSNS----------IYPWYEVQDFLINVLQKFYIGPGQIQ 187 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + +RV+++ + + ID+ T + A Sbjct: 188 VGVVQYGERVVNEFRL-DDFRTVDEVVAAAKNIDQRGGEETRTALGINVARTQ------- 239 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + +AKK ++++TDGE + D+ + A +++ I + IA ++ Sbjct: 240 -AFKHGGRPDAKKVMIVITDGE-SHDSPDLKAAVEESEKDNITLYGIAVLGYYNRRGINP 297 Query: 417 YFL----SNCASPN--SFFEANSTHELNKIFRDRIGNEI 449 A+ FF L D +G +I Sbjct: 298 EAFLREIKFIATDPDEHFFSVTDESALKD-IVDALGEKI 335 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 68/199 (34%), Gaps = 30/199 (15%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIR 321 + V+D I + D + A ++D VI S + + L Sbjct: 82 KPFKTLVTQVKDFTKKFIDKLTSRYYRCDRNLVWNAGALHYSDEVIMISSLTRDMKTLRD 141 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + I T + A++ + ++ ++ + KY++++TDG + Sbjct: 142 DVETVEYI----GKGTHTDCAIKRGIEEVL--------IGGSHQKENKYLIVVTDGHPLE 189 Query: 382 DNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANST 434 +E N+AK GI++ ++A LS AS S F A S Sbjct: 190 GYKEPCGGLEDAANEAKHLGIKVFSVAI-----SPNHLEPRLSVIASDASHRRNFTATSA 244 Query: 435 HELNKIFRDRIGNEIFERV 453 L + I N I + Sbjct: 245 AGLTD---EEIDNTIDTII 260 Score = 43.4 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 47/145 (32%), Gaps = 18/145 (12%) Query: 319 LIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 T+++ + G+T + A++ + +R + K +++ +DG Sbjct: 891 SNYTVLEVPVDNMQFINGATNVVSALRAVTEL---------YREDSLAGVSKKLLVFSDG 941 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS------PNSFFEA 431 NTQD ++ + A++ GI I +A + A F Sbjct: 942 -NTQDEKDMLKAVQDARAAGIEIYVLAVGSRLNYPNLQVMLTGSAADITAPFPEERLFRV 1000 Query: 432 NS-THELNKIFRDRIGNEIFERVIR 455 L + I I + + Sbjct: 1001 PDYPSLLQGVRYQSISRRISLKSSQ 1025 >gi|233142080|gb|ACQ91095.1| complement factor B-like protein [Ruditapes decussatus] Length = 697 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 52/178 (29%), Gaps = 17/178 (9%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + + + R F++ + + G + + + +N G T Sbjct: 224 DIFGVSNETSGGKNGTRFALLTFSNEADIVFNLNDGTARSKEEVKRRIDEIQNTGGGTNF 283 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---DNEEG--IAICNKAK 394 A+ I + A + + LLTD E T + N K Sbjct: 284 RAALLKVVGGIF--FNVIKKESQRLNHATRAVFLLTDAEETSTLEKDRLPRIRQAANDLK 341 Query: 395 SQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNE 448 ++G I I Q L+ AS F + +L K+ D I + Sbjct: 342 NEGHFEIFCIGVG-----QNIDETTLAEIASTPHIEHVFTLSKFDDLEKV-GDIIAEK 393 >gi|114321541|ref|YP_743224.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] gi|114227935|gb|ABI57734.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] Length = 972 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 42/155 (27%), Gaps = 10/155 (6%) Query: 292 NDTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 MG F+ + +K G TA+ D + D Sbjct: 362 PGRTAMGVVRFSASASVVHPMIAIPDPGTAEKDQLKDAIDSLPASGLTAMFDGLILGLDE 421 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + N+ +A + LL+DG + + + I IAF Sbjct: 422 LQD------YSAANDTDAGQVAFLLSDGGDNSSAATEPQTVQAYQDANVPI--IAFGYGS 473 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 L++ F + E+ + F Sbjct: 474 FAPTGVLRRLADNTGGEFFASPTTLAEIQEAFLAA 508 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 63/203 (31%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + + + +G F+ + + Sbjct: 311 DKSGSMSGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIKSNLIQIISSK 365 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I M++A+ I IVLLTDGE+ Sbjct: 366 ERNNLLESLPK-AANGGTSICAGMRSAFQVIREVYPQIDGSE---------IVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + C ++ G I IA + Q + + + F A+ + Sbjct: 416 NSAKD-----CIDEVTQSGAIIHLIALGPSADQAVIEMSAM----TGGNHFFASDEAQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F ++ ++ +++ Sbjct: 467 GLIDAFGALASGNTDLSQQPLQL 489 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 65/202 (32%), Gaps = 45/202 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I + R+G + + + + ++ Sbjct: 110 NRLEKAKRLVSEIINQLAS-------DRIGIIAYAGQAFPQLPITTDY-GAAKMFLQNMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAIN+A++ A + + + +++DGE+ + A Sbjct: 162 TNMLTSQGTAINEAIELATTYYDDEEQ-----------TNRVLFIISDGEDHSEGTTLKA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-----------------------EKARYFLSNCAS- 424 + + A +GI+I TI +K + L + AS Sbjct: 211 V-DDAIEEGIQIFTIGVGKSKGAPIPIKRNGVVESLKKDRQGEVVITKLNETILQDIASE 269 Query: 425 -PNSFFEANSTHELNKIFRDRI 445 + + ++T + + ++ + Sbjct: 270 GEGEYIDGSNTSDAVEQIKEIL 291 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 55/159 (34%), Gaps = 13/159 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D VR GA ++ ++ D + G T A+ Sbjct: 842 VKKADVAKDRVRFGALKYSYDPTILFYL--DEFDTRSKVISLLQNDSPKGGDTYTAKALA 899 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E + N + + ++++TDGE+ N+ + +GI I+ + Sbjct: 900 FSEHMFT-----EARGSRINQKVPQVLIVITDGESHDANQLNAT-AKALRDKGILILAVG 953 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 L+ S + +F + L IF++ Sbjct: 954 I-----AGANTEELLAMAGSTDKYFFVETFGGLKGIFQN 987 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 37/123 (30%), Gaps = 15/123 (12%) Query: 319 LIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + G+T A+ + + + + ++++LTDG Sbjct: 500 SNKNDLGKAIDNIWQLGGNTNTGAALDFTLGLLQ------RAKTQRGNKVPCHLIVLTDG 553 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + E K K + I + I ++ L + + + L Sbjct: 554 MSDDNVLEP---AKKLKDENINVYAIGV-----KEANRTQLLEIAGTEKRVYYVYNFDSL 605 Query: 438 NKI 440 I Sbjct: 606 KDI 608 >gi|301767086|ref|XP_002918971.1| PREDICTED: collagen alpha-1(XXI) chain-like [Ailuropoda melanoleuca] Length = 957 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H +V G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSHDSGENLVAAMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFYVEDYIAISKI 203 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/174 (9%), Positives = 49/174 (28%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 K KS G+ ++ AL + +++ V +D+ + S L+ A + A+ + S Sbjct: 5 KFCKSTQGNVSVMAALFIGLLIAVIAGTIDISQKSSLNRELQSVVDAAALAAAREMAVSS 64 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + ++ + A + + Sbjct: 65 ADQTRVQSVASSYVDANWTGEQATTHAVLDVSKGIITVSSTAPKTIASILKKDQKETNFY 124 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + + I + A + + ++ + Sbjct: 125 AEAVAEVSGGGNVCLIGLSDHEQGTINLQQRARITAENCQVYSNSDDRYSMIVR 178 >gi|304320960|ref|YP_003854603.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] gi|303299862|gb|ADM09461.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] Length = 433 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 70/213 (32%), Gaps = 12/213 (5%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L G ++ A+++ +G+G ++VD R ++ A+ A++ Q Sbjct: 17 LQRLRGDQRGSIAVLMAVMLVPSVGLGALIVDGSRMRTAHLEIQIVAEA----AALAAAQ 72 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +L V + + + + ++ N + TD E +A + Sbjct: 73 NLPSVDDAREAATDYAEANLDPTKYGNVVRS-----TDVEFGTYDDSNGTFSVGGTTAVR 127 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI---QWVIDFSR 187 V D +LF +G S + A A + + + + + +D Sbjct: 128 VTAGRTEDRSNAFSTLFGGVIGRPSVDLTASAIAVAETSGGNPICILVLGFGYYGLDMDG 187 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + D Q + + SY + Sbjct: 188 DINVDIPDCGIQVNSDDDDAMNSKDDSYVNAAY 220 >gi|46205147|ref|ZP_00048963.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Magnetospirillum magnetotacticum MS-1] Length = 125 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 32/92 (34%), Gaps = 10/92 (10%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS--VNKTQQEKARYF-----LSNC 422 +VLL+DG N + A+ G+R+ TIA + + L Sbjct: 1 MVVLLSDGANNAGQTAPKDVAQLARELGVRLYTIALGPIDMADNPDNEQDVVDVETLRAM 60 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 A S F +T +L + + I R Sbjct: 61 AEVSGGKAFRVKTTDDLENV-ANAIDELEGGR 91 >gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 730 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 53/179 (29%), Gaps = 24/179 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + AL V++ + + D R FN+ V + RT Sbjct: 290 KIEKAKLALLQVLQMLHEGD------RFSIITFNNEVNNLTERLLPFS--DRTEWYPAVK 341 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I+DA+ + + ++ + K ++ LTDG T+ + I Sbjct: 342 QIMAGGMTNIHDALLEGIEVL---------GTQSTDDRYKVVLFLTDGAPTEGITDIGTI 392 Query: 390 C----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 AK + + + + + E+++ + Sbjct: 393 IRDSTKLAKVRDVHLFVFGVGYDVNAELLDELAEK---GGGKVKYIVENEEIDEKVLEL 448 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/338 (12%), Positives = 94/338 (27%), Gaps = 32/338 (9%) Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET--VSRSYHKEHGVSI 179 + ++ Y ++ S + S + + + V++ + ++ Sbjct: 11 KTFTGPSQSQNVAFGYKVIQKDTSSLIVSDPLIQINQDKRGQIYNCEVAKGTCTQLNITG 70 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 V + +M + + K+ ++ G+ S Sbjct: 71 TTVPSEAVNMSLGFSMVQHPQSSKLAICGPTIPKNCTTATNYRGMCFIGNSGD----FGP 126 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN---VNDTVR 296 + +D S + +++ + +VIR + D + Sbjct: 127 PIPKSKYRDCLGQIDIAFLLDGSGSIGIY-DFTVMKGFVTNVIRRFIERDAQFAIAQYSN 185 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +N + T A G T A+Q + + + Sbjct: 186 DCDIHYN---------FNDLKLDDGTWESKVANIPYHEGGTFTASAIQKLVNYLFT---- 232 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS-VNKTQQEKA 415 AKK +V++TDGE + D ++A+ I I + Sbjct: 233 --PNGGTRPSAKKILVVITDGE-SHDRNLLKDAASQAEKNSIVRFAIGVGKAFDYYNARE 289 Query: 416 RYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEIF 450 L+ AS + F+ + L I + GN I Sbjct: 290 E--LNTIASDPDTDYVFKVTDFNALKNILQKLEGNIIA 325 >gi|113931618|ref|NP_001039260.1| inter-alpha (globulin) inhibitor H2 [Xenopus (Silurana) tropicalis] gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis] Length = 942 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/248 (11%), Positives = 73/248 (29%), Gaps = 12/248 (4%) Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S ++ NG + ++ + Y L+ + + Sbjct: 258 PTCSTTAVNGNLLVKYDVKREKSSHLQVFNGYFLHFIAPENLSPLPKNILFVIDVSGSMW 317 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ + ++ ++ + + FN + + + + Sbjct: 318 GLKMKQTVEAMKSILEDLRP---DDQFSIIDFNHNIRCWKDELVYASSVEKQDASKYVQS 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T IN+A+ A + ++ + + IVL++DG+ T + I Sbjct: 375 IQPNGGTNINEALLRAIFILKEASN----KGMLEPNSVSLIVLVSDGDPTVGEIKLSKIQ 430 Query: 391 NKAKSQGIR----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ IR + ++ + R N + T K F ++ Sbjct: 431 KNVRT-NIRDDFALHSLGIGFDVDYDFLERLAQENHGMAQRIYGKQDTAAQLKEFYKKVS 489 Query: 447 NEIFERVI 454 + + +I Sbjct: 490 TPLLKNII 497 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 17/147 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+D + S V + R G + +A++ A + Sbjct: 91 RVGLVSFSDTATVNSMLSPVVEQAARAAE-----GLTAGGKSNQAEAIRAAVKLLDMKTP 145 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +K + L+TDG+ T + + +A+ G+ + I + +A Sbjct: 146 G-----------EKMLFLITDGQ-TPFRSQTDSAAAEARQAGVTVYCIGIAAPDGVNREA 193 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFR 442 ++ S + E E F Sbjct: 194 LRSWASGPSDSHIIEIRELGEAQTAFE 220 >gi|160878421|ref|YP_001557389.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160427087|gb|ABX40650.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 551 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/362 (11%), Positives = 97/362 (26%), Gaps = 25/362 (6%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +N + +V + + L ++ + + Sbjct: 35 NSTTGQNGSKSSSDINFGSQVALQDTTSLSKTESDGLYVSYKLDGDNSNNMDFSTEEYNA 94 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + T S +I+ ++ +D + LN F Sbjct: 95 VIEQGYQSTKNHPLSTFSADVDTASYSNIRRML-KEGRRVDTGAVRIEEMLNYFNYDYKL 153 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 S + D +P + + +D S K Sbjct: 154 PE-GDSPFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSNLVFLIDVSGSMMDEDK 212 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ A + ++ + D R+ + G + ++ + Sbjct: 213 LPLVQRAFLLLTENLTEKD------RISIVTYAGNDTVVL---SGAKGNQKEKIQNAITE 263 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 GST + ++TAY + NN ++L TDG+ + Sbjct: 264 LEAGGSTFGSKGIETAYQL----AMENYIEGGNNR-----VILATDGDLNVGVTSESELT 314 Query: 391 NKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 N + G+ + + F + K + ++ +S E K+ + +G Sbjct: 315 NLIEEKRKSGVALSVLGFGTGNIKDNKMEALADH--GNGNYAYIDSLMEARKVLVEEMGA 372 Query: 448 EI 449 + Sbjct: 373 TL 374 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/415 (10%), Positives = 86/415 (20%), Gaps = 30/415 (7%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 ++ A + L+ + F + R F + V DI Sbjct: 195 TSEVANLGPGEALVVEIAYQQQVRYQDGEFSLRFPTAITPRYFPKGQVPDLEQASVNDIQ 254 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + + + ++ L D + I +T Sbjct: 255 GLNVLNESTQSDEQKLYLDVVLDAGMA----------ISRLETPYHQMRQTQLGGTKIGL 304 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 ++ D + +E F Q + + + V Sbjct: 305 NLTANLRPDRDFLLKWRPLLAEQPSAVMFAQLGKTHEFKTHEFKNEESLASSQAHNDQVV 364 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK--------KHLVRDALASVIRSIKK 287 + + S + + I K Sbjct: 365 SEANHPAEAQASDKEAKDSYALVMLMPPQDKARVRLPRELTLVIDTSGSMTGDSIAQAKS 424 Query: 288 IDNVN-----DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 F+ V S + F G T + A Sbjct: 425 AILNALAGLGSQDTFNVIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPA 484 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + A + E K +V +TDG + I Q R+ T Sbjct: 485 LLRALS--QPESGVSSISSAVKPERLKQVVFITDGAVGNEASLFALIAANIGRQ--RLFT 540 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + R A ++ E++ + + ++ +T Sbjct: 541 VGIGAAPNGYFMERA---ARAGRGTYTYVGKISEVDAKIGELLEKIESPQISDVT 592 >gi|119494080|ref|ZP_01624623.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] gi|119452182|gb|EAW33385.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] Length = 608 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 53/169 (31%), Gaps = 18/169 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++D+ + + + + G T ++ D + Sbjct: 72 TADDIVSVVIYDDQPETILTPQT---VEDKAAICKQIGRIRAGGCTNLSGGWLMGCDCVK 128 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVN 408 S + ++LLTDG+ + I A++Q GI T+ F Sbjct: 129 SRQTSDRLNR---------VLLLTDGQANMGITDPKVITKTAQNQAETGIITTTLGFGSY 179 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + A+ +F+ S ++ ++FR + + +T Sbjct: 180 FNEDLLISM---ADAAGGNFYFIQSPDDVAQVFRIELESLTAVVAQNLT 225 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 45/155 (29%), Gaps = 19/155 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + A+ + + R+G F + Sbjct: 99 SNSMLAEDFEPNRLERTKYAINKLFDGL-------HQDRVGLIVFAGEPKVQLPITSDYR 151 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + K + TAI A+ A + E + ++L+TDG Sbjct: 152 MA-KAFAKRIDPSLVPVQGTAIGKALSQALMSFSGETE---------ENHSRVVILITDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 EN +D+ A A GIRI TI + Sbjct: 202 ENHEDDALAAAR--HAAEMGIRIYTIGIGTPEGAP 234 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 70/191 (36%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +++++ + + D R+G ++DR ++ Sbjct: 39 YDLVFLLDTSSSVG-KEDFEKVRQWVSNLVETFEI---GPDKTRVGVVRYSDRPSTEFDL 94 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G +K I + + G+T DA++ S E ++ KK + Sbjct: 95 --GKYKTREEIKEAARKIQYYGGNTNTGDALRY---ITTYSFSKEAGGRLSDRTVKKVAI 149 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A GIRI + E + L AS F Sbjct: 150 LLTDGRSQDFVLDP---ATAAHQAGIRIFAVGVG------EALKEELDEIASEPKSAHVF 200 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 201 HVSDYNAIDKI 211 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ +D + R+G ++ +V S S R ++ T Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLS---AFSRREDMERAIRALVPLAQGT 113 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + V++TDG + + + +A+++G Sbjct: 114 MTGLAIQYAMNVAFSVAEGARPPEARVP---RVAVIVTDG---RPQDRVAEVAAQARARG 167 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F +S + + Sbjct: 168 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVDSFDLIQE 206 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 322 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 376 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 377 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 430 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 431 ARAKEEGIVMYAVGVG------KAVEEELRKIASEP 460 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 78/271 (28%), Gaps = 13/271 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L KS G F+ A+++ V++ + VDV ++ + L A + + L+ ++ Sbjct: 5 SLFKSKKGTTFVFFAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQELVTNV 64 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 N+ I + E+ + N ++R V+ T + Sbjct: 65 S-------NTNNVVNNYIAKNNSGLNESTIIVNESERSVKVTSSKTVENNFAKVFGNNSQ 117 Query: 133 -LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + + +S I V + + I + Sbjct: 118 DVIATAKAKVENISALNGCRPIAVIKQTFVYGNIYVLKEGASDGYNGNYAAIALGGNGGS 177 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 D+ + + D+ K + L + + Sbjct: 178 TYEDNLLNGYSGTIRVGDKIPTETGVIAQKTFTSIQTLVS-----RCNHTPPCTYNYYNK 232 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 + S F+ + +V KK++ A+ Sbjct: 233 NCSRIIFIPVVNTLYVNGKKYVDVVGFATFF 263 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 52/157 (33%), Gaps = 25/157 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A++ ++ + D R+ F+ R R +K Sbjct: 59 LETVKKAVSLLVDQLSSED------RLSIVVFDHRAKILVP---NQIISDRNQIKQQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ + + +D + + LLTDGEN DN + Sbjct: 110 LTADGGTAIDEGLRLGIEELAKGKKDTISQA----------FLLTDGENEHGDNNRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 A S + + T+ F N + L A Sbjct: 160 AQLAASYNLTLNTLGFGDNWN-----QDILEKIADAG 191 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 62/172 (36%), Gaps = 23/172 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + D +++G ++D + + G H+ +++ G+T Sbjct: 60 LVNITRNFDIGPKFIQVGVVQYSDYPVLEIPL--GTHESTENLIREMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A+D + K++ K V+LTDG + +E + +A+ I + Sbjct: 118 RAIQFAFDHLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDVAAEARKNKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 I E L A S F +++ ++ I ++ Sbjct: 166 FAIGVGS-----EIEEDELKAIANKPSSTYVFYVEDYIAISR-IKEVIKQKL 211 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 53/305 (17%), Positives = 104/305 (34%), Gaps = 62/305 (20%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 K+ I V+D S SM +P + T + + V I+D+ Sbjct: 89 KDKKKEIVLVMDTSTSMK-----CLVEPESYDIDNCVPTKEGHI-----VYIKDKSYLVN 138 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 S + + Y + + + + +K+ ++ A S I+ +KK +N Sbjct: 139 TTFLRGSRHKLFYIT--MGTTNYYIQGNKCYRQSSYNEKNRLKHAQESAIKFVKKFENDK 196 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + + +G F+ R I + + ++ +I G+T I +++A + Sbjct: 197 N-ISIGLVSFDTRAIEQKELTSSLSEVKSSINNLKVAY---NGATNIEAGLKSAQKIL-- 250 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENT---------------------------QDNEE 385 K N +A KY++L++DG T ++ Sbjct: 251 --------KKGNEDADKYVILMSDGFPTAFDYAGEKFEENFNEHEVQDNTFINFGYNDYR 302 Query: 386 GIAI------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 G A+ + K GI I FS ++ + A+ + EA +T LN Sbjct: 303 GYAMKHSINQADSLKKVGINSFIIGFSDGANSEKLNKI---AKAAGGEYEEARNTDALNG 359 Query: 440 IFRDR 444 + Sbjct: 360 AYNKI 364 Score = 43.8 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 52/170 (30%), Gaps = 28/170 (16%) Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + I +L+ + N YY+ + + + V+ Sbjct: 689 IEISSVQLASGNFTINGKKYYVKDNKVYEFN------------EQDRSRIDSVKKVANDF 736 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG--VHKLIRTIVKTFAIDENEMGSTAI 339 + K ++ + ++ + S + +K +T I Sbjct: 737 VDKFKD----DENTEIAIVRYSSKADVVLDNSNKVFLSSKDNETIKKRINSLKADVATNI 792 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D ++ +Y + K + +++KY++L+TDG T I Sbjct: 793 GDGIRKSYSIL----------DKCDKDSEKYMILMTDGVPTAYTCYANTI 832 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 69/192 (35%), Gaps = 28/192 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + I +K I + D + A ++D VI + R+ +KT Sbjct: 27 IDQIKTFTTNFIDELKNIRHQCDRILTWNSGALHYSDEVILVGELMD--MQTQRSTLKTS 84 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +G T + A++ ++ ++ KYIV++TDG +E Sbjct: 85 ISGIEYIGKGTYTDCAIKRGLSELL--------IGGSHYHENKYIVVVTDGHPLTGYKEP 136 Query: 387 ----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELN- 438 N+A+ G+++ +A ++ LS A+ +++ F A ++ Sbjct: 137 CGGVQEAANEARQHGVKVFAVAI-----SPDQEDTRLSLIATDHNYRQNFTAADDSKITK 191 Query: 439 -KIFRDRIGNEI 449 I + Sbjct: 192 MDTIHTIINMIV 203 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL ++ D+ G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VISVLQNDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ A + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNATAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 58/195 (29%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGNTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYTNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G + Sbjct: 497 DLGKAIDNIRQMGGNTNTGAALNFTLTLLQ------RAKKQRGNKVPCHLVVLTNGMSQD 550 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 K + + IR+ I +E + L A + + L I Sbjct: 551 SVLGPAH---KLREENIRVHAIGV------KEANQTQLREIAGDEKRVYYVHEFDALRDI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 10/132 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + ++ D + VR+G F+D S+ + + G T I Sbjct: 1022 LASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIF--GYTHIG 1079 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA++ + N + +++LTDG + +E + + +G+ I Sbjct: 1080 DALRKVKHYFRPDT-----GSRINAGTPQVLLVLTDG---RSQDEVAQAAEELRHKGVDI 1131 Query: 401 MTIAFSVNKTQQ 412 ++ QQ Sbjct: 1132 YSVGIGDVDDQQ 1143 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 15/144 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + S+ +D + +R+G +++ S S G + +T V D + A Sbjct: 251 LGESVSALDIKENCMRVGLVAYSNETRVISSLSMG---VNKTEVLQRIQDLSPHVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T ++ + +G+ + Sbjct: 308 GA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVTH---RASDDNVTKAAVNLRREGVTV 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS 424 T+ L AS Sbjct: 362 FTMGVEG------ANPEQLEKIAS 379 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 39/122 (31%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQNDIANAIDQMAHIGETTLTG------SALTFVSQYFSPEKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 ++ + G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVKDP---AVALRKDGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL ++ D+ G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VISVLQNDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ A + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNATAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 58/195 (29%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGNTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYTNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G + Sbjct: 497 DLGKAIDNIRQMGGNTNTGAALNFTLTLLQ------RAKKQRGNKVPCHLVVLTNGMSQD 550 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 K + + IR+ I +E + L A + + L I Sbjct: 551 SVLGPAH---KLREENIRVHAIGV------KEANQTQLREIAGDEKRVYYVHEFDALRDI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 10/132 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + ++ D + VR+G F+D S+ + + G T I Sbjct: 1022 LASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIF--GYTHIG 1079 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA++ + N + +++LTDG + +E + + +G+ I Sbjct: 1080 DALRKVKHYFRPDT-----GSRINAGTPQVLLVLTDG---RSQDEVAQAAEELRHKGVDI 1131 Query: 401 MTIAFSVNKTQQ 412 ++ QQ Sbjct: 1132 YSVGIGDVDDQQ 1143 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 15/144 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + S+ +D + +R+G +++ S S G + +T V D + A Sbjct: 251 LGESVSALDIKENCMRVGLVAYSNETRVISSLSMG---VNKTEVLQRIQDLSPHVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T ++ + +G+ + Sbjct: 308 GA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVTH---RASDDNVTKAAVNLRREGVTV 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS 424 T+ L AS Sbjct: 362 FTMGVEG------ANPEQLEKIAS 379 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 39/122 (31%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQNDIANAIDQMAHIGETTLTG------SALTFVSQYFSPEKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 ++ + G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVKDP---AVALRKDGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 48/158 (30%), Gaps = 21/158 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G + + + + ++T T I A+ A + Sbjct: 128 DDKIGLIVYAGEAYTQLPITSDYVSA-KIFLETINPSMITTQGTDIKQAIDLAMKSFT-- 184 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 +N + K I ++TDGE+ + + A +GI++ + + Sbjct: 185 ---------SNQDVSKAIFVITDGEDNEGGAV--EMAKAAAEKGIKVYVLGVGSPQGAPI 233 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + N+ + + + + EI Sbjct: 234 PMP-------GSSQYITDNTGNVVVSKLNESMCREIAA 264 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 52/148 (35%), Gaps = 19/148 (12%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS-SNEDEVHRMKNNLEAKKYIVLL 374 R VK + STA +A+ TA +I + S + + +VLL Sbjct: 142 SPTTDRNAVKNGVDNLQLAESTATGEAIFTAMQSIDTFSRSLQGGPDQQGTPPPARVVLL 201 Query: 375 TDGENTQDNEEGI-------AICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYF 418 +DG T +G + A+ +GI + TI+F + Sbjct: 202 SDGTQTVPGPDGENEPRGSFTAASDAQRRGIPVSTISFGTSYGSIELDGGRTPVAVDDAS 261 Query: 419 LSNCA--SPNSFFEANSTHELNKIFRDR 444 + A S FF A + EL ++ D Sbjct: 262 MERIASLSGGRFFTAATESELRAVYSDL 289 >gi|313902401|ref|ZP_07835804.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] gi|313467332|gb|EFR62843.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] Length = 308 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 21/39 (53%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 +S G ++ A+ + V+LG+ G++VD R L+ Sbjct: 17 RSQQGGVAVLVAISLAVLLGMVGLVVDGGRLYAERTRLQ 55 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 71/244 (29%), Gaps = 25/244 (10%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + ++ G G +E+ Y + Y ++ + +D S Sbjct: 400 NPDLTVFNPAWGIQGTINEQGIQYPSAPVIQAALDRYQTRYRRPVAAYYCLDGSGSMGDN 459 Query: 269 KKKHLVRDALASVIRSIKKIDN---VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + A V + N + FN V W V ++ Sbjct: 460 DGWTGIEAAATQVFDPEQAAQNLLQTHPQDVTTVAIFNGGVTGGSP--WQVRGNDGDALR 517 Query: 326 T---FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D G T + + A + + K+ +VL+TDG+ Sbjct: 518 DLARSVADYEPEGGTNMYACLLRATTELTGQQNGDR---------KRLVVLMTDGQ--SG 566 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 E+ + +S + ++ IAF + ++ A+ +F +L R Sbjct: 567 AEQRDDALDALRSADVPVVAIAFGRDADPRQLEEV---AKATNGTF---VREDDLVAALR 620 Query: 443 DRIG 446 G Sbjct: 621 QAAG 624 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 58/178 (32%), Gaps = 25/178 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +++A VI ++ D V+ F+D V + ++ R +K Sbjct: 58 KLAALKEATRRVIDTLTPQDIVS------IVLFDDTVQTLVPATF---ATDRDALKAQVD 108 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 E G TA++ M + ++ ++LLTDG+ D + A+ Sbjct: 109 AIEEAGGTAMSGGMAAGIVELRKHHDPGRVSA---------MLLLTDGQTWGDEDRCRAL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRI 445 + +R+ + L + A+ ++ F+ + Sbjct: 160 AQELARDHVRVTALGLGAEWN-----EKLLDDIADATGGLSDYIADPSQITTFFQHAV 212 >gi|123232279|emb|CAM16354.1| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H family (plasma Kallikrein-sensitive glycoprotein) (ITIH) [Danio rerio] Length = 860 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 59/419 (14%), Positives = 118/419 (28%), Gaps = 17/419 (4%) Query: 51 HALKQAAQTA---IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 + L A + A +++S + + + + + + + Sbjct: 71 NQLHIAKEAAFEVDLSSSAFISNFTITSNHKVYVAQVRRRTDARKIYDNAKKQGKTAGLV 130 Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM---GIKSWLIQTKAEA 164 + R++ + P + LS L L G + + Sbjct: 131 ATKEREMEKFRVAVNVPPGARVSFSLSYEELLSRRLGRYELSLGLRPGQPVQNLSLEVSI 190 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + V ++ID Q C +++ + + K G+ Sbjct: 191 SERTGISFIRALVLFLFLIDLLADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPK-GL 249 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + Y V + + + VI + + + I+ Sbjct: 250 SADFIIQYDVELKDPMGDIQVDDGYFVHYFAPRGLPVVPKD-VIFVIDISGSMIGTKIKQ 308 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID------ENEMGSTA 338 K + +FN SD +W + +R + G T Sbjct: 309 TKAAMVSILSDLREGDYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVRKIIAAGWTN 368 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAK-KYIVLLTDGENTQDNEEGIAICNKA-KSQ 396 IN A+ +A + S + + I+ LTDGE T E I + A KS Sbjct: 369 INAALLSAAKLLNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSL 428 Query: 397 G-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 G + + +AF + R L N +E + K F D + + + Sbjct: 429 GLVSLFGLAFGDDADFPMLRRLALENRGVARMVYEDDDAAIQLKGFYDEVATPLLSDIQ 487 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 58/192 (30%), Gaps = 32/192 (16%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 IK + +G F + R + + +D +A+ + Sbjct: 130 IKYFAYQRENDNIGLVAFAKEASLIVPLTIDRDFFSRKLDDIYIMDL--GNGSALGLGVS 187 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + K++ K+ +++LTDG D + N A+ ++I +I Sbjct: 188 IALSHL-----------KHSEAPKRSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSIG 236 Query: 405 FSVNKT-----------------QQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRI 445 + ++ L AS F+ + +D Sbjct: 237 IGSGEELSVGFKLRSGKFYQGTLREVYDPSMLLEIASKTGGLFYSVSDDFSFKLAIQDFS 296 Query: 446 GNEIFERVIRIT 457 E ER +RIT Sbjct: 297 KKENVERKVRIT 308 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 60/453 (13%), Positives = 115/453 (25%), Gaps = 78/453 (17%) Query: 54 KQAAQTAIITASVPLIQSLEEVSS-RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 A A + ++ P + + +++ + + T E R Sbjct: 77 SDVADIAGQDVAGQSATDDDDAQCLTDDGESQQPSNEQAQSQLQSQSQPEQPSETMPEQR 136 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 T ++ Q V + P + + + + V+ Sbjct: 137 QPCDKTEQPNEQKEQERQAVRQPTAENADVPANGAAPAQDGDRNDHAGENRTDAVANGDA 196 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 D + D + + + + + + Sbjct: 197 GATA-DDGKTGDADDADNDGNTAEDADDAEHIMRDRFTFNRKTVMRAARNVAVPQPDHTK 255 Query: 233 MVSCNKSLYYMLYPGP--------------------LDPSLSEEHFVDSS---SLRHVIK 269 ++ N Y L LD S S + +D S + K Sbjct: 256 SITYNNGGKYTLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRGCNKSNPK 315 Query: 270 KKHLVRDALASVIR----SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + +++A S I + I + N VR+ F S + L + Sbjct: 316 RLTALKEAATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSS--- 372 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G+T + M A ++ + AKK ++ DG T N Sbjct: 373 --VSRLSANGATPADKGMAAAQTALLRARPG----------AKKVVIFFADGVPTTQNTF 420 Query: 386 GIAICN-------KAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCASP---- 425 + N KS G I +I ++A F+ +S Sbjct: 421 STRVANDAVTTALAMKSAGTLIYSIGIFEGANPEQQSFGNRENDQANQFMHAVSSNYPNA 480 Query: 426 --------------NSFFEANSTHELNKIFRDR 444 + NS +L KIF D Sbjct: 481 TAYNKTNWGTGSNLGYYKATNSADDLTKIFDDI 513 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 62/198 (31%), Gaps = 28/198 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R A + ++ T+++G F + + S + R + Sbjct: 127 TRVDAARQAAIKFVDEMEP------TLQLGLVTFAGTAQTLIAPSSDHEVVKRALD-DAI 179 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 + TA + + TA I + + + IVL +DG+ T ++ Sbjct: 180 RPDKLAARTATGEGIYTALQQIET---LKGILGGASKAPPARIVLESDGKETVPDDLNAP 236 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQQE-----------KARYFLSNCA--SPNSFFE 430 +AK++ + I +I+F L A S FF Sbjct: 237 RGAFTAAKEAKAKEVPIYSISFGTASPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFT 296 Query: 431 ANSTHELNKIFRDRIGNE 448 A+S +L ++ Sbjct: 297 ASSLDQLTDVYSSLNAEI 314 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/290 (10%), Positives = 77/290 (26%), Gaps = 23/290 (7%) Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + +S + L + G + +T ++ Sbjct: 394 SQEYEVSPTRIVQKGKLMFYGTCSGNSAIGYETGCIVRITGADQASGSIMNGPLTYSNIV 453 Query: 188 SMLDYQRDS-----EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + D + + N + + + + + S + Sbjct: 454 NFCDDDVNGIGNLHNREAPNKHNRLCNSRSAWEVMREHEDFNDNSPPSNLN-DAQIEPSF 512 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L +D+S + V A + + + + +G F Sbjct: 513 DLVQASTGDECRVVLVLDTSGSMGTSNRIDKVNSAATAFVNLVDDG------ISIGIVTF 566 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + R ++ G T I ++ + +++ Sbjct: 567 TGSPTTRHALTQINTQADRDSLRD-IFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG- 624 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 IVL+TDG+++ I + G+R+ T+A + + Sbjct: 625 -------IIVLMTDGQDSGIQNHIIR--QTLQDMGVRVNTVAIGEDAYGE 665 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 23/59 (38%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ +L+ + + G+L+D+ + ++ + A + L + + Sbjct: 108 RGGAVLILIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTT 166 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 63/203 (31%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + + + +G F+ + + Sbjct: 312 DKSGSMSGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIKSNLIQIISSK 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I M++A+ I IVLLTDGE+ Sbjct: 367 ERNNLLESLPK-AANGGTSICAGMRSAFQVIREVYPQIDGSE---------IVLLTDGED 416 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + C ++ G I IA + Q + + + F A+ + Sbjct: 417 NSAKD-----CIDEVTQSGAIIHLIALGPSADQAVIEMSAM----TGGNHFFASDEAQNN 467 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F ++ ++ +++ Sbjct: 468 GLIDAFGALASGNTDLSQQPLQL 490 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 50/424 (11%), Positives = 117/424 (27%), Gaps = 44/424 (10%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 T +L+ + + + ++ + + + S S Sbjct: 6 TIILVLSVFLLFSCS-------SNKTKVQDSK-------TAENTTITPKTESETNQSSPL 51 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 I + + +++K+ D V D+ L N Sbjct: 52 SDAYIVQDEKQISLGDVRKSKMDISVADVSALEMTTSRSIS-----SLRMESSPYDNLQD 106 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + + T S +++ I S+ + + +N F Sbjct: 107 EEYSQFSENRFRLANEDPLSTFSVDVDAASYSNMRRFI-NRGSLPNKDAVRIEELINYFS 165 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + + + L G P+ + +D S Sbjct: 166 YNYTEPT-GNDPIKISTEVGNCPWNSQNRLVKIGLKARSMAGENLPASNFVFLIDVSGSM 224 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + LV+ +L + ++++ D R+ + + G + +K Sbjct: 225 YGATRLDLVKSSLKLLTNNLREKD------RVAIVVYAGSAGEVLPSTSG---ANKQKIK 275 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + N GST +Q AY + + NN I+L TDG+ Sbjct: 276 EALDNLNAGGSTTGGAGIQLAYKI----AKQNFIKGGNNR-----IILCTDGDFNVGVSS 326 Query: 386 GIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + G+ + + + + + K + A + ++ E NK+ Sbjct: 327 NDGLLALIEQERKSGVFLSILGYGMGNYKDSKMQTLAQ--AGNGNHAYIDNLQEANKVLV 384 Query: 443 DRIG 446 + G Sbjct: 385 NEFG 388 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 52/142 (36%), Gaps = 22/142 (15%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 FND D G + R + G TAI A++ AYD + + Sbjct: 475 FNDASNWDIPL--GRVRESRPELHDAIGRIKASGPTAIFPALRNAYDALANV-------- 524 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K+++LL+DG++ ++ + K + I + T+A + L N Sbjct: 525 ---RVRAKHVILLSDGQSDPED--FEGLVRKMSAAHITVSTVALG-----PDADAALLRN 574 Query: 422 CAS--PNSFFEANSTHELNKIF 441 AS + ++ +IF Sbjct: 575 LASWGGGRSYVVQDAQQIPEIF 596 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 55/197 (27%), Gaps = 47/197 (23%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS- 352 R+G F + + ++ I + TAI M + T++ Sbjct: 126 DTRVGVVAFAATAALIQAPTRNHDDVLAAIDRVQLQR-----GTAIGSGMVLSLATLLPE 180 Query: 353 --------SNEDEVHRMKN------------NLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + + K A IVLLTDGE T A Sbjct: 181 AGIDLRLLAGDGSPPADKPPPGEPTHAPVPPGSHAYGAIVLLTDGERTTGPPLDFAT-RL 239 Query: 393 AKSQGIRIMTIAFSVNKTQQE----------KARYFLSNCA--SPNSFFEANSTHELNKI 440 A G+R+ T+ + L + A + +F A S L I Sbjct: 240 AADHGVRVYTVGVGTAEGGVVGYEGWSMRVRLDEAALKSIADETRGEYFHAQSAEALRTI 299 Query: 441 FRDRIGNEIFERVIRIT 457 +R R+T Sbjct: 300 YRKL--------GTRLT 308 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 58/187 (31%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +++ + D V+ F+ R G+ L I + Sbjct: 58 LETVKKAAIELVKQLNPEDRVS------VIAFDHRAKVIVPN-QGIEDLNTVIEQ--IKA 108 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ ++ V + I LLTDGEN DNE + + Sbjct: 109 LRAAGGTAIDEGLKLGIKESALGKQERVSQ----------IFLLTDGENEHGDNERCLKL 158 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F + L A S + + + F Sbjct: 159 AQVASDYNITLNTLGFG-----NHWNQDVLEKIADSAGGSLSYIENPEKALEEFSRLFSR 213 Query: 448 EIFERVI 454 + Sbjct: 214 AQSIGLT 220 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 47/162 (29%), Gaps = 22/162 (13%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N FN + FS + + TA+ DA+ A + Sbjct: 118 GNPEDMAALYTFNWEIREQQPFSRDLRAFD-----NRLKMMHGEAGTAMYDAVYLAAQRL 172 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---AFSV 407 + +K IV++TDG +T A+ I I + Sbjct: 173 EP------------RDGRKVIVVVTDGGDTVSRLSVQKALEAAQLADAVIYAIVVVPITN 220 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + + + L A + F EL+K F D I Sbjct: 221 DAGRNIGGEHALDFMAKGTGGRIFMPTLGAELDKAFADIITE 262 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 46/144 (31%), Gaps = 21/144 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A++ +I ++ R+G F + + + Sbjct: 110 NRLENAKRAISQLIDNL-------HNDRIGIIIFAGEAYVQLPITTDYSAA-KLFLNNIT 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI A+ + N K ++L+TDGEN +D+ + Sbjct: 162 TDIVPTQGTAIGAAIDMGMKSFNFVN-----------GTSKAMILMTDGENHEDDAV--S 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ 412 +A ++ + I I + Sbjct: 209 AAKRASAKDVAIHVIGVGSEEGAP 232 >gi|149559056|ref|XP_001512734.1| PREDICTED: similar to collagen, type XXI, alpha 1, partial [Ornithorhynchus anatinus] Length = 225 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 72/225 (32%), Gaps = 31/225 (13%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 L Y + + + P D + I KK ++ K Sbjct: 27 SLHNYTSTGDGEIRSSCRTAPTDLVFILDGSYSVGPENFEIVKKW--------LVNITKN 78 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D +++G ++D + + G H + +++ + G+T A+Q A Sbjct: 79 FDIGPKFIQVGVVQYSDYPVLEIPL--GSHDSLENLIQAMESIQYLGGNTRTGKAIQFAL 136 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D + K+ K V+LTDG + +E A+ I + I Sbjct: 137 DHLF---------AKSQRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITMFAIGVGS 184 Query: 408 NKTQQEKARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEI 449 E L A+ S+ F ++K R+ I ++ Sbjct: 185 ETEDAE-----LRAIANKPSYTYVFYVEDYIAISK-IREVIKQKL 223 >gi|89095762|ref|ZP_01168656.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89089508|gb|EAR68615.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 432 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 48/140 (34%), Gaps = 14/140 (10%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + K G T + +++ Y+ + + K + + + + Sbjct: 203 YDANTYDQAAFKAALEKFKPSGWTPLAASIKAGYEDL---------KKKAGEDTENILYI 253 Query: 374 LTDGENTQDNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 ++DG T + + +++ I F V+ Q++ + A ++ Sbjct: 254 VSDGIETC-EGDPVKEAKALADSDLNMKVHIIGFDVDDAGQDQLKKTAE--AGNGKYYTV 310 Query: 432 NSTHELNKIFRDRIGNEIFE 451 NS EL + +G I Sbjct: 311 NSKLELTNTLNELMGEAISS 330 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 57/178 (32%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I ++K D + VR GA + D ++ Sbjct: 822 DYDEYNIMKDFMIDIVRKADVGMNRVRFGALKYADDPEVLFYLDN--FGTKPEVISVLQT 879 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ G+T +A+ + E + N + ++++TDG++ ++ Sbjct: 880 DQAMGGNTYTAEALGFSDHMFT-----EARGSRLNKGVPQVLIVITDGDSHDADKLNAT- 933 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L I D + Sbjct: 934 AKALRDKGILVLAVGI-----ADANPVELLAMAGSSDKYFFVETFGGLKGIISDVTAS 986 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 60/183 (32%), Gaps = 24/183 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + ++ ++ D N VR+GA F+D + G + I Sbjct: 1017 FKKMKEFMVSVVQDFDVSNKRVRIGAAQFSDAYRPEFPL--GTFIGAKEISIQIENITQI 1074 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I A++ + N + +++LTDG Q +E Sbjct: 1075 FGNTHIGAALRKVEHYFRP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQAAEAL 1126 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + +GI I ++ + + + ++ EL ++ +R+ Sbjct: 1127 RRRGIDIYSVGIGDVDD-----QQLMQITGTAEKKLTVHNFDEL---------KKVNKRI 1172 Query: 454 IRI 456 +R Sbjct: 1173 VRN 1175 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 49/147 (33%), Gaps = 18/147 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR+GA + D F + + + K G+T A+ + + Sbjct: 474 VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQLGGNTNTGAALNFTLSLLQKAK 531 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + ++V+LT+G + E N+ + + IR+ I +E Sbjct: 532 K------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGI------KEA 576 Query: 415 ARYFLSNCASPN-SFFEANSTHELNKI 440 + L A + + L I Sbjct: 577 NQTQLREIAGEEKRVYYVHDFDALKDI 603 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 44/124 (35%), Gaps = 18/124 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ + +K+++L+TDGE Sbjct: 681 MSQSDISNAIDQMAHIGQTTLTG------SALSFVSQYFSPTKGSRPSVRKFLILITDGE 734 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 +E + +G+ I ++ F N TQ L + P F + Sbjct: 735 AQDIVKEP---AVALRQEGVIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFDI 784 Query: 437 LNKI 440 L +I Sbjct: 785 LQRI 788 Score = 39.6 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 251 GFLEESVSALDIKENCMRIGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 307 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 308 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 361 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 362 TIFTLGIKGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 415 Query: 453 VI 454 V Sbjct: 416 VS 417 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 60/237 (25%), Gaps = 8/237 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 S+ L + +G +I AL PV++G G+ + W + L+ AA + A V Sbjct: 15 SRALKRDVSGTAAVIAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVRKRA 74 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY- 129 + A + L N+ + + E+ PR + Sbjct: 75 GDPKSQIDAAALNIALNSGMSSSLGNMLANSPPTSGIKAGDTSSLEVILTEVRPRLFSSV 134 Query: 130 ------QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++ + L S + + + Sbjct: 135 FSSEPVSIRARAVASIVAGSQACVLALSPAASGAVTLSGSTVVNLKGCDIASNSTAADAF 194 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S S + + ++ + + + + Sbjct: 195 LMSGSSAALSAGCA-SSVGGAVTTIQLKLTQCATVKTSAPVVRDPYASVVEPAAFGT 250 >gi|29349873|ref|NP_813376.1| putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341784|gb|AAO79570.1| conserved hypothetical protein, putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] Length = 621 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 49/433 (11%), Positives = 122/433 (28%), Gaps = 45/433 (10%) Query: 27 ALLMPVMLGVG--GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 A+++ VML + G V + + + + + + ++ + Sbjct: 8 AMMLTVMLVITTLGGTV-WAQPLIVKGTVTDGSDGSPLVGCAVTVKGTSRGAVTNLQGQY 66 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-------MNPRKSAYQVVLSSRY 137 + E LI ++ K + V A + S +V ++ Sbjct: 67 RIEANKGETLIFSYIGFDKVEKVVFSAKMDVSLKASSDVLEECVVVGYGSQRKVAVTGAI 126 Query: 138 D----LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-------QWVIDFS 186 L S+ + ++ + + ++ + V S + + + S Sbjct: 127 STVNLASLRHPSVRMDAVNTEEYKSISENGFKQVGESPLSTFSIDVDAASYSNMRRMINS 186 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 ++ + +N F + + L P Sbjct: 187 GTLPVPDAIRTEELVNYFSYDY-AKPTGSDPVKITMEAGVCPWNADHRLVRIGLKAREIP 245 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D S + LV+ +L ++ ++++ D V + Sbjct: 246 TDKLPESNLVFLIDVSGSMWGPTRLDLVKSSLKLLVNNLREKDKVA------IVVYAGNA 299 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G + ++ + GSTA +Q AY + NN Sbjct: 300 SVKLESTPGS---DKQKIRDAIDELTSGGSTAGGAGIQLAYKV----AKQNFLPKGNNR- 351 Query: 367 AKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+L +DG+ + K + G+ + + + + + K + Sbjct: 352 ----IILCSDGDFNVGVSSVEGLEQLIEKERKSGVFLSVLGYGMGNYKDNKGQALAEK-- 405 Query: 424 SPNSFFEANSTHE 436 + ++ E Sbjct: 406 GNGNHAYIDNLQE 418 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 61/173 (35%), Gaps = 10/173 (5%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 A I + T R+G +N D ++ K + + E Sbjct: 55 SANLASIFSDAQIGINPSNPKTTRIGMVTYNSNATVDAHLNYDWPKNNDVLNFYSTMSEI 114 Query: 333 EMGSTAI-NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 ST+ +Q A + + S + N KK I++ + I + N Sbjct: 115 SEDSTSYVAHGLQAAQNLLYSESFGS-----NRSHYKKVIIVCASTFKGTGKNDPIPVAN 169 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRD 443 + K +G++I+TIA+ + + L+ +SP SF + + + + Sbjct: 170 RLKGRGVKILTIAYDQGDEKVVEE---LAKISSPGLSFKQDANMTLIENTLLE 219 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 81/229 (35%), Gaps = 20/229 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 G + + + + K + ++A+ + + + NV+ Sbjct: 141 SNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKMKMDIAKEAIQQFVSDLPEAVNVS 200 Query: 293 DTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 V +++ + + + + K +T ++ +G T + +A++ + + Sbjct: 201 LRVYGHKGSNDEKDKTASCGAIENVYTLQKYNQTTLRQSLDGFQPVGWTPLAEAIKRSTE 260 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFS 406 T S+ K + +++DG T + K + I+ + I F Sbjct: 261 TFQSAK----------ANDKNIMYVVSDGVETCG-GNPVEEAQKVSNSNIKPIMNIIGFQ 309 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 V+ +++ + S + AN+ EL F++ G +I R ++ Sbjct: 310 VDHEAEKQLKEIAE--VSKGKYVLANNAKELQDQFKET-GKDITSRRLK 355 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL ++ D+ G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VISVLQNDQPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ A + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNATAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 58/195 (29%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGNTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYTNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G + Sbjct: 497 DLGKAIDNIRQMGGNTNTGAALNFTLTLLQ------RAKKQRGNKVPCHLVVLTNGMSQD 550 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 K + + IR+ I +E + L A + + L I Sbjct: 551 SVLGPAH---KLREENIRVHAIGV------KEANQTQLREIAGDEKRVYYVHEFDALRDI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 10/132 (7%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + ++ D + VR+G F+D S+ + + G T I Sbjct: 1022 LASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIF--GYTHIG 1079 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 DA++ + N + +++LTDG + +E + + +G+ I Sbjct: 1080 DALRKVKHYFRPDT-----GSRINAGTPQVLLVLTDG---RSQDEVAQAAEELRHKGVDI 1131 Query: 401 MTIAFSVNKTQQ 412 ++ QQ Sbjct: 1132 YSVGIGDVDDQQ 1143 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 15/144 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + S+ +D + +R+G +++ S S G + +T V D + A Sbjct: 251 LGESVSALDIKENCMRVGLVAYSNETRVISSLSMG---VNKTEVLQRIQDLSPHVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T ++ + +G+ + Sbjct: 308 GA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVTH---RASDDNVTKAAVNLRREGVTV 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS 424 T+ L AS Sbjct: 362 FTMGVEG------ANPEQLEKIAS 379 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 39/122 (31%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQNDIANAIDQMAHIGETTLTG------SALTFVSQYFSPEKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 ++ + G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVKDP---AVALRKDGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 62/190 (32%), Gaps = 31/190 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR A+ + S VR+G F + + + + R V+ D Sbjct: 104 MATVRTAVKDFLASA------PKDVRIGVVSFGNTAGPEIAPT-----TARAAVQAVVDD 152 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+TA+ + A + + IVLL+DG+NT + Sbjct: 153 LRADGNTALFSGVTQAVRML-------------GSTGDRSIVLLSDGKNTVGDRASGLAA 199 Query: 391 --NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR---I 445 + +R+ + F+ + E F A S +A + F+ + Sbjct: 200 AGKALTASQVRVEVVRFTTGENDPEALAAFAK--AGGGSVVQATDAEGVRTAFQTAAKVL 257 Query: 446 GNEIFERVIR 455 +++ + R Sbjct: 258 ESQVQFDITR 267 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/420 (9%), Positives = 103/420 (24%), Gaps = 45/420 (10%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN 97 G+ VDV + A+ VP E + K E + + Sbjct: 180 GIAVDV-----LPTPRQAVQDAAVEELKVPDKLYQAESFYVEVMVRSTFKTSGELRIYED 234 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 ++ + + NP Y+ ++ ++ + Sbjct: 235 NREIGRERVDVTPGENRYAVKGLAKNPGLHRYR------AEIFMDGDEASANNAAFDFTR 288 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 ++ V + ++ + + + + S Sbjct: 289 VEGPPNVLIVEGTPGTSGNITAALESGMIGTEVIPPELLSLEAAKYAVYDSIIFNNVSGS 348 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 G + + + + + + K+ + Sbjct: 349 DVGGKQMELIEQAVRSFGIGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLG 408 Query: 278 LASVIRSIKKIDNVN----------------DTVRMGATFFNDRVISDPSFSWGVHKLIR 321 L VI +D +G F+D+ + Sbjct: 409 LILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVAFDDQPWWVVPP---QKLGDK 465 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V + G T I A+ +A + ++ +++I+L+TDG+ + Sbjct: 466 EEVLSSIQSIPSAGGTNIYPAVSSALEEML-----------KIDAQRRHIILMTDGQ-SA 513 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 N + + I + ++A ++ A+ ++ L +F Sbjct: 514 MNSGYQDLTDTMVENKITMSSVAVGMDADTNLLQSL---ADAAKGRYYFVEDETTLPAVF 570 >gi|326433400|gb|EGD78970.1| hypothetical protein PTSG_11807 [Salpingoeca sp. ATCC 50818] Length = 2673 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 56/197 (28%), Gaps = 15/197 (7%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK---KKHLVRDALASVIRS-IKKID 289 + SLY + +L +D S +D + + + Sbjct: 638 YTSRDSLYVSVIADQCRQNLDLVFLLDGSGSIESTALGGAPGTFQDRVLAFVSQVTTYFT 697 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F + + + A G T + ++ Sbjct: 698 IGEHDTRVAVATFASGATVNIRLNDHFDG--DALRDAIADIPYPQGQTYTSLGLRAVRQD 755 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 I++ + + +V+LTDG N+Q + + + Q + + I + Sbjct: 756 ILTEANGMRPASEGVP---RVLVVLTDG-NSQPSYDPATEASILHDQNVNVFAIGVGSSI 811 Query: 410 TQQEKARYFLSNCASPN 426 + + L + AS Sbjct: 812 S-----QSQLEDIASDP 823 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 57/230 (24%), Gaps = 58/230 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A ++ + VR+G F + R + Sbjct: 106 NRISAAQQAARDLVVGLPAS------VRLGIVSFAGTAAVVLPPTDN-----RQDMLDAI 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISS----------------------------------N 354 TA + A + Sbjct: 155 ERFQLQRGTATGSGLFQALAVLFPEDGIDLEVILFGSRSDRAGRGTSLDEAAAADAARRR 214 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN------ 408 E + ++LL+DG T + + A +G+R+ T+ F Sbjct: 215 EQGQQAAQPGSYRHGAVILLSDGRRTTG-PDPLDAARMAAQRGVRVYTVGFGSQQVTSAP 273 Query: 409 ----KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFER 452 + L AS ++ A S +L++++R ER Sbjct: 274 ESSLSYFMQLDEPALRAVASITGGEYYHAGSAADLSQVYRQLSARFALER 323 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 51/173 (29%), Gaps = 33/173 (19%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G F R + L + + TA+ DA+ + + + Sbjct: 127 GRKDDRIGLVAFAGRPYPAAPLTSDHQWLQGVVDR--LDTGAVEDGTALGDAILSGVNRL 184 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + ++L+TDG N E AK+ GIR+ I + Sbjct: 185 R-----------RRPAESRALILITDGRNNAG-AEPQLAAQAAKALGIRVHAIGIGSRGS 232 Query: 411 -----------------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 E L A + +FEA L+++F + Sbjct: 233 AVIPVPSPLGGTIYRRLDAELDAATLKGVAELTGGRYFEAGDATVLSRVFAEI 285 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 18/195 (9%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIR 321 + + A V+ ++ + + +DR D + V L R Sbjct: 55 DMGGESRISAAKQAFNDVLDAVPEEVQLGIRTLGADYPGDDRKVGCKDTKQLYPVGPLDR 114 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 T KT G T I A+ A D + ++ + IVL+TDGE+T Sbjct: 115 TEAKTAVATLAPTGWTPIGPALLGAADDLD------------GGDSTRRIVLITDGEDTC 162 Query: 382 DNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + ++GI I T+ + +++ A+ ++ +T EL+ Sbjct: 163 GPLDPCEVARDIAARGIHLVIDTLGLVPDAKIRQQLTCIAE--ATGGTYTAVQNTDELSG 220 Query: 440 IFRDRIGNEIFERVI 454 + + V Sbjct: 221 RVKQLVDRAAEPVVT 235 >gi|118349478|ref|XP_001008020.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289787|gb|EAR87775.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 642 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 71/192 (36%), Gaps = 24/192 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V++ L ++ + D R+ N+ + + + + + +K Sbjct: 214 KILNVKNTLLYLLEMLNSND------RLSLVLSNNNPTTLFDLKY-LDEKNKQDLKRIIN 266 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + +T I +M A++ + + + I LL+DG ++ ++ Sbjct: 267 NISITQNTNITKSMIKAFNILQ---------FRQSQNKVSSIFLLSDGVDSSAEKQIQNY 317 Query: 390 C---NKAKSQGIRIMTIAFSVNKTQQEKARY-FLSNCASPNSFFEANSTHELNKIFRDRI 445 +++ I + + ++ + + L N +F+ + +++++ F D + Sbjct: 318 ISSQQSLQNKNFAIHSFGYGFDQDAEMINKICSLKN----GNFYYIQNMNQVDQYFADVL 373 Query: 446 GNEIFERVIRIT 457 G + IT Sbjct: 374 GGTLTAVAQDIT 385 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 55.8 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 53/187 (28%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A ++ + D R+ F+ R +K Sbjct: 59 LETVKKAANRLVDRLNPSD------RLSVVVFDHRAKVLVP---SQSVEDPEKIKNQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ + + +D V + LLTDGEN DN + Sbjct: 110 LAADGGTAIDEGLRLGIEELAKGKKDTVSQA----------FLLTDGENEHGDNNRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 A S + + T+ F N + L A + E F Sbjct: 160 AQLAASYNLTLNTLGFGDNWN-----QDVLEKIADAGLGTLSYIQKPEEAVDEFNRLFSR 214 Query: 448 EIFERVI 454 + Sbjct: 215 IQTVGLT 221 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 47/187 (25%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ + D R+ FN VK Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP---NQSVEDLQGVKNKINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G T I++ M+ I LLTDGEN DNE + + Sbjct: 110 LTASGGTCIDEGMKLGIKE----------AALGKENRVSQIFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F N + L A + F Sbjct: 160 AKVAAEYNITLNTLGFGSNWN-----QDILEQIADSAGGMLCYIEHPEQALTEFSRLFTR 214 Query: 448 EIFERVI 454 + Sbjct: 215 AQSVGLT 221 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 47/187 (25%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ + D R+ FN VK Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP---NQSVEDLQGVKNKINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G T I++ M+ I LLTDGEN DNE + + Sbjct: 110 LTASGGTCIDEGMKLGIKE----------AALGKENRVSQIFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F N + L A + F Sbjct: 160 AKVAAEYNITLNTLGFGSNWN-----QDILEQIADSAGGMLCYIEHPEQALTEFSRLFTR 214 Query: 448 EIFERVI 454 + Sbjct: 215 AQSVGLT 221 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL +V D G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VVSVLQNDHPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNTTAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 497 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 547 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 548 SRDSVLGPAHKLREENIRVHAIGV------KEANQTQLREIAGEEKRVYYVHEFDALRNI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 251 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQRIQDLSPQVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 308 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 362 FTMGIEG------ANPDELEKIASHP 381 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 1012 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1071 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1072 IF--GYTHIGDALRKVKYYFQP-----DMGSRINAGTPQVLLVLTDG---RSQDEVAQAA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1122 EELRHKGVDIYSVGIGDVDDQE 1143 Score = 40.4 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVRDPAI---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 52/192 (27%), Gaps = 34/192 (17%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + A ++ VR+G F + + + Sbjct: 98 SMAATDIAPTRLEAAKAAARGFVQRQPAA------VRIGIVAFGATGLVTQQPTSDRASV 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTI----------ISSNEDEVHRMKNNLEAKK 369 + G TA+ +QTA I E Sbjct: 152 -----VAAIDRLSPQGGTALGGGLQTALGAIVGKPVVVPGSDPGGGPEPSGPDLGYHGSA 206 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----------QQEKARYFL 419 +VLLTDGENT + + + + A + G+++ I L Sbjct: 207 AVVLLTDGENTA-QPDPLQVADIASTAGVKVYPIGLGSPAGTVLQIDGFQIATRLDEPLL 265 Query: 420 SNCA--SPNSFF 429 A + +F Sbjct: 266 QQIADRTDGRYF 277 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 56/181 (30%), Gaps = 22/181 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S + + T Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRAVVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSVAEGARPPE---AHVPRVAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + L ASP F S + G + R+ Sbjct: 169 EIYAVGVQRADVGS------LRAMASPPLNEHVFLVESFDLI-----QEFGRQFQGRLCT 217 Query: 456 I 456 + Sbjct: 218 L 218 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 363 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 417 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QDN Sbjct: 418 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDNISV--WA 471 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 472 ARAKEEGIVMYAVGVG------KAVEEELREIASEP 501 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 56/181 (30%), Gaps = 22/181 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S + + T Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRAVVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSVAEGARPPE---AHVPRVAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIR 455 I + L ASP F S + G + R+ Sbjct: 169 EIYAVGVQRADVGS------LRAMASPPLNEHVFLVESFDLI-----QEFGRQFQGRLCT 217 Query: 456 I 456 + Sbjct: 218 L 218 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 363 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 417 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QDN Sbjct: 418 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDNISV--WA 471 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 472 ARAKEEGIVMYAVGVG------KAVEEELREIASEP 501 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 60/190 (31%), Gaps = 26/190 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ LA ++ + + D R F + + K Sbjct: 184 KIEMVKQTLALLLDFLNEND------RYQLITFESQAQRLTPLKRVTDG-NKQYFKQVIQ 236 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G T I A + A+ + I LL+DG++ Q N+ I Sbjct: 237 QINSGGGTTIGTATEIAFKQLQERKYRNNVTS---------IFLLSDGQDGQANQR---I 284 Query: 390 CNKAKSQG--IRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRIG 446 + K+ + T F + Q + L SF+ L++ F D +G Sbjct: 285 QEQIKTVNEVFTLHTFGFGEDHDAQMMTQLCNLK----SGSFYFVQDVTLLDEFFADALG 340 Query: 447 NEIFERVIRI 456 I ++ Sbjct: 341 GLISVVGEQL 350 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 47/183 (25%), Gaps = 36/183 (19%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ F + + + ++T T I A+ A + Sbjct: 127 NNDKVAMIVFAGEAFTQLPITSDYISA-KMFLETITPSLISTQGTDIRGAIDLAMKSFTP 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + IVL+TDGEN + +A +G+R+ + Sbjct: 186 -----------NEGVGRAIVLITDGENHEGGAV--EAAQEAAKKGVRVFVLGVGSPDGSP 232 Query: 413 EK-----------------ARYFLSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIF 450 + C A + ++T+ + I Sbjct: 233 IPTGNGSEFRRDKDGNVIVTKLNEQMCQEIAKAGNGMYVRVDNTNNAERALNAEINKLAK 292 Query: 451 ERV 453 V Sbjct: 293 ADV 295 >gi|148657120|ref|YP_001277325.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569230|gb|ABQ91375.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 774 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 45/347 (12%), Positives = 95/347 (27%), Gaps = 38/347 (10%) Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 V + + + + L+ L P S FL S ++S Q A+A S Sbjct: 184 VDLMQAGDGIGIVGFNDGVVMPLNLTTILDSPPQSAFLFSDDMESGADQWNAQASWGLTS 243 Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + + S S + + + I Sbjct: 244 TARNSAWAWT----DSPSGNYANNTNSALTIATPIAIPASMSNPALMFWHRYDIASPDRG 299 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 +S N + + S V + +R V + D Sbjct: 300 EVEISTNNGATWQRLRLFTGTATSWRREVIRL----DDYRGQTIRMRFRLVTDAFGVRDG 355 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 ++D +R + N G+T+I +Q++ + Sbjct: 356 WYIDDVTIGPEWDD---------------VRARAQAAIDTLNSRGATSIGGGLQSSQRML 400 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++N + + I+LL+DG+ + + ++ + TI + Sbjct: 401 DTANP----------DLPRVIILLSDGQENT-RPFVADVLPQIRAAQTTVHTIGLGRDAD 449 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 QQ + ++ A + +L + I + R +T Sbjct: 450 QQLMLSIAAQ---TGGTYNYAPTPEQL-SGIYNTISGAVSNRQTLVT 492 >gi|323699770|ref|ZP_08111682.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323459702|gb|EGB15567.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 2034 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 43/145 (29%), Gaps = 27/145 (18%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-E 378 R G+T +NDA+ + + +++ TDG + Sbjct: 467 TRDAALKGLAGIKANGATCLNDAVLLGLHMLAGAKRPA-------------LLVFTDGFD 513 Query: 379 NTQDNEEG------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE 430 ++ + + K+ G+ + TI F L A S ++ Sbjct: 514 ANFNDTGPGSKATRREVLDAVKTGGVPVFTIGFGKGHDV-----STLDRIASLSGGRYYP 568 Query: 431 ANSTHELNKIFRDRIGNEIFERVIR 455 A+ L+K F N R Sbjct: 569 ASDPGALDKAFAVVNANLANTFTAR 593 >gi|220923693|ref|YP_002498995.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219948300|gb|ACL58692.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 130 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 36/131 (27%), Gaps = 3/131 (2%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ +G + A++ +ML +V++ R + L QAA L Sbjct: 2 SALRRFCACRSGSTAVEFAMVGMIMLVTMLGIVELGRGLNVRNQLSQAADFGARAV---L 58 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + S + + L + R V + Sbjct: 59 MNKTISDSGLEAVIRAAFQAASPDQLQVTVGAEVVNGLQFRTVSVSYPFVPLTWGFSTGT 118 Query: 129 YQVVLSSRYDL 139 + +S R + Sbjct: 119 INLSVSRRTPV 129 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 46/112 (41%), Gaps = 6/112 (5%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F + + + + ++ + D E T A++ A++ + S+ Sbjct: 128 GRRAIILFAAKPLVQCPLTADMDAF-EALLGMASPDLIEAQGTDFRSALELAHNVLEPSS 186 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 E R+ + + +K +VLL+DGE+ + A N+ K+ I + I Sbjct: 187 ES---RLASAAKGEKIMVLLSDGEDHAGD--LPAAANRIKNGRIHVFAIGVG 233 >gi|159184836|ref|NP_354603.2| hypothetical protein Atu1609 [Agrobacterium tumefaciens str. C58] gi|159140120|gb|AAK87388.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 579 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 86/287 (29%), Gaps = 13/287 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + +K TG+ I L P+ +G+ + VD + + L+Q A + S Sbjct: 6 SLLPRFLKDKTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQTA--DLAAISAAA 63 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + E + + + ++ L+ E V + Sbjct: 64 NATDAEKAVQQYFALNGMDLGVKTDQGLLTAKGLQPFDPLNEFASSKGYAEVIKGHYEPD 123 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 V + ++ + + + + S + S+ Sbjct: 124 ATVPV---------GKRFVDNALPTNAIKVNIVEQGQIFFASAFTKPPKVSAVGTASSQK 174 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN--KSLYYMLYP 246 + + S L+ + ++ + K+ L+ + + ++L L Sbjct: 175 IAAFSVGSRLASLDEGILNSLLGGLLGTTVSLKLMDYQALLAADVNALKIVEALAIDLNL 234 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + + V+ K ++ A+ +++ +++K N ++ Sbjct: 235 TAGTYKDVLQTEISYGKFLDVLTKTTGLQPAVVNILNTLQKTANKSN 281 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 66/195 (33%), Gaps = 30/195 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ I + + D M A ++D V + R+ +K Sbjct: 72 VDNIKQFTTQFIDKLNERYYRCDRNLMWNAGALHYSDEVQLISGLTSMRTG--RSGLKDQ 129 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +G T + A++ + ++ ++ + KY++++TDG + +E Sbjct: 130 VSKVVSIGKGTYTDCAIKRGIEELL--------IGGSHHKENKYMIVVTDGHPLEGYKEP 181 Query: 387 ----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTH---- 435 N+AK QGI++ ++A LS A+ ++ F A Sbjct: 182 CGGLEDAANEAKHQGIKVFSVAI-----SPNHLESRLSVIATDQAYRRNFTATGPSLRAR 236 Query: 436 ELNKIFRDRIGNEIF 450 +++ I Sbjct: 237 DIDNTIDTIIDMIKS 251 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 55/177 (31%), Gaps = 23/177 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + D VR+ ++ R + F + ++ K I++ +T +N A Sbjct: 862 LSAAKPTEDAVRVSVVQYSGRTQQKLEVPFEQNYTVIADSVDKMQFIND----ATDVNAA 917 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-ICNKAKSQGIRIM 401 + S+ KK +++ +DG + + I +A+ GI I Sbjct: 918 LNYVTSLFRRSSRSGA---------KKRMLIFSDGNSQGITQSAIERAVQEARQAGIEIY 968 Query: 402 TIAFSVNKTQQE-----KARYFLSNCASPNSF-FEANSTHELNK-IFRDRIGNEIFE 451 + + + + A F L + +F + +I Sbjct: 969 VLVVGTQANEPNVRVLVTGKTAEYDVAFGERHLFRVPDYESLLRGVFYQTVSRKISV 1025 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 40/380 (10%), Positives = 83/380 (21%), Gaps = 66/380 (17%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + ++ + + +A Sbjct: 9 AAVGAAAGRKGKRLVAIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V ++ G + + + + I V Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLV 128 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + QP ++ I D+ + + Sbjct: 129 LDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT- 187 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +K ND R G + + Sbjct: 188 -------------------------------------------EKTSVGNDFYREGWSSY 204 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N + ++ V L T + G+T+ + A A + + Sbjct: 205 N-YTQIVSNLTYDVSGLTST-----VNGLSASGATSADYAFNRAQAALT---------YQ 249 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKA 415 AKK ++ TDGE + + A K G I +I + + Sbjct: 250 PRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTS 309 Query: 416 RYFLSNCASPNSFFEANSTH 435 +S + + Sbjct: 310 SNLNKYMHGISSNYPDATAT 329 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 + + +D D R+ + V + L + +K T Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMT 143 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A++ A D + + K K +++TDG + ++ + A++ GI Sbjct: 144 GMAIKKAMDEAFTEKSGARPKSK---NISKVAIIVTDG---RPQDQVEEVSAAARASGIE 197 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I + L AS + F + +L FR+ + Sbjct: 198 IYAVGV------DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|170692557|ref|ZP_02883719.1| putative transmembrane protein [Burkholderia graminis C4D1M] gi|170142213|gb|EDT10379.1| putative transmembrane protein [Burkholderia graminis C4D1M] Length = 373 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 27/370 (7%), Positives = 75/370 (20%), Gaps = 13/370 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI------ITASV 66 K + G + AL M +LG+ + +D+ L+ AA A + Sbjct: 5 KSRRKQEGAVAVTVALCMLFLLGIAALAIDIGNLLVARSQLQNAADAAAMAGAGCLMRRT 64 Query: 67 PLIQSLEEVSS---RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + + +F + + ++ Sbjct: 65 ECGNTTATQPDWITASAKASSFSTSTVTNKVQADYVKASTVATGYWNATGTPYGLESLPF 124 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 ++ + + + + +Q + + V + Sbjct: 125 TPGASDLPAVQVTIHKDGSNANGAVPVFFGSVLGVQILKASAVATAVLSTPGNVGPGGLF 184 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + S Y + P + T+ S + ++ Sbjct: 185 PLAMSKCLYDNYWNAATSSPKLSPNNNTLPGTSVAQQQGQPYFFQIGSSYHYGACESGQW 244 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 D S S + +S + + + Sbjct: 245 TTFNVNDNSASYARGLLTSGNSTTFSIGASPGTWIQ--TGTENTLFKGAGDCSAAGNGQC 302 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE--VHRM 361 + + ++ +Q + D + V Sbjct: 303 AWATVAVVNDPATSGFQPVVAFACVHILDAKNGSSPYVLVQMSNDMSHCRTPNSGGVGPN 362 Query: 362 KNNLEAKKYI 371 + + Sbjct: 363 YGAYTPPRLV 372 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 24/213 (11%), Positives = 58/213 (27%), Gaps = 42/213 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + ++ ++G F + Sbjct: 97 DISNSMLAQDVAPTRLDRSKRLVEDLVNRFT-------NDKIGIVVFAGDAFVQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + + T I A++ + + K I+++T Sbjct: 150 YISA-KMFLNNISPELIGSQGTDIGKAIELSEHSFSEKANF-----------GKAIIIIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLSN------------ 421 DGEN + + +A+ QGIR+ + + +L + Sbjct: 198 DGENHEKGA--EEMAREAQKQGIRVFILGIGSPQGAPIPMGNGSYLQDTSGQTVMTRLNE 255 Query: 422 --C-----ASPNSFFEANSTHELNKIFRDRIGN 447 C A + ++T I + IG Sbjct: 256 DMCRKIAEAGKGMYIHVDNTTAAETILDNEIGK 288 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 14/55 (25%), Positives = 29/55 (52%), Gaps = 3/55 (5%) Query: 7 FIFYSKKLI---KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 S++L ++ +G F I+ AL++P+M+ + G +D+ R + L+ A Sbjct: 9 VRVQSRRLRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQSVAD 63 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 62/193 (32%), Gaps = 24/193 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + V+ AL +++ +T+ +G F+ + S + + Sbjct: 110 DTSGSMADNMDSVKYALRTMVSLFN------NTIDIGLVEFSHSIKSAIPPTPNRSYID- 162 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 G T + A+ TA + L + V +TDG Sbjct: 163 ----MVIDRMEAGGGTMYSFALSTALSWLRP---------YRELNVSAFTVFITDGLPG- 208 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 D ++ + ++ GI I T+ + ++ + S + F S L+ Sbjct: 209 DPQDYRPLLDEYNKLGIPIYTVFIGEDPRGIDETKLIASK--TGGEQFTVESIDRLSDT- 265 Query: 442 RDRIGNEIFERVI 454 + I ++I + Sbjct: 266 LNTIASKINTIIA 278 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 42/176 (23%), Gaps = 34/176 (19%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + + TAI + + + Sbjct: 138 SSSAALILPPTIDHKVFLSRLDSLSIGEL--GDGTAIGMGLAVSSAYMT----------- 184 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------------ 410 YIVLLTDGEN ++ I I + Sbjct: 185 RTKLNSSYIVLLTDGENNTGEINPKTAAKVLVNKNIGFYVIGIGSSGYTTLEYTDRKTGK 244 Query: 411 ------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L A + A+S L IF + I ++ T+ Sbjct: 245 TYSGSIFSKFDELELKKIAQYGNGKYASASSPEILENIF-NTISKQVPAAQSNFTR 299 >gi|239926966|ref|ZP_04683919.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291435315|ref|ZP_06574705.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291338210|gb|EFE65166.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] Length = 527 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 62/203 (30%), Gaps = 24/203 (11%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 PG P + VD+S + L R +LA++ ++ D+V F+D Sbjct: 159 APGGERPPAALTFVVDTSGSMAEPGRLDLARKSLAAMTDRLRDDDSVA------LVTFSD 212 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + R V ST + ++ Y+T + + Sbjct: 213 EAETVLPMT--RLGGNRDEVHDAVDRLEPDRSTNLGAGVEAGYETAVEGLREGATNR--- 267 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLS 420 +VL++D + AI + ++ GI + + + R Sbjct: 268 ------VVLVSDALANTGETDADAILERISRSRRAHGITLFGVGVGSDYGDALMERLADR 321 Query: 421 NCASPNSFFEANSTHELNKIFRD 443 + + ++F + Sbjct: 322 ---GDGHTVYVSGEEDAREVFGE 341 >gi|221058683|ref|XP_002259987.1| sporozoite surface protein 2 [Plasmodium knowlesi strain H] gi|193810060|emb|CAQ41254.1| sporozoite surface protein 2, putative [Plasmodium knowlesi strain H] Length = 579 Score = 55.8 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 64/218 (29%), Gaps = 25/218 (11%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + VD S V L +I ++ D++ + + F Sbjct: 39 CNEKVDLYLLVDGSGSIGYANWITRVIPMLTGLIENLNLS---KDSINLYMSLFASHTTE 95 Query: 309 DPSFSWGVHKLIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G + + G+T+++ A+ + N Sbjct: 96 LIRLGSGPSMDKKQALNVVRDLRKGYEPYGNTSMSSALSEVEMHLKDRV--------NRP 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----EKARYFLS 420 A + ++L+TDG + + + K + +++ I Q R Sbjct: 148 NAIQLVILMTDGIPN-NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLMAGCRPRER 206 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +C F+ + E + + I + V RI K Sbjct: 207 SC----KFYSSADWSEAISLIKPFIAK-VCTEVERIAK 239 >gi|10945609|gb|AAG24613.1|AF298217_1 sporozoite surface protein 2 [Plasmodium knowlesi] Length = 572 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 64/218 (29%), Gaps = 25/218 (11%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + VD S V L +I ++ D++ + + F Sbjct: 39 CNEKVDLYLLVDGSGSIGYANWITRVIPMLTGLIENLNLS---KDSINLYMSLFASHTTE 95 Query: 309 DPSFSWGVHKLIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G + + G+T+++ A+ + N Sbjct: 96 LIRLGSGPSMDKKQALNVVRDLRKGYEPYGNTSMSSALSEVEMHLKDRV--------NRP 147 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----EKARYFLS 420 A + ++L+TDG + + + K + +++ I Q R Sbjct: 148 NAIQLVILMTDGIPN-NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLMAGCRPRER 206 Query: 421 NCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 +C F+ + E + + I + V RI K Sbjct: 207 SC----KFYSSADWSEAISLIKPFIAK-VCTEVERIAK 239 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 65/190 (34%), Gaps = 27/190 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A I + + D R+ + V + + + ++ TF Sbjct: 62 KLDDAKRAAREFITRVSEED------RVALVHYGTDV-TVFPSTLATPETREQML-TFVN 113 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEG 386 + GST I+ ++ A + + + I+LL+DG+ T E+ Sbjct: 114 AIEDEGSTNISGGLEAAAQQLQKNADQFRVSR---------IILLSDGQPTAGLTREEQL 164 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 A+ +SQG+ + + + + + A F + +L ++F + Sbjct: 165 TALARNLRSQGMAVSALGVGEDFNEN-----LMQGIADQGGGFSGFLRSDQLAEVFTREL 219 Query: 446 GNEIFERVIR 455 + V R Sbjct: 220 -EQATSTVAR 228 >gi|253569758|ref|ZP_04847167.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840139|gb|EES68221.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 621 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 49/433 (11%), Positives = 122/433 (28%), Gaps = 45/433 (10%) Query: 27 ALLMPVMLGVG--GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 A+++ VML + G V + + + + + + ++ + Sbjct: 8 AMMLTVMLVITTLGGTV-WAQPLIVKGTVTDGSDGSPLVGCAVTVKGTSRGAVTNLQGQY 66 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-------MNPRKSAYQVVLSSRY 137 + E LI ++ K + V A + S +V ++ Sbjct: 67 RIEANKGETLIFSYIGFDKVEKVVFSAKMDVSLKASSDVLEECVVVGYGSQRKVAVTGAI 126 Query: 138 D----LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI-------QWVIDFS 186 L S+ + ++ + + ++ + V S + + + S Sbjct: 127 STVNLASLRHPSVRMDAVNTEEYKSISENGFKQVGESPLSTFSIDVDAASYSNMRRMINS 186 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 ++ + +N F + + L P Sbjct: 187 GTLPVADAIRTEELVNYFSYDY-AKPTGSDPVKITMEAGVCPWNADHRLVRIGLKAREIP 245 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D S + LV+ +L ++ ++++ D V + Sbjct: 246 TDKLPESNLVFLIDVSGSMWGPTRLDLVKSSLKLLVNNLREKDKVA------IVVYAGNA 299 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G + ++ + GSTA +Q AY + NN Sbjct: 300 SVKLESTPGS---DKQKIRDAIDELTSGGSTAGGAGIQLAYKV----AKHNFLPKGNNR- 351 Query: 367 AKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 I+L +DG+ + K + G+ + + + + + K + Sbjct: 352 ----IILCSDGDFNVGVSSVEGLEQLIEKERKSGVFLSVLGYGMGNYKDNKGQALAEK-- 405 Query: 424 SPNSFFEANSTHE 436 + ++ E Sbjct: 406 GNGNHAYIDNLQE 418 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 83/229 (36%), Gaps = 20/229 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 G + + + + K + ++A+ + + + NV+ Sbjct: 162 SNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKMKMDIAKEAIQQFVSDLPEAVNVS 221 Query: 293 DTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 V +++ + + + + K +T + +G T + +A++ + + Sbjct: 222 LRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAIKKSTE 281 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFS 406 T S+ E++ K I +++DG T + K + I+ + I F Sbjct: 282 TFQSAKEND----------KNIIYIVSDGVETCG-GNPVEEAQKVSNSNIKPIMNIIGFQ 330 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 V+ +++ + S + ANS EL F++ G +I R ++ Sbjct: 331 VDHEAEKQLKEIAE--VSKGKYVLANSAKELQDQFKET-GKDITSRRLK 376 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 83/229 (36%), Gaps = 20/229 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 G + + + + K + ++A+ + + + NV+ Sbjct: 162 SNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKMKMDIAKEAIQQFVSDLPEAVNVS 221 Query: 293 DTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 V +++ + + + + K +T + +G T + +A++ + + Sbjct: 222 LRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAIKKSTE 281 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFS 406 T S+ E++ K I +++DG T + K + I+ + I F Sbjct: 282 TFQSAKEND----------KNIIYIVSDGVETCG-GNPVEEAQKVSNSNIKPIMNIIGFQ 330 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 V+ +++ + S + ANS EL F++ G +I R ++ Sbjct: 331 VDHEAEKQLKEIAE--VSKGKYVLANSAKELQDQFKET-GKDITSRRLK 376 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +++ + + D +G F L I + Sbjct: 104 DPDRLTAAKESARRFVDGL------PDEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRL 157 Query: 327 FAIDENEMGSTAINDAMQT---AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + TAI +A+ T A + + + IVLL+DG NT Sbjct: 158 -VEGATGVQGTAIGEAINTSLGAVKALDGEAAKDPPPAR--------IVLLSDGANTSGM 208 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEAN 432 + A + + + TIAF ++ L A + F EA Sbjct: 209 DPMEAAADAV-EMEVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDAVARETGGQFHEAV 267 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 S EL ++ D IG+ + R R Sbjct: 268 SVEELRAVYDD-IGSSVGYRTKR 289 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/380 (10%), Positives = 83/380 (21%), Gaps = 66/380 (17%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + ++ + + +A Sbjct: 9 AAVGAAAGRKGKRLVAIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V ++ G + + + + I V Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLV 128 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM + QP ++ I D+ + + Sbjct: 129 LDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT- 187 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +K ND R G + + Sbjct: 188 -------------------------------------------EKTSVGNDFYREGWSSY 204 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N + ++ V L T + G+T+ + A A + + Sbjct: 205 N-YTQIVSNLTYDVSGLTST-----VNGLSASGATSADYAFNRAQAALT---------YQ 249 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKA 415 AKK ++ TDGE + + A K G I +I + + Sbjct: 250 PRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTS 309 Query: 416 RYFLSNCASPNSFFEANSTH 435 +S + + Sbjct: 310 SNLNKYMHGISSNYPDATAT 329 >gi|51891532|ref|YP_074223.1| hypothetical protein STH394 [Symbiobacterium thermophilum IAM 14863] gi|51855221|dbj|BAD39379.1| hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 252 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 9/56 (16%), Positives = 17/56 (30%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + G + + V + G +DV R L+ A + A + Sbjct: 3 RFRTEQRGSTGALFVFIWAVAVLAIGFALDVGRVFVLREQLRTAEEAAALAGVRQA 58 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 56/156 (35%), Gaps = 18/156 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 ++ F D V SF+ V K+ + +TA+ DA+ A + Sbjct: 122 RKQDQIDLMDFADDVDELVSFTSDVQKIDS-----GLGRIHHGDATALYDAVYLASQRL- 175 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT---IAFSVN 408 + ++ +VL+TDGENT + A +A+ G I + S + Sbjct: 176 -------GETPTSAGQRRVLVLITDGENTTHHGSYDAALEQAQRAGAMIYALIIVPVSAD 228 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + + L A + ++ H+L F+ Sbjct: 229 AGRNTGGEHALIQLARDTGGKYYYVEDKHDLAPAFQ 264 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 57/207 (27%), Gaps = 43/207 (20%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L+ ++ + ++G F + + + + + Sbjct: 110 SRLENAKQILSKLVDDMND-------DKVGLVVFAGDAYTQLPITVDYVSA-KMFLSNIS 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI A+ A + EA K I+L+TDGEN +D+ I Sbjct: 162 PELVPRQGTAIGSALDLAIKSF-----------GAKSEAGKAIILITDGENHEDDA--IG 208 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFL-----------------SNC-----ASPN 426 A I + I + C A Sbjct: 209 AAKLAAENNIIVNVIGMGKTDGAPIPVPGTMSFRKDKDGNVVVSKLNEQMCKEIALAGKG 268 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 + A++T+ K+ + + Sbjct: 269 VYVHADNTNGAYKVINKELDKLAKSDL 295 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL +V D G+T Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VVSVLQNDHPMGGNTY 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ + +GI Sbjct: 887 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNTTAKALRDKGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 941 LVLAVGI-----AGANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 984 Score = 45.4 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 442 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 496 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 497 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 547 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 548 SRDSVLGPAHKLREENIRVHAIGV------KEANQTQLREIAGEEKRVYYVHEFDALRNI 601 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 602 RNQVVQEICAEEACR 616 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 251 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQRIQDLSPQVGQAYT 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 308 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 361 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 362 FTMGIEG------ANPDELEKIASHP 381 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 1012 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1071 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1072 IF--GYTHIGDALRKVKYYFQP-----DMGSRINAGTPQVLLVLTDG---RSQDEVAQAA 1121 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1122 EELRHKGVDIYSVGIGDVDDQE 1143 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 AQDIVRDPAI---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 782 Query: 437 LN 438 L Sbjct: 783 LQ 784 >gi|114764813|ref|ZP_01443995.1| hypothetical protein 1100011001322_R2601_10474 [Pelagibaca bermudensis HTCC2601] gi|114542699|gb|EAU45722.1| hypothetical protein R2601_10474 [Roseovarius sp. HTCC2601] Length = 178 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 24/56 (42%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + ++ KS G + AL MP+M+G+ ++V + L++A Sbjct: 1 MIARIAKTLRRFRKSEDGSAVVPFALWMPMMVGIALSTIEVGAMTLRHTQLERALD 56 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 59/159 (37%), Gaps = 17/159 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D + V+ GA +++ + ++ + A G T A++ Sbjct: 839 VKKSDVGPNRVQFGALRYSNEPDIIFYLNSNRSAIMEYLRSLSAK----GGDTYTAKALE 894 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDG+ + D+ +K +++GI I + Sbjct: 895 RANILFT-----EEHGSRIKQNVKQMLIIITDGK-SHDHIHLSDKASKLRAKGIIIYAVG 948 Query: 405 FSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFR 442 E + L A + ++ L +++ Sbjct: 949 VG------EANQEELETMAGNKHYTIHVSNFDSLKDVYQ 981 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 70/220 (31%), Gaps = 25/220 (11%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK-------KKHLVRDALASV 281 + ++ ++ + L + VD+ + + + Sbjct: 404 TNNFLKKLQNEIWSQISTKAEQKELDKTGCVDTKEVDIYFLIDGSGSIRSDHFEQIKKFM 463 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + I+ D D VR+GA ++D + + + T+ K + G T + Sbjct: 464 LEVIENFDIGPDKVRVGAVQYSDTREKEFDITD--YTTDETLRKAISNIRQLGGGTYTGE 521 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ I E ++++ Y+++LTDG + D E K +++ I + Sbjct: 522 ALDFILQIIKKGREQRINKV------PCYLIVLTDGMSMDDVLEP---AEKLRAENIAVH 572 Query: 402 TIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 I E R L A + L I Sbjct: 573 AIGIG------EANRTQLQQIAGKEERVSFGQNFDSLKNI 606 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 34/99 (34%), Gaps = 10/99 (10%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + S +K+++L+TDGE D + + +G+ I + Sbjct: 707 KALTSIEPYFTESKGARSMVRKFLILITDGEAQDDVRNPAKV---LRDKGVVIFAVGV-- 761 Query: 408 NKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 R L + +S F+ S +L +I I Sbjct: 762 ----FRANRTQLEEISGDSSLVFQVESFSDLQEIENKLI 796 >gi|194335402|ref|YP_002017196.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307879|gb|ACF42579.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 343 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 48/135 (35%), Gaps = 6/135 (4%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ D L + I I RM F + + ++ + D Sbjct: 105 DVLPDRLGQAKQEITSISRAVTGGRMSILLFAASPLVQCPLTTDRDAFD-ALLGMASPDL 163 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 E T+ A + A + + ED RM + ++ +K +VLL+DGE+ + Sbjct: 164 IEEQGTSFRAAFELAGRLLEPTLED---RMASGVKGEKIVVLLSDGEDHTGEVR--SAVQ 218 Query: 392 KAKSQGIRIMTIAFS 406 + K + + I Sbjct: 219 QLKKANVHLFVIGVG 233 >gi|229491170|ref|ZP_04384998.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229321908|gb|EEN87701.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 614 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 55/148 (37%), Gaps = 20/148 (13%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + V R + G+T DA++ A + S+ + Sbjct: 92 FAGDCGNGGQLLVPVATDNRDQLNNATNQLTAGGTTPTPDALRAAAGDLPSTGD------ 145 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-GI--RIMTIAFSVNKTQQEKARYF 418 + I+L++DG++T + A+ + K+Q GI R+ + F+ + + Sbjct: 146 -------RTIILISDGQSTCGD--PCAVATELKTQLGIDFRVHAVGFNAPDVAESELSCI 196 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIG 446 A+ +F A +T EL+ + Sbjct: 197 --ANATGGRYFTATNTTELSDAISAAVT 222 >gi|311260227|ref|XP_003128387.1| PREDICTED: collagen alpha-1(XXI) chain-like [Sus scrofa] Length = 957 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H+ ++ G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GRHESGEKLLAAVGSIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFHVEDYIAISKI 203 >gi|293339632|gb|ADE44108.1| collagen type XXI alpha 1 [Sus scrofa] Length = 895 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H+ ++ G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GRHESGEKLLAAVGSIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARDSKITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGSETEDAE-----LRAIANKPSSTYVFHVEDYIAISKI 203 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/369 (9%), Positives = 97/369 (26%), Gaps = 22/369 (5%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 ++ + + E++ + F+ + + Sbjct: 83 IDDGLLDMADQPAPTAPEIAPVPQTEFSISTDAVMTEESAADLSAGGVQGMIAMDTTAAA 142 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 A+ N + S ++ P + + ++ ++ + S + Sbjct: 143 PQALSRNAQPLGSVAASSMSDLAIVAPQPTTKQYPNAEQNPLKIASDEPVSTFSIDVDAA 202 Query: 177 -VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 ++ + + +N F T + + + Sbjct: 203 SYAVVRSSLSRAQLPPSGAVRIEEMINYFSYDYLATDAGDVPFRATTSVFETPWNADTQL 262 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + + L P L+ +D+S + K L+ A ++ ++ D V+ Sbjct: 263 LHIGIQGTLPDVSEHPPLNLVFLIDTSGSMNQPDKLPLLISAFRLMLSELRPEDEVS--- 319 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + + + R + + GSTA ++ AY + +E Sbjct: 320 ---IITYAGSAGQVLAPTP---ASDRATILAALNRLSAGGSTAGQAGLRQAYAIAAAMSE 373 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQ 412 D ++L TDG+ + + + + G + + F Sbjct: 374 DGEIAR---------VILATDGDFNVGLNDPDTLKDYITTRRDSGTYLSVLGFGRGNLND 424 Query: 413 EKARYFLSN 421 + N Sbjct: 425 AVMQSLAQN 433 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 51/168 (30%), Gaps = 26/168 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + AL I ++ D G T F+D S Sbjct: 387 DVSGSMADEARLEQAKTALRIFIEQLQDDD------GFGLTIFSDSATVLTPVSPI--GP 438 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T + D AY + ++ + + +V+LTDG + Sbjct: 439 KRAEILNRIAGLTPRGGTRLLDTTVEAYQEMSATPPGQRI---------RAVVVLTDGLD 489 Query: 380 TQDNEEGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCA 423 + + N + I++ T+AF + L A Sbjct: 490 NKSQRNAQDVLNLLRQDREGYSIKVFTVAFGGDADVN-----LLKEIA 532 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 46/432 (10%), Positives = 111/432 (25%), Gaps = 49/432 (11%) Query: 41 VDVVR------WSYYE---HALKQAAQTAIITASVPLIQSLEEV----SSRAKNSFTFPK 87 +D R + + + ++ V L+ V S + + Sbjct: 43 IDSSRSVRPSEFEQAKVFLSQVIESLDVGANATRVGLVNYASTVKNEFSLKTHKAKPALL 102 Query: 88 QKIEEY--LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 Q +++ L L + ++ VV R + +S Sbjct: 103 QAVKKVQPLSTGTMTGLAIQYAMNNAFTESEGARIKSPGINKVAIVVTDGRPQDTVKDIS 162 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 R G++ + I + + + S S+++ + Sbjct: 163 ARARESGLEIYAI-GVGRVDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEAF---- 217 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + S + + Y +C + + + Sbjct: 218 -CVTADLCSTGDHDCEQICVSTP-GSYTCACRDGFTLNEDGKTCNACGASAVDLVFLIDG 275 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + + +D +G ++ V + G + + I Sbjct: 276 SKSVRPENFELVKQFINQIVDSMDVGERRAHVGLVQYSSSVRQEFPL--GRYSSKKDIKS 333 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T A+Q D + + + K ++ TDG + + Sbjct: 334 AVKKMSYMEKGTMTGQALQYLVDNSFAISSGGRPAV------PKVGIVFTDGRSQDYIND 387 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN----SFFEANSTHELNKIF 441 +AK G ++ + L AS SF+ A+ K Sbjct: 388 ---AAARAKELGYKMFAVGVG------NAVEEELRMIASEPQAEHSFYTAD-----FKAM 433 Query: 442 RDRIGNEIFERV 453 ++ IG ++ ++ Sbjct: 434 KE-IGKKLQMKI 444 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 64/201 (31%), Gaps = 20/201 (9%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + + A + + I+ +D + R+G + V ++ Sbjct: 31 NCRTRPTDILFIIDSSRSVRPSEFEQAKVFLSQVIESLDVGANATRVGLVNYASTVKNEF 90 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S HK +++ + T A+Q A + + +E + K Sbjct: 91 SLK--THKAKPALLQAVKKVQPLSTGTMTGLAIQYAMNNAFTESEGARIKS---PGINKV 145 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNS 427 +++TDG + + I +A+ G+ I I + L AS Sbjct: 146 AIVVTDG---RPQDTVKDISARARESGLEIYAIGVGR------VDKNTLRQIASEPLDEH 196 Query: 428 FFEANS---THELNKIFRDRI 445 S +L+K F++ Sbjct: 197 VDYVESYSLIEKLSKKFQEAF 217 >gi|290996921|ref|XP_002681030.1| predicted protein [Naegleria gruberi] gi|284094653|gb|EFC48286.1| predicted protein [Naegleria gruberi] Length = 353 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 59/173 (34%), Gaps = 20/173 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 IDN+ + R+G F+ + + + V L + +K + E GST MQ Sbjct: 20 IDNLREFERLGIVLFDHKAETLLPLTI-VQDLDKKSLKETVLKIEEQGSTNFEAGMQRGI 78 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMT--IA 404 D + ++L I+ LTD + + I K + G I + I Sbjct: 79 DLFST-------LDSSDLSNSNRIIYLTDACPNVGGTDTLDILTKDANSGPYNIFSTFIG 131 Query: 405 FSVNKTQQEKAR-YFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + C ++F ST + F + + V I Sbjct: 132 IGLDFNSDIVEELTRVRGC----NYFSVRSTED----FTKILNQDFNYIVTPI 176 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 57 QFLVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + + +V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSEAEGARP---SEERVPRVLVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 169 EIYAVGVQRADVGS------LRTMASPPLDQHVFLVESFDLIQE 206 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 17/157 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 404 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 458 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 459 EYMERGTMTGLALRHMVEHSFSEAQGARPRDLNVP---RVGLVFTDGR-SQDDISV--WA 512 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +AK +GI + + + L AS S Sbjct: 513 ARAKEEGIVMYAVGVG------KAVEEELREIASEPS 543 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 57 QFLVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + + +V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSEAEGARP---SEERVPRVLVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 169 EIYAVGVQRADVGS------LRTMASPPLDQHVFLVESFDLIQE 206 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 17/157 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 404 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 458 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 459 EYMERGTMTGLALRHMVEHSFSEAQGARPRDLNVP---RVGLVFTDGR-SQDDISV--WA 512 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +AK +GI + + + L AS S Sbjct: 513 ARAKEEGIVMYAVGVG------KAVEEELREIASEPS 543 >gi|56696618|ref|YP_166979.1| hypothetical protein SPO1741 [Ruegeria pomeroyi DSS-3] gi|56678355|gb|AAV95021.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 177 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 7/56 (12%), Positives = 23/56 (41%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + + ++ + G+ + ++ P ++GV V++ + L++A Sbjct: 1 MKRRLSNFLRRFRRGEDGNATVEFVIVFPAVIGVMLAGVELTFMTLNHAMLERAVD 56 >gi|308068884|ref|YP_003870489.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858163|gb|ADM69951.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 432 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 58/213 (27%), Gaps = 25/213 (11%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 L Y + P +D+S + A S+I + R+ Sbjct: 104 LGYQVNPEGRKSVQDLVLVIDNSGSMQQTDPDNERLTAAKSLIGQMD------GDKRVAI 157 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE--MGSTAINDAMQTAYDTIISSNEDE 357 F+ F+ + V + G T I A+ I + Sbjct: 158 VSFDSTAQLVQPFTPIRTDAEKQAVYSKIDSMQTIMSGGTEIRLALDETIKEIETQG--- 214 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 N E +++L+DG + ++ I + TI + + Sbjct: 215 ------NAEKGSLVIMLSDG---FSELDTQTALAPYIARQIPVNTIGLKL---AESDGIA 262 Query: 418 FLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 L N A + ++ + L + F Sbjct: 263 LLQNIADLTGGTYSNVANAQGLTQAFGKIYNKI 295 >gi|319941893|ref|ZP_08016214.1| hypothetical protein HMPREF9464_01433 [Sutterella wadsworthensis 3_1_45B] gi|319804546|gb|EFW01416.1| hypothetical protein HMPREF9464_01433 [Sutterella wadsworthensis 3_1_45B] Length = 570 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/207 (9%), Positives = 54/207 (26%), Gaps = 7/207 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + I G + L + + +G V V R A++ A T+ I + Sbjct: 16 LKTRVRHFITDRRGAAAMFLGLGILPAMALGTGAVLVNREFMNLSAMQSAVDTSSIALAK 75 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA----VEM 122 + + + + + + + ++ F ++ + + + Sbjct: 76 -IQSIGADSYLTTAVAQQWVDENSKLFAETLEVKSVLPTFAEKSISVSAQYASKPRLTGA 134 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + LS + P SL + S + + + Sbjct: 135 IGSLFSKKKELSVQAIAQFFPSSLEVVFTIDNSADSPAGQQVASGMEKVLNKLFEGKS-- 192 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPAD 209 D + + + + + Sbjct: 193 YDKTIHVSVFAVGTHMNLGTKYADIIS 219 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/253 (15%), Positives = 87/253 (34%), Gaps = 25/253 (9%) Query: 212 VKSYSSQNGKVGIRDEKLSPYM-----VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + R + LSP + L ++++ F+ S Sbjct: 12 HPVPHCKQDSRTERKDPLSPTYIGILAFGSTIFMRGRLEVAGCRTAVNDLVFIVDGSWSV 71 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K ++ L ++ S D ++G ++D + + ++ Sbjct: 72 GYKDFDTAKNWLLNITSSF---DIGPSYTQVGVVQYSDFPQLEIPLGHNTSY--QQLLSA 126 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G+T A++ A + + +++ + N+ K +++TDG+ +QDN Sbjct: 127 LKSIKYLGGNTNTGRAIKFATEEVFPTSK------RLNVSKNKIAIVITDGK-SQDNVVN 179 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS---THELNKIFRD 443 I+ + A++QGI + + T+ E N S + A + + R Sbjct: 180 IS--SSARAQGIILFAVGVGSEITKSELVAIA--NMPSTHYVLYAEDYTTIDRIKETMRQ 235 Query: 444 RIGNEIFERVIRI 456 +I E RI Sbjct: 236 KICEE-SVCPTRI 247 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 55/153 (35%), Gaps = 20/153 (13%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 RT VK STA +A+ A +I S + + IVL++ Sbjct: 142 SPTTDRTAVKQAVDGLKLSESTATGEAIFAALQSIDSFSRTVAASGTEGPPPAR-IVLMS 200 Query: 376 DGENTQDNEEGI-------AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFL 419 DG+ T +G +A ++ I + TI+F + + + Sbjct: 201 DGKQTVPGPDGENDPRGSFTAAKQAAAEKIPVSTISFGTDYGTIDIEGGRTRVAVDDASM 260 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 A S FF A S +L +++ + G +I Sbjct: 261 QQIASLSGGQFFTAASESQLRQVYSEL-GEQIG 292 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 59 LVGLVRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERALRTLVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 + L ASP F S + + Sbjct: 171 YAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 206 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 59/202 (29%), Gaps = 42/202 (20%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 V + + + +++ + ++G F + + ++ Sbjct: 108 VPSRLAKSKLMVENLVDNFT-------NDKIGLVVFAGEAFVQLPITSDYVSA-KMFLQN 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T I A++ + + + + I+L+TDGE+ + Sbjct: 160 ADPSLITTQGTNIAQAIRLSMSSFTQQ-----------DKVGRAIILITDGEDHEGEAL- 207 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFL-----SNC-----ASP 425 +A+ +GI + + K ++ + + C A Sbjct: 208 -EAAKEARKKGINVYILGVGETKGAPIPTPDGGYMTDDRGQTVMTALNEKMCQEVAKAGE 266 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 ++ ++T + K D + Sbjct: 267 GTYIHVDNTSDAQKQLNDELAK 288 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 45/123 (36%), Gaps = 19/123 (15%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T I +++ AY+ + +N +K+I+LLTDG+ Sbjct: 460 DKKDAVDKIGSVTPGGGTEIFTSLEKAYEEL-----------ENLKLQRKHIILLTDGQ- 507 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + + + ++ K I + T+A + + L A F++ + + Sbjct: 508 SARSTDYESMIETGKENNITLSTVALGSDADRN-----LLEELAGLGAGRFYDVTDSSVI 562 Query: 438 NKI 440 I Sbjct: 563 PSI 565 >gi|126340390|ref|XP_001364302.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Monodelphis domestica] Length = 1092 Score = 55.4 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D+ +P + Y VD S Sbjct: 208 VFGSATGLARYYPASPWVDKTRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 267 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 268 LT-LRLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 320 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 321 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 369 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T + + ++ + C + ++E S + Sbjct: 370 AQEIFAKYNKDKKVRVFTFSVGQHNYDRDPVK--WKACRNCGYYYEIPSIGAI 420 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 57/212 (26%), Gaps = 25/212 (11%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 Y + P +D+S + A S+I + R+ Sbjct: 102 GYQVNPEGRKSVQDLVLVIDNSGSMQQTDPDNERLTAAKSLIGQMD------GDKRVAIV 155 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE--MGSTAINDAMQTAYDTIISSNEDEV 358 F F+ + V + G T I A+ I + Sbjct: 156 SFESTAQLVQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEIGLALDETIKEIETQG---- 211 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 N E +++L+DG + ++ I I TI + + + Sbjct: 212 -----NAEKGSLVIMLSDG---FSELDTQTALAPYIARQIPINTIGLKL---AESEGIAL 260 Query: 419 LSNCAS--PNSFFEANSTHELNKIFRDRIGNE 448 L N AS ++ + L + F Sbjct: 261 LQNIASLTGGTYSNVANAQGLTQAFGKIYNKI 292 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/274 (10%), Positives = 70/274 (25%), Gaps = 11/274 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++L++ +G ++ +LL+P+ LG+ + +D+ +H LK A+ + Sbjct: 2 RRLLRDRSGQITVLASLLLPIGLGIAALAIDLSTLQLVKHRLKV--------AADAASLA 53 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 V + + + + + +A QV Sbjct: 54 AVAVLPDTTTALDRALSIAADNAGTGAGTVTAASDVRFGSYNSAAKSFTPGATPANAVQV 113 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY---HKEHGVSIQWVIDFSRS 188 S + F +++G + I A A S +Y + S + + Sbjct: 114 TASRNQAHGNPVVLAFAKALGWSTPDISVSAVAVRFSPAYCFLVLDPSASDALSVSGTGR 173 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + + + VG + Sbjct: 174 LSVPNCGVQVNSTSATAATVGNNSTAQARSFCIVGGYSGTSFSPKPITKCAAAPDPLADI 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +P+ + + Sbjct: 234 PEPAQPTAGCYYNGLNTGSGMTLPSNVTYCGKIT 267 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 89/286 (31%), Gaps = 25/286 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 K + ++ + + I F +S+ + N + + S Sbjct: 213 DAKTGMQEAKCTFIPNKSQTARGSIMFMQSLDSVVEFCTEKTHNVEAPNLQNKMCNLRST 272 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR---HVIKKKHLVR 275 + + + ++ ++ + +D S I + L+ Sbjct: 273 WDVIKASADFQNASPMTGTEAPPLPTFSLLKSRQRVVCLVLDKSGSMRLGSPITRLTLMN 332 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + I + +++ +G F+ + ++ + T + G Sbjct: 333 QAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIRIINDSSYLAISTKL-PQYPNG 386 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T+I + ++ ++ I SS++ IVLLTDGE+ + + + K Sbjct: 387 GTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNRISSCFQEV----KH 433 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 G I TIA + LS+ F+ + L F Sbjct: 434 SGAIIHTIALG---PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 53/171 (30%), Gaps = 25/171 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISSNEDE 357 + V S ++ + +K F T I ++ A + N Sbjct: 74 VVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAVKMLNQRNASG 133 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQD----------NEEGIAICNKAKSQGIRIMTIAFSV 407 IV+ TDG N +++ AI ++A+S+G I TI + Sbjct: 134 NSP---------LIVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPIYTIGLND 184 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 N + +L + + F EL I + +V + Sbjct: 185 NG---KLNEAYLEKISVDTKAKAFATKDPAELPDILTEIFAAHSNLKVQSL 232 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 66/203 (32%), Gaps = 23/203 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 D + FV SS V++ + ++++ + D R+G + V Sbjct: 3 SSCDSKRLDLIFVIDSSRSVRPHDFEKVKEFIVTILQFLDVA---PDVTRVGLIQYGSTV 59 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ S + + + + + T A+Q A + S +E Sbjct: 60 KNEFSLK--TYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIAFSESEGARP---LREN 114 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP- 425 + I+++TDG + KA++ GI I I L + S Sbjct: 115 VPRIIMIVTDGRPQDP---VAEVAAKARNSGILIFAIGVGQVDYNT------LKSIGSKP 165 Query: 426 --NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 166 HQDHVFLVANFSQIESLTSVFQN 188 >gi|110632964|ref|YP_673172.1| TadE-like [Mesorhizobium sp. BNC1] gi|110283948|gb|ABG62007.1| TadE-like protein [Chelativorans sp. BNC1] Length = 140 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 11/91 (12%), Positives = 37/91 (40%), Gaps = 4/91 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ ++S +G + A++ +L + +V+ R + + L AA A + + Sbjct: 6 RLFRRFLESGSGAAAVEFAIVCMPLLLICLGIVEFGRAFFVRNDLSYAADVA----ARKV 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 + + + ++ + + + E + + Sbjct: 62 LIGQIPAGAPSSDAASGLETAVREAFVGDAS 92 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 49/177 (27%), Gaps = 34/177 (19%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + V + + TA+ + + + N Sbjct: 134 LVLFGKEAMLEVPPTIDVEYFLERLEAVRLFSL--GDGTALGMGVGISLLHLSRVNASF- 190 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------- 408 + +V+LTDG+NT AK I + T+ + Sbjct: 191 ----------RAVVILTDGKNTTGEILPETAAEMAKDLDIPVFTVGVGSDLPVSLDVIDP 240 Query: 409 --------KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 ++ L A S FF + L++IF IG V R Sbjct: 241 STGTRYAGVLEEGYDEETLRRMAEMSGGQFFSGYTPTSLHRIF-QYIGATATADVRR 296 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 49/155 (31%), Gaps = 21/155 (13%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S + + A++ +I ++ R+G F + + Sbjct: 99 SNSMLSQDLSPNRLENAKRAISQLIDNL-------HDDRIGIIVFAGQAYVQLPITTDYS 151 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + T + TAI A+ + N K ++++TDG Sbjct: 152 AA-KLFLNTINTNMVPTQGTAIGAAIDLGMQSFDFKNGMS-----------KAMIVITDG 199 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 EN +D+ + N A+ + + + I + Sbjct: 200 ENHEDDAV--SAANHARDKDVTVNVIGVGSEEGAP 232 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 20/43 (46%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 + L K G+ +I A+ + +LG +++DV ++ Sbjct: 10 FRYLFKDERGNVLVIFAVALIALLGFAAIVIDVGGMYVERRSM 52 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 76/248 (30%), Gaps = 22/248 (8%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPY---MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + V + G V + P + + ++ + ++ Sbjct: 89 NTIVSLIAEDYGPVKDAYDNFDPTFKISGDQPGTEVNGPEEKQHNVTILLDASGSMAARV 148 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVN--DTVRMGATFFNDRVISDPSFSWGVHK--LIR 321 +K + ++A+ S + + NV+ G+ D+ S V Sbjct: 149 SGGEKMQVAKEAVRSFTSQMPEGTNVSLIVYGHKGSNSKADQAESCKGIEEIVELGPYNE 208 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + +++ G T + AM A + + +A+ I +++DG T Sbjct: 209 STLQSKLDPIRATGWTPLAGAMNQAGQRL----------KETEGQAENVIYVVSDGLETC 258 Query: 382 DNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + I + I F V + + + A +F A S EL+ Sbjct: 259 G-GDPVKEAKSLNQSNIKATVNIIGFDVGNKEHQALKKVAE--AGGGKYFSATSKTELDL 315 Query: 440 IFRDRIGN 447 FR+ Sbjct: 316 YFRNEYAK 323 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 40/114 (35%), Gaps = 17/114 (14%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T A++ A + + K I+L+TDG+ T I K K Sbjct: 24 GKTNTGGALERAQQML----------AEGRPSVPKIILLITDGDATDKERLDAQI-EKLK 72 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGN 447 I I TI + E L+ A+ + +E ++KI +G Sbjct: 73 KSNILIYTIGVGDLIDRNE-----LNRIATDEDFVYETRDFDSISKIKSSLLGR 121 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 44/151 (29%), Gaps = 18/151 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A V D + V++G ++ V SD + G V+ + G Sbjct: 159 AKNFVANVSSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGD 218 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ + E K ++ +TDG QD A + + Sbjct: 219 THTGTAL-----------RYISTNNRWREEVPKILIFVTDGTP-QDRAIVPAAARSLRDK 266 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 G+RI I L AS Sbjct: 267 GVRIFAIGVG------NAVESELKEIASEPY 291 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 89/286 (31%), Gaps = 25/286 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 K + ++ + + I F +S+ + N + + S Sbjct: 213 DAKTGMQEAKCTFIPNKSQTARGSIMFMQSLDSVVEFCTEKTHNVEAPNLQNKMCNLRST 272 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR---HVIKKKHLVR 275 + + + ++ ++ + +D S I + L+ Sbjct: 273 WDVIKASADFQNASPMTGTEAPPLPTFSLLKSRQRVVCLVLDKSGSMRLGSPITRLTLMN 332 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + I + +++ +G F+ + ++ + T + G Sbjct: 333 QAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIRIINDSSYLAISTKL-PQYPNG 386 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T+I + ++ ++ I SS++ IVLLTDGE+ + + + K Sbjct: 387 GTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNRISSCFQEV----KH 433 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 G I TIA + LS+ F+ + L F Sbjct: 434 SGAIIHTIALG---PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|73981989|ref|XP_852918.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Canis familiaris] Length = 1147 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 225 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 284 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 285 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 344 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 345 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 404 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 405 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 451 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 452 RGPIQWMACENKGYYYEIPSIGAI 475 >gi|259415853|ref|ZP_05739773.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259347292|gb|EEW59069.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 182 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 28/58 (48%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 M++ T +K ++ G+ I A+L P+ML + V++ +Y + L +A Sbjct: 1 MIYRTSIPALVQKFRRNEDGNATIEFAILFPLMLMMLFAAVELGMINYRQIMLDRAMD 58 >gi|41409532|ref|NP_962368.1| hypothetical protein MAP3434 [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570937|sp|Q73UD4|Y3434_MYCPA RecName: Full=UPF0353 protein MAP_3434 gi|41398363|gb|AAS05984.1| hypothetical protein MAP_3434 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 330 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 62/203 (30%), Gaps = 34/203 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + A + + + +G F + I + K Sbjct: 113 NRLKAAEQAASQFASQLTPG------INLGLVGFAGTPYLLVPPTPQHQATIDALKKLDF 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 D STA +A+ TA I ++ + IVLL+DG + + Sbjct: 167 AD-----STATGEAIFTALHAISATA-----VAGGDTPPPARIVLLSDGGENKPSNPSDP 216 Query: 388 -----AICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEA 431 AK +G+ I TI F + E + A S + A Sbjct: 217 HDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVSTDQMKMVAKLSGGQSYTA 276 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + EL K + + I NEI R + Sbjct: 277 TNLGELQKSY-NAIENEIGYRTV 298 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 13/169 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +KK D + VR GA + D KL +V D G+T Sbjct: 809 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLE--VVSVLQNDHPMGGNTY 866 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + + +H+ + ++++TDGE + D E+ + +GI Sbjct: 867 TAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNTTAKALRDKGI 920 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + + L+ S + ++ + L IF D + Sbjct: 921 LVLAVGI-----AGANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSAS 964 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S H ++ L+ V+ + VR+GA + D + S + Sbjct: 422 GSGSTQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEIS--KYSNKP 476 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + K G+T A+ + + + + ++V+LT+G Sbjct: 477 DLGKAIENIRQMGGNTNTGAALNFTLKLLQ------RAKKERGSKVPCHLVVLTNGM--- 527 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + + +K + + IR+ I +E + L A + + L I Sbjct: 528 SRDSVLGPAHKLREENIRVHAIGV------KEANQTQLREIAGEEKRVYYVHEFDALRNI 581 Query: 441 FRDRIGNEIFERVIR 455 + E R Sbjct: 582 RNQVVQEICAEEACR 596 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + SI +D + +R+G +++ S S G +T V D + A Sbjct: 231 LGESISALDIKENCMRVGLVTYSNETRVISSLSTG---NNKTEVLQRIQDLSPQVGQAYT 287 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A A + N + VL+T + + +G+ I Sbjct: 288 GA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVTH---RASEDNVTKAAVNLRREGVTI 341 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN 426 T+ L AS Sbjct: 342 FTMGIEG------ANPDELEKIASHP 361 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 992 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 1051 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 1052 IF--GYTHIGDALRKVKYYFQP-----DMGSRINAGTPQVLLVLTDG---RSQDEVAQAA 1101 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 1102 EELRHKGVDIYSVGIGDVDDQE 1123 Score = 39.6 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 40/122 (32%), Gaps = 18/122 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 659 MSQSDIANAIDRMTHIGETTLTG------SALTFVSQYFSPDKGARPNVRKFLILITDGE 712 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 713 AQDIVRDPAI---ALRKEGVIIYSVGVFGSNVTQ-------LEEISGKPEMVFYVENFDI 762 Query: 437 LN 438 L Sbjct: 763 LQ 764 >gi|296474583|gb|DAA16698.1| collagen, type XXI, alpha 1 [Bos taurus] Length = 507 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H+ ++ G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSHESGENLMAAMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K+ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSARFLTKIAVVLTDG---KSQDEVKDAAEAARDSRITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGS-----ETEEAELRAIANKPSSTYVFYVEDYIAISKI 203 >gi|218512349|ref|ZP_03509189.1| hypothetical protein Retl8_00989 [Rhizobium etli 8C-3] Length = 222 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 66/217 (30%), Gaps = 38/217 (17%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + +++ VID S SML S + + GK Sbjct: 3 SAVKGPATSYLNVYIVIDRSPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGK----- 57 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + D S+ +++ + + DA+ V+ I Sbjct: 58 --------------------------KTYANNYDYSTEKNIKLRADVAGDAVREVLDMID 91 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDAMQ 344 + D+ ++ +++G D + + + + M T + A+ Sbjct: 92 ESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALA 151 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + + ++ K ++LLTDG +Q Sbjct: 152 ALQKIVGTGGDGT-----SSANPLKLVLLLTDGVQSQ 183 >gi|151556217|gb|AAI49226.1| COL21A1 protein [Bos taurus] Length = 518 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ K D +++G ++D + + G H+ ++ G+T Sbjct: 60 LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL--GSHESGENLMAAMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K+ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSARFLTKIAVVLTDG---KSQDEVKDAAEAARDSRITL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGS-----ETEEAELRAIANKPSSTYVFYVEDYIAISKI 203 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 65/190 (34%), Gaps = 27/190 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A I + + D R+ + V + + + ++ TF Sbjct: 112 KLDDAKRAAREFITRVSEED------RVALVHYGTDV-TVFPSTLATPETREQML-TFVN 163 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEG 386 + GST I+ ++ A + + + I+LL+DG+ T E+ Sbjct: 164 AIEDEGSTNISGGLEAAAQQLQKNADQFRVSR---------IILLSDGQPTAGLTREEQL 214 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 A+ +SQG+ + + + + + A F + +L ++F + Sbjct: 215 TALARNLRSQGMAVSALGVGEDFNEN-----LMQGIADQGGGFSGFLRSDQLAEVFTREL 269 Query: 446 GNEIFERVIR 455 + V R Sbjct: 270 -EQATSTVAR 278 >gi|218193011|gb|EEC75438.1| hypothetical protein OsI_11969 [Oryza sativa Indica Group] Length = 516 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 51/200 (25%), Gaps = 34/200 (17%) Query: 271 KHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + +K ++D R+ F++ + R + Sbjct: 75 LDVSTSMAGDKLDRMKAALLFVIRKLDDVDRLSIVTFSNDAARLCPLRFVAGDAARADLG 134 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G+T I ++ + + R N ++L++DG+ + + Sbjct: 135 ALVDGLAADGNTNIRAGLEIGLA-VAAGRRLTAGRSVN-------VMLMSDGQQNRAD-- 184 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFR 442 + G+ + T + L A F L F Sbjct: 185 ----ATRLDPGGVPVHTFGLGADHDPA-----VLQAIAGKSREGMFHYVADGVNLTAPFS 235 Query: 443 D-------RIGNEIFERVIR 455 I ++ V R Sbjct: 236 QLLGGLLTIIAQDLELTVTR 255 >gi|153008054|ref|YP_001369269.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] gi|151559942|gb|ABS13440.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] Length = 187 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 46/136 (33%), Gaps = 4/136 (2%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++ ++ G + ALL+ L + +++ + A +T + L Sbjct: 17 LRRFARAQHGVAAVEFALLIVPFLIIIFATIEIGMSFVARQVISNATET----VARRLQT 72 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + ++ + + ++++ NL N E D V + K Sbjct: 73 GQIRGAQISEGTLRTELCQQMQFMVAQGCPNLSLNLGTYEGFDKVPIDQILDGEGKLTRT 132 Query: 131 VVLSSRYDLLLNPLSL 146 ++ + +N L++ Sbjct: 133 GIIGTSGTSTINQLNV 148 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 89/286 (31%), Gaps = 25/286 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 K + ++ + + I F +S+ + N + + S Sbjct: 213 DAKTGMQEAKCTFIPNKSQTARGSIMFMQSLDSVVEFCTEKTHNVEAPNLQNKMCNLRST 272 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR---HVIKKKHLVR 275 + + + ++ ++ + +D S I + L+ Sbjct: 273 WDVIKASADFQNASPMTGTEAPPLPTFSLLKSRQRVVCLVLDKSGSMRLGSPITRLTLMN 332 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + I + +++ +G F+ + ++ + T + G Sbjct: 333 QAAELYLIQIIEKESL-----VGLVTFDSTATIQTNLIRIINDSSYLAISTKL-PQYPNG 386 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T+I + ++ ++ I SS++ IVLLTDGE+ + + + K Sbjct: 387 GTSICNGLKKGFEAITSSDQSTSGSE---------IVLLTDGEDNRISSCFQEV----KH 433 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 G I TIA + LS+ F+ + L F Sbjct: 434 SGAIIHTIALG---PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|156742635|ref|YP_001432764.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233963|gb|ABU58746.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 777 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 45/344 (13%), Positives = 93/344 (27%), Gaps = 36/344 (10%) Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 V + + + + LS S FL S +++ + A S Sbjct: 184 VDLMQAGDGIGVVSFDHVIETPLSLTTIAEAPLPSAFLFSDSMETGASRWIAAPPWGLTS 243 Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + + D +S + PA T + + Sbjct: 244 VARNSAQAWT---DSPAGNYANNTNSALEIATPIAIPASMTAPALMFWHRYDIENGFDRG 300 Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 +S N + + +L + S + VR V + D Sbjct: 301 EVEISANNGVTWQRLRSFTGTALGWRREMISL----DAYRGQTVRVRFRLVTNAFGVRDG 356 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 ++D +R + N G+T+I +Q + + Sbjct: 357 WYIDDVALGPAWDD---------------VRARAQAAIDTLNSRGATSIGGGLQRSQHLL 401 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S+N + IVLL+DG+ + + + TI + Sbjct: 402 TSANPAIP----------RAIVLLSDGQENTSPYV-ADVLPPIRDAQTTVHTIGVGQDAD 450 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 Q+ + ++ A + +L +I+ GN + + Sbjct: 451 QRLMLSIAAQ---TGGTYNYAPTPDQLARIYNTISGNVSNRQTL 491 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 54/436 (12%), Positives = 113/436 (25%), Gaps = 53/436 (12%) Query: 21 HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAK 80 + IT + + + G A ++T + + Sbjct: 3 NVAAITVVSL----ALAGCA------------TADNADEIVLTGAKAERSAEVPPPPPPP 46 Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 P + R R+ TA V ++ Sbjct: 47 PPPPPPPPPPSPSAQAFAAPSAIMVTGSRISREEADATAQPQTTSPDLRYVP-----SIV 101 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQ 199 + + R G + E + S + S + M + Sbjct: 102 IPTVPDRERYDGKDVSEVAVTLEQPVSTFSVDVDTGAYSNARRMLTDGQMPPKGAVRTEE 161 Query: 200 PLNCFGQPADRTVK-SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 +N F R + + + L P P+ + Sbjct: 162 FVNYFRYDYPRPTSAQDAPFTVNMDVARTPWDADTRLVRIGLAGYEAPKAERPAANLVFL 221 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S K LV+ A+ +++ + D V+ + + Sbjct: 222 LDVSGSMSSADKLPLVKTAMKTLVGQLTPKDRVS------IVVYAGAAGLVLEPTS---- 271 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + GSTA ++ AY +S D ++R ++L TDG+ Sbjct: 272 -DSREIMAALDQLQAGGSTAGGAGLELAYKVAEASKVDGINR----------VILATDGD 320 Query: 379 NT---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 DN++ + + GI + + F + A ++ +S Sbjct: 321 FNVGLSDNDKLLEYVEDKRKNGIAMSVLGFGRG----NINEALMEQIADKGNGNYGYIDS 376 Query: 434 THELNKIFRDRIGNEI 449 E K+ +++G + Sbjct: 377 AIEARKVLGEQLGATL 392 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RVAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IMDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA++ GI I Sbjct: 138 LAIQYALNIAFSEAEGARPLK---ENVPRVIMIVTDG---RPQDSVAEVAAKARNTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|198424353|ref|XP_002120419.1| PREDICTED: similar to mCG120740 [Ciona intestinalis] Length = 1650 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 74/204 (36%), Gaps = 23/204 (11%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S+ + L+R A+++ I D V +G F R + + Sbjct: 380 DISTSMENYGRMGLMRQAVSNFI------DTVPMNTWVGIVVFASRANTLARLTEITSYD 433 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R I+KT ++ +G T+I + + + +S + + I++LTDG Sbjct: 434 ARNILKTRLVNTTVVG-TSIGSGIMKGLEVLETSGPRSLRGSGGS------IIILTDGLE 486 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST----- 434 + + I + + G+R+ TIA N + + S A+S Sbjct: 487 HNNPKINDTI-ERVREFGVRVSTIALGSNVAKDLEWLAS----VSNGRTHAASSGQFGID 541 Query: 435 HELNKIFRDRIGNEIFERVIRITK 458 EL + F E + +++ Sbjct: 542 AELQEAFASHHEQEDIGSISILSQ 565 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/420 (9%), Positives = 103/420 (24%), Gaps = 45/420 (10%) Query: 38 GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN 97 G+ VDV + A+ VP E + K E + + Sbjct: 180 GIAVDV-----LPTPKQAVQDAAVEELKVPDKLYQAESFYVEVMVRSTFKTSGELRIYED 234 Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 ++ + + NP Y+ ++ ++ + Sbjct: 235 SREIGRERVDVTPGENRYAVKGLAKNPGLHRYR------AEIFMDGDEASANNAAFDFTR 288 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 ++ V + ++ + + + + S Sbjct: 289 VEGPPNVLIVEGTPGTSGNITAALQSGMIGTEVIPPELLPLEAAKYAVYDSIIFNNVSGS 348 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 G + + + + + + K+ + Sbjct: 349 DVGGKQMELIEQAVRSFGIGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLG 408 Query: 278 LASVIRSIKKIDNVN----------------DTVRMGATFFNDRVISDPSFSWGVHKLIR 321 L VI +D +G F+D+ + Sbjct: 409 LILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVAFDDQPWWVVPP---QKLGDK 465 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V + G T I A+ +A + ++ +++I+L+TDG+ + Sbjct: 466 EEVLSSIQSIPSAGGTNIYPAVSSALEEML-----------KIDAQRRHIILMTDGQ-SA 513 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 N + + I + ++A ++ A+ ++ L +F Sbjct: 514 MNSGYQDLTDTMVENKITMSSVAVGMDADTNLLQSL---ADAAKGRYYFVEDETTLPAVF 570 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 43/153 (28%), Gaps = 3/153 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ AL + + L ++VD R L+Q A A + A+ + Sbjct: 12 RQRGAIGLMAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATRGGNCGAGAT 71 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + A + + + I + L + A +VV+S Sbjct: 72 ANAYAQASVVRN---NFPIPSAGRTLAVACGTLNLDASNLRVFAVNAASTEAIRVVVSHT 128 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 L ++ I A A Sbjct: 129 VPQSFAGAIGGLFGGAGRNATINLSATAVAAVP 161 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 54/202 (26%), Gaps = 46/202 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+A VI ++ D R+ F+ S R + Sbjct: 373 TKLALLKRAMAFVISNLSPED------RLSVVVFSSTAKRVFSLKRMTPDGQRAANR-VV 425 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I + ++ + + I+LL+DG++T Sbjct: 426 ERLLCTGGTNIAEGLRKGAKVLEDRRQ---------RNPVASIMLLSDGQDTYS------ 470 Query: 389 ICNKAKSQGIRIM------------------TIAFSVNKTQQEKARYFLSNC--ASPNSF 428 S+G+ + I + + S +F Sbjct: 471 ----LSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHDAATMHAISEVSGGTF 526 Query: 429 FEANSTHELNKIFRDRIGNEIF 450 + + F IG + Sbjct: 527 SFIQAESLVQDAFAQCIGGLLS 548 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 55.0 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 57/192 (29%), Gaps = 28/192 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I + + +G F+ + + Sbjct: 314 DVSGSMSSSDRLNRMNQAAKYFLSQIIE-----NRSWVGMVHFSSQATIVHELIQMNSDI 368 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + +G T+I ++TA+ + I+LL+DGE+ Sbjct: 369 ERNKLLQTLPT-SAIGGTSICSGIKTAFQVFKNGEYQTDGTE---------ILLLSDGED 418 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + + C ++ K G + IA + L + + A + Sbjct: 419 STAKD-----CIDEVKDSGSIVHFIALGPSADLAVTNMSIL----TGGNHKLATDEAQNN 469 Query: 437 -LNKIFRDRIGN 447 L F Sbjct: 470 GLIDAFGALASE 481 >gi|91788413|ref|YP_549365.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] gi|91697638|gb|ABE44467.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] Length = 533 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 35/249 (14%), Positives = 79/249 (31%), Gaps = 8/249 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 KS G II L + V+ +GG+++D+ + L+ AA A + + L ++ Sbjct: 8 RKSERGAVAIIFGLTVVVLFAMGGVVLDLGHLYIAKAELQNAADAAALAGAKDLNETTPG 67 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + T + + L A + + + Sbjct: 68 IDAAVATAQTISAKNKYN-FSTDVTLALANIEFGPSPDGPWSSVATARAAPQGMTFIKVD 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + + +R G+ + A A R + + + V +++ Sbjct: 127 TGLKV---LGTYLMRVAGVDTVSTFGLAVA---GRFVNNVTPIGVCAVDPATKTAKYSYT 180 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 +G L + R +Y+ G + D PY+++ S P + + Sbjct: 181 QPDGTELTELVEFGFRRGVTYNLF-GLNPLADGPFDPYLLNPVDSPPSACNPANSSANFT 239 Query: 255 EEHFVDSSS 263 +S Sbjct: 240 APFLCSGTS 248 >gi|326433447|gb|EGD79017.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 1763 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 40/340 (11%), Positives = 81/340 (23%), Gaps = 35/340 (10%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 N + + L ++R I + + Sbjct: 162 TGNGDIINNAADMGTLAALAWRDRLDWVRHFQITPQNAAQFSFVRVAVVPESCGATLDFA 221 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-- 238 +++D S S+L DS+ Q + D Y S Sbjct: 222 FLLDGSGSLLQSGWDSQIQFVKDVTAYFDVGRNRTHVGIVTFHGPRVSHLSYNESIPDCP 281 Query: 239 -SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + + S+ + +A + + ID + D Sbjct: 282 LGTVWEPDGEGCICEVEGSQCTGSACGTGYYE-----LEAGGCQGATDQNIDTLPDCEIR 336 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN--------------EMGSTAINDAM 343 + V G+T ++ + Sbjct: 337 QFFPHGCADCTCNPLGQKYQPSDSYTVDQLFSMTQDHDDLAATLDSLVFPDGATHMSAGL 396 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + DTI + + + +++LTDG++ E + + +++GI I I Sbjct: 397 EQIRDTIF---QLRNGMREYEQAIPRVLIVLTDGKSNPG-FEPHEVAEQLRNRGIIIYAI 452 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + L AS +E L I Sbjct: 453 GVG------DYYLPELEAMASEPMDRHVYELADPSSLFTI 486 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 73/261 (27%), Gaps = 24/261 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +S+ + N + S S + D+ + + Sbjct: 237 SIMFVQSIDSVTEFCTAKTHNTEAPNLQNKMCSSRSTWDVIMDSDDFQNASPMKGTDPPP 296 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + + +D S + + A + I + +G Sbjct: 297 HPTFSLLKSKQRVVCLVLDKSGSMSSEDRLLRMNQAAELYLIQIIE-----KGSLVGMVT 351 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + T + G T+I ++ + II SN+ Sbjct: 352 FESLATILNYLTDII-GDNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQSTSGSE 410 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I+LLTDGE+ Q + C + K G I TIA + LS Sbjct: 411 ---------IILLTDGEDDQISS-----CFEEVKQSGAVIHTIALG---PSAARELETLS 453 Query: 421 NCASPNSFFEANSTHELNKIF 441 N F+ + L F Sbjct: 454 NMTGGYRFYANKDINGLTDAF 474 >gi|90021002|ref|YP_526829.1| hypothetical protein Sde_1355 [Saccharophagus degradans 2-40] gi|89950602|gb|ABD80617.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 787 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 31/147 (21%), Positives = 56/147 (38%), Gaps = 24/147 (16%) Query: 321 RTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R++ + A + N +G T I +A++ A + ++++ E K+I+LLTDG Sbjct: 109 RSLGRAKASEINSVGLYTNIGEALEKAAYDLDAASD----------EYAKHIILLTDGMV 158 Query: 380 TQDNEEGIAICN----------KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 D + K K+ G I T+A S N + L + Sbjct: 159 DIDKQPDKNTQEWRRIVDEVLPKLKAAGYTIHTVALSDNADNNLLKKLSLQ---TDGIAS 215 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRI 456 A++ +L KIF + + Sbjct: 216 VAHTADDLMKIFLGTFDAAAPAEQVPL 242 >gi|163849338|ref|YP_001637382.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527332|ref|YP_002571803.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670627|gb|ABY36993.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451211|gb|ACM55477.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 418 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 56/180 (31%), Gaps = 29/180 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+ A ++ + D FNDR S R +K Sbjct: 60 RLQQVKQAAMQILDLLG------DHESFALVTFNDRAEVVVSAQLARA---RAEIKRQIS 110 Query: 330 DENEMGSTAINDAMQTAYDTII-SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T + + + + VHR ++LLTDG D + Sbjct: 111 AIEAAGGTEMATGLALGVQELQRAMMPRAVHR----------LLLLTDGRTYGDEGRCVE 160 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRI 445 I +A+S+GI I + L A + + + S E+ KIF + Sbjct: 161 IARRAQSRGIGITALGIGSEWN-----EDLLETIAARENSRTHY-ITSAAEITKIFTAEV 214 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 60/218 (27%), Gaps = 38/218 (17%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + LD + + + + A+ ++ ++ Sbjct: 77 QYGIEWTERKTRGLDIVFVLDSSKSMLASDLRPTRLERAKLAILDLVEQLES-------D 129 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + ++ T A D G + + A++ A N Sbjct: 130 RIGLVAFAGSAFLQTPPTLDYGAFRESLDAT-APDMMSRGGSDLGVALREATKAFPVENN 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K +VLLTDGE+ I KA +G+++ I Sbjct: 189 Y------------KAVVLLTDGEDLGG--HAIDEAKKASKEGVKVFAIGLGTP------- 227 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + + + RD G I ++ Sbjct: 228 ---------EGDYLRQTNDAGIEEYMRDADGQPIRTKL 256 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 47/159 (29%), Gaps = 35/159 (22%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN----------------------EDE 357 R V TAI + + T+ +D+ Sbjct: 153 SRDDVFAAIDRFQLQRGTAIGSGIVLSLATLFPEAGIDLSDITGERRMPKGIGESDKQDD 212 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ------ 411 + ++LLTDG+ T + + A +G+++ T+ F + Sbjct: 213 FKPVAPGSYGSAAVILLTDGQRTTG-PDPMDAAKMAADRGVKVYTVGFGTTSGEIIGFEG 271 Query: 412 ----QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L N A + +F A S +L K++ Sbjct: 272 WSMRVRLDEATLKNIANLTQAEYFYAGSATDLQKVYDTL 310 >gi|14248663|gb|AAK57617.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|309792255|ref|ZP_07686727.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225796|gb|EFO79552.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 391 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 60/185 (32%), Gaps = 21/185 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A VI ++ D V+ F+D V + + RT + Sbjct: 36 KLQSMKAATRKVIETLTDQDVVS------IVIFDDTVQTLVPATL---ATDRTALLAAID 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +E G TA++ MQ + + + +++LLTDG+ D E I Sbjct: 87 TISEAGGTAMSLGMQAGQVELQK---------HSGPDRLSHMLLLTDGQTWGDEETCRNI 137 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +RI + +Q + + + +++ F+ I + Sbjct: 138 ARALGQADVRITALGLGAEWNEQLLDDLAEFSDGTSDYI---ADANQIGTFFQRAIRSAQ 194 Query: 450 FERVI 454 Sbjct: 195 GTAAT 199 >gi|281338501|gb|EFB14085.1| hypothetical protein PANDA_006133 [Ailuropoda melanoleuca] Length = 984 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 39 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 98 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 99 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 158 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 159 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 218 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 219 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 265 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 266 RGPIQWMACENKGYYYEIPSIGAI 289 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 24/47 (51%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 +K IK G + +I AL + V+LG+ + VDV ++ + + Sbjct: 5 IRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRMYEIV 51 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 61/158 (38%), Gaps = 17/158 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D +D VR+GA ++D S +I + + G T A++ Sbjct: 830 VKKADVDSDRVRVGALKYSDYPEVLFYLSGNKSAVIEHLRRRR----YTSGHTYTARALE 885 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E + + K+ ++++TDG + D + +K +++GI I + Sbjct: 886 HANIMFT-----EEYGSRIQQNVKQMLIIITDGV-SHDRDNLSDTASKLRNKGINIYAVG 939 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A ++ F ++ L I+ Sbjct: 940 VG------QANQLELETMAGNKSNTFHVDNFSNLKDIY 971 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 37/124 (29%), Gaps = 18/124 (14%) Query: 319 LIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + T A+ + + KK+++L+TDG Sbjct: 675 FTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTK-------GARTKVKKFLILITDG 727 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHE 436 E + + +G+ I ++ R L + +S F+ + + Sbjct: 728 EAQDPVRDP---AKALRDKGVVIFSVGV-YGAN-----RTQLEEISGDSSLVFQVENFDD 778 Query: 437 LNKI 440 L + Sbjct: 779 LKTV 782 >gi|14248651|gb|AAK57611.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens] Length = 550 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 45/136 (33%), Gaps = 7/136 (5%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D++ FN V + K + + G T IN+A+ A Sbjct: 291 DDLTIDDHFSIIDFNHNVRCWSEELVAGSSIQIADAKKYIQNIKPNGGTNINEALMRAVQ 350 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAF 405 ++ ++ + ++ I+L++DG+ T + I K + + ++ Sbjct: 351 MLLKASNQGLIDPRSVS----MIILVSDGDPTVGEIKLSTIQKNVKRVMREEFSLFSLGI 406 Query: 406 SVNKTQQEKARYFLSN 421 + R + N Sbjct: 407 GFDVDYDFLERIAMEN 422 >gi|162449478|ref|YP_001611845.1| hypothetical protein sce1208 [Sorangium cellulosum 'So ce 56'] gi|161160060|emb|CAN91365.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 607 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 46/164 (28%), Gaps = 17/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F+ +V + + G+T + + + Sbjct: 73 DRFSVVTFDHQVEVPIPSTDATPGARLRA-EAALDTVIARGNTDLGGGWLRGCAEVGAHL 131 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQ 411 ++ ++LLTDG+ + ++A+SQ + TI + Sbjct: 132 PEDAIGR---------VLLLTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGEGFNE 182 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 R +F+ A EL IG + V R Sbjct: 183 FLLGRLSEE---GGGNFYFAARADELPGFVGREIGEVLSV-VAR 222 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 31/348 (8%), Positives = 85/348 (24%), Gaps = 17/348 (4%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + ++ + + + L + D+ + Y Sbjct: 917 GVATGAAIDFVRQNSYTR--GNGDRTSVPDLLVVVTSSASTDDVASAQETAEKEGITIYT 974 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V +++ + S +++ + + + Sbjct: 975 VGVTNSVSFAELTSTAGSFSRVLRANDFSDLSAIRQPLHETICQAAFCGDPGTPANGFQQ 1034 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + C + Q + + + + PL Sbjct: 1035 GTYFEGNTVTFGCNFGFLLSGTDNTQCQADGSWSNPLPVCIAITTPAPTQAPACNDFPLF 1094 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 +D S LV+ +V+++ + R+ ++D+ ++ Sbjct: 1095 NGTDLVFLLDGSGSVGSNN-FDLVKTFTKNVVQNFDISETAT---RVAVVQYSDQFSTEF 1150 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S + G T A+ ++ +S Sbjct: 1151 SL--NAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTS------ISGERDGYPDL 1202 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +V++TDG +T D + A++QG+ I + + + Sbjct: 1203 LVVVTDGLSTDDVSGP---ADTARAQGVTIYAVGVGSDIDFNTLEQIA 1247 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 47/409 (11%), Positives = 110/409 (26%), Gaps = 47/409 (11%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + + + N +F ++ + IRN + T + + + ++ Sbjct: 1413 SLADTRVGVVQYSDSPTLEFNLNSFNTNELVDLAIRNIQYQQGGTNTGQAIDFVRVNSFS 1472 Query: 121 EMNPRKSAYQ----VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 N +S VV + + + R+ GI + + +T Sbjct: 1473 ANNGDRSDVPNVMIVVTDGQSSDDVVGPAQTARNAGISMYAVGIGNGVDTNELLQIAGQV 1532 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + DFS + E + V + + G Sbjct: 1533 DRVVQSADFSTLFSVGENLQELVCDAGWCGDPGVPVNGFRLGSYFEGHVVTFGCNAGFIL 1592 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRH-------------------------VIKKK 271 N + + + ++ Sbjct: 1593 NGASSTICQGDGSWSNAVPVCVAITTPSPTRAPVCTDLSFGGLDLVFLLDGSGSVTAVNF 1652 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 LV+D + V+ + R+G F+D + ++ S + +++ + + Sbjct: 1653 DLVKDFASGVVSEFQISTTET---RVGVVQFSDTLRTEFFMSS--FSTKQQVLQAISDID 1707 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T A+ A + S+ N +++++TDG + + Sbjct: 1708 YIQGNTLTGAAITFATASSFST------PAGNRANFPDFMIVVTDGLSQDSVVQP---AQ 1758 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 A+ QGI I + L P + +L Sbjct: 1759 SARDQGITIFAVGVGNEVDFA----TLLQITGVPEYILQVTDFSDLLAA 1803 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 45/407 (11%), Positives = 94/407 (23%), Gaps = 33/407 (8%) Query: 36 VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 G V+ R + +A + NS + + Sbjct: 633 FTGFAVEFTRQIAFRT-----------SAGTRDNYPDILIVVTDGNSEDVVTSAVASAID 681 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + ++ + ++ T + S L + G Sbjct: 682 QGILIYAVGVGSNVDFASLLELTGGVNSRVLQVSDFTGLSTAAQTLPTVLCAAAYCGDPG 741 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVKS 214 S + S + S P+ + Sbjct: 742 APTNGYRRGTFFVDSVVTFGCNDGYLLQGSSNTSCLGTGQWSNPVPVCLDRCDPNPCFHG 801 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + Y + L VD SS LV Sbjct: 802 AACSVQGSSFVCVCPPNYEGELCQFYTPCLNRSVE---FDLVFLVDKSSSVGPAN-FELV 857 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDENE 333 ++ + R+GA F D D ++ + ++ +N+ Sbjct: 858 KE---FMYDFTNTFSVGLSDTRIGAVQFADAQTKDFDMDTFATKEQTLAGIQNIVYTDNQ 914 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 +G A A+ N +V+ + + ++ + A Sbjct: 915 VGGVATGAAIDFVRQ-----NSYTRGNGDRTSVPDLLVVVTS----SASTDDVASAQETA 965 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + +GI I T+ + + + E S + AN +L+ I Sbjct: 966 EKEGITIYTVGVTNSVSFAELTSTA----GSFSRVLRANDFSDLSAI 1008 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 42/294 (14%), Positives = 80/294 (27%), Gaps = 29/294 (9%) Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 FL + A+T + V I +D + S +N F Sbjct: 2153 FLIVVTDGLSQDNVAVPAQTARNNGISIFAVGIGSEVDADTLLQIAGTPSRTLQINDFAG 2212 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + + S + +P S+ + LD S S Sbjct: 2213 LVNAEEQLASIVCNTITTTPPATTPTPAPSCSSITPVDLVFLLDGSSSITSPN------- 2265 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +V+D A V+R+ + ++D + ++ ++ Sbjct: 2266 ----FQIVKDFTADVVRTFNVSSAATNVG---LVQYSDTIRTEFFL--NSFDTKSGVLNA 2316 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G+T A+ + S + Y++++TDG + D Sbjct: 2317 IGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPD------YLIVVTDGLSQDDVVVP 2370 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 A++ GI I + L+ SPN + N L Sbjct: 2371 ---AQTARNDGISIFAVGIGSEIDFA----TLLNIAGSPNRILQINDFAGLANA 2417 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 48/157 (30%), Gaps = 14/157 (8%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D ++ R+G +++ ++ S ++ + + GST A+ Sbjct: 324 MANNFDISPNSTRVGVVQYSNFPGTEFSL--NAFTDKAAVLDAISKIDYNGGSTFTGAAI 381 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + +S + + ++++TDG D N A GI + Sbjct: 382 DFVRNNEFTS------VNGDRDDVPNILIVITDGNPNDDVSGPAISANNA---GITTYAV 432 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 N Q + P +A +L + Sbjct: 433 GIGSNVDQANLVQMTAGR---PGRVLQAADFTDLTTV 466 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 278 LASVIRSIKK-IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + D + + R+ ++ + + + + I S Sbjct: 34 VKDFTQQTTAKFDISDGSTRVAVAQYSSTPQVEFNLNTN-SDVDTLSNAIEQITYMNGDS 92 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A++ + SS + +V++TDG+ + + A+ Q Sbjct: 93 TFTGFAIEFVRQSAFSSFNGAR------DDKPDIMVVVTDGQ--SADSVTSSAAT-AREQ 143 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCAS--------PNSFFEANSTHELNK 439 G+ + + E L + A + A S + Sbjct: 144 GVTMFAVGVGTGVGLSE-----LQDIAGYTDRVLQLNDFVQLAQSADTIQS 189 Score = 36.5 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 56/195 (28%), Gaps = 19/195 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +D SS +V+D A V+R+ + ++D Sbjct: 2501 SCSSITPVDLVFLLDGSSSITSPN-FQIVKDFTADVVRTFNVSSAATNVG---LVQYSDT 2556 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + ++ ++ G+T A+ + S + Sbjct: 2557 IRTEFFL--NSFDTKSEVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPD-- 2612 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 Y++++TDG +E + A+ +GI I + L SP Sbjct: 2613 ----YLIVVTDGL---SQDEVLGPAQTARFEGINIFAVGIGNEIDFT----TLLHIAGSP 2661 Query: 426 NSFFEANSTHELNKI 440 N + N L Sbjct: 2662 NRVLQINDFAGLASA 2676 >gi|14248653|gb|AAK57612.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248655|gb|AAK57613.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248657|gb|AAK57614.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248659|gb|AAK57615.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248661|gb|AAK57616.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|211546|gb|AAA48695.1| cartilage matrix protein [Gallus gallus] Length = 416 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 35/289 (12%), Positives = 74/289 (25%), Gaps = 44/289 (15%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + S S+++ + + + I Y + Sbjct: 116 PLDDHVDYVESYSVIEKLTHKFQEAFCVVSDLC------ATGDHDCEQICISTPGSYKCA 169 Query: 236 CNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 C + + S + LV+ + ++ S++ + Sbjct: 170 CKEGFTLNNDGKTCSACSGGSGSALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSE 229 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++G ++ V + G K + I T A++ D+ Sbjct: 230 K---QAQVGLVQYSSSVRQEFPL--GQFKNKKDIKAAVKKMAYMEKGTMTGQALKYLVDS 284 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S + K ++ TDG + + KAK G R+ + Sbjct: 285 SFSIANGARPGV------PKVGIVFTDGRSQDYITD---AAKKAKDLGFRMFAVGVG--- 332 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR--IGNEIFERV 453 L AS +F FR IG ++ ++ Sbjct: 333 ---NAVEDELREIASEPVAEHYFYTAD-------FRTISNIGKKLQMKI 371 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 44/154 (28%), Gaps = 22/154 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDE 357 + V ++ S + + T A+Q A I + D Sbjct: 3 VINYASAVKNEFSLKT---HQTKAELLQAVQRIEPLSTGTMTGLAIQFA---ISRAFSDT 56 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + K +++TDG ++ A +A+ GI I I + Sbjct: 57 EGARLRSPNINKVAIVVTDGRPQDGVQDVSA---RARQAGIEIFAIGVGR------VDMH 107 Query: 418 FLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L F++ Sbjct: 108 TLRQIASEPLDDHVDYVESYSVIEKLTHKFQEAF 141 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 35/289 (12%), Positives = 74/289 (25%), Gaps = 44/289 (15%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + S S+++ + + + I Y + Sbjct: 193 PLDDHVDYVESYSVIEKLTHKFQEAFCVVSDLC------ATGDHDCEQICISTPGSYKCA 246 Query: 236 CNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 C + + S + LV+ + ++ S++ + Sbjct: 247 CKEGFTLNNDGKTCSACSGGSGSALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSE 306 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++G ++ V + G K + I T A++ D+ Sbjct: 307 K---QAQVGLVQYSSSVRQEFPL--GQFKNKKDIKAAVKKMAYMEKGTMTGQALKYLVDS 361 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S + K ++ TDG + + KAK G R+ + Sbjct: 362 SFSIANGARPGV------PKVGIVFTDGRSQDYITD---AAKKAKDLGFRMFAVGVG--- 409 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR--IGNEIFERV 453 L AS +F FR IG ++ ++ Sbjct: 410 ---NAVEDELREIASEPVAEHYFYTAD-------FRTISNIGKKLQMKI 448 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 53/168 (31%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D ++ R+G + V ++ S + + T A+ Sbjct: 66 IEGLDVGPNSTRVGVINYASAVKNEFSLKT---HQTKAELLQAVQRIEPLSTGTMTGLAI 122 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A I + D + K +++TDG ++ A +A+ GI I I Sbjct: 123 QFA---ISRAFSDTEGARLRSPNINKVAIVVTDGRPQDGVQDVSA---RARQAGIEIFAI 176 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS + S +L F++ Sbjct: 177 GVGR------VDMHTLRQIASEPLDDHVDYVESYSVIEKLTHKFQEAF 218 >gi|90407194|ref|ZP_01215382.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] gi|90311770|gb|EAS39867.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] Length = 1116 Score = 55.0 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 44/327 (13%), Positives = 87/327 (26%), Gaps = 73/327 (22%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 ++ + S + +N + K SPYM K+ Sbjct: 119 IVRGNKYSWAPLAWVYQTEIYDCKADQDDKNNTEYPINNKKSPYMKKNKKNKKNKKIFTS 178 Query: 249 LD-----PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 D + +++ + + + + ++ + ++I S +D G + FN Sbjct: 179 PDAVTLYTANYYYWYMNKRASNNTRSRVDIAKEVITTLISSTPSVD-------FGLSIFN 231 Query: 304 DRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTA------------INDAMQ----- 344 + K + + + ST + Sbjct: 232 SKGSGGRIIHRVKERDKPEKEQLVASINNLKIESSTNTPLLETVFESYLYYAGLNVKSGK 291 Query: 345 -----TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------------N 383 + + + + K Y+VL+TDG+ + N Sbjct: 292 TAKGSEPIRDMKAEDAGVYTSPFKTCQEKSYLVLMTDGDPSSPYALQKEIGSLIKRDNPN 351 Query: 384 EEGIAICN------------------KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-- 423 I N A+ Q I TI F + T A L A Sbjct: 352 SNLKGIANSYMPVLTQWMFNNDINPKLAEEQNITTYTIGFG-DITSDTDATDLLKKTAEL 410 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIF 450 +F A++ L K F+ I + + Sbjct: 411 GGGKYFPASNASALQKAFKKMIISILN 437 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 57/195 (29%), Gaps = 26/195 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + S+ + F + RT +K Sbjct: 107 TPNRLDAAKSGAKDFVDSLPSA------FNVALVTFAGTANVKMPPT-----TDRTQLKA 155 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 STAI + + T+ D + + + IVLL+DG + Sbjct: 156 AIDAIRLAPSTAIGEGIYTSLDVLEKLAPQDPDHPDDPAPG--AIVLLSDGATNMGR-DS 212 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ---------QEKARYFLSNCA--SPNSFFEANSTH 435 +AK + + I TIA+ + LS A S + A+S Sbjct: 213 ADAATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAVNHAELSQVAKLSGGKKYSADSMK 272 Query: 436 ELNKIFRDRIGNEIF 450 L ++ I +I Sbjct: 273 NLQAVY-QTISRQIG 286 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 59/178 (33%), Gaps = 24/178 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + +I + ++G ++D + + I K E+ G TA Sbjct: 396 LRNTISSFPIGEEYTQIGLATYSDNPRIIFHL--NKYHKLDDIRKAVLEVEHTSGGTATG 453 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ + + + H AK+ +V+LTDG++ D I AK GI + Sbjct: 454 KAILYLTNNMFT------HENGVRPNAKRLVVVLTDGKSQDDVIVPSRI---AKESGIVM 504 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN--SFFEANSTHELNKIFRDRIGNEIFERVIRI 456 I + L AS ++ N L + I E+ + + Sbjct: 505 FAIGVGKVVMGE------LRAIASDPDRYVYKINDFSAL-----ESIRRELSHSIASL 551 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 55/186 (29%), Gaps = 28/186 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + A + + ++ ++ FN V + + L Sbjct: 191 MTQAKSAAKQFLNYVDF----SNGDQVEIIEFNSDVYIRIPYGSDIKSLNTA-----IDT 241 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 TA+ DA+ T R + K + TDGE + Sbjct: 242 MESNSQTALYDALYTGL-----------VRAYSQSGPKCILAF-TDGEENASIRSVSEVT 289 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 +++ I I I ++ L A + +F + + EL +I++ + Sbjct: 290 ELSRATSIPIFIIGVGSLIDEES-----LKEIAEQTGGEYFYSPTAVELEQIYKTVYDQQ 344 Query: 449 IFERVI 454 + V+ Sbjct: 345 KEQYVL 350 >gi|170743966|ref|YP_001772621.1| hypothetical protein M446_5903 [Methylobacterium sp. 4-46] gi|168198240|gb|ACA20187.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 417 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/332 (10%), Positives = 83/332 (25%), Gaps = 20/332 (6%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 G II M + +G + +D Y + L++ A A + A + Sbjct: 2 RQRFVAFAAHKQGGASIIGLFGMLIAVGFAAVAIDSGNLYYSKLKLQKIADAAALGAVMA 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L S +++ + + F V D T ++ Sbjct: 62 LPTSSSVMAAA--------LDLVSKNTPVGFGTVSTSADIQIGVYDPSSKTFTPSAIGQN 113 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 A QV L+ +G+ S + + A + + + Sbjct: 114 AVQVTTRRSSAYGNAVLTYVAGILGVSSVDMAASSVAVKYGGACVMVLEPASAGSLQTKG 173 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S L + + +++ G + + + + + Sbjct: 174 S----------SALQTNCPIQVNSSSATAARTGGSSSITASQICVVGNYSGTGFSPWPKI 223 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL--ASVIRSIKKIDNVNDTVRMGATFFNDR 305 + V + + + + S + N T++ G +F Sbjct: 224 NCPSLVDPLANVPEPAQPVCTVNNPSISSGVFPTNCTYSGTVSLSGNVTLQSGLYYFKSA 283 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 IS + + + + T Sbjct: 284 NISVTGSTSITGSGLTIFLDKDSTLNLSGSGT 315 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/290 (10%), Positives = 77/290 (26%), Gaps = 23/290 (7%) Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + +S + L + G + +T ++ Sbjct: 180 SQEYEVSPTRIVQKGKLMFYGTCSGNSAIGYETGCIVRITGADQASGSIMNGPLTYSNIV 239 Query: 188 SMLDYQRDS-----EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + D + + N + + + + + S + Sbjct: 240 NFCDDDVNGIGNLHNREAPNKHNRLCNSRSAWEVMREHEDFNDNSPPSNLN-DAQIEPSF 298 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L +D+S + V A + + + + +G F Sbjct: 299 DLVQASTGDECRVVLVLDTSGSMGTSNRIDKVNSAATAFVNLVDDG------ISIGIVTF 352 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + R ++ G T I ++ + +++ Sbjct: 353 TGSPTTRHALTQINTQADRDSLRD-IFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG- 410 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 IVL+TDG+++ I + G+R+ T+A + + Sbjct: 411 -------IIVLMTDGQDSGIQNHIIR--QTLQDMGVRVNTVAIGEDAYGE 451 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 65/200 (32%), Gaps = 28/200 (14%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + + ++A +R++ + +G F S Sbjct: 93 DISNSMAATDVSPDRFTVAKEAATEFVRNL------PEQFNVGLVSFARTATVVAPPSTN 146 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + D STAI +A+ T+ + S + IVLL+ Sbjct: 147 HQAAVDAIEQLTLTD-----STAIGEAVLTSLQAVRSLDAQAAEDPPPAR-----IVLLS 196 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLS--NCAS 424 DG NT A G+ + TIA+ + + L A+ Sbjct: 197 DGGNTSGRPIDEG-ARAATEAGVPVSTIAYGTPEGTIDLEGRSIPVPADTESLRGLADAT 255 Query: 425 PNSFFEANSTHELNKIFRDR 444 SF+ A S EL ++ D Sbjct: 256 SGSFYAAESDEELRDVYSDL 275 >gi|47217757|emb|CAG05979.1| unnamed protein product [Tetraodon nigroviridis] Length = 1380 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 60/173 (34%), Gaps = 20/173 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +R +A+++ S D R+ ++DR ++ + + ++ + + + Sbjct: 25 FEKIRQWVANLVDSFDVA---PDKTRVAVVRYSDRPTTEFNLA--RYRTLEDVKRAARNI 79 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T DA+ I + ++ +LLTDG + E Sbjct: 80 RYLGGNTMTGDAISYTTSNIFTERNGARPIA---RGIQRVAILLTDGRSQDYVLEPSKAA 136 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 KA GIR+ + E + L A+ F + ++KI Sbjct: 137 AKA---GIRMFAVGIG------EALKVELDEIAAEPKNAHVFHVTDFNAIDKI 180 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 61/190 (32%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + + + +DR + L RT KT Sbjct: 60 TRMAAAKQAFNEVLDATPEEVRLGIRTLGADYPGDDRKTGCKDTAQLYPVGPLDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + + K IVL++DGE+T + Sbjct: 120 AVATLSPTGWTPIGPALLKAADDLD------------GGDGSKRIVLISDGEDTCAPLDP 167 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N +++ A+ ++ T EL Sbjct: 168 CEVAREIAAKGIGLTIDTLGLVPNTKMRQQLSCIAE--ATGGTYTSVEHTDELTDKVNQL 225 Query: 445 IGNEIFERVI 454 + V Sbjct: 226 VDRAADPVVT 235 >gi|14248649|gb|AAK57610.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|81892746|sp|Q6Q473|CLCA4_MOUSE RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel regulator 6; Short=mClca6; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|50882459|gb|AAS86332.2| calcium activated chloride channel [Mus musculus] Length = 924 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 54/194 (27%), Gaps = 28/194 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I + + +G F+ + + Sbjct: 313 DVSGSMTSYDRLNRMNQAAKYFLSQIIE-----NRSWVGMVHFSSQATIVHELIQINSDI 367 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++ A+ + I+LL+DGE+ Sbjct: 368 ERNQLLQTLPT-SANGGTSICSGIKAAFQVFKNGEYQTDGTE---------ILLLSDGED 417 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + + C ++ K G + IA L + + A + Sbjct: 418 STAKD-----CIDEVKDSGSIVHFIALGPLADLAVTNMSIL----TGGNHKLATDEAQNN 468 Query: 437 -LNKIFRDRIGNEI 449 L F Sbjct: 469 GLIDAFGALASENA 482 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 47/136 (34%), Gaps = 17/136 (12%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + G T + AM+ A + +A+ I +++DG T Sbjct: 218 KGKFGKALNSFEPTGWTPLAAAMEEARQDLKP---------YAGEDAENIIYVVSDGIET 268 Query: 381 QDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + I+ + I F V+ Q+ + A + AN+ ELN Sbjct: 269 CG-GDPVKAAKSLYNSDIQAVVNIIGFDVDDAGQQALKKVAE--AGGGEYKTANTREELN 325 Query: 439 KIFR---DRIGNEIFE 451 + F D I E+ + Sbjct: 326 QSFGIDWDEIEKEVSK 341 >gi|313902415|ref|ZP_07835818.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] gi|313467346|gb|EFR62857.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] Length = 170 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 20/57 (35%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G LL+PV+L G+++D R + A A + + + Sbjct: 17 QGGAVGAAFLLLLPVILAALGLVLDGSRLVLTRAHAQAVADFASLAGVQEVDEEALA 73 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/394 (9%), Positives = 110/394 (27%), Gaps = 31/394 (7%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 Q + + +++E+ K +EV+ ++ + Sbjct: 21 SQQQQPSTPSDQQVLVAQQVAIKEVEQSAAAAKAAADAKALAQQEVQQYSDKQTLQGRLK 80 Query: 126 -KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + NP + + + T S +++ ++ Sbjct: 81 EAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLN 140 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNK 238 + + +N F D K + + + Sbjct: 141 Q-GLLPPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 199 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 200 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------ 253 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G H + + + GST ++ AY ++ Sbjct: 254 IVTYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIK 308 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + I+L TDG+ ++ + + K + G+ + T+ + + Sbjct: 309 GGINR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMM 361 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 362 VRIAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 393 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 59/180 (32%), Gaps = 23/180 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 V++ + ++++ + D D R+G + V ++ S + + + Sbjct: 34 DFEKVKEFILNILQFL---DIRPDVTRVGLIQYGSTVKNEFSLK--TFARKQDMERAVRR 88 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T A+Q A + S E + I+++TDG I Sbjct: 89 MMYLSTGTMTGLAIQFAVNIAFSETEGARPLS---QNVPRVIMIVTDGRPQDP---VAEI 142 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHE---LNKIFRD 443 KA++ GI I I L + S + F + + L +F++ Sbjct: 143 AAKARNSGILIFAIGVGR------VDMNTLKSIGSEPYEDHVFLVANFSQIETLTSVFQN 196 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 68/248 (27%), Gaps = 28/248 (11%) Query: 203 CFGQPADRTVKSYSSQNGKVGI---RDEKLSPYMVSCNKSLYYMLYPGPLDPSL----SE 255 ++ G Y C++ Sbjct: 519 MLRDDGKTCRNRDVCKSTNHGCEHICVNTGDSYACKCHEGFALQEDRKTCKGCTEGPVDL 578 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D S V +V++ + ++ S+ R+G ++ +V ++ + Sbjct: 579 VFVIDGSKSLGVNN-FEIVKEFVLGILDSLTISPKAA---RIGLLQYSTQVRTEFTLKQF 634 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + K + + + A++ + + E H + + V+ T Sbjct: 635 STATD--MKKAVSQMKYMGKGSMTGLALKQMTERSFTEAEGARHLS---AKVPRVCVVFT 689 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEAN 432 DG +E KAK +GI + I + L AS F A Sbjct: 690 DGRA---QDEVSEWAAKAKQRGITMYAIGIG------KAIEEELREIASDPPEKHLFYAE 740 Query: 433 STHELNKI 440 + +I Sbjct: 741 DFSAMGEI 748 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 47/165 (28%), Gaps = 21/165 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + I+ + + + T + A++ + Sbjct: 108 DRIGLIAFSGVAFVQCPLTLDY-GAIQMFLDELKPELIPVAGTDLGAAIEAGISSFD--- 163 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K I+L+TDGE+ + KAK +G++I Sbjct: 164 --------FKSVTDKVIMLITDGEDNEGKGLI--AAQKAKEKGVKIFVFGMGDPSGGPIP 213 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIF-RDRIGNEIFERVIRITK 458 A F + + + + + + R T+ Sbjct: 214 ATD------GKGGFRKDRNGNVIMSKMDEESLEKIASATGGRYTR 252 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRSEDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 53/154 (34%), Gaps = 17/154 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIAS 497 >gi|86137907|ref|ZP_01056483.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] gi|85825499|gb|EAQ45698.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] Length = 181 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 21/51 (41%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 ++ K + A+LMP+ L V++ + L++A A+ Sbjct: 10 LRRFRKDTDATVTVEFAILMPLFLMFLFSTVELGMLILRQSQLERALDIAV 60 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ I+ +D + R+G ++ +V S G + + T Sbjct: 59 LVNIIRGLDIGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIHAIVPLAQGTMTG 116 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + + V++TDG + + + +A+++GI I Sbjct: 117 LAIQYAMNVAFSVAEGARP---SQARVPRVAVIVTDG---RPQDRVTEVAAQARNRGIEI 170 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 + L ASP F S + + Sbjct: 171 YAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQQ 206 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 52/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G ++ RV ++ G + + + Sbjct: 404 FELVKRFVNQIVDFLDVS---PEGTRVGLVQYSSRVRTEFPL--GRYGTADEVKQAVLAV 458 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R + ++ TDG +QD+ Sbjct: 459 EYMEKGTMTGLALRHLVEHSFSEAQGARPRA---QNVPRVGLVFTDGR-SQDDISV--WA 512 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 513 ARAKEEGIIMYAVGVG------KAVEEELREIASDP 542 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/394 (9%), Positives = 110/394 (27%), Gaps = 31/394 (7%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 Q + + +++E+ K +EV+ ++ + Sbjct: 28 SQQQQPSTPSDQQVLVAQQVAIKEVEQSAAAAKAAADAKALAQQEVQQYSDKQTLQGRLK 87 Query: 126 -KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + NP + + + T S +++ ++ Sbjct: 88 EAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLN 147 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNK 238 + + +N F D K + + + Sbjct: 148 Q-GLLPPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 206 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 207 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------ 260 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G H + + + GST ++ AY ++ Sbjct: 261 IVTYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIK 315 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + I+L TDG+ ++ + + K + G+ + T+ + + Sbjct: 316 GGINR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMM 368 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 369 VRIAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 400 >gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/372 (8%), Positives = 101/372 (27%), Gaps = 29/372 (7%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + + + + ++V ++ + E N S + + + ++ Sbjct: 60 TYDYLFIVDRNEIKPKICHQYEAQDVTNVSKIQQYEGNGLNSFFLGSIGPNKTIEIHHKV 119 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 F+ ++ + + + + + ++ I + + + Sbjct: 120 SFMAMANENGFIYTFPLSNKGIYGPFTYQRPENFEFSIHVKTLSELKEIINSVRGTINVI 179 Query: 206 QP---ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 P T + + ++ + + + + + + + + Sbjct: 180 DPHNVIFATKEFPNDESITIETQIKDKDNNIAIWSDGYIAISTFTYFETKVHSNSEFYFI 239 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 +++A + + + R F + + Sbjct: 240 IDCSGSMSGSCIQNAKLCLNIFMHSL---PIGCRFSIIKFGSDYEVALHPCDYTDENVSE 296 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 +K + EMG T I ++ + + K + LLTDG+ Sbjct: 297 AMKQLNNIDAEMGGTDILSPLKYVME-LTPKQGFI-----------KQVFLLTDGQ---- 340 Query: 383 NEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS--THELN 438 + +C A+ RI +I + S ++ + + +LN Sbjct: 341 DSNTNELCALAQENRTNNRIFSIGIGSGADKDLIINVSQK---SGGNYVFVDDDESEKLN 397 Query: 439 KIFRDRIGNEIF 450 + + + + I Sbjct: 398 EKVIELLNSAIS 409 >gi|317122048|ref|YP_004102051.1| hypothetical protein Tmar_1211 [Thermaerobacter marianensis DSM 12885] gi|315592028|gb|ADU51324.1| Protein of unknown function DUF2134, membrane [Thermaerobacter marianensis DSM 12885] Length = 344 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 11/71 (15%), Positives = 27/71 (38%), Gaps = 2/71 (2%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV-- 75 G+ ++ A+ + ++ + G++VD R L+ AA A + + L + Sbjct: 55 QEGNVAVLVAICLAALMAMAGLVVDGGRLYTERARLQAAADVAALAGATELPEDPAAARQ 114 Query: 76 SSRAKNSFTFP 86 + + Sbjct: 115 VALDYLARNGV 125 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 48/169 (28%), Gaps = 6/169 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + G ++ A+ + + + V G +DV + L++ A A I + Sbjct: 2 TRQRDAGWRGKSRQRGSVAVMAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIAS 60 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + +S A + + + + T + Sbjct: 61 VESMTDQCSQQNSPAMMAAQSNALANGFDYRASGQTLSIECGRWDT-----SATPYFNST 115 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 V +S + L F G + + A+ ++ Sbjct: 116 FTPLNAVSVSVTQQVPYIFLGRFFGKSGSTGATVAAFSTAKAINIDSFT 164 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 20/140 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + + G T+I + ++ ++TI SS++ IVLLTDGE+ Sbjct: 371 SSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE---------IVLLTDGED 421 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 Q + C + K G I TIA + + LS+ F+ + + L Sbjct: 422 DQISS-----CFEEVKHSGAVIHTIALGPD---AARELETLSDMTGGRRFYASEGINGLI 473 Query: 439 KIFRDRIGN--EIFERVIRI 456 F + ++ +++ Sbjct: 474 DAFSGISSKSGSLSQQALQL 493 >gi|163848202|ref|YP_001636246.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526110|ref|YP_002570581.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669491|gb|ABY35857.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449989|gb|ACM54255.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 415 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 69/243 (28%), Gaps = 26/243 (10%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + Q G+ I +++ L+ +D S K ++ Sbjct: 8 TCQWGRAPIPSSTTPQVGYLLVEAVAP--ATLAPSLPLNLCFVLDRSGSMQGA-KLDNLK 64 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A VI ++ D F+D V + + R+ + +E G Sbjct: 65 AATRRVIEQLQPTD------IAAIVIFDDTVQTLIPATP---VGDRSALLAAVDSISEAG 115 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA++ MQ + + ++LLTDG+ D + Sbjct: 116 GTAMSLGMQAGQVELQK---------HLGSDRLSRMLLLTDGQTWGDEPLCRDLARSLGQ 166 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI--GNEIFERV 453 G++I+ +Q AS + ++ F+ + Sbjct: 167 AGVQIVAFGLGTEWNEQLLDDI---AAASNGYSDYIAAPEQIGTFFQQAVHEAQATVATE 223 Query: 454 IRI 456 R+ Sbjct: 224 TRL 226 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 279 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 333 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 334 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 387 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 388 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 417 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 400 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 454 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 455 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 508 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 509 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 538 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 320 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 375 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 428 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 429 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 458 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 402 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 456 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 457 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 510 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 511 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 540 >gi|194211147|ref|XP_001917810.1| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 909 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 64/203 (31%), Gaps = 30/203 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + A + + + + +G F+ + + + + Sbjct: 312 DKSGSMAGSNRLSRMNQAAKHFL-----LQTIENGSWVGMVHFDSIALVISNLTQIISSN 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + +G T+I +++A++ I IVLLTDGE+ Sbjct: 367 ERNKLFESLPT-EAVGGTSICAGIKSAFEVITDMYSQIDGSE---------IVLLTDGED 416 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 C ++ K G I IA + Q + + A+ Sbjct: 417 N-----TAGSCVDEVKQSGAIIHFIALGPSADQAVIEMSTI----TGGKHKYASDEAANN 467 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + ++ ++ +++ Sbjct: 468 GLIDAFAALVSGNADLSQQSLQL 490 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 20/140 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + + G T+I + ++ ++TI SS++ IVLLTDGE+ Sbjct: 371 SSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE---------IVLLTDGED 421 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 Q + C + K G I TIA + + LS+ F+ + + L Sbjct: 422 DQISS-----CFEEVKHSGAVIHTIALGPD---AARELETLSDMTGGRRFYASEGINGLI 473 Query: 439 KIFRDRIGN--EIFERVIRI 456 F + ++ +++ Sbjct: 474 DAFSGISSKSGSLSQQALQL 493 >gi|197117534|ref|YP_002137961.1| hypothetical protein Gbem_1146 [Geobacter bemidjiensis Bem] gi|197086894|gb|ACH38165.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 383 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/282 (8%), Positives = 64/282 (22%), Gaps = 10/282 (3%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL----------IQSLEEV 75 +++ V L + G+ +D+ + L+ +A+ A +T + L Sbjct: 6 VTIMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKQRLLFQAQQSPEKLAQ 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S ++ ++ + L D + S+ Sbjct: 66 VSADPLQSAARSTAVDNATGKHSASALVALMNDNGNALSENNDITVGFWNMSSRSYTPGG 125 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + A+ + V++ ++ +RS + Sbjct: 126 TPVNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAICAA 185 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + K + S + L Sbjct: 186 ACEPSCSYPDVCNIAERKMSHAPWDPRRENSSANRYLYTSLLHPVTITNTMSDLVCQEMP 245 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 V + K + L +++ + + Sbjct: 246 VQEVCGLPIFTAAMKTDAILRDLKAMMYDPNVDSSNKEYDNN 287 >gi|332970884|gb|EGK09861.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 440 Score = 54.6 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 75/224 (33%), Gaps = 18/224 (8%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + + S+ + ++ +K + +DA+ + + + Sbjct: 106 DTSYNVNSKKPGTDVDKPKFDQYNVSILLDASGSMAAQVSGGRKMDVAKDAVETFVSTFP 165 Query: 287 KIDNVN--DTVRMGATFFNDRVISDPSFS--WGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + NV+ G+ D+V+S + + ++ + G T + DA Sbjct: 166 EEANVSLIAYGHKGSNSQKDKVLSCSEIEEIYPLSTYDQSTFSNALKTVDATGWTPLADA 225 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI--RI 400 ++ + T+ ++ + + +++DG T + GI + Sbjct: 226 IKKSGKTLKANADKNSKN---------VVYIVSDGLETCG-GHPAKEAEALQKDGISATV 275 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I F VN +Q+ + A +F ANS +L F Sbjct: 276 NIIGFDVNNAEQQALKEVAE--AGGGTFTSANSKSDLESYFESE 317 >gi|301764709|ref|XP_002917776.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Ailuropoda melanoleuca] Length = 1091 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 52/164 (31%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + T Sbjct: 57 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSVAEGARPPEARVP---RIAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 169 EIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 206 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 363 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 417 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 418 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 471 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 472 ARAKEEGIVMYAVGVG------KAVEEELRQIASEP 501 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 52/164 (31%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + T Sbjct: 33 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTM 90 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 91 TGLAIQYAMNVAFSVAEGARPPEARVP---RIAVIVTDG---RPQDRVAEVAAQARARGI 144 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + Sbjct: 145 EIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 182 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 380 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 434 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 435 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 488 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 489 ARAKEEGIVMYAVGVG------KAVEEELRQIASEP 518 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 39/290 (13%), Positives = 84/290 (28%), Gaps = 26/290 (8%) Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + + + I F +++ + N + + S + + Sbjct: 224 KFFPDKIQTARASIMFMQNLNSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQ 283 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + ++ + +D S + + A + I Sbjct: 284 QAPPMRGTEAPPPPTFSLLKSRQRVICLVLDKSGSMDTEDRLIRMNQAAELYLTQI---- 339 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V +G F+ + T + + +G T+I ++ + Sbjct: 340 -VEKESMVGLVTFDSTAQIQNYLIKITNTGD-YKKITGNLPQQAVGGTSICRGLEAGFQA 397 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVN 408 I SS++ IVLLTDGE+ I+ C K G I TIA Sbjct: 398 ITSSDQSTSGSE---------IVLLTDGEDD-----LISSCFEVVKHSGAVIHTIALG-- 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFR--DRIGNEIFERVIRI 456 + + LS+ F+ + L F + ++ +++ Sbjct: 442 -PKAARELETLSDMTGGLRFYANKDVNSLMDAFSGISSASGNLSQQALQL 490 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 25/261 (9%), Positives = 60/261 (22%), Gaps = 4/261 (1%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL 94 G G+ +D R L+ A + + A+ L + + +S Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPDAISRAVSAGAAAGNLNGVNLQ 102 Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL-LNPLSLFLRSMGI 153 N+ + D RD + + S + L L F + Sbjct: 103 SANWSGQGQITAADISFRDASYLSTTSATAAVYSQCTHTQSNIGMWLLKALGAFSGNSAS 162 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 A S + + + + G L + + Sbjct: 163 FPATGNVGTYAVATRASAQSACPIPVALQAKTPGATRPDYGYTVGDWLTLLAKSNGGSTF 222 Query: 214 SYSSQNGKVGIRDEKLSPYMVSC---NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 +++ +G + +L + + + +S + + Sbjct: 223 GWANLDGSNSASETADELNGHCGSKLGDTLGTPGVQASIVEIWNYRFGIYKNSSGPSVGR 282 Query: 271 KHLVRDALASVIRSIKKIDNV 291 + S + Sbjct: 283 PDYTGYIYDTSTWSAGRSAYN 303 >gi|116671476|ref|YP_832409.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] gi|116611585|gb|ABK04309.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] Length = 353 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 45/157 (28%), Gaps = 13/157 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G +I A+++ V+LG G + VDV L+ ++ Sbjct: 21 ERGAVSVIVAIMLVVLLGFGAVAVDVGMLYAERTQLRN-------------GADAAALAV 67 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 K + + P + + N + R ++ ++ RK + Sbjct: 68 AQKCAKSAPSSSDADCSNTSTLAASLANSNANDGRSNIKSIILDTTNRKVTVTAGAQEKG 127 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 F R +G+ S + + S Sbjct: 128 KSPNEVSLFFARVLGMNSAEVNAPSTVVWGSPEKGTS 164 >gi|194209576|ref|XP_001915257.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Equus caballus] Length = 1016 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 143 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 202 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 203 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 262 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 263 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 322 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 323 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 369 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 370 RGPIQWMACENKGYYYEIPSIGAI 393 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 32/332 (9%), Positives = 94/332 (28%), Gaps = 30/332 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QAAKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 365 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 394 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 54.6 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 59/193 (30%), Gaps = 24/193 (12%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +DSS V++ + ++++ + D +G + V + S Sbjct: 17 VFIIDSSRSVSHY-DFEKVKEFILTILQFLDIS---PDATHVGLIQYGSTVKQEFSLK-- 70 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + I + + T A+Q A + S E + I+++T Sbjct: 71 TFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIAFSETEGARP---LRQNVPRIIMIVT 127 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEAN 432 DG I KA++ GI I I L + S F Sbjct: 128 DGRPQDP---VAEIAAKARNSGILIFAIGVGR------VDMNTLKSIGSEPHEEHVFLVA 178 Query: 433 STHE---LNKIFR 442 + + L F+ Sbjct: 179 NFSQIETLTSAFQ 191 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 15/181 (8%), Positives = 49/181 (27%), Gaps = 11/181 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ G ++ A+L+ + + +DV + L+ A ++A+ L ++ Sbjct: 10 RTRRNTRGAVSVMAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQLKRAD 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR-----------EVRDIVRDTAVE 121 + A T + + + Sbjct: 70 SPANLNAAVLGTVSNIGAKNGYPSGIAMGCAEAAGGGADAMTACVGVWDPATAGPRHFSA 129 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + + + + L +F G + + + + V+ + + Sbjct: 130 VYNAATVSPNAVRVQATQTVPILFVFSGGSGRQLFAESIASGSPPVAAFSLGTGLLDVST 189 Query: 182 V 182 Sbjct: 190 A 190 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 65/211 (30%), Gaps = 28/211 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + R A + ++ T+++G F + + S Sbjct: 107 DISLSMVCDDVRPTRVDAARQAAIKFVDEMEP------TLQLGLVTFAGTAQTLIAPSSD 160 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + D+ TA + + TA I + + IVL + Sbjct: 161 HEIVKHALDEAIRPDKLAAR-TATGEGIYTALQQIET---LSSILGGKSKAPSARIVLES 216 Query: 376 DGENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQE-----------KARYFL 419 DG+ T ++ +AK++ + I +I+F + L Sbjct: 217 DGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTTRPIPYVNIQGSRVPVPADDASL 276 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 A S FF A S +L+ ++ Sbjct: 277 QKVAELSGGKFFTAGSLDQLSDVYSSLNAEI 307 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 60/180 (33%), Gaps = 21/180 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIV 324 + ++A + RS+ + + V + D + V K + Sbjct: 94 TRLDAAKEASKNFSRSVSEESELGFMVYGTKVGNSPEEREAGCKDVTTLLPVGKGNAGKI 153 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 N G T + A++ A + + ++ IVL++DGE+T Sbjct: 154 SGEVDKVNASGHTPMGPALKQAAKELPNEG-------------ERSIVLVSDGEDTCAPP 200 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + QGI I T+ F V+ +++ + A + +A L + + Sbjct: 201 PVCDVAKDLHKQGIDLTINTVGFLVDPAARKELQCIAE--AGGGEYLDAQDAESLAESMK 258 >gi|325292999|ref|YP_004278863.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] gi|325060852|gb|ADY64543.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] Length = 579 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/287 (11%), Positives = 89/287 (31%), Gaps = 9/287 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + +K TG+ I L P+ +G+ + VD + + L+Q A + S Sbjct: 4 ISSLLPRFLKDSTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQTA--DLAAISA 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + E + + + ++ + L+ E V + Sbjct: 62 AANAADPEKAVQQYFALNGMNLGVKTPNGLLTVDGLQPFDPLNEFAKSNGYAEVVKGHYE 121 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + + + ++ ++ I + T + S + Sbjct: 122 PDATLPV-GQRFVENALPTNAMKVNIVEKGHIFFASAFTTPPKISAAGTASSQKIAAFSV 180 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S L + LN T S + + + + + ++V +L L Sbjct: 181 GSRL---ASLDEGILNSLLGGLLGTTVSLKVMDYQALVAADVNALHVVE---ALAVDLNL 234 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + + + K ++ A+A++++++ K N ++ Sbjct: 235 TAGTYKDVLKTEITYGKFLDALTKTTGLQPAVANILKTLAKTANKSN 281 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 67/187 (35%), Gaps = 23/187 (12%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 I + V+++++ + + + R+G F ++ S L I Sbjct: 114 PGGITRLQAVKNSVSKFVAA-------RQSDRIGLVIFANQAWPFAPVSEDKQALQTRI- 165 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 T TAI DA+ A + SS N +A K +LLTDG +T Sbjct: 166 -TQLSPGMVGEQTAIGDALGVAVKLLDSSA---------NQDASKLAILLTDGNDTASQL 215 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF--LSNCA---SPNSFFEANSTHELNK 439 A + +++ TIAF + + L A S+ ANS L+ Sbjct: 216 APPLAAQLAAAHHVQVHTIAFGDSNSAGSDHVDLTQLQEIARITGGKSWTAANSGASLDS 275 Query: 440 IFRDRIG 446 ++++ Sbjct: 276 VWQEIDA 282 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 51/407 (12%), Positives = 105/407 (25%), Gaps = 35/407 (8%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 +++AA I P + E+ + L+ + + Sbjct: 194 IERAADALITAFVAPKMAQAEQTIPLTVTIESAVSGAATVQLLVDGRLQTEVPLDLVIGD 253 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + + TA P + V L + +D N ++ + L+ + V+ Sbjct: 254 NQLTITAPAGEPGFRQFGVRLIAPFDTQPANNQAVAFTFVSGPPRLLLLTTTPDQVTALV 313 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 I V+ + R V + + +RD Sbjct: 314 AAWQATGITVVVQEPSQVPADPRALRAFDTIALVDTPADEVPTALQRALTTYVRDNGGGL 373 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ +S + L + LV D S+ + Sbjct: 374 LVIGGPRSFGAGGWRRTLLEPILPVALDPPLREERPDLALVLVIDRSGSMRELVDDGRTQ 433 Query: 292 NDTVRMGATFFNDRVIS------------DPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 D R + + + ++ G T I Sbjct: 434 LDLAREAVYQASRGLTQRDQIALIAFDSIADTLLPLQPLPGLFTIEDALSRLVAGGGTNI 493 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + A +TI +S H ++LLTDG E + ++QGI Sbjct: 494 RSGIALAAETIATSQARIRH-----------VILLTDGV---SETEYADLVADLRAQGIT 539 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + IA ++ L A ++ +L ++ + Sbjct: 540 VSAIAIGLDTD------PALERVAQIGGGKYYLVQRVPDLPQVVLEE 580 >gi|300704937|ref|YP_003746540.1| hypothetical protein RCFBP_20766 [Ralstonia solanacearum CFBP2957] gi|299072601|emb|CBJ43951.1| conserved exported protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 340 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/337 (10%), Positives = 89/337 (26%), Gaps = 21/337 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G +++ +L+ + L +G M +D+ + L+ AA A + + L + + Sbjct: 6 RRQRGAVGVLSPILLIIFLAIGAMAIDIAHLFVVRNELQNAADAAALAGAAGLYPANPKP 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + K+ + + + P + V Sbjct: 66 NWSNGVAQGTSAIKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSND---VAGV 122 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + +P + G +W+ A + + S F ++ D Sbjct: 123 QVTVTRSPGNNGGPVSGWLTWVFNGGAASIQATAVAVIAAPGSANPGSLFPVALNKCLFD 182 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 PY++ N S Sbjct: 183 LYWNYTTGQPLN-----------------DPSTGQPYVIDINTSYPPSGGTCASGEWTGF 225 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + D+S+ ++++ + ++ I I + + + + G Sbjct: 226 DGPTDASTEKNLVSSGNPTNISIGENIN-ISTGVKTSVYNAIPPLPLTVTMPVVSPLTPG 284 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + GS + + TA +++ Sbjct: 285 ATSPVYAFAGFTITKIVTNGSHSYIEGHFTANQKVVN 321 >gi|89070480|ref|ZP_01157773.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] gi|89043884|gb|EAR50075.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] Length = 432 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 5/156 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L++ G + L + L VG + +DV L+ A A V Sbjct: 4 PRRLLRREDGAATVFALFLFILCLTVGAVALDVSHAYAARTRLQAATDATAHAAIVAREY 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + Y ++ D + T V A Sbjct: 64 MTADEARAKARAIGEGHLPAGRYGDIFAPQHVTFGTWDGDT-----RTFVPDETASGAVL 118 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 V D + + +GI W I +A E+ Sbjct: 119 VRGMRGLDGGTPAPTFLFKLVGIDWWDIVVEALFES 154 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 50/152 (32%), Gaps = 21/152 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + +++ + ++G F + Sbjct: 98 DISNSMLAEDVAPSRLEKSKLLVENLMSRFSE-------DKIGLIVFAGEAFVQLPITSD 150 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + T I A+Q A ++ N +A K I+L+T Sbjct: 151 YVSA-KMFLDNINPSLIGTQGTDIGKALQLAANSFTP-----------NSKAGKAIILIT 198 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 DGE+ + A+ +A+S+GI++ + Sbjct: 199 DGEDNEGGA--EAMAKQARSKGIKVFILGIGS 228 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 59/189 (31%), Gaps = 28/189 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D+S + + + A + V + +G FN + Sbjct: 312 DTSGSMLSHNRLNRMNQAAKQFLMQT-----VENGSWVGMVRFNSTATILNKLIQIISSN 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I E + IVLLTDGE+ Sbjct: 367 ERNTLLEKLPT-RAQGGTSICSGIKSAFQVI---GELYSQLDGSE------IVLLTDGED 416 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + C ++ K G + IA + + +SN + S A+ E Sbjct: 417 N-----TASSCIDEVKQSGAIVHFIALGKDADKAVIE---MSNI-TGGSHLYASDEAENN 467 Query: 437 -LNKIFRDR 444 L F Sbjct: 468 GLIDAFGAL 476 >gi|327274818|ref|XP_003222173.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1097 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 56/184 (30%), Gaps = 23/184 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAI 339 V I + T R+G F+ +V + + +K +G T Sbjct: 737 VKTVIDAVTVNQATARVGIINFSHKVDVVSTLQQYP---NKESLKGAIDVMQYLGEGTYT 793 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGI 398 A+ A + S+ K +++TDG+ + + + + A + I Sbjct: 794 ATAISNATNIFKSARPGVR----------KVAIVITDGQADRRDPNTLEVVVKDAHASNI 843 Query: 399 RIMTIAFSVNKTQQ-EKARYFLSNCASPN---SFFEANSTHELNK----IFRDRIGNEIF 450 I I + R + AS ++ + L +F+ N+ Sbjct: 844 EIFVIGVVQKSDPNFDSFRKEMDLIASDPDIEHVYQIDDFITLQALENKVFKQICENDFS 903 Query: 451 ERVI 454 + Sbjct: 904 VYIT 907 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 42/143 (29%), Gaps = 14/143 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVN---DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + + + + +++ F+ V D F + Sbjct: 65 FDKQKEFVTLLSDKLFLMKPTRVLRYDIKLAIMQFSSSVRIDYPFDEWRSLPDFKLRVKE 124 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T A+ A + + K +VL+ DG + + + Sbjct: 125 MTYI--GHGTYSYYAISNATQLFKTEGR---------KSSVKVVVLMADGIDHPKSPDVQ 173 Query: 388 AICNKAKSQGIRIMTIAFSVNKT 410 AI A++ GI +TI S Sbjct: 174 AISEAARTFGISFITIGLSNVAD 196 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 46/386 (11%), Positives = 99/386 (25%), Gaps = 33/386 (8%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + L L E + I++ + F L + +V + Sbjct: 185 TTALSTLLAEFYAATGKQRDLTVADIQKPESQEFIRKLGQGIKHYGYNTLVFSENMRKFG 244 Query: 125 RKSAYQVVLSSRYDLLLN----PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + N P + + + S S + Sbjct: 245 MGYISAFPMEEITLIDFNKNKNPPTPLVAIYPREGTFWHDNPFILMTSASDQERRAAEQF 304 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKS 239 + S +P N +D +Y G + + +V+ + Sbjct: 305 YDFLLSDESQRLAMSYGFRPANVNVPLSDPISLAYGVQPQGVQSVLPVPPAEVIVAAKNA 364 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + ++ K + L + + I D R+G Sbjct: 365 WSLNRKRADILLVVDVSGSMEGD-------KLEAAKAGLGTFLSRILPED------RVGL 411 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + R + G TA+ DA+ + + S Sbjct: 412 VTFSTESRLVVPPAPLSD--TRIRLDDAIAVMRAQGRTALYDALIDGKEALDS------- 462 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + IVLL+DG + + + GI I +A+ + L Sbjct: 463 LPSTGDDRIRAIVLLSDGLDNSSRATLEQVRLAFEESGISIFPVAYGADAD-----TDAL 517 Query: 420 SNCASPNSFFEAN-STHELNKIFRDR 444 A+ + ++ +IF + Sbjct: 518 QQIATFSRTILVQGDAGDIGQIFENL 543 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 49/170 (28%), Gaps = 24/170 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K H + + I +K D R FN+ + Sbjct: 297 KLHTLASGVKKAIGQLKPED------RFRVVAFNNTAFDLNRGWVSATEANLRETFARLD 350 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G T + + A + + + ++L+TDG Q + A Sbjct: 351 QLNSNGGTNVYAGVHLALERLDADRVAT-------------LILVTDGVTNQGIVDPKAF 397 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELN 438 Q +R F + + + C AS S+ +++ ++ Sbjct: 398 YKLMHKQDLRFY--GFLLGNSSNWPLMQLM--CDASGGSYRAVSNSDDII 443 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 18/166 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ +IR + D R+ FND V S V R+I Sbjct: 101 SRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 154 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TA+ A++ A + + + +I+LLTDG++T Sbjct: 155 IDRLQARGGTALMPALEEAVKILDERQGGSRNHVG-------FILLLTDGDDTTGFRWTR 207 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + A ++ + T + + S ++ + Sbjct: 208 DAIHGAVAK-YPVHTFGLGASHDPEALLHIAQ---GSRGTYSFVDD 249 >gi|219847682|ref|YP_002462115.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541941|gb|ACL23679.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 44/155 (28%), Gaps = 15/155 (9%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D V + + R+ + E G TA++ MQ A + Sbjct: 83 VIFDDTVQTLIPATP---VGDRSALLAAVETITEAGGTAMSLGMQAAQTELQK------- 132 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + ++LLTDG+ D + G+RI + +Q Sbjct: 133 --HLGPDRISRMLLLTDGQTWGDEPICRDLARTLGQAGVRITALGLGTEWNEQLLDDI-- 188 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 AS ++ F+ + Sbjct: 189 -AAASDGYSDYIADPAQIETFFQQAVKEAQAVVAT 222 >gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] Length = 699 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/272 (11%), Positives = 59/272 (21%), Gaps = 30/272 (11%) Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + + + + ++ + L+ R + V + E Sbjct: 213 NATIQQLSPNEWSASLVNQANLEEAGNTQNVITLSQDIVVYWRHKEGLPGSVDLVTYKPE 272 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + L + S VR L + Sbjct: 273 ANQRGTFMLTLTPGDDLGKVTTGSDWIFVLDISGSMSGKYATLVEGVRQGLEKL------ 326 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 R FN+ + S V + V + G T + +Q Sbjct: 327 ----RQQDRFKVVLFNNGSVDLTSGFLTVSQANVANVLQQLDNYKVGGGTNLYAGLQKGL 382 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 I + I+L+TDG E +R+ T Sbjct: 383 TGIDADRPAG-------------IILVTDGVANVGQTEKKTFLKLLNDHDVRLFT----- 424 Query: 408 NKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 R L A S ++ ++ Sbjct: 425 FIMGNSANRPLLKEMADVSNGFAMSVSNADDI 456 >gi|14248631|gb|AAK57601.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 89/305 (29%), Gaps = 29/305 (9%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 P + + + S S ++ + S +P Sbjct: 264 PPPTPLVAIYPAEGTFWHDNPFIVMASTSADEQAAAERFYEFLLSEESQRAAMQYGFRPA 323 Query: 202 NCFGQPADRTVKSYSSQNGKVGIR-DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 N D ++ + V + +V+ S + + ++ Sbjct: 324 NPNVPLTDPISPAFGADPQGVQTVLAVPSAEVIVAVKDSWKLNRKRADILLVVDTSGSME 383 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 K +V+ + + + I D R+G F+ + + Sbjct: 384 GD-------KMTMVKAGIETFLMRILPED------RLGLITFDSQARLVVPMAPLSE--N 428 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R ++ + G TA+ DA+ A T+ + E + + IVLL+DG + Sbjct: 429 RIDLQIAVQEMRASGRTALFDALDLARQTLEALPPAE-------DDRIRAIVLLSDGADN 481 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 + + GI I +A+ + R L A + + ++ + Sbjct: 482 ASRLTLEEVRRQFDESGITIFPVAYGSDAD-----RQVLDAIAEFSRTIVVVGDSGDIAQ 536 Query: 440 IFRDR 444 IF + Sbjct: 537 IFENL 541 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/332 (9%), Positives = 94/332 (28%), Gaps = 30/332 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QAAKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ +I + K G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMIKKQRESGVTLSTFGVGDDNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 365 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 394 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 21/200 (10%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +D SS K A + ++ ++D N FN V Sbjct: 134 GRPTQVVIDIDGSSSMRRSDPKRERVRAAKRFVETLSRVDKRNQFG---VIEFNTTVEER 190 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 G+ + +GSTA+ ++ + D + S + ++ Sbjct: 191 APMGSGMKATS-----DAIQAVDAVGSTALYTSLIRSIDALEGSGK---------TGYRR 236 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY-FLSNCA--SPN 426 I++LTDG++T + + N+AK+ +RI ++ Q+ + + Sbjct: 237 AILVLTDGKDTASSHGVATVINRAKAAKVRIYVVSLGGAGDQKGLGYVGPMQRLTTETGG 296 Query: 427 SFFEANSTHELNKIFRDRIG 446 F + +L F D I Sbjct: 297 VFTHVDRADDLVARF-DAIA 315 >gi|147899676|ref|NP_001087858.1| collagen alpha-1(XXI) chain precursor [Xenopus laevis] gi|82234134|sp|Q641F3|COLA1_XENLA RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|51950065|gb|AAH82384.1| MGC81791 protein [Xenopus laevis] Length = 957 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 26/206 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + ++ F+ S + +++ + ++ + ++G ++D Sbjct: 29 SSCRTAPNDLVFILDGSWSVGPENFEILKKWVVNITSNFNI---GPKFTQVGVVQYSDYP 85 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 I + G ++ I + + + G+T +A+Q A D + + + Sbjct: 86 ILEIPL--GSYESIDDLSRRTQSIQYLGGNTQTGNAIQFAIDNLFAR---------SLRP 134 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--- 423 K ++LTDG + ++ I +A+ I + I E L A Sbjct: 135 LTKIAIVLTDG---KSQDDVKHIAEEARKNKITLFAIGVGS-----EIEESELRAIANKP 186 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 S F +++ R+ + ++ Sbjct: 187 SSTYVFYVEDYIAISR-IREIMKQKL 211 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 + + +D D R+ + V + L + +K T Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLK---SHLTKDTIKQAITRIEPLAAGTMT 143 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A++ A D + + K K +++TDG + ++ + A++ GI Sbjct: 144 GMAIKKAMDEAFTEKSGARPKSK---NISKVAIIVTDG---RPQDQVEEVSAAARASGIE 197 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I + L AS + F + +L FR+ + Sbjct: 198 IYAVGV------DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETL 243 >gi|82703475|ref|YP_413041.1| hypothetical protein Nmul_A2358 [Nitrosospira multiformis ATCC 25196] gi|82411540|gb|ABB75649.1| putative membrane protein [Nitrosospira multiformis ATCC 25196] Length = 437 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 17/88 (19%), Positives = 35/88 (39%), Gaps = 8/88 (9%) Query: 3 FDTKFIFYSKKLIKSC--------TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 D S +L ++ G II AL + V++G G+ +D+ + + L+ Sbjct: 1 MDKAMKRVSDRLRRNRNHPGCRQEKGVVAIIVALSLVVLVGFAGLALDLGKLYVAKSELQ 60 Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNS 82 +A + A+ L + A+ + Sbjct: 61 NSADACALAAARELNGANTNQLVLAEAA 88 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A++ G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARASG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 320 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 375 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 428 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 429 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 458 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A++ G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARASG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|14248665|gb|AAK57618.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYKNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 3 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 59 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A++ G Sbjct: 60 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARASG 113 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 114 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 152 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 268 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 322 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 323 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 376 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 377 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 406 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 59/178 (33%), Gaps = 13/178 (7%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +I +KK D VR GA + D ++ Sbjct: 820 DYDEYNIMKDFMIGLVKKADVGKAQVRFGALKYADDPEVLFYLGDL--DTKMEVISMLQN 877 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D+ G+T +A+ + + +H+ + ++++TDGE + D E+ Sbjct: 878 DQPMGGNTYTAEALAFSDHMFTEARGSRLHKG-----VPQVLIVITDGE-SHDAEKLNGT 931 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + +GI ++ + L+ S + +F + L IF D + Sbjct: 932 TKALRDKGILVLAVGI-----AGANPVELLAMAGSSDKYFFVETFGGLQGIFSDVSAS 984 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 46/147 (31%), Gaps = 18/147 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR+GA + F G + + + G T A+ + + Sbjct: 472 VRVGAVQYASSWD--LEFEIGKYSNKHDLGRAIENIRQLGGDTNTGAALNFTLRLLQKAK 529 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + ++V+LT+G + E + + + IR+ I +E Sbjct: 530 Q------QRGNKVPCHLVVLTNGMSKDSILEP---AKRLREENIRVYAIGV------KEA 574 Query: 415 ARYFLSNCASPN-SFFEANSTHELNKI 440 + L A + + L I Sbjct: 575 NQTQLREIAGDEKRVYYVHDFDALKDI 601 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 53/171 (30%), Gaps = 15/171 (8%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V ++ D + V++GA F+ + G + + Sbjct: 1011 DPSDFKKMKEFVASVVQDFDVSLNRVQIGAAQFSHTYQPEFPL--GTFTDEKEVSFHIEN 1068 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I A+ + N + +++LTDG Q +E Sbjct: 1069 IQQIFGYTHIGAALHQVGRYFQP-----DMGSRINTGTPQVLLVLTDG---QSQDEVAQA 1120 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + G+ I ++ + + + + ++ EL KI Sbjct: 1121 AEELRHNGVDIYSVGIGNVDH-----QQLIQITGTADKKLTVDNFDELKKI 1166 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 45/148 (30%), Gaps = 15/148 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D +R+G +++ S S G + R+ V + A Sbjct: 249 GFLQESVSALDIKESCMRVGLVAYSNETEVIGSLSEG---VNRSAVLHRIQSLSPGAGEA 305 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A I + N + +VL+T + + + QG+ Sbjct: 306 YTGA---ALRKIRKEVFGARGGSRKNQGVPQIVVLVTHRASEDNVTR---AAVNLRRQGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN 426 + T+ + L AS Sbjct: 360 TVFTLGVEG------ASATQLEKIASHP 381 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 42/124 (33%), Gaps = 18/124 (14%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + +G T + + ++ +K+++L+TDGE Sbjct: 679 MSQSDISNAIDRMAHIGETTLTGG------ALTFVSQYFSPAKGARPNVRKFLILITDGE 732 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHE 436 + + +G+ I ++ F N TQ L + P F + Sbjct: 733 A---QDVVKEPAVALRQEGVIIYSVGVFGSNVTQ-------LEEISGRPEMVFYVENFDI 782 Query: 437 LNKI 440 L +I Sbjct: 783 LQRI 786 >gi|197124353|ref|YP_002136304.1| von Willebrand factor A [Anaeromyxobacter sp. K] gi|196174202|gb|ACG75175.1| von Willebrand factor type A [Anaeromyxobacter sp. K] Length = 480 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 53/186 (28%), Gaps = 21/186 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ ++ + D G F+ V + + + + +K Sbjct: 60 ATQSIMKLVDHLAPGD------FCGVVVFSTEVETLAAPTEMTQD-RKDALKVALGRLRP 112 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEGIAICNK 392 +T + + + + +V ++L TDG + +C Sbjct: 113 RHNTNLAGGLLAG---LDHAKVTKVPDGMPVR-----VILFTDGLANEGPATSPEGLCAL 164 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 ++ + T + S + + L + ++ S + F +G + Sbjct: 165 LEAN---LGTASVSAFGYGDDADQELLRELSTLGRGNYAYVRSPEDALTAFARELGGLLS 221 Query: 451 ERVIRI 456 RI Sbjct: 222 TYAQRI 227 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 63/172 (36%), Gaps = 21/172 (12%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I ++ ID++ +G F V D S + + + ++ + G TAI D Sbjct: 388 IAALDFIDHLPSNANVGLIHFGTLVTVDHSLTNDIGAVRQS-----ISELKPEGQTAIYD 442 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAICNKAKSQGIRI 400 A+ +Y + + + +IVL++DG +T + +I KA I Sbjct: 443 ALAISYTQL------------RRAKGQTFIVLISDGADTASKGDNYDSIVAKATKANIPT 490 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 I + + + + + ++ S +L F + E+ + Sbjct: 491 YIIGLTSPEFDGQLLEDLQRD--TKAMIYQTPSKEQLG-GFYTEVAQEVSGQ 539 >gi|313225346|emb|CBY06820.1| unnamed protein product [Oikopleura dioica] Length = 369 Score = 54.2 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 54/167 (32%), Gaps = 20/167 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 R A + + + D R+ ++D + K ++ + Sbjct: 106 FRKAKDFMKSLVNPFEIGWDYTRVSVLQYSDDPRIEFYLKDYQDKTT--LLNAIDAITYK 163 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T +A++ I S + KK++VLLTDG++ D A Sbjct: 164 GGNTRTGEAIRYMMGQIFSV------EAGSRPYVKKHMVLLTDGQSQDDVGAP---ARAA 214 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 K+ IR I + L A+P ++ + + Sbjct: 215 KNFNIRTFAIGVG------DAIEDELKLVATPPFSDTLYHVEDYDGI 255 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 51/201 (25%), Gaps = 58/201 (28%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII---- 351 R+G F + KL TAI + T+ D I Sbjct: 127 RIGLVTFAGSAGLLVPPTDDTDKL-----LAALKSLTTSRGTAIGQGILTSIDAIAEVDP 181 Query: 352 --SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 +V A IV+LTDG NT + +A ++ +R+ TI F Sbjct: 182 SVPPTGADVPGGTGGEYAADVIVVLTDGANTVG-VDPRTAAGEAATRRLRVYTIGFGTTT 240 Query: 410 --------------------------------------------TQQEKARYFLSNCAS- 424 + L AS Sbjct: 241 PAPMVCDSSQIGDDAFGFGGGFGGGGGGGFGGGGGGFGGRNGERDPRAIDEDALKQVAST 300 Query: 425 -PNSFFEANSTHELNKIFRDR 444 +++ A + +EL D Sbjct: 301 TGGTYYRAQNANELQDALTDL 321 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 23/225 (10%), Positives = 61/225 (27%), Gaps = 41/225 (18%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + +++ + + R+G F + + Sbjct: 98 SNSMLADDIQPSRLQKSKYTISNFLERL-------GNDRVGLVVFAGQSFVQCPITSDKS 150 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 L + + + D T + A++ + + E+ + N K I++ +DG Sbjct: 151 AL-KLFMDIVSTDAIPTQGTNFSSAIRESIRAL-ERIEEGAEAEEKNRVRNKVILIFSDG 208 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------------------E 413 E+ + +A S+ IRI T+ + Sbjct: 209 EDH--EAGIDEVLEEAASKNIRIYTVGVGSAEPTPIPVLNKDGKRVDFKRDSQGSVVTTH 266 Query: 414 KARYFLSNCA--SPNSFFEA----NSTHELNKIFRDRIGNEIFER 452 L A + +++ + + E+ + Sbjct: 267 LQEALLRKIAEQTKGNYYRIAPQGSDFELIADDINKLEKQELSAK 311 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 60/190 (31%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + + + +DR + L RT KT Sbjct: 60 TRMAAAKQAFNEVLDATPEEVQLGIRTLGADYPGDDRKTGCKDTAQLYPVGPLDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + + K IVL++DGE+T + Sbjct: 120 AVATLSPTGWTPIGPALLKAADDLD------------GGDGSKRIVLISDGEDTCAPLDP 167 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N + + A+ ++ T EL Sbjct: 168 CEVAREIAAKGIGLTIDTLGLVPNTKMRRQLSCIAE--ATGGTYTSVEHTDELTDKVNQL 225 Query: 445 IGNEIFERVI 454 + V Sbjct: 226 VDRAADPVVT 235 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 61/193 (31%), Gaps = 34/193 (17%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 +K + +DA+ + D V F+ + + V Sbjct: 439 SSERKIDIAKDAIVQAAALLGPQDTVG------VVTFDGAASATFPAT---RGATVEQVM 489 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I + A + + + K+++LLTDG + ++ Sbjct: 490 DAVSGVEPRGPTNIRAGLLRAEEMLQQVDARI-----------KHMILLTDGWGSGGDQL 538 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 + + + QGI + +A +L A ++ A E+ +IF Sbjct: 539 --DLAARLREQGITLTVVAAGSGS------AAYLKQLAAEGGGRYYPAADMAEVPQIFVQ 590 Query: 444 ----RIGNEIFER 452 IGN I E+ Sbjct: 591 ETITAIGNYIVEQ 603 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + I K + ++A +++ D+V F+D ++ Sbjct: 422 YGITKLEMAKEAAIRSTEALRPTDSVG------VICFDDAASWVVGMRQ---ADDLAEIQ 472 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T + A+ AY + + K+I++LTDG++ + Sbjct: 473 DSIGTIRPGGGTNMYPALDLAYKALEEA-----------DTKLKHIIVLTDGQSATGD-- 519 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 I ++ GI + ++A ++ + LS A ++ + + KI Sbjct: 520 FDGIAHRMAEDGITLSSVAVGMDAD-----KNLLSRLAEIGNGRYYYTDEFSNIPKI 571 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 16/148 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++ L +++ + R+G F+ ++ + L Sbjct: 89 SMDANDVAPSRLEKIKYELKNIVDAFNS-------DRIGLIIFSSEAFVQCPLTYDQNAL 141 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++T ST A+ A++ + S ++ + K I+L++DGE+ Sbjct: 142 N-LFIETLNTGLVPGSSTDFGSALNMAHEKLTSEA------APSSQQKSKIIILISDGED 194 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D+ +K GIR+ ++ Sbjct: 195 FGDD--TEGAVSKINDSGIRLFSLGVGT 220 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 59/192 (30%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + + V +G F S + RT+ Sbjct: 110 DRLTAAKEAAKKFVTEL------PNGVNLGIVSFAGTASLLVSPTP-----DRTLALNAV 158 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 TA + + Y +I S + I+L +DG+ T + Sbjct: 159 DKLELAQRTATGEGI---YTSIQSIKNIRDVLGGEDNAPPARIILESDGKQTVPTDLDDP 215 Query: 388 ----AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 KAK +GI I TI+F L A S FF A+ Sbjct: 216 RGGFTAARKAKEEGIPISTISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQFFAAS 275 Query: 433 STHELNKIFRDR 444 S ++LN+ + Sbjct: 276 SLNDLNEAYGSL 287 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 44/122 (36%), Gaps = 13/122 (10%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + + + + + E + + + +V+L+DG T + I + Sbjct: 47 GGGTNMAPGLFISREIL------ERPIFPSQIYLRPVVVVLSDGL-TSHPAKTSEIATQL 99 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 K + I+T+AF + + S F+ + +L F +G + + Sbjct: 100 K-KDADIVTVAFGDDADEPYLISLA----TSSEHFYHCRTGTDLRAFFAS-VGTTLSVSL 153 Query: 454 IR 455 R Sbjct: 154 QR 155 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 61/190 (32%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 + + A V+ + + + +DR D + + V L RT KT Sbjct: 59 SRMAAAKQAFNEVLDATPEEVRLGIRTLGANYPGDDRKTGCKDTAQLYPVSTLDRTEAKT 118 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + IVL+TDGE+T + Sbjct: 119 AVATLSPTGWTPIGPALLKAADDLDGGTGSHR------------IVLITDGEDTCAPLDP 166 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++G+ I T+ N ++ A+ ++ +L Sbjct: 167 CEVAREIAAKGVGLTIDTLGLVPNSKLSKQLSCIAE--ATGGTYTSVEHKEDLTDKVNQL 224 Query: 445 IGNEIFERVI 454 + + V Sbjct: 225 VDRAADKVVT 234 >gi|221134029|ref|ZP_03560334.1| vault protein inter-alpha-trypsin [Glaciecola sp. HTCC2999] Length = 757 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 30/423 (7%), Positives = 97/423 (22%), Gaps = 37/423 (8%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 L A + I+ + + + + + + N + Sbjct: 170 LGAAVDELYMQIGTRTIEGVIQ---EKAQAKKTFNKAKAGGKKASLVEQKRPNLFTNTIA 226 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 +I + + Y + L + + S ++ H Sbjct: 227 NIGPGEHISITISYQNIAQYTDGEYLVRLPMTLSQRYAGELNSVDGSGESLDPVDPDYAH 286 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 K++ + + ++ + + + + + + + L Sbjct: 287 KQYAMDVVLTLNTQSADTRITSTTHHIESEAIDKGIYQVRLAEEDVANRDFVVSWSLGNT 346 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI---- 288 + + + ++ V+ S + Sbjct: 347 DDAVVGHYQEYIDGTTYGLIHVVPPVISHDPSSMASTVTPSIQQNTIFVLDSSGSMHGTA 406 Query: 289 ------------DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + + F+ + S ++ + F + G Sbjct: 407 LTQAIDAIREGVSYLTEHDTFNIVDFDSEARALWRQSQFADEVSKAEAMRFLRHVDSDGG 466 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + DA+ + ++ S+ ++ +TDG + E I + + Sbjct: 467 TNMQDALALSLTQLLDSSTGLTQ-----------VIFVTDGSINNERELLKQIAEQLGDK 515 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI-- 454 R+ T+ + ++ + E+ + Sbjct: 516 --RLFTVGIGAAPNSHFMEYAAML---GKGTYTYIDDLTEIQPKMAYLFSQLRSPMITDI 570 Query: 455 RIT 457 ++T Sbjct: 571 QLT 573 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 54.2 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 18/166 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ VIR + D R+ FND V S V R+I Sbjct: 98 SRLDVLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 151 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TA+ A++ A + +R+ +I+LLTDG++T Sbjct: 152 IDRLQARGGTALMPALEEAVKILDERQGSSRNRVG-------FILLLTDGDDTTGFRWTR 204 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + A ++ + T + + S ++ + Sbjct: 205 DAIHGAVAK-YPVHTFGLGASHDPEALLHIAQ---GSRGTYSFVDD 246 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 63/189 (33%), Gaps = 23/189 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + A+ +V K ID D V +G FND V + + ++ Sbjct: 115 MKGDKIITAINAVK---KFIDQTIDYVLIGLITFNDHVRIAIPPTS-----DQELLYKKL 166 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGI 387 + G T + ++ AYD ++ E + I+ +TDG +QD Sbjct: 167 GEIKAFGGTIYSKPLEIAYDWLVPFAEFNLSPT---------IIFVTDGLPYSQDAPLYR 217 Query: 388 AICNKAKSQGIRIMTIAFSVNK---TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + K I I I + A+ L A + F+ T+ L +F Sbjct: 218 EVVYKCARYNITIYPIFIETPGMSIYETMMAQQRLREIANITKGQFYNVKQTNSLINLFE 277 Query: 443 DRIGNEIFE 451 + + Sbjct: 278 KLAEKTVSK 286 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 63/203 (31%), Gaps = 34/203 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + A + + + +G F + I + K Sbjct: 110 NRLKAAEQAASQFASQLTPG------INLGLVGFAGTPYLLVPPTPQHQATIDALKKLDF 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 D STA A+ TA I ++ + IVLL+DG + + Sbjct: 164 AD-----STATGQAIFTALHAIGATA-----VTGGDNPPPARIVLLSDGRENKPSNPSDP 213 Query: 388 -----AICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEA 431 AK +G+ I TI+F + E + A S + A Sbjct: 214 HDGVYTAARLAKDEGVPISTISFGTKGGEIEMDGQRVAVPVSTDQMKTIARLSGGQPYTA 273 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + ELNK + + I NEI R + Sbjct: 274 TNIGELNKSY-NAIENEIGYRTV 295 >gi|108758819|ref|YP_634592.1| hypothetical protein MXAN_6470 [Myxococcus xanthus DK 1622] gi|108462699|gb|ABF87884.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 914 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 56/183 (30%), Gaps = 27/183 (14%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A V+ ++ ++ ++ + G+ + A + G Sbjct: 421 AAEGVVAALTLLNPKDEVSVHMVDTAAHEIFPLSPVEAGLP------LDAVARGFSGGGG 474 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + +A++ I+ + + ++++L +D ++++ ++ + Q Sbjct: 475 IYVGEALRAGRTEIL-----------RSEKPTRHVLLFSDAADSEEPDDYQRTLAHLREQ 523 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + I V + L A + A L +IF V Sbjct: 524 EVTVSVIGLGVPS---DPDADLLREVAHRGGGRVYFAEDAMSLPRIFSQE-----TLTVA 575 Query: 455 RIT 457 R T Sbjct: 576 RAT 578 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/289 (12%), Positives = 73/289 (25%), Gaps = 44/289 (15%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + S S+++ + + + I Y + Sbjct: 193 PLDDHVDYVESYSVIEKLTHKFQEAFCVVSDLC------ATGDHDCEQICISTPGSYKCA 246 Query: 236 CNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 C + + S + LV+ + ++ S++ + Sbjct: 247 CKEGFTLNSDGKTCSACSGGSGSALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSE 306 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++G ++ V + G K + I T A++ D Sbjct: 307 K---QAQVGLVQYSSSVRQEFPL--GQFKNKKDIKAAVKKMAYMEKGTMTGQALKYLVDN 361 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S + K ++ TDG + + KAK G R+ + Sbjct: 362 SFSIANGARPGV------PKVGIVFTDGRSQDYITD---AAKKAKDLGFRMFAVGVG--- 409 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR--IGNEIFERV 453 L AS +F FR IG ++ ++ Sbjct: 410 ---NAVEDELREIASEPVAEHYFYTAD-------FRTISNIGKKLQMKI 448 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ +D ++ R+G + V ++ S H+ +++ E T A+Q Sbjct: 66 IEGLDVGPNSTRVGVINYASAVKNEFSLK--THQTKAGLLQAVRRIEPLSTGTMTGLAIQ 123 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A S E + K +++TDG ++ A +A+ GI I I Sbjct: 124 FAISRAFSDAEGAR---LRSSNINKVAIVVTDGRPQDGVQDVSA---RARQAGIEIFAIG 177 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS + S +L F++ Sbjct: 178 VGR------VDMHTLRQIASEPLDDHVDYVESYSVIEKLTHKFQEAF 218 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 58/160 (36%), Gaps = 13/160 (8%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N+T ++G ++ ++ + + GSTA A+Q A ++ Sbjct: 102 NNTTQVGIVKYSSSANMVEMLQDLTS--NKSDLIATINGLSASGSTATGTAIQVATAELL 159 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 SS H K V+L+DGE + +A +QGI + T+ + T Sbjct: 160 SSRAIAGHA--------KMEVVLSDGEYNVG-IDPKIAAAQAHAQGITVHTVGVQLYGTG 210 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + A F N+ ++L +F GN + Sbjct: 211 YTSMQQT--AVAGGGIFTNVNNLNDLVALFSGTGGNLVGL 248 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 68/199 (34%), Gaps = 30/199 (15%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIR 321 + V++ I + D + A ++D VI S + + L Sbjct: 81 KPFKTLVTQVKEFTKKFIDKLTSRYYRCDRNLVWNAGALHYSDEVILINSLTRDMKTLRD 140 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + I T + A++ + ++ ++ + KY++++TDG + Sbjct: 141 NVETVEYI----GKGTHTDCAIKRGIEEVL--------IGGSHQKENKYLIVVTDGHPLE 188 Query: 382 DNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANST 434 +E N+AK GI++ ++A LS AS S F A S Sbjct: 189 GYKEPCGGLEDAANEAKHLGIKVFSVAI-----SPNHLEPRLSVIASDASHRRNFTATSA 243 Query: 435 HELNKIFRDRIGNEIFERV 453 L D I N I + Sbjct: 244 VGLTD---DEIDNTIDTII 259 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 50/145 (34%), Gaps = 18/145 (12%) Query: 319 LIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 T+++ + G+T + A++ + +R + K +++ +DG Sbjct: 911 TNYTVLEVPVDNMQFINGATNVVSALRAVTEL---------YREDSLAGVNKKLLVFSDG 961 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF---LSNCASP---NSFFEA 431 NTQ+ + + + A+S GI I +A ++ A P F Sbjct: 962 -NTQEEKGLLKVVQDAQSAGIEIYVLAVGSRLNYPNLQVMLTGSAADIAGPFPEERLFRV 1020 Query: 432 NSTHELNKIFR-DRIGNEIFERVIR 455 L + R I I + + Sbjct: 1021 PDYTSLLQGVRYQSISRRIALKSSQ 1045 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 60/224 (26%), Gaps = 51/224 (22%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + I+ + VR+G F + L Sbjct: 97 SMRATDVQPDRLTAAQNAAKAFIQDL------PRHVRVGVVAFAGTAQLAQLPTQSHEDL 150 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------------------------- 354 ++ TA + + A T+ Sbjct: 151 LKA-----IDSFQLQRGTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPIDEVGRAD 205 Query: 355 --EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ- 411 + + I++LTDG+ T + + A +G+R+ T+ + + Sbjct: 206 PAKKPFTPVAPGSYRSAAIIMLTDGQRTTG-VDPLEAAQWAADRGVRVYTVGVGTVQGEL 264 Query: 412 ---------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A + +L K++ Sbjct: 265 IGFEGWSMRVRLDEDTLKAVALRTNAEYFHAATAQDLRKVYETL 308 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 51/182 (28%), Gaps = 35/182 (19%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D ++G F + + + T I A+ A + Sbjct: 128 DNDKVGLIVFAGDAFVQLPITSDFISA-KMFLDNINPSLIGTQGTDIGQAINLAMHSFSP 186 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ- 411 +++ K IV++TDGE+ + + +KA+ GI++ + Sbjct: 187 TSKS-----------GKAIVVITDGEDNEG--RAEEMASKAQKAGIQVYILGVGSTSGAE 233 Query: 412 -QEKARYFLSN--------------C-----ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 L + C A + ++ ++ I + + Sbjct: 234 IPLGNGEMLKDNSGNVVRTHLNENMCKKIAAAGKGVYIHVDNNNDAQNILKGELSKLQKG 293 Query: 452 RV 453 + Sbjct: 294 EI 295 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 47/153 (30%), Gaps = 15/153 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D +G F+ + R ++ + G T I +A A + Sbjct: 338 RDQDYIGVVAFSSSARWVFRPRPMTPQGRREA-ESLLLSVRAGGGTEIGEAYAEALQALR 396 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 K +++LTDG + ++A++ IR +A + + Sbjct: 397 -----------GLKTEDKQVLVLTDGLVQDPTLPTLQAAHQAQANKIRTNAVALGSDADR 445 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 +F++ S +L + F + Sbjct: 446 AFLRELAKQ---GGGTFYDVPSPKDLPRFFLEE 475 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/239 (12%), Positives = 72/239 (30%), Gaps = 27/239 (11%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH----------VIKKKHL 273 +P + +++ + + + + Sbjct: 252 CTIYPAAPLKECHAYDNRLRPWYTSAAYPSTKKLVIVLDTSSSMASRVELGTKRRTRLDV 311 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAID 330 + AL++++ ++ D V + D +S + R +K F Sbjct: 312 AKAALSTILSTLLPQDKVGVVLFNSKVTLAGSSGVDECYSTRLAPAGRFNVNYLKDFINR 371 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-I 389 G T +A + A+ + S+ + ++ +++ LTDG D E I Sbjct: 372 SRPGGGTQYQNAFKAAFTLLKSAKSGDGGGEQS------FLLFLTDGGPKDDALEVERLI 425 Query: 390 CNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 K + + IMTI ++ ++ N S + + ++ + D Sbjct: 426 AQNKKEMEESRERVTIMTIGLGKDEHMKDFLGRLSKNVGS--KYSQVDNEAHMYSAIHD 482 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/261 (11%), Positives = 71/261 (27%), Gaps = 14/261 (5%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S + P N Q + + + + VS + + Sbjct: 152 SAPVAPLPTIPSPPNPTVQLTTPPAPLATITPEVLPVETSAPATLSVSTFTTTVSAVETE 211 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 LS + + + + +D D R+ + V Sbjct: 212 TDSSCLSRPLDLVFIIDSSRSVRPSEFEKVKIFLADMVDTLDVGADATRVAVVNYASTVK 271 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ +K + T A++TA + R Sbjct: 272 TEFLLKDHFNKPNLKKAISRIEPLAT--GTMTGLAIKTAVSEAFTEQSGARPRP---RNI 326 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 K +++TDG + ++ + A++ G+ I + ++ +++ + Sbjct: 327 AKVAIIVTDG---RPQDQVEEVSAAARASGVEIYAVGV---DRADMRSLQLMASVPLEDH 380 Query: 428 FFEANS---THELNKIFRDRI 445 F + +L FR+ + Sbjct: 381 VFYVETYGVIEKLTSKFRETL 401 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/290 (13%), Positives = 84/290 (28%), Gaps = 26/290 (8%) Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + + + I F +++ + N + + S + + Sbjct: 224 KFFPDKIQTARASIMFMQNLNSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQ 283 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + ++ + +D S + + A + I Sbjct: 284 QAPPMRGTEAPPPPTFSLLKSRQRVICLVLDKSGSMDTEDRLIRMNQAAELYLTQI---- 339 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V +G F+ + T + + +G T+I ++ + Sbjct: 340 -VEKESMVGLVTFDSTAQIQNYLIKITNTGD-YKKITGNLPQQAVGGTSICRGLEAGFQA 397 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVN 408 I SS++ IVLLTDGE+ I+ C K G I TIA Sbjct: 398 ITSSDQSTSGSE---------IVLLTDGEDD-----LISSCFEVVKHSGAVIHTIALG-- 441 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFR--DRIGNEIFERVIRI 456 + + LS+ F+ + L F + ++ +++ Sbjct: 442 -PKAARELETLSDMTGGLRFYANKDVNSLMDAFSGISSASGNLSQQALQL 490 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGLQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 320 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 375 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 428 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 429 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 458 >gi|327282764|ref|XP_003226112.1| PREDICTED: collagen alpha-1(XXI) chain-like [Anolis carolinensis] Length = 956 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 64/182 (35%), Gaps = 26/182 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ L ++ + +++G ++D + + G H +V+ Sbjct: 53 FEIVKRWLVNITSNFNI---GPKFIQVGVVQYSDYPVLEIPL--GFHDSNENLVRGMEYI 107 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T A+Q A D + K++ K V+LTDG + ++ + Sbjct: 108 QYLGGNTQTGKAIQFALDHLF---------AKSSRFLTKIAVVLTDG---KSQDDVKEVA 155 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGN 447 +A+ I + I E L A S F +++ R+ I Sbjct: 156 AEARKNRITLFAIGVGS-----ETEEDELRAIANKPSSTYVFYVEDYIAISR-IREVIKQ 209 Query: 448 EI 449 ++ Sbjct: 210 KL 211 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 60/187 (32%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +++ + D R+ F+ R G+ L I + Sbjct: 58 LETVKQAAIELVKQLNVED------RLSIIAFDHRAKVLVPN-QGIDNLNTIIEQ--INS 108 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ + +D V + I LLTDGEN DNE + + Sbjct: 109 LKPAGGTAIDEGLKLGIQESANGKKDRVSQ----------IFLLTDGENEHGDNERCLKL 158 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 + A I + T+ F + L + + + + + F Sbjct: 159 AHVASDYNITLNTLGFG-----NHWNQDVLEKISDSAGGTLCYIETPDKAIEEFSRLFNR 213 Query: 448 EIFERVI 454 + Sbjct: 214 AQSIGLT 220 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 42/176 (23%), Gaps = 34/176 (19%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + + TAI + + + Sbjct: 88 SSSAALILPPTIDHKVFLSRLDSLSIGEL--GDGTAIGMGLAVSSAYMT----------- 134 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------------ 410 YIVLLTDGEN ++ I I + Sbjct: 135 RTKLNSSYIVLLTDGENNTGEINPKTAAEVLVNKNIGFYVIGIGSSGYTTLEYTDRKTGK 194 Query: 411 ------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + L A + A+S L IF + I ++ T+ Sbjct: 195 TYSGSIFSKFDEVELKKIAQYGNGKYASASSPEILEDIF-NTISKQVPAAQSNFTR 249 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ + ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVVYAVGVG------KAVEAELREIASEP 499 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ + ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 279 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 333 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 334 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 387 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 388 ARAKEEGIVVYAVGVG------KAVEAELREIASEP 417 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ + ++ + R+G ++ +V S R ++ D T Sbjct: 3 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 59 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 60 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 113 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 114 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 152 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 268 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 322 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 323 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 376 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 377 ARAKEEGIVVYAVGVG------KAVEAELREIASEP 406 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ + ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 320 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 374 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 375 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 428 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 429 ARAKEEGIVVYAVGVG------KAVEAELREIASEP 458 >gi|298247107|ref|ZP_06970912.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297549766|gb|EFH83632.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 74/243 (30%), Gaps = 22/243 (9%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 +++ G ++ Y + Y + + +D S Sbjct: 322 VDKTTFNPDWGIKATLRQQAITYPAPDVINAALDNYQTVYRRPVHTIYCLDGSGSMGSNG 381 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGAT---FFNDRVISDPSFSWGVHKLIRTIVK- 325 ++++ + K + T T FN + + P SW V ++ Sbjct: 382 GWDQLKESSELLFDQTKARQYLLQTHPQDLTSVMVFNSDIAAGPDGSWTVEGNDPQKMRG 441 Query: 326 --TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T + +Q + + + + N K+ I+++TDG++ + N Sbjct: 442 LYDNIQAREPDGGTNMYACLQRSVELF---------KQQPNENRKRLIIVMTDGQSEKGN 492 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I S G+ ++++AF + + S + + R+ Sbjct: 493 GV-DQIIQSVASLGVPVISVAFGSDADVTQLNEI------STATHGSVTKKDNMVDAMRE 545 Query: 444 RIG 446 G Sbjct: 546 ATG 548 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLESRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/287 (11%), Positives = 70/287 (24%), Gaps = 65/287 (22%) Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + Q S ++ + P V +S N + K Sbjct: 193 DDDEVLDHQPESTERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPL 252 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI--KKKHLVRDALASVIRSIKKID 289 N + V + + K L++ A+ VI+S+ D Sbjct: 253 TSGRQNSGTNQTNMQPTSQSCRAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQSLGPCD 312 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+ + ++ G T I + ++ Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQA-VNSLVSNGGTNIAEGLRKGAKV 365 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ I+LL+DG++T +C K++G+ Sbjct: 366 MLDRKW---------KNPVSSIILLSDGQDTY------TVC---KAEGV----------- 396 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F IG + V + Sbjct: 397 ---------------------------IQDAFAQCIGGLLSVVVQEL 416 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 86/280 (30%), Gaps = 23/280 (8%) Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + + + V+ + + + LN D +++ + Sbjct: 122 TDDDNFKTKVNYSYAAVQIPTDIYKGSTVILNELNWTQALEDVFIENRKEDPSLLWQVFG 181 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + + + L ++ +S + V +++ + + K Sbjct: 182 SATGVTRYYPATPWRAPNKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMK 241 Query: 288 I------DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAI 339 D ++D + FN++ F V + + K G+T Sbjct: 242 TSVYEMLDTLSDDDYVNVASFNEKAKPVSCFKHLVQANIRNKKVFKEDVQGMVAKGTTDY 301 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQG 397 + A+D + +SN K I++ TDG + + K K ++ Sbjct: 302 KAGFEYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYKWPNKT 350 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 +R+ T FSV + + CA+ +FE S + Sbjct: 351 VRVFT--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 45/129 (34%), Gaps = 11/129 (8%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 DSS K ++ A I + + N+ + F+ + + Sbjct: 87 DSSGSMADNYKLTNLKKAATDFITKMSTVKNLK----IAIVDFDTQATIINKLTDVSSST 142 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 T +K + G T + ++ A + +S+E N A K I+ ++DGE Sbjct: 143 NVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSE-------QNPLASKNIIFMSDGEP 195 Query: 380 TQDNEEGIA 388 T N + Sbjct: 196 TYYNWQTAN 204 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/310 (8%), Positives = 73/310 (23%), Gaps = 33/310 (10%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + + + P + L S K + + +++ + + Sbjct: 73 FKISIPPKEIVLVLDSSGSMADNYKLTNLKKAATDFITKMSTVKNL------KIAIVDFD 126 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + + + + + + ++ Y + +P S+ Sbjct: 127 TQATIINKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKN 186 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN--DRVISDPSFSW 314 S + + D FN + Sbjct: 187 IIFMSDGEPTYYNWQTANSGWI--------WGDYNGTYYTYYDWSFNRVNYGFDSNHMPS 238 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + + + S + Sbjct: 239 GYR-TGPYTGNYTGYKYQGSSYSNYYTDITQSNVASNVSQSGTGYSD------------- 284 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 TDG++ + I K +G + +I + ++ K + S+ ++ NS F + Sbjct: 285 TDGKSLNYAKIMGEI---IKGKGYNVFSIGYGLDSDGNTKMQQIHSSMSTNNSNFYETDS 341 Query: 435 HELNKIFRDR 444 ++ +F + Sbjct: 342 GAIDAVFSNI 351 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 55/146 (37%), Gaps = 14/146 (9%) Query: 296 RMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAIDENEMGSTA-INDAMQTAYDTI 350 R+G +N + SWG L + + ++ +T+ I + TA + + Sbjct: 69 RVGFVTYNSLATINADLNKFKSWG--DLSQGVNDSYNNMNLSSENTSFIGTGLITAGELL 126 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K I++ N + +++ N KS GI I+T+A + Sbjct: 127 Q-----VQGSAIGRVYYPKVIIVYASAFNGTGLLDPLSVANTLKSAGITIITVAVDTDNN 181 Query: 411 QQEKARYFLSNCASPNSFFEANSTHE 436 + L++ ASP S F + + Sbjct: 182 G--VIQKQLASIASPGSAFSLDPDDD 205 >gi|14248697|gb|AAK57634.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248699|gb|AAK57635.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|118401451|ref|XP_001033046.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89287392|gb|EAR85383.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 680 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 55/185 (29%), Gaps = 25/185 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ AL ++ ++ D+++ F+ I + + + + Sbjct: 73 LDIVQHALKMIVNTLTPDDDLS------LVVFSSMAIEVFD-TLRMDDANKILAIDKIEK 125 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 GST + +Q + + S ++ + +LTDG+ D+ + Sbjct: 126 LEASGSTNLQHGIQVGLNILSKSKSQNRNQA---------MYVLTDGQP--DDRNVMQFL 174 Query: 391 NKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K K I T F + + + + F + + Sbjct: 175 KKYKKDNPQLRCTISTFGFGSSCDSELLDEIAREY---NGMYSFIPDATLIATAFANALA 231 Query: 447 NEIFE 451 N + Sbjct: 232 NTLTV 236 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 57/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +K +D DT R+G + V ++ S +K I + + T Sbjct: 75 LITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLK--TYKKKMDIERAVKRMMHLATGTMTG 132 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S +E N + +++TDG I KA++ GI I Sbjct: 133 LAIQYAMNIAFSESEGARP---LNQHVPRIAMIVTDGRPQDP---VAEIAAKARNSGILI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE---LNKIFRD 443 + L S F + + L +F++ Sbjct: 187 FAVGVGR------VDMSTLKTIGSEPHTEHVFLVANFSQIETLTTVFQN 229 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/204 (10%), Positives = 57/204 (27%), Gaps = 18/204 (8%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + +D S +V+ + ++ S++ Sbjct: 550 YTLGEDGKSCKSERKCGEGPVDLVFVIDGSKSLG-ENNFEIVKQFVNGILDSLEISQKAA 608 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 ++ + ++ + + + + + K + + + A++ ++ S Sbjct: 609 HVG---LIQYSTHIRTEFTMAQ--YSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFS 663 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + R + ++ TDG +E KAK GI I I ++ Sbjct: 664 EVQGARPRAMGVP---RVAIVFTDGRA---QDEVSEYAKKAKQSGITIYAIGVGKAIDEE 717 Query: 413 EKARYFLSNCASPNSFFEANSTHE 436 L AS + Sbjct: 718 ------LQEIASAPQEKHVIYAED 735 >gi|47523446|ref|NP_999348.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Sus scrofa] gi|3341749|gb|AAC36289.1| voltage-dependent calcium channel alpha-2 delta subunit precursor [Sus scrofa] Length = 1091 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 58/202 (28%), Gaps = 44/202 (21%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ + + R+G F + + + T + Sbjct: 111 DRLTRAKQLISKLADKLD-------NDRVGLVVFAGNAYLQMPLTIDYSAA-KMYLTTVS 162 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAI A+Q A D K +++++DGE+ +E I+ Sbjct: 163 PDMIPTQGTAIGQAIQVANDAFNKKER-----------KHKSLIIISDGEDH--DEAAIS 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKA--------------------RYFLSNCASPNS- 427 A G+ I TI L + A+ Sbjct: 210 KARAAFEDGVVINTIGIGSPTGSPLPDPETGTYKKDKEGNTVISKLNEDALKSIAAAGKG 269 Query: 428 FFE--ANSTHELNKIFRDRIGN 447 +E N+T E+ +I + Sbjct: 270 IYEHLDNNTEEVVNSLTQKIDS 291 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 60/171 (35%), Gaps = 16/171 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ + VR+G + D + + ++++ K G T Sbjct: 482 ILEFLQTFRVGPNHVRIGVVKYADSPTLEFDL--HTYTDVKSLEKAITNIHQVGGGTETG 539 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ + + + K+Y+V++TDG +T ++ +K ++QG+ + Sbjct: 540 KALDFM------RPQFDRAVTTRGHKVKEYLVVITDGNSTDKVKDP---ADKLRAQGVVV 590 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + + L P F N+ L I D I + Sbjct: 591 YAIGV-----KDAVEKELLEISGEPQRTFYVNNFDALKPIKDDIITDICST 636 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 59/162 (36%), Gaps = 16/162 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + + R G F+ V S + + R +++ G+T DA+ Sbjct: 881 VNQSSVGPELTRFGVITFSTGVQSIFTLKQ--YSSKRDVLQAVGAVTAPGGNTNTGDALD 938 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + H + L+ + ++++TDG + ++ + QG+ + +I Sbjct: 939 YSLQYF-----GKEHGGRAALKVPQILMVITDGAAQEPSKLP-GPSEALRKQGVSVFSIG 992 Query: 405 FSVNKTQQEKARYFLSNCASPN--SFFEANSTHELNKIFRDR 444 + +R L A + F ++ L ++++ Sbjct: 993 V------KNASREQLDIMAGNDPSRVFFVDTFDALETLYKNI 1028 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 64/214 (29%), Gaps = 21/214 (9%) Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + S + S +L D +DSS + ++D + S+++ Sbjct: 626 KDDIITDICSTDGSDLSLLSTVCKDVPGDLIFLIDSSGSI-YPEDYQKMKDFMKSLVQKS 684 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D V +G ++ + K + G T +A+ Sbjct: 685 NI---GKDQVHVGVLQYSTEQKLVFPLIQYY--TKDQLSKAIDDMQQIGGGTHTGEAIAV 739 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + N + K+ +V++TDGE+ D + +++G+ + +I Sbjct: 740 VSKYFDAQN-------GGRPDLKQRLVVVTDGESQDDVKLP---AEALRAKGVIVYSIGV 789 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 L + + L Sbjct: 790 -----VAANTSQLLEISGDADRMYAERDFDALKD 818 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 49/158 (31%), Gaps = 17/158 (10%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAIN 340 I I K D V +G F+ + + D E G T I Sbjct: 1070 IDLINKFQISRDLVHVGLAQFSSTFKDEFYL---NKFFDEQAISAHIKDMQQEEGGTLIG 1126 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ + +S H + + +VL+TDG+ +QD+ + G+ + Sbjct: 1127 LALNSIRKYFEAS-----HGSRKAEGISQNLVLITDGD-SQDDV--EEAARLLRGLGVEV 1178 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 I L +P + F + +L Sbjct: 1179 FAIGIG-----NVHDLELLQIAGTPENVFTVKNFDKLE 1211 >gi|126722583|ref|NP_001075745.1| voltage-dependent calcium channel subunit alpha-2/delta-1 preproprotein [Oryctolagus cuniculus] gi|116409|sp|P13806|CA2D1_RABIT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|164763|gb|AAA81562.1| dihydropryridine-sensitive calcium channel alpha-2 subunit [Oryctolagus cuniculus] Length = 1106 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/264 (13%), Positives = 75/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 171 PTDIYEGSTIVLNELNWTSALDDVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 230 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 231 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 290 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 291 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 350 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K + +R+ T FSV + + Sbjct: 351 V-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVFT--FSVGQHNYD 397 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 398 RGPIQWMACENKGYYYEIPSIGAI 421 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 53/154 (34%), Gaps = 20/154 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ + + + L+ T+ G T+ +AMQ+ I S+ Sbjct: 382 CIMYDHSAYNCFTTNPLSTSLVSTMSSYV-----AGGGTSFTNAMQSVSSLISSTYP--- 433 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 N+ K ++ ++DGE++ D I S I + TI + Sbjct: 434 ----NHQSYKIVVLFMSDGEDSADEAVSITG-QLVSSHDIILHTIQLGGSSD-----NTG 483 Query: 419 LSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L A F AN + L I+++ + + Sbjct: 484 LRQMAATGRGQFKRANDSASLAGIYQEIANHPVA 517 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLVGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIAMYAVGVG------KAVEAELREIASEP 499 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ ++ + R+G ++ +V S R ++ D T Sbjct: 55 QFLVGLLRGLNVGPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q A + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYAMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|14248667|gb|AAK57619.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DTV + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTVNLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + GST + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGSTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + + Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVVGVGQGIN 140 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 54/164 (32%), Gaps = 17/164 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ ++ +D + R+G ++ +V S G + + T Sbjct: 57 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRALVPLAQGTM 114 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+Q A + S E + V++TDG + + + +A+++GI Sbjct: 115 TGLAIQYAMNVAFSVAEGARPPEARVP---RIAVIVTDG---RPQDRVAEVAAQARARGI 168 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I + L ASP F S + + + Sbjct: 169 EIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFNLIQE 206 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 404 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 458 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 459 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 512 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 513 RRAKEEGIVMYAVGVG------KAVEEELRQIASEP 542 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHPSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 55/157 (35%), Gaps = 15/157 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N +M FN+R I+ ++T + GSTA+ DA+ A I Sbjct: 107 NTRDQMFVVHFNERARLGLPERTPFTGKIK-ELETAISSFDVGGSTALYDAILLAQSHIR 165 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++ ++++TDG + + G+ I I + Sbjct: 166 -----------GGVYGRRILLVITDGGDNSSKATLEEAVDAVAKAGVVIYAIGI-YDPND 213 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 +++ L++ A + F + ++ +I + Sbjct: 214 KDQNPKVLAHLAEVTGGEAFFPTALSDITRICEEIAA 250 >gi|258624850|ref|ZP_05719778.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582848|gb|EEW07669.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 128 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 31/89 (34%), Gaps = 15/89 (16%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN-------------KTQQEKARY 417 ++LL+DG NT + + N AK I T+ T Q+ Sbjct: 1 MILLSDGSNTAGVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLDEK 60 Query: 418 FLSNCAS--PNSFFEANSTHELNKIFRDR 444 L AS +F A + +L I+ Sbjct: 61 TLQTIASTTGGQYFRARNQQDLQSIYDTI 89 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/149 (21%), Positives = 56/149 (37%), Gaps = 18/149 (12%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R + TAI +A+ T+ I +IVLL+ Sbjct: 142 TPSGDRAAAISSIDTLTLAKRTAIGEAVFTSLQAIR-----SFDAQAGQDPPPAHIVLLS 196 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN---------KTQQEKARYFLSNCA--S 424 DG+NT AI + A++ + + TIAF T E + L+ A + Sbjct: 197 DGDNTTGRSVPEAI-DAARAADVPVSTIAFGTPYGTVEIDGETTPVEVNKVTLAGLAQGT 255 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERV 453 +EA +L++++ + IG + R+ Sbjct: 256 NGKAYEAADNDQLSQVYAN-IGTSLGWRI 283 >gi|1589785|gb|AAC47462.1| thrombospondin-related anonymous protein [Plasmodium knowlesi] Length = 533 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 62/208 (29%), Gaps = 25/208 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S V L +I ++ D++ + + F G Sbjct: 10 VDGSGSIGYANWITRVIPMLTGLIENLNLS---KDSINLYMSLFASHTTELIRLGSGPSM 66 Query: 319 LIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + G+T+++ A+ + N A + ++L+T Sbjct: 67 DKKQALNVVRDLRKGYEPYGNTSMSSALSEVEMHLKDRV--------NRPNAIQLVILMT 118 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----EKARYFLSNCASPNSFFE 430 DG + + + K + +++ I Q R +C F+ Sbjct: 119 DGIPN-NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLMAGCRPRERSC----KFYS 173 Query: 431 ANSTHELNKIFRDRIGNEIFERVIRITK 458 + E + + I + V RI K Sbjct: 174 SADWSEAISLIKPFIAK-VCTEVERIAK 200 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 65/203 (32%), Gaps = 23/203 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + ++ F+ SS V++ + +++ + D D R+G + V Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAKVKEFIVDILQFL---DIGPDVTRVGLLQYGSTV 105 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ S K + + + T A+Q A + S E Sbjct: 106 KNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LREN 160 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-- 424 + I+++TDG + + + KA+ GI I I L S Sbjct: 161 VPRVIMIVTDG---RPQDSVAEVAAKARDTGILIFAIGVGQVDFNT------LKAIGSEP 211 Query: 425 -PNSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 212 HEDHVFLVANFSQMESLTSVFQN 234 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 23/185 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEIVKQFVTGIIDSLAISPKAA---RVGLLQYSTQVRTEFTLRN--FGSAKDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQVEGARPLSARVP---RVAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+N+ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEINEKLKKG 833 Query: 445 IGNEI 449 I + Sbjct: 834 ICEAL 838 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 65/203 (32%), Gaps = 23/203 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + ++ F+ SS V++ + +++ + D D R+G + V Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAKVKEFIVDILQFL---DIGPDVTRVGLLQYGSTV 105 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ S K + + + T A+Q A + S E Sbjct: 106 KNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LREN 160 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-- 424 + I+++TDG + + + KA+ GI I I L S Sbjct: 161 VPRVIMIVTDG---RPQDSVAEVAAKARDTGILIFAIGVGQVDFNT------LKAIGSEP 211 Query: 425 -PNSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 212 HEDHVFLVANFSQMESLTSVFQN 234 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 23/185 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 630 FEIVKQFVTGIIDSLAISPKAA---RVGLLQYSTQVRTEFTLRN--FGSAKDMKKAVAHM 684 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 685 KYMGKGSMTGLALKHMFERSFTQVEGARPLSARVP---RVAIVFTDGRAQDD---VSEWA 738 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+N+ + Sbjct: 739 SKAKANGITMYAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEINEKLKKG 792 Query: 445 IGNEI 449 I + Sbjct: 793 ICEAL 797 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|1575519|gb|AAC47463.1| thrombospondin-related anonymous protein [Plasmodium vivax] Length = 510 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 45/155 (29%), Gaps = 15/155 (9%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S V L +I S+ DT+ + F + G Sbjct: 10 VDGSGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSI 66 Query: 319 LIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R + + G+T + A++ + N +A + ++L+T Sbjct: 67 DKRQALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMT 118 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 DG + K K + + + I Sbjct: 119 DGIPNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 152 >gi|297299828|ref|XP_002805471.1| PREDICTED: matrilin-2-like [Macaca mulatta] Length = 897 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 LMDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|149920662|ref|ZP_01909127.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] gi|149818449|gb|EDM77898.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] Length = 540 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 54/180 (30%), Gaps = 29/180 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR L + ++ D R+ F D + T Sbjct: 175 IDRVRQGLLQMREQLEPED------RVTLVGFGDEAQVIVE----NADKDSVELATAIAA 224 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGI 387 GST + ++TA++ ++ ++L++DG T +++ Sbjct: 225 LVPWGSTNLYAGLRTAFEQTDLYAQEGWQNR---------VLLVSDGVPTTGIVNSDKIE 275 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + G + T+ + + N + SF+ + ++F + + Sbjct: 276 GLAEAWSGMGYGLTTVGIG-----NDFDIELMRNLSELGSGSFYYVEDPDAVIEVFSEEV 330 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 64/191 (33%), Gaps = 18/191 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVK 325 + + A V+ ++ + + +DR D + V L RT K Sbjct: 59 KSRMAAAKQAFNEVLDAVPEEVRLGIRTLGADYPGDDRKRGCKDTRQLYPVGPLDRTEAK 118 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T I A+ A + + +A + IVL+TDGE+T + Sbjct: 119 AAVATLAPTGWTPIGPALLGAAEDL------------EGGDATRRIVLITDGEDTCAPLD 166 Query: 386 GIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + + ++GI I T+ + + + R A+ ++ T EL+ Sbjct: 167 PCEVAREIAAKGIHLVIDTLGLVPDAKTRTQLRCIAE--ATGGTYTSVQHTDELSDRVSQ 224 Query: 444 RIGNEIFERVI 454 + V Sbjct: 225 LVDRAADPVVT 235 >gi|14248693|gb|AAK57632.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248695|gb|AAK57633.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|313676404|ref|YP_004054400.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312943102|gb|ADR22292.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 618 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 90/302 (29%), Gaps = 25/302 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G+ Q + + S + S S + +N F + Sbjct: 152 GLDENTFQEATQNPLSTFSIDVDAASYSNMRRFINSGQNPPKDAVRIEEMINYFNYDYKQ 211 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + + + L + P P+ + +D S K Sbjct: 212 P-SGQDPFSINTEVSQAPWNKKHQLVHIGLQGKVIPTENLPASNLVFLLDVSGSMFAQNK 270 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++ L ++ +++ D V+ + + G + + + Sbjct: 271 LPLLKSGLKMLVDQLREEDKVS------IVVYAGAAGCVLPPTSGNE---KDKIIEALQN 321 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGI 387 GSTA ++ AY ++ + NN I+L TDG+ NE Sbjct: 322 LQAGGSTAGGAGIELAYKI----AKENFIKEGNNR-----IILATDGDFNVGASSNEAME 372 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + K + +G+ + + F + + K ++ ++ E K+ + G Sbjct: 373 DLIEKKRKEGVFLTVLGFGMGNYKDSKMEILADK--GNGNYAYIDNILEAKKVLVNEFGG 430 Query: 448 EI 449 + Sbjct: 431 TL 432 >gi|301610722|ref|XP_002934910.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 957 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 70/206 (33%), Gaps = 26/206 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + ++ F+ S + +++ + ++ + ++G ++D Sbjct: 29 SSCRTAPNDLVFILDGSWSVGPENFEILKKWVVNITSNFNI---GPKFTQVGVVQYSDYP 85 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 I + G ++ + + + G+T +A++ A D + + + Sbjct: 86 ILEIPL--GSYESSDDLSRRMQSIQYLGGNTQTGNAIRFAIDNLFAR---------SLRP 134 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--- 423 K V+LTDG + +E + +A+ I + I E L A Sbjct: 135 LTKIAVVLTDG---KSQDEVKHVAEEARKNKITLFAIGVGS-----EIEESELRAIANKP 186 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEI 449 S F ++K R+ + ++ Sbjct: 187 SSTYVFYVEDYIAISK-IREIMKQKL 211 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 52/201 (25%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRMAAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 K TA + + TA I + + IVL +DG+ Sbjct: 162 -----KNALDKLQFADRTATGEGIFTALQAIATVG---AVIGGGDKPPPARIVLFSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T AK QG+ I TI+F E L A Sbjct: 214 TMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S + + A S EL ++ Sbjct: 274 SGGNAYNAASLQELKSVYATL 294 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 102/417 (24%), Gaps = 65/417 (15%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E A + Y + + + +V + T + V Sbjct: 119 ETENVPEADKDKNYTFNAYVNYEGTDSDKDYTLYTGKYDVYNTATGTKDGETRTTDSNGV 178 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + S R + + + V + D S + Sbjct: 179 ITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKD-SATTEP 237 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + Y N V+ +S P D Sbjct: 238 VSVGEVPRITVTNTVVTAPRYRKYIKANND----GTYDLSLNVTGTQSGSSQTTVSPADI 293 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN--VNDTVRMGATFFNDRVISD 309 + + S+ + + + A+ S+ + + +N + +RM F+ V + Sbjct: 294 VVVFDTSGSMSNPMGHNSRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNV 353 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +F+ + + G T + KK Sbjct: 354 SNFTDNAMD-----IVSAVNGLRADGGTNWEA----------ALKAANAKLTSGRKGVKK 398 Query: 370 YIVLLTDGENT--------------------------------------QDNEEGIAICN 391 YIV ++DG+ T A+ Sbjct: 399 YIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAE 458 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + ++ S + T+ + S++ A ST ELNK F D IG Sbjct: 459 ANRRGDATLFSVGVSSDPTKMRGFADQ-----TKGSYYSATSTDELNKAFADIIGQI 510 >gi|14248707|gb|AAK57639.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 630 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 684 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 685 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 738 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 739 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 792 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 793 ICEALEDSDGR 803 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 630 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 684 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 685 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 738 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 739 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 792 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 793 ICEALEDSDGR 803 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 41/379 (10%), Positives = 97/379 (25%), Gaps = 30/379 (7%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + +S A+ + N + Sbjct: 25 TLAGCGGTSNMTDSTASEAEYYQNDTYTEGAAQAPVNSDAKSADERGKYSAGTNNNYKYS 84 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + + + + P + + L A+ +T S S + Sbjct: 85 DNTTDNYYGDKKVINNNESYTKPEENGFNLV-MNQPLSTFAADVDTASYSNVRRMIEDGY 143 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 V + + + +N F K + + + Sbjct: 144 AVGEIDPESVRP-----EEFINYFNYDLYEPEKGD-LFGITTEVSACPWNSDNQLMFVGM 197 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P + +D S K L++ + ++ S+ D + Sbjct: 198 KTGEIDMEEAPVSNLVFLIDVSGSMSSRNKLPLLQKSFDELVDSL------PDEGTISIV 251 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 ++ G + +K + G T MQ AY+ + Sbjct: 252 TYSGEEKVVL---SGEPMSNKKGIKKAIDKLHANGCTNGQAGMQKAYEI----AQKYFIE 304 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARY 417 NN +++ TDG+ + + K +G+ + + F + +K + Sbjct: 305 GGNNR-----VIMATDGDLNVGISDLDDLEKFITDKKDEGVFLSILGFGEGNYKDDKMQT 359 Query: 418 FLSNCASPNSFFEANSTHE 436 L++C ++ +S E Sbjct: 360 -LADC-GNGNYSYIDSLEE 376 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 669 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 723 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 724 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 777 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 778 SKAKANGITMYAVGVG------KAIEEELQEIASEPTTKHLFYAEDFSTMDEISEKLKKG 831 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 832 ICEALEDSDGR 842 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|226290246|gb|EEH45730.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 757 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 56/193 (29%), Gaps = 28/193 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L + A ++I ++ + D R+G F++ S + + Sbjct: 109 LDLTKHAARTIIETLNEND------RLGVVTFSNDAEVAYKIS-HMDDTNKKAALEAVEA 161 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QDNE 384 + ST + ++ + + + + + +LTDG+ Sbjct: 162 LQPLASTNLWHGLKLGLNVL--------GEVDLRPRNVQALYVLTDGQPNHMCPTQGYVP 213 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + I + K + I T F + L + A ++ + +F Sbjct: 214 KLRPILERQKDRLPLIHTFGFGYDI-----RSGLLQSIAEVGGGTYSFIPDAGMIGTVFV 268 Query: 443 DRIGNEIFERVIR 455 I N + Sbjct: 269 HAIANLYTTFATQ 281 >gi|14248701|gb|AAK57636.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.8 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNLTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 53.5 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 66/211 (31%), Gaps = 24/211 (11%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFV---DSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 + N S+ + + + F+ S ++ +K L++ + S+ I Sbjct: 46 WNENDSVLADKKNEVCNEGIVDIGFIMDSSGSLGKNYKNEKDLLKT-----LASLFSIKP 100 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + F + + + + T I+ ++ A + Sbjct: 101 NGSQAGVITFSFYTEHSIKLNQFSDQDSFNDAVDRIPLMGHT----TRIDKGLRLAQKEM 156 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE--EGIAICNKAKSQGIRIMTIAFSVN 408 + K +VLLTDG TQ + I ++ + QG+ I+ I Sbjct: 157 FKVENGGRPGV------SKLLVLLTDGSQTQGKGVIDPAIIADEIRKQGVPIIAIGIGKE 210 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A+ +L + Sbjct: 211 INKNELIKIG----GGEANTYSADDFEKLKE 237 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 53.5 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 67/191 (35%), Gaps = 18/191 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++A + ++ + + + F+ + +L I KT Sbjct: 199 MTAQKEAAKKFCEDTIGTNKNSN-HKFAVITLDSGSKTLTDFTNDIDELDSAIAKT---- 253 Query: 331 ENEMGSTAINDAMQTAYDTIISSNED--------EVHRMKNNLEAKKYIVLLTDGENTQD 382 GST + A++ A + + + D E L+D + + Sbjct: 254 -TAYGSTNYSAALRNAAELLSKVSADAVRNIVLCSDGNPYGGEEKSTGKYTLSDYSDYEY 312 Query: 383 NEEGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 I + K + I T+ F S++ + R +L + AS +++ E N +L K Sbjct: 313 ANAAYDIAQEIK-KDYEIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNYAEVNKVDDLQK 371 Query: 440 IFRDRIGNEIF 450 +F D GN + Sbjct: 372 VFADVAGNAVS 382 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 53.5 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 59/203 (29%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + + K Sbjct: 117 NRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKAALDKLQF 170 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 D TA +A+ TA I + + IVL +DG+ T Sbjct: 171 ADR-----TATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 223 KGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAA 282 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 + ELN ++ +I IR Sbjct: 283 TLAELNSVYASL-QQQIGYETIR 304 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 53.5 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 63/186 (33%), Gaps = 17/186 (9%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 K L + A S + + D + D + +D G R + Sbjct: 445 GSESKLDLAKKAATSALGQLTDTDQMGLWAFTTDLPTPDTITADL-VGVGPLAQTRQPII 503 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + T + A + A + + + +V+LTDG N + + Sbjct: 504 DAISSLTPLNGTPLYAATREAAKAMNAQKDPNSINA---------VVVLTDGRNEYTDND 554 Query: 386 GIAICNK----AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + A+ G+R+ TIA+ + AS + ++A + ++K+F Sbjct: 555 LDGLLRELNASAEEDGVRVFTIAYGPDADLATLQEISE---ASRAAAYDARNPTSIDKVF 611 Query: 442 RDRIGN 447 D + N Sbjct: 612 SDVLSN 617 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 53.5 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/395 (10%), Positives = 95/395 (24%), Gaps = 34/395 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + A ++++ + Sbjct: 166 AAALEAREEFDADAPAPRVPSRSAGDMTLR------RSTAPTGAAPQSGGGLSGGFGGAL 219 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLN-PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 ++N R + V + + P + + + T S Sbjct: 220 TGQVNGRIAPAPVPVPQFEPVPTPAPDTEAFPEAEQNPIKVTAEDPVSTFSIDVDTASWS 279 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 I+ + + + +N F + VGI D + Sbjct: 280 IIRNSL-TRGMLPSKDAVRIEEMVNYFSYDYPAPE-GDAPFATHVGISDSPWRE--GTQI 335 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + P ++ +D+S K L++ + ++ + + D + Sbjct: 336 VQIGLQGMLPEARPPMNLVFLIDTSGSMQDANKLPLLKQSFRLMLGQLGEED------MV 389 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + R + GSTA +Q AY T D Sbjct: 390 SIVTYAGSAGRVLEPT---KASDRQTILDALDRLEAGGSTAGQAGLQQAYATATEMARDG 446 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEK 414 ++L TDG+ + + + + G + + F Sbjct: 447 AVSR---------VILATDGDFNVGISDPDDMKDYIETQRGTGTYLSVLGFGRGNLDDAT 497 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + ++ E K+ D++ + Sbjct: 498 MQALAQH--GNGMAAYIDTLSEAQKVLVDQLTGAL 530 >gi|14248671|gb|AAK57621.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248673|gb|AAK57622.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|126343558|ref|XP_001372639.1| PREDICTED: similar to ITI-like protein, partial [Monodelphis domestica] Length = 1002 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 38/344 (11%), Positives = 82/344 (23%), Gaps = 31/344 (9%) Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 V E+ + + V L L R +G + + + Sbjct: 183 THVASRGDEVKKFRISTSVAGGGEVAFTLTYEELLPRLLGKYQHAVSVRPGQVVPNLRVE 242 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + + K + + + Y Sbjct: 243 VTISERTGIDYIHVLPLQSSHLLTNIVRGEADLPTSTTIEKEKTCARVIFMPSPSEQAAY 302 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + + + + A+ ++ + D+ N Sbjct: 303 S----------------SQGITGDFVIQYDVTMKDVIGDVQTKKAMHVILGDLCPKDHFN 346 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 S + + + K + T +N A+ A + Sbjct: 347 IVTFSDTVHIWKAAGSIQAIPPNIQRA-----KAYVSRMKAARWTDMNAALLAAASILNQ 401 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNK 409 S + + I+ LTDGE T I A+ + + +A + Sbjct: 402 SIAGPLGEA-------RLIIFLTDGEPTAGVTSPARILANAQRALAGQVALFGLALGDDA 454 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R L N + + E + K F DRI + + Sbjct: 455 DLPLLRRLSLENRGTAHRIREDHDAASQLKGFYDRIAYPLLSDI 498 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 45/127 (35%), Gaps = 14/127 (11%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N + R+G + + + + +++ D TAI +A+Q A Sbjct: 124 NNLASDRIGLIAYAGSAVPQLPITTDYSSA-KMFLQSMNTDLVSSQGTAIAEAIQLAESY 182 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + EA K +V+++DGE+ + A GIRI+TI K Sbjct: 183 YS-----------EDTEASKVLVIISDGEDHEGEAL--DYAEAAAENGIRIITIGVGTEK 229 Query: 410 TQQEKAR 416 + Sbjct: 230 GGTIPIK 236 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 53/161 (32%), Gaps = 20/161 (12%) Query: 275 RDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 ++ + S+ +I ++ V V++ F+ V D F + K Sbjct: 30 KNFVLSLTDNIFQMKPVKSQKYDVKLAGMQFSSTVSVDHPF---IAWKNVQNFKEKIRAL 86 Query: 332 NEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G T A+ A + + K L+TDG + ++ I Sbjct: 87 VYIGQGTYSYYAISNATQLFKTEGREGSI---------KVAFLMTDGVDHPNSPSVEGIA 137 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 A+S GI TI +++ L + +SF Sbjct: 138 TAARSLGIHFFTIGL----SKKNVKEEKLRLISGDSSFKHV 174 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 49/164 (29%), Gaps = 19/164 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F+ +V S + ++ T A++ A + Sbjct: 722 NHAKTRIGVINFSHKVELVSSLE--KYTTKESLKSAVDKMLYLGEGTYTASAIKKAINLF 779 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAF--SV 407 ++ K V++TDG+ +E + + +A + I I I Sbjct: 780 QAARPAVR----------KVAVVVTDGQADARDEVHLDMVVREAHAANIEIFVIGIVQET 829 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNE 448 + + + A+ F+ L+ + I Sbjct: 830 DPHYDNFLKE-MHLIATDPDEEHFYRIEDFKTLSALTDKLITKI 872 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/224 (9%), Positives = 61/224 (27%), Gaps = 50/224 (22%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + ++I S ++G F + Sbjct: 97 DISNSMLAQDVQPSRLDKSKLMIENLINSF-------INDKIGLVVFAGEAYVQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + T I A++ + + K I+L+T Sbjct: 150 YVSA-KMFLSDITPNLISAQGTDIARAIRVSLSSFTQQ-----------KGVGKAIILIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFLSN--- 421 DGE+ + +AK +G+ + + +K + ++ Sbjct: 198 DGEDNEGGAL--EAVKEAKEKGVNVFILGVGDSKGAPIPLGNGEYLKDNHGQTVMTALNE 255 Query: 422 --C-----ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 C A ++ ++T ++ + ++ K Sbjct: 256 NMCKEIAQAGSGTYIHIDNTS--------LAQEQLNNELSKLQK 291 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 81/362 (22%), Gaps = 66/362 (18%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + ++ + + +A + V Sbjct: 5 AAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTLSAP 64 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ G + + + + I V+D S SM + QP Sbjct: 65 AREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQP 124 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 ++ I D+ + + Sbjct: 125 TKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT------------------- 165 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +K ND R G + +N + ++ V L Sbjct: 166 -------------------------EKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLT 199 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T + G+T+ + A A + + AKK ++ TDGE Sbjct: 200 ST-----VNGLSASGATSADYAFNRAQAALT---------YQPRANAKKVVIFFTDGEPN 245 Query: 381 QDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + A K G I +I + + +S + + Sbjct: 246 HGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDAT 305 Query: 434 TH 435 Sbjct: 306 AT 307 >gi|47847422|dbj|BAD21383.1| mFLJ00114 protein [Mus musculus] Length = 1188 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 157 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 210 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 211 MQFSDYFRVHFTFNNFISTSSPLSLLDSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 268 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +A K ++++TDG DN ++ A++ I I E ++ L Sbjct: 269 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVGK-AFYNEHSKQEL 321 Query: 420 SNCA---SPNSFFEANSTHELNKI 440 A S F + L I Sbjct: 322 KAIASMPSHEYVFSVENFDALKDI 345 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 66/188 (35%), Gaps = 14/188 (7%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKL 319 ++ + + + A V+ ++ + V +D+ D V + Sbjct: 60 ATDIDGRSRISVAQQAFNEVVDALPDETELGIRVLGATYPGDDKEQGCQDTQQIVPVGPV 119 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R K G T + A+++A + + + + IVL+TDGE+ Sbjct: 120 DRVQAKAAVATLRPTGYTPVGLALRSAAEDLGTG------------STARRIVLITDGED 167 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 T + + + +QG +++ + ++ + + A+ ++ A S EL Sbjct: 168 TCAPPDPCEVARELAAQGTKLVVDTLGLAPDEKVRQQLLCIAGATGGTYTAAQSADELTG 227 Query: 440 IFRDRIGN 447 + + Sbjct: 228 RIKQLVDR 235 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 53/176 (30%), Gaps = 14/176 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK------------- 54 ++ +KS TG+ + AL+MP+++ G+ +D + L+ Sbjct: 5 TTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAAD 64 Query: 55 -QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 +A+ A++ + + + + + + Sbjct: 65 VSSAEEAVLAHFRSNDLGFAVSTPSGLMTVDGKLLPNDVETAAKGVATIVRGRYVPDPAI 124 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + A QV L + D+ L L + + +K A +V Sbjct: 125 DAGQRFIRGALPADAVQVTLEKKGDIYLAALFSEPPDLSVIGTAASSKIAAFSVGS 180 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 15/132 (11%), Positives = 44/132 (33%), Gaps = 13/132 (9%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + R+ + G T I + ++ ++ + + R + ++LL+D Sbjct: 221 REAWRSEMHALVDTLQPGGGTNIYEGLERGFEIAKEARVNHPDRAQR-------VILLSD 273 Query: 377 GENTQDNEEGIAIC---NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 G T+ + +I G+ + T+ + + +F+ Sbjct: 274 GLATEGITDSASIIALSEAFIEGGMGLTTVGVGASFNVELMRGLAER---GAGNFYFVED 330 Query: 434 THELNKIFRDRI 445 + ++F + + Sbjct: 331 PEAVREVFTEEL 342 >gi|149732296|ref|XP_001503293.1| PREDICTED: collagen, type XXI, alpha 1 [Equus caballus] Length = 957 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ + D +++G ++D + + G H ++ G+T Sbjct: 60 LVNITRNFDIGPKFIQVGVVQYSDYPVLEIPL--GSHDSGENLMAAMESIHYLGGNTRTG 117 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A D + K++ K V+LTDG + +E A+ I + Sbjct: 118 KAIQFALDYLF---------AKSSRFLTKIAVVLTDG---KSQDEVKDAAEAARESKIIL 165 Query: 401 MTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I E L A S F ++KI Sbjct: 166 FAIGVGS-----ETEEAELRAIANKPSSTYVFYVEDYIAISKI 203 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 20/179 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 79 ILDILQFLDIGPDITRVGLLQYGSTVKNEFSLK--TFKRKSDVERAVKRMRHLSTGTMTG 136 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + +KA++ GI I Sbjct: 137 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVASKARNTGILI 190 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRDRIGNEIFERV 453 I + L S + F + L +F++++ ++ V Sbjct: 191 FAIGVG------QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQNKLCSKYGASV 243 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 17/156 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 664 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGS-MTG 721 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ Y+ + E + ++ TDG D KAK+ GI + Sbjct: 722 LALKHMYERSFTQVEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWARKAKANGITM 775 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 + + L AS F A Sbjct: 776 YAVGIG------KAIEEELQEIASEPIDKHLFYAED 805 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 60/187 (32%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I +K+ D R+ F+ R +K+ Sbjct: 59 LETVKKAAMQLIERLKEGD------RICVIAFDHRAKVLVP---NQAIDNLNTIKSQIRQ 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 + G TAI++ ++ + + D V + + LLTDGEN DNE + + Sbjct: 110 LSADGGTAIDEGLKLGIEEVAKGKADAVSQ----------VFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 + A + I T+ F + + L A + + + F Sbjct: 160 AHFAVEHKLTINTLGFGASWN-----QDVLEKIADSGSGTLCYIEQPEQAVQEFGRLFNR 214 Query: 448 EIFERVI 454 + Sbjct: 215 IQAVGLT 221 >gi|14248689|gb|AAK57630.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTIDLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 45/144 (31%), Gaps = 19/144 (13%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + T + G T+I ++ + II SN+ Sbjct: 21 MVTFESLATILNYLTDII-GDNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQSTS 79 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARY 417 I+LLTDGE+ Q + C + K G I TIA + Sbjct: 80 GSE---------IILLTDGEDDQISS-----CFEEVKQSGAVIHTIALG---PSAARELE 122 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 LSN F+ + L F Sbjct: 123 TLSNMTGGYRFYANKDINGLTDAF 146 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 53/133 (39%), Gaps = 20/133 (15%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R++++ + G T + + AY+ + S + +V++TDG+N Sbjct: 482 RSLLRERLLGLTLSGGTGLYNTTAAAYEKMTGSRRGDAINA---------VVVMTDGKNE 532 Query: 381 -QDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANST 434 + + K + + +R+ TI + + Q L A+ + ++++ Sbjct: 533 RPGGLDLDGLIAKLGARREESVRVFTIGYGEDADQN-----VLRRIAEAADGAAYDSSDP 587 Query: 435 HELNKIFRDRIGN 447 + + IF + I N Sbjct: 588 NTIGDIFTEVISN 600 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 55/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I+ +D + R+G + V + + + T A+ Sbjct: 30 IESLDVGPNATRVGLVNYASSVKQEFPL---RAHSSKAELLQAVRRIQPLSTGTMTGLAI 86 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A +S E R + K ++++TDG QD+ ++ +A++ GI + I Sbjct: 87 HFAITKALSDAEGGRPRS---PDISKVVIVVTDGRP-QDSVRDVSA--RARASGIELFAI 140 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L+K F++ Sbjct: 141 GVGR------VDKATLRQIASEPQDEHVDYVESYRVIEKLSKKFQEAF 182 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 54/155 (34%), Gaps = 15/155 (9%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D V+ GA +++ + I + + G T A+Q A Sbjct: 870 ADVGRDRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQHAN 927 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + E H + ++ ++++TDGE + D ++ + + +GI I + Sbjct: 928 ILFM-----EEHGSRLKQNVRQLMIVITDGE-SHDRDKLNDTARELRDKGITIFAVGVGR 981 Query: 408 NKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 982 ------ANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 40/355 (11%), Positives = 97/355 (27%), Gaps = 24/355 (6%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + + + + + S + + S ++ G+ Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGAAIEHMRTEGFSESSGSRKAQGV 364 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQW--VIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + S +E + ++ V F+ S+ ++ + + T Sbjct: 365 PQIAVLVT---HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQLEDIVSYPSRQSIST 421 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SY G +K+ + + + + ++E + ++ Sbjct: 422 HSSYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKTGCVDTKEADIYFLIDGSSSIRR 481 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + + VR+G ++ R + S + Sbjct: 482 KEFEQIQIFMSSVVDMFPIGPNNVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQL 541 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + +H E Y+++LTDG++ E N Sbjct: 542 K--GGTFTGKALDFILPIIKKGKSERIH------EVPCYLIVLTDGKSNDSVLEP---AN 590 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 + +++ I I + E + L A + L K ++ I Sbjct: 591 RLRAEHITIHAVGIG------EANKTQLQQIAGKDERVSFGQNFDSL-KYIKNEI 638 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 54/179 (30%), Gaps = 24/179 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S R V Sbjct: 668 TNFEKMKTFMKNLVGKIEIGADRSQVGVVQFSDYNKEEFQLNKYS------TREEVYAAI 721 Query: 329 IDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + +T A+ NE + +K+++LLTDGE + Sbjct: 722 DGMSPINRNTLTGSALTFV-------NEYFDISKGGRPQVRKFLILLTDGEAQDEVGGPA 774 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ R L + S F + L I I Sbjct: 775 M---ALRSKSVTIFSVGV-YGAN-----RTQLEEISGEGSLVFHVENFDHLKTIESKLI 824 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 54/155 (34%), Gaps = 15/155 (9%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D V+ GA +++ + I + + G T A+Q A Sbjct: 870 ADVGRDRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQHAN 927 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + E H + ++ ++++TDGE + D ++ + + +GI I + Sbjct: 928 ILFM-----EEHGSRLKQNVRQLMIVITDGE-SHDRDKLNDTARELRDKGITIFAVGVGR 981 Query: 408 NKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 982 ------ANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 40/355 (11%), Positives = 97/355 (27%), Gaps = 24/355 (6%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + + + + + S + + S ++ G+ Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGAAIEHMRTEGFSESSGSRKAQGV 364 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQW--VIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + S +E + ++ V F+ S+ ++ + + T Sbjct: 365 PQIAVLVT---HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQLEDIVSYPSRQSIST 421 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SY G +K+ + + + + ++E + ++ Sbjct: 422 HSSYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKTGCVDTKEADIYFLIDGSSSIRR 481 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + + VR+G ++ R + S + Sbjct: 482 KEFEQIQIFMSSVVDMFPIGPNNVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQL 541 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + +H E Y+++LTDG++ E N Sbjct: 542 K--GGTFTGKALDFILPIIKKGKSERIH------EVPCYLIVLTDGKSNDSVLEP---AN 590 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRI 445 + +++ I I + E + L A + L K ++ I Sbjct: 591 RLRAEHITIHAVGIG------EANKTQLQQIAGKDERVSFGQNFDSL-KYIKNEI 638 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 42/112 (37%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G AI A++ A + + + K I+ Sbjct: 2065 TFDSESIMKNYIHTSLQQL--NGDAAIGLALEWAMEGLFLGTPNPR--------KHKVII 2114 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG I ++ R + AS Sbjct: 2115 VISAGENHEEKEFVKTVALRAKCQGYIIFVVSLG------STRRDEMEELAS 2160 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 54/179 (30%), Gaps = 24/179 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S R V Sbjct: 668 TNFEKMKTFMKNLVGKIEIGADRSQVGVVQFSDYNKEEFQLNKYS------TREEVYAAI 721 Query: 329 IDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + +T A+ NE + +K+++LLTDGE + Sbjct: 722 DGMSPINRNTLTGSALTFV-------NEYFDISKGGRPQVRKFLILLTDGEAQDEVGGPA 774 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ R L + S F + L I I Sbjct: 775 M---ALRSKSVTIFSVGV-YGAN-----RTQLEEISGEGSLVFHVENFDHLKTIESKLI 824 >gi|126658524|ref|ZP_01729672.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] gi|126620266|gb|EAZ90987.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] Length = 610 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 21/184 (11%), Positives = 53/184 (28%), Gaps = 24/184 (13%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A ++ + D ++ ++D + +K G Sbjct: 62 QATEKLVDFLSPDDLLS------VIIYDDVAEVIVP---HQAVTNKQEIKAKIKKIRARG 112 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T ++ + + + ++LLTDG + + A Sbjct: 113 CTNLSGGWLLG---------CSQVKSHLSTDKLNRVLLLTDGLANIGERKPEILLKTAAE 163 Query: 396 Q---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + GI T+ F N + +++ A +F+ S + +F + + + Sbjct: 164 KAQQGIVTTTLGFGSNFNEDLLI--GMADAAG-GNFYFIQSPDDSADVFHIEMESLLSLV 220 Query: 453 VIRI 456 + Sbjct: 221 TQNL 224 >gi|14248685|gb|AAK57628.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|262198293|ref|YP_003269502.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262081640|gb|ACY17609.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 419 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 53/189 (28%), Gaps = 22/189 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A V+ + + D R+ F+ + + R ++ Sbjct: 51 RLASAIVAARQVVEQLDERD------RLSVIAFDATARTIFGPMSVTDEA-RQTLEQALA 103 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T + M+ + + S + +VLLTDG+ + + + Sbjct: 104 GLRTGVGTNLAAGMKKGAEAVRSG--FVRGALSR-------LVLLTDGQPSLGITDNDRL 154 Query: 390 CNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 C A+ +G+ I T+ E + F S ++ F + Sbjct: 155 CALAQKEADRGVTITTMGLGQG-FDDELLADLAHS--GRGGFHYLASAADIPGAFGRELS 211 Query: 447 NEIFERVIR 455 + Sbjct: 212 GVFAIAATQ 220 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 47/134 (35%), Gaps = 20/134 (14%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 T + A + G T I D++ A +++ K I+L++ Sbjct: 333 TANRQATAINRTADALSAYGYTPIADSLTLAGHDLLA-----------IDAQKHMIILIS 381 Query: 376 DGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEA 431 DGE T A+ +S GI + I F ++ A+ + AS +F+A Sbjct: 382 DGEETCG-GFPAAVAANLRSLGIDLQTHVIGFDLD----ATAQQQMQAIASAGGGQYFDA 436 Query: 432 NSTHELNKIFRDRI 445 EL I Sbjct: 437 ADGDELGASLMRVI 450 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 42/141 (29%), Gaps = 17/141 (12%) Query: 317 HKLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + N +G T I A++ A + + I+L++ Sbjct: 81 AARSAAGISSMIDSINAVGQQTPITYALERAQQDM------------EGWLGPRKIILIS 128 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG+ T ++ ++ GI + TI + L A F A + Sbjct: 129 DGQETC-QQDPESLAEAFSGAGITVDTIGIGPPGNFAQLGMIAL---AGAGEFHLAENLA 184 Query: 436 ELNKIFRDRIGNEIFERVIRI 456 L + G + I Sbjct: 185 ALQEAMATATGGSSTGAALPI 205 >gi|297622708|ref|YP_003704142.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297163888|gb|ADI13599.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 329 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 49/154 (31%), Gaps = 12/154 (7%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + G + +++ + G+T I +D+++ + + + Sbjct: 156 SSGFRASTLLQGFTS-DKALLEAALDRVGQRGNTPI-------WDSLLDTLDLHA-ADEG 206 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 A + ++L TDGE A +R+ TI + E Sbjct: 207 GQGASRVVLLFTDGEREGGQVAFGEALAAALESDVRVFTIGLGSDIDTAELQELAAE--- 263 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + +F S EL ++F+ I ++ Sbjct: 264 TGGTFANVASAAELEELFQRAFNAIRASGTITLS 297 >gi|14248675|gb|AAK57623.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|72180809|ref|XP_798930.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] gi|115975272|ref|XP_001180569.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] Length = 964 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 62/184 (33%), Gaps = 12/184 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+DAL++++ + + D N F + S V + R F Sbjct: 364 TKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGMLYSTKENVRRAKR-----FV 418 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + EM +T ++ A+ + + + + +E + + +++LTDG + Sbjct: 419 MGLQEMDNTNLHKAIISGVNMLRAESEQDPQEEEIVS----MLIVLTDGNPNHGEIDKTI 474 Query: 389 ICNK---AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I A + + I F + R L N E E + F + Sbjct: 475 IERNVHEAINGDFSLFCIGFGADADYPFLRRLSLQNHGVARRIPERADAGEHLENFYYEV 534 Query: 446 GNEI 449 + Sbjct: 535 ATPL 538 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 81/362 (22%), Gaps = 66/362 (18%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + ++ + + +A + V Sbjct: 5 AAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTLSAP 64 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ G + + + + I V+D S SM + QP Sbjct: 65 AREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQP 124 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 ++ I D+ + + Sbjct: 125 TKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT------------------- 165 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +K ND R G + +N + ++ V L Sbjct: 166 -------------------------EKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLT 199 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T + G+T+ + A A + + AKK ++ TDGE Sbjct: 200 ST-----VNGLSASGATSADYAFNRAQAALT---------YQPRANAKKVVIFFTDGEPN 245 Query: 381 QDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + A K G I +I + + +S + + Sbjct: 246 HGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDAT 305 Query: 434 TH 435 Sbjct: 306 AT 307 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 81/362 (22%), Gaps = 66/362 (18%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + ++ + + +A + V Sbjct: 5 AAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTLSAP 64 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ G + + + + I V+D S SM + QP Sbjct: 65 AREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQP 124 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 ++ I D+ + + Sbjct: 125 TKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT------------------- 165 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +K ND R G + +N + ++ V L Sbjct: 166 -------------------------EKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLT 199 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T + G+T+ + A A + + AKK ++ TDGE Sbjct: 200 ST-----VNGLSASGATSADYAFNRAQAALT---------YQPRANAKKVVIFFTDGEPN 245 Query: 381 QDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + A K G I +I + + +S + + Sbjct: 246 HGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDAT 305 Query: 434 TH 435 Sbjct: 306 AT 307 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 45/435 (10%), Positives = 110/435 (25%), Gaps = 79/435 (18%) Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 Q E + + + + A E + V+ +L Sbjct: 167 AVATQGTTEVSATSPTTAEEITSSQETETTAQTEEASEATSSEINSSRVVVGASNLEYLS 226 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY-QRDSEGQPLN 202 S+ + + +A + S + VI+ + N Sbjct: 227 GISAFSSINYSNIAPEYTTDATKGTFSTNSWQPAGQTNVINHQGNKERNFATWDGVTSWN 286 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSP-------YMVSCNKSLYYMLYPGPLDPSLSE 255 + Y + IR ++ + + P + + Sbjct: 287 GDPSDTTHSYIQYGEKQTDFAIRKYAKETSEPGLYDVYLNVKGNEQQDIKPVDIVLVVDM 346 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 ++SS + R+ + + +++IK + D V +G F+ + Sbjct: 347 SGSMESSQSNGWNDRAGAARNGVKNFLQTIKDA-GIGDYVNVGLVGFSSPGYVTGPNGYL 405 Query: 316 VHKLIRTIVKTFAIDENEM------GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + N+ G T ++ + + K Sbjct: 406 TVPIGKASDTSHINAINDALKPKFTGGTYTQIGIEQGQQMLA-----------GSSNENK 454 Query: 370 YIVLLTDGENTQDN---------------------------------------------- 383 +++LTDG T Sbjct: 455 MMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNTLDEPRNTSKLNSSYEVGSWGNRTNIN 514 Query: 384 ---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN----CASPNSFFEANSTHE 436 + AK G+ I T+ ++K + + A+P + +A +T++ Sbjct: 515 STWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFLTEQQVRDRASLIATPGKYKDAETTND 574 Query: 437 LNKIFRDRIGNEIFE 451 ++ ++ N + Sbjct: 575 VSDYLNEQAKNVVKS 589 >gi|163849593|ref|YP_001637636.1| TadE family protein [Methylobacterium extorquens PA1] gi|163661198|gb|ABY28565.1| TadE family protein [Methylobacterium extorquens PA1] Length = 135 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 43/138 (31%), Gaps = 8/138 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + K G + AL++P+++ + V+ Y + AA++A + L + Sbjct: 2 RSFRKDQRGVISVEFALVVPLLITLVFAAVEFGGILYT----QAAAESATNNVARQLATN 57 Query: 72 LEEVSSRAKNSF----TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + K + + ++ + + + N + A + Sbjct: 58 RITQAQAKKAVTPLLPVWARVNVDVAVTASNATKPESNLYTVATTIPMAQAAPTGFLKFI 117 Query: 128 AYQVVLSSRYDLLLNPLS 145 +S+ P S Sbjct: 118 YVDRNISATVTQQQEPTS 135 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 59/195 (30%), Gaps = 36/195 (18%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K++ + R+G F+ ++ L I T +N ++ Sbjct: 107 LKELTKSFPSDRIGLIIFSSEAFMQCPLTFDQSVLQLYIDGLN-TGLVPNFGTDLNAPLR 165 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A D ++ K ++L++DGEN D E I ++ K+ G+++ + Sbjct: 166 IALDRFQ--------NDESQEVKSKSVILISDGENFGDELENIG--SELKNLGVKVFALG 215 Query: 405 FSVNKTQQEK-------------------ARYFLSNCA--SPNSFFEANST----HELNK 439 + L A + +FE + +L K Sbjct: 216 IGTESGSTIPRGNGIVMDPQTGEPAQTVLDKRPLQQIAAETDGQYFEISDEVQEVADLIK 275 Query: 440 IFRDRIGNEIFERVI 454 G R + Sbjct: 276 RLERLEGGITGSRTV 290 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 53.5 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALRHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTTKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 L S + F + ++ Sbjct: 192 FAFGVGQVDFNT------LKAIGSEPHEDHVFLVANFSQIET 227 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 40/352 (11%), Positives = 98/352 (27%), Gaps = 12/352 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L++ TG+ ++ A MP+++G G+ D V+W+ ++ +++ A +A + + + Q Sbjct: 8 RLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQADSAALAGAYAVAQGF 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 S + ++ + T + V ++ + + Sbjct: 68 SASDSATSDI-----NRLSLITLSQTPTIENAPTTGSYAGNNKAVRVVLQTTKELPFSKI 122 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L ++L + + G + K+ A ++ + + + S Y Sbjct: 123 LGIGAPVILGEATAAIVGSGDYCVVSLEKSSATGITLQGNATVNLGCGMATNSRASNAVY 182 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 S T S + + + S L Sbjct: 183 AGGSSTVTATPVAAVGGLTSSSNYATGTTLLPYSIPVQDPYASLPTPTAANLTGCSNLNV 242 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 ++ K + ++ + ID + V A V Sbjct: 243 QPNNSRNITA---GCYKNIDV--KGTLNMAPGVYYIDAGSFDVGSQAVINGTDVTII--L 295 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + T I+ + + Y I+ + N Sbjct: 296 TSSNASNNPGSIATVNINGGATLNLTAPTDLNDPYHGILFYQDRRALDSGTN 347 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 452 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 453 DMTGGLRFYANKDLNSLIDAFSRI 476 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D +D VR+GA ++D G + ++++ + G T A+ Sbjct: 841 VKKADVGSDRVRIGALKYSDYPEILFHL--GKYSNRSSVIEHLRRRRSTGGDTYTARALD 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 E + + K+ ++++TDG + D K +++GI I + Sbjct: 899 HTNMMFT-----EEYGSRIQQNVKQMLIVITDGV-SHDRNLLNETALKLRNKGIDIYAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIF 441 + + L A S F ++ ++L I+ Sbjct: 953 VG------QADQLELEAMAGNKSKTFHVDNFNKLKDIY 984 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 49/157 (31%), Gaps = 21/157 (13%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 D +G F+D+ + + + + +T +A+ + Sbjct: 661 GPDRTHIGVIQFSDKTREEFQL---NKYFTQNEISDAIDRMSLIDKNTLTGNALISVDQY 717 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + KK+++L+TDGE + + +G+ I ++ Sbjct: 718 FTPAKGARIGI-------KKFLILITDGEAQDAVRDP---AKALRDKGVVIFSVGV-YGA 766 Query: 410 TQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 R L + + F+ S +L I I Sbjct: 767 N-----RTQLEEISGDGNLVFQVESFDDLKAIESKLI 798 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 61/204 (29%), Gaps = 26/204 (12%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI-- 288 + ++ + L +D+ IK+ Sbjct: 408 NFQKKLRNEIWSQISTRAEQMDLDRTGCIDTKEADIYFL-IDGSTSIQGKHFEQIKEFML 466 Query: 289 ------DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 D V++GA ++D++ + + + + + E G+T +A Sbjct: 467 AVTGMFSIGPDKVQVGAVQYSDKMRVEFYINDNSNNVN--LKNAILNIEQLQGNTYTGEA 524 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + I + R + Y+++LTDG +T D E + +++ + I Sbjct: 525 LNFTLSII------KEDRKLRTSQVPCYLIVLTDGRSTDDVLEP---AERLRAEQVTIHA 575 Query: 403 IAFSVNKTQQEKARYFLSNCASPN 426 + E + L A Sbjct: 576 VGIG------EAIKVQLQQIAGGE 593 >gi|329850448|ref|ZP_08265293.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] gi|328840763|gb|EGF90334.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] Length = 395 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 34/300 (11%), Positives = 78/300 (26%), Gaps = 27/300 (9%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 +I L +++ GG +D+ S H ++ A A++T S ++ Sbjct: 1 MIYGLSATLVIAAGGGALDLFNASNLRHDMQNALDAAVLTGVRA-------SSQMGTSAS 53 Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 KQ + + + ++ + T + L Sbjct: 54 NAFKQNVADDMDGASQSYSSAVSSSSASSSSYVTT---------------TLTGTASLES 98 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + F++ +G+ + K+ A+ + +V+D S S Sbjct: 99 PTYFMKLLGLNDLTVSVKSVAQGTTTIAPAGKP--CIYVLDPSGSQALLVNSGANVQALS 156 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEK--LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 T + N + +K + + N L G + + Sbjct: 157 CEIHVKSTGNPAAIFNSGSSLNFKKLCVQGTNIIKNSVTVPNLVTGCAASGDPYAGTLPT 216 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + A ++ + + F V + W V+ Sbjct: 217 PASSTCTYSNQNYSAATQNLTPGVYCGWFNFNN-SSATVNFAPGVYVIKNGGWNVNGGTW 275 >gi|89053332|ref|YP_508783.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88862881|gb|ABD53758.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 686 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 80/298 (26%), Gaps = 23/298 (7%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 DL L PL + + T S ++ + ++ Sbjct: 209 GTGDLALAPLPEDFANADDNPLRVVADDPVSTFSIDVDTASYALLRSTL-NRGALPAPDA 267 Query: 195 DSEGQPLNCFGQPADRTVKSYSS-QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + +N F S V + + +P + + L P L Sbjct: 268 VRIEEMVNYFPYDYPAPTADDISPFRPNVQVFETPWNPDTQLVHIGIQGDLPVVEDRPPL 327 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D+S + K L+ + ++ + D V + + Sbjct: 328 NLVFLIDTSGSMNDPAKLPLLIQSFRLMLNRLSPEDEVA------IVTYAGSAGVALEPT 381 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + GST ++ AY M + E + + L Sbjct: 382 ---AASDTATINAALTTLQAGGSTNGVGGLEEAYRL--------AGEMMVDGEVSRVL-L 429 Query: 374 LTDGENTQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 TDG+ + A+ + + GI + + F Q + + N S+ Sbjct: 430 ATDGDFNVGLSDAGALEDYIAEQRDTGIYLSVLGFGRGNLQDDTMQALAQNGNGTASY 487 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 25/224 (11%), Positives = 63/224 (28%), Gaps = 50/224 (22%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + +++ ++G F + + Sbjct: 97 DISNSMLAEDVAPSRLTKSKMLVENLVDHFN-------NDQIGLVVFAGQSYVQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++ + T I A+ T N + K I+++T Sbjct: 150 YVSA-KMFLQDIQPSLIQTQGTDIAGAINTCMHAFTP-----------NDKVGKAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLSN------------ 421 DGE+ + I A +GI + + +K +L + Sbjct: 198 DGEDHEGGA--IEAAKAAHDRGINVFILGIGDSKGAPIPMGNGDYLKDRSGNTVMTAVNE 255 Query: 422 --C-----ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 C A + ++T + N++ + + R+ K Sbjct: 256 QMCKQIAEAGSGKYIHVDNTKD--------AENQLNDDLSRLQK 291 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 93/330 (28%), Gaps = 30/330 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QATKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ ++ E K+ + Sbjct: 365 I--ADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 60/166 (36%), Gaps = 19/166 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D ++G + RV + + H ++++ + G+ + A+ Y+ + Sbjct: 567 NRDVTQIGLVVYGSRVQTTFAL--DTHPTSSSLLQAISQAPYMDGAGSTGSALLHVYEEV 624 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ + + K +V++T+G T+D K ++ GI ++ IA Sbjct: 625 MTVQKGARPGV------SKAVVVITEGTGTEDAVVP---AQKLRNNGISVLVIAVG---- 671 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKI---FRDRIGNEIFERV 453 L SP+ S +L F +RI E V Sbjct: 672 -PVLKETLLRLAGSPDFLIHVASYEDLENYQDLFIERICEEAKRPV 716 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 93/330 (28%), Gaps = 30/330 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QATKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ ++ E K+ + Sbjct: 365 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQ 392 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 46/177 (25%), Gaps = 46/177 (25%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS-----NEDEVHRMKNNLEAK---- 368 R + TAI + + + I + R Sbjct: 144 TTNREDLNAAIDRFQLQRGTAIGNGIVLSLAAIFPEVGIDLGQFSYGRPGQPRGPSLDMP 203 Query: 369 ----------------KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ- 411 I+LLTDG+ T + + A +G+R+ T+ + Sbjct: 204 PAGPPPTPVAPGSYGSAAIILLTDGQRTTG-IDPLEAAKLASERGVRVYTVGVGTVEGVT 262 Query: 412 ---------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 L + A + +F A S L +++ R+T Sbjct: 263 VGFEGWSMHARLDEESLKHIAQQTRAEYFHAASAEALTQVYETL--------SSRLT 311 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 93/330 (28%), Gaps = 30/330 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 74 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 132 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 133 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 192 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 193 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 246 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 247 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QATKGFIKGG 301 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 302 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 354 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ ++ E K+ + Sbjct: 355 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQ 382 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 81/362 (22%), Gaps = 66/362 (18%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + ++ + + +A + V Sbjct: 5 AAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTLSAP 64 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ G + + + + I V+D S SM + QP Sbjct: 65 AREKTVTANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQP 124 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 ++ I D+ + + Sbjct: 125 TKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGT------------------- 165 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 +K ND R G + +N + ++ V L Sbjct: 166 -------------------------EKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLT 199 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 T + G+T+ + A A + + AKK ++ TDGE Sbjct: 200 ST-----VNGLSASGATSADYAFNRAQAALT---------YQPRANAKKVVIFFTDGEPN 245 Query: 381 QDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + A K G I +I + + +S + + Sbjct: 246 HGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDAT 305 Query: 434 TH 435 Sbjct: 306 AT 307 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 68/264 (25%), Gaps = 24/264 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 241 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 300 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 301 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 355 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 356 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 414 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 IVLLTDGE+ C G I TIA + LS Sbjct: 415 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG---PSAARELETLS 457 Query: 421 NCASPNSFFEANSTHELNKIFRDR 444 + F+ + L F Sbjct: 458 DMTGGLRFYANKDLNSLIDAFSRI 481 >gi|14248629|gb|AAK57600.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248679|gb|AAK57625.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248681|gb|AAK57626.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248683|gb|AAK57627.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|161086896|ref|NP_001104313.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform a [Mus musculus] gi|46576352|sp|O08532|CA2D1_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|1905817|gb|AAB50138.1| voltage-gated calcium channel alpha2/delta subunit, alpha2a isoform [Mus musculus] gi|148671294|gb|EDL03241.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Mus musculus] Length = 1103 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|14248687|gb|AAK57629.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|553647|gb|AAA60558.1| inter-alpha-trypsin inhibitor heavy chain old gene name 'ITI' [Homo sapiens] Length = 482 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 43/127 (33%), Gaps = 11/127 (8%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 1 GGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLSKIQKNVK 56 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + T K F + ++ Sbjct: 57 ENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN----QVST 112 Query: 452 RVIRITK 458 ++R + Sbjct: 113 PLLRNVQ 119 >gi|14248691|gb|AAK57631.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|42407699|dbj|BAD08847.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408121|dbj|BAD09261.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] Length = 703 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 49/176 (27%), Gaps = 32/176 (18%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F+ + + K+ + G T I + + A Sbjct: 300 DRLAVVSFSYSARRVIRLTRMSEGGKASA-KSAVESLHADGCTNILEGLVEAAKVFD--- 355 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--------------SQGIRI 400 + A ++LL+DG++ + G + +K + + + Sbjct: 356 ------GRRYRNAVASVILLSDGQDNYNVNGGWG-ASNSKNYSVLVPPSFKRSGDRRLPV 408 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 T F + + A + +F + + F IG + V Sbjct: 409 HTFGFGTDHD-----ASAMHTIAEETGGTFSFIENQAVVQDAFAQCIGGLLSVPVQ 459 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 44/150 (29%), Gaps = 13/150 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + + +++ + + F + L + Sbjct: 37 ISELNEGINCFYDAVRSDETALYAAEIAVVTFGGSAV----LKTDFSTLEHQPDSP---N 89 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EG 386 G T + +AM A D + ++ + +IVL+TDG+ D+ Sbjct: 90 FFANGGTPMGEAMNMALDLL--EKRKGEYKASGVDYYQPWIVLMTDGKPNGDSSEYARAV 147 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 C K++ + I I + A Sbjct: 148 QRTCEMIKNRKLTIFPIGIGEDADMNALAA 177 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 49/164 (29%), Gaps = 29/164 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L +++ V+ ++ D G F+ + + Sbjct: 411 MKMELAKESAQLVLDLLEDKDY------FGLIAFDHSYQWIVPLQPLT---NKEEAASLI 461 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TA+ +++A ++++ K+I+ +TDG+ + Sbjct: 462 SRISPGGGTALYPPLKSAGESLL-----------KVPIKSKHIIAITDGQTEGGD--FYN 508 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE 430 + I + TI + L + A F+ Sbjct: 509 LVRNLAKYKITVSTIGIGEDANIP-----LLKDIANWGNGRFYH 547 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 63/196 (32%), Gaps = 20/196 (10%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + ++ F+ SS V++ + +++ + D D R+G + V Sbjct: 49 SSCENKRADPVFIIDSSRSVNTHDYAKVKEFIVDILQFL---DIGPDVTRVGLLQYGSTV 105 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ S K + + + T A+Q A + S E Sbjct: 106 KNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LREN 160 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-- 424 + I+++TDG + + + KA+ GI I I L + S Sbjct: 161 VPRVIMIVTDG---RPQDSVAEVAAKARDMGILIFAIGVGQVDFNT------LKSIGSEP 211 Query: 425 -PNSFFEANSTHELNK 439 + F + ++ Sbjct: 212 HEDHVFLVANFSQIET 227 >gi|149915102|ref|ZP_01903630.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] gi|149810823|gb|EDM70662.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] Length = 444 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 55/188 (29%), Gaps = 26/188 (13%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A ++ ++ D R+ F++ G + + + Sbjct: 63 AKRAADQIVAGLRPSD------RLAIVAFDNATEV---MFSGGPRGDGQAARAALSRIHA 113 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---C 390 G TA++D + I+ E + LL+DG + AI C Sbjct: 114 RGMTALHDGWLLGVEQSIAMREAGTPAR---------VFLLSDGVANVGLTDASAIAADC 164 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG--NE 448 + GI T + + A A + + + +L F Sbjct: 165 TRMAEHGITTSTCGLGMGFNEDLMAEM---ARAGRGNAYYGETAEDLQDPFEQEFDLLRN 221 Query: 449 IFERVIRI 456 I R +R+ Sbjct: 222 ICARGLRL 229 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 630 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 684 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 685 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 738 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 739 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 792 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 793 ICEALEDSDGR 803 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 430 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 484 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 485 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 538 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 539 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 592 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 593 ICEALEDSDGR 603 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 387 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 441 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 442 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 495 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 496 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 549 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 550 ICEALEDSDGR 560 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 731 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 785 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 786 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 839 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 840 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 893 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 894 ICEALEDSDGR 904 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 140 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 197 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 198 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 251 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 252 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 287 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 252 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 306 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 307 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 360 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 361 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 414 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 415 ICEALEDSDGR 425 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 456 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 510 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 511 KYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWA 564 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 565 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 618 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 619 ICEALEDSDGR 629 >gi|304320961|ref|YP_003854604.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] gi|303299863|gb|ADM09462.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] Length = 432 Score = 53.5 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 50/170 (29%), Gaps = 8/170 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 F T + G ++ L++ LG+GG++ D R L+ Sbjct: 7 FWTVARQTLARFCTEERGSIAVMMMLMLVPALGIGGLVFDGKRIHTAHLELEI------- 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + ++ + E+ L + + + Sbjct: 60 -VAESAAIAAALQLPSESSAKSAAVDYAEQNLPPSSYGEVVRASDVEIGSYDEGTGTFTA 118 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 SA +V D + ++F ++G S + A A + S S Sbjct: 119 GAGTSAVRVTAWRHEDRSNSLPTIFAGAIGRDSVNLSASAIAVSQSSSED 168 >gi|307180277|gb|EFN68310.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Camponotus floridanus] Length = 2214 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 40/121 (33%), Gaps = 22/121 (18%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + K F G T A+ A + K+ +A K + L+TDG + Sbjct: 167 LNKQFGNITYSGGGTYTRGALLEALTIL----------EKSREKANKVVFLITDGFSNGG 216 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + K+ G + T ++ L + ASP + + L F Sbjct: 217 DPRP--AADLLKNTGATVFTFGIRTGNVEE------LHDIASPPGY----THSYLLDSFA 264 Query: 443 D 443 + Sbjct: 265 E 265 >gi|161086900|ref|NP_001104315.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform c [Mus musculus] gi|1905821|gb|AAB50140.1| voltage-gated calcium channel alpha2/delta subunit, alpha2c isoform [Mus musculus] gi|109732367|gb|AAI15872.1| Cacna2d1 protein [Mus musculus] gi|148671297|gb|EDL03244.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Mus musculus] Length = 1086 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 51/166 (30%), Gaps = 21/166 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAI 339 +I I D VR G ++D + +K+ G T Sbjct: 646 MIELISTFRVGADHVRFGVVQYSDSPTVEFDIRQHSSVAQ---LKSAITKIWQTGGGTRT 702 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 +A+ D + ++++++TDG Q ++ + + + I Sbjct: 703 GEALTFMKRLFSEVARD---------KVLRFLIVITDG---QSQDQVAQAAEELRQENIT 750 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I I + + L S N F N L I ++ I Sbjct: 751 IYAIGV-----KSAVTKELLEISGSQNRMFFVNDFDSLKPIQQEVI 791 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 47/134 (35%), Gaps = 15/134 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G FND + + K +++ G T A+ Sbjct: 50 GRDQVRVGLAQFNDNIYKAFLLNQFPRKSD--VLEQILSLPYRTGGTRTGSALNFLRTEF 107 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + + ++L+TDGE N+E +K K QG+ I + +V Sbjct: 108 FTESAGSRAK----DNVPQIVILVTDGE---SNDEVAEAASKLKGQGVSIYVVGINV--- 157 Query: 411 QQEKARYFLSNCAS 424 + L AS Sbjct: 158 ---QDVQELKTIAS 168 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 43/133 (32%), Gaps = 11/133 (8%) Query: 276 DALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A+ + ++ I D VR+G ++ + + Sbjct: 1203 TAMKTFMKQIVNSFTIGKDRVRIGVAQYSTNPQKEFYLNTFYSGAEINQHIDKITQLRTQ 1262 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T ++ +N + NL + +V++TDG + + N + Sbjct: 1263 TYT--GKGLRFVKSFFEPAN-----GSRKNLHVLQSLVVITDGMSNDS---VVEAANDLR 1312 Query: 395 SQGIRIMTIAFSV 407 ++ I+I +I V Sbjct: 1313 NEKIQIFSIGIGV 1325 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 50/171 (29%), Gaps = 14/171 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + R I ++ D ++G + ++ + + I Sbjct: 223 TSSVSFNQMKDFICRVIDTLEVGRDKDQIGLAQYGNQGHVEF-LLNAYQNPVEMISHIQQ 281 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G+ + +Q +T Y V++T G Q + + Sbjct: 282 NFLPRGGARKTGNGLQYIQETFFQEEAGSRFLQGIPQ----YAVVITSG---QSEDLVLE 334 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 K K +G++IM + Q+ L A+P FE + + Sbjct: 335 KAQKLKERGVKIMVVGI------QDFDSRELKAMATPPLVFEIEGQDGIRQ 379 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 17/159 (10%), Positives = 44/159 (27%), Gaps = 19/159 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +++ + K D + V++G F+ + + K+ T + Sbjct: 829 MMQMVNKSDLGPEKVQIGLLQFSSNPQEEFRLNTYYSKVDILRAITGMVQIRA------- 881 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIR 399 + S E R + I++++ + + + + +GI+ Sbjct: 882 -GARVGSALSFSLPYFERSRGGRLNVPQYLIIIIS-----GKTGDAVKMPAKALRDKGIK 935 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 I I Q + + + L Sbjct: 936 IFAIGVHKANNSQLLEITGAQ-----DKVYYEENFDSLL 969 >gi|161086898|ref|NP_001104314.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform b [Mus musculus] gi|1905819|gb|AAB50139.1| voltage-gated calcium channel alpha2/delta subunit, alpha2b isoform [Mus musculus] Length = 1091 Score = 53.1 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 53/186 (28%), Gaps = 28/186 (15%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ S+ + ++ V + G R I Sbjct: 33 FELVKKFINQIVDSLDVSEQNAQVG---LVQYSSSVRQEFPL--GRFTSKRDIKAAVKKM 87 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+ D + + +K ++ TDG + + Sbjct: 88 TYMEKGTMTGTALNYLIDNTFAISSGARPGA------QKVGIVFTDGRSQDYISD---AA 138 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGN 447 KAK G ++ + L AS +F +N+ IG Sbjct: 139 KKAKDLGFKMFAVGVG------NAVEDELREIASDPVAEHYFYTADFKTINQ-----IGK 187 Query: 448 EIFERV 453 ++ +++ Sbjct: 188 KLQKKI 193 >gi|14248669|gb|AAK57620.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGIGQGIN 140 >gi|87199928|ref|YP_497185.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87135609|gb|ABD26351.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 153 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 35/132 (26%), Gaps = 3/132 (2%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+FI + +L G I A+L PV++ G V+V L+ A Sbjct: 2 TRFIPFLSRLRACRDGAIMIEVAVLTPVLVLFGLGTVEVSSLVARRSELQSAL---AEAV 58 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ L + S + + V + Sbjct: 59 AIALASKPDTQSKIDTIESVISASTGVSTANIDTAVIYRCGIDPGYVTLPGLCSQSGEIS 118 Query: 125 RKSAYQVVLSSR 136 + + + Sbjct: 119 KYVQLTIRDTYT 130 >gi|83749631|ref|ZP_00946614.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|207728099|ref|YP_002256493.1| hypothetical protein RSMK04469 [Ralstonia solanacearum MolK2] gi|207744165|ref|YP_002260557.1| hypothetical protein RSIPO_02350 [Ralstonia solanacearum IPO1609] gi|83723692|gb|EAP70887.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|206591344|emb|CAQ56956.1| conserved hypothetical protein [Ralstonia solanacearum MolK2] gi|206595569|emb|CAQ62496.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609] Length = 345 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/337 (10%), Positives = 87/337 (25%), Gaps = 21/337 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G +++ +L+ + L +G M VD+ + L+ AA A + + L + + Sbjct: 11 RRQRGAVGVLSPILLIIFLAIGAMAVDIAHLFVVRNELQNAADAAALAGAAGLYPANPKP 70 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + K+ + + + P + Sbjct: 71 NWSNGVAQGTSAVKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVP---GV 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + +P + G +W+ A + + S F ++ D Sbjct: 128 QVTVTRSPGNNGGPVKGWLTWVFNGGAASIQATAVAVIAAPGSANPGSLFPVALNKCLFD 187 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 PY++ N S Sbjct: 188 LYWNYTTGQPLN-----------------DPSTGQPYVIDINTSYPPSSMTCASGEWTGF 230 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 D+S+ + ++ + ++ I I + + A + + G Sbjct: 231 NGPTDASTEKSLVSSGNPTNISIGQNIN-ISTGVKTSVYNAIPALPLTVTMPVVSPLTPG 289 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + GS + + TA +++ Sbjct: 290 ATSPVYAFAGFTITKIVTNGSHSYIEGHFTANQKVVN 326 >gi|282900972|ref|ZP_06308905.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] gi|281194063|gb|EFA69027.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] Length = 464 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 78/245 (31%), Gaps = 24/245 (9%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 ++ + + + + + E+ + + K +V ++ Sbjct: 34 KDIATSLPPTTFTFVIDTSGSMYEVVAGESKPTGVTYEQDGKEYREVTGGKSKIDIVIES 93 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L +++ S + R+ FND S + + +T N G T Sbjct: 94 LLALVNSGRL----KQQDRIAIVQFNDSASSIIGLTSATE-IKKTKKTAINNLRNFSGGT 148 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + ++ A+D + K ++L TDG+ T D ++ +I N ++ Sbjct: 149 RMGLGLRRAFDILSEQ-----------EMTVKRVLLFTDGQ-TFDEDQCQSIANDFATRN 196 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN-----STHELNKIFRDRIGNEIFER 452 I + + + + LS+C F+ + + + + I + Sbjct: 197 IPVTALGVGEEFNEDLLSH--LSDCTGGKLFYVVSGIAKGTETSILDLPGKIIAEYSEAQ 254 Query: 453 VIRIT 457 IT Sbjct: 255 QEVIT 259 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 63/185 (34%), Gaps = 20/185 (10%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTI 323 + ++++A+ +IR + D R+ FNDR V + + R I Sbjct: 71 NKPPSRLDVLKEAMKFIIRKLDDGD------RLSIVAFNDRPVKEYSTGLLNISGNGRRI 124 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + G TA+ A++ A + D + + +I+LLTDG++T Sbjct: 125 AEKKVDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVG-------FILLLTDGDDTSGF 177 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS--THELNKIF 441 + N A + + T + + S ++ + ++ Sbjct: 178 RWSRDVINGAVGK-YPVHTFGLGAAHSSEALLHIAQE---SRGTYSFVDDENMDKIAGAL 233 Query: 442 RDRIG 446 IG Sbjct: 234 AVCIG 238 >gi|150251390|gb|ABR68007.1| matrilin-like 40 kDa protein [Lehmannia valentiana] Length = 390 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/166 (22%), Positives = 63/166 (37%), Gaps = 19/166 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N R GA F+D V + + + K G+T N+ A+D I Sbjct: 242 GNSKARFGALLFSDFVENLFYL--NKYTSTADVSKAILRAPYHRGTTLTNE----AFDFI 295 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + +V TDG++T+ +A + K Q +RI+ + Sbjct: 296 RTEGVFSTPKGGRSNAPDIVVVF-TDGQSTKP-ALTLAAADNLKRQNVRIVAVGIG---- 349 Query: 411 QQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 E ++ L AS + FEA+S L D I ++ + V R Sbjct: 350 -NEVSKEELRQVASSRDDVFEASSFENL-----DYIEQKLAKNVCR 389 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 75 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 130 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 131 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 185 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 186 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 236 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 237 PHKDHVFLVANFSQIESLTSVFQN 260 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + L S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTLKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 52/201 (25%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRMAAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 K TA + + TA I + + IVL +DG+ Sbjct: 162 -----KNALDKLQFADRTATGEGIFTALQAIATVG---AVIGGGDKPPPARIVLFSDGKE 213 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T AK QG+ I TI+F E L A Sbjct: 214 TMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQL 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 S + + A S EL ++ Sbjct: 274 SGGNAYNAASLQELKSVYATL 294 >gi|218516298|ref|ZP_03513138.1| hypothetical protein Retl8_22869 [Rhizobium etli 8C-3] Length = 432 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/263 (13%), Positives = 77/263 (29%), Gaps = 23/263 (8%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D Q + +N F K+ V + + + ++ Sbjct: 1 PDPQSVRVEEMINYFPYDWPGPEKADQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPTT 60 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 P + +D S K L++ A ++ +K D V+ + + Sbjct: 61 APHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNRLKADDTVS------IVTYAGNAGTV 114 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + + GST + ++ AYD + + Sbjct: 115 LEPT---RVAEKPKILSAIDRLEAGGSTGGAEGIEAAYDL--------AKKAFVQDGVNR 163 Query: 370 YIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + L TDG+ +E+ I + + GI + + F + N Sbjct: 164 VM-LATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQN--GNG 220 Query: 427 SFFEANSTHELNKIFRDRIGNEI 449 S ++ E K ++ G+ + Sbjct: 221 SAAYIDTLAEAQKTLVEQAGSTL 243 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 59/203 (29%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + + K Sbjct: 117 NRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKAALDKLQF 170 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 D TA +A+ TA I + + IVL +DG+ T Sbjct: 171 ADR-----TATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 223 KGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAA 282 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 + ELN ++ +I IR Sbjct: 283 TLAELNSVYVSL-QQQIGYETIR 304 >gi|47229095|emb|CAG03847.1| unnamed protein product [Tetraodon nigroviridis] Length = 608 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 38/121 (31%), Gaps = 6/121 (4%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T IN A+ +A I + ++ LTDGE T G I AK Sbjct: 354 GTNINAALLSAAQLINPPSSSRHLSSHRVP----LVIFLTDGEATIGVTAGDTILTNAKK 409 Query: 396 Q--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + +AF + R L N +E K F D + + + V Sbjct: 410 ALGSASLFGLAFGDDADFLLLKRLALDNRGVARMVYEDADAALQLKGFYDEVASPLLSDV 469 Query: 454 I 454 Sbjct: 470 Q 470 >gi|2811055|sp|O07395|Y335_MYCAV RecName: Full=UPF0353 protein MAV335 gi|2183263|gb|AAC46199.1| MAV335 [Mycobacterium avium] Length = 335 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 59/221 (26%), Gaps = 37/221 (16%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + + +G + S + Sbjct: 108 SMRATDVAPNRMAAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 K TA + + TA + + + + IVL +DG+ Sbjct: 162 -----KNALDKLQFADRTATGEGIFTALQ-VQAIATVGAVIAGDKPPPAR-IVLFSDGKE 214 Query: 380 TQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA-- 423 T AK QG+ I TI+F E L A Sbjct: 215 TMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQL 274 Query: 424 SPNSFFEANSTHELNKIF--------RDRIGNEIFERVIRI 456 S + + A S EL ++ + I + +R+ Sbjct: 275 SGGNAYNARSLQELKSVYATLQQQIGYETIKGDASVGWVRL 315 >gi|161086902|ref|NP_001104316.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform d [Mus musculus] gi|1905823|gb|AAB50141.1| voltage-gated calcium channel alpha2/delta subunit, alpha2d isoform [Mus musculus] gi|148671298|gb|EDL03245.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_e [Mus musculus] Length = 1079 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|6753234|ref|NP_033914.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform e [Mus musculus] gi|1905825|gb|AAB50142.1| voltage-gated calcium channel alpha2/delta subunit, alpha2e isoform [Mus musculus] gi|148671296|gb|EDL03243.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Mus musculus] Length = 1084 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|219127465|ref|XP_002183955.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404678|gb|EEC44624.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 582 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 65/209 (31%), Gaps = 27/209 (12%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 Y + +D+S + L++ A VI ++ D Sbjct: 179 TCGVYDPRLRPWYIAASSGPKNVVLVLDTSGSMTDGNRLSLLKQAAKQVIETLTVGD--- 235 Query: 293 DTVRMGATFFNDRVI---SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+ + D F + + + ++ T G+T DA A+ Sbjct: 236 ---RVAIVEFSSQAKLFAQDNKFLFTATQKNKELLATHIDSFTAAGATNFLDAFTAAFAV 292 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR---------- 399 + S + E H I+ LTDGE TQ A + GI Sbjct: 293 LNDSIDQEYHVGCTTA-----ILFLTDGEMTQPENVQEADVLDLVNTGISNLEARLGRSV 347 Query: 400 -IMTIAFSVNKTQQEKARYFLSNCASPNS 427 + T + S N + C++ ++ Sbjct: 348 FLFTFSISDNNNVHAFPKQI--ACSTGDN 374 >gi|47224105|emb|CAG13025.1| unnamed protein product [Tetraodon nigroviridis] Length = 983 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 76/215 (35%), Gaps = 23/215 (10%) Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 D + +P + Y VD+S + L+R ++ ++ ++ Sbjct: 50 DARKTPSKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLT-LKLIRTSVTEMLETL 108 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D VN +FN +V F V + ++K + G T + Sbjct: 109 SDDDYVN------VVYFNTQVKKTACFDHLVQANVRNKKLLKDAVQNITAKGITNYTKGL 162 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMT 402 + A++ + +N + K I+L TDG E AI K + +RI T Sbjct: 163 EFAFEQLSVTNV-------SRANCNKIIMLFTDG----GEERAQAILEKYNADKKVRIFT 211 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 FSV + +K C++ F+E S + Sbjct: 212 --FSVGQHNYDKGPIQWMACSNKGYFYEIPSIGAI 244 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 61/188 (32%), Gaps = 23/188 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ +L +++ + D R+ F++ ++ ++ Sbjct: 176 KIEMVKASLIVLLQFLGDND------RLQLITFDNDAHRLTPLKTVTNQ-NKSYFTQIIK 228 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G I++A + A+ + S + LL+DG + E I Sbjct: 229 QIKANGGNRISEATKMAFYQLKSRKYINNVTS---------VFLLSDGVDYTYPEVKNQI 279 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ + T F + Q + L SF+ L++ F D +G Sbjct: 280 QTV--NEVFTLHTFGFGEDHDAQMMTQLCNLK----SGSFYFVQDVTLLDEFFADALGGL 333 Query: 449 IFERVIRI 456 I ++ Sbjct: 334 ISVVGEQL 341 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 64/181 (35%), Gaps = 23/181 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V++++ + + + R+G F + S L I + Sbjct: 118 TRLQAVQESVRKFVAA-------RQSDRIGLVIFASQAWPFAPVSEDKQALQTRINQ--L 168 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TAI DA+ + ++ N EA K +LLTDG +T Sbjct: 169 SPGMVGQQTAIGDALGVGVKLLDNT---------TNTEASKLAILLTDGNDTASQLAPAL 219 Query: 389 ICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSNCA---SPNSFFEANSTHELNKIFRD 443 A + +++ TIAF + + + N A S+ ANS L+ ++++ Sbjct: 220 AAQLAAAHHVQVHTIAFGDINSTGDDKVDLPLMQNIAQITGGQSWTAANSGAALDSVWKE 279 Query: 444 R 444 Sbjct: 280 I 280 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 93/330 (28%), Gaps = 30/330 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 74 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 132 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 133 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 192 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 193 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 246 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 247 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QATKGFIKGG 301 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 302 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 354 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ ++ E K+ + Sbjct: 355 I--ADVGNGNYSYIDTLSEAQKVLNSEMRQ 382 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 60/199 (30%), Gaps = 49/199 (24%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + S+I +++ R+G F+ + + Sbjct: 103 DRLSRAKNIIESIIDNLEG-------DRIGFIPFSSAAYIQMPLTDDYDLARMYLDVID- 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D G T + A+ A ++ + A + +++L+DGE N Sbjct: 155 TDMIAGGGTNVGTALNLAENSFEET-----------SSADRVVIILSDGEEHNSNSV--D 201 Query: 389 ICNKAKSQGIRIMTIAFS------VNKTQQEKARYF-------------------LSNCA 423 I + +++ TI V + + L A Sbjct: 202 ILKSFNDEHLKVFTIGIGTAKGGLVPDYGSDGGQKSGYKKDSNGEFVMSKLNSETLKKLA 261 Query: 424 SP--NSFFEAN-STHELNK 439 S S+++++ + E+ Sbjct: 262 STGKGSYYQSSLTGDEIAS 280 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 53.1 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 49/153 (32%), Gaps = 19/153 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 + R+G ++ +V S R ++ D T A+Q A + Sbjct: 67 GPNATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNV 123 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S E + + V++TDG + + + +A+++GI I + Sbjct: 124 AFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGVQRAD 177 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNK 439 L ASP F S + + Sbjct: 178 VGS------LRAMASPPLDEHVFLVESFDVIQE 204 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 361 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 415 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 416 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 469 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 470 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 499 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 44/146 (30%), Gaps = 18/146 (12%) Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+D S + K I + +T A++ + + Sbjct: 671 HFSDTSKEVFSLNKNTSKKSDIIQAVEDMSLI-GSTTNTGGALRFVSKYFKLAKQA---- 725 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K +VL+TDGE + + + ++ GI I ++ + L Sbjct: 726 ---RPSVNKILVLITDGEASDEVTAP---ATELRNDGIIIYSVGV------FNANKTQLE 773 Query: 421 NCAS-PNSFFEANSTHELNKIFRDRI 445 + P F + L I D I Sbjct: 774 EISGKPEKVFYVENFDILEDIKGDII 799 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 61/162 (37%), Gaps = 17/162 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 VI + K D N+ V+ GA ++ + + +I I +++ T Sbjct: 839 DFVIDIVNKSDVGNNRVQFGAVKYSAYPQILFNLNGNKADIIDKIKGDTLLNDT----TY 894 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A++ A + S + + ++++TDG + + + A+ + ++ GI Sbjct: 895 TAEALRHAENLFTESK-----GSRKRRGVPQLLMVITDGTSHDKD-KLDAVSTRIRNDGI 948 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNK 439 I I ++ R L A + + F ++ L Sbjct: 949 TIYAIGI------KDAKREELEIIAENKDHVFFVDTFDGLKN 984 Score = 43.1 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 63/181 (34%), Gaps = 18/181 (9%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D + IK D VR G +++ I F H I + K + Sbjct: 461 KDMKTFLKEVIKMFTVGPDHVRFGVVQYSN--IHRTEFEIDKHSTISNLEKAINNIQYLT 518 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T A+++ S+ + +++ Y+++LTDG+ +E ++ + Sbjct: 519 GDTYTGAALESMLGLFESARKQRKNKVPT------YLIVLTDGDPHDKVKEP---ADRLR 569 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 GI ++ I + L ++ + + L I + + + E Sbjct: 570 KAGINVIAIGVG------DIKWKGLQEI-GESNVYYVHQYASLKTIKDNIVQDICSEEAC 622 Query: 455 R 455 + Sbjct: 623 K 623 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 49/171 (28%), Gaps = 21/171 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +V+D L ++ SI +D ++ G F+D + K Sbjct: 1015 KNDFEIVKDFLTKLVDSISF----HDNIQFGMAQFSDIYSEEFPLG-HYQSKSELKNKIA 1069 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + T I ++ S + +K ++ TDGE+ + Sbjct: 1070 NVSMQAGLHTYIGKGLKEVKAFFKSPRRRVARNV-----HQKLLIF-TDGESKDSFTQP- 1122 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHEL 437 + +G+ I + + L SP + + L Sbjct: 1123 --AEDLRREGVEIHAVGVG------KIEHAKLQQITVSPERIYTTANYTGL 1165 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/362 (9%), Positives = 103/362 (28%), Gaps = 36/362 (9%) Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + + ++N + + + + + +++ E+ + Y+ + Sbjct: 213 KEAKLKINFDGTSQTHDLKNLQEDNIISLSLKSQNTGLKEIEAEIYSPEDTYRDNNVTSS 272 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + L +G K+ A ++ ++++ E Sbjct: 273 YIYIQGKPKVLYLIGRDFNPTFAKSLAVQGWDVVFDSKPYPQILNLNSYQAVIMDNIPQE 332 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK---SLYYMLYPGPLDPSLS 254 ++ + + + G + Y + + + +++ Sbjct: 333 DLSIDKMEILKNFVTEKGGTLLVLGGDKSFSAGNYQGTPLEEILPVTLRPEQILKKSNVA 392 Query: 255 EEHFVDSSSLRHVI----KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 +D+S K L +++ V+ +++ D G F+ Sbjct: 393 IVIVLDASGSMGSYSGGDMKMELAKESAQLVLDLLEEKDY------FGLIAFDHSYQWIV 446 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + + G TA+ +++A + +I + K+ Sbjct: 447 PLQPLT---NKEETASLISKISPGGGTALYPPLKSAGEALIKA-----------PIKSKH 492 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 I+ +TDG+ + + I + TI + L + A F Sbjct: 493 IIAITDGQTEGGD--FYNLVKYLAKYKITVSTIGIGEDANIP-----LLKDIANWGNGRF 545 Query: 429 FE 430 + Sbjct: 546 YH 547 >gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 459 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 42/121 (34%), Gaps = 14/121 (11%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +T G T + A+ A + + N+ + K + +++DG T Sbjct: 229 KTAFNKSMDSFEASGWTPLAGAIDKARELLS---------AYNSTDYKNTLYIVSDGVET 279 Query: 381 QDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + + + + I ++ I F V+ Q++ + A ++ EL Sbjct: 280 C-DGDPVEAAQQLQGSNIEAKVNIIGFDVDDEGQKQLKEVAE--AGGGTYATVRDKDELE 336 Query: 439 K 439 Sbjct: 337 D 337 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 53/424 (12%), Positives = 108/424 (25%), Gaps = 39/424 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + + + K + + Y + + V Sbjct: 365 AAVSTGSNADKGDHTDSGENKPSEKIEGKTYGHVLLNEESYAKAYVTTLNKMGVSVEVTE 424 Query: 123 N-PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + + + I + T + S +S Q Sbjct: 425 GLALTIHHNAFYNDKDISEDEVVRGLKEGDVINPADYAWNKDFLTYAGSSQDSIVISSQT 484 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV----SCN 237 + + G +S + + + + Sbjct: 485 ENRVDLYYNAIAEEKSEDEVTITGAIPRSYFRSVKADDNTTPGKVFPTKTAEWVDEANGI 544 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + + +Y P+ +DSS K A + + D V R+ Sbjct: 545 GKINFTIYGNPIRRGSDVILVIDSSGSMEGEKWSTAKTAAKGFIDNLYQNKDGVVSDDRI 604 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFA-----------------IDENEMGSTAIN 340 F+ + P + G ++ K + G T N Sbjct: 605 AIVDFDSSAKAYPGTNSGSETFLKVDDKITIKNKTYSAKDYLKSYVLDSQMKDTGGTDYN 664 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-------IAICNKA 393 A+QTA I + + YIV ++DGE + + Sbjct: 665 KALQTAQSVINNRRDSSRPA---------YIVFMSDGEPNGYWDWLTYRYYDGQKYATEL 715 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 KS G+ I ++ ++ T K L++ + T +L I+ D I + I Sbjct: 716 KSDGVTIYSLGLNIGSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIY-DAIASSIKIAG 774 Query: 454 IRIT 457 T Sbjct: 775 TEAT 778 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 37/290 (12%), Positives = 80/290 (27%), Gaps = 40/290 (13%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + A + +++ I +++D S S+ + +E + F Sbjct: 25 NLAQVASTFQNQVNKYQDSRADIVFMLDSSGSVGENNFQTEITFVENFL----------- 73 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + +++ + P + + R Sbjct: 74 -SQLTISPQASRVAIVSFDDAARTHIDYIKNPKNKCSFLRELKTVKYTGGSTNAEDAFRL 132 Query: 277 ALASV--IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE- 333 A + + V T F ++ P S +T N Sbjct: 133 AQELLRPKSAFANNQPVKQTEITFVENFLSQLTISPQASRVAVVSFDNHARTHIDYINSP 192 Query: 334 ----------------MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 GST D + A + + + ++ K+ +V LTDG Sbjct: 193 KNKCSFLRELKAVKYTGGSTNAEDGFRLAQELLRPQSAFTNYQP-----VKQVVVYLTDG 247 Query: 378 ENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ + N KS I +I + E ++C+SP Sbjct: 248 ---KPDKDPVGRANNLKSVYNAEIYSIGVDPYSKRYETDGVTSADCSSPG 294 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 66/222 (29%), Gaps = 49/222 (22%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + + A + + + TV++G F + + Sbjct: 97 SMRATDVQPSRLVAAQTAAKAFLTEL------PRTVKVGLVAFAGSAQVAQIPT-----V 145 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK---------- 369 R + + TAI + + + T+ ++ M++ E ++ Sbjct: 146 NREDLVSAIDRFQLQRGTAIGNGIVMSLATLFPDAGIDLQSMQSGRERQRGFAIDQEKKE 205 Query: 370 ---------------YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 I+LLTDG+ T + + A +G+R+ T+ + Sbjct: 206 AKEFTPVAPGSYNSAAIILLTDGQRTTG-VDSLDAAKLAADRGVRVYTVGIGTVDGETIG 264 Query: 413 --------EKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 L A+ +F A + +L K++ Sbjct: 265 FEGWSMRVRLDEETLKGIARATQAEYFYAGTATDLKKVYETL 306 >gi|14248643|gb|AAK57607.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 93/330 (28%), Gaps = 30/330 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + NP + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QATKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ ++ E K+ + Sbjct: 365 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQ 392 >gi|31542335|ref|NP_037051.2| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 1 [Rattus norvegicus] gi|11055592|gb|AAG28164.1|AF286488_1 voltage-gated calcium channel alpha2/delta-1 subunit [Rattus norvegicus] Length = 1091 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|6090615|gb|AAF03259.1| dihydropyridine receptor alpha 2 subunit [Homo sapiens] Length = 1110 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 77/264 (29%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I NK K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFNKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 69/203 (33%), Gaps = 26/203 (12%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + ++ + S + K ++A + + + F + +I + F Sbjct: 75 IIDKSGSMAPSYNNGEDKMLNAKEAAKGFVDLMDLTKHR----VAIVDFSSSNMIGNLPF 130 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + T K + N GSTA DA+ +A + + + IV Sbjct: 131 T-----TNPTEAKNYIDTINANGSTATGDAIDSAIALLANHRPEAQP----------VIV 175 Query: 373 LLTDGENTQDNEEGIAICNK----AKSQGIRIMTIAF--SVNKTQQEKARYFLSNCA-SP 425 ++TDG+ TQ + + + AK GI TIA S + L A + Sbjct: 176 IMTDGDATQPSTDPYGYAKQKALLAKDNGIIFYTIALLKSTDDPVTSGPNILLKEMATTS 235 Query: 426 NSFFEANSTHELNKIFRDRIGNE 448 + + L++I+ + Sbjct: 236 DHHHFVLGSTGLSQIYAAIVKEI 258 >gi|1705853|sp|P54290|CA2D1_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|203955|gb|AAA41088.1| dihydropyridine-sesitive L-type calcium channel alpha-2 subunit [Rattus norvegicus] Length = 1091 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 69/216 (31%), Gaps = 23/216 (10%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 D +P + Y VD S + L+R +++ ++ + Sbjct: 223 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLT-LKLIRTSVSEMLET 281 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDA 342 + +D + FN F V + ++K + G T Sbjct: 282 L------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKG 335 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIM 401 A++ +++ N + K I+L TDG E I K + +R+ Sbjct: 336 FTFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEERAQEIFAKYNKDKKVRVF 384 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + ++ C + ++E S + Sbjct: 385 T--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 418 >gi|262527579|sp|P54289|CA2D1_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor Length = 1103 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 77/264 (29%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I NK K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFNKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|149175889|ref|ZP_01854507.1| hypothetical protein PM8797T_24761 [Planctomyces maris DSM 8797] gi|148845336|gb|EDL59681.1| hypothetical protein PM8797T_24761 [Planctomyces maris DSM 8797] Length = 157 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 18/159 (11%), Positives = 47/159 (29%), Gaps = 8/159 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +K G + +AL++P+++ + +DV ++ + + A Sbjct: 6 KRKIQKSYSERRGVAAVESALVVPLLVMIAMGTMDVSQYVNV---AQLVNDASYEGARRA 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 +++ S ++ + + + L N D +I + Sbjct: 63 SQNNVKNQSEVEQSVLNYFTSQFPNQASGEIGSALTVNVRDYLEANIPNGNLETVPSGS- 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 +S + N + G+ I+T Sbjct: 122 ----AVSVQVIFQFNSVRWLSGFAGLNGVSIETTTVMRR 156 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 52/192 (27%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + K Sbjct: 117 SRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT-----KNAL 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 TA +A+ TA I + + IVL +DG+ T Sbjct: 166 DKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 223 KGAYTAARTAKDQGVPISTISFGTPYGFVEIDDQRQPVPVDDETMKKVAQLSGGNSYNAA 282 Query: 433 STHELNKIFRDR 444 + EL ++ Sbjct: 283 TLAELRAVYSSL 294 >gi|320334211|ref|YP_004170922.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319755500|gb|ADV67257.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 609 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 55/181 (30%), Gaps = 18/181 (9%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ + +D + +T + ++D V + + +K G T + Sbjct: 58 ALKAAADFVDRLTETDVLSIVVYDDDVDT---LLDAQPVRDKAAIKDLLKGVRAGGITNL 114 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQ 396 + + + + + ++LLTDG+ + + A Sbjct: 115 SGGWLRGCELVAGARRADAVNR---------VLLLTDGQANHGVTDTGVLIKTAASKAEA 165 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 G+ T+ F + + AS +F+ S + +F + + Sbjct: 166 GVSTTTLGFGSSFEEDLLIGM---ARASGGNFYFIQSMDDAADVFGIELDTMKAVAAQNL 222 Query: 457 T 457 T Sbjct: 223 T 223 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/213 (11%), Positives = 58/213 (27%), Gaps = 15/213 (7%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P + + + + A + + R + D D R+G F+ Sbjct: 17 WGPSFVLAASGDAGDIMFVLDGSGSISADDFVSAKSFISRVVDAFDIAADFTRVGVVQFS 76 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + ++ + G T + + +T + + Sbjct: 77 SFFTEEFPL--DRYSDKASLKQAIGNIPQRGGGTLLGQVINYLVNTSFTEAKGARPLSDG 134 Query: 364 NLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + VL+TDG + + ++ GI +I R L Sbjct: 135 IP---RIAVLMTDGSAHDNPTTVLAPAIDALRASGIIAFSIGVG-----PSVNRDQLEAV 186 Query: 423 ASP-NSFFEA---NSTHELNKIFRDRIGNEIFE 451 A + F + ++ + +R+ I Sbjct: 187 AGDTDRVFLVGAYSVIDDIRDLLVERVREIIEL 219 >gi|161086906|ref|NP_001104318.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Rattus norvegicus] gi|27450704|gb|AAO14652.1|AF486276_1 calcium channel alpha-2 delta-1 subunit isoform e [Rattus norvegicus] Length = 1084 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 8/83 (9%) Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 K N + K++++++DG+ + + I N+ K GI+I T+A + L N Sbjct: 550 KPNPASVKHMIIISDGDPSPPSG---TILNQYKQAGIKITTVAVGTHGPA---GSTPLQN 603 Query: 422 C--ASPNSFFEANSTHELNKIFR 442 A+ ++ A + L +IF+ Sbjct: 604 IANATGGKYYVATNPKALPRIFQ 626 >gi|253701737|ref|YP_003022926.1| hypothetical protein GM21_3141 [Geobacter sp. M21] gi|251776587|gb|ACT19168.1| conserved hypothetical protein [Geobacter sp. M21] Length = 383 Score = 53.1 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/234 (9%), Positives = 58/234 (24%), Gaps = 23/234 (9%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI---------------- 69 +++ V L + G+ +D+ + L+ +A+ A +T + L Sbjct: 6 VTIMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKKRLLLQAQHSPGKLAQ 65 Query: 70 ------QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 QS + + + + N + N ++ + Sbjct: 66 VLADPLQSAARSVAVDTATGKHSASALVALMNDNGNALTENNDITVGFWNMSSRSYTPGG 125 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY-HKEHGVSIQWV 182 +A QV + L + S + V+ ++I Sbjct: 126 TPVNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAICAA 185 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + ++ R S + + ++ M Sbjct: 186 ACEPSCGYPDVCSIPERRMSHLPWDPQRENSSANRYLYTSLLHPVTITNTMSDL 239 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 69/210 (32%), Gaps = 17/210 (8%) Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 S + S + E D + + V + +D+ + R Sbjct: 24 TSSKTVEAAYDYSNLSCTNEQGRDIVFVIQDTPDIQSHDPDQSRVTEVLSLMDDASSKDR 83 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G FN V + + + + + + + + + ++ ++ A D + Sbjct: 84 FGFVGFNKEVTKELALTNNIVQAKSKLNEFGKN-ISPYMANDLSKGLEKAVDELT----- 137 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K + K IV++T G N+ NE + KA + I I TI+F Sbjct: 138 -----KKSTSNDKVIVIMTVG-NSIYNEVSKKLAAKAYEEDITIHTISFGDPLYADAP-- 189 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDR 444 FL+ A + ++ + + L + Sbjct: 190 -FLTEIAKLTGGNYTHSPNAAFLKDVLSKL 218 >gi|326798073|ref|YP_004315892.1| von Willebrand factor type A [Sphingobacterium sp. 21] gi|326548837|gb|ADZ77222.1| von Willebrand factor type A [Sphingobacterium sp. 21] Length = 622 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/292 (14%), Positives = 89/292 (30%), Gaps = 24/292 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + K E + S + + + +N F ++ Sbjct: 156 VSPKKEPLSTFAVDVDGASYSNIRRFINNGQLPPKDAIRLEEMINYFQYNLPAP-QNNEP 214 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + +P+ +L P + +D S + LV+ + Sbjct: 215 VAIHTELSQAPWNPHHRLLRIALKAKAIDAAKLPPANLVFLIDVSGSMDGPNRLPLVKSS 274 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L ++ ++K D R+ + + W K+ +K + + GST Sbjct: 275 LKMLVDQLRKED------RVAIVTYAGTARIKLAPVWANEKMR---IKNAIDELDAGGST 325 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAK 394 A ++ AYD + + NN I+L +DG+ NE+ + K + Sbjct: 326 AGGAGLKMAYDL----AREHFKKDGNNR-----IILASDGDFNVGPSSNEDMETLIEKER 376 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 G+ + + F + + K + ++ ++ E K G Sbjct: 377 QSGVSLSVLGFGMVNLKDSKMELLANK--GHGNYAYIDNLMEAKKAMISEFG 426 >gi|182438638|ref|YP_001826357.1| hypothetical protein SGR_4845 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467154|dbj|BAG21674.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 578 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 60/202 (29%), Gaps = 24/202 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + VD S + LVR +LA + ++ D+V+ F+D Sbjct: 210 STAERPPAALTFVVDISGSMAETGRLDLVRKSLAVLTDELRDDDSVS------LVTFSDA 263 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + R +K + ST + + Y+ + + Sbjct: 264 AETRLPMT--RLQGNRNRIKDAVDEMRPEQSTNVEAGITRGYEESVEGHRKGATNR---- 317 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 +VLL+D + E I + + GI + + + R Sbjct: 318 -----VVLLSDALANTGDTEADGILERIDSTRREYGITLFGVGVGSDYGDAFMERLTNK- 371 Query: 422 CASPNSFFEANSTHELNKIFRD 443 + + K+F D Sbjct: 372 --GDGNTTYVGDEAQARKVFVD 391 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 71/215 (33%), Gaps = 18/215 (8%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 G ++ ++ F +S + +D + SVI I+ + VR+ Sbjct: 1823 GFPGCTNEIGEINDIPTDIAFALDASASFYEEGFQQEKDFIKSVIDKIELSSSG---VRV 1879 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G ++D F + K + K + T I+ ++ A + + Sbjct: 1880 GVLTYSDEAKIRIRFDYSFDKED--VKKAIDNIPYDSMGTRIDLGLEAAKELFLE----- 1932 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + +KK ++LLTDG+ T D ++ + + G+ I I + + Sbjct: 1933 --KSGGRGSSKKVLILLTDGQQTYIPDAKDPVDYAKELAEYGVDIFAIGIGDEINKVDLE 1990 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 P F ++ + L I + Sbjct: 1991 DLISK----PQHIFLSDDINSLITDLSKDISTALS 2021 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 34/106 (32%), Gaps = 18/106 (16%) Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-------GIRIMTIAFSVN 408 +++ + +K IV+L DG + + AK + G+ I TI Sbjct: 1346 FTSQGARSDKDIEKAIVILGDGYISGGGNRSRDLIESAKKEAAKLRELGVLIFTIGVGAE 1405 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 L N AS +++ K + IG + Sbjct: 1406 PN-----SLLLQNFASKKTYYITV------KDYGQLIGKIGALKTS 1440 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 13/168 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R A + + +F L +T I+ A++ Sbjct: 31 VDSFSISQQNARFAALVYGSNASVEFNFVRYDSALEIKQAIQSLSYLKS--NTRIDKALE 88 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S ++++ Y+ DG T+ + ++ K G++I+ I Sbjct: 89 VAKSDLFSLQ----GKVRSRRPMILYVFF--DGTVTRSMSDLESVVQPLKDYGVKIIAIG 142 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 E RY L + N+ F S EL + + E+ Sbjct: 143 VG-----PEVNRYQLKKISEDNAIFSGKSFKELAPLLYSIVEQSCSEK 185 >gi|119899154|ref|YP_934367.1| hypothetical protein azo2864 [Azoarcus sp. BH72] gi|119671567|emb|CAL95480.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 339 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 53/181 (29%), Gaps = 36/181 (19%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR+G F + + L TAI + A + Sbjct: 125 DVRIGIVSFAGTATVVQAPTDNREDL-----LGAIDRLQLARHTAIGSGIIVALSALFPE 179 Query: 354 NED-EVHRMKNNLEAKKY-----------------IVLLTDGENTQDNEEGIAICNKAKS 395 M ++ E + ++LLTDG T E + A Sbjct: 180 ESFDPDPTMMSSAEPGRAPNAPREEVAPGSNGSAAVILLTDGRRTSG-PEPVDAARMAAV 238 Query: 396 QGIRIMTIAFSVNKTQQEK----------ARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +GIR+ T+ F + + L A + +F A + EL +I+ D Sbjct: 239 RGIRVFTVGFGTAEGATIQNEGWSVFMRFDEGTLRAIADLTQAKYFHAGTAAELQQIYHD 298 Query: 444 R 444 Sbjct: 299 L 299 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 53/167 (31%), Gaps = 16/167 (9%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N R+G F+ + L+ + + E+G TAI DA A Sbjct: 129 NKRPNDRIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELG-TAIGDATAAATKR 187 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + I+LLTDG+N Q E + A + G +I TI + + Sbjct: 188 LQ----------MSKDSKSRIIILLTDGDNNQGEIEPVPAAQLAAAIGAKIYTIGLGIEE 237 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 A F + I + ++ R+ Sbjct: 238 PSHLPAFDV-----DTGKFKHGPGGELIPTIMLQPANYSVLGQMSRL 279 >gi|157819693|ref|NP_001101483.1| matrilin-3 [Rattus norvegicus] gi|149050910|gb|EDM03083.1| matrilin 3 (predicted) [Rattus norvegicus] Length = 463 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 76/234 (32%), Gaps = 28/234 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS-----SSLRHVIKKKHLVRDA 277 G R SP S + Y G + + +D SS + V+ Sbjct: 41 GTRVPGGSPGHFSARATSTRAPYSGVRGSGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTF 100 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ +I ++ D R+ + V + + + + + A T Sbjct: 101 VSRIIDTL---DIGATDTRVAVVNYASTVKIEFQL--NTYSNKQALKQAVARITPLSTGT 155 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+QTA + + E K +++TDG + ++ + +A++ G Sbjct: 156 MSGLAIQTAMEEAFTV---EAGARGPTSNIPKVAIIVTDG---RPQDQVNEVAARARASG 209 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 I + + + + L AS + F + +L+ F++ Sbjct: 210 IELYAVGV----DRADMES--LKMMASKPLEDHVFYVETYGVIEKLSARFQETF 257 >gi|254776723|ref|ZP_05218239.1| hypothetical protein MaviaA2_18931 [Mycobacterium avium subsp. avium ATCC 25291] Length = 309 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 62/203 (30%), Gaps = 34/203 (16%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + A + + + +G F + I + K Sbjct: 92 NRLKAAEQAASQFASQLTPG------INLGLVGFAGTPYLLVPPTPQHQATIDALKKLDF 145 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI- 387 D STA +A+ TA I ++ + IVLL+DG + + Sbjct: 146 AD-----STATGEAIFTALHAISATA-----VAGGDTPPPARIVLLSDGGENKPSNPSDP 195 Query: 388 -----AICNKAKSQGIRIMTIAFSVNKTQQEKA---------RYFLSNCA--SPNSFFEA 431 AK +G+ I TI F + E + A S + A Sbjct: 196 HDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVSTDQMKMVAKLSGGQSYTA 255 Query: 432 NSTHELNKIFRDRIGNEIFERVI 454 + EL K + + I NEI R + Sbjct: 256 TNLGELQKSY-NAIENEIGYRTV 277 >gi|14248625|gb|AAK57598.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|161086904|ref|NP_001104317.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 3 [Rattus norvegicus] gi|17864880|gb|AAL47093.1|AF400662_1 L-type calcium channel alpha2/delta subunit [Rattus norvegicus] Length = 1079 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|14248705|gb|AAK57638.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 671 FEVVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVHTEFTLRNFNSA--KDMKKAVAHM 725 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 726 KYMGKGSMTGLALKHMFERSFTQGEGARPFSTRVP---RAAIVFTDGRAQDD---VSEWA 779 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E+++ + Sbjct: 780 SKAKANGITMYAVGVG------KAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKG 833 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 834 ICEALEDSDGR 844 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|332828898|gb|EGK01581.1| hypothetical protein HMPREF9455_02113 [Dysgonomonas gadei ATCC BAA-286] Length = 609 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/284 (12%), Positives = 79/284 (27%), Gaps = 24/284 (8%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + + +N F + Sbjct: 155 STFSVDVDAASYSNMRRFINRGELPYKDAVRIEELINYFSYDY-AEPAGNDPVRITTEVG 213 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + L P+ + +D S + LV+ +L ++ ++ Sbjct: 214 SCPWNSQNRLVKVGLKAKSLASDNLPASNFVFLIDVSGSMSGPTRLDLVKSSLKLLVNNL 273 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +K D R+ + + G + +K + + GSTA +Q Sbjct: 274 RKKD------RVAIVVYASSTGEVLPSTSG---ENKQKIKEALNNLSAGGSTAGGAGIQL 324 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 AY + + NN I+L TDG+ + + G+ + Sbjct: 325 AYKI----AKQNFIKGGNNR-----IILCTDGDFNVGVSSNEGLQRLIENERKTGVFLSI 375 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + + + K + A + ++ E NK+ + G Sbjct: 376 LGYGMGNYKDSKMQTLAQ--AGNGNHAYIDNLQEANKVLVNEFG 417 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 58/188 (30%), Gaps = 23/188 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW-----GV 316 + + +DA + D + G N D V Sbjct: 53 TPDAGGQTRMAAAKDAAQ-LFSVAVPSDAELGFMVYGTEVGNSPEERDAGCKDVKTLLPV 111 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K T + G T + A++ A + + + ++ IVL++D Sbjct: 112 EKGNVTKIPAEVGKVEASGHTPMGPALRQAAEEL-------------PKDGERSIVLVSD 158 Query: 377 GENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 GE+T + K +GI I T+ F V+ +++ A + +A T Sbjct: 159 GEDTCAPPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAE--AGGGEYMDAKDT 216 Query: 435 HELNKIFR 442 L + Sbjct: 217 VSLADSMK 224 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 102/417 (24%), Gaps = 65/417 (15%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E A + Y + + + +V + T + V Sbjct: 437 ETENVPEADKDKNYTFNAYVNYEGTDSDKDYTLYTGKYDVYNTATGTKDGETRTTDSNGV 496 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + S R + + + V + D S + Sbjct: 497 ITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKD-SATTEP 555 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + Y N V+ +S P D Sbjct: 556 VSVGEVPRITVTNTVVTAPRYRKYIKANND----GTYDLSLNVTGTQSGSSQTTVSPADI 611 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN--VNDTVRMGATFFNDRVISD 309 + + S+ + + + A+ S+ + + +N + +RM F+ V + Sbjct: 612 VVVFDTSGSMSNPMGHNSRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNV 671 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +F+ + + G T + KK Sbjct: 672 SNFTDNAMD-----IVSAVNGLRADGGTNWEA----------ALKAANAKLTSGRKGVKK 716 Query: 370 YIVLLTDGENT--------------------------------------QDNEEGIAICN 391 YIV ++DG+ T A+ Sbjct: 717 YIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAE 776 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + ++ S + T+ + S++ A ST ELNK F D IG Sbjct: 777 ANRRGDATLFSVGVSSDPTKMRGFADQ-----TKGSYYSATSTDELNKAFADIIGQI 828 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 43/146 (29%), Gaps = 21/146 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+D + ++ G T I ++ Sbjct: 455 LVVFDDAANWVLPLQRLPSVV---EIERALGSFGIGGGTNIRPGIEQ-----------AA 500 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + + K+++LLTDG + + IA + ++ G+ I T+A + Sbjct: 501 QALASADAKVKHVILLTDGIAESNYSDLIA---QMRAAGVTISTVAIGEDANPNLVDV-- 555 Query: 419 LSNCASPNSFFEANSTHELNKIFRDR 444 A + ++ +IF Sbjct: 556 --ANAGGGRSYRVTRIEDVPRIFLQE 579 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 98/360 (27%), Gaps = 29/360 (8%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + + +K + D + L++ G + ++ + + Sbjct: 123 EYLGNSLMKLSVFPIPPKGDQKIKLAYKFVAPKDGSVVEYVYPLKTDGKATRTLEEFSVS 182 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T+ + ++ S ++ R + + + G + Sbjct: 183 LTIKSRHAVQNVYSPTHAVNTVRKSDKEVSVTFERKQALLDKDFQLFYGHGDKDIGLSPL 242 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + + + +D+SS I K + A+ + Sbjct: 243 VYKPIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSMSDI-KMQQAKKAVKFCLSQ 301 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPS--FSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ D R G F+ V S + L + + G TAI A Sbjct: 302 LQPED------RFGVVRFSTTVTKFRSELVAANTDYLD--LATKWIDGLKTSGGTAIWPA 353 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGIAICNK--AKSQG- 397 + A + + + +V TDG+ T D I AK+ G Sbjct: 354 LNDALAM-------------RSSDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGN 400 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 RI T + + S A EA +I N + V T Sbjct: 401 TRIFTFGVGDDVNAAMLDQLADSTRAVSTYVREAEDIEVKVSGLYAKISNPVLTDVQLAT 460 >gi|54112390|ref|NP_000713.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Homo sapiens] gi|109658756|gb|AAI17469.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|109659118|gb|AAI17471.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|119597396|gb|EAW76990.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] Length = 1091 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 77/264 (29%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I NK K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFNKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 58/188 (30%), Gaps = 23/188 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW-----GV 316 + + +DA + D + G N D V Sbjct: 53 TPDAGGQTRMAAAKDAAQ-LFSVAVPSDAELGFMVYGTEVGNSPEERDAGCKDVKTLLPV 111 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K T + G T + A++ A + + + ++ IVL++D Sbjct: 112 EKGNVTKIPAEVGKVEASGHTPMGPALRQAAEEL-------------PKDGERSIVLVSD 158 Query: 377 GENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 GE+T + K +GI I T+ F V+ +++ A + +A T Sbjct: 159 GEDTCAPPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAE--AGGGEYMDAKDT 216 Query: 435 HELNKIFR 442 L + Sbjct: 217 VSLADSMK 224 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 54/199 (27%), Gaps = 22/199 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFND----RVISDPSFSWGVHKLIRTIVKTF 327 A ++ I+K+ + R G + V S + + + + Sbjct: 284 SNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATHPKIWVKVSDPDSSNADWVTKQLNEIN 343 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+Q Y + D++ N I+L+TDG + + I Sbjct: 344 YEDHKLKSGTNTKKALQAVYSMMS--WPDDIPPEGWNRTRH-VIILMTDGLHNMG-GDPI 399 Query: 388 AICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + ++ + + R + Q + F+ Sbjct: 400 TVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKD-NEQHVFKVKDM 458 Query: 435 HELNKIFRDRIGNEIFERV 453 L +F I + Sbjct: 459 ENLEDVFYQMIDESQSLSL 477 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 54/199 (27%), Gaps = 22/199 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTF 327 A ++ I+K+ + R G + V S + + + + Sbjct: 284 SNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSDPDSSNADWVTKQLNEIN 343 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+Q Y + D+V N I+L+TDG + + I Sbjct: 344 YEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VIILMTDGLHNMG-GDPI 399 Query: 388 AICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + ++ + + R + Q + F+ Sbjct: 400 TVIDEIRDLLYIGKDHKNPREDYLDVYVFGVGPLVNQVNINALASKKD-NEQHVFKVKDM 458 Query: 435 HELNKIFRDRIGNEIFERV 453 L +F I + Sbjct: 459 ENLEDVFYQMIDESQSLSL 477 >gi|302551540|ref|ZP_07303882.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] gi|302469158|gb|EFL32251.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] Length = 518 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 62/199 (31%), Gaps = 24/199 (12%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 +D S + L +++L+ + ++ D+V F+D + + Sbjct: 172 FVIDVSGSMSEPGRLDLAQESLSVMTDRLRDDDSVA------IVTFSDEAETVLPMT--R 223 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R ++ D ST + ++T Y+T + + +VL++D Sbjct: 224 LDGNRDEIQDVISDLATQDSTNLGAGVETGYETAVEGLREGATNR---------VVLVSD 274 Query: 377 GENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + + I + + GI + + + R + Sbjct: 275 ALANTGDTDADTILERIAGERREHGITLFGVGVGSDYGDALMERLADR---GDGHTVYVS 331 Query: 433 STHELNKIFRDRIGNEIFE 451 E +++F + + I Sbjct: 332 GPDEAHEVFCEELPRNIDL 350 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 52/192 (27%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + K Sbjct: 117 NRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNRDAT-----KNAL 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 TA +A+ TA I + + IVL +DG+ T Sbjct: 166 DKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E L A S + + A Sbjct: 223 KGAFTAARTAKDQGVPISTISFGTPYGFVEINGQRQPVPVDDETLKKVAQLSGGNAYNAA 282 Query: 433 STHELNKIFRDR 444 + EL ++ Sbjct: 283 TLAELKSVYASL 294 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 60/194 (30%), Gaps = 30/194 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + +A+ + G TFF + V+ + V + Sbjct: 94 SMNNPFGRATRYDGAMEAVTAFTS-------YRQGDAFGLTFFGNEVLHWCPLTTDVSAI 146 Query: 320 IRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 G T I A++ +I E + IVL+TDG+ Sbjct: 147 NCATPFMRPGQLPPWFGGTLIAKALRACKAELIKRPEG-----------DRMIVLITDGD 195 Query: 379 N-TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-------SPNSFFE 430 + N + + K++GI + + ++ + N + + FE Sbjct: 196 SQDFANGADAEVAEELKAEGITVFAVVIG---NDRQFQNPIIRNGSVQTVTARTGGESFE 252 Query: 431 ANSTHELNKIFRDR 444 A + L +F+ Sbjct: 253 AGDPNALATVFKRI 266 >gi|304382531|ref|ZP_07365026.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] gi|304336362|gb|EFM02603.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] Length = 340 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 49/185 (26%), Gaps = 35/185 (18%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 ++G F + + ++ T I A+ A + Sbjct: 125 NFTNDKVGLIVFAGDAFIQLPITSDFVSA-KMFLQNIDPSLIATQGTDIAKAIDMAMKSF 183 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--N 408 K I+++TDGE+ + AK++G + + Sbjct: 184 TQQ-----------ENVGKAIIVITDGEDHEGGAL--EAAKTAKAKGYNVFILGVGTAKG 230 Query: 409 KTQQEKARYFLSN--------------C-----ASPNSFFEANSTHELNKIFRDRIGNEI 449 +L + C A ++ ++T+E KI + + Sbjct: 231 APIPTGDGSYLKDNTGQTVITALNEDMCKEIAAAGSGTYIHVDNTNEAQKILNNELTKMQ 290 Query: 450 FERVI 454 + Sbjct: 291 KGELT 295 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 55/165 (33%), Gaps = 13/165 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +ND +G ++ V + + R+ + + G TA DA+ Sbjct: 398 TNGAQYINDNNYVGLVSYSSSVTVEVPIAQ-FDLNQRSYFQGSVNNLMASGGTASYDAVV 456 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A I + + +AK + LL+DG I + + I + TI Sbjct: 457 VAMKMITDAKA-------EHPDAKCMLFLLSDGYANVGYS-MDEITSALRQSNIPVYTIG 508 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + E + N A+ + + ++ + +++ Sbjct: 509 YGGDADTDELTKLSGINEAASIN----ADSDDIIYKIKSLFNSQL 549 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/142 (21%), Positives = 43/142 (30%), Gaps = 19/142 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 T TA +A+ TA I + IVL +DG+ Sbjct: 148 TDHTATDNALDHLQLAERTATGEAIFTALQAIDT---LAGVVGGGGTPPPARIVLESDGK 204 Query: 379 NTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA- 423 T + AK QG+ I TI+F + L A Sbjct: 205 QTVPTDLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAE 264 Query: 424 -SPNSFFEANSTHELNKIFRDR 444 S SFF A S EL +++ Sbjct: 265 LSGGSFFTATSADELQASYQNL 286 >gi|14248619|gb|AAK57595.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248621|gb|AAK57596.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 61/215 (28%), Gaps = 49/215 (22%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 KK V+ A+ +I ++ D R+ F+ + + + K Sbjct: 140 KKMERVKRAMGFLIDNLGSDD------RLSVVAFSTDARRIIRLT-RMSDDGKAAAKRAV 192 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------ 382 GST I + A + ++LL+DG++ Q Sbjct: 193 ESLAASGSTNIRGGLDVAAMVLDGRRHKNAVAS---------VILLSDGQDNQSMHHEYL 243 Query: 383 NEEGIAICNKAKSQG--------------------IRIMTIAFSVNKTQQEKARYFLSNC 422 + + A S+G + + T F ++ + Sbjct: 244 PTSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVHTFGFGIDHDAAA-----MHYI 298 Query: 423 --ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + ++F + + F IG + V + Sbjct: 299 SEVTGSTFSFIENHAVIQDAFARCIGGLLSVAVQK 333 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/290 (12%), Positives = 79/290 (27%), Gaps = 22/290 (7%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + + S I + T S ++ + + + + Sbjct: 204 PAGTEAFPDVEPSSLKIAAEEPVSTFSIDVDTASWAVVRSSL-TRGQLPPRDAVRIEEMV 262 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N F + +G+ D + + L L P P L+ VD+ Sbjct: 263 NYFPYDYPAPAAGEAPFRATLGVMDSPWAEGRQLVHIGLQGALPPVEARPPLNLVFLVDT 322 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S K L++ + ++ ++ D + + + + R Sbjct: 323 SGSMDAPDKLGLLKQSFRLMLSELRPQD------EIAIVAYAGSAGEVLAPTP---AGER 373 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + GST ++ AY T + ED I+L TDG+ Sbjct: 374 ATILAALERLAAGGSTNGAGGLEQAYATAEAMTEDGEVSR---------ILLATDGDFNV 424 Query: 382 DNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + A+ + + G + + F + N ++ Sbjct: 425 GLSDPSALEDFIADKRDSGTYLSVLGFGRGNLDDATMQALAQNGNGTAAY 474 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + +++ + ++G F + Sbjct: 98 DISNSMLAEDVSPSRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITGD 150 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + T I A+Q + ++ N + K I+L+T Sbjct: 151 YVSA-KMFLDNINPSLIGTQGTDIGKALQLSINSFTP-----------NSKVGKAIILIT 198 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSN 421 DGE+ + A+ +A+++GI++ + T L + Sbjct: 199 DGEDNEGGA--EAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKD 244 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 21/162 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR G ++D++IS + + + + + G T A+ Sbjct: 24 GPDRVRFGVVQYSDKIISQFFLTQ--YASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVF 81 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ + +Y++++TDG++T + G+ I I Sbjct: 82 QNTARI---------DVARYLIVITDGQSTDP---VAEAAQGLRDIGVNIYAIGV----- 124 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 ++ L AS FF L I ++ I + Sbjct: 125 -RDANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDICSSE 164 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 48/169 (28%), Gaps = 17/169 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + D +++G F+ + + + + + T A+ Sbjct: 198 MVNQSNIGADEIQIGLLQFSSNPQEEFRL--NRYSSKVDMCRAILSVQQMSDGTHTGKAL 255 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +Y++++TDG +QDN + + I I I Sbjct: 256 NFTL-------PFFDSSRGGRPRVHQYLIVITDGV-SQDNVAPP--AKALRDRNIIIFAI 305 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + L + F+ + L + ++ + + Sbjct: 306 GVG-----NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 349 >gi|14248623|gb|AAK57597.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 63/193 (32%), Gaps = 24/193 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S + + V +G F S + RT+ K Sbjct: 106 NRLDAAKEAAQSFVDDLTPG------VNLGIVSFAGIATVLVSPT-----TDRTVAKQAI 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 TA +A+ ++ TI ++ + IVL+TDG+ T E A Sbjct: 155 DGLTLDERTATGEAIISSLQTIELFSKTLPPDGTDTGPPPARIVLMTDGKRTVGRTEQDA 214 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEANSTHEL 437 +A G+ + IAF + + A S F +A S EL Sbjct: 215 A-QRAADAGVPVSVIAFGTDNGSITVNDEVIPVPLDTEAMQQIAQISGGDFHQAASAEEL 273 Query: 438 NKIFRDRIGNEIF 450 I+ G +I Sbjct: 274 KSIYAQL-GEQIG 285 >gi|145297003|ref|YP_001139824.1| hypothetical protein cgR_2902 [Corynebacterium glutamicum R] gi|140846923|dbj|BAF55922.1| hypothetical protein [Corynebacterium glutamicum R] Length = 230 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/197 (10%), Positives = 52/197 (26%), Gaps = 17/197 (8%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + + + + + + + D + Sbjct: 5 DSSGSMVTPDAGGQSRSDAANQFIDELAGTFDLGLVTYGGNTGETPEDYEAGCQDITVVR 64 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G +K G T I ++++ A + I+L+ Sbjct: 65 GPTNGQAEQLKQHIDGLQPRGYTPIGESLRKAVAELPEGGSGT-------------ILLV 111 Query: 375 TDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 +DG T + + QG+ I T+ F+++++ + A ++ +A+ Sbjct: 112 SDGIATCTPPPVCEVAAELADQGVDLVINTVGFTLDESARAVLECIAQ--AGNGTYADAS 169 Query: 433 STHELNKIFRDRIGNEI 449 L + Sbjct: 170 DADSLVAELKQAATRTA 186 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/298 (11%), Positives = 80/298 (26%), Gaps = 33/298 (11%) Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + + + S S+++ + + S + Sbjct: 176 GVGRVDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEAF-----CVTADLCSTGDHD 230 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + Y +C + + + + Sbjct: 231 CEQICVSTP-GSYTCACRDGFTLNEDGKTCNACGASAVDLVFLIDGSKSVRPENFELVKQ 289 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + + ++ +D + +G ++ V + G + + I T Sbjct: 290 FINQIVESMDVGDHRAHVGLVQYSSSVRQEFPL--GRYTSKKDIKSAVKKMSYMEKGTMT 347 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+Q D + + + K ++ TDG +QD A+ +AK G + Sbjct: 348 GQALQYLIDNSFAISSGGRPAV------PKVGIVFTDGR-SQDYINDAAL--RAKELGYK 398 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN----SFFEANSTHELNKIFRDRIGNEIFERV 453 + + L AS SF+ A+ K ++ IG ++ ++ Sbjct: 399 MFAVGVG------NAVEEELRMIASEPVVEHSFYTAD-----FKAMKE-IGKKLQMKI 444 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 55/165 (33%), Gaps = 20/165 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D + R+G + V ++ S HK +++ + T A+Q A Sbjct: 67 SLDVGANATRVGLVNYASTVKNEFSLK--THKAKPALLQAVKKVQPLSTGTMTGLAIQYA 124 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + E + K +++TDG + + I +A+ G+ I I Sbjct: 125 INIAFTEPEGAR---LKSPGINKVAIIVTDG---RPQDAVKDISARARESGLEIYAIGVG 178 Query: 407 VNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS S +L+K F++ Sbjct: 179 R------VDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEAF 217 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/332 (9%), Positives = 94/332 (28%), Gaps = 30/332 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + NP + + + T S +++ ++ Sbjct: 90 PTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLNQ- 148 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 149 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 208 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 209 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 262 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 263 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGG 317 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T+ + + Sbjct: 318 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVR 370 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 371 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 400 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 48/384 (12%), Positives = 97/384 (25%), Gaps = 40/384 (10%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 ++ + + + ++ Y + T+ + A K Sbjct: 211 GEQQQLVTDIYATSTGQGELLLYENDELIHREAVELTEGTNVFTYKHAATAEGLVKYEAV 270 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V + N L+ LI + + + ++ V S Sbjct: 271 VQVGQDAIFENNKLTSVTMVQSEPHLLIVNGYDTASPIAAALGSQSIAYDVVAPKSLPNE 330 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP---- 246 + S V + + L Sbjct: 331 LSSYLQYNAIIFDNVPGHLVGEAKMSVIEQTVKNFGVGFTMVGGENSFGLGGYFKTPIET 390 Query: 247 ----------GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 PSL +D S K L ++A A + ++ D + Sbjct: 391 LLPVEMEIKGKEQLPSLGLVIVLDRSGSMQG-SKLELAKEAAARSVEMLRDEDTLGFIAF 449 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + +I S + + G T I ++ AY+ + Sbjct: 450 DDRPW---EIIETGPLSSKEEAVDT------ILSVTPGGGTEIYSSLAKAYENLAD---- 496 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +K+I+LLTDG++ N + + K GI + T+A + Sbjct: 497 -------LKLQRKHIILLTDGQSQAGNY--EDLITEGKEDGITLSTVAIGQDADANLLE- 546 Query: 417 YFLSNCASPNSFFEANSTHELNKI 440 LS+ F++ + I Sbjct: 547 -ALSDM-GSGRFYDVIDEQTIPSI 568 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 48/147 (32%), Gaps = 22/147 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAM 343 IK+ D V+ G ++D + + S +K D + G T +A+ Sbjct: 870 IKRFQIGPDRVQFGVVQYSDGINIQFALSQYSSMA---ELKAAIDDIQQRKGGTMTGEAL 926 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + V Y++++TDG++ E + +G+ I I Sbjct: 927 SRMAQVFVDTARSNVP---------WYLIIITDGKSEDPVAEP---AEALRGEGVIIYAI 974 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFE 430 + L A +FF Sbjct: 975 GV------KNANVMELKEIAKDKTFFT 995 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 35/128 (27%), Gaps = 12/128 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V R + + + D +++G F+ + + I + + T Sbjct: 1045 GFVKRMVNQANIGADEIQIGLLQFSSSPQEEFRL--NQYSSKADIHRAISKVVQMNDGTR 1102 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ +Y++++TDG D + + I Sbjct: 1103 TGKALTFTL-------PFFDSSRGGRPNVHQYLIVITDGVAQDDVAIP---AKALRDRNI 1152 Query: 399 RIMTIAFS 406 I I Sbjct: 1153 VIFAIGVG 1160 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 48/149 (32%), Gaps = 15/149 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I +D +D VR+G +ND + + L +++ G T Sbjct: 256 NFLYSVILGLDISSDQVRVGLAQYNDNIYPAFQL--NQYPLKSVVLEQIQNLPYRTGDTN 313 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + + ++L+TDGE N+E NK K G+ Sbjct: 314 TGSALEFIRMHYLTEAAGSRAK----DSVPQIVILVTDGE---SNDEVQEAANKLKEDGV 366 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + L AS Sbjct: 367 V------VYVVGVNVQDVQELQKIASEPF 389 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 81 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 138 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 139 LAIQYALNIAFSEAEGARP---LRENVLRVIMIVTDG---RPQDSVAEVAAKARDTGILI 192 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 193 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V I + R+G ++ +V ++ + + + K A + + Sbjct: 677 QFVTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHMKYMGKGSM 734 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + E + ++ TDG D +KA++ GI Sbjct: 735 TGLALKHMFERSFTQVEGARPLSTRVP---RVAIVFTDGRAQDD---VSEWASKAQANGI 788 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEI 449 + + + L AS F A E+++ + I + Sbjct: 789 TMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISEKLKKGICEAL 839 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 81 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 138 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 139 LAIQYALNIAFSEAEGARP---LRENVLRVIMIVTDG---RPQDSVAEVAAKARDTGILI 192 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 193 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V I + R+G ++ +V ++ + + + K A + + Sbjct: 677 QFVTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHMKYMGKGSM 734 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + E + ++ TDG D +KA++ GI Sbjct: 735 TGLALKHMFERSFTQVEGARPLSTRVP---RVAIVFTDGRAQDD---VSEWASKAQANGI 788 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEI 449 + + + L AS F A E+++ + I + Sbjct: 789 TMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISEKLKKGICEAL 839 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 81 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 138 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 139 LAIQYALNIAFSEAEGARP---LRENVLRVIMIVTDG---RPQDSVAEVAAKARDTGILI 192 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 193 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V I + R+G ++ +V ++ + + + K A + + Sbjct: 677 QFVTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHMKYMGKGSM 734 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + E + ++ TDG D +KA++ GI Sbjct: 735 TGLALKHMFERSFTQVEGARPLSTRVP---RVAIVFTDGRAQDD---VSEWASKAQANGI 788 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEI 449 + + + L AS F A E+++ + I + Sbjct: 789 TMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISEKLKKGICEAL 839 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 81 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 138 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 139 LAIQYALNIAFSEAEGARP---LRENVLRVIMIVTDG---RPQDSVAEVAAKARDTGILI 192 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 193 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V I + R+G ++ +V ++ + + + K A + + Sbjct: 677 QFVTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHMKYMGKGSM 734 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + E + ++ TDG D +KA++ GI Sbjct: 735 TGLALKHMFERSFTQVEGARPLSTRVP---RVAIVFTDGRAQDD---VSEWASKAQANGI 788 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEI 449 + + + L AS F A E+++ + I + Sbjct: 789 TMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISEKLKKGICEAL 839 >gi|221115612|ref|XP_002166292.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 484 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 55/176 (31%), Gaps = 20/176 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 + +I K + + G F+ R + + + + + + MG Sbjct: 162 QEIEFLINLAKTFNISKNGAHAGVITFSSRAVLNIKLNQYY---DQEQFEFAIDEIPYMG 218 Query: 336 S-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EGIAIC 390 T I+ A++ A + + + + LLTDGE + + Sbjct: 219 YVTRIDLALRKALEMFDE-------INGARKDIPQLLFLLTDGEQYAGDGVVDENPANVA 271 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI-FRDRI 445 + +GI I+ + Q E S F A + EL F RI Sbjct: 272 QLVRDRGIEIIAVGIGSGVNQSELNSIA----GSSEKVFLAENFDELINKNFLKRI 323 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 55/187 (29%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ + D R+ F+ R VK Sbjct: 67 IKTVKEAAMRLVDGLGASD------RLSVVAFDHRAKVIVP---NQPVDDIERVKQAIER 117 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G T+I++ M+ + +D V + I LLTDGEN DNE + + Sbjct: 118 LKPEGGTSIDEGMKLGIKEVALGKDDRVSQ----------IFLLTDGENEHGDNERCLKL 167 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F + L + A + + F Sbjct: 168 AQVAAEYNITVNTLGFG-----NHWNQDVLESIADAVGGTLCYIEQPEQALTEFSRLFTR 222 Query: 448 EIFERVI 454 + Sbjct: 223 IQSVGLT 229 >gi|14248645|gb|AAK57608.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248647|gb|AAK57609.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 52/192 (27%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + K Sbjct: 117 SRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT-----KNAL 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 TA +A+ TA I + + IVL +DG+ T Sbjct: 166 DKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTXPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 223 KGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAA 282 Query: 433 STHELNKIFRDR 444 + EL ++ Sbjct: 283 TLAELRAVYSSL 294 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + +++ + ++G F + Sbjct: 98 DISNSMLAEDVSPSRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITGD 150 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + T I A+Q + ++ N + K I+L+T Sbjct: 151 YVSA-KMFLDNINPSLIGTQGTDIGKALQLSINSFTP-----------NSKVGKAIILIT 198 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSN 421 DGE+ + A+ +A+++GI++ + T L + Sbjct: 199 DGEDNEGGA--EAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKD 244 >gi|198426775|ref|XP_002120099.1| PREDICTED: similar to SD03168p [Ciona intestinalis] Length = 1474 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 58/377 (15%), Positives = 113/377 (29%), Gaps = 27/377 (7%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 +E + F ++ + + +V +N LNP S + Sbjct: 1 MESTFLNVFIISIISTTSAFNIIPSELTKSVTLNASVVNSTGPTYFGTSFFLNPSSGNHQ 60 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI-DFSRSMLDYQRDSEGQPLNCFGQPA 208 + L + S+ + S S A Sbjct: 61 LLVGSPLLNSGSQTEPSGGLSFCEFSFTDSVPSCKPISPSTPALVAGD----AVGLSFAA 116 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY-----PGPLDPSLSEEHFVDSSS 263 + + D +L + + + P + + F+ S Sbjct: 117 QADGNITACAPLRPSYCDGELEHLGYCYAGTEFGKTWMSISRIDPFECPKVDILFLLDGS 176 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 V ++++ + ++ S N + V +G F+ ++ F G+ I Sbjct: 177 GSIVESDFEIMKEWIENITLSFDISSNGS--VAVGLMQFSHFSLTKTEFQIGMFTTKEEI 234 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + G+T DA++ + S+ N+ +K IVLLTDGE T D Sbjct: 235 MAAMKNVTIKKGNTYTADALRRSIAVFQKSSR------YNDTNTRKVIVLLTDGEAT-DT 287 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS------NCASPNS-FFEANSTHE 436 + + +SQGI I + + E++ A S F +T Sbjct: 288 ASLSSTADLVRSQGITITAVLITEKVLPSERSAAVAQMQLIVNGVAGNPSGVFVVGTTAN 347 Query: 437 LNKIFRDRIGNEIFERV 453 L+ + R I I + Sbjct: 348 LDSVIR-AITQRIQSTL 363 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 41/122 (33%), Gaps = 16/122 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G T I ++ AY+ + +K+I+LLTDG+ Sbjct: 423 NNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADM-----------KLQRKHIILLTDGQ 471 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + N + + + K GI + T+A + LS F+ + Sbjct: 472 SQPGNYD--DLIEQGKDNGITLSTVAIGQDADANLLE--ALSEM-GSGRFYNVIDEQTIP 526 Query: 439 KI 440 I Sbjct: 527 SI 528 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 55/192 (28%), Gaps = 24/192 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A ++ I + G F+ + Sbjct: 313 DVSGSMGSYDRLNRMNQAAKFFLQQILES-----RSWAGMVHFHSSATVKSELIQINSDV 367 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++TA+ + I+LL+DGE+ Sbjct: 368 ERNQLLETLPT-SASGGTSICSGIRTAFQVFKNKGYQTGGND---------ILLLSDGED 417 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH--EL 437 + + ++ K G + IA +++ ++N F + L Sbjct: 418 STAKDCL----DEVKDSGAVVHFIALGK---AFDQSISNMANVTGGKQLFATDEAQNNGL 470 Query: 438 NKIFRDRIGNEI 449 F Sbjct: 471 IDAFGALASENA 482 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 46/154 (29%), Gaps = 3/154 (1%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 +V + S + G ++ V+LG ++VD+ + L+ A A Sbjct: 12 IVRSARVRGKSASQCRDDGGQISLLIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGA 71 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 +TA+ + + + + P E Y + TA Sbjct: 72 ALTAAQHVS---LAGAYTTELAEWLPLSDAEVYAAVADYVGEPGRAPQSCRSGTLSITAA 128 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 ++ V LS L + + G+ Sbjct: 129 TLDATDRTVSVSLSCTVSLPIVNVITGSWWRGVP 162 >gi|222082656|ref|YP_002542021.1| hypothetical protein Arad_9366 [Agrobacterium radiobacter K84] gi|221727335|gb|ACM30424.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 176 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 40/126 (31%), Gaps = 6/126 (4%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++L+ G I ALL + + ++++ + A ++ Sbjct: 1 MMRARPFASFRRLLADRKGVAAIEFALLALPLFMLIFAIIEISVMFFVNS----AMDASV 56 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 S + S ++ F + ++ L+ ++ D +A Sbjct: 57 QKISRMIRTGEVASSKISQA--DFKAKICDDMLLAFSCSSNLLVKVDILSDLSSATSANP 114 Query: 122 MNPRKS 127 +N + Sbjct: 115 INASGN 120 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/122 (18%), Positives = 48/122 (39%), Gaps = 8/122 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D+ + I++ + G+T A++ Sbjct: 494 VKKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALK 551 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A E H + K+ ++++TDGE+ ++ +++GI I + Sbjct: 552 HANALFT-----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVG 605 Query: 405 FS 406 Sbjct: 606 VG 607 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 48/151 (31%), Gaps = 18/151 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 128 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 185 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T + + +++ I + + Sbjct: 186 KNGMKD------RMSKVPCYLIVLTDGMSTD---RVVEPAKRLRAEQITVHAVGIGA--- 233 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + L A + L I Sbjct: 234 ---ANKIELQEIAGKEERVSFGQNFDALKSI 261 >gi|14248581|gb|AAK57576.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 493 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 52/192 (27%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + K Sbjct: 117 SRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT-----KNAL 165 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 TA +A+ TA I + + IVL +DG+ T Sbjct: 166 DKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 222 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 223 KGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAA 282 Query: 433 STHELNKIFRDR 444 + EL ++ Sbjct: 283 TLAELRAVYSSL 294 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 62/202 (30%), Gaps = 27/202 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + ++ A + I + +G FN Sbjct: 309 DVSGSMTTYDRIARLKQAAELFLLQIIET-----GSWVGIVTFNSYATRQIGLRQITSDS 363 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +R +K + + G T I ++ + + IVLLTDGE+ Sbjct: 364 VRESLKNYLPT-SAGGGTIICSGVRQGFQV--------KQKKYQTSTKGCEIVLLTDGED 414 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--- 436 + + +S G I TIA N ++ + + + F A + + Sbjct: 415 NSVSSCFAEV----QSSGSIIHTIALGPNAAKELEMLADM----TGGLKFSATDSLDSNG 466 Query: 437 LNKIFRDRIGNE--IFERVIRI 456 L F I ++ I++ Sbjct: 467 LIDAFSRISSESGDISQQSIQL 488 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 68/197 (34%), Gaps = 29/197 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + + +G F S + + + Sbjct: 106 NRITAAKEAAQKFVEDL------PERFNVGVVAFARSASVVVSPTTDHQAVSAS-----L 154 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TAI +A+ ++++ + + + IVLL+DG+NT A Sbjct: 155 GNLTTRAGTAIGEAV---FNSLDAVRSFDQQAV--TDPPPAAIVLLSDGDNTSGRSVAEA 209 Query: 389 ICNKAKSQGIRIMTIAFS-----VNKTQQEKARYFLSNC------ASPNSFFEANSTHEL 437 + + A S + I TIA+ V+ ++ + +EA S +L Sbjct: 210 V-DAAMSARVPISTIAYGTQEGTVSIDGRDVNVPVNKATLQTLSEGTSGRAYEAESGSQL 268 Query: 438 NKIFRDRIGNEIFERVI 454 +++ IG + R + Sbjct: 269 REVYEQ-IGTSLGYRTV 284 >gi|148685682|gb|EDL17629.1| mCG133494, isoform CRA_a [Mus musculus] gi|148685684|gb|EDL17631.1| mCG133494, isoform CRA_a [Mus musculus] Length = 828 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 138 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 191 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 192 MQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 249 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +A K ++++TDG DN ++ A++ I I E ++ L Sbjct: 250 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVGK-AFYNEHSKQEL 302 Query: 420 SNCA---SPNSFFEANSTHELNKI 440 A S F + L I Sbjct: 303 KAIASMPSHEYVFSVENFDALKDI 326 >gi|10946646|ref|NP_067309.1| integrin alpha-X precursor [Mus musculus] gi|48428495|sp|Q9QXH4|ITAX_MOUSE RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor gi|6684131|gb|AAF23492.1|AF211864_1 leukocyte adhesion glycoprotein p150,95 alpha integrin subunit [Mus musculus] gi|189442099|gb|AAI67225.1| Integrin alpha X [synthetic construct] Length = 1169 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 138 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 191 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 192 MQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 249 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +A K ++++TDG DN ++ A++ I I E ++ L Sbjct: 250 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVGK-AFYNEHSKQEL 302 Query: 420 SNCA---SPNSFFEANSTHELNKI 440 A S F + L I Sbjct: 303 KAIASMPSHEYVFSVENFDALKDI 326 >gi|74215118|dbj|BAE41792.1| unnamed protein product [Mus musculus] Length = 1169 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 138 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 191 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 192 MQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 249 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +A K ++++TDG DN ++ A++ I I E ++ L Sbjct: 250 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVGK-AFYNEHSKQEL 302 Query: 420 SNCA---SPNSFFEANSTHELNKI 440 A S F + L I Sbjct: 303 KAIASMPSHEYVFSVENFDALKDI 326 >gi|332520546|ref|ZP_08397008.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332043899|gb|EGI80094.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 698 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 85/295 (28%), Gaps = 27/295 (9%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + + T S K +++ +I+ ++ + + +N F + Sbjct: 250 LTNTSALSTFSIDVDKASYSNVRRMINNAQHIP-ADAVKIEEMINYFNYNYSQPKD-EHP 307 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + L Y P+ + +D S K ++ A Sbjct: 308 FAIHTEVAQTPWNNQTKLVKIGLQGKTYENKELPAANLTFLIDVSGSMSHELKL--LKSA 365 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ ++ D V+ + + G + + GST Sbjct: 366 FKLLVDQLRDKDKVS------IVVYAGAAGVVLEPTSGK---DKKKILKALNKLQSGGST 416 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---K 394 A + AY E+ ++ NN ++L TDG+ A+ + + Sbjct: 417 AGGAGINLAYKL----AEENFNKNGNNR-----VILATDGDFNVGASSNQAMEDLIIEKR 467 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 G+ + + F + +K + ++ E IF G + Sbjct: 468 KSGVFLSVLGFGYGNYKDDKLETLADK--GNGNHAYIDTMQEAKLIFGKEFGGTL 520 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 64/219 (29%), Gaps = 18/219 (8%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + K + + + +I I +D + Sbjct: 22 MDEGKGEKPYGTGSQQSKCKTGPLDIVFVIDSSRSVRPFEFETMRRFMIDIIHNLDIGPN 81 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G ++ +V + S + K T A+Q + ++ Sbjct: 82 ATRVGVIQYSSQVQNVFSLKSFF--TRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTT 139 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 E K + V++TDG + + + +A++ GI I + Q Sbjct: 140 QEGARPLHKKIP---RVAVIVTDG---RPQDRVTEVSAQARAAGIEIYAVGV------QR 187 Query: 414 KARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 L ASP F S +L + F + +++ Sbjct: 188 ADMNSLRAMASPALEEHVFLVESF-DLIQQFGKQFQDKL 225 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 50/161 (31%), Gaps = 19/161 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G ++ RV ++ + K Sbjct: 373 FELVKQFVNQIVDFLDVSPHGT---RVGLVQYSSRVRTEFPL--NKFTTAADLKKAVQRV 427 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + T A++ + + E + ++ TDG +QD+ Sbjct: 428 QYMEKGTMTGLALKHMLEHSFTEAEGARPLS---QNVPRIGLVFTDGR-SQDDI--SEWA 481 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFF 429 ++K GI + + + L AS + F Sbjct: 482 RRSKEAGIIMFAVGVGKAVESE------LREIASEPVDKHF 516 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 50/167 (29%), Gaps = 23/167 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F+DRV R V ++ GST ++ + + Sbjct: 78 DDRVSVVAFDDRVDVIVPSQLATS---REAVIQAIGTIDDRGSTNLHGGWLEGATQV--A 132 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKT 410 + ++LL+DG+ + I + + +GI TI + Sbjct: 133 QHLTPGALNR-------VILLSDGQANVGVTDRREIARQVRGLTERGISTTTIGLGSHYD 185 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 ++ A +F L F + + + + R T Sbjct: 186 EELLLAI---ANAGDGNFEHVEDPSRLPTFFEEEL-----QGLTRTT 224 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/332 (9%), Positives = 93/332 (28%), Gaps = 30/332 (9%) Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ + N + + + T S +++ ++ Sbjct: 84 PTFARAAKAKATHIANLGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ- 142 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSL 240 + + +N F D K + + + + Sbjct: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IV 256 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + G H + + + GST ++ AY ++ Sbjct: 257 TYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQ--QAAKGFIKGG 311 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + I+L TDG+ ++ + + K + G+ + T + + Sbjct: 312 INR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVR 364 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 365 IAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 394 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 48/153 (31%), Gaps = 19/153 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 + R+G ++ +V S R + D T A+Q A + Sbjct: 66 GPNATRVGVIQYSSQVQSVFPL---RAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNV 122 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S E + + V++TDG + + + +A+++GI I + Sbjct: 123 AFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGVQRAD 176 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNK 439 L ASP F S + + Sbjct: 177 VGS------LRAMASPPLEEHVFLVESFDLIQE 203 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 401 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 455 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 456 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 509 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 510 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 539 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 48/153 (31%), Gaps = 19/153 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 + R+G ++ +V S R + D T A+Q A + Sbjct: 66 GPNATRVGVIQYSSQVQSVFPL---RAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNV 122 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S E + + V++TDG + + + +A+++GI I + Sbjct: 123 AFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGVQRAD 176 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNK 439 L ASP F S + + Sbjct: 177 VGS------LRAMASPPLEEHVFLVESFDLIQE 203 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 51/156 (32%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 360 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 414 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 415 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 468 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + AS Sbjct: 469 ARAKEEGIVMYAVGVGKAVEAD------WREIASEP 498 >gi|14248627|gb|AAK57599.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + K K + + + I Sbjct: 110 PNSK-YRALEVAKKLKQRNVSLAVIGIGQGIN 140 >gi|186590|gb|AAA59195.1| inter-alpha-trypsin inhibitor heavy chain [Homo sapiens] gi|226055|prf||1409219A inter alpha trypsin inhibitor H Length = 383 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 43/127 (33%), Gaps = 11/127 (8%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T IN+A+ A + +N + + I+L++DG+ T + I K Sbjct: 1 GGTNINEALLRAIFILNEANNLGLLDPNSVS----LIILVSDGDPTVGELKLSKIQKNVK 56 Query: 395 SQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I + ++ + R N + T K F + ++ Sbjct: 57 ENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYN----QVST 112 Query: 452 RVIRITK 458 ++R + Sbjct: 113 PLLRNVQ 119 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 55/152 (36%), Gaps = 13/152 (8%) Query: 292 NDTVRMGATFFNDRVISDPS--FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V F SD + + + + + G+T A+ +++ Sbjct: 783 KADVGRDRVQFGALKYSDTPEILFYLNDYQNKKSIIDKLKFQRKGGNTYTAKALNRSHEL 842 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 E H + K+ ++++TDG+ + D E + +++ I I + Sbjct: 843 FT-----EQHGSRIKRGVKQMLIVITDGK-SHDYLELETVGKALRAKNIIIYGVGV---- 892 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + L S +++F ++ +L +I+ Sbjct: 893 -AEASDKELLDITGSKDNYFMVDNFEKLKEIY 923 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/280 (12%), Positives = 83/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ + + S Sbjct: 97 VTLSNVSERKDNMRLGLSLTTNPKDNSFVACSPLWSHECGSSYYTTGMCSRVNANFRFSK 156 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 157 TVAPALQRCQTYMDIIIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 206 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 207 VGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAYGIEFA--------RS 257 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + + ++ I +A ++ Sbjct: 258 EAFQKGGRKGAKKVMIVITDGE-SHDSPDLEQVIDASEKDNITRYAVAVLGYYNRRGINP 316 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 317 EAFLSEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 355 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/353 (11%), Positives = 84/353 (23%), Gaps = 30/353 (8%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 Y + V ++ V+ + + Sbjct: 93 AYGAPPPPPPPPPPPPPAPAALVRPPVVVTNAAGQTTVDGVVVPGRPGTRVDTERYPDAT 152 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + + + + + +N F R Sbjct: 153 PNPVRRVADEPV--STFSIDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYFDYGYARP 210 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNK-----SLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 ++ + S + + L P L+ VD S Sbjct: 211 GRADEPFAVSTAVAASPWSANAGAGGRQIVHIGLQGYELPAGERRPLNLTFMVDVSGSMQ 270 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K L + + +I ++ D R+ T++ V + + G KL ++ Sbjct: 271 SPDKLGLAQQTMNLIIDRLRPED------RVAVTYYASDVGTAVGPTPGSEKLK---LRC 321 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDN 383 N GSTA M AY+ + + + ++ TDG+ D+ Sbjct: 322 AVAALNAGGSTAGAQGMVNAYEQ--------AEAAFSPDKVNRILMF-TDGDFNVGVTDD 372 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + GI + F Q + + A + E Sbjct: 373 RRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQ--AGNGVAAYVDDLDE 423 >gi|332206577|ref|XP_003252372.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 [Nomascus leucogenys] Length = 1107 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 173 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 232 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 233 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 292 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 293 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 352 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 353 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 399 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 400 RGPIQWMACENKGYYYEIPSIGAI 423 >gi|291444745|ref|ZP_06584135.1| lipoprotein [Streptomyces roseosporus NRRL 15998] gi|291347692|gb|EFE74596.1| lipoprotein [Streptomyces roseosporus NRRL 15998] Length = 531 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 84/295 (28%), Gaps = 28/295 (9%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + ++ + + +R + + + + +++ E N F D Sbjct: 86 LSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSVNIDGAR 145 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + G + + + P P + VD S + Sbjct: 146 INSGKGGGGGTGASD----WSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGRLD 201 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LVR +L + ++ D+++ F+D + + K R +K + Sbjct: 202 LVRKSLTILADELRDDDSLS------LVTFSDEAETRLPMT--RVKDNRNRIKDVVSEMQ 253 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ST + ++ Y+ + + + +VLL+D E I K Sbjct: 254 PAQSTNVEAGIKLGYEESVEGHREGATNR---------VVLLSDALANTGETEAEGILKK 304 Query: 393 A----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + GI + + + + + + K+F D Sbjct: 305 IDSARREYGITLFGVGVGSDYGDAFMEQLTNK---GDGNTTYVGDETQARKVFVD 356 >gi|149928031|ref|ZP_01916279.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] gi|149823219|gb|EDM82455.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] Length = 543 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 50/149 (33%), Gaps = 4/149 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGG-MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +S G + I+T ++ + LG G + V ++ + L+ A +TA+ + E Sbjct: 8 RQSQRGAYSILTVFVLIMSLGALGVLAV--GHMAWEKTRLQGVADLVALTAARQMSNGPE 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++A + + + ++ N D + V + Sbjct: 66 FAEAQAIALANGVSEDDDLTIECIIDDAPT-NDCDNSITSRVTIVRPVNGLLVFLPNREV 124 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 + + + P + S G+ S A Sbjct: 125 TVLAEATVAPTVVGSVSSGLVSVDTNQSA 153 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/142 (21%), Positives = 44/142 (30%), Gaps = 19/142 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 T TA +A+ TA I + + IVL +DG+ Sbjct: 148 TDHTATDNALDHLQLAERTATGEAIFTALQAIDT---LAGVLGGGSTPPPARIVLESDGK 204 Query: 379 NTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA- 423 T + AK QG+ I TI+F + L A Sbjct: 205 QTVPADLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAE 264 Query: 424 -SPNSFFEANSTHELNKIFRDR 444 S SFF A S EL +++ Sbjct: 265 LSGGSFFTATSADELQASYQNL 286 >gi|94970371|ref|YP_592419.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94552421|gb|ABF42345.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 356 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 49/155 (31%), Gaps = 13/155 (8%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + F+ F+ L + G TA+ DA+ A Sbjct: 166 RPKFDKAFVIGFDTTAEVTQDFTDDTDLLGK-----GVRMLRPGGGTAMYDAIYYAC--- 217 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 D++ + N +K ++LL+DGE+ Q A+ + I I+ + + Sbjct: 218 ----RDKLLKENGNTAMRKAMILLSDGEDNQSRVTREEAVEMAQRAEVIIYAISTNTSGL 273 Query: 411 QQEKARYFLS-NCASPNSFFEANSTHELNKIFRDR 444 + + A+ F ++ F + Sbjct: 274 KLRGDKVLERFAEATGGRAFFPFKISDVANAFSEI 308 >gi|118346119|ref|XP_977009.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288305|gb|EAR86293.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 685 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 67/207 (32%), Gaps = 18/207 (8%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S K V+ L ++I + F +F Sbjct: 87 NKYCILIDRSQSMMSENKLQNVKQYLCNLIEKANTNS------QFALISFGSSQKLIFNF 140 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE--AKKY 370 + H+ + +I K + G T I A++ A++ I + E + + + + Sbjct: 141 TQVTHENLESI-KGQINNIISTGDTNIIQALEVAHNIIKQDQQLENQKEEQTKKRIVRYS 199 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCAS--PN 426 LLTDG++ + K++ I + F ++ L S Sbjct: 200 AFLLTDGQDNMKEKAIFKFRENFKNKDMDYSINCLGFGIDHD-----PLLLGAITSYTGG 254 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 F+ + +F+D I N+ V Sbjct: 255 KFYYIKPEESVFSVFQDYIKNQQETLV 281 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 58/211 (27%), Gaps = 43/211 (20%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + A+ + +K R+G F + + Sbjct: 99 SNSMLAEDFQPNRLERTKYAIDKLFDGLK-------QDRVGLVVFAGDAVVQLPITSDYR 151 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + + T I A+ A + ++ + IVL+TDG Sbjct: 152 MA-KAFARRISPSMVSVQGTDIGQALSLATMSFSEKGDNPAG---------RVIVLITDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------------------EKARYFL 419 E A +A QGIRI TI + + L Sbjct: 202 EGHDSGAIEAAE--RAAEQGIRIFTIGIGTPEGAPIQIGGEFIKDDKGEMVVSKLGEPLL 259 Query: 420 SNC--ASPNSFFEANST----HELNKIFRDR 444 A+ ++ + + E+ + + Sbjct: 260 EKIAQATDGAYIRSTNQSIGLDEIVRTINNM 290 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 52.7 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 62/196 (31%), Gaps = 20/196 (10%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + + F+ SS V++ + ++++ + D R+G + V Sbjct: 24 NTCNNKRLDLVFIIDSSRSVRPYDFEKVKEFILTILQFLDVS---PDATRVGLIQYGSTV 80 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + S + + I + + T A+Q A + S +E N Sbjct: 81 KHEFSLK--TFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFSESEGARP---LNQN 135 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + I+++TDG I KA++ GI I I L + S Sbjct: 136 VPRIIMIVTDGRPQDPVG---EIAAKARNSGILIFAIGVGR------VDMNTLKSIGSEP 186 Query: 427 ---SFFEANSTHELNK 439 F + ++ Sbjct: 187 HEEHIFLVANFSQIET 202 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 37/124 (29%), Gaps = 14/124 (11%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ F + + + ++T T I A+ A + Sbjct: 127 NNDKVAMIVFAGEAFTQLPITSDYISA-KMFLETINPSLISTQGTDIAGAINLAMKSFTP 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + IVL+TDGEN + +A +G+R+ + Sbjct: 186 -----------NEGVGRAIVLITDGENHEGGAV--EAAQEAAKKGVRVFVLGVGSPDGAP 232 Query: 413 EKAR 416 A Sbjct: 233 IPAE 236 >gi|14248677|gb|AAK57624.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + NK K + + + I Sbjct: 110 PNSKYTALE-VANKLKQRNVSLAVIGVGQGIN 140 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/262 (12%), Positives = 81/262 (30%), Gaps = 31/262 (11%) Query: 15 IKSCTGHFFIITALLMP-VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +G ++ LL+P +M+ + G+ +D + L+ A A + A L Sbjct: 8 RSRKSGSVMVLITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLLGT--- 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + E L F + + + + + + Sbjct: 65 --------------PAVPETLAAEFLASNFRTDGSAGTWGAHDLHSTIVYTPGITKIIDI 110 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-LDY 192 + + L LFLR +G S ++ + + VID S +M + Sbjct: 111 DATAQVPL----LFLRILGKTSATVRARGSGTR--------TDSRVMLVIDRSGTMDVSD 158 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + N + +++ ++G+ S Y+ + Sbjct: 159 GTGLPTRIENAKTVAQTLFIPAFTEGADEIGLVAFDGSAYVAYPPSQPGWDPTTTSSSRG 218 Query: 253 LSEEHFVDSSSLRHVIKKKHLV 274 + +F D ++ ++I + + + Sbjct: 219 GPDMYFKDPNNPNNMINQVNAI 240 >gi|239987768|ref|ZP_04708432.1| hypothetical protein SrosN1_10718 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 84/295 (28%), Gaps = 28/295 (9%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + ++ + + +R + + + + +++ E N F D Sbjct: 70 LSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSVNIDGAR 129 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + G + + + P P + VD S + Sbjct: 130 INSGKGGGGGTGASD----WSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGRLD 185 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LVR +L + ++ D+++ F+D + + K R +K + Sbjct: 186 LVRKSLTILADELRDDDSLS------LVTFSDEAETRLPMT--RVKDNRNRIKDVVSEMQ 237 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ST + ++ Y+ + + + +VLL+D E I K Sbjct: 238 PAQSTNVEAGIKLGYEESVEGHREGATNR---------VVLLSDALANTGETEAEGILKK 288 Query: 393 A----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + GI + + + + + + K+F D Sbjct: 289 IDSARREYGITLFGVGVGSDYGDAFMEQLTNK---GDGNTTYVGDETQARKVFVD 340 >gi|47218379|emb|CAG01900.1| unnamed protein product [Tetraodon nigroviridis] Length = 683 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 90/279 (32%), Gaps = 32/279 (11%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + ++ ++ + + D + + K + + Sbjct: 130 DFKRLVSYNTTAVHIPTDIYEGSTIILNELNWTEALEDVFRKNKEEDPSLLWQVFGSATG 189 Query: 238 KSLYYMLYPG-PLDPSLSEEHFVDSSSLRHVIKKKHLVRD------ALASV--------- 281 + YY P + S ++ D I+ +D A SV Sbjct: 190 LARYYPASPWMDISNSANKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLTLKLIQ 249 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAI 339 I K ++ ++D + +FND+ F V + ++K + G+T Sbjct: 250 ISVSKMLETLSDDDYVNVVYFNDKAKYASCFENLVQANVRNKRMLKKAVQNITAKGTTNY 309 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK-SQGI 398 + + A++ + + K I+L TDG E+ I K +Q + Sbjct: 310 SGGFELAFEQL-------AQMNVSRANCNKIIMLFTDG----GEEKAEEIFKKYNPNQEV 358 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 RI T FSV + +K CA+ ++E S + Sbjct: 359 RIFT--FSVGQHNYDKGPIQWMACANKGYYYEIPSIGAI 395 >gi|313233701|emb|CBY09871.1| unnamed protein product [Oikopleura dioica] Length = 663 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/358 (12%), Positives = 93/358 (25%), Gaps = 24/358 (6%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 E + L T P + S + +S Sbjct: 292 TTGAPEPTRSQPPGPPLTTTTGPPTSPLSPSTTLPPGPPATTTTDTPAESTTSSIGPTIS 351 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + + + G S Q + Q Sbjct: 352 QPPGPPVTTTMEPPGSSTTNQPNITNTLPPGPPATTTTGLIGSSTTAQLTTTNQSATTIS 411 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 +V N I C + + + +D + L Sbjct: 412 PTCPPSVPD-PCMNNSTCIMINATL---TDCACQRGWAGEFCDVVCPATLTGRLDLAVLL 467 Query: 266 HVIKKKHLVRDALASVIRS----IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 V + + + D V + V++ T F+D + D H Sbjct: 468 DVSGTIASNPNKDQDTFDFFQALLNEFDTV-NQVQLSITSFSDDAVVDLPMG---HYNEP 523 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + T IN+ +QTA T+ ++ + ++ ++DG ++ Sbjct: 524 DLFGAVKNVDWVGRLTDINEGLQTALSTMNTT-----------DDVPDIMIFVSDGFDSF 572 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 D + G+ ++++ F +N A +++ + F A++ EL Sbjct: 573 DPGAIGDNAADISNAGVDVVSVGFGLNGFVNFMALVTVADNEGA-NVFTASTGDELLS 629 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 53/184 (28%), Gaps = 23/184 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ V + ++ L +I R+G FN ++ + Sbjct: 89 SMNATDVVPSRIDRAKNELQGLIDRFSA-------DRIGIILFNSNAYLLTPLTFDTENI 141 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + GST + ++ + + ++ K +++TDGE Sbjct: 142 -RNTIGNLKTHMIDKGSTDFSPMLEMINEKLSVGTQNR----------GKVAIVVTDGET 190 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHE 436 + + + K I + + + A E + +E Sbjct: 191 HY--QIDEQLAKRLKQNNIHLFWLGVGTLGGGKIPEGRGFKKDAEGRDVVTALEVKNINE 248 Query: 437 LNKI 440 L K Sbjct: 249 LAKA 252 >gi|301758388|ref|XP_002915048.1| PREDICTED: matrilin-3-like [Ailuropoda melanoleuca] Length = 466 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 62/181 (34%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++ +I ++ D R+ + V ++ + +++ + A Sbjct: 79 FTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKTEFHLQ--TYSDKQSLKQAVARI 133 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+QTA D + E K +++TDG + ++ + Sbjct: 134 TPLSTGTMSGLAIQTAMDEAFTV---EAGARGPTSNIPKVAIIVTDG---RPQDQVNEVA 187 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + + L AS F + +L+ F++ Sbjct: 188 ARARASGIELYAVGV----DRADMES--LKMIASEPLDEHVFYVETYGVIEKLSSRFQET 241 Query: 445 I 445 Sbjct: 242 F 242 >gi|281344227|gb|EFB19811.1| hypothetical protein PANDA_002988 [Ailuropoda melanoleuca] Length = 395 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 62/181 (34%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++ +I ++ D R+ + V ++ + +++ + A Sbjct: 26 FTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKTEFHLQ--TYSDKQSLKQAVARI 80 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+QTA D + E K +++TDG + ++ + Sbjct: 81 TPLSTGTMSGLAIQTAMDEAFTV---EAGARGPTSNIPKVAIIVTDG---RPQDQVNEVA 134 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + + L AS F + +L+ F++ Sbjct: 135 ARARASGIELYAVGV----DRADMES--LKMIASEPLDEHVFYVETYGVIEKLSSRFQET 188 Query: 445 I 445 Sbjct: 189 F 189 >gi|239941305|ref|ZP_04693242.1| hypothetical protein SrosN15_09946 [Streptomyces roseosporus NRRL 15998] Length = 516 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 84/295 (28%), Gaps = 28/295 (9%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + ++ + + +R + + + + +++ E N F D Sbjct: 71 LSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSVNIDGAR 130 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + G + + + P P + VD S + Sbjct: 131 INSGKGGGGGTGASD----WSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGRLD 186 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 LVR +L + ++ D+++ F+D + + K R +K + Sbjct: 187 LVRKSLTILADELRDDDSLS------LVTFSDEAETRLPMT--RVKDNRNRIKDVVSEMQ 238 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ST + ++ Y+ + + + +VLL+D E I K Sbjct: 239 PAQSTNVEAGIKLGYEESVEGHREGATNR---------VVLLSDALANTGETEAEGILKK 289 Query: 393 A----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + GI + + + + + + K+F D Sbjct: 290 IDSARREYGITLFGVGVGSDYGDAFMEQLTNK---GDGNTTYVGDETQARKVFVD 341 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 52.3 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 22/187 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V A + I+S D +M FN+ V S + + Sbjct: 93 KMASVIAAARTFIQSSSPED------QMFVVNFNEDVTLGLS-TEIPFTNRPEDLTYAIS 145 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ DA+ A + + + D KK +V+++DG + I Sbjct: 146 HSPPTGKTALYDAVWKAREWVARGSRD-----------KKVLVVVSDGGDNASTHTLSEI 194 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS--NCASPNSFFEANSTHELNKIFRDRIGN 447 A I++ TI + +K L A+ F + E+ + I Sbjct: 195 LEAANKSNIQVFTIGI-FDPDDPDKNPGVLRQLARATGGEAFVPDELSEVV-AICESIAK 252 Query: 448 EIFERVI 454 +I + Sbjct: 253 DIRSQYT 259 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 21/204 (10%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D+S +++ + S++ D Sbjct: 97 CRKTIKKKSCPIDIAFLLDASGSIGRRSW-EEIKNFVKSIVDMCDISDQGTHVGI---IT 152 Query: 302 FNDRVISDPSFS-WGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + D +F + ++ +K + + G T I+ A+ A ++ + Sbjct: 153 FSTDPVIDIAFDKYKGVEMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFT------Q 206 Query: 360 RMKNNLEAKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +++K VL++DG T+D N K +G+++ T+ + E Sbjct: 207 EAGMREDSQKVAVLMSDGIQTKDRGPFTPTDIAANPLKMKGVQVYTVGIGADVDVFE--- 263 Query: 417 YFLSNCASP-NSFFEANSTHELNK 439 L AS S F +L Sbjct: 264 --LMAVASGITSMFSDRYLDDLKA 285 >gi|145491135|ref|XP_001431567.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398672|emb|CAK64169.1| unnamed protein product [Paramecium tetraurelia] Length = 590 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 60/188 (31%), Gaps = 23/188 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V+ +L +++ + D R+ F++ ++ ++ Sbjct: 184 KIEMVKASLIVLLQFLGDND------RLQLITFDNDAHRLTPLKTVTNQ-NKSYFTQIIK 236 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G I++A + + + + LL+DG + E I Sbjct: 237 QIQADGGNRISEATKMTFYQLKGRKYINNVTS---------VFLLSDGVDVTYPEVKKQI 287 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF-LSNCASPNSFFEANSTHELNKIFRDRIGNE 448 K ++ + T F + Q + L SF+ L++ F D +G Sbjct: 288 --KTVNEVFTLHTFGFGEDHDAQMMTQLCNLK----SGSFYFVQDVTLLDEFFADALGGL 341 Query: 449 IFERVIRI 456 I ++ Sbjct: 342 ISVVGEQL 349 >gi|330688339|gb|AEC32940.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 483 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 39/134 (29%), Gaps = 12/134 (8%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGS 336 + I + DT+ + F + G R + + G+ Sbjct: 1 MLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKSYSPYGT 60 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + A+ + N +A + ++L+TDG + + K K + Sbjct: 61 TNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGVPNSK-YRALEVAKKLKQR 111 Query: 397 GIRIMTIAFSVNKT 410 + + I Sbjct: 112 NVTLAVIGIGQGIN 125 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 45/147 (30%), Gaps = 18/147 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + A + +K + R+G F+ + + L+ + Sbjct: 107 KPDRLERAKREITDFLKVVK----GDRVGLVAFSGAAYTQCPLTLDYGALMMFLNILHPN 162 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T + A+ A E K I+L+TDGE+ + G+ Sbjct: 163 NI-PHPGTDLGAAVLGAIKAFDPK-----------SETDKVILLITDGEDNEK--RGLDA 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +A +GI+I A Sbjct: 209 AREAVRKGIKIFVFGMGDPAGGPIPAS 235 >gi|242007523|ref|XP_002424589.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508032|gb|EEB11851.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1003 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/283 (10%), Positives = 77/283 (27%), Gaps = 17/283 (6%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + + + + I + +++ + + Sbjct: 132 NSANMTFSHNFRQKVSYKESGVHIPLEIFDGYSNILNGVNWTSSLDEVFKENHRMDPNIG 191 Query: 212 VKSYSSQNGKVGIRD---EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + S +G + + Y D S H Sbjct: 192 WQFFGSFHGFLRVYPAFRWPDHSRYPDFFDVRRRSWYIQSSTSPKDVIILFDRSGSVHGP 251 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS-FSWGVHKLI--RTIVK 325 +++ +++ S+ + D + +FN+ V V + ++ Sbjct: 252 T-LDIMKITARALLNSLGEND------FVNVAWFNNDVKWVVPCLKTLVQATTQIKNLLA 304 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 E T+ A+ AY+ +E+ + K ++ L+DG + Sbjct: 305 DAIERLTESNLTSYVTALDFAYEEF--RKFEEIKKPWIGSNCHKIVMFLSDGGTEWPTDV 362 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 CN + ++ I+I T F+ C++ F Sbjct: 363 INRHCNNSNNENIKIFT--FACGPHPIPTVILKEMACSTGGYF 403 >gi|217978821|ref|YP_002362968.1| hypothetical protein Msil_2684 [Methylocella silvestris BL2] gi|217504197|gb|ACK51606.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 429 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 42/379 (11%), Positives = 89/379 (23%), Gaps = 19/379 (5%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ G ++ L +PV++G+ + ++ Y + ++ A +A + + Sbjct: 15 IAALGRRFRYDRGGGVALMIGLALPVIIGMIALGTEISFLLYKKFQMQSVADSAALGGAA 74 Query: 67 PLIQSLEEVSSRAKNSFTFP-----KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 L A+ +F + + N I + + Sbjct: 75 ALQSGHPAPGIEARGISSFLGFVDGAAGVTVTVNNPPATGSAANNASAVEVIISQPQTLS 134 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLF---LRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 M + + +R S L S G S A + Sbjct: 135 MVSLFVSGLFTVGARAVATRGTTSSCVLQLGSGGQFSMNNGAVANLTACGLAVDSTSAAG 194 Query: 179 IQWVI-----DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + S S++ S G +N G Sbjct: 195 LSLTGGAQLNALSVSVVGGASLSNGAAINPSSALKTSQANVADPYAGVTMPAMNSSCTTG 254 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH--VIKKKHLVRDALASVIRSIKKIDNV 291 + P S+++ K + A + + + Sbjct: 255 SPTTQVQGSNATISPGVYCNGISFNYGSATMNPGVYFIDKGVFNVAGGETLTATNVTIIL 314 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI---NDAMQTAYD 348 G + + + S IV F +T I A+ Sbjct: 315 TSKTTSGYATLSVSNGATVNLSAPTSGATAGIV-FFGDRNAPSSNTNIVAGGTALTVNGA 373 Query: 349 TIISSNEDEVHRMKNNLEA 367 S + + +N A Sbjct: 374 VYFPSQKLQFENGSSNASA 392 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 62/246 (25%), Gaps = 43/246 (17%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + + + + ++++AL + + N V + Sbjct: 51 WTTDACRLNRWGQPYPSCYPGNGEKSRLDIMKEALELFLDDL----PDNVKVGILTYSAG 106 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + T G T A+ A ++ + Sbjct: 107 NNIDLLHEVKQLSDNNHKATLLTTIDGLEANGGTLTAGALYEAGSYFRGQYDNLPSPITP 166 Query: 364 NLEAKKYIVLLTDGENTQ----------------------------------DNEEGIAI 389 IV LTDG+ + Sbjct: 167 GCSNASNIVFLTDGQPNSMSYNGYSYRNSIINMTGSSCARSDDGKECSEKLAGFLSTVDQ 226 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGN 447 ++ TIAF++ R FL N A + A+ST L F+ I Sbjct: 227 IEDLTPSKVKTHTIAFALEDNNA---RTFLENVADAGNGQSYTADSTDGLVDAFKSSIQT 283 Query: 448 EIFERV 453 +I + + Sbjct: 284 DIEQSM 289 >gi|87199929|ref|YP_497186.1| hypothetical protein Saro_1912 [Novosphingobium aromaticivorans DSM 12444] gi|87135610|gb|ABD26352.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 435 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 95/381 (24%), Gaps = 27/381 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I ++L+++ G+ ++ A P ++ G VD +W ++ ++ AA A ++A+ Sbjct: 12 ISLFRRLLRARAGNATLLMAFGAPALIATAGFAVDTAQWYLWKREMQYAADQAALSAAYS 71 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + + A + Sbjct: 72 KSKGISTTAYETHAVQEYNANL----------------QLVTFSDTPTVSLAAYNGGTNN 115 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + V S+ +L + + L + S A A S I S Sbjct: 116 SVVVRASATRELAFSGIVLGKPTTVSVSAQATYTAGATYTSCIIATNATADGAITIGGSS 175 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + ++ T + + + + + + + Sbjct: 176 ILKSGC---------GIAALSNSTNAIKVDGSPTIDVNYVLAAGGIDDWFNTNTDDVVKE 226 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + VR V ++ D T N + Sbjct: 227 YVTSLADPFASLTPPTNNTAATPYRCVRS--GGVTQATLNPGTYTDITTSCNTVMNSGIY 284 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S+ + I S IN + SS + + Sbjct: 285 VINGGSFTIRAQDAVTANGVMIVLKNGASIKINGGAALNLTAMTSSQLTSAGVSATDAQK 344 Query: 368 KKYIVLLTDGENTQDNEEGIA 388 + I+ D ++ + Sbjct: 345 LEGILFFEDRNSSGNTSILNG 365 >gi|322437225|ref|YP_004219437.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321164952|gb|ADW70657.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 528 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 94/362 (25%), Gaps = 17/362 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 ++ + G +I A M +LG + VDV + Y + L++AA A + + + + Sbjct: 74 RIFRDEDGSATLIAAFGMVAILGFLALAVDVGQLRYQKRGLQKAADAAALASVLEMSYCD 133 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ +E L + + ++ Y Sbjct: 134 GTLACG-VMQTAAKSAMVENGLTPDNIVTACGTVPSTGLTLMINHGPCAKGASDPNYGKT 192 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY-------HKEHGVSIQWVIDF 185 S ++ ++F + +G+ + + +AEA T + +++Q + Sbjct: 193 SSVEVLVMQAQPTIFAKVLGLSTGTVGARAEASTTGGTNCIFALDPTGSGALTVQGLASI 252 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC----NKSLY 241 + S L+C + P + L Sbjct: 253 TSPCGIMVESSSNSALSCAVLGVITASAINVVGGVSNFLCAIAPVPKTSASVPKPADPLA 312 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR-----DALASVIRSIKKIDNVNDTVR 296 Y+ P +S + + + V Sbjct: 313 YLAAPTMPGCGISTSSPYHGAGSTVTVGFGQSATFFADGAYCGGITVQSGGTAVFQPGVY 372 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 + + + + TF G T + ++ A +++ Sbjct: 373 ALTSNNGGATKLPGGLTIDLGTNVSGTGVTFYNYGPSGGITFLAPSLSLAGVRLVAPTSG 432 Query: 357 EV 358 Sbjct: 433 SY 434 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 62/197 (31%), Gaps = 20/197 (10%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ F+ SS V++ + +++ + D D R+G + Sbjct: 50 GSACENKRADVVFIIDSSRSVNTHDYAKVKEFIVDILQFL---DIGPDVTRVGLLQYGST 106 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 107 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 161 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS- 424 + I+++TDG + + + KA+ GI I I L S Sbjct: 162 NVPRVIMIVTDG---RPQDSVAEVAAKARDTGILIFAIGVGQVDFNT------LKAIGSE 212 Query: 425 --PNSFFEANSTHELNK 439 + F + ++ Sbjct: 213 PHEDHVFLVANFSQIET 229 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 60/191 (31%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ V ++ + + + K A Sbjct: 673 FEIVKQFVTGIIDSLAISPKAA---RVGLLQYSTLVRTEFTLRNFSSA--KDMKKAVAHM 727 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 728 KYMGKGSMTGLALKHMFERSFTQVEGARPLSARVP---RVAIVFTDGRAQDD---VSEWA 781 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +KA++ GI + + + L AS F A E++ + Sbjct: 782 SKAQASGITMYAVGVG------KAIEEELQEIASEPTEKHLFYAEDFSTMGEISDKLQKG 835 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 836 ICEALEDSDGR 846 >gi|126334034|ref|XP_001370526.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 1247 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 62/190 (32%), Gaps = 24/190 (12%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + +VI K+ +++R+ +F+ + + Sbjct: 223 KSFVRAVINQFKET-----NTLFSLMQYSNRLKIHFTFADFQRSTNWGNLVNPILQLR-- 275 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T A++ + S A K ++++TDGE QD + + +A+ Sbjct: 276 GLTYTATAIRKVVTELFQSRNGAR------KNATKILIVITDGEKYQDKLQYKDVIPEAE 329 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS-------THELNKIFRDR 444 GI I + + A+ L+ AS F+ ++ +L + Sbjct: 330 QAGIIRYAIGVG-DAFEYASAQEELNIIASQPAKEHVFQVDNFSALKTIQEDLQEKIFSI 388 Query: 445 IGNEIFERVI 454 G + + Sbjct: 389 EGTQSKSSIS 398 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 62/226 (27%), Gaps = 42/226 (18%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + + + + + ++RS ++G F Sbjct: 85 TKNGKGIETVIALDISNSMLAQDVIPSRLDKSKLLIEDLLRSFD-------NDKVGLIVF 137 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + T I A+ A + + Sbjct: 138 AGDAFVQLPITSDFISA-KMFLNDINPSLIGTQGTDIGKAINLAMHSFSPT--------- 187 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLS 420 +A K I+++TDGE+ + A+ KA+ G I + L Sbjct: 188 --SKAGKAIIIITDGEDNEGGA--EAMAKKAQEAGFHIYILGIGSTSGAEIPIGNGEKLK 243 Query: 421 N--------------C-----ASPNSFFEANSTHELNKIFRDRIGN 447 + C A + ++ + +I + ++ Sbjct: 244 DKRGNIVVSHLNEDMCKGIADAGKGIYIHVDNNSDAQQILKKQLSK 289 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 54/148 (36%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L +I + R G F + + + V L + + Sbjct: 111 TRLENAKTMLKMLISA-------KGEQRTGIVAFTSKAYTQCPITNDVEALKYFVNQLRP 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 N G TA+ A+Q A + + KK ++LLTDGE+ + + A Sbjct: 164 EMLNAKG-TALAPAVQRAAEMLSK------------YPGKKALILLTDGEDHEPEQIEEA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 I A+ +GI+I+ + + + + Sbjct: 211 I-KTAQKEGIKIIAVGIGTEEGEPIPEK 237 >gi|114614242|ref|XP_001160235.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 1 [Pan troglodytes] Length = 1110 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 ILDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA++ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRIIMIVTDG---RPQDSVAEVAAKARNTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 192 FAIGVG------QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 ILDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA++ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRIIMIVTDG---RPQDSVAEVAAKARNTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 192 FAIGVG------QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVAHMKYMGKGS-MTG 735 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ ++ + E + + ++ TDG D +KAK+ GI + Sbjct: 736 LALKHMFERSFTQVEGARPPS---TQVPRVAIVFTDGRAQDD---VSEWASKAKANGITM 789 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRIGNEIFERVI 454 + + L AS F A E+++ ++ I + + Sbjct: 790 YAVGVG------KAIEEELQEIASEPIDKHLFYAEDFSTMGEISEKLKEGICEALEDSGG 843 Query: 455 R 455 R Sbjct: 844 R 844 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 70/221 (31%), Gaps = 24/221 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 + + + ++ + + VDSS + A + + ++ Sbjct: 31 NHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSV----RPTEFEKAKEFLQDLVDSLEV 86 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D+ R+G + V + + A E T A++TA + Sbjct: 87 GLDSTRVGLVNYASTVRMEFPLKAHFS--KPALKGALARVEPLASGTMTGLAIRTAVEKA 144 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ E N+ + + V++TDG + E A A+ GI I + Sbjct: 145 FAA---EAGARLNSTKVARVAVVVTDGRPQDEVERVSA---AARESGIEIYAVGV----- 193 Query: 411 QQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 R L AS + F + +L FR+ + Sbjct: 194 -DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTSRFRETL 233 >gi|148656912|ref|YP_001277117.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569022|gb|ABQ91167.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 543 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 55/176 (31%), Gaps = 21/176 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + L + + I D R+G F+ + R + Sbjct: 384 KLEAAKSGLGTFLSRILPED------RVGLIVFSTDARVVVPPAPLSEA--RIALDDAIA 435 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G TA+ DA+ T + R+ + IVLL+DG + + Sbjct: 436 QLNARGKTALYDALITGKQVFDDLPPPDEERI-------RAIVLLSDGLDNASRTTLDQV 488 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN-STHELNKIFRDR 444 GI I +A+ + L A+ + T ++ +IF + Sbjct: 489 RLAFDETGISIFPVAYGSDADLAA-----LEQIATFSRTIVVQGDTGDIGQIFENL 539 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 53/176 (30%), Gaps = 14/176 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK------------- 54 ++ +KS TG+ + AL+MP+++ G+ +D + L+ Sbjct: 5 TTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAAD 64 Query: 55 -QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 +A+ A++ + + + + + + Sbjct: 65 VSSAEEAVLAHFRSNDLGFAVSTPSGLMTVDGKLLPNDVETAAKGVATIVRGRYVPDPAI 124 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + A QV L + D+ L L + + +K A +V Sbjct: 125 DAGQRFIRRALPADAVQVTLEKKGDIYLAALFREPPDLSVIGTAASSKIAAFSVGS 180 >gi|296209823|ref|XP_002807089.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Callithrix jacchus] Length = 1094 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter fumaroxidans MPOB] gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter fumaroxidans MPOB] Length = 680 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 47/180 (26%), Gaps = 28/180 (15%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++ R FN GSTA+ D + AY Sbjct: 333 KMSANDRFRIVTFNTTAADFTGGYVPASPENVQTWMQRVKQIQAGGSTALFDGLDLAYRL 392 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + IVL+TDG K +R+ T + Sbjct: 393 LDGERTTG-------------IVLVTDGVCNVGPTRHDEFLGLLKQHDVRLFTFVIGNSA 439 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHEL--------NKIFRDRIGNEI----FERVIRIT 457 Q R S + + ++ K+F + + + E+V R+T Sbjct: 440 NQPLMDRLAKE---SGGFAMNVSESDDIAGRLIQAKAKVFHECLHGVVLKFHGEKVSRLT 496 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 73/233 (31%), Gaps = 29/233 (12%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP---LDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G R ++P + + G L +DSS + V+ ++ Sbjct: 50 GRRPSSVAPDGAPASGASEPGRASGAGVCKSRPLDLVFLIDSSRSV-RPLEFTKVKTFVS 108 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTA 338 +I ++ D R+ + V + + +K T Sbjct: 109 RIIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGQITPLSTGTM 162 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+QTA D + ++ E + K +++TDG + ++ + +A++ GI Sbjct: 163 SGLAIQTAMDEVFTA---EAGARGPSSNIPKVAIIVTDG---RPQDQVNEVAARARASGI 216 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 ELYAVGV------DRADMQSLKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 263 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 295 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 351 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 352 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 410 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 411 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 443 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 304 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 360 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 361 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 419 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 420 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 452 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 70 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 126 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 127 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 185 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 186 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 218 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 831 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 887 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 888 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 946 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 947 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 979 Score = 43.8 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 51/184 (27%), Gaps = 17/184 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 + + + + ++ P+ ++ T ++E G T A Sbjct: 130 QGNGVWSGTEPICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAA 189 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR-- 399 + + Y + + E + I+LLTDG++ + + I Sbjct: 190 LNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTAVDHIREILNINQK 248 Query: 400 ------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I + S F T L+++F V Sbjct: 249 RNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEHM------LDV 302 Query: 454 IRIT 457 ++T Sbjct: 303 SKLT 306 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 314 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 370 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 371 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 429 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 430 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 462 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|14248611|gb|AAK57591.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + NK K + + + I Sbjct: 110 PNSKYTALE-VANKLKQRNVSLAVIGIGQGIN 140 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 329 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 233 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 290 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 291 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 344 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 345 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQIESLTSVFQN 387 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 52/165 (31%), Gaps = 17/165 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + R+G ++ +V ++ + + + K A + + Sbjct: 829 QFLTAIIDSLAVSPKAARVGLLQYSTQVRAEFTLRS--FSTAKDMKKAVAHMKYMGKGSM 886 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + E + ++ TDG D NKAK+ GI Sbjct: 887 TGQALKHMFERSFTQLEGARPLSARVP---RVAIVFTDGRAQDD---VSEWANKAKANGI 940 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + + + L AS F A +N+I Sbjct: 941 TMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMNEI 979 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 49/150 (32%), Gaps = 21/150 (14%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + ++ +I ++ R+G + R + + + Sbjct: 106 SPSRLDKSKRIVSEIINNLGS-------DRIGIIAYAGRAVPQLPITTDFSAAKMFLNNL 158 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G TAI+DA++ A + + +V+++DGE+ Sbjct: 159 NTNMLSSQG-TAIDDAIRLAKTYYDDVEQ-----------TNRVLVIISDGEDHTGGAG- 205 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + +A +GI+ TI K + Sbjct: 206 -QLAEEATKEGIKTYTIGVGTTKGGPIPIK 234 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 58/190 (30%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + + + ++R + L RT KT Sbjct: 60 SRMAAAKQAFNEVLDATPEEVRLGIRTLGADYPGDNRKEGCKDTAQLYPVGPLDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + N K IVL++DGE+T + Sbjct: 120 AVATLSPTGWTPIGPALLKAADDLDGGNG------------SKRIVLISDGEDTCAPLDP 167 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ + + A+ ++ +L Sbjct: 168 CEVAREIAAKGIGLTIDTLGLVPDVKLNRQLSCIAE--ATGGTYTSVEHRDQLTDKVNQL 225 Query: 445 IGNEIFERVI 454 + V Sbjct: 226 VDRAADPVVT 235 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 52/164 (31%), Gaps = 5/164 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++++ + G + +A+L VM+ + +D + + A+ A I Sbjct: 3 LSRMTRRVAQPVAGQSLVFSAILFFVMIAFAALAIDTGEAFSRQRQQQAASTAASIAGLE 62 Query: 67 PLIQ--SLEEVSSRAKNSFTFPKQKIEEYLIRNFE---NNLKKNFTDREVRDIVRDTAVE 121 + + + + I + N + + +N+ Sbjct: 63 SMNSEIDGTDGAVQQAIRDALAANGITNAVYINGDWGNLDPSQNYYRAFYTQRGSRVEYP 122 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + R ++ ++F +++GI + + + +A Sbjct: 123 VGSGGQVSTEFNGLRVEVRSARNTIFGQALGIDTLEVSAENKAT 166 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 48/183 (26%), Gaps = 36/183 (19%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ F + + + ++T + T I A+ A + Sbjct: 127 NNDKVAMIVFAGEAFTQLPITSDYVSA-KMFLETISPSLITTQGTDIRGAIDLAMKSFTP 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + IVL+TDGEN + I +A +G+R+ + Sbjct: 186 -----------NEGVGRAIVLITDGENHEGGA--IEAAQEAAKKGMRVFVLGVGSPDGSP 232 Query: 413 EK--------------------ARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIF 450 A + + ++T+ K I Sbjct: 233 IPVEGTNEFRRDKDGNVVVTRLNEQMCQEIAKAGNGIYVRVDNTNNAEKALNAEINKLAK 292 Query: 451 ERV 453 V Sbjct: 293 ADV 295 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 61/191 (31%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 131 FEIVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRN--FGSAKDMKKAVASM 185 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 186 KYMGKGSMTGLALKHMFERSFTQIEGARPLSARVP---RVAIVFTDGRAQDD---VSEWA 239 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E++ + Sbjct: 240 SKAKANGITMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISDKLQKG 293 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 294 ICEALEDSDGR 304 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 61/191 (31%), Gaps = 23/191 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ + +I S+ R+G ++ +V ++ + + + K A Sbjct: 415 FEIVKQFVTGIIDSLTISPKAA---RVGLLQYSTQVRTEFTLRN--FGSAKDMKKAVASM 469 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D Sbjct: 470 KYMGKGSMTGLALKHMFERSFTQIEGARPLSARVP---RVAIVFTDGRAQDD---VSEWA 523 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +KAK+ GI + + + L AS F A E++ + Sbjct: 524 SKAKANGITMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISDKLQKG 577 Query: 445 IGNEIFERVIR 455 I + + R Sbjct: 578 ICEALEDSDGR 588 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 53/173 (30%), Gaps = 12/173 (6%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 I NV + + ++ + + T A+ Sbjct: 370 DFIYNVMSNCNFAIVQYGSLIRTELLLLDNEDRAGSLLKVKQIKQIY--NLTKTASAINH 427 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 I + + K I++L+DGE +D + NK + +G+ +I Sbjct: 428 VLTDIF------IPEKGSKNNTAKIIIVLSDGEILEDPMTLDEVLNKPQMKGVTRYSIGV 481 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF----RDRIGNEIFERVI 454 ++ A + A P ++ +S LN I R+ IG E Sbjct: 482 GDGILKKPNAVKEMMQIADPGKYYSVSSYGALNDILSSLEREIIGTEGDSVTT 534 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 53/181 (29%), Gaps = 8/181 (4%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A + + + + + ++ S ++ + Sbjct: 172 DDFQKAKDFIYNVMSNVWKTCFDCNFAIVQYGSLIRTELSLLDNEDRVGSLLKVKQIKQI 231 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T A+ I + + + K I++L+DG+ D + N Sbjct: 232 Y--NLTKTASAINHVLTDIF------IPENGSKDNSAKIIIVLSDGKILGDPMTLDEVLN 283 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 K + +G+ +I + A + A P ++ +S LN I I Sbjct: 284 KPQMKGVTRYSIGVGDGILKNLDATEEMMQIADPGKYYNVSSYRALNDIVSSLERGIIGT 343 Query: 452 R 452 + Sbjct: 344 K 344 >gi|221117277|ref|XP_002154725.1| PREDICTED: similar to HyTSR1 protein, partial [Hydra magnipapillata] Length = 3382 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 13/126 (10%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G + A TA +TI ++ + K+ +L TDG + Sbjct: 3075 LESRGFSYTWSAAITARETIFTTQKGMR------SNVKRVAILFTDGVYN-GKHDTQKEW 3127 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNKIFRDRIGNEI 449 K QGI+++ I + AS +F F A + E + I NE Sbjct: 3128 QYVKDQGIQVVAIGIGSPINISNLELW-----ASSKNFVFNATTYEEADAFISSVILNEP 3182 Query: 450 FERVIR 455 V++ Sbjct: 3183 SSTVVK 3188 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 42/125 (33%), Gaps = 14/125 (11%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G + A TA +TI ++ + K+ +L TDG + Sbjct: 2816 LESRGFSYTWSAAITARETIFTTQKGMR------SNVKRVAILFTDGVYN-GKHDTQKEW 2868 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRIGNEI 449 K QGI+++ + + AS + F A + E F IG+ + Sbjct: 2869 QYVKDQGIQVIAVGIGSPINTDNLELW-----ASNKNSVFSATTYQE-ADAFISTIGDNV 2922 Query: 450 FERVI 454 + + Sbjct: 2923 CKSLT 2927 >gi|307944861|ref|ZP_07660199.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307772075|gb|EFO31298.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 862 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 46/153 (30%), Gaps = 29/153 (18%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + T G T ++ +++ A + E +VL +DG Sbjct: 105 SGSGADISTRINSIRPKGKTPLSQSVRLAAQELQYREEAAT------------VVLFSDG 152 Query: 378 ENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 T N + + + + GI I F + R L A + ++F+A+ Sbjct: 153 IETC-NADPCLLAEELERDGIDFTAHVIGFGIGSDA---DRKKLQCIAENTGGTYFDADD 208 Query: 434 THELNKIFRDRIGNE---------IFERVIRIT 457 L E + +R+T Sbjct: 209 AGSLKDALGQVTNAEPVSEQADTSLALVPLRLT 241 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 58/169 (34%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 79 ILDILQFLDIGPDITRVGLLQYGSTVKNEFSLK--TFKRKSDVERAVKRMRHLSTGTMTG 136 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + +KA++ GI I Sbjct: 137 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVASKARNTGILI 190 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 191 FAIGVG------QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 17/156 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 636 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGS-MTG 693 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ Y+ + E + ++ TDG D KAK+ GI + Sbjct: 694 LALKHMYERSFTQVEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWARKAKANGITM 747 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 + + L AS F A Sbjct: 748 YAVGIG------KAIEEELQEIASEPIDKHLFYAED 777 >gi|194220937|ref|XP_001501805.2| PREDICTED: matrilin 3 [Equus caballus] Length = 450 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 61/181 (33%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++ +I ++ D R+ + V + H +++ + A Sbjct: 63 FTKVKTFVSQIIDTL---DIGPADTRVAVVNYASTVKIEFHL--NTHSDKQSLKQAVARI 117 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+QTA D + N K +++TDG + ++ + Sbjct: 118 TPLSTGTMSGLAIQTAMDEAFTVEAGARVPSYNIP---KVAIIVTDG---RPQDQVNEVA 171 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + + L AS F + +L+ F++ Sbjct: 172 ARARASGIELYAVGV----DRADMES--LKVIASEPLDEHVFYVETYGVIEKLSSRFQET 225 Query: 445 I 445 Sbjct: 226 F 226 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 58/169 (34%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 79 ILDILQFLDIGPDITRVGLLQYGSTVKNEFSLK--TFKRKSDVERAVKRMRHLSTGTMTG 136 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + +KA++ GI I Sbjct: 137 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVASKARNTGILI 190 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRD 443 I + L S + F + L +F++ Sbjct: 191 FAIGVG------QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 17/156 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V I + R+G ++ +V ++ + G + GS Sbjct: 636 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGS-MTG 693 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ Y+ + E + ++ TDG D KAK+ GI + Sbjct: 694 LALKHMYERSFTQVEGARPLSTRVP---RAAIVFTDGRAQDD---VSEWARKAKANGITM 747 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 + + L AS F A Sbjct: 748 YAVGIG------KAIEEELQEIASEPIDKHLFYAED 777 >gi|326681146|ref|XP_002665520.2| PREDICTED: collagen alpha-1(XXII) chain [Danio rerio] Length = 1623 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + +R +A+++ S D D R+ ++DR + + Sbjct: 37 YDLVFILDTSSSVG-KENFEKIRQWVANLVESF---DVGVDKTRVAVVRYSDRPTVEFNL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + +K + + + G+T DA+ I + K +K + Sbjct: 93 A--RYKTLEEVKRAAGNIRYLGGNTKTGDAISFTTTNIFTERAGARPAAKG---IQKVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFF 429 LLTDG+ +QD ++ A GIR+ + E + L A F Sbjct: 148 LLTDGQ-SQDFVLEPSVAAAA--AGIRLFAVGIG------EALKEELEEIAAEPKSAHVF 198 Query: 430 EANSTHELNKI 440 ++KI Sbjct: 199 HVTDFDAIDKI 209 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 45/163 (27%), Gaps = 8/163 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G F ++ A+ + V+ + G+ +DV L+Q A A + + Sbjct: 21 RRRERGSFAVMAAVFLVVIAAIFGV-LDVGNTYLQRRDLQQIADMAAAAGVQRVDNLCVQ 79 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + A NS T + + D V N ++ L+ Sbjct: 80 APTSATNSATVN----GLNTSQGDTIAVTCGRWDPTVNPAPSYYLANTNTSGDPNRLQLN 135 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + +N + + A + Sbjct: 136 A---VQVNVTRQVRHMFVGPLQTVHATSTARATAIDVFSVGAT 175 >gi|14248609|gb|AAK57590.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + NK K + + + I Sbjct: 110 PNSKYTALE-VANKLKQRNVSLAVIGIGQGIN 140 >gi|197102272|ref|NP_001124862.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Pongo abelii] gi|55726163|emb|CAH89855.1| hypothetical protein [Pongo abelii] Length = 1079 Score = 52.3 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|329744564|ref|NP_001192916.1| voltage-dependent calcium channel subunit alpha-2/delta-1 [Bos taurus] Length = 1091 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|114614244|ref|XP_001160279.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Pan troglodytes] gi|114614246|ref|XP_519175.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 3 [Pan troglodytes] Length = 1091 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 76/264 (28%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + YY P Sbjct: 169 PTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|14248703|gb|AAK57637.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G+T + A + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKTYTPYGTTNMTAAQDEVQKHLNDRV--------NREKAIQLVILMTDGV 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + NK K + + + I Sbjct: 110 PNSK-YRALEVANKLKQRNVSLAVIGVGQGIN 140 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 56/151 (37%), Gaps = 23/151 (15%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI---VKTFAIDENEMGSTA 338 I +I + D R+ ++D + + + + ++ E G T Sbjct: 82 ISTIFSMSP--DEARVSVVTYSDSSKIVRQIDYIGSSVGKNKCTFLGELSLIRYEAGWTD 139 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ A + + + +VLLTDG++T+ + + I + +++GI Sbjct: 140 TKGALEEADRVLQHARSGAN----------RLVVLLTDGQSTEGD--PVGIATRIRNKGI 187 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 RI+ I L++ A+ F Sbjct: 188 RIVAIGVG------NVNMDELTSIATAQYVF 212 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 37/119 (31%), Gaps = 14/119 (11%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + T I A+Q + ++ Sbjct: 134 LIVFAGDAFVQLPITSDYVSA-KMFLDNINPSLIGTQGTDIGKALQLSMNSFTP------ 186 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 N + K I+L+TDGE+ + + +A+S+GIR+ + + Sbjct: 187 -----NSKVGKAIILITDGEDNEGGA--EEMAKQAQSKGIRVFILGVGSTEGATIPMPD 238 >gi|260782980|ref|XP_002586557.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] gi|229271674|gb|EEN42568.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] Length = 534 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 59/190 (31%), Gaps = 27/190 (14%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D S + +++ A + ++D T R+G ++ V + Sbjct: 149 VFVIDGSGSISSVSFGSVMKFAADMSL----RLDISATTTRVGMVQYSTNVTPEFMLKEH 204 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 +++ K + G T A++ + + K+ +++T Sbjct: 205 T--TKKSVEKAIGDVKRLGGGTNTGKALKFVRTEM----------DWRDPPTKRVAIVVT 252 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANST 434 DG++ D + G+ + + + + L + NS Sbjct: 253 DGKSQDDVGTP---ATALRQAGVVLYAVGVGLPTDE-------LKEITGDPTKVYALNSY 302 Query: 435 HELNKIFRDR 444 EL I +D Sbjct: 303 DELQDIIQDI 312 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 62/198 (31%), Gaps = 30/198 (15%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + K + +D + +I + ++ V + I + Sbjct: 41 DGSNSMWGQVDGEAKITIAKDVMTDLISNWDDAVDLGLMVYGHRRKGDCSDIEVVAL--- 97 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 K+ R + + G T I+ + A ++ K +VL++ Sbjct: 98 PGKVNRPALIDKVQSISPRGKTPISKTLLLAATSV------------GYFSGKSSVVLVS 145 Query: 376 DGENTQDNEEGIAICNKAKSQGIR-----IMTIAFSVNKTQQEKARYFLSNCAS--PNSF 428 DG T D C +AK+ GI + I F E+ L A+ F Sbjct: 146 DGLETCD----ADPCAQAKALGIINPGFDVHVIGF----DVTEEEFKSLQCIATETGGKF 197 Query: 429 FEANSTHELNKIFRDRIG 446 F AN+ EL R + Sbjct: 198 FRANNAEELKDALRRTVA 215 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 86/271 (31%), Gaps = 18/271 (6%) Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 ++ + S + D N A Y + + + Y S Sbjct: 614 YDGVEKTASFITNTSDPLNNTGNPGTAVAYFDSVIYGKSKVEARLVNPIDPRYATSLKGL 673 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + ++ ++ + A+ + + + + + Sbjct: 674 KDKTVTKDIFTNPYFSKNSCSLATEIAYVVDYSSSMKAVD---PTNYRGKKMIELINQLK 730 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAI-DENEMGSTAINDAMQTAYDTIISSNEDEV 358 N + ++ + ++K E G+T I + A Sbjct: 731 AKNNIVIETNTKATVLGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKFS------- 783 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 N+ + K IV+++DG+ + + N+AK QG+++ T++ + Sbjct: 784 ----NDSKTSKAIVVVSDGK--TSKSKMTKVINEAKKQGVKVYTVSMGKKSQVNDATLMQ 837 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 LS+ + ++F A +L+++F+ I + Sbjct: 838 LSS-ETSGAYFHAIDNMQLHQVFQKLIDTIL 867 >gi|85705210|ref|ZP_01036309.1| hypothetical protein ROS217_17117 [Roseovarius sp. 217] gi|85670083|gb|EAQ24945.1| hypothetical protein ROS217_17117 [Roseovarius sp. 217] Length = 179 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 22/59 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 F + ++ + G + L+ P LG+ +++ + L++ A+ Sbjct: 1 MTRAFKNFLRRFRRDERGTASLEFVLVAPFFLGIMIFSIELGFVTMRATFLERGLDMAV 59 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 30/98 (30%), Gaps = 6/98 (6%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + A G T + +AM A D + ++ + +IVL+TD Sbjct: 75 AGFSTLEHQPDAPQFYADGGTPMGEAMNMALDML--EKRKSEYKASGVDYYQPWIVLMTD 132 Query: 377 GENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKT 410 G E C+ + + I I + Sbjct: 133 GMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDAD 170 >gi|73980586|ref|XP_540098.2| PREDICTED: similar to matrilin 3 precursor [Canis familiaris] Length = 481 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 61/181 (33%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++ +I ++ D R+ + V + + +++ + A Sbjct: 98 FTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKIEFHLQ--TYSDKQSLKQAVARI 152 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+QTA D + E K +++TDG + ++ + Sbjct: 153 TPLSTGTMSGLAIQTAMDEAFTE---EAGARGPTSNIPKVAIIVTDG---RPQDQVNEVA 206 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + + L AS F + +L+ F++ Sbjct: 207 ARARASGIELYAVGV----DRADMES--LKIIASEPLDEHVFYVETYGVIEKLSSRFQET 260 Query: 445 I 445 Sbjct: 261 F 261 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 60/217 (27%), Gaps = 53/217 (24%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + + VR+G F + L I K Sbjct: 110 NRLVASQNAAKAFLADL------PRNVRVGVVAFAGTAAVVQPPTVSREDLTAAIDKFQL 163 Query: 329 IDENEMGSTAINDAMQTAYDTIISS-----------------------------NEDEVH 359 TAI + + + + + Sbjct: 164 QR-----GTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSLDQAGKDDGNGKKAFT 218 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ------- 412 + I+LLTDG+ T + + A +GIR+ T+ + + Sbjct: 219 PVAPGSYTSAAIILLTDGQRTTG-IDSLDAAKVAADRGIRVYTVGVGTVEGETIGFEGWS 277 Query: 413 ---EKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + L A+ +F A + +L K+++ Sbjct: 278 MRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTL 314 >gi|323320820|gb|ADX36428.1| complement factor B [Apostichopus japonicus] Length = 913 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 75/232 (32%), Gaps = 23/232 (9%) Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + +S + N P D L D SS ++ + + Sbjct: 427 SALFEAAVSNETTNTNVDQTARFIPLSHDTGLDLYFMFDGSSSVG-EDNFNMAKKFAKEL 485 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM--GSTAI 339 ++ I D +++R+GA FN + T AID E G T I Sbjct: 486 VKEIGVTD-RPNSLRVGALVFNSEAEI----GFHTVAFDSTADVLDAIDSMEYKEGGTNI 540 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A + + +I N ++ K + L+TDG+ T+ + + Q + Sbjct: 541 AKAFEVLSNVMIPQT-----AKLNREKSFKTVFLITDGDATEGGDAQED-ARAVRDQDVT 594 Query: 400 IMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNE 448 I I + R LS AS F L + F + N+ Sbjct: 595 IHCIGI-----SENATRRTLSGMASEPLSEHLFFLKDYSTLEE-FIQIVTNQ 640 >gi|261880540|ref|ZP_06006967.1| BatB protein [Prevotella bergensis DSM 17361] gi|270332763|gb|EFA43549.1| BatB protein [Prevotella bergensis DSM 17361] Length = 342 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/211 (9%), Positives = 55/211 (26%), Gaps = 42/211 (19%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + + S+ ++G F + Sbjct: 101 SNSMKAEDVAPSRLDKSKMLVESMTEHFT-------NDKLGLVVFAGDAFVQLPITSDYV 153 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + ++ T I A+ + + + I+++TDG Sbjct: 154 SA-KMFLQNIDPSLVSTQGTDIARAITVGMRSFTQQ-----------EKVGRAIIVITDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFLSN----- 421 E+ + + +A+ +GI + + K + +S Sbjct: 202 EDHEGGA--MEAAKEARKRGINVFILGVGSTKGAPIPTGNGGYMTDATGQTVMSALNEEM 259 Query: 422 C-----ASPNSFFEANSTHELNKIFRDRIGN 447 C A ++ ++T + + D + Sbjct: 260 CKEVAKAGSGTYIHVDNTSDAQEQLNDELTK 290 >gi|156350148|ref|XP_001622163.1| hypothetical protein NEMVEDRAFT_v1g221080 [Nematostella vectensis] gi|156208611|gb|EDO30063.1| predicted protein [Nematostella vectensis] Length = 2040 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 55/164 (33%), Gaps = 24/164 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + G + + ++ F + G+T + ++ ++ ++ Sbjct: 242 IHTGVVSYGTKAKTNLGFDVHFSQANFNSAIDGIN--FPGGATDTGNGIRKVMKSLFKTS 299 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +VL+T G++T D + ++ G+R+ + Sbjct: 300 KRRSIPH--------ILVLITAGKSTGDPSLN---AEELRASGVRLFCVGVGG-----AY 343 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 R L AS ++T+ L F D G + V RITK Sbjct: 344 DRTQLDAIASSP-----STTYVLTAGFDDLSG-LVPTLVSRITK 381 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 16/184 (8%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + AL+ + + +G F F + I Sbjct: 425 ENFKQALSFFKTIYQAFWTPFGSTHVGLVVFGADSTMVFDFDNKLKD-KPAIDAAIDASV 483 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + A+Q D + H ++ +V+L + ++ + Sbjct: 484 FPSGQTNLGAALQQTKDLSLGYLFGSKHNDG----HRRTLVVL---AGSTAEDDVFQGAS 536 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNE 448 + G+ + + L AS ++ S ++L I + I Sbjct: 537 DLRMNGVTVYCVG-----AGPNYDSAQLDGIASTPADSNVLTVASYNDLPGISQTLINKI 591 Query: 449 IFER 452 R Sbjct: 592 EEGR 595 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 37/124 (29%), Gaps = 12/124 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +D R+G FN K + G + +A+ A + Sbjct: 1684 SDKTRVGVVVFNS-PSVQFPLDNYSSKEEVYSAIDTITQSSNSGPGTVGEAIAFANTNLF 1742 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 K + K ++++T ++ D AK + I ++ + ++ Sbjct: 1743 KG--------KTRSQTPKILIVITSKKSGDDVTSPS---QDAKRENITMLVVGVGPGASK 1791 Query: 412 QEKA 415 + Sbjct: 1792 SDMD 1795 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/394 (10%), Positives = 103/394 (26%), Gaps = 34/394 (8%) Query: 58 QTAII--TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A++ SV + +L + ++ N + +E + + D + + Sbjct: 114 DAAVLDPAGSVSVAPALVDETTDDANVAALAEPVMEMAPAVVLSESSSREEADSAIGNPA 173 Query: 116 RDTAVEMNPRKSAYQVVLSS-----RYDLLLNPLSLFLRSM-----GIKSWLIQTKAEAE 165 + + +V ++ ++ E Sbjct: 174 TAAPLSLTRAAPTNDLVAEGMALGQVVSAPEPNVAGIGELDTESFANDTPNPLKITTEEP 233 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + S + + L + + + A + + Sbjct: 234 VSTLSIDVDTAAYALIRSSLTAGQLPPADAVRIEEMINYFPYAYPAPDG-QPFQPTINVF 292 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + L + P+L+ +D+S K L+R + ++ ++ Sbjct: 293 ETPWNADTQLVHIGLQGEMPSIQDRPALNLVFLIDTSGSMESADKLPLLRQSFRLMLDNL 352 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D V + + RT + N GST ++ Sbjct: 353 APEDEVA------IVTYAGSTSIALQPTQ---ASERTTILAALNALNAGGSTNGQGGLEQ 403 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 AY MK + + + ++L TDG+ + + + G + Sbjct: 404 AYAL--------AETMKTDGDVSR-VILATDGDFNVGLSDPRGLQAYIEDKRDTGTYLSV 454 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + F Q + N + + S + Sbjct: 455 LGFGRGNLQDATMQSLAQNGNGTAVYIDTLSEAQ 488 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/260 (11%), Positives = 68/260 (26%), Gaps = 4/260 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L K G ++ A+L+ + LG+ + +DV + L+ AA + + L Sbjct: 7 SLKKDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLES 66 Query: 73 EEVSSRAKN--SFTFPKQKIEEYLIRNFENNLKKNFTDREVRD-IVRDTAVEMNPRKSAY 129 + N + + I E + N N D + N +A Sbjct: 67 GASVNPNANVIANNTAIENISEDVPVEVNYNAAANTGDVQRGHWSFAARQFTPNGSLTAI 126 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 V + DL ++ + + + + S + + + Sbjct: 127 DVGNYTTEDLDNPDPNINGGLINAVKV-VVRRQDRPASSFFAQIFGFENFGITAEAIAYI 185 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + + + + N +G S S P Sbjct: 186 GFSGTINPAELDQPIAICEESILDDNGNYNCNMGRMLNSGSNLNTSNTGGWTNFSQPCDT 245 Query: 250 DPSLSEEHFVDSSSLRHVIK 269 + + + + Sbjct: 246 ADASEMKALICGDGNPDEVT 265 >gi|327541351|gb|EGF27892.1| hypothetical protein RBWH47_01135 [Rhodopirellula baltica WH47] Length = 384 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/258 (10%), Positives = 65/258 (25%), Gaps = 4/258 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ---SLE 73 S G I +L + V ++ + + L ++A +A + + L + + Sbjct: 18 SRRGAAGIFGLILCVALFTVLAVVSEFAYINTARTELSRSADSAALAGAWALYDARVNGD 77 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S + + D +N SA V + Sbjct: 78 HDGVSQVKSSADLISSANHIGQIQAHVGSVDHAIEIGSYDHATQVFEPINDWASANAVRV 137 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + L L+ G+ Q+ ++ T + ++ + Sbjct: 138 NLHLTNSA-HGELPLQLAGLTGRKSQSLHQSVTAAFQHNISGFKRPPNNGTKIDILPIAL 196 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + Q ++ D + S G G + L P + + + + Sbjct: 197 DEETWQLVSDKQTSDDLMYFNGSIVEGSDGFHECSLYPTGTGAPGNRGTVDIGSSNNSTS 256 Query: 254 SEEHFVDSSSLRHVIKKK 271 + + Sbjct: 257 DLRRQILYGISQDDFDAL 274 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + + + E G T Sbjct: 626 LSDFPVVPSATRVAIITFSSKTNVQTRVDYISSSEPHQHKCSLLNREIPAITYKGGGTFT 685 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + + + K I L+TDG + + I + G+ Sbjct: 686 KGAFQQAAQILR----------YSRSNSTKVIFLITDGYSNGGDPRP--IAANLRDLGVE 733 Query: 400 IMTIAF 405 I T+ Sbjct: 734 IFTVGI 739 >gi|119512059|ref|ZP_01631153.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] gi|119463285|gb|EAW44228.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] Length = 435 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 57/200 (28%), Gaps = 33/200 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDT--VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 V A+ ++ ++ D+ T R+ F + +I Sbjct: 59 IATVIQAVEQLLDRLQPSDSETPTSGDRISVVAFAGEAQVIIP--NQTLQDTASIKAQIH 116 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 G TAI + +Q ++ + V + LLTDG Sbjct: 117 KKLKASGGTAIAEGLQLGITELMKGTKGAVSQA----------FLLTDGHGESSLKIWKF 166 Query: 382 -----DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANST 434 DN+ + +KA + I T+ F + + L A + Sbjct: 167 EIGKDDNKRCLEFAHKATKINLTINTLGFG-----NDWNQDLLEKIADAGGGTLAYIERP 221 Query: 435 HELNKIFRDRIGNEIFERVI 454 + + FR + R+ Sbjct: 222 EQALEQFRHLLQRIQSVRLT 241 >gi|223694808|gb|ACN18090.1| von Willebrand factor type A [uncultured bacterium BLR5] Length = 347 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 41/121 (33%), Gaps = 7/121 (5%) Query: 328 AIDENEMGST--AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + T + +D + S + + R ++ I+LLTDG++T Sbjct: 173 ISGISSGPGTIANPREGSTAIWDAVTVSAGEILTRSPGRR--RRAIILLTDGQDTSSRVT 230 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +KA I I + + + L+N A + F +L +F + Sbjct: 231 RGTAIDKALEAETVIYAIGIG-DSKYEGIDKGALNNVAERTGGRAFFPKRGADLTSVFTE 289 Query: 444 R 444 Sbjct: 290 I 290 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 40/122 (32%), Gaps = 16/122 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G T I ++ AY+ + +K+I+LLTDG+ Sbjct: 463 NNKEEAVDTILSVTPGGGTEIYGSLAKAYENLAD-----------IKLQRKHIILLTDGQ 511 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + N + + K GI + T+A + LS F+ + Sbjct: 512 SQPGNY--EDLIEQGKDNGITLSTVAIGQDADANLLE--ALSEM-GSGRFYNVIDEQTIP 566 Query: 439 KI 440 I Sbjct: 567 SI 568 >gi|14248589|gb|AAK57580.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|304441881|gb|ADM34176.1| martilin [Aplysia californica] Length = 348 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 50/125 (40%), Gaps = 16/125 (12%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T+ + ++ ++S A++ +++TDG N+Q++ + +A Sbjct: 59 GRYTSTGEGIEYMATAQLASQ-------FTRSWAERVGLVITDG-NSQESAKTKEAARQA 110 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFER 452 + GI + I K Q+ L N A F+ +S EL + + ++ R Sbjct: 111 RESGITMFAIGVGNVKDQE------LVNIAGDASRVFKVDSYDELEN-IKQTLAHQTCIR 163 Query: 453 VIRIT 457 ++ T Sbjct: 164 QLKTT 168 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 104/417 (24%), Gaps = 65/417 (15%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E A + Y + + + +V + T + V Sbjct: 460 ETENVPEADKDKNYTFNAYVNYEGTDSDKDYTLYTGKYDVYNTATGTKDGETRTTDSNGV 519 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + S R + + + V + D + + Sbjct: 520 ITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPV 579 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + A R K + N V+ +S P D Sbjct: 580 SVGEVPRTTVTNTVVTAPRYRKYIKANNDGT-----YDLSLNVTGTQSGSSQTTVSPADI 634 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN--VNDTVRMGATFFNDRVISD 309 + + S+ + + + A+ S+ + + +N + +RM F+ + Sbjct: 635 VVVFDTSGSMSNPMGHNSRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTAGNV 694 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +F+ + + G T + KK Sbjct: 695 SNFTDNAMD-----IVSAVNGLGADGGTNWEA----------ALKAANAKLTSGRKGVKK 739 Query: 370 YIVLLTDGENT--------------------------------------QDNEEGIAICN 391 YIV ++DG+ T A+ Sbjct: 740 YIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAE 799 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + ++ S + T+ + S++ A ST ELNK F D IG Sbjct: 800 ANRRGDATLFSVGVSSDPTKMRGFADQ-----TKGSYYSATSTDELNKAFADIIGQI 851 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 53/165 (32%), Gaps = 19/165 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGST 337 ++ ++ + + R+G ++ +V S R ++ D T Sbjct: 55 QFLVGLLRGLKVGANATRVGVIQYSSQVQSVFPL---RAFSRREDMERAIRDLVPLAQGT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 A+Q + S E + + V++TDG + + + +A+++G Sbjct: 112 MTGLAIQYVMNVAFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARG 165 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 I I + L ASP F S + + Sbjct: 166 IEIYAVGVQRADVGS------LRAMASPPLDEHVFLVESFDLIQE 204 >gi|172062967|ref|YP_001810618.1| hypothetical protein BamMC406_3936 [Burkholderia ambifaria MC40-6] gi|171995484|gb|ACB66402.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 346 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/282 (9%), Positives = 76/282 (26%), Gaps = 6/282 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEE 74 S G + + + +L G + +D+ + + L+ AA A + A+ ++ Sbjct: 14 SERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAADAAALAAADAMMSPTGGPL 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+A + + + + + V ++ Sbjct: 74 NWSQAAPAANGVIAQNSAAGAALATGTVTTGYWNVARNPATMQPTTITPGAYDVPAVQVT 133 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-LDYQ 193 +N S+ L G+ + + + + I + + Y Sbjct: 134 VSRAPGINGGSIPLLLGGLLGVPGTSGSATAIAVLAAPGGVAAGGLFPIAIDQCVYNQYW 193 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYPGPLD 250 + QPL + Q + + + + + M+ P Sbjct: 194 NAATNQPLINPLTGQPYEFSITNGQTYGALCMGGQWTSFQSTATDTTTMGGLMVTGNPTT 253 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + ++ + + + + V+ V Sbjct: 254 LNIGDSIYLATGVKATLYTSVPVGATVVLPVVTQTSSSTYVP 295 >gi|148234427|ref|NP_001080470.1| matrilin 2 [Xenopus laevis] gi|28175657|gb|AAH45220.1| Matn2-prov protein [Xenopus laevis] Length = 589 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I +K +D DT R+G + V ++ S ++K I + + T Sbjct: 75 LITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLK--MYKRKSDIERAVKRMMHLATGTMTG 132 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E N + +++TDG I KA+ GI I Sbjct: 133 LAIQYAMNIAFSEAEGARP---LNQYVPRIAMIVTDGRPQDP---VEEISAKARMSGILI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHE---LNKIFRD 443 I L S F + + L +F++ Sbjct: 187 FAIGVGR------VDMSTLKTIGSEPHSEHVFLVANFSQIETLTSVFQN 229 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 9/48 (18%), Positives = 18/48 (37%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ 55 + +++ G+ + AL+ P+ L V VD L+ Sbjct: 2 RGLVSRFLRARGGNLATMAALVSPIFLAVAAFSVDTSSLFLERRQLQS 49 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 44/152 (28%), Gaps = 15/152 (9%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G F R + S L + + TA+ D + A + Sbjct: 125 NHPGDRIGLVAFGSRAYAVMPPSADRAALTGALARLAVGA--AGKRTAMGDGLGLAVKRL 182 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + V+ DG + A +G+ + ++ ++ Sbjct: 183 SDAPGLS-----------RLAVVFGDGRSNAGEVSPEDAAKAASERGVTVYSVGVGGDEP 231 Query: 411 QQEKARYFL--SNCASPNSFFEANSTHELNKI 440 + L S + + +A + + K Sbjct: 232 APFLVTHPLLGSQIVTEKAAVDATTLAAMAKA 263 >gi|149919193|ref|ZP_01907676.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] gi|149819907|gb|EDM79329.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] Length = 718 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/416 (9%), Positives = 90/416 (21%), Gaps = 48/416 (11%) Query: 56 AAQTAI--ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A A + A+ L+ L R+ + L Sbjct: 163 ALDRAQAEVAATRALVPDLPGEQVRSYVAV---------VLTDGLWTGADGTTALAPADQ 213 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 TA + V++ D + + L G Y Sbjct: 214 DPAITATALLDEADVPTYVITIDGDAQVEEAADALAFAGGTGAAFHGGTPGLLDGAIYGL 273 Query: 174 EHGV--------------SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + + ++D S S L+ + + R + Sbjct: 274 NQELLHDHALPGCSRQHPRVMVLLDASSSTLNDFGGTVPGTMGETAWDEIRAGLTAPDSL 333 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 V V + P S + ++ + Sbjct: 334 FDVDEGLAPQDMLHVGLTIFGH-DDAPEEQRVLASYGPCM----RDNLAWALDPEVSCVE 388 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 D + + D T Sbjct: 389 PGCD-----DPWGGPTIAWTYQDGS----AGPPGFDLQTNSHMPRCEGPADFCWGSGTFT 439 Query: 340 NDAMQTAY---DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAK 394 + ++T ++++ + + +L+TDG + + + + Sbjct: 440 HRGLETVRLHQIDYAAASQTDDALYPTAPDTPYINILITDGMYSGYSSDAQVQSELEAMF 499 Query: 395 SQGIRIMTIAFSVNKTQQEKARYF--LSNCASPNS--FFEANSTHELNKIFRDRIG 446 + GI I + + +++ S + + A + EL + +G Sbjct: 500 NAGITTYVIGYGDLFGFPDAVAQVEQMADWGSGGANDAYLAQNQVELEDALYETLG 555 >gi|297288988|ref|XP_002803438.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Macaca mulatta] Length = 997 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 77/264 (29%), Gaps = 32/264 (12%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG-PLDP 251 D + + + + + + + + YY P Sbjct: 169 PTDIYEGCTIVLNELNWTSALNEVFKKNREENPSLLWQVFGSATGLARYYPASPWVDNSR 228 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASV---------------IRSIKKIDNVNDTVR 296 + ++ D I+ +D L V + ++ ++D Sbjct: 229 TPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDF 288 Query: 297 MGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + FN F V + ++K + G T A++ +++ N Sbjct: 289 VNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYN 348 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNKTQQE 413 + K I+L TDG E I K K + +R+ T FSV + + Sbjct: 349 V-------SRANCNKIIMLFTDG----GEERAQEIFTKYNKDKKVRVFT--FSVGQHNYD 395 Query: 414 KARYFLSNCASPNSFFEANSTHEL 437 + C + ++E S + Sbjct: 396 RGPIQWMACENKGYYYEIPSIGAI 419 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 33/116 (28%), Gaps = 26/116 (22%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T AM A + + + A K + L+TDG + + Sbjct: 154 YAGGGTYTKGAMIKAQEVLRHARP----------NATKAVFLMTDGYSNGGDPLPE--AR 201 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 K K ++I T + L N A+ E + F D Sbjct: 202 KLKQNDVQIFTFGI------RSGNVKELQNMAT--------DPAEEHSYFLDSFAE 243 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 48/162 (29%), Gaps = 21/162 (12%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +D+SS ++ + ++ + R+ F+ ++ Sbjct: 138 VFLIDASSSVGKANFYSEIK-FVKKLLSDFNVSY---NYTRVAVITFSSQMKIFRHIDQI 193 Query: 316 VHKLIRTIVKTFAIDENE-----MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + G T A++ A + ++ ++KK Sbjct: 194 STSVEDNDKCLLLNYQIPKIEFSGGGTYTYGALKEAEEIFQNA----------RADSKKI 243 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 I L+TDG + + + K + + I +I + Sbjct: 244 IFLITDGFSNGRDPIP--LAESLKRKNVVIYSIGIQSGNYAE 283 >gi|149046637|gb|EDL99462.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Rattus norvegicus] Length = 939 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|74214868|dbj|BAE33446.1| unnamed protein product [Mus musculus] Length = 1169 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 138 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 191 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 192 MQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 249 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +A K ++++TDG DN ++ A++ I I E ++ L Sbjct: 250 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVGK-AFYNEHSKQEL 302 Query: 420 SNCA---SPNSFFEANSTHELNKI 440 A S F + L I Sbjct: 303 KAIASMPSHEYVFSVENFDTLKDI 326 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 55/187 (29%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ A +I +++ D R+ F+ R + Sbjct: 59 LETVKSAALGLIDRLEEDD------RLSVIAFDHRAKIVIE---NQQVRNGAAIAKAIER 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ K + +I LLTDGEN DN+ + + Sbjct: 110 LKAEGGTAIDEGLKLGIQE----------AAKGKEDRVSHIFLLTDGENEHGDNDRCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGN 447 A + + T+ F + L A+ S + E FR Sbjct: 160 GTVASDYKLTVHTLGFG-----DHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQR 214 Query: 448 EIFERVI 454 + Sbjct: 215 MSNVGLT 221 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 74/263 (28%), Gaps = 23/263 (8%) Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + + T ++ N + + +L P + Sbjct: 1 MHSMRWDISQKEVTHMGVTNSNKVINTDRIPCD---GTLRVTLALTAAPDIVSNPTDIVL 57 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIK-KIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 +D S + + + I I D+ D + S+ + + Sbjct: 58 VLDRSGSMTGTPLADM-KLGAKTFIDLIDEATDSSQDGQIGSGSRMGVVSFSNTAVADTQ 116 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +K + + GST DA A ++ + K +V+ TD Sbjct: 117 LITSVDALKAAVDNLSAGGSTNHADAFAKAIQLFDPASANA-----------KVMVMFTD 165 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA---NS 433 G NT + A++QGI I I + L++ A+ + Sbjct: 166 G-NTTIGAPPAPVAAAARAQGIIIYCIGLIGSDG---LDITALNDWATDPDASHVAVTPN 221 Query: 434 THELNKIFRDRIGNEIFERVIRI 456 +L ++F + N I Sbjct: 222 AADLEELFAELAANISKPGATEI 244 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 52/192 (27%), Gaps = 30/192 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + +G + S + K Sbjct: 109 SRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREAT-----KNAL 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 TA +A+ TA I + + IVL +DG+ T Sbjct: 158 DKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNP 214 Query: 385 -EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY---------FLSNCA--SPNSFFEAN 432 AK QG+ I TI+F E + A S + + A Sbjct: 215 KGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAA 274 Query: 433 STHELNKIFRDR 444 + EL ++ Sbjct: 275 TLAELRAVYSSL 286 >gi|198436258|ref|XP_002122777.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 4258 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 47/353 (13%), Positives = 92/353 (26%), Gaps = 24/353 (6%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 N N+ T + E ++ LS + GI S Sbjct: 1925 TNTCNSGTMTRTRICMNGDFGMVGCEGPSSQTKDCENLSPCAQWSPWSPGFCSVTCGIGS 1984 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + S + + S D +C G +R+ Sbjct: 1985 INRFRLCIGGDAGGPGCEGSASSEEPCVQQSCPYWDTWSSWTECSASCDGGVEERSRTCI 2044 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + G + + D + + + SS +VR Sbjct: 2045 NGDPGDGNCTGAPVDVLACNTIACSDLPPDGVREDCAQLDIAIIIDSSSSIGNANFEIVR 2104 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 L V+ + ++ R+ A FN + G Sbjct: 2105 SFLRRVMAQLNV---GLNSTRIAALRFN---RDVIPLWSFAQSTSSEDLIERIDRVTYDG 2158 Query: 336 -STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T A+ A + + + E + + + V+LTDG + + + K Sbjct: 2159 SGTHTGKALTYAANRLFTEAEGDRPDVPDLA------VVLTDGRAQDNPGDTV---QALK 2209 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 ++G++++ IA + E AS + F N L + + Sbjct: 2210 NKGVKVIAIAVTNRVDINEIY-----AIASDPDEQNAFFVNDFEGLFSVVENI 2257 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + D R+G + RV ++ S K + + T A+Q Sbjct: 65 IDGLSVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST--GTMTGLAIQ 122 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG QDN I +A+ GI I I Sbjct: 123 FAMNVAFSEAEGGR----KSPDISKVAIIVTDGRP-QDNIR--DIAARAREAGIEIFAIG 175 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 176 VGR------VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAF 216 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/338 (7%), Positives = 83/338 (24%), Gaps = 23/338 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L++S G T + + ++G + + W + AA +A + ++ L + Sbjct: 2 RNLLRSRQGSAAFATVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRLSCT 61 Query: 72 LEEVS---------SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + ++ + + + +R R + D Sbjct: 62 MAGAACGTQSVDYLAKEFAAQNGFCNSSPQDSTPYPGTQCAPSLPNRISRAVQIDIGTYT 121 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--- 179 + + ++ + + + A+A + + K +++ Sbjct: 122 AGTFTTPPAGTGNAVRARVSQQQPAYLAAVLGLTTVNIPAQAIALVQQPTKACALALGPD 181 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + ++ + + + + +C+ Sbjct: 182 SGALKLAGNLSNNGTGCALMSDTSVQLASTPSFTGSG-----WAVYGVSGCTPSGTCSNI 236 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y + + +S + R A + Sbjct: 237 SVPHNYFMTYANNPLRKLDTESFNSRTGNTSPSCPVQADG--WKHCAPNSAGTGAYGNFT 294 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 D+ + G + + +K G T Sbjct: 295 VQNGDKYV----LDPGTYIFSKATIKVTGGQLKGTGVT 328 >gi|326779287|ref|ZP_08238552.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] gi|326659620|gb|EGE44466.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] Length = 575 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 60/202 (29%), Gaps = 24/202 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + VD S + LVR +LA + ++ D+V+ F+D Sbjct: 207 STAERPPAALTFVVDISGSMAETGRLDLVRKSLAVLTDELRDDDSVS------LVTFSDA 260 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + R +K + ST + + Y+ + + Sbjct: 261 AETRLPMT--RLQGNRNRIKDVVEEMRPEQSTNVEAGITRGYEESVEGHRKGATNR---- 314 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSN 421 +VLL+D + E I + + GI + + + R Sbjct: 315 -----VVLLSDALANTGDTEADGILERIDSTRREYGITLFGVGVGSDYGDAFMERLTNK- 368 Query: 422 CASPNSFFEANSTHELNKIFRD 443 + + K+F D Sbjct: 369 --GDGNTTYVGDEAQARKVFVD 388 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/223 (10%), Positives = 56/223 (25%), Gaps = 42/223 (18%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + V + + + +++ + ++G F Sbjct: 94 ICLDISNSMLCQDVVPSRLDKSKMLIENLVD-------NFNNDKIGLIVFAGDAFVQLPI 146 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + ++ + T I A+ A + + I+ Sbjct: 147 TTDYVSA-KMFLQNITPGLIQTQGTNIGAAIDLASKSFTQQ-----------ENVGRAII 194 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFL-- 419 ++TDGEN + + A +GI + + K + Sbjct: 195 VITDGENHEPGAQEAAAAAN--KKGINVFILGIGNTKGAPIPMGDGSYLKDNAGNTVMTA 252 Query: 420 ---SNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 C A + ++T + + D I V Sbjct: 253 LNEQMCKELAQAGKGQYIHVDNTSDAERALNDDIAKLQKGDVT 295 >gi|307154093|ref|YP_003889477.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984321|gb|ADN16202.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 240 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 47/139 (33%), Gaps = 19/139 (13%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + +K N G T ++ T D + ++ E ++LLTDG Sbjct: 97 DQAAIKNQINRINARGCTNLSGGWLTGCDHVKANLSAERLNR---------VLLLTDGLA 147 Query: 380 TQDNEEGIAICNKA---KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 N + + A QGI T+ F L N A +F+ S Sbjct: 148 NVGNSDPKILTKTATEKAEQGIITTTLGFGTYFN-----EDLLINMANGGKGNFYFIQSP 202 Query: 435 HELNKIFRDRIGNEIFERV 453 + ++F I + + + V Sbjct: 203 QDAAQVFEIEIESLVSDAV 221 >gi|14248579|gb|AAK57575.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 493 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 58/178 (32%), Gaps = 29/178 (16%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D++ D + +G F + S + + + D+ TA + + A Sbjct: 121 ADDLTDGINLGLISFAGTASTLVSPTPDHSATKNALDRLKLADK-----TATGEGIFAAL 175 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-----AICNKAKSQGIRIMT 402 I + + IVLL+DG+ T KAK +G+ + T Sbjct: 176 QQIDT---LNAVLGGPSGAPPARIVLLSDGKQTVPESPDDPRGGFTAARKAKEKGVPVST 232 Query: 403 IAFSVNKTQQEK--------------ARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 I+F + L A S FF A+S ELNK++ Sbjct: 233 ISFGTLTGTVDLETPGGGVERVPVPVDDESLRKIANLSGGDFFTASSLDELNKVYSTL 290 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + K+ + Sbjct: 93 --GLFGSREEVKAAARRLAYHGGNTNTGDALRYITALSFSPRAGGRP---GDRAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + + + L AS F Sbjct: 148 LLTDG---RSQDLVLDAAAAAHRAGIRIFAVGVG------KALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 73/200 (36%), Gaps = 26/200 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +D L + L + + K + +DA+A I+ + K R+ F Sbjct: 311 YANVKTIDIVLCLDVSGSMRELSNGMPKIEIAKDAVAQYIQFLSKT-----NDRLAMVLF 365 Query: 303 NDRVISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 N R WG+H++ R G T + +A++ + + + Sbjct: 366 NFRADVL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLT--------- 412 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 + K+++ +TDG T ++ I + + G I TIA N + R LS Sbjct: 413 --RSRSNSKHVICVTDGR-TVNSSMCIKEAVRLRRSGTTISTIAIGENSDDELLMR--LS 467 Query: 421 NCASPNSFFEANSTHELNKI 440 F + +S H+L K Sbjct: 468 KI-GGGLFIKISSIHDLGKA 486 >gi|292627386|ref|XP_001332035.3| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 1056 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 74/217 (34%), Gaps = 21/217 (9%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++++ ++ S + L +V D + ++G + Sbjct: 90 DDVRAGCSTAVNDLVYIMDGSWSVGDVDFETAKRWLINVTS---GFDVSSHYSQVGVVQY 146 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 +D + G HK + +++ G+T A++ A D + S+ Sbjct: 147 SDTPRLEIPL--GQHKTTQQLIEAIEKISYLGGNTQTGRAIKFAVDHVFPSSR------- 197 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 N + V++TDG+ +QD+ ++ A++QGI + + T E N Sbjct: 198 RNDVKNRIAVVVTDGK-SQDDVTDASL--DARTQGITVFAVGVGSEITNSELVTIA--NT 252 Query: 423 ASPNSFFEA---NSTHELNKIFRDRIGNEIFERVIRI 456 + + A + + ++ E RI Sbjct: 253 PAGDYVLFAEDYTNIERIRDAMEQKLCEE-SVCPTRI 288 >gi|221130232|ref|XP_002156394.1| PREDICTED: similar to tyrosine kinase receptor [Hydra magnipapillata] Length = 1746 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 67/235 (28%), Gaps = 22/235 (9%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + V + L D + S + K+ ++ Sbjct: 1108 KPNFWSQSSYHWNHTAVPYKINSDTNLNLKSPDCVGIVDVGFILDSSGSLEKEYPQEKEF 1167 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L ++ + F+ + + + G T Sbjct: 1168 LINLASTFGI---NPSGAHAAVVTFSSDAVLSIKLNDYFEQAPFNKSVYEIEHM--NGWT 1222 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQDNEEGIAICNKAK 394 I+ A++ + + + + LLTDG N E+ + + + Sbjct: 1223 RIDLALRKSLEMF-------EEINGARKNVPRLLFLLTDGKQETNEGGAEDPVNVAQLLR 1275 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKI-FRDRIGN 447 +G+ I+ + + E L+N A S + F A + EL K F +I Sbjct: 1276 DRGVEIIAVGIGKGVNRLE-----LNNIAGSSDKVFLAENFDELIKQDFLKKIKE 1325 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 40/144 (27%), Gaps = 11/144 (7%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + G F+ Sbjct: 1608 SQEVDFLKQLASSFGISKQGSHAGVVTFSSEAKLSIQLDKYFTDADFNKAVDDIPYM--G 1665 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I+ A++ A + + + EA K + LLTDG E + N+ K Sbjct: 1666 GGTRIDLALEKAIELFDTKK-------GSRNEAPKLLFLLTDGVQEPKMEIPVP--NEIK 1716 Query: 395 SQGIRIMTIAFSVNKTQQEKARYF 418 + I++ I N + E + Sbjct: 1717 QKIIQLFAIGIGSNVNKDELIKIV 1740 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 92/301 (30%), Gaps = 64/301 (21%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 I V+D S SM +P + T + + V I+++ Sbjct: 93 EIVLVMDTSTSME-----CLVEPESYDIDNCVPTKEGHI-----VYIKNKSYLVNTAFLQ 142 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHV-IKKKHLVRDALASVIRSIKKIDNVNDTVR 296 S + + Y + + + + +++ ++ + + + Sbjct: 143 GSRHKLFYITIGTTNYYIQGNKCYRQSSYNEKNRLQHAKESAIKFVQKFE----NDKNIS 198 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G F+ S + ++++ +I D G+T I +++A + Sbjct: 199 IGLVSFDTTANSQKDITSKLNEVEDSINSLKVAD---NGATNIEAGLKSAQQLL------ 249 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--------------------------- 389 K N +A KY++L++DG T + G + Sbjct: 250 ----KKGNKDADKYVILMSDGFPTAFDYAGEKVEKNFNYHEIQDNTFINFGYYDYSGYAM 305 Query: 390 ------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 N K GI I FS ++ A+ + EA +T LN + Sbjct: 306 KHSINQANSLKKDGINSFIIGFSEGANSEKLNNI---AKAAGGEYEEAKNTDTLNGAYDK 362 Query: 444 R 444 Sbjct: 363 L 363 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 43/136 (31%), Gaps = 20/136 (14%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG--VHKLIR 321 + V+ + ++ + ++ + S ++ Sbjct: 718 NEKDRSRLDSVKKVANDFVDKF----KNDENTEIAIVRYSSKANIVLDGSNKIFLNGKDN 773 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I+K G T I D ++ +Y + K + +++KY++L+TDG T Sbjct: 774 EIIKKRINSLKADGGTNIGDGIRKSYSIL----------DKCDKDSEKYMILMTDGVPTA 823 Query: 382 DNEEGIAICNKAKSQG 397 N K+ Sbjct: 824 YTC----YANTIKASN 835 >gi|149046634|gb|EDL99459.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Rattus norvegicus] Length = 927 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|14248597|gb|AAK57584.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248591|gb|AAK57581.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|124008260|ref|ZP_01692956.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123986209|gb|EAY26038.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 552 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/287 (12%), Positives = 81/287 (28%), Gaps = 24/287 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + K + S + + + +N F + + Sbjct: 96 LSVKTAPLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEFINYFNYQYKQPEGQHP- 154 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + +P + L + +D S K L+R A Sbjct: 155 FSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSNLVFLIDVSGSMSAPDKLPLLRKA 214 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ ++ + D R+ + + G + + GST Sbjct: 215 FKMLVNNLGEED------RVAIVVYAGNAGLVLPATQGT---DKQKIMEALDKLQSGGST 265 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---K 394 A ++ AY + + NN I+L TDG+ A+ N + Sbjct: 266 AGGAGIKLAYKI----AKQNFIKEGNNR-----IILATDGDFNLGASSDQAMQNLIEEKR 316 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +G+ I + + + K +++ ++ +E K+F Sbjct: 317 KEGVFITVLGLGMGNYRDSKMEIIADK--GNGNYYYLDNLNEAYKVF 361 >gi|14248593|gb|AAK57582.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7] gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-7] Length = 755 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/267 (11%), Positives = 72/267 (26%), Gaps = 19/267 (7%) Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + Q + ++T K++++Q+ + N SL +L Sbjct: 300 VPANRDFVLQWRLKQGTSPVAWVFNQTGKTHTTQDDNASADTGPTGNSSNTDNYSLVMVL 359 Query: 245 YPG-PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 P L+ + + A ++ +++ + FN Sbjct: 360 PPKVEASEQLNLPRELILVIDTSGSMAGDSIIQAKNALRYALRGLRP---QDSFNIIEFN 416 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 V + + F G T + A+ A + Sbjct: 417 SDVSLLSPTPLPATATNLAMARQFVNRLQADGGTEMAQALNAAL------PRQAFNTASG 470 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 ++ + ++ +TDG ++ I N+ R+ T+ F+ A Sbjct: 471 EDKSLRQVIFMTDGSVGNESALFELIRNQIGDN--RLFTVGIGSAPNSH-----FMQRAA 523 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNE 448 +F E+ + + Sbjct: 524 ELGRGTFTYIGDVDEVEQKISQLLAKI 550 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 56/189 (29%), Gaps = 27/189 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 V+ A + ++ + ++ R+ F+ + +K Sbjct: 55 TPLQTVKQAASRIVDRL------SNRDRISIIAFDHKAEV---LISNELASDPQAIKRRI 105 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGI 387 G T I+D ++ + + S E + + ++LLTDGEN DN I Sbjct: 106 NSLRAGGGTCIDDGLKAGIEQLASGKEGYISQ----------LLLLTDGENEHGDNSRAI 155 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRI 445 + + A + + T+ F + L A S E F Sbjct: 156 KLADVAIGYNLTVNTLGFG-----DHWNQDVLEQIADAGGGSLSYIEHAEEAIATFGRLF 210 Query: 446 GNEIFERVI 454 + Sbjct: 211 TRMQSVSLT 219 >gi|14248575|gb|AAK57573.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248577|gb|AAK57574.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248569|gb|AAK57570.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248571|gb|AAK57571.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248573|gb|AAK57572.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|162447313|ref|YP_001620445.1| surface-anchored VWFA domain-containing protein [Acholeplasma laidlawii PG-8A] gi|161985420|gb|ABX81069.1| surface-anchored VWFA domain protein [Acholeplasma laidlawii PG-8A] Length = 486 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/291 (14%), Positives = 74/291 (25%), Gaps = 25/291 (8%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 I ++ S S +D + +N F ++ + Sbjct: 52 IDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNFFNYNYNQPETDKT- 110 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 K + + L D + +D S K L + A Sbjct: 111 FGFKSELIQTPWNNETHLLLIGL-ETKQVDLGDIPSNIVILLDVSGSMSATNKLSLAKKA 169 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + +I +K D ++ ++ G + + GST Sbjct: 170 MELLIEQMKPNDVIS------LVTYSSGEKVVFK---GKSIDDMAYMTSQIRLLKASGST 220 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---K 394 A + AY E+ NN I+L TDG+ + + Sbjct: 221 AGKKGLDMAYKV----AEEYFIEGGNNR-----IILATDGDFNVGISSTDMLIEYISEKR 271 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 GI F + EK A ++ + K F D I Sbjct: 272 ESGIYFSAYGFGYGNFKDEKLERVAK--AGNGTYHYIDDIISARKAFVDNI 320 >gi|296118874|ref|ZP_06837447.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] gi|295967972|gb|EFG81224.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] Length = 674 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 61/175 (34%), Gaps = 21/175 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNV----NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + + A ++ S+ + + A +R D V ++ + Sbjct: 77 TRLDAAKQAANQLVNSLPETAVMGMLAYGASGSNAPDNRERGCQDIDVLAPVERIDNEEL 136 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K+ G T + +A++ A D + E + I+L++DG +T Sbjct: 137 KSEIGALEAQGYTPMGNALRAAADEL-------------GSEGDRSIILVSDGIDTCAPP 183 Query: 385 EGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + G + I T+ F ++ Q + AS ++ EA + EL Sbjct: 184 PACEVAEELAGDGFDLAIHTVGFKPDEAAQAELECISE--ASGGTYVEAENAEEL 236 >gi|14248605|gb|AAK57588.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 46/153 (30%), Gaps = 19/153 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 + R+G ++ +V S ++ T A+Q A + Sbjct: 69 GPNATRVGVIQYSSQVQSVFPLG---AFSRPEDMERAIRALVPLAQGTMTGLAIQYAMNV 125 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S E + + V++TDG + + + +A+++GI I + Sbjct: 126 AFSVAEGARPPEERVP---RVAVIVTDG---RPQDRVAEVAAQARARGIEIYAVGVQRAD 179 Query: 410 TQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 L AS F S + + Sbjct: 180 VGS------LRAMASHPLDEHVFLVESFDLIQE 206 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 363 FELVKRFVNQIVDFLDVSPDGT---RVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 417 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 418 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 471 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 472 ARAKEEGIVMYAVGVG------KAVEEELREIASEP 501 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/119 (21%), Positives = 41/119 (34%), Gaps = 25/119 (21%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHR------------------MKNNLEAKKYIVLLTD 376 G TAI A+ + D +++ D+ EA + IVLLTD Sbjct: 184 GGTAIGKALDKSIDLLVAVRRDDSGARADQVEGEGEDESGAPEAAPALSEADQVIVLLTD 243 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GE+T + +A+ GIRI T+ + + + A A Sbjct: 244 GEDTVG--RPEEVAARAEQLGIRIYTVGIGSDSGEP-----IMRYDADGQPAGYATDAD 295 >gi|149922101|ref|ZP_01910541.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] gi|149817038|gb|EDM76520.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] Length = 350 Score = 51.9 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 47/145 (32%), Gaps = 16/145 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + A + + ST +N A+ + + E + ++ +V+ Sbjct: 152 TGSEGAAEARANSLAGFQPKDKSTNLNGAIVQGLEVLDEGLEKAKNPLRLGT-----LVV 206 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 TDG + + + K + I ++ + L + A Sbjct: 207 FTDGTDRASRVSEDEMLSAVKDTPYEVFAIGLGAELSEGD-----LQKIGKSGTA-LAKD 260 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 + E+ F I ER+ ++T+ Sbjct: 261 SGEIQTAF-----ESIGERIEKLTR 280 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/279 (12%), Positives = 86/279 (30%), Gaps = 23/279 (8%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + ++ V+ + + + LN + +++ + Sbjct: 128 EDSNFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGS 187 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + + L ++ +S + V +++ + + K Sbjct: 188 ATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKT 247 Query: 289 ------DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAIN 340 D ++D + FN++ F+ V + + K G+T Sbjct: 248 SVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYK 307 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGI 398 + A+D + +SN K I++ TDG + + K ++ + Sbjct: 308 AGFEYAFDQLQNSN-------ITRANCNKVIMMFTDG----GEDRVQDVFEKYNWPNRTV 356 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 R+ T FSV + + CA+ +FE S + Sbjct: 357 RVFT--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 427 VYYDAKADAELDDPESEDVERGFKASTLRLDFIEDPNFKNKVNYSYTAVQIPTD-----I 481 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 482 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 541 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 542 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 601 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 602 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 654 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 655 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 708 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 709 ACANKGYYFEIPSIGAI 725 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/279 (12%), Positives = 86/279 (30%), Gaps = 23/279 (8%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + ++ V+ + + + LN + +++ + Sbjct: 126 EDSNFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGS 185 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + + L ++ +S + V +++ + + K Sbjct: 186 ATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKT 245 Query: 289 ------DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAIN 340 D ++D + FN++ F+ V + + K G+T Sbjct: 246 SVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYK 305 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGI 398 + A+D + +SN K I++ TDG + + K ++ + Sbjct: 306 AGFEYAFDQLQNSN-------ITRANCNKVIMMFTDG----GEDRVQDVFEKYNWPNRTV 354 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 R+ T FSV + + CA+ +FE S + Sbjct: 355 RVFT--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 391 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 56/223 (25%), Gaps = 50/223 (22%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++A + I + VR+G F + L Sbjct: 27 SMRATDVQPDRITAAQNAAKAFIAEL------PRHVRVGIVAFAGSAQLAQLPTQSHEDL 80 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------------------------- 354 + TA + + A T+ Sbjct: 81 AKA-----IDSFQLQRGTATGNGIMLALATLFPDAGIDIAALGGRQAMHPRPLDEVTRQD 135 Query: 355 -EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 + I++LTDG+ T + + A +G+R+ T+ + + Sbjct: 136 PAKPFTPVAPGSYTSAAIIMLTDGQRTTG-VDPLEAAQWAADRGVRVYTVGVGTVQGETI 194 Query: 413 ---------EKARYFLSNCASPNS--FFEANSTHELNKIFRDR 444 L A +F A + +L K++ Sbjct: 195 GFEGWSMRVRLDEETLKAVAGRTHAEYFHAATAADLKKVYETL 237 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 68/188 (36%), Gaps = 35/188 (18%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ L+++I D +R+G + ++ ++ S + +++ + Sbjct: 260 IKEFLSALISPFNIAQ---DKIRVGLSQYSSDPRTEWDLS--AYATRDQVLEAVRNLRYK 314 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T A+ + ++ LEA+K ++LLTDG+ +QD+ Sbjct: 315 GGNTFTGLALTHVLE------QNLKPDAGARLEAEKLVILLTDGK-SQDDANL--AAQTL 365 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIF 450 K+ GI I I + L AS + + F + Sbjct: 366 KNMGIEIFAIGV------KNADEAELKQVASEPLELTVYNVLD-------FP-----LLS 407 Query: 451 ERVIRITK 458 V R+T+ Sbjct: 408 SLVGRLTR 415 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 54/187 (28%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ A ++ +K D R+ F+ R +K Sbjct: 59 LEIVKQAAIRLVDRLKTGD------RLSVVAFDHRAKVLVP---NQVIDNPEQIKKQINR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G TAI++ ++ + + ++ + + LLTDGEN DN + Sbjct: 110 LAADGGTAIDEGLRLGIEELAKGKKETISQA----------FLLTDGENEHGDNNRCLKF 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 A + + T+ F N + L A S + F Sbjct: 160 AQLAAGYNLTLNTLGFGDNWN-----QDVLEKIADAGLGSLSYIQKAEQAVDEFGRLFSR 214 Query: 448 EIFERVI 454 + Sbjct: 215 IQTVGLT 221 >gi|87306384|ref|ZP_01088531.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] gi|87290563|gb|EAQ82450.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] Length = 1030 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 36/88 (40%), Gaps = 8/88 (9%) Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N A K++++++DG+ + N ++ GI++ T+A + L A Sbjct: 542 NEVAVKHMIIISDGDPSPANPFTLS---AIAKAGIKVTTVAIGTHGPANSLE---LKKIA 595 Query: 424 S--PNSFFEANSTHELNKIFRDRIGNEI 449 S ++E + L +I++ Sbjct: 596 SATGGKYYEVTNPKALPRIYQREARRIA 623 >gi|301778755|ref|XP_002924795.1| PREDICTED: integrin alpha-X-like [Ailuropoda melanoleuca] Length = 1160 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 57/159 (35%), Gaps = 13/159 (8%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F+D +F ++ G T A+Q + + S+ Sbjct: 186 NTQFSLMQFSDDFRVHFTFKDFTDSSNPLVLLDSVYQL--GGYTHTATAIQMVTNQLFST 243 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + +A K ++++TDG+ D + A++ GI + + ++ Sbjct: 244 SSGAR------KDASKILIVITDGQKQGDYLNYEDVIPMAEAAGIIRYAVGVGLAFRKRH 297 Query: 414 KARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 + L++ A S F+ ++ L ++++ +I Sbjct: 298 SWKE-LNDIASKPSNEYIFKVDNFDVLRD-IQNQLKEKI 334 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 54/167 (32%), Gaps = 26/167 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R + G T+I +++A+ I Sbjct: 345 WVGMVTFDSAAYVQSELVQINSATERDTLTKSLPT-TASGGTSICSGLRSAFTVIKKKYP 403 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 + I+LLTDGE+ I+ C ++ K G I T+A + Q+ + Sbjct: 404 TDGAE----------IILLTDGEDN-----TISACFDEVKQSGAIIHTVALGPSAAQELE 448 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDRI--GNEIFERVIRI 456 + + A+ + L F + +R I++ Sbjct: 449 QMSKM----TGGLQTYASDQVQNNGLVDAFAALSSGNKAVSQRSIQL 491 >gi|14248585|gb|AAK57578.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|310824462|ref|YP_003956820.1| hypothetical protein STAUR_7237 [Stigmatella aurantiaca DW4/3-1] gi|309397534|gb|ADO74993.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 913 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 43/123 (34%), Gaps = 16/123 (13%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + A + G + +A++ I+ + +A ++++L +D ++++ Sbjct: 462 LNKVARGFSGGGGIFVGEALREGKTQIL-----------RSDKATRHVLLFSDAADSEEP 510 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 ++ A + + + + I ++ L A + A L +IF Sbjct: 511 DDYRATLAALRRENVTVSVIGLGTP---KDSDADLLREVAQLGGGRIYFAEDALSLPRIF 567 Query: 442 RDR 444 Sbjct: 568 SQE 570 >gi|115377018|ref|ZP_01464236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365996|gb|EAU65013.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 884 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 43/123 (34%), Gaps = 16/123 (13%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + A + G + +A++ I+ + +A ++++L +D ++++ Sbjct: 433 LNKVARGFSGGGGIFVGEALREGKTQIL-----------RSDKATRHVLLFSDAADSEEP 481 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 ++ A + + + + I ++ L A + A L +IF Sbjct: 482 DDYRATLAALRRENVTVSVIGLGTP---KDSDADLLREVAQLGGGRIYFAEDALSLPRIF 538 Query: 442 RDR 444 Sbjct: 539 SQE 541 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 53/169 (31%), Gaps = 26/169 (15%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S + + AL I DNV T F+ S Sbjct: 390 IDTSGSMRQENRLREAKTALGDFIDIFADQDNV------QVTIFSTNATELSDLSPI--G 441 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 R + T G T + + Y I E + + +V+LTDGE Sbjct: 442 PKRADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQRI---------RALVVLTDGE 492 Query: 379 NTQDNEEGIAICNKAKSQ----GIRIMTIAFSVNKTQQEKARYFLSNCA 423 +T + + + + I+I TIA+ + + L A Sbjct: 493 DTASSLSLEQLNEQIRQDESGTSIKIFTIAYGSDAN-----QEVLQRIA 536 >gi|14248583|gb|AAK57577.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248563|gb|AAK57567.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248565|gb|AAK57568.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248567|gb|AAK57569.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248607|gb|AAK57589.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248633|gb|AAK57602.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248615|gb|AAK57593.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248617|gb|AAK57594.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248639|gb|AAK57605.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248595|gb|AAK57583.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|118384116|ref|XP_001025211.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89306978|gb|EAS04966.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 631 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 51/162 (31%), Gaps = 14/162 (8%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N+ R+ F+ F + ++ +K + GST I M+ I Sbjct: 175 NENDRICLISFDSVEKILTPFLRN-NLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIK 233 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + E + LL+DG++ + + +S I+ I + Sbjct: 234 NRKE---------KNPITCMFLLSDGQDDSPQVDLR-VQKLIQSYDIQDTFI-VNTYGYG 282 Query: 412 QEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFE 451 + + N A ++ ++++ F I + Sbjct: 283 ADHDATQMRNIAETHKGGYYYIEDVKKVSEWFVLSISGLLSA 324 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/394 (9%), Positives = 107/394 (27%), Gaps = 31/394 (7%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 Q + + + K+ E + K + + + Sbjct: 27 KSQQQQSTTPTDQQVLAAQQAATKVAEQSAAKAAADAKALAQQEVQQYSDKQALLGRLQA 86 Query: 126 KSAYQVVLSSRYDLLL-NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 YQ + + NP + + + T S +++ ++ Sbjct: 87 APKYQHAAREKAAYQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN 146 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNK 238 + + +N F D K + + + Sbjct: 147 Q-GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 205 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + P+ + +D+S ++ L++ +L +++ +++ DN+ Sbjct: 206 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------ 259 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + G H + + + GST ++ AY ++ Sbjct: 260 IVTYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFVK 314 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKA 415 + I+L TDG+ ++ + + K + G+ + T+ + + Sbjct: 315 GGVNR-------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMM 367 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ E K+ + + Sbjct: 368 VRIAD--VGNGNYSYIDTLSEAQKVLNSEMRQTL 399 >gi|14248599|gb|AAK57585.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248601|gb|AAK57586.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248603|gb|AAK57587.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 47/379 (12%), Positives = 95/379 (25%), Gaps = 33/379 (8%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + ++ + +N V D + + V Sbjct: 173 ASAAVSDEQKAEEAPKSLKLAIDAIDASNYPTIKVKLAVEDGSEQSDLSSGQVAIKENTV 232 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 ++ N + + E + S+ L Sbjct: 233 AQKTAEVNANTADKTYEIVYDTTVSNTNPPNGEQRVVDLVIGDNKLSESYKSPSQKKLHI 292 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 S + + + + Sbjct: 293 DDVSYNTDEYPKVNVYFSLYDENNQLVEDMNPVKTAFTVKEGDKETKNASFSKLTEKPQA 352 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +S +D S K V+DA + ++ V +F + I F Sbjct: 353 ISTNLVIDVSDSMSEDNKLTKVKDAATQFLSHASFA---SNDVVGLMSFSDASNIRQSDF 409 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + + G TA+ +A+ A + N++E KY+V Sbjct: 410 TTEIESIKSS-----IAGMQTSGCTALYEALNQAVS----------NTAYNSVEGSKYVV 454 Query: 373 LLTDGENT----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPN 426 + TDG+NT + + N A G+ I I + L A + Sbjct: 455 VFTDGKNTICDGTNWVSPSTVINNALQWGVPIYAIGVEEDAD--------LQQIAEQTNG 506 Query: 427 SFFEA-NSTHELNKIFRDR 444 + N +LN I+ D Sbjct: 507 QYHVLGNDFTDLNAIYSDI 525 >gi|303246237|ref|ZP_07332517.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] gi|302492300|gb|EFL52172.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] Length = 382 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/303 (11%), Positives = 74/303 (24%), Gaps = 17/303 (5%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G +I AL M V+ G + VD Y L+ AA A + + L+Q + Sbjct: 13 QAGSVAVIVALSMIVLAGFATLAVDYGFLEYKRSQLQNAADAAALAGASVLVQYGANQEA 72 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP-----------RK 126 + + + + + + V + Sbjct: 73 VTDTAVLYGQANLNDSDSKEMAIRNSDVTYPDAVSVRATVGRTQERGNPVEMFLGRILGW 132 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + L + S L+ K + + + Y+ + S Sbjct: 133 NTQDIAATGVAALFCSKSSKCLKPW-SPPAKFTWKDDCDADKKYYNNNQLDAGSVCEMNS 191 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + Y + G P+ TV G D + Y Sbjct: 192 VEVQGYDNNDVGTPIILKFGDPSGTV-----TPGHYQPVDYPPASSGTPMTGGDAYRDNI 246 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 S + L ++ + T + + ++D + Sbjct: 247 AGCTGSNDGIVNTGDELQVEPGDMVGPTKQGLKELLDQDPGASWDSATNSIVDSAYSDPL 306 Query: 307 ISD 309 S Sbjct: 307 SSP 309 >gi|149046636|gb|EDL99461.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Rattus norvegicus] Length = 920 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 23/233 (9%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + + D +P + Y VD S Sbjct: 207 VFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSG 266 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVK 325 + L+R +++ ++ ++ +D + FN F V + ++K Sbjct: 267 LT-LKLIRTSVSEMLETL------SDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLK 319 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T A++ +++ N + K I+L TDG E Sbjct: 320 DAVNNITAKGITDYKKGFSFAFEQLLNYNV-------SRANCNKIIMLFTDG----GEER 368 Query: 386 GIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + +R+ T FSV + ++ C + ++E S + Sbjct: 369 AQEIFAKYNKDKKVRVFT--FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + K D T A+Q Y + D++ N I+ Sbjct: 329 SSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYSMMS--WPDDIPPEGWNRTRH-VII 385 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 386 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 444 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 445 KKD-NEQHVFKVKDMENLEDVFFQMIDESQSLSL 477 >gi|309792307|ref|ZP_07686777.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225622|gb|EFO79380.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 423 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 54/178 (30%), Gaps = 21/178 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+DA ++ + D ++ FNDR H+ +K Sbjct: 61 RLQQVKDAANRIVDMLTDDDYLS------LVTFNDRAEVVIPAQRPQHRPD---LKRMIS 111 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + M A M + I+LLTDG D + I Sbjct: 112 GIEAAGGTEMATGMALALQETQ------RPLMGRGVSR---ILLLTDGRTYGDEGRCVEI 162 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 +A+ +G+ + + + + S E+ +IF D + Sbjct: 163 ARRAQGRGVGLTALGIGGEWNEDLLETMSARE---NSRTQYITSAAEITQIFTDEVKR 217 >gi|14248641|gb|AAK57606.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 47/163 (28%), Gaps = 8/163 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G F ++T + M VM+ V GM +D+ + L++ A A + L + + Sbjct: 21 RRRERGAFAMMTIIFMTVMIAVLGM-LDIGNVFFQRRDLQRIADMAALAGVQRLDATCSQ 79 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 A S ++ D NP S+ V L+ Sbjct: 80 APVSASRSAASN----GLNTGAGDTISVGCGRWDPTANPAPSYYVPVANPGASSPAVQLN 135 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + + S + + + + Sbjct: 136 A---VQVAVSRQVPYFFLGPSRTVYAMSTSRATAIDVFSVGAT 175 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 58/179 (32%), Gaps = 23/179 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V+DA A ++ + + D + FNDR + + +K Sbjct: 61 RLMQVKDAAARIVDQLGQDDYFS------LVVFNDRADVVIP---AQRAIKKADLKAAIA 111 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + M A + R + I+LLTDG D + I Sbjct: 112 QIEAAGGTEMAQGMALALQEVQ--------RPFLTRGISR-IILLTDGRTYGDESRCVEI 162 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGN 447 + +S+GI + + + AS + + ++ K+F D + Sbjct: 163 ARRGQSRGIGLTALGIGTEWNEDLLETMT----ASENSRAQYIATAQDVVKVFADEVKR 217 >gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3] gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. ANA-3] Length = 751 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 43/441 (9%), Positives = 104/441 (23%), Gaps = 63/441 (14%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + A + + +I+ + + +Q E + + + N EV + Sbjct: 127 EAAVDSLRLHIGERIIEGQIHP---KQQAKQIFEQAKSEGKRASLVSQERPNMFTTEVAN 183 Query: 114 IVRDTAVEMNPRKSA------------YQVVLSSRYDLLLNPLS------LFLRSMGIKS 155 + D + + + +V++ RY L + Sbjct: 184 LAPDEELVVEISYQETIHYEDGLFSLRFPLVVAPRYIPGLTLGGNNGERVTSSQVFDADR 243 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + + + V + +D S + Y + + V + Sbjct: 244 IIAPIRDANSEADPVLKADIKVKLGEGVDKSAVVSPYHPITIDEKQGQLTVALANRVPAN 303 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV-------- 267 + + + SP N++ + + + S++ + Sbjct: 304 RDFVLQWRL-KQGTSPVGWVFNQAGKTHVSQDDNASADTGPTGKSSNTDNYSLVMVLPPK 362 Query: 268 -------------IKKKHLVRDALASVIRSIKKIDNV-----NDTVRMGATFFNDRVISD 309 I I K FN V Sbjct: 363 VEASEQLNLPRELILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLL 422 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + F G T + A+ A + ++ + Sbjct: 423 SPTPLPATASNLAMARQFVNRLQADGGTEMAQALNAAL------PRQAFNAASAEDKSLR 476 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNS 427 ++ +TDG ++ I N+ R+ T+ F+ A + Sbjct: 477 QVIFMTDGSVGNESALFELIRNQIGDN--RLFTVGIGSAPNSH-----FMQRAAELGRGT 529 Query: 428 FFEANSTHELNKIFRDRIGNE 448 F E+ + + Sbjct: 530 FTYIGDVDEVEQKISQLLAKI 550 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 92/322 (28%), Gaps = 30/322 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + NP + + + T S +++ ++ + Sbjct: 99 ASQIANPGTARYQQFDDNPIKQVAQNPLVTFSLDVDTGSYANVRRFLNQ-GLLPPPDAVR 157 Query: 197 EGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F D K + + + + Sbjct: 158 VEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEEL 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ + Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIAL 271 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G H + + + GST ++ AY ++ + Sbjct: 272 PSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGGINR------- 319 Query: 371 IVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + K + G+ + T+ + + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD--VGNGN 377 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+ + + Sbjct: 378 YSYIDTLSEAQKVLNSEMRQTL 399 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 56/180 (31%), Gaps = 22/180 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A + + + D V+ F D + + + + + Sbjct: 138 KMENARAAAQAFVDRLPDGDLVS------VASFADTAQARVAPTVLGRSTRPAVARA-IA 190 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-- 387 GST + ++ A +++ A + +VL++DG+ Sbjct: 191 ALGPDGSTNLFAGLKLAEQHALAAPS---------THAVRRVVLISDGQANIGPSSPDIL 241 Query: 388 -AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A+ + + G++I +I + ++ S + E++ + + Sbjct: 242 GALAQRGAAHGVQITSIGVGADYDERTLNAL---AVGSSGRLYHLTEAREMSSVLERELA 298 >gi|126309704|ref|XP_001376377.1| PREDICTED: similar to complement C2 [Monodelphis domestica] Length = 822 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 59/193 (30%), Gaps = 19/193 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + +++ ++ I D V +G F S + + Sbjct: 331 STEDFAIFKESAQLMVDRIFSFDVN---VSVGIVIFAKSPKVILSVT-HKDSRDEMEIAK 386 Query: 327 FAIDENEM-----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 D T IN AM Y+ + + R K+ + + I+LLTDG++ Sbjct: 387 KLEDLKYGDPDIGTGTNINKAMMQIYEMMNNEMAIFGGRQKDWEKIRHVIILLTDGKSNM 446 Query: 382 DNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 A+ K K IR I I ++ S F Sbjct: 447 GGSPTEAV-KKIKEVLNIRQERTDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILP 505 Query: 433 STHELNKIFRDRI 445 + L ++F + + Sbjct: 506 DSKALLQVFENML 518 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 55/177 (31%), Gaps = 24/177 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + V++ ++ ++ + +VK GST AM Sbjct: 435 VNSFDIGPNKVQISLVQYSRDPHTEFAL--NKFDDNAAMVKAVRTFPYRGGSTNTGKAMT 492 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + I S + +VL+TDG+++ ++ NK + + I + Sbjct: 493 YVREKIFVSGRGAR------DNVPRVMVLITDGKSSDSFKDP---ANKLRDTDVEIFAVG 543 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ R L A+ N FE F +RI + Sbjct: 544 V------KDAVRSELEAIANVPADNHVFEVED----FDAFERISKELTASICLRIEQ 590 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 57/181 (31%), Gaps = 20/181 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + R + D D V++G ++ ++ H+ ++ A + Sbjct: 1185 FKTIRNFIARMVGVFDIGPDRVQIGLAQYSGDPKTEWHL--NAHRTRTQLLDAVANLPYK 1242 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T A+ + + ++K VL+TDG+ +QD+ + Sbjct: 1243 GGNTLTGLALNYILQ------NNFKPNVGMRPNSRKIGVLVTDGK-SQDDIVANS--QNL 1293 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIF 450 + QGI + I + L AS + L I D N Sbjct: 1294 RDQGIELYAIGV------KNADENELRTIASDPDDIHMYNVADFSFLLDIVDDLTNNLCN 1347 Query: 451 E 451 Sbjct: 1348 S 1348 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 51/177 (28%), Gaps = 20/177 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D D R+ ++ ++ S H +++ + G+T DAM Sbjct: 134 MAGAFDIEEDKTRVAVVQYSSDTRTEFSL--NTHFRRPDVLRAIKNLPYKGGNTMTGDAM 191 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + K +++TDG++ + ++ G+ I + Sbjct: 192 DYLVKNTFT------QAAGARKGFPKVAMIITDGKSQDP---VEEYAERLRNIGVEIFVL 242 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + L AS + + +N++ + I ++ Sbjct: 243 GI------KGADEDELKEIASRPHSKHVYNVPNFDMINQVQKALITEVCSGVEEQLN 293 >gi|327265755|ref|XP_003217673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Anolis carolinensis] Length = 1078 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 37/309 (11%), Positives = 92/309 (29%), Gaps = 23/309 (7%) Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + + G + + + + + + V+ + + + Sbjct: 97 YDSKADTEYDDLDGEEIERDTSNSLKLEFTDDENFKTKVNYSYAAVQIPTDIYKGSTVIL 156 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 LN D V++ + + + + L + Sbjct: 157 NELNWTEALEDVFVENRKEDPSLLWQVFGSATGVTRFYPATPWRAPNKIDLYDVRRRPWY 216 Query: 259 VDSSSLRHVIKKKHLVRDALASVI------RSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +S + V +++ + +D ++D + FN + + F Sbjct: 217 IQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVYDMLDTLSDDDYVNVASFNQKAQAVSCF 276 Query: 313 SWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + V + + K G+T + A++ + +SN + K Sbjct: 277 THLVQANIRNKKVFKEKVEVMEARGTTDYKAGFEFAFEQLQNSN-------ISRANCNKM 329 Query: 371 IVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I++ TDG + + K ++ +R+ T FSV + + CA+ + Sbjct: 330 IMMFTDG----GEDRVQDVFEKYNWPNKTVRVFT--FSVGQHNYDVTPLQWMACANKGYY 383 Query: 429 FEANSTHEL 437 FE S + Sbjct: 384 FEIPSIGAI 392 >gi|149197491|ref|ZP_01874542.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] gi|149139509|gb|EDM27911.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] Length = 890 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 +++ K L + A I + +D+V+ + + + Sbjct: 401 TVKGGKTKMELANEGAAQTIELLGAMDSVS------VIAVDTEAHAIVPQTVLKDAPEIA 454 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + G + ++ ++ + E +K+++L +D ++++ Sbjct: 455 SQARRVKSQ--GGGIYVYTGLEESWRQL------------EGREGQKHVILFSDSNDSEE 500 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + K +G+ + IA + + FL + A F + L I Sbjct: 501 PGRYKELLADMKDEGMTVSVIALG---ERTDVDSPFLIDIANRGRGRIFFTDDPLSLPSI 557 Query: 441 FRDR 444 F Sbjct: 558 FAQE 561 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 21/192 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A VI ++ + + D+ + RT KT Sbjct: 55 SRMSAAKQAFNDVIDAVPEEVELGIRTLGADYPGEDKARGCKDTRQLYPVGPIDRTEAKT 114 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + + + IVL++DGE+T + Sbjct: 115 AVATLSPTGWTPIGPALLGAADDLD-----------GDEGGSRRIVLISDGEDTCGPLDP 163 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 + + ++G+ I T+ K R LS A ++ +L+ + Sbjct: 164 CEVAREIAAKGVDLVIDTLGL----VPNAKIRQQLSCIAGATGGTYTAVQHKEDLSDKVK 219 Query: 443 DRIGNEIFERVI 454 + V Sbjct: 220 QLVDRAADPVVT 231 >gi|14248613|gb|AAK57592.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|14248637|gb|AAK57604.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 22/168 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ VIR + D R+ FND V S V R+I Sbjct: 98 SRLDVLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 151 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G TA+ A++ A + ++ +I+LLTDG++T Sbjct: 152 IDRLQARGGTALMPALEEAVKILDERQ-------GSSRNHVGFILLLTDGDDTTGFRWTR 204 Query: 388 AICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + T + + S ++ + Sbjct: 205 ---DAIHGAVFKYPVHTFGLGASHDPEALLHIAQ---GSRGTYSFVDD 246 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 21/192 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A VI ++ + + D+ + RT KT Sbjct: 54 SRMSAAKQAFNDVIDAVPEEVELGIRTLGADYPGEDKARGCKDTRQLYPVGPIDRTEAKT 113 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ A D + + + IVL++DGE+T + Sbjct: 114 AVATLSPTGWTPIGPALLGAADDLD-----------GDEGGSRRIVLISDGEDTCGPLDP 162 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 + + ++G+ I T+ K R LS A ++ +L+ + Sbjct: 163 CEVAREIAAKGVDLVIDTLGL----VPNAKIRQQLSCIAGATGGTYTAVQHKEDLSDKVK 218 Query: 443 DRIGNEIFERVI 454 + V Sbjct: 219 QLVDRAADPVVT 230 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 62/181 (34%), Gaps = 14/181 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + + A V+ ++ + V D+ V + RT K Sbjct: 61 SRISVAQQAFNEVVDALPDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKA 120 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A+++A + + + IVL+TDGE+T + Sbjct: 121 AVAGLRPTGFTPVGLALRSAAQDLGTG------------STARRIVLITDGEDTCAPPDP 168 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + +QG R++ + ++ + + A+ ++ A S EL + + Sbjct: 169 CEVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADELTGRIKQLVD 228 Query: 447 N 447 Sbjct: 229 R 229 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 62/200 (31%), Gaps = 24/200 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +D S+ K + ++ L + + D R+G F++ Sbjct: 78 YSEKRVYNIIIALDVSNSMKEKNKLKISKEILRDFLLKRDEED------RIGILVFDNLP 131 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + L+R ++ ++G TA+ D + A + + + Sbjct: 132 FRLMPLTSDRGALLR-VISIIRPAMVDVGGTAMYDGLVEALNMFMKDRRN---------- 180 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K I+LLTDG + + + G +I TI S R A+ Sbjct: 181 --KIIILLTDGGDINSKYTLEDVVRFNQDIGAKIYTIGVSSGMNFYVLERLSE---ATGG 235 Query: 427 SFFEANS--THELNKIFRDR 444 F L +F + Sbjct: 236 KAFFVTKDYQKALRSVFDEI 255 >gi|194671641|ref|XP_591137.3| PREDICTED: matrilin 3 [Bos taurus] gi|297480578|ref|XP_002691564.1| PREDICTED: matrilin 3 [Bos taurus] gi|296482389|gb|DAA24504.1| matrilin 3 [Bos taurus] Length = 574 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 66/220 (30%), Gaps = 26/220 (11%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + L +DSS + V+ ++ +I ++ D Sbjct: 150 YSGTGQPGGARGAGVCKSRPLDLVFIIDSSRSV-RPLQFTKVKTFVSKIIDTL---DIGP 205 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTII 351 R+ + V + + +K T A+QTA D Sbjct: 206 MDTRVAVVNYASTVKIEFHLQT---HSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAF 262 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + E + K +++TDG + ++ + +A++ GI + + + Sbjct: 263 TV---EAGARGPSSNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAVGV----DR 312 Query: 412 QEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS F + +L+ F++ Sbjct: 313 ADMES--LKMMASEPLDEHVFYVETYGVIEKLSSRFQETF 350 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 54/194 (27%), Gaps = 22/194 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDEN 332 A ++ I+K+ + R G + V S + + + + D Sbjct: 289 AKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEPDSSNADWVTKQLNEINYEDHK 348 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+Q Y + D++ N I+L+TDG + + I + ++ Sbjct: 349 LKSGTNTKKALQAVYSMMS--WPDDIPPEGWNRTRH-VIILMTDGLHNMG-GDPITVIDE 404 Query: 393 AKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + R + Q + F+ L Sbjct: 405 IRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKD-NEQHVFKVKDMENLED 463 Query: 440 IFRDRIGNEIFERV 453 +F I + Sbjct: 464 VFFQMIDESQSLSL 477 >gi|119612173|gb|EAW91767.1| matrilin 2, isoform CRA_c [Homo sapiens] Length = 451 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + V ++ S K + + + T Sbjct: 80 IVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTG 137 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 138 LAIQYALNIAFSEAEGARP---LRENVPRVIMIVTDG---RPQDSVAEVAAKARDTGILI 191 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNK 439 I L + S + F + ++ Sbjct: 192 FAIGVGQVDFNT------LKSIGSEPHEDHVFLVANFSQIET 227 >gi|14248635|gb|AAK57603.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V L +I S+ DT+ + F + G R Sbjct: 1 SGSIGYPNWITKVIPMLNGLINSLSLS---RDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A+ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 43/141 (30%), Gaps = 13/141 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + K +++D I + + + + ++ Sbjct: 77 IDRSGSMGERYDGSRQTKMEVLKDTAKDFITQLSAQSPASQ---VSVVSYASDSKTNIGL 133 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + G+T + ++ AY + +++ K++++ Sbjct: 134 TSLDTQENIQSLNRAIDKLWASGATRSDLGLEDAYSVLGAADSGN----------KQFVI 183 Query: 373 LLTDGENTQDNEEGIAICNKA 393 LTDGE + I +A Sbjct: 184 FLTDGEPNSYSGFDREIAARA 204 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 44/155 (28%), Gaps = 20/155 (12%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 K N R+ FND + V + G T + A+Q+ Sbjct: 333 KGFAKFNRNDRVRVIMFNDNATEVTNGWVQATPENLQQVVGAVENAGPSGGTNLMSAIQS 392 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + + + I L+TDGE + A + + IR+ T Sbjct: 393 ALTGLDADRTNA-------------IWLVTDGEANVGETKQKAFIELLEKKDIRLFT--- 436 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 R L S +++ ++ Sbjct: 437 --FIMGNSANRPLLEAITKHSNGFAISVSNSDDII 469 >gi|17545367|ref|NP_518769.1| transmembrane protein [Ralstonia solanacearum GMI1000] gi|17427659|emb|CAD14178.1| putative transmembrane protein [Ralstonia solanacearum GMI1000] Length = 347 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 40/330 (12%), Positives = 93/330 (28%), Gaps = 4/330 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G +I ++ + VG + VDV R + L+ AA A + + L + Sbjct: 11 ARRQRGAVGLIMPAMLIAIFSVGALAVDVARLIVVRNELQNAADAAALAGAAGLYPTNPT 70 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-VVL 133 + + K+ + + + + + P + V + Sbjct: 71 PNWSNGVAQGTSAVKLNKSSNVQLVSGTVQAGYWNLTGTPAGLQSQSITPGANDVPAVQV 130 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD-Y 192 + N + I L + + G + + S+ + D Y Sbjct: 131 TINRSAGNNGGPVATLLAPIFGALSANSSATAVAVIAAPGSAGPGALFPLAISKCLYDLY 190 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + GQP + +S + + + + S L + Sbjct: 191 WNYATGQPKIDPSTGKPYVFQINTSYPSSSNCTSGEWTGFNGPTDASTLKGLVQNGNTST 250 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 LS + ++++ V ++ + + +I ++ +N F + Sbjct: 251 LSIGNMINTALASGVKSSVYMAIPST-PLNVTIPVVNPLNPGASEPIYAFA-GIQITKVV 308 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 G + I + N G T Sbjct: 309 CCGTNSTIWASFISNWKVSNSGGGTGPYYG 338 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 54/167 (32%), Gaps = 26/167 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R + G T+I +++A+ I Sbjct: 343 WVGMVTFDSAAYVQSELVQINSATERDTLTKSLPT-TASGGTSICSGLRSAFTVI----- 396 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K I+LLTDGE+ I+ C ++ K G I T+A + Q+ + Sbjct: 397 -----KKKYPTDGAEIILLTDGEDN-----TISACFDEVKQSGAIIHTVALGPSAAQELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDRI--GNEIFERVIRI 456 + + A+ + L F + +R I++ Sbjct: 447 QMSKM----TGGLQTYASDQVQNNGLVDAFAALSSGNKAVSQRSIQL 489 >gi|329850149|ref|ZP_08264995.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] gi|328842060|gb|EGF91630.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] Length = 399 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/207 (10%), Positives = 55/207 (26%), Gaps = 6/207 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + +G +I A+ +L G ++ S ++ L+ AA + + L + Sbjct: 8 NMWRCESGQSAMIFAIAAVPLLVAIGGALEYSDMSGLKNRLQTAADAGALAGAGRLSLAS 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 A+ Q + F ++ + V + + Sbjct: 68 TTGDEDARRVAIRTAQDNLKGDSATFTVDIDRTAGTLTVNAAAEHHGITGVLGDKVLRAS 127 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 L L + + ++ A + H + +S+ + ++ Sbjct: 128 AVGEA---LQKTPLCVLQVETGGIVLSQTAAIKAPGCMIHANNNISVTQSAMITGKLIQA 184 Query: 193 Q---RDSEGQPLNCFGQPADRTVKSYS 216 + N P + Sbjct: 185 SGTVTGATNPAGNGGALPIPDPFNGMN 211 >gi|302392065|ref|YP_003827885.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] gi|302204142|gb|ADL12820.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] Length = 230 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 58/213 (27%), Gaps = 25/213 (11%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 G + + V + + + + AL Sbjct: 22 GITELPFNQPQNMKVDQRQLPTFTDANNVEIIWDVSGSMWGKVEKNKKYLR---AKKALK 78 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ I + + + + + + ++ + + G + I Sbjct: 79 DIVMMI----PDHVNIGLRTFGKGEDSDRSTNLVVNIATNNKSTLLAKINNLKPAGKSPI 134 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+ A NL+ +I+L+TDG +T N + ++ GI Sbjct: 135 GKALSQA------------GVDLINLDGNNHILLVTDGRDT-GNIVPSRVADRLSKNGIY 181 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE 430 I + QQ R L + A +F Sbjct: 182 IHVLKIGAVGNQQ---RAVLKSIARLGGGRYFT 211 >gi|83648073|ref|YP_436508.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83636116|gb|ABC32083.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 261 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 58/201 (28%), Gaps = 22/201 (10%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 +D++ +K + + A+ + + +G F+ + Sbjct: 72 KNYYIVFDGSGSMDNTDCGDGKRKLDVAKTAVKKFVEQL------PADANVGVYAFDGQG 125 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + R +VK G T ++ ++ + K Sbjct: 126 VGE---RTHLATQNRPLVKQMIDQLVAGGGTPLSAGLEDGKAALT------AQAGKQLGY 176 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + ++V++TDG + E A+ + I I TI F + Sbjct: 177 GEYHLVIITDGLASLGYETDGAVQTILQDTPINIHTIGFCIGDDHSLHQPGL-------T 229 Query: 427 SFFEANSTHELNKIFRDRIGN 447 + A+ L + + Sbjct: 230 FYRSASDPDSLMAGLNEVLAE 250 >gi|330688327|gb|AEC32934.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 474 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 12/129 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGSTAIND 341 I + DT+ + F + G R + + G+T + Sbjct: 5 INSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKTYSPYGTTNMTA 64 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + N +A + ++L+TDG + + K K + + + Sbjct: 65 ALDEVQKHLNDRV--------NREKAIQLVILMTDGVPNSK-YRALEVAKKLKQRNVSLA 115 Query: 402 TIAFSVNKT 410 I Sbjct: 116 VIGIGQGIN 124 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/220 (10%), Positives = 55/220 (25%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +A ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDKM-------QNDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI A+ A + + I+++T Sbjct: 150 YISA-KMFLESIDPSLISKQGTAIGAAINLATRSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A +GI++ + + Sbjct: 198 DGENHEGGAV--EAAKAATEKGIQVNVLGVGMPDGAPIPMEGTNDYRRDRDGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++T+ K I V Sbjct: 256 EEMCQEIAKAGNGIYVRVDNTNSAQKAISQEINKMAKADV 295 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 37/119 (31%), Gaps = 14/119 (11%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + + T I A+Q + ++ Sbjct: 134 LIVFAGDAFVQLPITSDYVSA-KMFLDNINPSLIGTQGTDIGKALQLSMNSFTP------ 186 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 N + K I+L+TDGE+ + + +A+S+GIR+ + + Sbjct: 187 -----NSKVGKAIILITDGEDNEGGA--EEMAKQAQSKGIRVFILGVGSTEGATIPMPD 238 >gi|332831234|ref|XP_528236.3| PREDICTED: collagen alpha-1(XXII) chain, partial [Pan troglodytes] Length = 695 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPRAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + E + L AS F Sbjct: 148 LLTDG---RSQDLVLDAAAAAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|156324784|ref|XP_001618484.1| hypothetical protein NEMVEDRAFT_v1g2784 [Nematostella vectensis] gi|156199073|gb|EDO26384.1| predicted protein [Nematostella vectensis] Length = 410 Score = 51.5 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 68/217 (31%), Gaps = 16/217 (7%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + N + + ++ Y + FV S + Sbjct: 194 NHLNDQQSDVHSPHNHDSCDHSQFCEYQAGLTENSRLAVDLAFVLDGSTSINNADPGNFQ 253 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +I +K ++ +G ++ + F++ + +IVK + G Sbjct: 254 LLKNFMINIVKSFKISSERTHVGLVLYS--FFTQLMFNFDKYSDSASIVKAINTTDYPKG 311 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T +A++ A + + + K +++LTDG ++ E K Sbjct: 312 GTRTGEALKMAKSQLFGA---------SMRSVPKVLIVLTDGRSSDKVEAPS---KALKD 359 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 +G+ I + E +++ + + F+A Sbjct: 360 EGVVIFAVGVGDQIDPSELN--VMASDSKSDHVFKAK 394 >gi|330818825|ref|YP_004351042.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] gi|327374367|gb|AEA65719.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] Length = 660 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/146 (23%), Positives = 47/146 (32%), Gaps = 16/146 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 V + SF G ++ R + STA AM A +I Sbjct: 521 DVPFEVISYAAGYQIIKSFDDGWEEVSRRSHAVA----SVGRSTATGMAMTVALSNLI-- 574 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 E ++ +VLLTDG D A AK G+ ++TI + Sbjct: 575 ---------LRDEDRRMMVLLTDGAAG-DPVMTAASYQAAKEAGVEVVTIFIGRDIQAIA 624 Query: 414 KARYFLSNCASPNSFFEANSTHELNK 439 R L+ F NS EL K Sbjct: 625 LTRSILNATGFGQHFSNVNSPDELAK 650 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 50/172 (29%), Gaps = 22/172 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 + + I+ +D + R+G + V + + + T Sbjct: 50 LTQVIESLDVGPNATRVGVVNYASAVKHEFPLK---AHRSKASLLQAVRRLEPLSTGTMT 106 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+Q A S E K +++TDG + ++ + +A+ GI Sbjct: 107 GLAIQFAISRAFSEVEGARPLS---PALSKVAIVVTDG---RPQDDVKDVSARAREAGIE 160 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE------LNKIFRDRI 445 + I + L AS + L K F++ Sbjct: 161 LFAIGVGR------VDKTTLRRIASEPLAEHVDYVESYSVIEKLAKKFQEAF 206 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 92/322 (28%), Gaps = 30/322 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + NP + + + T S +++ ++ + Sbjct: 99 ASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ-GLLPPPDAVR 157 Query: 197 EGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F D K + + + + Sbjct: 158 VEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEEL 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ + Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIAL 271 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G H + + + GST ++ AY ++ + Sbjct: 272 PSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGGINR------- 319 Query: 371 IVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + K + G+ + T+ + + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD--VGNGN 377 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+ + + Sbjct: 378 YSYIDTLSEAQKVLNSEMRQTL 399 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 43/108 (39%), Gaps = 12/108 (11%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T N A++T D S + +VL TDG E+ Sbjct: 465 SGTTFTNGALKTMADLFQKSKRADAKLK---------VVLFTDG---YSAEDTSEGAEAL 512 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 KSQG+ + T+ S K+ + SPN FF A+ EL+K F Sbjct: 513 KSQGVVVYTVGISTEKSTGLNMKELHGMATSPNHFFNASDFVELSKNF 560 >gi|47551047|ref|NP_999700.1| complement factor B [Strongylocentrotus purpuratus] gi|3928787|gb|AAC79682.1| factor B SpBf [Strongylocentrotus purpuratus] Length = 833 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 48/154 (31%), Gaps = 16/154 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+GA ++ + + + + A T + A++T I + Sbjct: 428 TRVGAVSYSSEARRLFNVNDFTSTVD-VVKAIEANVNYTNKGTNLPAALETIGVMITETA 486 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK-SQGIRIMTIAFSVNKTQQE 413 ++ K+ + ++TDG + K ++I I + Sbjct: 487 DES-----GYSSRKRILFIITDGFSNVGGA-PSKSAQPLKEDAALKIHCIGI-----SRN 535 Query: 414 KARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + L+ ASP F + +EL + Sbjct: 536 TDKTALAEIASPPVSEHVFYLSDYNELERAVEAI 569 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 55/169 (32%), Gaps = 20/169 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ ++ +D D R+G + + ++ S K + + + T Sbjct: 116 ILDILQFLDISPDLTRVGLLQYGSTIKNEFSLK--TFKKKSEVERAVKRMRHLSTGTMTG 173 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+Q A + S E + I+++TDG + + + KA+ GI I Sbjct: 174 LAIQYALNIAFSEAEGARP---LRENVLRVIMIVTDG---RPQDSVAEVAAKARDTGILI 227 Query: 401 MTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRD 443 I + L S + F + L +F+ Sbjct: 228 FAIGVG------QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQK 270 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 61/185 (32%), Gaps = 23/185 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+ +A +I S+ R+G ++ +V ++ + R + K A Sbjct: 707 FEIVKQFVAGIIDSLAVSPKAA---RVGLLQYSTQVRTEFTLGDFSSA--RDMKKAVAHM 761 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + + A++ ++ + E + ++ TDG D A Sbjct: 762 KYMGKGSMTGLALKHMFERSFNPVEGARPVSPG---VSRVAIVFTDGRAQDD---VSAWA 815 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + L AS F A E+++ + Sbjct: 816 RRARAGGITMYAVGVG------KAIEEELQEIASEPTDKHLFYAEDFSTMGEISEKLKKG 869 Query: 445 IGNEI 449 I + Sbjct: 870 ICEAL 874 >gi|330688329|gb|AEC32935.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 474 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 12/129 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGSTAIND 341 I + DT+ + F + G R + + G+T + Sbjct: 5 INSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKTYSPYGTTNMTA 64 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + N +A + ++L+TDG + + K K + + + Sbjct: 65 ALDEVQKHLNDRV--------NREKAIQLVILMTDGVPNSK-YRALEVAKKLKQRNVSLA 115 Query: 402 TIAFSVNKT 410 I Sbjct: 116 VIGIGQGIN 124 >gi|316975135|gb|EFV58594.1| putative calcium binding EGF domain protein [Trichinella spiralis] Length = 2537 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 77/280 (27%), Gaps = 23/280 (8%) Query: 183 IDFSRSML-DYQRDSEGQPLNCFGQPADRTVKSYSSQN---GKVGIRDEKLSPYMVSCNK 238 +D S + R C + + G + + E + Sbjct: 405 VDVSSTYGLPPGRQCSNTTKMCARPETNSCDPEAVCIDKPDGYICVCPEGYVDVSSNAKL 464 Query: 239 --SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 L +D S + + V + D D R Sbjct: 465 QPGRVCTLQTLCPTQPTDLVFLIDGSGSIGTEIFY---KYVMRFVHEFVTLFDINEDRTR 521 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + +V + HK I + + G T A+Q ++ Sbjct: 522 VGIIQYTGQVKPEFYL--NQHKNIDQLQQAIRNIRYVGGLTKTGAALQFMTKNTFTAQMG 579 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + K V++TDG + + ++A+ I + + Sbjct: 580 AR---TRDPNVYKIGVVITDGRAQDNVKIP---ADEARRHNISLYAVGV-----TNHVLE 628 Query: 417 YFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L A + +F + ELN R +I E+ + +++ Sbjct: 629 SELEQIAGSKDRYFIVGTFAELNTRLRAKIQKEMCKGIVK 668 >gi|307294186|ref|ZP_07574030.1| TadE family protein [Sphingobium chlorophenolicum L-1] gi|306880337|gb|EFN11554.1| TadE family protein [Sphingobium chlorophenolicum L-1] Length = 126 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 34/120 (28%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +LI++ G I L +P L + + + + L+ A AS+ + Sbjct: 3 RLIRNECGAAAIEFVLALPPFLILLMGAIQLGVIACARTGLQHAVDEGARYASIYPTPTD 62 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++++R + +T+ + + + Sbjct: 63 AQITARVTSREFGLDPAYSNVPTITHGTQYGVPYTEITMTYSRPLNFIVFQTAPLSISYT 122 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 12 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 67 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + K+ + Sbjct: 68 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPRTGGRP---GDRAYKQVAI 122 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + E + L AS F Sbjct: 123 LLTDG---RSQDLVLDAAAAAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 173 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 174 HVSDFNAIDKI 184 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/244 (9%), Positives = 52/244 (21%), Gaps = 10/244 (4%) Query: 35 GVGGMLVDVVRWSYYEHALKQAAQTAIITAS-VPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 GV +D+ + L+ A TA + + V + A + Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVNQAETAVRTTLGGISTEPGN 89 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 F ++ +++ V + + V ++ ++ GI Sbjct: 90 SELTFSDSNTAVTFSHDMQTFVSAASFTPPLGEYDIYVRVAVT---DISLTQYLSGLFGI 146 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + A A + + + I D + + D + Sbjct: 147 DK-EVSASAVAGRSAAIAYTCNLTPIAMCGDPAGDVSDAWGYRPPGYDPNVDMDPSLVHE 205 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + P G + + + Sbjct: 206 LKVGDQNNTD-----MGPGNFQLLDFGQGTGNSGAALVRDALSGAYNGCASIGDTVTTKP 260 Query: 274 VRDA 277 A Sbjct: 261 GNSA 264 >gi|146304257|ref|YP_001191573.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702507|gb|ABP95649.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 383 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 44/121 (36%), Gaps = 16/121 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + G TA A+ TA++ N Y++LLTDG Sbjct: 91 VDPEDLTAEISSLSAGGQTAFFTALLTAFNL------------HNKHGIPSYVILLTDGN 138 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T D+ A G++ TI+F + E L++ S F+ N E+ Sbjct: 139 PT-DDTNVETYKRIAIPNGVQ--TISFGLGDDYNETILKSLADR-SGGVFYHVNDAMEIP 194 Query: 439 K 439 + Sbjct: 195 E 195 >gi|125829720|ref|XP_698253.2| PREDICTED: collagen alpha-1(VI) chain [Danio rerio] Length = 1000 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 61/188 (32%), Gaps = 28/188 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + ++ I +K + D A ++D R +K Sbjct: 56 INQIKTFTKLFIDELKDLRQPCDRDVTWNSGALHYSDDTELVMGLVDL--NTKRADLKAA 113 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +G T + A++ ++ R ++ KYIV++TDG +E Sbjct: 114 IDRIKYIGKGTYTDCAIKEGIAELL--------RAGSHYHENKYIVVVTDGHPITGYKEP 165 Query: 387 ----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEA--NSTHEL 437 N+A+ I++ +A ++ LS A+ ++ F A NS Sbjct: 166 CGGIQEAANEARQHAIKVFAVAI-----SPDQEDTRLSVIATDINYRQNFTAADNSRSTQ 220 Query: 438 NKIFRDRI 445 R I Sbjct: 221 MSTIRSII 228 >gi|125540180|gb|EAY86575.1| hypothetical protein OsI_07955 [Oryza sativa Indica Group] Length = 477 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 44/155 (28%), Gaps = 31/155 (20%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + K+ G T I + A + A ++LL+ Sbjct: 168 MSDDGKASAKSAVESLAAGGGTNILKGLVEAAKVFD---------GRRYRNAVASVILLS 218 Query: 376 DGENTQDNEEGIAICNKAK--------------SQGIRIMTIAFSVNKTQQEKARYFLSN 421 DG++T + G + +K + + + T F + + Sbjct: 219 DGQDTYNVNSGWG-ASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDAAA-----MHA 272 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A + +F + + F IG + V Sbjct: 273 IAEETGGTFSFIENQAVVQDAFAQCIGGLLSVTVQ 307 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 92/322 (28%), Gaps = 30/322 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + NP + + + T S +++ ++ + Sbjct: 99 ASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ-GLLPPPDAVR 157 Query: 197 EGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F D K + + + + Sbjct: 158 VEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEEL 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ + Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIAL 271 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G H + + + GST ++ AY ++ + Sbjct: 272 PSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGGINR------- 319 Query: 371 IVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + K + G+ + T+ + + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD--VGNGN 377 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+ + + Sbjct: 378 YSYIDTLSEAQKVLNSEMRQTL 399 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPRAGGRP---GDRAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + E + L AS F Sbjct: 148 LLTDG---RSQDLVLDAAAAAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|171320626|ref|ZP_02909646.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] gi|171094139|gb|EDT39226.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] Length = 346 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/282 (9%), Positives = 74/282 (26%), Gaps = 6/282 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEE 74 S G + + + +L G + +D+ + + L+ AA A + A+ ++ Sbjct: 14 SERGSVALFFVMFLIPLLSFGALAIDIAWVATVRNQLQNAADAAALAAADAMMSPTGGPL 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+A + + + + + V ++ Sbjct: 74 NWSQAAPAANGVIAQNSAAGAALSTGTVTTGYWNVTRNPASMQPTTITPGAYDVPAVQVT 133 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-LDYQ 193 +N S+ L G+ + + + + + + + Y Sbjct: 134 VSRAPGINGGSIPLLLGGLLGIPGTSGSATAIAVLAAPGGVAAGGLFPVAIDQCVYNQYW 193 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYPGPLD 250 QPL + Q + + + + + M+ P Sbjct: 194 NAVTNQPLINPLTGLPFEFSITNGQTYGALCMGGQWTSFQSTATDTTTMSGLMVTGNPTT 253 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + + + + + + V+ V Sbjct: 254 LNIGDSINLATGVKATLYTSVPIGATVVLPVVTQTSSSTTVP 295 >gi|167744177|ref|ZP_02416951.1| hypothetical protein Bpse14_39258 [Burkholderia pseudomallei 14] gi|167851185|ref|ZP_02476693.1| hypothetical protein BpseB_38456 [Burkholderia pseudomallei B7210] gi|167916479|ref|ZP_02503570.1| hypothetical protein Bpse112_38762 [Burkholderia pseudomallei 112] Length = 396 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 93/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPGYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 59/160 (36%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I + +D ++ R+G ++ +V S+ S L + + G T Sbjct: 30 MIDILNTLDIGLNSTRVGVVQYSSQVRSEFSLRSH-ASLDSMVKAIQEMVPLAQG-TMTG 87 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + ++ E + ++ N +V++TDG + + + +A+ +G+ I Sbjct: 88 LAIRYTMNVAFTAAEGDRPKVPN------VVVIVTDG---RPQDRVAEVAAEARERGMEI 138 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 139 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 172 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 46/134 (34%), Gaps = 9/134 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I ++ V +G ++ V + + R++ G+TA DA+ Sbjct: 416 INGSKYISSDVSIGLVSYSTDVNINLPIA-KFDLNQRSLFVGAVESLAAGGNTATFDAII 474 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + N AK + +L+DG + I + K+ GI I TI Sbjct: 475 VATKMLKEEKA-------KNPNAKLMLFVLSDGVTNYGHSLND-IKDMMKTFGIPIYTIG 526 Query: 405 FSVNKTQQEKARYF 418 ++ N E Sbjct: 527 YNANIKALETLSQI 540 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 62/181 (34%), Gaps = 14/181 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + + A V+ ++ + V D+ V + RT K Sbjct: 61 SRISVAQQAFNEVVDALPDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKA 120 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A+++A + + + IVL+TDGE+T + Sbjct: 121 AVAGLRPTGFTPVGLALRSAAQDLGTG------------STARRIVLITDGEDTCAPPDP 168 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + +QG R++ + ++ + + A+ ++ A S EL + + Sbjct: 169 CEVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADELTGRIKQLVD 228 Query: 447 N 447 Sbjct: 229 R 229 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 45/153 (29%), Gaps = 21/153 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + V + + + ++ K +G F + Sbjct: 98 DISNSMLATDVVPSRLDKSKLMVEGLMNKFTKNK-------LGLIVFAGDAFVQLPITSD 150 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + T I A+ A + N + K IV++T Sbjct: 151 YVSA-KMFLDNINPSLIGTQGTDIGKAINLAMHSFTP-----------NTQTGKAIVVIT 198 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 DGE+ + A+ +A+ +GI++ + Sbjct: 199 DGEDNEGGA--EAMAKQAQEKGIKVFILGIGST 229 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 110/361 (30%), Gaps = 20/361 (5%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 +E Y ++ N ++ + +++ ++ + AY+ + + + S Sbjct: 527 LEHYDSVMKIYSIFTNLSNYDRTVLLKYQGGKLLGQVEAYKKRVEA-LKESADAASRPSG 585 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 ++ Y + V + + S + D + A Sbjct: 586 KDRYTKVMVTLVRPGGEPVTDYQGTVKIKFDGV-EKTASFITNTSDPLNNTGSPGTAVAY 644 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 Y + + + Y + ++ ++ + Sbjct: 645 FDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVTKDIFTNPNFSKNSCSLATEMAYVV 704 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 A+ + + + + + N + ++ + ++K Sbjct: 705 DYSSSMKAVD---PTNYRGKKMIEFINQLKAKNNIVIETNTKATILGEGTTDAVLKKDLY 761 Query: 330 DENEM-GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++ G+T I + A N+ + K IV+++DG+ + I Sbjct: 762 KASKDKGATDIFAGIDIALTKFS-----------NDTKTAKAIVVVSDGK--TSKSKMIK 808 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 N AK QG++I T++ + LS + +++ A +L+++F+ I Sbjct: 809 AINDAKKQGVKIYTVSMGKKSQINDATLMQLST-ETGGAYYHALDNLQLHQVFQKLIDAI 867 Query: 449 I 449 + Sbjct: 868 L 868 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 19/210 (9%), Positives = 49/210 (23%), Gaps = 43/210 (20%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + +++ ++G F + + + + Sbjct: 108 TPSRLDRSKMMVENLVDHFT-------NDKIGLLVFAGDAFVQLPITSDYVSA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T I A+ A + K I+++TDGE+ + Sbjct: 160 IDPSMMATQGTDIARAIDMATHSFTQE-----------EGIGKAIIVITDGEDHEGGAL- 207 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ------------EKARYFLSN-----C-----AS 424 AK G+R+ + + +S C A Sbjct: 208 -ESAEAAKKAGMRVYVLGVGSTQGAPIPIPGTGDYMKDNTGNTVMSALNEDMCRQVAQAG 266 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ + + + + Sbjct: 267 GGAYIHVENNSAAQDQLDNELSKLAKKETT 296 >gi|317127857|ref|YP_004094139.1| von Willebrand factor A [Bacillus cellulosilyticus DSM 2522] gi|315472805|gb|ADU29408.1| von Willebrand factor type A [Bacillus cellulosilyticus DSM 2522] Length = 282 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 40/201 (19%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++A VI + D + F+D+V + H + ++K + Sbjct: 63 CKEACNFVINQLTDKD------ILSVVVFDDQVETIIEPQKVTH---KDLLKEYIQRIET 113 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T ++ + ++ ++LL+DG+ + A+ A Sbjct: 114 RGITNLSGGLIQGCQHVLKQEVKNYVNR---------VILLSDGQANAGITDKEALVKLA 164 Query: 394 ---KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR---- 444 +S G+ I T+ + L A +F N + IF Sbjct: 165 DDYQSAGLVISTLGV-----SEHFDEELLEGVADSGRGNFHFINEVENIPSIFEQELDGL 219 Query: 445 ---IGNEIFERVI-----RIT 457 IG I ++ RIT Sbjct: 220 LNVIGQNITLNILPKKGVRIT 240 >gi|218673728|ref|ZP_03523397.1| hypothetical protein RetlG_20318 [Rhizobium etli GR56] Length = 176 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 44/132 (33%), Gaps = 8/132 (6%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M +F + ++L+ G I A+L + + +++V + A + Sbjct: 1 MTARKRFAPF-RRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDAS 55 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + S + + +K + K KI + ++ F + + D+ T+ Sbjct: 56 VHKISRMIRTGE---VASSKITLAGFKSKICDDMLLAFSCSSGLVVKVSVLSDLSSATSA 112 Query: 121 EMNPRKSAYQVV 132 + V Sbjct: 113 DPIDNSGKLTVT 124 >gi|154149676|ref|YP_001403294.1| von Willebrand factor, type A [Candidatus Methanoregula boonei 6A8] gi|153998228|gb|ABS54651.1| von Willebrand factor, type A [Methanoregula boonei 6A8] Length = 1081 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 41/370 (11%), Positives = 89/370 (24%), Gaps = 20/370 (5%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 YL + + + + V TA N + VV+ S + + + Sbjct: 477 YLSELTTISSQNVNVNGKFYVTVSLTADGWNLTGTPADVVIVSDLAAGIGGATRLTHTKA 536 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + I+ + V+ + + D S + + R Sbjct: 537 AEVGFIKNATDNTYVALASFGSAPNAGSTPYDSSDTQNLWNLQLSKS----NTTYTYRPF 592 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 Y + + + S P SL V+ ++ + + Sbjct: 593 NPYGNVWDYNLVNPANWNSISSSTAYCFNSSSQKNPSSQSLGLRACVN-ATSPYGYTYLN 651 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 D+ + + T A + + + + + + Sbjct: 652 PWSDS--KIDADLMNAGPTYKTTNQNALVNTVNAYTAYGGTDYAAGINAALQELQSKGNP 709 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-------TQDNEE 385 T I M + + + + +++G N Sbjct: 710 SHNQTII--IMGDGVNMMAPIAPGSFESYWPSDWNPRNGTGISEGPNLWYLDESDVGKAA 767 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLSNCASPN--SFFEANSTHELNKIF 441 + AK+ GI I I F F S +++ A + IF Sbjct: 768 ALNASTTAKNLGITIYGIQFPTPDNYGHNINDTAFFQQMVSSPTSTWYYAPDPTTMTGIF 827 Query: 442 RDRIGNEIFE 451 + G Sbjct: 828 QQIEGQIQNT 837 >gi|310115525|ref|XP_003120135.1| PREDICTED: collagen alpha-1(XXII) chain-like [Homo sapiens] Length = 788 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + + A GIRI + E + L AS F Sbjct: 148 LLTDG---RSQDLVLDAAAAAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|294011131|ref|YP_003544591.1| hypothetical protein SJA_C1-11450 [Sphingobium japonicum UT26S] gi|292674461|dbj|BAI95979.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 418 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/241 (9%), Positives = 65/241 (26%), Gaps = 22/241 (9%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK------------------ 54 +L++ TG+ ++ A MP+++G G+ D V+W+ ++ ++ Sbjct: 8 RLLRDRTGNVLMMAAASMPLLVGAAGLATDTVQWTLWKRQIQRQADSAALAGAYAVAQGF 67 Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTF---PKQKIEEYLIRNFENNLKKNFTDREV 111 A+ +A S + +L + + + + +++ + Sbjct: 68 NASDSATADISRMALVALTQTPTIENAPTSGPFAGNAQAVRVVLQTSAELPFSKILGVKA 127 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 I + + V L + + ++G + A S Sbjct: 128 PVIYGEATAAVVGSGDYCVVSLEKTSTVGITLQGNATVNLGCGIVTNSRASNAVYAGGSS 187 Query: 172 HK-EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 V+ + S + + T + + Sbjct: 188 TVAATPVAAVGGLTSSSNYVSPTTLLPYSIPVQDPYAGLPTPNVSGCSAKVSVNPQQSVV 247 Query: 231 P 231 Sbjct: 248 S 248 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 92/322 (28%), Gaps = 30/322 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + NP + + + T S +++ ++ + Sbjct: 99 ASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ-GLLPPPDAVR 157 Query: 197 EGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F D K + + + + Sbjct: 158 VEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEEL 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ + Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIAL 271 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G H + + + GST ++ AY ++ + Sbjct: 272 PSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGGINR------- 319 Query: 371 IVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + K + G+ + T+ + + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD--VGNGN 377 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+ + + Sbjct: 378 YSYIDTLSEAQKVLNSEMRQTL 399 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 64/189 (33%), Gaps = 27/189 (14%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI--RTI 323 + ++ VR + ++ + ++ T R+G ++D + S G+ K + + Sbjct: 14 NPLEDFEKVRQWIGKLVDTFDIEEDGGGT-RVGVVIYSDAPRMEISLGNGLGKTDLIKAV 72 Query: 324 VKTFAID---------ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + E G+T ++++ A + S + + +++L Sbjct: 73 LVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFSETSGARALSEG---INRIMIVL 129 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEA 431 TDG + I A+ GI I + + L AS + F Sbjct: 130 TDGRAQDNVAGPAVI---AQEDGIVIYAVGVG------HAIKDELDEIASKPTHRHKFSV 180 Query: 432 NSTHELNKI 440 + + I Sbjct: 181 SEYGAIESI 189 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 64/189 (33%), Gaps = 27/189 (14%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI--RTI 323 + ++ VR + ++ + ++ T R+G ++D + S G+ K + + Sbjct: 14 NPLEDFEKVRQWIGKLVDTFDIEEDGGGT-RVGVVIYSDAPRMEISLGNGLGKTDLIKAV 72 Query: 324 VKTFAID---------ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + E G+T ++++ A + S + + +++L Sbjct: 73 LVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFSETSGARALSEG---INRIMIVL 129 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEA 431 TDG + I A+ GI I + + L AS + F Sbjct: 130 TDGRAQDNVAGPAVI---AQEDGIVIYAVGVG------HAIKDELDEIASKPTHRHKFSV 180 Query: 432 NSTHELNKI 440 + + I Sbjct: 181 SEYGAIESI 189 >gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] Length = 802 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 63/194 (32%), Gaps = 31/194 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + AL V+ + D R F+++V GV + + + + Sbjct: 313 KFQQAKQALEYVLSRLNPQD------RFNLLSFSNQVEVFAPEMEGVEAIPQA--QKWVA 364 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T I+ A+ A + S Y++ LTDG T + I Sbjct: 365 GLSAAGGTNIHRALLDAIQFVRSQRPT-------------YLIFLTDGLPTVGITDREQI 411 Query: 390 CNK-AKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI---FRD 443 + A+ +G+R+ + L A +LN+ F + Sbjct: 412 LDDFARQAPRGLRLFVFGVGYDVDTFLLDELAL---AHHGLSLYVRPEEDLNQAVAGFFE 468 Query: 444 RIGNEIFERVIRIT 457 +I + + +T Sbjct: 469 KISTPVLTDLT-LT 481 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 62/197 (31%), Gaps = 30/197 (15%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + K + ++ + +I + ++ TV + I + Sbjct: 21 DGSNSMWGQVDGEAKITIAKEVMTDLITNWDDSVDLGLTVYGHRRKGDCADIEVVAM--- 77 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 K+ R + G T I+ + A ++ K +VL++ Sbjct: 78 PGKVDRQALIDKVQSITPRGKTPISKTLSLAALSV------------GFFSGKSSVVLVS 125 Query: 376 DGENTQDNEEGIAICNKAKSQGIR-----IMTIAFSVNKTQQEKARYFLSNCAS--PNSF 428 DG T + C +AKS GI + I F E+ L A+ F Sbjct: 126 DGLETCN----ADPCAQAKSLGIINPGFDVHVIGF----DVTEEEFKSLQCIATETGGKF 177 Query: 429 FEANSTHELNKIFRDRI 445 F AN+ EL R + Sbjct: 178 FRANNAEELKDALRQTV 194 >gi|158335198|ref|YP_001516370.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305439|gb|ABW27056.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 573 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 65/208 (31%), Gaps = 28/208 (13%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + VDSS K V+ L + ++ + + V F+ Sbjct: 383 SWQTAAKKPSQVVIVVDSSGSMTG-TKLAAVQSTLQTYLKGLGPKEKVT------LIDFD 435 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 V S + + F + G+T + D++ A + + Sbjct: 436 SVVRKPVSVDGSPEGQSKGLE--FVVALKADGNTKLYDSILAAQTWLTQNLRPNAINA-- 491 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIAFSVNKTQQEKARY 417 +++LTDGE++ ++ + + K G I I T+ + + A Sbjct: 492 -------VIVLTDGEDSGSGQQLPQLLSALKKSGFEGEQRIAIFTVGYG---NAGDFAPD 541 Query: 418 FLSNCA-SPNSFFEANSTHELNKIFRDR 444 L A + ++ + ++ D Sbjct: 542 VLKQIAEANGGYYRQGDPASIAQLMADL 569 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 53/185 (28%), Gaps = 22/185 (11%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + K+ + V S S + + K++ N G TA + Sbjct: 96 MTNAKEAAKGFVDLMDMTKHRVAVVDFSSSASSFPFTVDKDAAKSYINTINSGGGTATGN 155 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------AICNKAKS 395 A+ A + + EA+ IVL+TDG T+ + AK Sbjct: 156 AIDAAVALLAD----------HRTEAQPVIVLMTDGAATESPKNTDPFDYALQRAQAAKD 205 Query: 396 QGIRIMTIAFSVNKTQQ--EKARYFLSNCASPNSFFEANSTH----ELNKIFRDRIGNEI 449 G+ TIA + N A+ + ++ IG Sbjct: 206 AGVIFYTIALLNPNEDPITSAPNVLMKNMATTATHHHFVLGSKGLNQIYAAIVKEIGMAS 265 Query: 450 FERVI 454 V Sbjct: 266 AYDVT 270 >gi|303241521|ref|ZP_07328022.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] gi|302590939|gb|EFL60686.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] Length = 1061 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 43/101 (42%), Gaps = 3/101 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L K G + A+++ VML + G+LVD R E ++ + A + L Sbjct: 2 NLFKKNEGAITVYLAIILSVMLILTGVLVDGARARTAEAQVQSTTEAAANSLLANYNNIL 61 Query: 73 EE---VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 +E + S ++N+ ++++ YL RN L + Sbjct: 62 KEWFGLMSLSENNSDVLEEELMYYLNRNLMTELGAEKANLS 102 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 63/188 (33%), Gaps = 14/188 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 + + A V+ ++ + + +DR D + V L RT KT Sbjct: 60 SRMSAAKQAFNDVLDAVPEEVQLGIRTLGANYPGDDRKVGCKDTKQLYPVGPLDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A D + +A + IVL+TDGE+T + Sbjct: 120 AVATLAPTGWTPIGPALLGAADDLD------------GGDATRRIVLITDGEDTCGPLDP 167 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + ++GI ++ + + + + A+ ++ EL+ + + Sbjct: 168 CEVARDIAARGIHLVIDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHADELSGRVKQLVD 227 Query: 447 NEIFERVI 454 + Sbjct: 228 RAAEPTIT 235 >gi|170703647|ref|ZP_02894384.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170131446|gb|EDT00037.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 346 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 27/282 (9%), Positives = 76/282 (26%), Gaps = 6/282 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEE 74 S G + + + +L G + +D+ + + L+ AA A + A+ ++ Sbjct: 14 SERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAADAAALAAADAMMSPTGGPL 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+A + + + + + V ++ Sbjct: 74 NWSQAAPAANGVIAQNSAAGAALATGTVTTGYWNVTRNPATMQPTTITPGAYDVPAVQVT 133 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-LDYQ 193 +N S+ L G+ + + + + I + + Y Sbjct: 134 VSRAPGINGGSIPLLLGGLLGVPGTSGSATAIAVLAAPGGVAAGGLFPIAIDQCVYNQYW 193 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYPGPLD 250 + QPL + Q + + + + + M+ P Sbjct: 194 NAATNQPLINPLTGQPYEFSITNGQTYGALCMGGQWTSFQSTATDTTTMGGLMVTGNPTT 253 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + + + + + + V+ +V Sbjct: 254 LNIGDTINLATGVKATLYTSVPVGATVVLPVVTQTSSSTSVP 295 >gi|86748912|ref|YP_485408.1| TadE-like [Rhodopseudomonas palustris HaA2] gi|86571940|gb|ABD06497.1| TadE-like [Rhodopseudomonas palustris HaA2] Length = 181 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 56/178 (31%), Gaps = 1/178 (0%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTA 60 + ++ + +++ ++ G I A++ P+ + + +++ L+ A +A Sbjct: 4 LLLSRAVLLARRFSRNRRGSAAIEFAMIAPIFIALLFAIIETAFVFLASQVLETAVQDSA 63 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + S + K + + +F+ + + + Sbjct: 64 RLILTGQAQAASYTQSQFKTDLCNRLKALFSCDGVYVDVQSYGSDFSTVSITTPIDSSKN 123 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 +N K + LF+ +G + + + ++ E + Sbjct: 124 FVNTMKYSPGAAGDIVVVRAFYQWPLFVTGLGWNIANLADSKRLLSATAAFRNEPYSN 181 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 47/158 (29%), Gaps = 16/158 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+ FN + + + I KT + G T I MQ A+D I Sbjct: 129 GDDRISLVGFNSQAKVLLELTQLTKNSKKKIQKT-VDELQAGGGTQIGFGMQKAFDIIKE 187 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK-SQGIRIMTIAFSVNKTQ 411 + I LL+DG++ + N++K I F + Sbjct: 188 RTNSKNLAS---------IFLLSDGQDNCGFSQTQHFMNQSKIEYPFCIDCFGFGDDHDS 238 Query: 412 QEKAR-YFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 ++ L +F +++ F + Sbjct: 239 LTLSKINQLQQ----GTFNFIRDISQIDDAFTIILAGI 272 >gi|14248587|gb|AAK57579.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 42/152 (27%), Gaps = 15/152 (9%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S V + + I + DT+ + F + G R Sbjct: 1 SGSIGYPNWITKV---IPMLNGLINGLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKR 57 Query: 322 TIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + G+T + A++ + N +A + ++L+TDG Sbjct: 58 QALSKVTELRKSYSPYGTTNMTAALEEVQKHLNDRV--------NREKAIQLVILMTDGI 109 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + K K + + + I Sbjct: 110 PNSKYTALE-VAKKLKQRNVSLAVIGIGQGIN 140 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 40/114 (35%), Gaps = 13/114 (11%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T A+ A I + + +L+TDG + N + + + +A Sbjct: 124 GQGTYSTYAISNATQLFIRETSGQS---------VRVSLLMTDGSDHPRNPDIMTVVAEA 174 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN--SFFEANSTHELNKIFRDRI 445 KS I+I I S+ + L AS +F + + L + +I Sbjct: 175 KSHNIKIFAIGLSMRA--MDSNSAKLRAVASSPAQQYFHSLTDRGLEERLLQQI 226 >gi|296224421|ref|XP_002758053.1| PREDICTED: matrilin-3 isoform 2 [Callithrix jacchus] Length = 445 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 73/232 (31%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R ++P + + G + F+ SS + V+ ++ Sbjct: 50 GSRPSPVAPDGAPSSGASEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 109 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 110 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 163 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + E + K +++TDG + ++ + +A++ GI Sbjct: 164 GLAIQTAMDEAFT---LEAGARGPSSNVPKVAIIVTDG---RPQDQVNEVAARARASGIE 217 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + + + L AS F + +L+ F++ Sbjct: 218 LYAVGV----DRADMES--LKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 263 >gi|296224419|ref|XP_002758052.1| PREDICTED: matrilin-3 isoform 1 [Callithrix jacchus] Length = 487 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 73/232 (31%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R ++P + + G + F+ SS + V+ ++ Sbjct: 50 GSRPSPVAPDGAPSSGASEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 109 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 110 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 163 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + E + K +++TDG + ++ + +A++ GI Sbjct: 164 GLAIQTAMDEAFT---LEAGARGPSSNVPKVAIIVTDG---RPQDQVNEVAARARASGIE 217 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + + + L AS F + +L+ F++ Sbjct: 218 LYAVGV----DRADMES--LKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 263 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 58/190 (30%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + + + +DR + L RT KT Sbjct: 53 SRMAAAKQAFNEVLDATPEEVELGIRTLGANYPGDDRKEGCKDTAQLYPVGPLNRTEAKT 112 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A D + + K IVL++DGE+T + Sbjct: 113 AVATLAPTGWTPIGPALLKAADDLD------------GGDGSKRIVLISDGEDTCAPLDP 160 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N + A+ ++ EL + Sbjct: 161 CEVAREIAARGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYTSVEHQDELTDRVNEL 218 Query: 445 IGNEIFERVI 454 + V Sbjct: 219 VDRAAEPVVT 228 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 67/162 (41%), Gaps = 18/162 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ID +D R+G + F F+ +L + + + DE G T A++ Sbjct: 322 IDKFDDDFRIGISKFTGTYTKMCDFTDDRTELRKVLNRIRTEDEIFDG-TYNQTALKKCI 380 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAICNKAKSQGIRIMTIAFS 406 + ++ + + IV+L+DGE+ + + E ++ N A + + ++T+ Sbjct: 381 NEFSAAGDGKYVN---------IIVMLSDGESDEVDAETIESLSNLANEKSVIVLTVGLG 431 Query: 407 VNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 +E R +L A + ++ A+ L+ +++ + Sbjct: 432 -----REIDRAWLQEVAYSTGGKYYSASDATSLDDVYKQIVT 468 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 66/218 (30%), Gaps = 50/218 (22%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++I+S+K R+G F+D + + + + Sbjct: 98 YPNRLEAGKRVLTNLIQSLKG-------DRVGFIPFSDSAYIQMPLTDDY-NITQNYINA 149 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A + + K +++++DG + Sbjct: 150 IDTTLISGGGTELYQALELAEKSFKEIGSEN-----------KTVIVISDGGDFDKKSL- 197 Query: 387 IAICNKAKSQGIRIMTIAFSV-------------------NKTQQE----KARYFLSNCA 423 + K I + +I +++ FL + Sbjct: 198 ----DFVKENKIDVYSIGVGTKEGNVIPEYLNGVKRGFIKDESGSAVISKLNSDFLQKIS 253 Query: 424 SP--NSFFEANSTHELNKIF-RDRIGNEIFERVIRITK 458 + ++E N+ + +K F D I E + TK Sbjct: 254 NENNGKYYEVNNLVDTSKNFVNDTINLERKNQRNEKTK 291 >gi|120609754|ref|YP_969432.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] gi|120588218|gb|ABM31658.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] Length = 355 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 62/226 (27%), Gaps = 48/226 (21%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + +DA + I + VR+G F + L Sbjct: 97 SMRAADVHPDRLTAAQDAAKAFIADL------PRHVRVGIVAFAGSAQLAQLPTQNHEDL 150 Query: 320 IRTIVKTFAIDENEMGS------------TAINDAMQTAYDTI-----ISSNEDEVHRMK 362 R I G+ T I+ + + + HR Sbjct: 151 FRAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMDEIGRPPHRGG 210 Query: 363 NNLEAKKY------------IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 N A + I++LTDG+ T + + A +G+R+ T+ Sbjct: 211 NGKGADRPAPVAPGSYTSAAIIMLTDGQRTTG-VDPMEAAQWAADRGVRVYTVGVGTVAG 269 Query: 411 QQ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +F A + +L K++ Sbjct: 270 ETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETL 315 >gi|91977980|ref|YP_570639.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684436|gb|ABE40738.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 181 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 11/72 (15%), Positives = 29/72 (40%), Gaps = 1/72 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTA 60 +F ++ ++ ++ G I AL+ P+ G+ +++V + L+ A +A Sbjct: 4 LFVSRAAVLMRRFGRNRRGSAAIEFALIAPIFFGLLFAIIEVALMFFAGQVLETAVQDSA 63 Query: 61 IITASVPLIQSL 72 + + Sbjct: 64 RLILTGQAQGGS 75 >gi|295669664|ref|XP_002795380.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] gi|226285314|gb|EEH40880.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] Length = 773 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 57/193 (29%), Gaps = 28/193 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L + A ++I ++ + D R+G F++ S + + Sbjct: 109 LDLTKHAARTIIETLNEND------RLGVVTFSNDAEVAYKIS-HMDDTNKKAALEAVEA 161 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QDNE 384 + ST + ++ + ++ + + + +LTDG+ Sbjct: 162 LQPLASTNLWHGLKLGLSVL--------GKVDLRPQNVQALYVLTDGQPNHMCPRQGYVP 213 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + I + K + I T F + L + A ++ + +F Sbjct: 214 KLRPILERQKDRLPLIHTFGFGYDI-----RSGLLQSIAEVGGGTYSFIPDAGMIGTVFV 268 Query: 443 DRIGNEIFERVIR 455 I N + Sbjct: 269 HAIANLYTTFATQ 281 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 27/169 (15%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F V + + K+ + + T I A++ A +++ ++ Sbjct: 83 WTFGQSVNLLVPYRLVDESWRQQAAKSASAINSVALHTHIGAALEKAAQDVVAGDDGFRR 142 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICN----------KAKSQGIRIMTIAFSVNK 409 +VLLTDG D E + I + K+ G + TIA Sbjct: 143 N----------LVLLTDGVVDIDPEAVVNIQERKRILTELLPQLKAAGYVVHTIAL---- 188 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 Q+ + + A + F A S EL + F + + + Sbjct: 189 -SQDADQELMKKLALTTDGVFAVAQSADELMQAFLTIFDQAVPAERVPL 236 >gi|85859136|ref|YP_461338.1| hypothetical protein SYN_01495 [Syntrophus aciditrophicus SB] gi|85722227|gb|ABC77170.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 364 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/274 (10%), Positives = 57/274 (20%), Gaps = 13/274 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL------ 68 ++ G +I AL M V LG+ M VDV ++ L AA + + L Sbjct: 12 LRERKGAVAVIVALAMTVFLGIAAMAVDVGHIMVVKNELHNAADADALARANLLYAHTPS 71 Query: 69 ---IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + A S + + + ++ + P Sbjct: 72 GFTSATPPTPDWAAAESAASTIDPANKSDGVTLTSYEVETGYWNLDQNPAGLQPKSITPG 131 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 V + + G + + A A ++ V Sbjct: 132 TRDVAAVKVTVRRVDGTNGGSIRHWFGAFVGNLTSNASATAIAICSSPGTAKPGTLV--- 188 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + N + V + + + SL Sbjct: 189 -PMAIPLWIAKRASHYNSPSNLLTIGSAYHYDAYAPVNPDGDPYTNTVAGQWTSLTSDPI 247 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 + + + Sbjct: 248 NNANHLKDIIVNGNSNPLSIGDPLYIEPGTMDVG 281 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 81/305 (26%), Gaps = 29/305 (9%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 PL+ + + S + + + S +P Sbjct: 265 PPLTPLVAIYPAEGTFWHDNPFIIMASANSDERDAAERFYEFLLSEESQRAAMSFGFRPA 324 Query: 202 NCFGQPADRTVKSYSSQNGKVGIR-DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 N D ++ V + +V+ S + + ++ Sbjct: 325 NPNVPLTDPISPAFGVDPQGVQTVLAVPTAEVIVAIKNSWSLNRKRADIVLVVDTSGSME 384 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 K +V+ + + + I D R+G F + Sbjct: 385 GD-------KLTMVKAGIETFLMRILPED------RLGLITFASAARLVVPMAPLSD--N 429 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R ++ G TA+ DA+ + + + + IVLL+DG + Sbjct: 430 RIALQDAVQAMRASGRTALFDALVLGKQVL-------EQLPPADDDRIRAIVLLSDGADN 482 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 I GI I +A+ + R L A + T ++ + Sbjct: 483 SSQASLDQIRTLFDESGISIFPVAYG-----NDADRQVLDAIAEFSRTIVVVGDTGDIAQ 537 Query: 440 IFRDR 444 IF + Sbjct: 538 IFENL 542 >gi|254420639|ref|ZP_05034363.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] gi|196186816|gb|EDX81792.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] Length = 613 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/370 (11%), Positives = 86/370 (23%), Gaps = 28/370 (7%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + +IE A +A V S P Sbjct: 76 TQSRIEGVPPPPPPPPPPPPPAPPAPMRPAVVAAAPGQAPLNAVVVTGSRIMPGAPAPSD 135 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + +T + + S + +N F Sbjct: 136 TETYPDATPNPVKRTADQPVSTFSIDVDTAAYSNVRRFIDEGRSPPADAVRVEELINAFD 195 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK-----SLYYMLYPGPLDPSLSEEHFVD 260 R + +P + L P L+ VD Sbjct: 196 YGYARPTSLARPFAITTAVVASPWAPRTERGGRQIVHIGLQGYELPQGEQRPLNLTFLVD 255 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S K L + A+ I ++ D ++ T++ + + + G Sbjct: 256 VSGSMRSPDKLDLAKQAMNLAIDRLRPQDTLS------VTYYAEGAGTTLQPTPG---DQ 306 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + ++ G TA M AYD + + ++ TDG+ Sbjct: 307 KLKMRCAVASLRASGGTAGATGMTNAYDQ--------AQASFARDKVNRILMF-TDGDFN 357 Query: 381 ---QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 DN+ + + G+ + F Q + + A + Sbjct: 358 VGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARMQTIAQ--AGNGVAAYVGDLRDA 415 Query: 438 NKIFRDRIGN 447 ++F Sbjct: 416 RRLFGPMFDK 425 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 45/160 (28%), Gaps = 37/160 (23%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISS------------------------NED 356 R V T G TA+ + M A T++ Sbjct: 157 REAVATAIERLQPQGGTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPRALANPPA 216 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------ 410 + +K IVL +DGE+ A A G+RI T+ + Sbjct: 217 DTEPVKPGSYTSGAIVLFSDGESNAGPAALRAA-QLAAEHGVRIYTVGVGTPEGVVLSVD 275 Query: 411 ----QQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + L A+ +F EL +++R Sbjct: 276 GWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRAL 315 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + D R+G + RV ++ S K + + T A+Q Sbjct: 65 IDGLSVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST--GTMTGLAIQ 122 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG QDN I +A+ GI I I Sbjct: 123 FAMNVAFSEAEGGR----KSPDISKVAIIVTDGRP-QDNIR--DIAARAREAGIEIFAIG 175 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 176 VGR------VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAF 216 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 84/284 (29%), Gaps = 34/284 (11%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + + + ++ + F Q + V+ + +N + + + Sbjct: 212 VIDRVTGLYKDNCVFIPDKNQREKASIMFNQNINSVVEFCTEKNHNKEAPNAQNQRCNLR 271 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 + ++ + + SL ++ + + ++S +++ +N Sbjct: 272 STWEVIQESEDFKQTTPMTAQPPAPTFSLLQTRQRIVCLVLDKSGSMQSDNRLNRMNQAS 331 Query: 296 RMG------------ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 R+ F+ + R ++ G T+I + Sbjct: 332 RLFLLQTVEQGSWVGMVTFDSTAYVQSELTQLNSGADRDLLIKRLPTV-ASGGTSICSGL 390 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMT 402 Q A+ +I + IVLLTDGE+ I+ C + K+ G I T Sbjct: 391 QAAFTSIKKKYPTDGAE----------IVLLTDGEDN-----TISSCFDLVKNSGAIIHT 435 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEAN--STHELNKIFRDR 444 +A K LS + ++ + L F Sbjct: 436 VALG---PSAAKELEQLSKMTGGLQTYSSDQIQNNGLVDAFAAL 476 >gi|302832626|ref|XP_002947877.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] gi|300266679|gb|EFJ50865.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] Length = 733 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 54/181 (29%), Gaps = 21/181 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 ID + +G + V D RT+ + GSTA+ D + Sbjct: 254 IDQLTSDDFLGLISYAHDVREDLPLLRMTP-ASRTLAHAVVEELVAGGSTALYDGLVAGL 312 Query: 348 DTIISSNEDEVHRMKNNLEAKKY-----------IVLLTDGENTQDNEEGIAICNKAKS- 395 +++ D + A L TDG+ T +I ++ Sbjct: 313 RQQMAAERDLGGGNGASGGASDSSSPSSLSLVHSCFLFTDGQATDGPSNPASIIEGLQAA 372 Query: 396 -----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 Q + + T F + + + A ++ + ++ F D +G + Sbjct: 373 QAPSGQHVTVHTFGFGNGHSVELLQQVAE---AQSGVYYYISCEEDIACGFGDALGGLLA 429 Query: 451 E 451 Sbjct: 430 V 430 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 56/173 (32%), Gaps = 25/173 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 I K + ++A I +++ R+G F+ F L V+ Sbjct: 410 ISKFTMAKEAA------IMATESLRQEDRIGVLAFDVSTRWVVDFQPVGVGLSLADVQRR 463 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEG 386 G T I +A+Q + H VLLTDG T D + Sbjct: 464 ISTLPLGGGTDIYNALQEGLPALAQQPGRVRHA-----------VLLTDGRSFTDDRQAY 512 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + +A+SQ I + TIA + L A + A +++ Sbjct: 513 RMLLEEARSQNITLSTIAIGTDADIN-----LLQELARWGAGRYHYAAEPNDI 560 >gi|90020471|ref|YP_526298.1| arginine biosynthesis bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase [Saccharophagus degradans 2-40] gi|89950071|gb|ABD80086.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 708 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 45/400 (11%), Positives = 111/400 (27%), Gaps = 44/400 (11%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A + ++T ++S + S + + + + Sbjct: 164 SAALEEVVVTGMRSSAAESAKLSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGMP 223 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + DT V P+ + + ++ +++ AEA + S Sbjct: 224 VDGDTIVAPAPQGND--------------------KFEHVEENSVKSVAEAPVSTFSIDV 263 Query: 174 EHG-VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + S S + + + +N F S + + + D + Sbjct: 264 DTASYSFVRRQLNSGYLPEKDAIRAEELINYFDYNYPLPSDSTAPFKPNITVIDSPWAKG 323 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + L P + +D S + K LV+ ++ ++ ++ D V Sbjct: 324 KKLVHIGLKGYDIAPDQKPRTNLVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVA 383 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 V + + + + + GSTA + AYD + Sbjct: 384 IVV---------YAGAAGTVLEPTPAKDKQKILSAMQRLQAGGSTAGGAGIALAYDLAEA 434 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNK 409 + + + ++L TDG+ + + + + +GI + + F Sbjct: 435 NFDKKAVNR---------VILATDGDFNVGSTNNETLQGFVERKREKGIFLSVLGFGQGN 485 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + N ++ E K+ + + Sbjct: 486 YNDHLMQTLAQN--GNGVAAYIDTVSEAQKVLVQEASSSL 523 >gi|319940449|ref|ZP_08014794.1| hypothetical protein HMPREF9464_00013 [Sutterella wadsworthensis 3_1_45B] gi|319806075|gb|EFW02824.1| hypothetical protein HMPREF9464_00013 [Sutterella wadsworthensis 3_1_45B] Length = 529 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/376 (10%), Positives = 90/376 (23%), Gaps = 18/376 (4%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++++ +G + + + + +VD R S LKQA A Sbjct: 4 LLCKRAQLLLARVLREESGSAAVNVVFFLIGGIALTAFVVDATRVSADGARLKQATDAAA 63 Query: 62 ITASVPLIQSLEEVS---SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + E ++ + K + + R Sbjct: 64 QAVAMEAAKDSETDVLGMAQRYVAVNLGLDKEQLSRDLSVAVEPVTWNDYEGYRVSASFR 123 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL----IQTKAEAETVSRSYHKE 174 A+ V ++S + NPL + L + + E Sbjct: 124 ALPSLLGGPGKTVEVASAAVAIYNPLEIALVVPSTINETQRDMQAIRDIGEAFYDEVIDG 183 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD--EKLSPY 232 ++ +S + + + + P + + G+ + P Sbjct: 184 RADRWMALVPYSDGVNVWDDAKGVSRIRSWALPDRIEPQWFRYIKQGAGVSNLASPDMPD 243 Query: 233 M---------VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + L+ P +S E + + VI Sbjct: 244 VRKKILHVRRGMRGGELFDWTEPPGGSFEISAETCYGGNCIMSNYPGGWPYITWRGPVIP 303 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S R A + + + K + K E + +N AM Sbjct: 304 SAGNGLTGPTDQRWIAADGTVPLTALLPLTDEREKFTARLQKMIGDHEEINHAINMNIAM 363 Query: 344 QTAYDTIISSNEDEVH 359 + Sbjct: 364 GWGAMALSPGFRGIDG 379 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 91 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 145 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 146 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 205 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 206 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 265 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 266 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 318 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 319 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 372 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 373 ACANKGYYFEIPSIGAI 389 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 91 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 145 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 146 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 205 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 206 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 265 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 266 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 318 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 319 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 372 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 373 ACANKGYYFEIPSIGAI 389 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 91 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 145 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 146 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 205 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 206 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 265 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 266 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 318 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 319 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 372 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 373 ACANKGYYFEIPSIGAI 389 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 74/272 (27%), Gaps = 26/272 (9%) Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 D + + S G + F + + D+ + + Sbjct: 634 VVSFDTNTKDYNNGTGSRGSAVVYFDDILYGKAQVKARLIDLDPRYDKVVQSLKNTTVSE 693 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + E + +K + V ++IK+I V Sbjct: 694 QIFANPRYENNLCSVESEVMFVVDHSGSMKARDAKNYTANKVKQTIKQIGANPSHVYRFN 753 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 N + L+ + ST I A++TA Sbjct: 754 NRPNHEATDKADIVSSIDSLLTYKNENR--------STNIVKALETAI-----------G 794 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 N K IVL+TDG + + + AK +GI I T++ L Sbjct: 795 NFTTNQYTSKAIVLVTDGYSNSN--GLEQVLRDAKLKGIAIHTVSVGS---YTTVNEKLL 849 Query: 420 SNCASP--NSFFEANSTHELNKIFRDRIGNEI 449 + +S ++ S L+ + I + Sbjct: 850 KDISSETNGTYQNITSIENLHGSLQAIITTIL 881 >gi|99081992|ref|YP_614146.1| TadE-like [Ruegeria sp. TM1040] gi|99038272|gb|ABF64884.1| TadE-like protein [Ruegeria sp. TM1040] Length = 182 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 10/48 (20%), Positives = 23/48 (47%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 ++ + G+ + A+L P+ML V V++ ++ + L +A Sbjct: 11 LRQFRRDEEGNATVEFAMLFPLMLMVLFASVELGMITFRQIMLDRAMD 58 >gi|23008710|ref|ZP_00050041.1| hypothetical protein Magn03003732 [Magnetospirillum magnetotacticum MS-1] Length = 167 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 48/164 (29%), Gaps = 10/164 (6%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +K L + G ++ L + ++G G+ VD + + L AA + Sbjct: 10 RLRGRAKALHRDSGGALNVVIGLTLIPIVGFIGLGVDYGMATAGKTRLDNAA------DA 63 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L + + N+ + +N + + V ++ Sbjct: 64 AALAAVVSAKAYIVANAKQSNVTTLALTEGQNQAVKAFNINAGKLLYGTVSLATPQVTRN 123 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + ++ + +LF R G ++ A Sbjct: 124 RQTLSSSITYTATIQ----TLFGRIFGSQATTFTNTVTASADIA 163 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/279 (12%), Positives = 86/279 (30%), Gaps = 23/279 (8%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 + ++ V+ + + + LN + +++ + Sbjct: 199 EDSNFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGS 258 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + + L ++ +S + V +++ + + K Sbjct: 259 ATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKT 318 Query: 289 ------DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAIN 340 D ++D + FN++ F+ V + + K G+T Sbjct: 319 SVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYK 378 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGI 398 + A+D + +SN K I++ TDG + + K ++ + Sbjct: 379 AGFEYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTV 427 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 R+ T FSV + + CA+ +FE S + Sbjct: 428 RVFT--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 464 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 58/183 (31%), Gaps = 14/183 (7%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + D + + I+++NDT ++G F+ T Sbjct: 331 DYSASMYEKNGDITRMALSAKDLIESLNDTHQVGVVEFHRPDEPPAILQDFT-----TYK 385 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++ S I + +D ++ + K + + K +V L+DG + Sbjct: 386 NAAIEAVSQFSSGKIYRDFSSCWDAVLKG--LKQFPEKPDPDIFKTLVFLSDGFDNSSFS 443 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 + + AK + + I + L N A + ++ A + + F+ Sbjct: 444 TPGNVISLAKERDVHIYILGIGRGS-----EEEVLKNIALETGGTYVHAENIAVFRERFK 498 Query: 443 DRI 445 I Sbjct: 499 QTI 501 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 60/190 (31%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND--RVISDPSFSWGVHKLIRTIVKT 326 + + A V+ + + + ++ D + + V L RT KT Sbjct: 60 TRMAAAKRAFNEVLDATPEEVQLGIRTLGANYPGDNQKTGCKDTAQLYPVSTLDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T I A+ + S K IVL++DGE+T + Sbjct: 120 QVATLSPTGWTPIGPALLKSAGDFTDSAS------------SKRIVLISDGEDTCAPLDP 167 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N +++ A+ ++ EL Sbjct: 168 CEVAREIGAKGIGLTIDTLGLVPNTKMRKQLSCIAE--ATGGTYTSVEHADELTDKVNQL 225 Query: 445 IGNEIFERVI 454 + V Sbjct: 226 VDRAADPVVT 235 >gi|145494949|ref|XP_001433468.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400586|emb|CAK66071.1| unnamed protein product [Paramecium tetraurelia] Length = 611 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 46/160 (28%), Gaps = 16/160 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + R+ FN+ K + + + G T I+ A A+ + Sbjct: 223 EQDRLQIITFNEHAQRLTPLKCLTEK-NKQYFQAVISQISAEGLTKISSATYIAFKQL-- 279 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K + LL+DG + E K I T F + Q Sbjct: 280 -------KEKVYRNNVTSVFLLSDGHDGDALFEISDQIRHVKEV-FTISTFGFGDDHDAQ 331 Query: 413 EKARYF-LSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 L N +F+ L++ F +G + Sbjct: 332 MMTSISNLKN----GNFYYVKDITLLDEFFAHALGGIVSV 367 >gi|75676719|ref|YP_319140.1| hypothetical protein Nwi_2535 [Nitrobacter winogradskyi Nb-255] gi|74421589|gb|ABA05788.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255] Length = 554 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/225 (12%), Positives = 62/225 (27%), Gaps = 12/225 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ I G+F I+ A M ++LG + VDV + + Sbjct: 2 RNLLRRFISDTRGNFAIMGAGCMVLVLGCTALGVDVGAIYADRRKAQS---------AAD 52 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + + S Y R + FT + Sbjct: 53 LAAIVAAGNLGKAASAAAATVTANGYPARALLSVELGTFTANAALTPQARFVTPATGAPN 112 Query: 128 AYQVVL---SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 A +V L + Y + + + ++ T + + +G + V+ Sbjct: 113 AARVTLRTETPLYFARMFAGGVNHFDITTRATAASTALASFAIGSRLLALNGGLLNAVLG 172 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + Q L A + + +++ G+ + + Sbjct: 173 RMLGTTLSLSVMDYQALIDARIDAFDFLNALATRLDLTGVTYDSM 217 >gi|71028596|ref|XP_763941.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350895|gb|EAN31658.1| thrombospondin-related protein [Theileria parva] Length = 552 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/314 (13%), Positives = 87/314 (27%), Gaps = 24/314 (7%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + P+ R + + E + S + I S+ Sbjct: 99 SVLPMETLDRLSEAITRSSEHPVTFEALDGGSVVVTNNSDTFSIKLYPSLPGLNLTPGML 158 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P N + T K + D S Y ++ LY P E Sbjct: 159 PTNKPNSHINFTGNHNEHALLKHALHDLSTSNY----DRGLYPDGIKKPSSYCHRELDLT 214 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 I K + + +K I + V + F+ + SF Sbjct: 215 ILVDESSSIMK-EEWEKLIPFLKSLVKSISISPNYVHLSVVTFSTSIRWLISF-LNPSGK 272 Query: 320 IRTIVKTFAIDENEM----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + G T A+ + + ++ A K I+L+T Sbjct: 273 DENLALRVIDELKNSKPVFGFTFTGQALNFITEAV--------YQFGARQNAPKAIILIT 324 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANS 433 DG +TQ N + G+ I+ + + ++ ++ C + F + Sbjct: 325 DGSSTQPN-VTSQASAMLREAGVTILVVGVGM---ARDYECRAVAGCPVKGNCPHFFMTN 380 Query: 434 THELNKIFRDRIGN 447 +E+ + + + Sbjct: 381 WNEIIRKVGELMAE 394 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 83/271 (30%), Gaps = 23/271 (8%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V+ + + + LN + +++ + + Sbjct: 136 VNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYY 195 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DN 290 + + L ++ +S + V +++ + + K D Sbjct: 196 PATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDT 255 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYD 348 ++D + FN++ F+ V + + K G+T + A+D Sbjct: 256 LSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFD 315 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFS 406 + +SN K I++ TDG + + K ++ +R+ T FS Sbjct: 316 QLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FS 362 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 V + + CA+ +FE S + Sbjct: 363 VGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 160 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 214 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 215 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 274 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 275 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 334 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 335 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 387 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 388 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 441 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 442 ACANKGYYFEIPSIGAI 458 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 70/259 (27%), Gaps = 22/259 (8%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ------ 55 + + SKK G ++TA+LM V++ + VD+ + L+ Sbjct: 1 MMKSILKTGSKKG---EEGAVAVLTAILMAVLIMFAALAVDLGYLYGVRNELQNGADAGA 57 Query: 56 --------AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 + I+T + ++ VS + + + IE + N T Sbjct: 58 LAGAHELLDVENGILTRDDAIAEAERVVSLNSTGNDAVQFKPIETGHWSFTTSTFSPNPT 117 Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 D + + A V + + + F R G S+L+ T+A A Sbjct: 118 DTQGEWQEKSFAELDADLNFINAVRV---VSFRDDTPAFFARIFGFASFLVNTEAIAYIG 174 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL--NCFGQPADRTVKSYSSQNGKVGIR 225 + + I + + + + T + Sbjct: 175 FAGDLYPGEIDLPIGICKETITDGDSFNCNMGRMLNSGGNAATEMTAMWTNFSQEPCSTA 234 Query: 226 DEKLSPYMVSCNKSLYYML 244 + S Sbjct: 235 SNSDMQDLTSGCSGSNITD 253 >gi|32477946|ref|NP_870940.1| hypothetical protein RB13238 [Rhodopirellula baltica SH 1] gi|32448503|emb|CAD78018.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 164 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 52/157 (33%), Gaps = 8/157 (5%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + I +++ + G + A+++PV++ +V++ R L+ A TA Sbjct: 1 MASQARQIKAQRRVPR--QGAALVEFAVVLPVIMLFLTAMVEISRILM----LQHTADTA 54 Query: 61 IITASV-PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A+ ++ + + + + + + VR V Sbjct: 55 AYEAARCAMVPGATVTEAEWEAYALIEAAGLTNTAVTVTPAEITEETAFITVRVEVPAND 114 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 VV++S L L S +R GI Sbjct: 115 NSWMLSSQFTDVVVASEVTL-LTERSPIVRLTGIPGL 150 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 50.8 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 22/160 (13%) Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + ++ GSTA+ DA+ A + Sbjct: 125 FFLVEFDSSPRLVVPLTSDT-----GTIEDHLTFSRSHGSTALLDAIFLALHEM------ 173 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF--SVNKTQQEK 414 K++ + KK +++++DG + + + K + I +I ++ Sbjct: 174 -----KHSKKNKKALLIISDGGDNHSRYSEKEVSSVVKESDVLIYSIGVFGGGGSPEEAG 228 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 LS + + FEA S EL +IG E+ R Sbjct: 229 GPGLLSKVSEQTGGRLFEA-SAVELPD-IAKKIGIELRNR 266 >gi|313238339|emb|CBY13421.1| unnamed protein product [Oikopleura dioica] Length = 501 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/362 (11%), Positives = 103/362 (28%), Gaps = 29/362 (8%) Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 + + K + + + ++ + LS + + LR M + Sbjct: 60 IADFVDKFDVSQTATRVSVVQFTDNVSDRNGFGFPLSGQA-ARVKSDLANLRYMTGNTHA 118 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS--Y 215 + S ++ + V+ S + ++ + + + S Sbjct: 119 GNALRYVKNNVLSRARKSASKVLIVLTDGVSQDEIEKAASDLIGDKVLVFSVGIGNSVDA 178 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI------- 268 + G+ + + + LY S+ I Sbjct: 179 NELEKIAGLPEYVFKTANYNALTGITDTLYNKLCSSLKSDNCISSVKQDLSFIVDSSSSI 238 Query: 269 --KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS---DPSFSWGVHKLIRTI 323 ++ + S+I ++ DN + + + + R +P ++ + Sbjct: 239 TISDYQKLKTWMKSIIEKLEIGDNASRVSILQFSGQSARPQGRWINPVLTFDRSTSKEAV 298 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + + G T I +A+ Y + +S + ++ ++++TDG+ Sbjct: 299 IGAIDGMKKLNGDTCIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNCPA 352 Query: 384 EEGIAICNKAKSQGIRIMTIAF----SVNKTQQEKARYFLSNCAS---PNSFFEANSTHE 436 E ++Q I I + R L AS FE N+ + Sbjct: 353 EIAKP-AELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQ 411 Query: 437 LN 438 L Sbjct: 412 LI 413 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++D+ + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDQPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + + K+ + Sbjct: 93 --GLFGSREEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRP---GDRAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDLVLDAAA---AAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 160 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 214 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 215 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 274 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 275 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 334 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 335 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 387 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 388 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 441 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 442 ACANKGYYFEIPSIGAI 458 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 57/162 (35%), Gaps = 17/162 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D +G FND+ + +++ ++ +T I + A D + + Sbjct: 334 DDDELGMVVFNDQPSTRSQMVTISESTRLDLLELIPTRDDIGDATGIGSGLSEAIDVLEN 393 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 D I+L++DGE + + + +G+ + TIA V+ + Sbjct: 394 GGNDAAGGC---------IILVSDGEENRSPYI-DDVQSTIVDKGVCVHTIALGVDASHN 443 Query: 413 EKARYFLSNCASPN-SFFEANST--HELNKIFRDRIGNEIFE 451 + A+ SF+ + + + LN+ F + Sbjct: 444 MEQLPL----ATDGKSFYYSENPYSNALNEAFITIAKQDTNA 481 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 106/382 (27%), Gaps = 33/382 (8%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM- 151 +N ++ DI V + +L L P L + ++ Sbjct: 225 DSAQNVLEKGNVSYDGFLGYDISFYNKEGKEIEPEEGSVRVEVDMNLNLLPKDLQMDTLQ 284 Query: 152 ------GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 K ++T A+A S K + V FS +L + D+ G Sbjct: 285 MQHLKEEKKDRTVETVAKAADHKLSVSKSKVTAKFEVKSFSDFILTWNIDATPADPLETG 344 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 A K + + +RD+ ++ LD + Sbjct: 345 DNAASIEKQINHEKYAT-LRDDGTYDLTLTVAGKKGTETNKAKLDVIYILDKSGSMKEDF 403 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 K+ +A+ ++ +S+K N N R F+ + V Sbjct: 404 GGTSKRIAASNAITALTKSLK--QNANIDARFSMVTFSGNKTTGMWGQGDTKTWDDAEVA 461 Query: 326 TF----------AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 G T ++TA + + S + + + Sbjct: 462 VSWTTDAGTIERGSKPTSNGGTNYQAGIRTAKELLTSKRAGAMTAVIFISDGDPTFYYNP 521 Query: 376 DG-------ENTQDNEEGIAIC-NKAKSQ----GI-RIMTIAFSVNKTQQEKARYFLSNC 422 DG + + + +C + AK++ G+ T+ + ++ Sbjct: 522 DGYTRGDGNNDGNGGADNLKVCLDAAKNEIANLGVNYFYTVGVGKANDYVNLSDLCSASG 581 Query: 423 ASPNSFFEANSTHELNKIFRDR 444 S F+ +T EL K F Sbjct: 582 VSGAKNFDGTNTDELTKAFSTI 603 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 58/171 (33%), Gaps = 17/171 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D++ D R+G FN S K +K ++ + G T ++ MQ A + Sbjct: 230 DHLEDDDRLGLVLFNTGAELAEPVSLVGAK-NMQKLKGDVLEISATGGTRLSAGMQMATE 288 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAF 405 + N E + I+ LTD + + ++ + I Sbjct: 289 L------YDEFLEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANANKNVYTTFIGI 342 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 V+ + A +++ +S + F++R+ +E V + Sbjct: 343 GVDFNTELVEYITKIRGA---NYYSVHSATQ----FKERMDDEFEYMVTPL 386 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 160 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 214 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 215 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 274 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 275 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 334 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 335 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 387 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 388 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 441 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 442 ACANKGYYFEIPSIGAI 458 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 160 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 214 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 215 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 274 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 275 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 334 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 335 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 387 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 388 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 441 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 442 ACANKGYYFEIPSIGAI 458 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 92/322 (28%), Gaps = 30/322 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + NP + + + T S +++ ++ + Sbjct: 99 ASQIANPATARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ-GLLPPPDAVR 157 Query: 197 EGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +N F D K + + + + Sbjct: 158 VEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEEL 217 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P+ + +D+S ++ L++ +L +++ +++ DN+ + Sbjct: 218 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIAL 271 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G H + + + GST ++ AY ++ + Sbjct: 272 PSISGSH---KAEINAAIDSLDAEGSTNGGAGLEMAYQ--QAAKGFIKGGINR------- 319 Query: 371 IVLLTDGENTQDNEEGIA---ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+L TDG+ ++ + + K + G+ + T+ + + + Sbjct: 320 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD--VGNGN 377 Query: 428 FFEANSTHELNKIFRDRIGNEI 449 + ++ E K+ + + Sbjct: 378 YSYIDTLSEAQKVLNSEMRQTL 399 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 57/177 (32%), Gaps = 24/177 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D V++ ++ ++ + H I +V+ GST AM+ Sbjct: 422 VNSFDIGPSKVQISLVQYSRDPHTEFAL--NTHHDINAVVRAVRTFPYRGGSTNTGKAMK 479 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 D I ++ + + +VL+TDG+++ ++ + I + Sbjct: 480 YVKDKIFVASRGARQNV------PRVMVLITDGKSSDS---FKDAATNLRNIDVEIFAVG 530 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ R L A+P FE F+ +RI + Sbjct: 531 V------KDAVRSELEAIANPPADNHVFEVED----FDAFQRISKELTQSICLRIEQ 577 Score = 37.7 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 31/95 (32%), Gaps = 12/95 (12%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + ++K VL+TDG+ +QD+ +++ I + + + L Sbjct: 1168 NVGMRRNSRKIGVLVTDGK-SQDDVHEK--AQNLRNENIELYAVGV------KNAEENEL 1218 Query: 420 SNCASPN---SFFEANSTHELNKIFRDRIGNEIFE 451 + AS + L I + N Sbjct: 1219 RSIASDPDDIHMYNVADFSFLLDIVDNLTNNLCNS 1253 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 59/165 (35%), Gaps = 13/165 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +ND +G +++ V + + R+ + + G TA DA+ Sbjct: 399 TNGAQYINDNNYVGLVSYSNSVTIEVPIAQ-FDLNQRSYFQGAVNNLIASGGTASYDAVV 457 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A I + + +AK + LL+DG I + ++ GI + TI Sbjct: 458 VAVKMITEAKA-------QHPDAKCMLFLLSDGYANNGYS-MDEITSALRTSGIPVYTIG 509 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + E AR N A+ + + ++ + +++ Sbjct: 510 YGDDADTGELARLSGINEAASIN----ADSDDIIYKIKSLFNSQL 550 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 38/307 (12%), Positives = 91/307 (29%), Gaps = 19/307 (6%) Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + F G + ++ + + + + V+ + + + Sbjct: 130 YDSKADTEFDDPDGEEIEREKSNSLKLEFTDDANFKTKVNYSYAAVQIPTDIYKGSTVIL 189 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 LN D +++ + + + + L + Sbjct: 190 NELNWTQALEDVFIENRKEDPSLLWQVFGSATGVTRYYPATPWRAPNKIDLYDVRRRPWY 249 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFNDRVISDPSF 312 + +S + V +++ + + K D ++D + FN++ F Sbjct: 250 IQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVHEMLDTLSDDDYVNVASFNEKAKPVSCF 309 Query: 313 SWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 V + + K G+T + A+D + +SN K Sbjct: 310 KHLVQANIRNKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSN-------ITRANCNKM 362 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 I++ TDG + + ++ +R+ T FSV + + CA+ +FE Sbjct: 363 IMMFTDGGEDRVQDVFEKY--NWPNKTVRVFT--FSVGQHNYDVTPLQWMACANKGYYFE 418 Query: 431 ANSTHEL 437 S + Sbjct: 419 IPSIGAI 425 >gi|322418525|ref|YP_004197748.1| hypothetical protein GM18_0996 [Geobacter sp. M18] gi|320124912|gb|ADW12472.1| hypothetical protein GM18_0996 [Geobacter sp. M18] Length = 389 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 43/166 (25%), Gaps = 22/166 (13%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII----------------------T 63 +++ V L V G+ +D+ E L+ +A+ A + Sbjct: 13 VTIMLVVFLVVTGLAIDIGYMYVSEEDLQHSAEMAALTGAQTIKQRYLYQAQTDPARLPA 72 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 S +Q+ ++ + + L N N ++ + Sbjct: 73 ISSDPVQAPARNAAVDLVTGKHDAAALVGLLNNNGNALTGDNDITVGFWNMSSRSYTPGG 132 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 +A QV + L + S + + Sbjct: 133 TPVNAMQVRTRRTAESSSVGLGTVGTFIAKISGTENFGSTPVATAA 178 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ + ++ ++Q D + Sbjct: 160 VYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTD-----I 214 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 215 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 274 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 275 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 334 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + +SN Sbjct: 335 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSN-------I 387 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 388 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 441 Query: 421 NCASPNSFFEANSTHEL 437 CA+ +FE S + Sbjct: 442 ACANKGYYFEIPSIGAI 458 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 71/211 (33%), Gaps = 38/211 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A S I + +G F+ S + ++R+I Sbjct: 106 NRLEAAKAAAISFIDQL------PARYNVGLVSFSSSARVVTSPTTDHALVVRSIDGLG- 158 Query: 329 IDENEMGSTAINDAMQTAYDTIIS----------------SNEDEVHRMKNNLEAKKYIV 372 G TAI +A+ ++ D + S E + ++V Sbjct: 159 ---PPDGGTAIGEAVYSSIDDLQQILEEAAASGPSQEPEESEGSEKSEESEEERSPAHLV 215 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA--------RYFLSNCA- 423 LL+DG N+ +A A+ G+ TIA+ + L N A Sbjct: 216 LLSDGGNSAGRS-PVAAAEAAREAGLPTSTIAYGTEGSSATLPGGQSVEVREDTLRNLAD 274 Query: 424 -SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ A+S EL +++ D IG + + Sbjct: 275 TTGGRFYRASSADELREVYDD-IGTLVGHHI 304 >gi|254416823|ref|ZP_05030572.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176369|gb|EDX71384.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 538 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 49/174 (28%), Gaps = 18/174 (10%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ID + + ++D + +K G T ++ Sbjct: 67 NLIDYLTADDFVSVVIYDDTAEVIIPPQL---VGDQAALKAKIGKIRARGCTNLSGGWLL 123 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA---KSQGIRIMT 402 + ++ E ++LLTDG ++ + A I T Sbjct: 124 GCSQVQANQSPERINR---------VLLLTDGLANYGIKDPQVLTKTALEKAEADIVTTT 174 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + F + A+ +F+ S + +++F + + + + Sbjct: 175 LGFGNYFNEDLLINM---ANAARGNFYFIQSPDDASQVFEIEMESLVSVVAQNL 225 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 53/166 (31%), Gaps = 29/166 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F V + + + + + T I +AM+ A + + ++ E Sbjct: 87 VWTFGQWVNMLIPPAEVNSEWRTNAKEKAKMINSFGLRTNIGEAMERATWKLAADSDFEQ 146 Query: 359 HRMKNNLEAKKYIVLLTDG-----------ENTQDNEEGIAICNKA----KSQGIRIMTI 403 H +LLTDG ++ ++ E I K+ G++I TI Sbjct: 147 HA-----------ILLTDGIVDIAADDDPQKDNKNEAERQRILTDVLSEYKNLGVKIHTI 195 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 A S + + L + ++ +L K F + Sbjct: 196 ALSNAADKVLLEKLALE---TGGMAEVVENSEQLVKAFLNAFDKAA 238 >gi|52001201|gb|AAU21481.1| inter-alpha trypsin inhibitor [Fundulus heteroclitus] Length = 201 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 30/88 (34%), Gaps = 3/88 (3%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 ++LLTDG+ T I + K + + + F + + + L N Sbjct: 9 ILILLTDGDPTTGVTNINTIRSNVKQKIAGKYPLYCLGFGFDVNFEFLKKMSLENNGVAR 68 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 +E + K F D + + V Sbjct: 69 RIYEDSDADLQLKGFYDEVATPLLTDVT 96 >gi|56696061|ref|YP_166415.1| hypothetical protein SPO1165 [Ruegeria pomeroyi DSS-3] gi|56677798|gb|AAV94464.1| conserved domain protein [Ruegeria pomeroyi DSS-3] Length = 257 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 47/177 (26%), Gaps = 26/177 (14%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + + S + + G+TA+ DA+ A Sbjct: 90 YGAAMDGDADGDLCKRVSMPFTPSPNAAGQILNLIDAIEPDGNTALTDAVNLAARVFDQP 149 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAF------- 405 IVL+TDG+ T + A+ G+ + I F Sbjct: 150 PRPG------------VIVLVTDGDETCGGAPCALAADLARDTPGLTVHVIGFRVRSQFF 197 Query: 406 ---SVNKTQQEKARYFLSNC---ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + C A+ + + EL + +G ++ + + Sbjct: 198 GWEGGDGNPTVPETISPAECLAQATGGEYVSTETVEELIRALNQTLGCPLYGALAPL 254 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 72/221 (32%), Gaps = 32/221 (14%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + R A IK +D + TV++G F S Sbjct: 104 DISKSMAATDVKPSRVDAARAAA------IKFVDGMAPTVQLGVVTFAGSAQPLVRPSTD 157 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + + TA + + TA I + N IVL++ Sbjct: 158 HETAKKVIDQ-MIRADKLEKQTATGEGIYTALQQIETIAGA---LGGKNHTPPARIVLVS 213 Query: 376 DGENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQEKARYF---------LSN 421 DG+ T ++ A AK + I + T+AF + L Sbjct: 214 DGKETVPDDLNAPRGAYAAARTAKEKHIPVCTVAFGTKSGKITIDNQVDEVPVDLDSLKK 273 Query: 422 CA----SPNS---FFEANSTHELNKIFRDRIGNEIFERVIR 455 + SP + FF A S EL +I++ +I +R Sbjct: 274 ISDLSNSPGNSCRFFPAESQAELAQIYQSL-NEDIGYENVR 313 >gi|219841908|gb|AAI44536.1| COL22A1 protein [Homo sapiens] Length = 1319 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDLVLDAAA---AAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|148555258|ref|YP_001262840.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] gi|148500448|gb|ABQ68702.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] Length = 166 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 40/155 (25%), Gaps = 11/155 (7%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +L G I ALL + V +D+ W L A + + Sbjct: 2 IRALLHRLRDDRRGVATIEFALLSVLFFFVMTAGLDIAMWYQQRLRLDSAVEQGA-MIAF 60 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV----------RDIVR 116 S+++ + + I N N Sbjct: 61 NSRASVDQSAIGTYVAAAAKLSSAPTVSISCNGNATCANSGRTCACISGGAPTYSALGCD 120 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 T + + ++ ++ LL P ++ +M Sbjct: 121 KTCSDGSLPGYYMRISATATASTLLVPAAMLGGTM 155 >gi|46445753|ref|YP_007118.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] gi|46399394|emb|CAF22843.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] Length = 362 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 62/226 (27%), Gaps = 24/226 (10%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 L + K L++ I + +G F Sbjct: 99 YLIVDQSSSMNEKISSNSFFERGRSFTKIDLLKVMAGQFILH-------RPSDLIGLVAF 151 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + LI + AI+ E TA+ A+ I+++ K Sbjct: 152 ARVPKILSPLTLDHELLINQLNDLKAINSMEEDGTAMGYAIYKTAHLIVATKHFAQELQK 211 Query: 363 NNLEAK----KYIVLLTDGENTQDNEE---------GIAICNKAKSQGIRIMTIA----F 405 A +V+LTDG + + AK+ GI + I F Sbjct: 212 KGKPAYEIKNAIMVVLTDGFQDPNRLDYGNRLRTIELDEAIAYAKNAGIHLYIINVDPKF 271 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S + + + ++ + AN +L ++F E Sbjct: 272 SSPQFAPHRRQIETLAESTGGQLYLANQEKDLKRVFDTIDRLEKSS 317 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + D R+G + RV ++ S K + + T A+Q Sbjct: 65 IDGLSVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST--GTMTGLAIQ 122 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S E + + K +++TDG QDN I +A+ GI I I Sbjct: 123 FAMNVAFSEAEGGR----KSPDISKVAIIVTDGRP-QDNIR--DIAARAREAGIEIFAIG 175 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 L AS + S +L K F++ Sbjct: 176 VGR------VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAF 216 >gi|119612600|gb|EAW92194.1| collagen, type XXII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1626 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDLVLDAAA---AAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|40805823|ref|NP_690848.1| collagen, type XXII, alpha 1 [Homo sapiens] gi|296434458|sp|Q8NFW1|COMA1_HUMAN RecName: Full=Collagen alpha-1(XXII) chain; Flags: Precursor gi|225000822|gb|AAI72420.1| Collagen, type XXII, alpha 1 [synthetic construct] Length = 1626 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDLVLDAAA---AAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|22652113|gb|AAN03620.1|AF406780_1 alpha 1 type XXII collagen [Homo sapiens] Length = 1626 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + D R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + R K+ + Sbjct: 93 --GLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRD---RAYKQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LLTDG + + A A GIRI + E + L AS F Sbjct: 148 LLTDGRSQDLVLDAAA---AAHRAGIRIFAVGVG------EALKEELEEIASEPKSAHVF 198 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 199 HVSDFNAIDKI 209 >gi|53723209|ref|YP_112194.1| hypothetical protein BPSS2192 [Burkholderia pseudomallei K96243] gi|52213623|emb|CAH39677.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 396 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 94/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|218548279|ref|YP_002382070.1| hypothetical protein EFER_0898 [Escherichia fergusonii ATCC 35469] gi|218355820|emb|CAQ88433.1| conserved hypothetical protein; putative exported protein [Escherichia fergusonii ATCC 35469] Length = 543 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 30/297 (10%), Positives = 79/297 (26%), Gaps = 35/297 (11%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------- 217 T S +++ + + S+ + +N F Sbjct: 87 ATFSFDVDTGSYANVRRFL-KTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENS 145 Query: 218 -QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + + + P + +D+S + ++ L++ Sbjct: 146 PFSINYELTPAPWNEKHTLLRLDIAANDIARSKLPPTNLVFLIDTSGSMNSDERLPLIKS 205 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +L ++ ++ D ++ S + + + G Sbjct: 206 SLKLLVNELRDQDRISIV---------TYAGSARLLLSSTSGAEKNTILNAIANLQAGGG 256 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAK 394 T + AY+ + + N I+L TDG+ + + K + Sbjct: 257 TNGGAGVAMAYE----QAQAGYIKGGVNR-----ILLATDGDFNIGDDPSSVEDLVKKQR 307 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 GI + T+ + A ++ +S E K+ + + + Sbjct: 308 ESGITLSTLGVG----DNNYNEAMMVKIADTGNGNYSYLDSLSEAQKVLSNEMNQTL 360 >gi|198421591|ref|XP_002123589.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 401 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 40/389 (10%), Positives = 89/389 (22%), Gaps = 43/389 (11%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY------QVVLSSRYDL 139 + K YL + + + I T+ Q+ + Sbjct: 24 VETKALRYLQSPVVRAVSNSASSTNPGQIPSGTSPSYFGYDFHLGKTGNQQLRFTVGAPK 83 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 ++P L++ S T E + + ++ Sbjct: 84 SIDPSQPSLQTTPSASAAPATGDLLECPVSNLFSSNP------PPRPACNSRNPPGAQRG 137 Query: 200 PLNCFGQPADRTVKSYSSQNGKV-GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + + K + + N + + + Sbjct: 138 DAFGLSVDVSPNGRLSACSPTKQQNCPPDSIYSPGYCYNSMNRGSTWAPGPETNKIRCPI 197 Query: 259 VDSS-------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF------NDR 305 +D S +V++ V S D + + + + + Sbjct: 198 IDLDMLFVLDGSGSVGKDNFEIVKNWTIKVANSFDISDGYTQVGVIQYSHYWATEPLDKQ 257 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G ++ + +T A+ S+ + Sbjct: 258 SYIKTEVPLGKYRNKQEFSAAVRNISLHEYTTYTAHALNKTVFDFQQSSRWNRPK----- 312 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-- 423 K ++LLTDG +T + N +S I + E L A Sbjct: 313 -TSKVLILLTDGLSTDKQLLPSS-ANYVRSLNITTFAVGVG------EANEKELQEIANG 364 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + ++ LNKI + Sbjct: 365 QGTNERVYYTSNFAGLNKIVSQLRSAILN 393 >gi|110833078|ref|YP_691937.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] gi|110646189|emb|CAL15665.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] Length = 151 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 9/128 (7%), Positives = 35/128 (27%), Gaps = 3/128 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +++ K G + L+ P+++G+ ++ ++ + A A+ Sbjct: 2 RRLIRRVKKIVKGVVALEFLLVSPLVIGLVYAAATYGVLFSWQMRMQVSVDRAAAAATTL 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + + + + + + + + S Sbjct: 62 DRNTTSDPGV---MAASLANGALANNVPTFMGSVPADACVTVGAEVVCDLSIALTDGGCS 118 Query: 128 AYQVVLSS 135 V + Sbjct: 119 EASGVATG 126 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 56/190 (29%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + K + +DR + L RT K Sbjct: 61 SRMAAAKQAFNEVLDATPKEVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPLDRTEAKA 120 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I ++ A D + K IVL++DGE+T + Sbjct: 121 AVATLTPTGWTPIGPSLLKAADDL------------EGGNGSKRIVLISDGEDTCAPLDP 168 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N + A+ ++ EL + Sbjct: 169 CEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYASVEHQDELTDRVNEL 226 Query: 445 IGNEIFERVI 454 + V Sbjct: 227 VDRAAEPVVT 236 >gi|110679844|ref|YP_682851.1| hypothetical protein RD1_2615 [Roseobacter denitrificans OCh 114] gi|109455960|gb|ABG32165.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 181 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 8/55 (14%), Positives = 24/55 (43%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 F Y ++ + G + +L+P++ + +++ +S + L + A+ Sbjct: 5 FRTYLRRFRREEDGQIAVEFVILVPLVFTIFMTAMELGIYSMRQMWLDRGLDIAV 59 >gi|87199536|ref|YP_496793.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] gi|87135217|gb|ABD25959.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] Length = 520 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 85/317 (26%), Gaps = 17/317 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII------ 62 + +K TG + AL + ++ + G+ DV + + L+ AA A + Sbjct: 4 TLIRSGLKDETGAVAVTYALALTGLIAMVGIGYDVSQVLTLDSELQNAADQAALAAVTQL 63 Query: 63 -----TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + +++ + I+E + D R Sbjct: 64 DRQSGAIDRAKNAANSLIANNTLLAKDAGVVTIKEAGFAFYATRADAEAGANPTTDNARA 123 Query: 118 TAVEMNPRKSAYQVVLS---SRYDLLLNPLSLFLRSMGIKSW---LIQTKAEAETVSRSY 171 ++ + Q L+ S ++P GI S + K A + S+ Sbjct: 124 RFAKVEVNGRSAQYALTPIGSAVASWVSPTLSAAAVAGIGSAICKVPPMKICAPSNDLSF 183 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 ++++ + S S L Q + + + + + + Sbjct: 184 NQDYVGKGLLLDTTSNSGLQLQFLGDASNTPSIRKALSADSPTGDCLDEENLAVGSQAGS 243 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + + + + + G + S +L A ++ Sbjct: 244 AVAAALNTRFDIYDAGGNSQNACPSGVTCSPALNSSKDLIRKTNSGGACLLSQNGTPAGS 303 Query: 292 NDTVRMGATFFNDRVIS 308 + F+ Sbjct: 304 WFWPELSTYGFDASGTV 320 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 62/226 (27%), Gaps = 48/226 (21%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + +DA + I + VR+G F + L Sbjct: 97 SMRAADVHPDRLTAAQDAAKAFIAEL------PRHVRVGIVAFAGSAQLAQLPTQNHEDL 150 Query: 320 IRTIVKTFAIDENEMGS------------TAINDAMQTAYDTI-----ISSNEDEVHRMK 362 + I G+ T I+ + + + HR Sbjct: 151 FKAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMDEIGRPPHRGS 210 Query: 363 NNLEAKKY------------IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 N A + I++LTDG+ T + + A +G+R+ T+ Sbjct: 211 NGRGADRPAPVAPGSYSSAAIIMLTDGQRTTG-VDPMEAAQWAADRGVRVYTVGVGTVAG 269 Query: 411 QQ----------EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +F A + +L K++ Sbjct: 270 ETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETL 315 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 63/208 (30%), Gaps = 50/208 (24%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + + + + Sbjct: 101 YPNRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQMPLTDDYS-IGKNYINA 152 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T + A++ A + N D K I++L+DG + + Sbjct: 153 LDTNLISGGGTELYQALELAEKSFKEINSDN-----------KTIIILSDGGDFDEKSL- 200 Query: 387 IAICNKAKSQGIRIMTIAFSV-------------------NKTQQE----KARYFLSNCA 423 K + + +I ++ FL + Sbjct: 201 ----KFVKDNKMNVFSIGIGTEEGTIIPEYVNGKKVGFIKDQNGSAVISKLNSDFLKKLS 256 Query: 424 --SPNSFFEANS-THELNKIFRDRIGNE 448 S ++E N+ + + F+D I E Sbjct: 257 SESDGKYYEVNNLKDDSSNFFKDTINLE 284 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 51/427 (11%), Positives = 118/427 (27%), Gaps = 91/427 (21%) Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 + + + + ++ + L+S+++L N S + + Sbjct: 7 VASIVVSLIFTVTELICIPTFAEENLKRNNLTSQFELKQNINSNNYTVDESFNINYDISS 66 Query: 163 EAETVSRSYHKEHGVS--IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 Y + I V+D S SM D + A+ + Sbjct: 67 LMGIDKNDYDAVNSKDKEIVLVVDTSGSMN----DRNVEGKYKINYSAEPIFSEEEYRWY 122 Query: 221 K---------VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL---RHVI 268 + I + +P Y Y ++ + + Sbjct: 123 RRVSLGNGQKYIIHELIDTPEREIKIGKETYNEYIEINKKRYYLKYDYSDGNWYAIAYSE 182 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K ++ A + + + + ++G + + S + + ++ +I Sbjct: 183 PKIDELQKAAKNFVNKFE----IKANTKIGLVSYGNNGKEVHSLTNELDRINSSIDS--- 235 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHR------------------MKNNLEAKKY 370 + G T + D ++ A + + ++ + + KNN KY Sbjct: 236 -GLSIGGGTNVGDGIRMANGILNNGSDADKYIVLMTDGMPTAATCYNDIYYKNNRFYSKY 294 Query: 371 I------------------------------------VLLTDGENTQDNEEG---IAICN 391 +++ G+N N Sbjct: 295 GEILNGNGNPLSYFNYIGNYKYKFEYNPNDYTHEDEKIIMNYGDNDYGNIALNYSKEALK 354 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FF-EANSTHELNKIFRDRIGNEI 449 +A G+ I FS R L+ A+ + ++ EA EL +++ I +EI Sbjct: 355 RASESGVNNFVIGFSNGIN-----REKLNGIATEGNGYYREAMHGDELTDVYKR-IADEI 408 Query: 450 FERVIRI 456 V++ Sbjct: 409 NNPVVKN 415 >gi|163857684|ref|YP_001631982.1| hypothetical protein Bpet3372 [Bordetella petrii DSM 12804] gi|163261412|emb|CAP43714.1| putative membrane protein [Bordetella petrii] Length = 706 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 46/168 (27%), Gaps = 10/168 (5%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT---------ASVP 67 G + TA + + L + + + Y + L++AA A ++ A+ Sbjct: 13 RQRGSIAVSTAFAVLIGLVML-LSAQIGYLFYMKRELQKAADLAALSAVQVLAPTGAASD 71 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + ++ ++ N + K R + + Sbjct: 72 CAAGSPVAVAAQTSAVANVPAFVDSIAAANVTVDCKFWDPARADSTGMHLFEPDAASGGR 131 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 V + L S+ MG + A + ++ E Sbjct: 132 VNAVRVRIDKTLTALIPSVIGNWMGGTEVSVVAVASNTAPTAAFSVES 179 >gi|309777426|ref|ZP_07672384.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] gi|308914822|gb|EFP60604.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] Length = 365 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 46/126 (36%), Gaps = 3/126 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ---TAIITASVPLIQS 71 ++ G I +L+ ML V G+ VDV R+ + A+ AA + + ++ Sbjct: 1 MRKEHGSVTIFLTILLVPMLIVAGVFVDVGRYQLSKAAVTSAADLTLRSYLANYDTALKE 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + ++ Y N N + + ++ I +++ + + Sbjct: 61 VYGLYAVSQKDEGNRNSLYASYFDDNLGWNDQDSMDFLKLYKITSESSDNFSVKGMEGSS 120 Query: 132 VLSSRY 137 + + Sbjct: 121 LANPDM 126 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 31/78 (39%), Gaps = 2/78 (2%) Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-QEKARYFLSNCASPNS-FFEANSTHEL 437 + AK+ GI + TI FS +K LSNCAS +S F AN + L Sbjct: 58 NALDALTTQAYTNAKAAGISVYTIGFSTPSDSIDDKGLSLLSNCASSSSQAFVANDANTL 117 Query: 438 NKIFRDRIGNEIFERVIR 455 F + R+ R Sbjct: 118 ISAFNQIAKSVGSLRLTR 135 >gi|325108192|ref|YP_004269260.1| hypothetical protein Plabr_1627 [Planctomyces brasiliensis DSM 5305] gi|324968460|gb|ADY59238.1| protein of unknown function DUF1355 [Planctomyces brasiliensis DSM 5305] Length = 938 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 44/143 (30%), Gaps = 18/143 (12%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + G + +A+ A + ++ S Sbjct: 447 DSTPHTIVPLTNVSNPDDIAQQVLGIQSM--GGGIFVYEALVAAGNELMKS--------- 495 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A K+I+L +D ++++ ++ ++ GI + I + L Sbjct: 496 --DLATKHIILFSDAADSEEPGAYRSLIKDYENAGITVSVIGLGTT---ADVDAKLLQEI 550 Query: 423 A--SPNSFFEANSTHELNKIFRD 443 A + EL ++F + Sbjct: 551 ATLGSGNIMFTQDVAELPRLFTE 573 >gi|308472951|ref|XP_003098702.1| hypothetical protein CRE_04179 [Caenorhabditis remanei] gi|308268302|gb|EFP12255.1| hypothetical protein CRE_04179 [Caenorhabditis remanei] Length = 363 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 43/143 (30%), Gaps = 10/143 (6%) Query: 291 VNDTVRMGATFFNDRVISDPSF---SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 R+G +N + + + ++ + + TA Sbjct: 41 NFPYTRVGFVTYNYVATVNADLNKFKSPSALSQGVYNSYNLDNISPEKTSFLGTGLTTAG 100 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D + + K I++ N + + + N KS GI I+TI Sbjct: 101 DILT-----VQGSADGRVNNPKVIIVYASVLNGTGFVDPLLVANTLKSAGITIITIPLDT 155 Query: 408 NKTQQEKARYFLSNCASPNSFFE 430 + + L + ASP F+ Sbjct: 156 DHNG--VIQKQLVSIASPGFAFD 176 >gi|307294426|ref|ZP_07574270.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] gi|306880577|gb|EFN11794.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] Length = 556 Score = 50.8 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/305 (11%), Positives = 78/305 (25%), Gaps = 9/305 (2%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K L++S G AL + ++ VGG+ D R + + L+ AA A + A+ L Sbjct: 3 RIRKGLLQSTGGAVAPTVALSLFGLIAVGGIAFDYARMASLDSELQNAADQAALAAATQL 62 Query: 69 IQSLEEVSSRAKNSFTFPKQKI--EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 S A + + + N + ++ + Sbjct: 63 DGKTGACSRAANAASALIRNDARFANDGNASGLAITVANEATCDRTGFIKFYKNKDRSDT 122 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + ++ +N + + + A+ + V + + Sbjct: 123 GTLADADVNFVEVTVNSRTARFALTPVVAMFSSGPLSAKAYAGLGEAICKVPPVMICNPD 182 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + ++ A+R K G + S Sbjct: 183 EPIG-------NTDVDYDFAVANRIGKGLKLVTVGNGNSAWAPGNFGYLDTGSSTSNPNV 235 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + D SSL V + ++ + N+ + G + Sbjct: 236 ELREALGWISVPGDCSSLEGVKTRTGAGTTVTQAINTRFDIYERANNNAQSGTGNNGNGN 295 Query: 307 ISDPS 311 + Sbjct: 296 NASCP 300 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 68/197 (34%), Gaps = 29/197 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + V+ + S++ + ++ R+G F ++ + + + + Sbjct: 117 SRLDAVKQGVMSLLDA-------SEGNRIGLIAFGEQSFVMSDLTAYGDTVRYMV--SQL 167 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + D + A + + + +VL+TDG +T + + Sbjct: 168 ETGFAGDSTRLGDGVGYAVSLLADVDSERA-----------IVVLITDGNDTGSDLPPVE 216 Query: 389 ICNKAKSQGIRIMTIAFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 AK+ +++ A + ++ L A + +FF + + +++ Sbjct: 217 AARLAKALDVKLYVAAVGADVSTDREPIDEALLRRLAERTGGAFFRIAQVIDFDAMWQTL 276 Query: 445 IG-----NEIFERVIRI 456 G + ERV RI Sbjct: 277 EGLEPKQERVSERVKRI 293 >gi|281345873|gb|EFB21457.1| hypothetical protein PANDA_003362 [Ailuropoda melanoleuca] Length = 399 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 45/161 (27%), Gaps = 17/161 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V R + + D D +++G F+ + K+ I + + T Sbjct: 243 GFVKRMVNQADISTDEIQIGLLQFSSTPQEEFRLDQYSSKVD--IHRAITNVQQMNDGTR 300 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ + ++Y++++TDG D + + I Sbjct: 301 TGKALNF-------TRPFFDSSRGGRPNVQQYLIVITDGVAQDDVVMP---AKALRDRNI 350 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I I + K L P + + L Sbjct: 351 VIFAIGVG-----EAKNAQLLQITDDPQKVYYEENFESLQN 386 >gi|326676330|ref|XP_003200547.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 1164 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 63/182 (34%), Gaps = 25/182 (13%) Query: 272 HLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + + + +D D ++ F+D ++ S S ++ Sbjct: 463 DNFQKIIRFLHSTAGALDQIGPDGTQVAIAQFSDDARTEFSLSSHSS--KEELLTAIQRV 520 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+T AM+ D++ K +V+LTDG +QD+ ++ Sbjct: 521 SYKGGNTKTGRAMKHVKDSVF------APVGGARRGVPKVLVVLTDGR-SQDDVLQVS-- 571 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRIGN 447 + +++G + I F+ + L + AS F + FR + N Sbjct: 572 QELQAEGYIVFAIGFADADYGE------LLSIASRPGDRHVFFVDD----LDAFRTIVEN 621 Query: 448 EI 449 + Sbjct: 622 IL 623 >gi|253996767|ref|YP_003048831.1| hypothetical protein Mmol_1398 [Methylotenera mobilis JLW8] gi|253983446|gb|ACT48304.1| conserved hypothetical protein [Methylotenera mobilis JLW8] Length = 463 Score = 50.4 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/366 (9%), Positives = 93/366 (25%), Gaps = 14/366 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G +I A+ + +++G+ G+++D+ + L+ AA ++ ++ + L + Sbjct: 12 MQRQRGAVALIVAICLVLLVGMLGLVLDLGHLYVTKTELQNAADSSSLSGARELNGKVTG 71 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++S + + N + D + A Sbjct: 72 INSAITRAIEAAGKNNFNLNSTAVTVNASNMWVGSCPSDGCMVPISSVTTDALAGDKTFL 131 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQWVIDFSRSMLDYQ 193 L + F++ + T+ V+ Y + ++I D Sbjct: 132 KVDTGLRTINTWFIQVLP--GVANTTQTFGMAVAGKYAVDITPIAICEQPDPGTVHELGF 189 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE----KLSPYMVSCNKSLYY-MLYPGP 248 + T+ ++ G+ PY+ + + Sbjct: 190 ERGVSYNVQYANPLGPGTLFWIDPESNAPGVCPVISTNATLPYVCTGKVGFTPIIGDTVN 249 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + +S +S ++ A A ++K+ N V + Sbjct: 250 TNTGVSVPQLAALNSRFGDYTPQNKCDVATAPPDTNVKEYIYSNTGVGAPRDWMGTDPDK 309 Query: 309 DPSFSWGVHKLIRTIVKT----FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 T + + + + + Sbjct: 310 QSILFVDRATNGACKPSAPCKPVPYSLRAASGTDY--GVLWSGFRRAGATVAQWPTLYPG 367 Query: 365 LEAKKY 370 A Y Sbjct: 368 NTATSY 373 >gi|213961715|ref|ZP_03389981.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] gi|213955504|gb|EEB66820.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] Length = 550 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/370 (11%), Positives = 101/370 (27%), Gaps = 25/370 (6%) Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + ++ L N + ++ S Sbjct: 18 RNSSNSANQLPPPAPATTVEELTANKMASPETPPPPPPPPAYDAVVEEMEIANSEEPSQQ 77 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 + + + ++ + T S + +++ ++ + + + Sbjct: 78 QLRSNETYKEISENPFVAVAQQPVTTFSADVDRASYANLRRMLGY-GQLPPKDAIRIEEM 136 Query: 201 LNCFGQPADRTVKSYSSQNGKV-GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 +N F K +S + +P + L P + + Sbjct: 137 INYFDYDYPAPTKEATSPLRVTPELAPTPWNPEHLLLRIGLQAKKLDLAQAPPSNIVFLI 196 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S K L++ + ++ +K D R+ + S + Sbjct: 197 DVSGSMDEPNKLPLLKSSFKLLLTQLKPTD------RVAIVTYASGTKVALSSTPVKE-- 248 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R ++ + GST+ + +Q AY + + NN I+L TDG+ Sbjct: 249 -RQKIEKVLDNLYASGSTSGSSGIQLAYK----EAQKNFIKNGNNR-----IILATDGDF 298 Query: 380 TQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + K + GI + + F + + + A ++ + E Sbjct: 299 NVGISNPRELEKFIEKQRESGIYMSVLGFGMGNYRDDMAETIADK--GNGNYAYIDDLTE 356 Query: 437 LNKIFRDRIG 446 K+ + Sbjct: 357 AKKVLVNEFS 366 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/278 (11%), Positives = 74/278 (26%), Gaps = 37/278 (13%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF---GQP 207 + + + E + + S + ++D S + + N Sbjct: 35 VNQEQNIEVGPIVNEESTEGDEGQKDDSYEDLLDESVRTSESNKLETVIQTNLNQSLFVT 94 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 + VK + + L + + +D S Sbjct: 95 SASNVKPQIDFTYLGITPINPMQGQEFTVRYKLTPNPFQHNISKPKEIVLVLDGSGSMSG 154 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK--------- 318 K +++A I +K +DN+ + + ++ IS + Sbjct: 155 -TKLTNLKNAAKDFIDRLKGVDNLKVAIVVFSSNATINPISVSGTTKIKSTDKSSESSIP 213 Query: 319 ------------LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + T + + G T D ++ A + + Sbjct: 214 NYKTLQNEYFLDINDSRLITMINNIDAQGGTNTGDGLRKAEYLLSQKGDSVA-------- 265 Query: 367 AKKYIVLLTDGENTQDNEEGIA---ICNKAKSQGIRIM 401 K I+L++DG T + + + K + I Sbjct: 266 -NKTIILMSDGLPTYYSGSTESGVNYYKEIKDDVVGIF 302 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 105/417 (25%), Gaps = 65/417 (15%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E A + Y + + + +V + T + V Sbjct: 440 ETENVPEADKDKNYTFNAYVNYEGTDSDKDYTLYTGKYDVYNTATGTKDGETRTTDSNGV 499 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + S R + ++ + V + D + + Sbjct: 500 ITLKADQYAVLLGSDAKRITESSKYKVKEINVDQDTYAVSANGGQVKVTQEKDSATTEPV 559 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + A R K + N V+ +S P D Sbjct: 560 SVGEVPRTTVTNTVVTAPRYRKYIKANNDGT-----YDLSLNVTGTQSGSSQTTVSPADI 614 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN--VNDTVRMGATFFNDRVISD 309 + + S+ + + + A+ S+ + + +N + +RM F+ + Sbjct: 615 VVVFDTSGSMSNPMGHNSRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTAGNV 674 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +F+ + + G T + KK Sbjct: 675 SNFTDNAMD-----IVSAVNGLGADGGTNWEA----------ALKAANAKLTSGRKGVKK 719 Query: 370 YIVLLTDGENTQ--------------------------------------DNEEGIAICN 391 YIV ++DG+ T A+ Sbjct: 720 YIVFMSDGDPTYRTSSVRTGTDWLGRPIYDADDGWGLPAGVHGSGLSDRYGANLSSAVAE 779 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + + ++ S + T+ + S++ A ST ELNK F D IG Sbjct: 780 ANRRGDATLFSVGVSSDPTKMRGFADQ-----TKGSYYSATSTDELNKAFADIIGQI 831 >gi|294632153|ref|ZP_06710713.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] gi|292835486|gb|EFF93835.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] Length = 403 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 49/138 (35%), Gaps = 16/138 (11%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L RT KT + G T I A+ A + + + IVL++DGE Sbjct: 94 LNRTEAKTAVATLSPTGWTPIGPALLKAATDL------------EGGDGTRRIVLISDGE 141 Query: 379 NTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 +T + + + ++GI I T+ + +++ A+ ++ + E Sbjct: 142 DTCQPLDPCEVAREIAAKGIGLTIDTLGLVPDAKTRDQLSCIAD--ATGGTYTDVRHKDE 199 Query: 437 LNKIFRDRIGNEIFERVI 454 L+ + V Sbjct: 200 LSDRVGQLVDRAADPVVT 217 >gi|149921342|ref|ZP_01909796.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] gi|149817775|gb|EDM77239.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] Length = 739 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 40/119 (33%), Gaps = 8/119 (6%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICN 391 T + ++ +D ++ + + + + +L+TDG+ + + + Sbjct: 438 GSGTFTHRGLELVFDNLLDYQANPPALYPADEDTQYVNILITDGQYSTYSTDAQVQNALE 497 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-----FEANSTHELNKIFRDRI 445 G + I F E + N A+ S F+A+S EL I Sbjct: 498 ALLDAGSKTYVIGFGDGLNTTEAQLQLM-NMATWGSGGTEMPFDADSQAELELALGAII 555 >gi|90578363|ref|ZP_01234174.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] gi|90441449|gb|EAS66629.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] Length = 259 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 22/224 (9%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P + + N + +D+++ + KK + A+ + I I N Sbjct: 55 PALNNDNALVSDNWLATNYLLIFDGSGSMDNTNCGNGQKKIVAAKQAIQTFINDIPNSAN 114 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V V D S + R +K D G+T + ++ ++Y + Sbjct: 115 VGLYVFD---------NKDASLRVPLGNNNRATLKQAIYDVTAGGATPLKSSLDSSYSAL 165 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 E K + +V++TDG+ +Q AI + + I TI F + + Sbjct: 166 ------ERQASKQLGYGEYNVVIVTDGDASQGENPQPAINRIYRDSPVTIHTIGFCIGEQ 219 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A+ + AN+ +L ++ + V Sbjct: 220 HALNAKGI-------TYYQSANNPEKLLAGLKNVLAESAEFDVA 256 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 55/201 (27%), Gaps = 3/201 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + + + + G+ + AL+ P+ L V VD L+ A A + + Sbjct: 2 RGFVSRFLGARGGNLATMAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAAS 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD--REVRDIVRDTAVEMNPR 125 L Q+ E R + + + + N N T E + D + R Sbjct: 62 LSQAN-EAVLRQLQANGVDPVLMTDGYDPSIVNGKADNKTRVWVEKGNYFPDKGRAVEDR 120 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 A + L P +L+ I + A T + + + D Sbjct: 121 FVAGGANPDAVRVRLARPGNLYFGQSFISRPALGATGTAATKAEAAFSVGSRLLSLNTDQ 180 Query: 186 SRSMLDYQRDSEGQPLNCFGQ 206 S Sbjct: 181 SVLNGLLGGLLGTSLNLKLID 201 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 53/201 (26%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 +S + ++A + + +G F + + Sbjct: 108 SMASTDVPPNRLAAAKEAGKQFADQLTPA------INLGLVEFAANATLLVPPTTNRSAV 161 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 K TA + + TA I + V + IVL +DG Sbjct: 162 -----KAGIDSLQPAPKTATGEGIFTALQAI--ATVGSVMGGGEGPPPAR-IVLESDGAE 213 Query: 380 TQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA-- 423 + AK++G++I TI+F + L Sbjct: 214 NVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEI 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + F A+S L ++ Sbjct: 274 TDGQAFHADSLDSLKNVYSTL 294 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 54/217 (24%), Gaps = 41/217 (18%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + + +D + +++ G T + + Sbjct: 104 AKDINEKTRIQAAKDIITDFVQTYPA-------DAFGLTALASTAALVIPPTIQHEQFFA 156 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + TA+ + A N + I+LLTDGE+ Sbjct: 157 RLNSLQIGEL--GEGTALGMGLAVA-----------AAHFAKNTVKTQSIILLTDGESNT 203 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK------------------ARYFLSNCA 423 KS+ I I + + L A Sbjct: 204 GEIHPNLAAELIKSKKIGFYIIGIGKDGYANLEYVDPSTGEKREGTLQTIFNERELRELA 263 Query: 424 --SPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 + A S L +IF++ I I R ++ Sbjct: 264 HRGNGIYVSAKSFASLQEIFKN-ISQNISPTPARFSE 299 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 56/190 (29%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + K + +DR + L RT K Sbjct: 54 SRMAAAKQAFNEVLDATPKEVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPLDRTEAKA 113 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I ++ A D + K IVL++DGE+T + Sbjct: 114 AVATLTPTGWTPIGPSLLKAADDL------------EGGNGSKRIVLISDGEDTCAPLDP 161 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N + A+ ++ EL + Sbjct: 162 CEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAE--ATGGTYASVEHQDELTDRVNEL 219 Query: 445 IGNEIFERVI 454 + V Sbjct: 220 VDRAAEPVVT 229 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 58/434 (13%), Positives = 114/434 (26%), Gaps = 60/434 (13%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 L+ A + I L + K L N N + Sbjct: 24 TTLQTAPGELAKKLAELAINGLGTSEMQGYYDKLTFK---SLDLDGNSILNDLATRFANK 80 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRY--------DLLLNPLSLFLRSMGIKSWLIQTKA 162 ++ V + + +Y + L S F + + + Sbjct: 81 LQTKVTIARKIKDAVEVSYAKSATVTSRTECCKADTRWLKYDSRFRTKVNLDEMCVIISG 140 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 A + + +++ I+ + ++ S S Sbjct: 141 AASSNPKQLQDNVLQTMKQNIENNPTL----TWQYFGSEEGLYTNYPMIRDSSSCS---- 192 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +Y+ P + S + + ++A +V+ Sbjct: 193 ----------SYDPRYRPWYVEAASPQPKDVILVVDYSGSMGGS---RLPIAKEAAKTVL 239 Query: 283 RSIKKIDNV------NDTVRMGATFFNDRVISDPSFSWGVHK-LIRTIVKTFAIDENEMG 335 ++ D V + R+ T + + S + I+K F E G Sbjct: 240 DTLNPRDRVAFLAFESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASG 299 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A A+D + + + N + I+ +TDG D I N K+ Sbjct: 300 GTMYAIAFNAAFDILDKYYK------EKNTTRRPVILFMTDGAPNDD---PGTILNTVKT 350 Query: 396 QGIRIMT----IAFSVNKTQQEKARYFLSNCA----SPNSFFEANSTHELNKIFRDRIGN 447 + + T + F + L + A + FE + T L + R + Sbjct: 351 RNQGLSTKADILTFGMGGGISPAGVDLLQSLAEQTLDGGARFEVSLTTALRDVSRHLLAV 410 Query: 448 EIFER----VIRIT 457 R V R T Sbjct: 411 ARSARKLVQVGRYT 424 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 46/146 (31%), Gaps = 17/146 (11%) Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 SD + G + G T I + + + + Sbjct: 117 FSDAASDGGTFGSEAEFNEFVDNASWIEGDTNIEAGLSKGRELLANGT----------ST 166 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP- 425 +++L+TDG+ + + + A+ +G + + + + L + Sbjct: 167 RTSFLILITDGDWNRG-GDPQIEADAARDEGTIVYAVGVG-----PDVSEATLLSIGGDL 220 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 + F+A++ EL+ + + + Sbjct: 221 TNVFDASNFTELDNTLDEIVSTSVDS 246 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 61/187 (32%), Gaps = 40/187 (21%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F+ S L + G T + + TA ++ Sbjct: 132 RVGLIVFSGTAFLQSPLSSDYEILREFLPALDPTFL-PEGGTNYDALINTALTAFGATGA 190 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---- 411 A +++++L+DGE T+D+ + + K++GIR++ + Sbjct: 191 -----------ADRFLIILSDGEATEDDWR--SHVAELKNRGIRVIALGVGTTAGAMIPD 237 Query: 412 ---------------QEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRI-----GNEI 449 + L AS + +A++ +L + + G + Sbjct: 238 GAGGLVKDERGAVVLSKLESNTLRELASATSGVYEDASTWVDLAAVLAATVEAGQKGQFL 297 Query: 450 FERVIRI 456 + IR+ Sbjct: 298 EKNTIRL 304 >gi|219848048|ref|YP_002462481.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542307|gb|ACL24045.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 446 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 53/197 (26%), Gaps = 24/197 (12%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 P + + S + AL SV+ + D RM + Sbjct: 79 PVWQLANPLSSEARSQFSSPIDYTVRALHSVVERLTPDD------RMALIACASDALVLA 132 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + G + ST + +Q A + + A + Sbjct: 133 PSTPGHRRTDLIGAIARLPVLRLGESTNLAQGLQLALAQF----------VVTDEPAVRR 182 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSF 428 +VLLTDG T D A+ +A + I I TI L+ A S Sbjct: 183 VVLLTDG-FTTDTTMCTALAREAADRSITISTIGLG-----NTFEETLLTQIADLSGGRA 236 Query: 429 FEANSTHELNKIFRDRI 445 + I + Sbjct: 237 SFVQEAGHIPTIISAEL 253 >gi|108761783|ref|YP_632825.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] gi|108465663|gb|ABF90848.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] Length = 694 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/373 (7%), Positives = 86/373 (23%), Gaps = 9/373 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-TAS 65 F ++ + G ++ AL++ +M V++ + L+ A +A A+ Sbjct: 2 FTRTLRQSFRRQEGQALVLAALMVLIMSIAVLTTVNIGHTVHERIRLQNTADSASYSMAA 61 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + L+ + + A + N Sbjct: 62 MEARAFNFYAYANRTQVSHYVSAMMWQSLLSLIYSAEAFLTDIYGFMRTLNPCAGKTNNL 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + ++ + + + + + ++R Sbjct: 122 FWKIACPILENVIPYVSAVIKAIGRIMTAYRNYFLRPIQQVITRLNPDRIVGEWIIPAHR 181 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + + Y + + + + + C S + + Sbjct: 182 VMNGVMYFASQAVMMSASTHVTQTTQTVIDDNDSNISSLASQLATGLYSQCLFSQAHSPH 241 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND-------TVRMG 298 G + + + K + A S+ D + Sbjct: 242 AGGKPLAPNTWKNPFGALDVTKKAAKDPIARAKRSMGGITNATRYACDANGGACPQGFIT 301 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST-AINDAMQTAYDTIISSNEDE 357 + D + S+ + + ++G T ++ A + Sbjct: 302 SRRMGDLLPLPDMLSFMRDFFNGGVDIPGIFEFGKIGQTRMLSTGFPNANAIRAGRGPGQ 361 Query: 358 VHRMKNNLEAKKY 370 R Sbjct: 362 EGRNYIRDWNDNI 374 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 49/181 (27%), Gaps = 23/181 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++ A+ VI + +D R+ F+ + +K Sbjct: 74 KLQSMKRAMQFVIMKLTPVD------RLSVVSFSSSATRHCPLRSVTQAA-QAELKGIVD 126 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I + TA I + + L++DG+ T + + Sbjct: 127 GLVANGGTNIKAGLDTALAVIAGRATTKARTPN--------VFLMSDGQQTDGDARQVD- 177 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + T F + + +F ++ F +G + Sbjct: 178 -----PGNVAVYTFGFGKDAD-HALLSDVAKK-SPGGTFNSVPDGGNVSAPFSQLLGGLL 230 Query: 450 F 450 Sbjct: 231 S 231 >gi|269128332|ref|YP_003301702.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268313290|gb|ACY99664.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 149 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 41/138 (29%), Gaps = 9/138 (6%) Query: 7 FIFYSKKLIKS-CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ G + T LL P +L + G+LVD + A+ A + Sbjct: 4 LRRALRRAGTDGDRGTIAMYTVLLTPSVLLLAGLLVDGGLAIHARQRAADMAEQAARAGA 63 Query: 66 --------VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + + ++ +++ +E R+ ++ +R + Sbjct: 64 NEIDVQTLRATGEPRIDAAAACASAYDLLDAYGQEVADRDCLPGAEEVRITVTIRVRPQL 123 Query: 118 TAVEMNPRKSAYQVVLSS 135 A+ S+ Sbjct: 124 LAIVPGLGHFTMTSTASA 141 >gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] Length = 670 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 39/403 (9%), Positives = 99/403 (24%), Gaps = 38/403 (9%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + + E N + N E + + Sbjct: 124 EIREKAQAKAIYQDAKKQGKRAALVEQQRPNLFTSKVANIAPGETIHVELRYTEALAIDG 183 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + L + +P ++ + + + + + S Sbjct: 184 REFSLRLPTTMTSRFHPQESSIKPVEQGPIVPSSAVGQSSHLADITVDIDGGWPIQNIES 243 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY----------MVSC 236 S +R G V + + + + Sbjct: 244 PSHPFVERSLGRGYRVHMGSSFSDKVAMDQDVVLRWQLDPVASASGAVFSEEYKGEHYAL 303 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 +D+S + + + AL+ + + D R Sbjct: 304 VMLRTPDEMTSGPRMPREVVFVIDTSGSMAGQ-RMYHAKQALSQAVERLSPDD------R 356 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 FN++ S + + G T + A++ A Sbjct: 357 FNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV------- 409 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +++ + ++L+TD + NE I + + +G R+ T+ V+ Sbjct: 410 -----RSDPAYLRQVILITD--ASVGNEAEILRVVERQRKGARLFTVGIGVSPN-----S 457 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 Y L A + S E+ + + ++ Sbjct: 458 YLLRKAAQVGQGDYVYIASGQEVKARMQRLFAKLENPVLKQLN 500 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 48/129 (37%), Gaps = 12/129 (9%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + R K G T + A+++A + + + IVL+TDGE Sbjct: 110 VDRVQAKAAVATLRPTGFTPVGLALRSAAQDLGTG------------STARRIVLITDGE 157 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 +T + + + +QG +++ + ++ + + A+ ++ A S EL Sbjct: 158 DTCAPPDPCEVARELAAQGTKLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAAQSADELT 217 Query: 439 KIFRDRIGN 447 + + Sbjct: 218 GRIKQLVDR 226 >gi|149913213|ref|ZP_01901747.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] gi|149813619|gb|EDM73445.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] Length = 512 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 55/164 (33%), Gaps = 6/164 (3%) Query: 3 FDTKFI---FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 ++ I S++++ G + + ++L +GG+ +D Y+ LKQ A Sbjct: 1 MGSRAINPSRTSRRILHQEDGAGSVFGIFAVVMILLLGGVALDATNLWRYQQMLKQTADV 60 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A +V L + ++ +F + + + N N + + Sbjct: 61 AAHAGTVQLASGGDATNAY-NAAFALVEANMPQSWYGNLFANPQADIEVVHYD--PDTDL 117 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + + +A V L D + LR I + T Sbjct: 118 LGGSGPLNAVAVTLRRDGDSGNPVPTYLLRIADIMNMGDSTDLS 161 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 38/299 (12%), Positives = 87/299 (29%), Gaps = 33/299 (11%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS- 216 +KAE + + + + +V D S + Q + A + + Sbjct: 91 YNSKAEMDYDGEDIDSQMSLKLDFVYDPSFKNQVNYSHTAVQIPTDIYKGAPVILNELNW 150 Query: 217 ----SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + RD+ + + + YP + + D I+ Sbjct: 151 TQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGAS 210 Query: 273 LVRDALASVIRSIKKI---------------DNVNDTVRMGATFFNDRVISDPSFSWGVH 317 +D + V S D ++D + FN++ + + Sbjct: 211 SPKDMVILVDVSGSVSGLTLKLIKASVTEMLDTLSDDDYVNVARFNEKAEAVVPCFDHLV 270 Query: 318 KLI---RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + I K G+T A++ +++ K I+L Sbjct: 271 QANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNKTNVPRANCN------KIIMLF 324 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 TDG + + ++ +R+ T + + ++ C++ +FE S Sbjct: 325 TDGGEDRAQDIFEQY--NWPNKTVRVFTFSVGQHNYDVTPLQWI--ACSNKGYYFEIRS 379 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 65/231 (28%), Gaps = 28/231 (12%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF----VDSSSLRHVIKK 270 +G S Y +C + S +D S ++ Sbjct: 91 NGVDHGCEFQCVSVGSSYRCACPEGRRLQADGKSCGRCGSGPTDLVLVIDGSKSVG-AQQ 149 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ S+ R+G F+ RV ++ G H Sbjct: 150 FELVKRWVGELVGSLDVSPAGT---RVGLVQFSSRVRTEFPL--GRHGTKAEAEAAVRAV 204 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A++ + S E ++ +L+TDG +QD+ Sbjct: 205 TPMDKGTMTGLALRHLVERGFSEAEGARPGS------RRVGLLVTDGR-SQDDVSP--WA 255 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 +AK +GI + + + L AS A L Sbjct: 256 ARAKDRGIVMFAVGVG------KAVEEELREIASDPVERHVSYAADFGALT 300 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAYVQSELVQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPT-AAAGGTSICTGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAVIHTVALGPSAAQELEQLSKM----TGGFQTYASDQAQNN 466 Query: 437 -LNKIFRDR 444 L F Sbjct: 467 GLIDAFGAL 475 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HSNVLFT-----EEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 985 VG------NANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 93/335 (27%), Gaps = 19/335 (5%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ + + ++ ++ V ++ + S S Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP--ADRT 211 + S +E + ++ +M ++ + T Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SYS G +K+ + + + + ++E + +K Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRK 487 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + I + VR+G ++ + + S + + K + Sbjct: 488 KEFEQIQIFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGID--LKKAVFNIK 545 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + + + A Y+++LTDG++ E N Sbjct: 546 QLKGLTFTGKALDFILPLI------KKGKTERTDRAPCYLIVLTDGKSNDSVLEP---AN 596 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ I I I E + L A + Sbjct: 597 RLRAEQITIHAIGIG------EANKTQLRQIAGKD 625 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S + Sbjct: 674 TNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPI 733 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G + NE + +K+++LLTDG+ + Sbjct: 734 NRNTLTGG------------ALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGP-- 779 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ +A+ L + S F + L I I Sbjct: 780 -ATALRSKSVTIFSVGV----YGANRAQ--LEEISGDGSLVFHVENFDHLKAIESKLI 830 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 2066 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 2115 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 2116 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 2161 >gi|260912479|ref|ZP_05919015.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] gi|260633398|gb|EEX51552.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] Length = 591 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/213 (9%), Positives = 58/213 (27%), Gaps = 42/213 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + V + + + +++ R+G F + Sbjct: 134 DISNSMMAQDVVPSRLEKSKLLIENLVDHFTH-------DRIGLVVFAGDAFVQLPITTD 186 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++ T I A+ + + + K ++++T Sbjct: 187 YVSA-KMFLQNIDPALIATQGTDIAKAINLSMRSFSQQ-----------KDIGKAVIVIT 234 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-----------ARYFLSN--- 421 DGE+ + A +GIR+ + K + L+ Sbjct: 235 DGEDHEGGAL--EAAKAANERGIRVFILGIGSTKGSPIPLAEGGYLADRSGQTVLTALNE 292 Query: 422 --C-----ASPNSFFEANSTHELNKIFRDRIGN 447 C A ++ ++T++ + + + Sbjct: 293 SMCKQIAQAGNGTYIHVDNTNDAQEKLNNELAK 325 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HSNVLFT-----EEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 985 VG------NANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 93/335 (27%), Gaps = 19/335 (5%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ + + ++ ++ V ++ + S S Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP--ADRT 211 + S +E + ++ +M ++ + T Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SYS G +K+ + + + + ++E + +K Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRK 487 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + I + VR+G ++ + + S + + K + Sbjct: 488 KEFEQIQIFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGID--LKKAVFNIK 545 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + + + A Y+++LTDG++ E N Sbjct: 546 QLKGLTFTGKALDFILPLI------KKGKTERTDRAPCYLIVLTDGKSNDSVLEP---AN 596 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ I I I E + L A + Sbjct: 597 RLRAEQITIHAIGIG------EANKTQLRQIAGKD 625 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S + Sbjct: 674 TNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPI 733 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G + NE + +K+++LLTDG+ + Sbjct: 734 NRNTLTGG------------ALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGP-- 779 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ +A+ L + S F + L I I Sbjct: 780 -ATALRSKSVTIFSVGV----YGANRAQ--LEEISGDGSLVFHVENFDHLKAIESKLI 830 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 2066 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 2115 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 2116 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 2161 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 57/173 (32%), Gaps = 20/173 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR +A+++ + + + R+G ++DR + G + Sbjct: 54 FEKVRQWVANLVDTF---EVGPERTRVGVVRYSDRPATAFEL--GRFGSRAAVRAAARQL 108 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T DA++ + + K+ +LLTDG + + + Sbjct: 109 AYHGGHTHTGDALRFITRHSFTPRAGGRP---GDRAFKQVAILLTDG---RSQDLVLPAA 162 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 A+ GIRI + E R L AS F + ++KI Sbjct: 163 TAARRAGIRIFAVGVG------EALREELEEIASEPTAAHVFHVSDFDAIDKI 209 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 57/173 (32%), Gaps = 20/173 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 VR +A+++ + + + R+G ++DR + G + Sbjct: 54 FEKVRQWVANLVDTF---EVGPERTRVGVVRYSDRPATAFEL--GRFGSRAAVRAAARQL 108 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T DA++ + + K+ +LLTDG + + + Sbjct: 109 AYHGGHTHTGDALRFITRHSFTPRAGGRP---GDRAFKQVAILLTDG---RSQDLVLPAA 162 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 A+ GIRI + E R L AS F + ++KI Sbjct: 163 TAARRAGIRIFAVGVG------EALREELEEIASEPTAAHVFHVSDFDAIDKI 209 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 67/200 (33%), Gaps = 26/200 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +D L + K + ++A+ IR + + R+ F Sbjct: 291 YADVKLMDIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIRYLSRT-----NDRLSMVLF 345 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 N R WG H + + I + + G T I +A++ A + SN Sbjct: 346 NFRADIL----WGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSN------ 395 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K+I+ +TDG T + I K + G+ + T+A N R LS Sbjct: 396 -----YPNKHIICITDGR-TVNASSCIKEAVKLRRMGVTLSTVAVGDNSDFDLLMR--LS 447 Query: 421 NCASPNSFFEANSTHELNKI 440 F + N L+K Sbjct: 448 KI-GNGLFIKINDISNLDKA 466 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 60/210 (28%), Gaps = 36/210 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-- 327 + ++ + N + +T + + + S + + + + Sbjct: 121 DFDTFVNDNSNFYVKGFYLQNGSWQKGNSSTIYGPYSLPNTCTSSLLDLTNTSAINSAKN 180 Query: 328 -AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-------- 378 + N G T + A+ A + + +S KY++ +TDG Sbjct: 181 LIDNMNASGGTNMEAALNKAKNLLNASPSGN----------DKYVIFITDGMPTFYLNGT 230 Query: 379 --------------NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 N E ++ G ++ + +K L + Sbjct: 231 HNGYPLVDGPGLQPNNTTKSETLSAVQSLSQSGTKLFVVGVDTTGADVDKTFIELMASTA 290 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +ST+ LN I +D I V Sbjct: 291 NGKSYYISSTNALNSILQDIF-KIINLAVT 319 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 58/230 (25%), Gaps = 58/230 (25%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A +I + VR+G F + R + Sbjct: 106 TRISAAQQAARDLIVGLPAS------VRLGIVSFAGTATVVLRPTSN-----RQDMLDAI 154 Query: 329 IDENEMGSTAINDAMQTAYDTIIS-----------------------------------S 353 TA + A + Sbjct: 155 DRFQLQRGTATGSGLIQALAVLFPDDGIDLEAILFADEPVFSTRRAVPLDEAAAADAVRK 214 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV-NKTQQ 412 E + ++LL+DG T + + A +G+R+ T+ F Sbjct: 215 REQATQSAQPGSYRHGAVILLSDGRRTVG-PDPVDAARMAAQRGVRVYTVGFGTLGGNAP 273 Query: 413 EK--------ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 E L A + +F+A S +L++++R ER Sbjct: 274 ETSLSYYMQLDEPALRAVATITGAEYFQAGSAADLSQVYRQLSARFALER 323 >gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica] gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica] Length = 1393 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 63/208 (30%), Gaps = 22/208 (10%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + +D S + K +DA S+I S+ + FN Sbjct: 778 NAQSSSSAPYNVVFVMDKSGSM-IGTKLDQTKDAFRSMISSLD------RNAKFSIVGFN 830 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + ++F + G T ++ A+ A + S + V Sbjct: 831 YATTAWRNKLVRATNYNVEEARSFISRISAGGGTNMHAALLDAIELCNSESSSTVP---- 886 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQ-----GIRIMTIAFSVNKTQQEKARY 417 I+ +TDG T E I + KS+ I + I F + +R Sbjct: 887 -----CMIMFMTDGTATVGVTEESRILADVTKSRQQGKANIALNVIGFGAGISYSFLSRL 941 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRI 445 + N FE + + F D + Sbjct: 942 SVLNSGIARQIFEDTNAAFQMQGFFDEV 969 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 832 VKKADIGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 889 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 890 HSNILFT-----EEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 943 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 944 VG------NANQDELETMAGKKENTVHVDNFDKLRDIY 975 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 93/335 (27%), Gaps = 19/335 (5%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ + + ++ ++ V ++ + S S Sbjct: 267 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 326 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP--ADRT 211 + S +E + ++ +M ++ + T Sbjct: 327 QGVPQIAVLVTHRASDDVVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 386 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SYS G +K+ + + + + ++E + +K Sbjct: 387 HSSYSHLESYSGNFLKKIHNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRK 446 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + I + VR+G ++ + + S + + K + Sbjct: 447 KEFEQIQVFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGID--LKKAVFNIK 504 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + + + A Y+++LTDG++ E N Sbjct: 505 QLKGLTFTGKALDFILPLI------KKGKTERTDRAPCYLIVLTDGKSNDSVLEP---AN 555 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ I I I E + L A + Sbjct: 556 RLRAEQITIHAIGIG------EANKTQLRQIAGKD 584 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 2025 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 2074 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 2075 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 2120 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S + Sbjct: 633 TNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPI 692 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G + NE + +K+++LLTDG+ + Sbjct: 693 NRNTLTGG------------ALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPAM 740 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ +A+ L + S F + L I I Sbjct: 741 ---ALRSKSVTIFSVGV----YGANRAQ--LEEISGDGSLVFHVENFDHLKAIESKLI 789 >gi|294011133|ref|YP_003544593.1| tight adherence protein TadE [Sphingobium japonicum UT26S] gi|292674463|dbj|BAI95981.1| tight adherence protein TadE [Sphingobium japonicum UT26S] Length = 126 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 11/120 (9%), Positives = 33/120 (27%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L + G I L +P + + + + + L+ A AS+ + Sbjct: 3 RLARDQRGAAAIEFVLAVPPFIMLLMGALQLGIIACARTGLQHAVDEGARYASIFPRPTD 62 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 EE+ ++ ++ ++ + + + + Sbjct: 63 EEIIAKVRSREFGLDPAYTNDPTVVQGTQYGVSYREITMTYSRPLNFIVYQTPAISISYT 122 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 620 VKKADIGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 677 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 678 HSNILFT-----EEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 731 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 732 VG------NANQDELETMAGKKENTVHVDNFDKLRDIY 763 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 93/335 (27%), Gaps = 19/335 (5%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ + + ++ ++ V ++ + S S Sbjct: 55 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 114 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP--ADRT 211 + S +E + ++ +M ++ + T Sbjct: 115 QGVPQIAVLVTHRASDDVVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 174 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SYS G +K+ + + + + ++E + +K Sbjct: 175 HSSYSHLESYSGNFLKKIHNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRK 234 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + I + VR+G ++ + + S + + K + Sbjct: 235 KEFEQIQVFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGID--LKKAVFNIK 292 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + + + A Y+++LTDG++ E N Sbjct: 293 QLKGLTFTGKALDFILPLI------KKGKTERTDRAPCYLIVLTDGKSNDSVLEP---AN 343 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ I I I E + L A + Sbjct: 344 RLRAEQITIHAIGIG------EANKTQLRQIAGKD 372 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S + Sbjct: 421 TNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPI 480 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G + NE + +K+++LLTDG+ + Sbjct: 481 NRNTLTGG------------ALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPAM 528 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ +A+ L + S F + L I I Sbjct: 529 ---ALRSKSVTIFSVGV----YGANRAQ--LEEISGDGSLVFHVENFDHLKAIESKLI 577 >gi|170741515|ref|YP_001770170.1| hypothetical protein M446_3336 [Methylobacterium sp. 4-46] gi|168195789|gb|ACA17736.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 407 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 44/193 (22%), Gaps = 1/193 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 D + +G+ I L + ++ + G VD+ R L+ A + Sbjct: 1 MDHAMWVSLSAFRTNVSGNVAAIFGLTLLPLMFLLGSSVDLSRTLRERARLQALTDQAAL 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 +A S + + T +++ + Sbjct: 61 SAVTSSSASQDPADTVQTFFPQPSDPDQRRLAPTATVTVEGSTVTVTSTQNVATAFTGIL 120 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + ++ P L + I A V+ + S Sbjct: 121 GVAQIPITARSTA-APGNDGPPVCVLALNPTATDAILFSGNATVVASNCVIYSNSSAANA 179 Query: 183 IDFSRSMLDYQRD 195 + S Sbjct: 180 LTRQGSASVQATG 192 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 41/124 (33%), Gaps = 13/124 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 IK + +G F + + + + +D +A+ + Sbjct: 130 IKYFVYQRENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL--GNGSALGLGIS 187 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + K++ K+ +++LTDG D + N A+ ++I +I Sbjct: 188 IALSHL-----------KHSEAPKRSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSIG 236 Query: 405 FSVN 408 + Sbjct: 237 IGSS 240 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/232 (12%), Positives = 56/232 (24%), Gaps = 30/232 (12%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRD--------ALASVIRSIKKIDNVNDTVRMG 298 + + +D + ++ A + I K+ + R Sbjct: 255 YSPGGQQNRKIVLDPAGSMNIYLVLDASDSIGKNNFTGAKKCLSSLIDKVASYGVEPRYA 314 Query: 299 ATFFNDRVISDPSFS----WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + + S + + + K D T A+ Y+ +I Sbjct: 315 VVTYATEAKAVVKLSDKESSNADWVKQELEKIKYSDHRLKAGTNTKKALTMLYEMMILQE 374 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS--------QGIR-----IM 401 IVL+TDG + +A + + + R + Sbjct: 375 SQNDINWNKTRH---VIVLMTDGNYNMG-GDPVAAIEQIREFLDIGRNRKNPRENYLDVY 430 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 QEK S F+ +L +F I + Sbjct: 431 VFGIG-PLVDQEKINALASKKDGEKHVFKVKDMEDLENVFYMMIDESKALSL 481 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 23/163 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 +++ L+++I + D +R+G + ++ ++ S + +++ + Sbjct: 280 IKEFLSNLISPFSIAE---DKIRVGLSQYSSDPRTEWELS--AYSTREQVLEAVRNLRYK 334 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T A+ + ++ LEA+K ++LLTDG+ +QD+ Sbjct: 335 GGNTFTGLALTHVLE------QNLKPDAGARLEAEKLVILLTDGK-SQDDANL--AAQTL 385 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 K+ GI I I + L AS S + Sbjct: 386 KNLGIEIFAIGV------KNADEAELRQVASEPLELSVYNVLD 422 >gi|145547190|ref|XP_001459277.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427101|emb|CAK91880.1| unnamed protein product [Paramecium tetraurelia] Length = 603 Score = 50.4 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 64/178 (35%), Gaps = 26/178 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K +LV+D+L +++ + D R+ F SF + ++K Sbjct: 135 RKINLVKDSLRYLMKILGPED------RICIIVFTTVAHIVTSFIRNTQENKP-LLKKAI 187 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEG 386 ++ + ST I+D M A + + I LL+DG++ + Sbjct: 188 LELKGLASTNISDGMNKALWMLKNR---------KYKNPVSCIFLLSDGQDDYKGAEQRV 238 Query: 387 IAICNKAK-SQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIF 441 K + I T + Q+ Y ++ A +F+ ++ ++ + F Sbjct: 239 FDQLQLLKIEEKFVIHTFGYG-----QDHDAYVMNQIAKYREGNFYYIDNINKASDYF 291 >gi|330688331|gb|AEC32936.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 482 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 40/134 (29%), Gaps = 12/134 (8%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGS 336 + I + DT+ + F + G R + + G+ Sbjct: 1 MLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKSYSPYGT 60 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + A+ + + N +A + ++L+TDG + NK K + Sbjct: 61 TNMTAALDEVHKHLNDRV--------NREKAIQLVILMTDGIPNSKYTALE-VANKLKQR 111 Query: 397 GIRIMTIAFSVNKT 410 + + I Sbjct: 112 NVSLAVIGIGQGIN 125 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 12 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 70 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 71 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 130 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 131 YFFLGPSRTVSATSTAKAT 149 >gi|167821376|ref|ZP_02453056.1| hypothetical protein Bpse9_40018 [Burkholderia pseudomallei 91] Length = 396 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 94/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|321460552|gb|EFX71593.1| hypothetical protein DAPPUDRAFT_255504 [Daphnia pulex] Length = 983 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 55/185 (29%), Gaps = 21/185 (11%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + +++ S I++ + + +G F+ + + Sbjct: 335 DISGSMKEFDRIGKLSESIRSWIKTD-----LRNGSHLGMVQFSATAEILSELTMISDEK 389 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + +T I ++ A + K E IVL+TDG+N Sbjct: 390 SREEMIAKLPK-QLQAATCIGCGLELAVQMLN--------ENKGTSETGGVIVLVTDGKN 440 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS---THE 436 + + IR++TIAF + + + Sbjct: 441 SPGYLHISDVQEDILKAKIRVITIAFGEKADKN--LEDLARQ--TDGKSYFVKDEDGGAA 496 Query: 437 LNKIF 441 L + F Sbjct: 497 LQEAF 501 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 29/64 (45%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 +TA+L PV+LG GM V V + L++AA +A + + L + S + Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGTIQTSQAEAFARN 60 Query: 85 FPKQ 88 F Sbjct: 61 FVAG 64 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 63/208 (30%), Gaps = 50/208 (24%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + + + + Sbjct: 98 YPNRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQMPLTDDYS-IGKNYINA 149 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T + A++ A + N D K I++L+DG + D Sbjct: 150 LDTNLISGGGTELYQALELAEKSFKEINSDN-----------KTIIVLSDGGDFDDKSL- 197 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ-----------------------QEKARYFLSNCA 423 K + + +I ++ + FL + Sbjct: 198 ----KFVKDNKMNVFSIGIGTDEGTIIPEYVNGKKVGFIKDQKGSAVISKLNSDFLKKLS 253 Query: 424 SP--NSFFEANS-THELNKIFRDRIGNE 448 S ++E N+ + + FRD E Sbjct: 254 SESNGKYYEVNNLKDDTSNFFRDTANLE 281 >gi|148655604|ref|YP_001275809.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567714|gb|ABQ89859.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 425 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 58/179 (32%), Gaps = 23/179 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + V++A A ++ + D + FNDR + ++ +K Sbjct: 61 RLMQVKEAAARIVDQLGPDDYFS------LVVFNDRADVVIP---AQRAIKKSDLKAAIA 111 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T + + A + R + ++LLTDG D + I Sbjct: 112 QIEAAGGTEMAQGLALALQEVQ--------RPFLTRGISR-LILLTDGRTYGDESRCVEI 162 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGN 447 + +S+GI + + + AS + + ++ K+F D + Sbjct: 163 ARRGQSRGIGLTALGIGTEWNEDLLETMT----ASENSRAQYIATAQDVVKVFADEVKR 217 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 36/410 (8%), Positives = 105/410 (25%), Gaps = 34/410 (8%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + ++ + + K + E N N N E + + + Sbjct: 166 ITGEIMPKQQAKAYFETAKEQGRKASLIEQKRPNLFTNAIANIGPNETISVTIEYQQVIR 225 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRS--------MGIKSWLIQTKAEAETVSRSYHKEH 175 + + V +P + + + W + + + Sbjct: 226 FDQQTFSVRFPMTITPRYSPRASQIPLINQTAVAKVNSNGWGESVSVLSRQIKTPDEPAN 285 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK--LSPYM 233 + I ++ S+ SE P++ + + +D P + Sbjct: 286 PIRISVELNSGFSISPEDIVSEHHPIDITPKDQQAGHYQIELTKEHIANQDFSLRWKPTV 345 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI----- 288 + ++ G ++ K + + ++ + + Sbjct: 346 GDIPSAAHFSETIGNYRYAMVMLTPPRQDDADDKSTKTPVSAREVVFLLDTSGSMAGESI 405 Query: 289 -----------DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 ++ + FND + + + + + + G T Sbjct: 406 VQAKRAVDFALTQLHPEDSVNVIEFNDAPQALWNLAMPATANNIQRARNWVASLSANGGT 465 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + A+ A ++ + + + +V +TDG + ++ I N+ Sbjct: 466 EMAPALSMALHKTNLEQQNIN---EGSPVQLRQVVFITDGSVSNEDALMSLIENQLADS- 521 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R+ TI + A +F +++ + + Sbjct: 522 -RLFTIGIGSAPNSYFMTQAAQ---AGRGTFTYIGDINQVQQKMTELFNK 567 >gi|313238855|emb|CBY13854.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 70/231 (30%), Gaps = 24/231 (10%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPL-------DPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 K + Y + + F+ SS I ++ Sbjct: 115 YVFKTANYNALTGITDTLYNKLCSSLKSDNCISSVKQDLSFIVDSSSSITISDYQKLKTW 174 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVIS---DPSFSWGVHKLIRTIVKTFAIDENEM 334 + S+I ++ DN + + + + R +P ++ ++ + Sbjct: 175 MKSIIEKLEIGDNASRVSILQFSGQSARPQGRWINPVLTFDRSTSKEAVIGAIDGMKKLN 234 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I +A+ Y + +S + ++ ++++TDG+ E + Sbjct: 235 GDTCIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNCPAEIAKP-AELIR 287 Query: 395 SQGIRIMTIAF----SVNKTQQEKARYFLSNCAS---PNSFFEANSTHELN 438 +Q I I + R L AS FE N+ +L Sbjct: 288 AQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQLI 338 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 47/138 (34%), Gaps = 13/138 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K+ +++ + S+ S K +N + T V+ + + F Sbjct: 374 KRFDYMKNWVKSIAASFKVGENFARFSVVQYTKTAKTVVDFQTLDLSSISQKIDSMIYFQ 433 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T +A++ A+ + S K+ ++LLTDG ++E + Sbjct: 434 GRNGRGGKTFTGNALERAHTLLKESEPGR----------KRIVLLLTDG---SSDDEYGS 480 Query: 389 ICNKAKSQGIRIMTIAFS 406 + + + I + Sbjct: 481 VAKAIRDDKVDIFAVGVG 498 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 51/212 (24%), Gaps = 46/212 (21%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKH-LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + ++ + + + D Sbjct: 36 NSAAINGGSSTTAPSDAYDLDVDLVPYNDDTRWRPMWNDVSYYPDWSWSYGVGRQPVAYC 95 Query: 309 DPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN------------- 354 + R+ ++ G T + M + ++ Sbjct: 96 PTEAKRLQNYHNNRSGFVSYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVT 155 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE-----------------------NTQDNEEGIA--- 388 + + KKY++ +TDG+ T DN +A Sbjct: 156 DPDNPAKIRGFSVKKYMIFMTDGDMSPTWNDYSAYGIEYLDGRVNGSPTTDNAALLARHL 215 Query: 389 -----ICNKAKSQGIRIMTIAFSVNKTQQEKA 415 CN AK++GI I IAFS T Sbjct: 216 QRFRMACNAAKAKGIDIWVIAFSTTLTADMTN 247 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 19/217 (8%), Positives = 53/217 (24%), Gaps = 4/217 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + + + G + AL++PV+ + ++ +Y L+ A + AS Sbjct: 246 TNLITRFLTATRGATVLEFALILPVLCALLAGGFELGYRAYVNAILQGAL----LEASRQ 301 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + K I++ + NF++ + + S Sbjct: 302 ATVGDRSGAQIDKTITDRMATLSGSISIQSIKKESFYNFSNVGKPEKLTFDRNGDGAYDS 361 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + + + + ++ S Q + + Sbjct: 362 TQDCYEDANNNGAYDVKTNSGIGTADDIVRYTVSLQYPSIMPVGSLFGWGSQQEITTSTV 421 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + V+ S+ + Sbjct: 422 LRNQPFTSRAMPTIAATSADNAARVRDSISEADVNEV 458 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 83/271 (30%), Gaps = 23/271 (8%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V+ + + + LN + +++ + + Sbjct: 171 VNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYY 230 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DN 290 + + L ++ +S + V +++ + + K D Sbjct: 231 PATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDT 290 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYD 348 ++D + FN++ F+ V + + K G+T + A+D Sbjct: 291 LSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFD 350 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFS 406 + +SN K I++ TDG + + K ++ +R+ T FS Sbjct: 351 QLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FS 397 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 V + + CA+ +FE S + Sbjct: 398 VGQHNYDVTPLQWMACANKGYYFEIPSIGAI 428 >gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata lugens] Length = 315 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 43/136 (31%), Gaps = 7/136 (5%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 K F ST I D ++ + E + ++ + LTDGE D Sbjct: 152 TAKEFVELIKPTTSTNIIDGLRKGLKLV---KEGKETLDTTKEPSQPIMFFLTDGEPNVD 208 Query: 383 NEEGIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + + I N+ S I ++AF + +N + +E + Sbjct: 209 LTDPVEIVNETSSLNEQLKTPIYSLAFGQGADITFLKKLSKANHGFARNIYEGSDATLQL 268 Query: 439 KIFRDRIGNEIFERVI 454 F I + + V Sbjct: 269 NNFYKEISSPLLANVT 284 >gi|330688333|gb|AEC32937.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 482 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 39/134 (29%), Gaps = 12/134 (8%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGS 336 + I + DT+ + F + G R + + G+ Sbjct: 1 MLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKSYSPYGT 60 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + A+ + N +A + ++L+TDG + NK K + Sbjct: 61 TNMTAALDEVQKHLNDRV--------NREKAIQLVILMTDGIPNSKYTALE-VANKLKQR 111 Query: 397 GIRIMTIAFSVNKT 410 + + I Sbjct: 112 NVNLAVIGIGQGIN 125 >gi|47214619|emb|CAG01460.1| unnamed protein product [Tetraodon nigroviridis] Length = 1723 Score = 50.4 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 54/177 (30%), Gaps = 24/177 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D D V++ ++ ++ H + +V GST AM Sbjct: 465 VNTFDIGPDKVQISLVQYSRDPHTEFYL--DSHHNLEAVVTALRTFPYRGGSTNTGRAMT 522 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 +T+ ++ + +L+TDG+++ +E ++ + I + Sbjct: 523 YVRETVFQASRGAR------AHVPRVTILITDGKSSDAFQEP---AANLRNSDVEIFAVG 573 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 ++ R L A+ + F+ +RI + Sbjct: 574 V------KDAVRSELEAIANAPAETHVYTVED----FDAFQRISTELTQSICLRIEQ 620 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 50/447 (11%), Positives = 123/447 (27%), Gaps = 48/447 (10%) Query: 28 LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPK 87 L + + L G D R K A++ + + + + Sbjct: 6 LSLFIALIAFGFTADTARTITGTVTDKNT--GAVLPGVAVRAKGTKTGVTTDGKGRYSIR 63 Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDT---------AVEMNPRKSAYQVVLSSRYD 138 + + ++ + + +V V + + ++ + Sbjct: 64 VQAQVSVLTFSSIGFETLNLPIGKKTVVNAAMEQSQQALNEVAVVGHAAQKRMETTGSVM 123 Query: 139 LLLNPLSLFL------------RSMGIKSWLIQTKAEAETVSRSYHKEHG-VSIQWVIDF 185 ++ F+ GI + + S + S + Sbjct: 124 VMAPSRGSFVHKNIVYRPQETESYAGINENRFHDPLKDPLSTFSIDVDAASYSNVRRMIN 183 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + + + + +N F + N I + L Sbjct: 184 NGGLPEKDAVRIEEMINYFDYDYPQP-AGDDPVNIITEIAAAPWNKKHKLVQIGLQGKTI 242 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 PS + +D S + K L+ + + ++K D R+ + Sbjct: 243 STAKLPSSNLVFLIDVSGSMNDSNKLPLLVSSFKLLTDQLRKTD------RVAIVVYAGN 296 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G +T +K + GSTA ++ AY+ + NN Sbjct: 297 SGLVLPSTSG---DQKTTIKDALNKLSAGGSTAGGAGIRLAYEV----AAKNYIKGGNNR 349 Query: 366 EAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++L TDG+ +E+ + + + G+ + + F + + K Sbjct: 350 -----VILATDGDFNVGASSDEDMEKLIEEKRKSGVFLTVLGFGMGNLKDSKMEVLADK- 403 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ ++ +E K+ + G + Sbjct: 404 -GNGNYAYIDNINEARKVLVNEFGGTL 429 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 52/167 (31%), Gaps = 26/167 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R + G T+I +++A+ I Sbjct: 343 WVGMVAFDSAAYVKSELVQINSAAERDALARSLPT-AASGGTSICSGLRSAFTVI----- 396 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C + K G I T+A + ++ + Sbjct: 397 -----KKKYPTDGSEIVLLTDGEDN-----TISACFPEVKQNGAIIHTVALGPSAAKELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDRI--GNEIFERVIRI 456 + + A+ E L F +R I++ Sbjct: 447 ELSQM----TGGLQTYASDQAENNGLIDAFGALSSGNRAASQRSIQL 489 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + +DSS + V++ + +++ + D D R+G + Sbjct: 49 SSCENKRADLVFIIDSSRSVNTY-DYAKVKEFILDILQFL---DIGPDVTRVGLLQYGST 104 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V ++ S K + + + T A+Q A + S E Sbjct: 105 VKNEFSLK--TFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARP---LRE 159 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + I+++TDG + + + KA++ GI I I + + S Sbjct: 160 NVPRIIMIVTDG---RPQDSVAEVAAKARNTGILIFAIGVG------QVDLNTVKAIGSE 210 Query: 426 ---NSFFEANS---THELNKIFRD 443 + F + L +F++ Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 >gi|115358179|ref|YP_775317.1| hypothetical protein Bamb_3429 [Burkholderia ambifaria AMMD] gi|115283467|gb|ABI88983.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 346 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/282 (9%), Positives = 76/282 (26%), Gaps = 6/282 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEE 74 S G + + + +L G + +D+ + + L+ AA A + A+ ++ Sbjct: 14 SERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAADAAALAAADAMMSPTGGPL 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+A + + + + + V ++ Sbjct: 74 NWSQAAPAANGVIARNSAAGAALATGTVTTGYWNVTRNPASMQPTTITPGAYDVPAVQVT 133 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM-LDYQ 193 +N S+ L G+ + + + + I + + Y Sbjct: 134 VSRAPGVNGGSIPLLLGGLLGVPGTSGSATAIAVLAAPGGVAAGGLFPIAIDQCVYNQYW 193 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YMLYPGPLD 250 + QPL + Q + + + + + M+ P Sbjct: 194 NAATNQPLINPLTGQPYEFSITNGQTYGALCMGGQWTSFQSTATDTTTMGGLMVTGNPTT 253 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ + + + + + + V+ +V Sbjct: 254 LNIGDTINLATGVKATLYTSVPVGATVVLPVVTQTSSSTSVP 295 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 59/195 (30%), Gaps = 26/195 (13%) Query: 268 IKKKHLVRDALASVIR------SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + A V+ + D L R Sbjct: 58 KSRMSAAKQAFNEVLDAVPEEVRLGIRTLGADYPGQDRKQGCKDTRQLYPVGP----LDR 113 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 T KT G T I A+ A + +A + IVL+TDGE+T Sbjct: 114 TEAKTAVASLAPTGWTPIGPALLGAAKDL------------EGGDATRRIVLITDGEDTC 161 Query: 382 DNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + ++GI I T+ + +++ A+ ++ + T +L++ Sbjct: 162 APLDPCQVAREIAAKGIHLVIDTLGLVPDAKTRQQLTCIAE--ATGGTYTSVHRTEDLSR 219 Query: 440 IFRDRIGNEIFERVI 454 R + V Sbjct: 220 RVRQLVDRAADPVVT 234 >gi|297668115|ref|XP_002812300.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-3-like [Pongo abelii] Length = 493 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 54/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I +D R+ + V + + +K T A+ Sbjct: 117 IDTLDIGPADTRVAVVNYASTVKIEFQLQTYT---DKQSLKQAVGRITPLSTGTMSGLAI 173 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 QTA D I + E + K +++TDG + ++ + +A++ GI + + Sbjct: 174 QTAMDEIFTV---EAGARGPSSNIPKVAIIVTDG---RPQDQVNEVAARAQASGIELYAV 227 Query: 404 AFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 228 GV----DRADMES--LKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 269 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 42/422 (9%), Positives = 118/422 (27%), Gaps = 48/422 (11%) Query: 36 VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 V + VD R A ++ + + + T + + Sbjct: 239 VLALGVDYSRLVEPGSN-ATGIDPAALSGPLGWLSGSIGAIEVHSRALTAAEIAAKYEGQ 297 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL--------LLNPLSLF 147 + + + + + + ++ ++ V + Sbjct: 298 SSRYSESLCSLESPKPPSTLDSSTAAVSGSGASGSVANTLTMTGGEELSGLQTDPVKQEQ 357 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHK-------------EHGVSIQWVIDFSRSMLDYQR 194 L+ + S + + + + + V +++ + + + Sbjct: 358 LKQAILASLNLPPGYTTDNIQINVVSVTQEGGRRRLQAPQTKVVVEYTLTATPEVQVEVL 417 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + F + + +S + I D + + + + + ++ + Sbjct: 418 AKAEASVYLFIRVQQPAIPWWSGPTATICISDSEGACNVNTPPITPPVVIQDCSANVCFL 477 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +++ R + S+ D +F++ V + Sbjct: 478 LDGSKSLTNVDGWNDVVASARSIMYSLNDPAAVFD---------VFWFSNDVEKIGHATG 528 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + V + T + A+ T ++ + +A + IVL+ Sbjct: 529 AEVAANNSFVSMVVNTTPDAHGTWMAQAITTCQGVLLEE----------DTQASRTIVLI 578 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 TDG+ T D + + AK++GI+++ + L AS F AN+ Sbjct: 579 TDGKPT-DPQPTFEAADAAKARGIKMVVVGAGEIDYAT------LKALASGPQFVFANTN 631 Query: 435 HE 436 + Sbjct: 632 LD 633 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 65/271 (23%), Gaps = 25/271 (9%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 LR G Y+ + ++ + S + + Sbjct: 557 LRPTGCGEQYAINNGNYARCRTEYNSDPYINNVLNCNPSSRIAAVANLAFSAGATSSRSF 616 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY------PGPLDPSLSEEHFVDS 261 T + + Q G + N L +D Sbjct: 617 GFHTCGTNNCQRGIRESNLGLYNTQGAFPNGDRIVYTACRGSAGSSSCAAPLDLFFLLDG 676 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + + + A+ V D R+G ++DR + G Sbjct: 677 SGSVNAANFVKVKQFAVNVV----NTFDVSLTATRVGVVQYSDRNTLVFNL--GNKVNKP 730 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + V + G T A+Q + K I++LTDG+++ Sbjct: 731 STVSAINNIVYQSGGTNTGAALQYVRQ----------YAAWRGGNVPKVIIVLTDGKSSD 780 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + G+ + I Q Sbjct: 781 SVSGPS---QNLVAAGVEVYAIGVGSFDHGQ 808 >gi|308493174|ref|XP_003108777.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] gi|308248517|gb|EFO92469.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] Length = 425 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 56/178 (31%), Gaps = 10/178 (5%) Query: 269 KKKHLVRDALASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V ++ + + + ++ A F + F+ + +++ Sbjct: 248 TTTDPVYNSYKDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFNLKKYTTQEEVLRGI 307 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ G+TAI ++ A I S K +++ TDG + + + Sbjct: 308 DKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT------KVMIVFTDGWSNKG-PDPE 360 Query: 388 AICNKAKSQGIRIMTIAFSVN-KTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 A + G + T+A++ LS S F + L + Sbjct: 361 KRAKDAVNAGFEMYTVAYTARAPNSVTLNNETLSAISGSSGHAFTDVTFQSLVDKIKQ 418 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 39/260 (15%), Positives = 74/260 (28%), Gaps = 29/260 (11%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + +R S S+ + + D K +P N + Sbjct: 236 LPSVTEFCDKNHNTEAPTLQNRICNSRSTWDVIMNSTDIKATPPQADSNLPVPTFTLLQS 295 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D ++ +D S + + A + I V + +G F+ Sbjct: 296 SDRVVTLV--LDVSGSMASDGRIGRLYQAAEVFVMQI-----VEEGSHVGIVSFSTSTTV 348 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + R +K F + + +G T I ++ + + Sbjct: 349 LSKLVQVIDDTQRNHLK-FLLPKTAVGGTNICAGIREGIKV---------NNQHDGSSYG 398 Query: 369 KYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 IVLLTDGE+ + ++C + GI + IA N + + + Sbjct: 399 TEIVLLTDGEDNYN----TSLCFPDISNSGIIVHFIALGPNPNPNLETIVDM----TGGL 450 Query: 428 FFEAN---STHELNKIFRDR 444 F A L F Sbjct: 451 RFLATDKVDAQGLIDAFSSL 470 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 11 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 69 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 70 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 129 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 130 YFFLGPSRTVSATSTAKAT 148 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 50.0 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 44/155 (28%), Gaps = 19/155 (12%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + R K TA +A+ TA I + + Sbjct: 143 AYAGTATVLVSPTTNRYATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGEM 199 Query: 366 EAKKYIVLLTDGENTQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQE------- 413 IVL +DG+ T AK QG+ I TI+F E Sbjct: 200 PPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTVYGFVEINGQRQP 259 Query: 414 --KARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A S + + A + EL ++ Sbjct: 260 VPVDDETMKKVAQLSGGNSYNAATLAELKAVYASL 294 >gi|260797332|ref|XP_002593657.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] gi|229278884|gb|EEN49668.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] Length = 949 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 52/165 (31%), Gaps = 23/165 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D D R+G ++D + + GVH + + + + G TA A++ Sbjct: 55 VNAFDIGTDLTRVGVVQYSDTPTMEFNL--GVHADKGSTIAAVNNIQYQNGGTATGAALE 112 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + K ++++TDG++ D A +G+ + I Sbjct: 113 FA----------RANANWRGAPVPKVMIVVTDGKSGDDVT---AAAQALAGEGVAVYAIG 159 Query: 405 FSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGN 447 L A+ N+ E + L G Sbjct: 160 VG------NYDLPELQQIANGNNNNVIELQDYNALTAAIDQIAGQ 198 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 43/150 (28%), Gaps = 22/150 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ V + S H ++ G T A+ D Sbjct: 723 VIQYSSTVQEEFSL--NAHFTKTAVLNAIDNIVYMGGGTLTGAAITYMKD---------- 770 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + K +++TDG+++ D + A+ GI + I N + Sbjct: 771 -NSQWRPGVAKIAIVVTDGKSSDDVGPPSS---AAQQTGITMHAIGVGANVD-----QTE 821 Query: 419 LSNCASPNSF-FEANSTHELNKIFRDRIGN 447 LS AS + + L+ + Sbjct: 822 LSQIASTSQYVTTVADYDALDAQMAQLTAS 851 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 29/263 (11%), Positives = 61/263 (23%), Gaps = 26/263 (9%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + GQ C ++ + PG Sbjct: 256 GQCDSINNGQAYQCSCPTGFVGANCQTANYCVNHQCQNGAACVNQPGQNGYTCNCLPGWE 315 Query: 250 DPSLSEEHFVDS----SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P ++ + + + N ++G ++ Sbjct: 316 GPHCETPSCQNTVSLCGNSPGWPDVGLCSESYVLAACPEFCGTCTFNCNKQIGVIQYSST 375 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + S H ++ G T A+ D + Sbjct: 376 VQEEFSL--NAHFTKTAVLNAIDNIVYMGGGTLTGTAITYMKD-----------NSQWRP 422 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K +++TDG+++ D + + GI + I N + LS AS Sbjct: 423 NVAKIAIVVTDGKSSDDVAAPSSAAQQ---AGITMHAIGVGANVD-----QTELSQIAST 474 Query: 426 NSF-FEANSTHELNKIFRDRIGN 447 + + L+ + Sbjct: 475 SQYVTNVADYDALDAQMAQLTAS 497 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 41/125 (32%), Gaps = 24/125 (19%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + + L GST + + + TAY + Sbjct: 129 LIEFGSKAKIISGLTQNAKHL-----YKAINRLKTNGSTNMTEGLTTAYLKL-------- 175 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 N++ ++I+LLTDG I + + GI ++TI + + + Sbjct: 176 ----KNVDDPRFIILLTDGLPNHPK-NTQQIAQEICADGIELITIGTG------DADKTY 224 Query: 419 LSNCA 423 L + A Sbjct: 225 LQSLA 229 >gi|330688335|gb|AEC32938.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|330688337|gb|AEC32939.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 481 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 37/129 (28%), Gaps = 12/129 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA---IDENEMGSTAIND 341 I + DT+ + F + G R + + G+T + Sbjct: 5 INSLSLSRDTINLYMNLFGNYTTELIRLGSGQSIDKRQALSKVTELRKSYSPYGTTNMTA 64 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + N +A + ++L+TDG + K K + + + Sbjct: 65 ALDEVQKHLNDRV--------NREKAIQLVILMTDGIPNSKYTALE-VAKKLKQRNVSLA 115 Query: 402 TIAFSVNKT 410 I Sbjct: 116 VIGIGQGIN 124 >gi|302381763|ref|YP_003817586.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] gi|302192391|gb|ADK99962.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] Length = 181 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 36/131 (27%), Gaps = 1/131 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++ + +G + ALL PVM+ + + + + + A + Sbjct: 1 MIARARVGLRRFWRDESGVSAVEFALLAPVMIALYFGSAEFCQGFMAQKRMDHATSQ-VA 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + ++ + + +N + D R + + Sbjct: 60 DITSQDGVVTRDELDDTLAVAQLIMSPFPTTPLKMRVSGVTRNASGVAKIDWSRGSGMTA 119 Query: 123 NPRKSAYQVVL 133 + V Sbjct: 120 LGTGAVVTVPA 130 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 68/222 (30%), Gaps = 18/222 (8%) Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 T + + ++S + S D +D+S Sbjct: 1 MATHVASRLAALALLAVFLRVSAQNAAQRASSLQNQVKKYQDSRADIVFLLDNSGSVGRY 60 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +++ + + R+ F+D + + + + Sbjct: 61 NFEEVEIAFVENLLSQLTISPQAS---RVAVVSFDDVART--HIDYIKYPKNKCSFLREL 115 Query: 329 IDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +G T DA + A + + + KN K+ ++LLTDG T+ + + Sbjct: 116 KTVKYIGEWTNTEDAFRLAQELLRPPSAF-----KNERPVKQVVILLTDGRPTRG-GDPV 169 Query: 388 AICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 N KS I +I N + L +CA+ Sbjct: 170 KRANNLKSVYNAEIFSIGIGGNLN-----KQQLEDCATDAQH 206 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 36/275 (13%), Positives = 87/275 (31%), Gaps = 26/275 (9%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V + S S G + F K ++ + R + Sbjct: 618 VEKTASFITNTSDPLNNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATIL------ 671 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 D + +S SL + ++ +V + + + + + Sbjct: 672 ---KGLKDKTVSKDIFTNPYFSKNSCSLATEVAYVVDYSSSMKAVDPTNYRGKKMIEFIN 728 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI-DENEMGSTAINDAMQTAYDTIISSNE 355 N + ++ + ++K E G+T I + A Sbjct: 729 QLKAKNNIVIETNTKATILGEGTTDAVLKKDLYKASKEKGATDIFAGIDIALTKFS---- 784 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 N+ + K IV+++DG+ + + N+AK QG++I T++ + Sbjct: 785 -------NDTKTAKAIVVVSDGK--TSKSKMTKVINEAKKQGVKIYTVSMGKKSQIND-- 833 Query: 416 RYFLS-NCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + + +++ A +L+++F+ I + Sbjct: 834 ATLMQVSTETGGAYYYALDNLQLHQVFQKLIDAIL 868 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/391 (9%), Positives = 96/391 (24%), Gaps = 26/391 (6%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + A A + + A +E + R Sbjct: 55 SDSVSVAARQADEGEMSDPIMEMAPAVVMEETATHIRDEDATLVPQAAPAPQQLTRAAPA 114 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + + + + D ++ ++ + S Sbjct: 115 GNDMAGNLRSMAEPSNDGFVHVLRDGSTFYEEYDETFANDTPNPLKITSDEPVSTFSIDV 174 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + +R L + + + A + + + + + + Sbjct: 175 DTAAYALIRSSLTRGQLPPTDAVRIEEMINYFPYAYPAPEGEAPFRPTINVFETPWNADT 234 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + + + P L+ +D+S K L+R + ++ +++ D V Sbjct: 235 QLVHIGIQGEMPAIEDRPPLNLVFLIDTSGSMESADKLPLLRQSFRLMLDNLRPEDEVA- 293 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + + R + N GST ++ AY Sbjct: 294 -----IVTYAGSTSIALEPTQ---ASERATIIAALNALNAGGSTNGQGGLEQAYAL---- 341 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ--------GIRIMTIAF 405 MK + + ++L TDG+ + + + + G + + F Sbjct: 342 ----AETMKTAGDVSR-VILATDGDFNVGLSDPRGLQAYIEDKRDDAQQTGGTYLSVLGF 396 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 Q + N ++ + S + Sbjct: 397 GRGNLQDATMQSLAQNGNGTAAYIDTLSEAQ 427 >gi|311253076|ref|XP_003125384.1| PREDICTED: matrilin-3-like [Sus scrofa] Length = 488 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 54/168 (32%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I +D + R+ + V + + +K T A+ Sbjct: 112 IDNLDIGAEDTRVAVVNYASTVKIEFHLQT---HSDKQALKRAVARIAPLSTGTMSGLAI 168 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 QTA D + E N K +++TDG + ++ + +A++ GI + + Sbjct: 169 QTAMDEAFTV---EAGARGPNSNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAV 222 Query: 404 AFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 223 GV----DRADMES--LRLMASEPLDEHVFYVETYGVIEKLSSRFQETF 264 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 44/148 (29%), Gaps = 1/148 (0%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G I AL++ +++ + G+ VDV E +A A + + Sbjct: 17 KHGQSIPIIALMILILVAMVGLSVDVGNTFSKERQAVASANAASLAGMSAYMARSSSTLD 76 Query: 78 RA-KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + T Q + N ++ + + + + +A R Sbjct: 77 TTIYQAITASLQSNGLVVGDGTNNTVEVTANYLDSQGNLLAGHPVVGSGGTAPNGAAYIR 136 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 L + F R +G I A A Sbjct: 137 VQLSGMVNTSFARVVGRDDLPINADAHA 164 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 33/260 (12%), Positives = 70/260 (26%), Gaps = 15/260 (5%) Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 R N S+ + + G + + + + V Sbjct: 321 RMVGSGNNASVRIVNFGSFIIVASGRDKNRPYFDMIYLGPPTRQYNVCSQMPPPPAETNL 380 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + F S V + + + PS Sbjct: 381 LDLAGNVSFYPEYQIIPTSQKPIQYVVVLDASGSMSANFDGQCNNSGGVKQCANGPSGFP 440 Query: 256 EHFVDSSSLRHVIK-----KKHLVRDALASVIRSIK-----KIDNVNDTVRMGATFFNDR 305 + V ++ + + ++ + AL ++ N + +M +FND Sbjct: 441 DVQVSNTGYDYWWTTESQRRIYVAKKALERLVTLSNMPGNPGYTNTRPSDQMAVVWFNDG 500 Query: 306 VISDPS--FSWGVHKLIRTI--VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 V S + F+ L I + + G T + A + + V Sbjct: 501 VSSSQTQAFTNNPTTLKNYITTLNNVNGNYRSAGGTNGAGGLYRA-SLLYQNAPKTVSFN 559 Query: 362 KNNLEAKKYIVLLTDGENTQ 381 N+E K+ ++ +TDG + Sbjct: 560 GTNVEYKRVVLFVTDGVSNY 579 >gi|325673441|ref|ZP_08153132.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] gi|325555462|gb|EGD25133.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] Length = 321 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 20/47 (42%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 ++ G + A+LM V+LG + VD+ + L+ A Sbjct: 6 SRMSNDERGVVAVFVAILMVVLLGCAAISVDIGANYVVKRQLQNGAD 52 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 14/200 (7%) Query: 233 MVSCNKSLYYM--LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 +Y+ P P + + + ++L ++ DA +V+ ++ D Sbjct: 167 NYDHRFRPWYVEAATPEPKNVVIVIDTSGSMANLHSGKSLINIAIDAAITVLDTMNPNDK 226 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAY 347 V ++ ++ + +K F + G T A A+ Sbjct: 227 VGVIAFSDELKLPPKIGDASCYANELALATTINIQNLKQFVLSLVARGGTHYGKAFDAAF 286 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG------IRIM 401 + + S + + +E + I+ LTDGE D +I K +S + I+ Sbjct: 287 NLLKESYTLDADNERGKIERDQVIIFLTDGEPLDDKT---SIMRKIRSNNEEMENKVTIL 343 Query: 402 TIAFSVNKTQQEKARYFLSN 421 T ++ + N Sbjct: 344 TFGLGLDSGINFLEDIAVQN 363 >gi|332798629|ref|YP_004460128.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] gi|332696364|gb|AEE90821.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] Length = 185 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 12/140 (8%), Positives = 40/140 (28%), Gaps = 8/140 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-----QTAIIT 63 +KK ++ G+ +I A M ++ G+ +DV ++++ + Sbjct: 3 KMAKKFLQKEDGNILLIFAGSMVLIAFFIGICLDVSMIYVKRNSMQNILQIIREERFTYQ 62 Query: 64 ASVPLIQSLEEVS---SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + + + + + + Y +++ R + Sbjct: 63 DTIRYSDNPALTTYHIAYSAAAENGFDGIVTVYFHEEDPEPNYRSYQVRILLTDECPFYF 122 Query: 121 EMNPRKSAYQVVLSSRYDLL 140 + +S Sbjct: 123 GRIFGLDTVPLNVSLDGGES 142 >gi|317419404|emb|CBN81441.1| von Willebrand factor A domain-containing protein 2 [Dicentrarchus labrax] Length = 761 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 47/158 (29%), Gaps = 17/158 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D +M + R + + H I+K G + A+ + Sbjct: 531 FDINRDVAQMALVAYGRRATTVFNL--DTHDTGSAILKAVGDANYMGGVASTGTALLHVH 588 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 I++ + + K +V++TDG D K + G+ + I Sbjct: 589 SDILTVAKGARPGVN------KAVVVVTDGSGGDDAAVP---AQKLRDNGVSLFVIGIG- 638 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L S S +L K F D + Sbjct: 639 ----DIQKERLLQIAGSEEHMISVLSYEDL-KYFEDVL 671 >gi|326677363|ref|XP_691404.5| PREDICTED: collagen alpha-1(XII) chain-like [Danio rerio] Length = 3085 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 19/176 (10%), Positives = 55/176 (31%), Gaps = 20/176 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D D R+G ++ ++ + + ++ +++ + G+T +A+ Sbjct: 146 AGAFDLGEDKTRVGVVQYSTDTRTEFNLNQHFRRVD--LLRAINNLPYKGGNTMTGEALD 203 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + + + V++TDG++ K K+ G+ + T+ Sbjct: 204 YLLKNMFTEA------AGARKGFPRVAVVITDGKSQDP---VEGYAKKLKNAGVELFTLG 254 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +E L +S + + + + + I ++ Sbjct: 255 I------KEADEEELKQMSSTPYRTHVYTVPNFDMIKAVEKSFIAQVCSSVDDQLN 304 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A + R + D D V++G ++ ++ H +++ A + Sbjct: 1197 FKTIRAFIGRMVGVFDIGPDKVQIGLAQYSGDPKTEWHL--NAHPTRASLLDAVANLPYK 1254 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T A+ + + +++K VL+TDG+ +QD + + Sbjct: 1255 GGNTMTGMALNYILQ------NNFRPNVGMRPDSRKIGVLVTDGK-SQDEIVVNS--QRL 1305 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + GI + I + L + A+ + N L I D Sbjct: 1306 RDSGIELYAIGV------KNADENELRSIATDPDEIHMYNVNDFSFLLDIVDDL 1353 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 63/208 (30%), Gaps = 50/208 (24%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L ++++ +K R+G F+D + + + + Sbjct: 101 YPNRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQMPLTDDYS-IGKNYINA 152 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G T + A++ A + N D K I++L+DG + D Sbjct: 153 LDTNLISGGGTELYQALELAEKSFKEINSDN-----------KTIIVLSDGGDFDDKSL- 200 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQ-----------------------QEKARYFLSNCA 423 K + + +I ++ + FL + Sbjct: 201 ----KFVKDNKMNVFSIGIGTDEGTIIPEYVNGKKVGFIKDQKGSAVISKLNSDFLKKLS 256 Query: 424 SP--NSFFEANS-THELNKIFRDRIGNE 448 S ++E N+ + + FRD E Sbjct: 257 SESNGKYYEVNNLKDDTSNFFRDTANLE 284 >gi|241113142|ref|YP_002972977.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861350|gb|ACS59016.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 176 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 41/139 (29%), Gaps = 10/139 (7%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M +F ++L G I A+L + + +++V + A + Sbjct: 1 MRRGKRFAS-LRRLFGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDAS 55 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + S + S F + + L+ ++ + Sbjct: 56 VHKISRMIRTGEVASSKITLA--DFKARICNDMLLSFSCSSGLLVKVIVLSDLSSAASTD 113 Query: 121 EMNPRKSAYQVVLSSRYDL 139 ++ + + ++ YD+ Sbjct: 114 PIDDSGN---LTVTETYDI 129 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 62/172 (36%), Gaps = 22/172 (12%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTI 323 + +++DA+ +IR ++ D R+ FNDR V + + R I Sbjct: 71 NKPPSRLDVLKDAMKFIIRKLEDGD------RLSIVAFNDRPVKEYSTGLLDISGNGRRI 124 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + G TA+ A++ A + D +R+ +I+LLTDG++T Sbjct: 125 AEKKVDWLEGRGGTALMPALEEAIRVLDCRPGDSRNRVG-------FILLLTDGDDTSGF 177 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + N A + + T + L A S ++ + Sbjct: 178 RWSRDVINGAVGK-YPVHTFGLGA-----AHSSEALLYIAQESRGTYSFVDD 223 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 46/174 (26%), Gaps = 18/174 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LVR A +++ ++ D R+G F+ ++ + + Sbjct: 105 LDLVRHAAKTILETLDDHD------RLGIVTFSTSSKVVRELTYMTP-ANKAKILKQLDA 157 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + T + ++ ++ + R +++LTDG Sbjct: 158 LQPLSMTNLWHGIRDGLSLFNNNLKAVNDRRNPGSGRVPALLVLTDGMPNHQCPN-QGYV 216 Query: 391 NKAKSQGI---RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 K + I T F L + A ++ + Sbjct: 217 AKLRQWSTLPASIHTFGFGY-----SLRSGLLKSIAEVGGGNYSFIPDAGMITT 265 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 54/201 (26%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 +S + ++A + + +G F + Sbjct: 108 SMASTDVPPNRLAAAKEAGKQFADQLTPA------INLGLVEFAANATLLVPPT-----T 156 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R VK TA + + TA I + V + IVL +DG Sbjct: 157 NRAAVKAGIDSLQPAPKTATGEGIFTALQAI--ATVGSVMGGGEGPPPAR-IVLESDGAE 213 Query: 380 TQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA-- 423 + AK++G++I TI+F + L Sbjct: 214 NVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEI 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + F A+S L ++ Sbjct: 274 TDGQAFHADSLDSLKNVYSTL 294 >gi|45384318|ref|NP_990352.1| collagen alpha-1(XII) chain precursor [Gallus gallus] gi|2506307|sp|P13944|COCA1_CHICK RecName: Full=Collagen alpha-1(XII) chain; AltName: Full=Fibrochimerin; Flags: Precursor gi|222811|dbj|BAA00701.1| alpha 1 chain of type XII collagen [Gallus gallus] gi|2326442|emb|CAA43358.1| collagen type XII alpha 1 chain [Gallus gallus] Length = 3124 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 51/150 (34%), Gaps = 22/150 (14%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 FS + ++ I++ GST AM + + +++ + + + Sbjct: 491 FSLNRYNRVKDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKGSRPNV------PRVM 544 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SF 428 +L+TDG+++ +E K + + I + ++ R L ASP Sbjct: 545 ILITDGKSSDAFKEPAI---KLRDADVEIFAVGV------KDAVRTELEAIASPPAETHV 595 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F+ +RI + Sbjct: 596 YTVED----FDAFQRISFELTQSVCLRIEQ 621 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 40/415 (9%), Positives = 94/415 (22%), Gaps = 43/415 (10%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + EN ++ + +V + Sbjct: 977 AAYSSGEGDPVEGEAF-TDVSQSARTVTVDNETENTMRVSVAALTWEGLVLARVLPNRSG 1035 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 +++ ++ + + I + ++ + Sbjct: 1036 GRQMFGKVNASATSIVLKRLKPRTTYDLSVVPIYDFGQGKSRKAEGTTASPFKPPRNLRT 1095 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN--GKVGIRDEKLSPYMVSCNKSLYYM 243 S S + R + + +N V + + Y + Sbjct: 1096 SDSTMSSFRVTWEPAPGRVKGYKVTFHPTEDDRNLGELVVGPYDSTVVLEELRAGTTYKV 1155 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIK-----------KKHLVRDALASVIRSIKKIDNVN 292 G D S + + D + V S Sbjct: 1156 NVFGMFDGGESNPLVGQEMTTLSDTTTEPFLSRGLECRTRAEADIVLLVDGSWSIGRPNF 1215 Query: 293 DTVRMGA-------------TFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTA 338 TVR S + W ++ A+ Sbjct: 1216 KTVRNFISRIVEVFDIGPDKVQIGLAQYSGDPRTEWNLNAYRTKEALLDAVTNLPYKGGN 1275 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 M + + A+K VL+TDG+ +QD+ + + + +G+ Sbjct: 1276 TLTGMALDFIL----KNNFKQEAGLRPRARKIGVLITDGK-SQDDVVTPS--RRLRDEGV 1328 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + I + L A+ A + + + F IG ++ + Sbjct: 1329 ELYAIGI------KNADENELKQIATDPDDIHAYNVADFS--FLASIGEDVTTNL 1375 >gi|304320962|ref|YP_003854605.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] gi|303299864|gb|ADM09463.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] Length = 439 Score = 50.0 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 55/156 (35%), Gaps = 9/156 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++++L G + +L+ LG+ ++VD R L+ A++ A++ Sbjct: 21 RHARRLA-HQRGSIATLLMILLVPSLGLSALIVDGSRMRNAYLELEIVAES----AALAA 75 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 Q L + + + + + ++ + F + + +A Sbjct: 76 AQQLPDTDAATSAAVDYAETNLDPTVYGEVVKASDIEFGQYD----EASSTFTSGGDATA 131 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 +V + + ++LF +G K + A A Sbjct: 132 VKVTARREAERGNSLVTLFGAVIGRKEINLSATAIA 167 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 56/173 (32%), Gaps = 28/173 (16%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F + I + K D STA +A+ TA + ++ Sbjct: 134 LVGFAGTPYLLVPPTPQHQATIDALKKLDFAD-----STATGEAIFTALHAVSATA---- 184 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGI------AICNKAKSQGIRIMTIAFSVNKTQQ 412 + IVLL+DG + + A+ +G+ I TI+F + Sbjct: 185 -ITGGDTPPPARIVLLSDGRENKPSNPSDPHDGVYTAARLARDEGVPISTISFGTKTGEI 243 Query: 413 EKA---------RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 E + A S + A + ELNK + + I +I R + Sbjct: 244 EMDGQRVAVPVSTDQMKTIAKLSGGQSYTAGNLAELNKSY-NAIEKDIGYRTV 295 >gi|146338945|ref|YP_001203993.1| hypothetical protein BRADO1888 [Bradyrhizobium sp. ORS278] gi|146191751|emb|CAL75756.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 408 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 30/320 (9%), Positives = 87/320 (27%), Gaps = 2/320 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G +I A+ + ++ G VD + S L++A +A++ V S + Sbjct: 17 FWRAQDGQISMIFAIAIIPIVIAIGTAVDFSKSSDTRTQLQKAVDSAVLAGVV--QPSGQ 74 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 +VS+ + +F N + T + M Sbjct: 75 QVSTANAIFRGDFGGRFGTAASASFTANSDGSLTGSASATVNTSFLTVMGTSSLGISASA 134 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +++ L + S + + A+ + + + F+ ++ + Sbjct: 135 TAKPGAQSKSPVCILLVSTLNSQSLLVNSGAQLNAPTCEVHVLSTQNPAALFNATLNVKR 194 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 +G + G S ++ + ++ + N +Y Sbjct: 195 ICIKGSTIIKNGGVTPPAETSCAAISDPFAGSLPTVTVGSCTTNNKVYDPGSVTLSPGVY 254 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + S + ++++ + + V + + + + Sbjct: 255 CGSTNFNGSGTLTLNPGLYIIKGTMTFNSGWTVTGNGVTFYLVDQNATLTFNGNVNATLA 314 Query: 314 WGVHKLIRTIVKTFAIDENE 333 I+ + Sbjct: 315 APTTGTYANILMYEPTGLST 334 >gi|188581386|ref|YP_001924831.1| hypothetical protein Mpop_2134 [Methylobacterium populi BJ001] gi|179344884|gb|ACB80296.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 543 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 33/137 (24%), Gaps = 12/137 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ------------TA 60 + G I+TAL + +LG +D+ R + ++ AA Sbjct: 2 RFRSDQRGAVSIVTALGLTTLLGAAAFGLDLSRLYGTQRRVQGAADLAALSAASDLSTAD 61 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + + + R N + V T Sbjct: 62 AAARRALADNGFGDGTRIGVQAGAYLRSAAVANGSRFTPGATAPNAVRVSLTAPVPLTFG 121 Query: 121 EMNPRKSAYQVVLSSRY 137 + Y V Sbjct: 122 RYLGLPTHYDVSAVGTA 138 >gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 722 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 43/348 (12%), Positives = 90/348 (25%), Gaps = 32/348 (9%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF--LRSMGIKSWLIQTKAEA 164 ++ + + + P ++ L + +N F T + Sbjct: 98 LGEDIGNGLSSFQLGNLPAGKTAEIHLKVSFLADINENGYFYKFPLTHRYQSGAVTSNNS 157 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + D S+ + + + K Sbjct: 158 DKPESFHFATTIKTQRDIQDLKVSVDGTKDVKDAHNATFVTNNPPAKDAIVIETHIKDED 217 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 ++ +S Y + GP+D + VD S + + + I+S Sbjct: 218 KNVAISSDGYIA--ITTYPYFEGPIDSNSEFYFIVDCSGSMSC-SRINNAIKCMRLFIQS 274 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R F + + + + MG T I +Q Sbjct: 275 LPVG------CRFSILRFGSHFETVLPPCDYTDENVANAMNLLDNISANMGGTNILAPLQ 328 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGI-RIMT 402 D S K I LTDGE + IC K++ RI + Sbjct: 329 HVSDLQASEGF------------VKQIFFLTDGEVDNSDI----ICATALKNRSTNRIFS 372 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 I + S ++ +N+ + + + I Sbjct: 373 IGLGSGAD-PGLIKGMARK--SGGNYAIIGDNDNMNEKVIEMLSSAIS 417 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 72/235 (30%), Gaps = 27/235 (11%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR-DSEGQPLNCFGQPADRTVK 213 S+ I + I ++D S SM+D + + ++ K Sbjct: 60 SYDIGSLMGVNKKDYDDANSKDKEIVLLVDTSGSMIDRKVEGKYRINYSVDQVFIEKEWK 119 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----- 268 Y + + G + S + ++ +++ Sbjct: 120 WYRRVSLENGQKYIIHELLYTSERDIIINGQIYTEYIEIDNKRYYLKYDYSDGSWYAIAY 179 Query: 269 --KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K ++ A + + + N +G + ++ S + + ++ +I Sbjct: 180 SESKIDELQKAAKNFVNKFETKSNTK----IGLVSYGNKGEVVHSLTNELDRINSSID-- 233 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G+T I D ++ A + N A KYIVL+TDG T Sbjct: 234 --YGLSVYGATNIGDGIRKANGLLN-----------NGSNADKYIVLMTDGIPTA 275 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 48/437 (10%), Positives = 110/437 (25%), Gaps = 23/437 (5%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ + V + G+ + V + L Q + E + + Sbjct: 7 ISILF--MSLVFIISQGIGIKVFATENKKSNLTSQFQLKQNINAKNYTVG--ETFDISYD 62 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + ++Y N ++ D I R + S QV + + Sbjct: 63 IGSLMGVNKKDYDDANSKDKEIVLLVDTSGSMIDRKVEGKYRINYSVDQVFIEKEWKWYR 122 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + I L ++ + + Y + + + D + Sbjct: 123 RVSLENGQKYIIHELLYTSERDIIINGQIYTEYIEIDN-----KRYYLKYDYSDGSWYAI 177 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 D K+ + K + + NK LD S +D Sbjct: 178 AYSESKIDELQKAAKNFVNKFETKSNTKIGLVSYGNKGEVVHSLTNELDRINS---SIDY 234 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + ++ + D + + + + Sbjct: 235 GLSVYGATNIGDGIRKANGLLNNGSNADK---YIVLMTDGIPTAATCYSNVKSERTEHYS 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + D +G+ + +I+ + + +++ G+N Sbjct: 292 NFYRRNIYD-TPVGAILDGYGRELEKFNLITDYNYKFEYNPKDYIYSDENIIINRGDNDY 350 Query: 382 DNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 N K G+ + F + + A + EA EL Sbjct: 351 GNIALNYAKESLKRASENGVNNFVVGF---SNGISRDKLTQIADAGNGYYREAMHGDELE 407 Query: 439 KIFRDRIGNEIFERVIR 455 ++ +RI +EI V++ Sbjct: 408 DVY-NRIADEIKNPVVK 423 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 42/123 (34%), Gaps = 13/123 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + ++ D TAI+DA++ A + Sbjct: 129 DRIGIIAYAAQAYPQLPITTDF-GAAKMFLQGMNTDMLSSQGTAISDAIELATTYYNDAA 187 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + +++DGE+ + +KA GI+I TI K Sbjct: 188 Q-----------TNRVLFIVSDGEDHS-EGGAVNAVSKATEAGIKIFTIGVGTEKGAPIP 235 Query: 415 ARY 417 + Sbjct: 236 IKK 238 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 47/169 (27%), Gaps = 5/169 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQ 75 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L D + P +A QV + Sbjct: 76 CAQPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTAT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 135 QSVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 183 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 72/200 (36%), Gaps = 26/200 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 +D L + L + K + +DA++ I+ + K R+ F Sbjct: 314 YANVKTIDIVLCLDVSGSMRELSSGMPKIEIAKDAVSQYIQFLSKT-----NDRLAMVLF 368 Query: 303 NDRVISDPSFSWGVHKLIRTIV--KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 N R WG+H++ R G T + +A++ + + + Sbjct: 369 NFRADVL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLT--------- 415 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 + K+++ +TDG T ++ + + + G I TIA N + R LS Sbjct: 416 --RSKSNSKHVICVTDGR-TVNSSMCVKEAVRLRRNGTTISTIAIGENSDDELLMR--LS 470 Query: 421 NCASPNSFFEANSTHELNKI 440 F + +S H+L K Sbjct: 471 KI-GGGLFIKISSIHDLGKA 489 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 32/310 (10%), Positives = 81/310 (26%), Gaps = 30/310 (9%) Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF------SRSML 190 + F ++ + T E +++ Y+ + + S + Sbjct: 26 ATGGAQDVENFRTNLEAGYLPLPTDLTYEGLAKDYYFDTSSNATKPCTKLFCPLYSVGLS 85 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV-GIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + + K + + YY Sbjct: 86 PDPLLGTPASSEFYMAVGLDSGMKAADFARKQLNLVVVLDVSGSMGSPFDSYYYDQTVQP 145 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + +E KK + ++ LA ++ ++ D+++ F+D Sbjct: 146 TAGVPDEGE--------TKKKIDVAKEVLAGIVGLLRPDDSLS------VVLFSDAACVP 191 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + + + + D EMG T + + Sbjct: 192 KPLGPVRCADVDKLKEQISADVVEMGGTNFQAGIDAGGAQLTGCAACMEANASLVENR-- 249 Query: 370 YIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +V LTD + ++ + + K + GI I ++ Q + Sbjct: 250 -VVFLTDAQPNAGDDSEQGLLARIKALSADGIYTTIIGVGLDFNTQLVESIG---KVRGS 305 Query: 427 SFFEANSTHE 436 ++F ++ E Sbjct: 306 NYFSVHTPGE 315 >gi|315498270|ref|YP_004087074.1| hypothetical protein Astex_1248 [Asticcacaulis excentricus CB 48] gi|315416282|gb|ADU12923.1| hypothetical protein Astex_1248 [Asticcacaulis excentricus CB 48] Length = 409 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 33/313 (10%), Positives = 82/313 (26%), Gaps = 10/313 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + + + G+ + AL + + G G VD+ L+ A +++ Sbjct: 10 MSARLVTRLRHYVSHKNGNTTLAFALSLVAVAGAVGGAVDMHAAYSVRAQLQDAMDASVL 69 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 +V+ + F + + T V D + Sbjct: 70 GGLRQEQVQRSDVA-----TRIFKSNLPASMSVTPKYVTDETTLTGTAVYDSPTVLLGLL 124 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS--YHKEHGVSIQ 180 V ++ + S + + A+ + + + Sbjct: 125 GISSLKINVTATAIGKAS-SQGPCIQVLDPSGSQALLVNSGAKITAPNCEIQVRSKGNPA 183 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + S+ + EG + DR S + + ++ + + Sbjct: 184 AIFNSGSSLNFKKVCIEGSTVIRNSTTVDRLSLSCPTVADQWAALIPTVASTPCTVSNGN 243 Query: 241 YYMLYPG-PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-RMG 298 Y +F +SS+ +++R +V V Sbjct: 244 YNTSNVSLTPGVYCGWFNFNNSSATVSFAPGLYVIRSGGWNVNGGTWTGKGVTFYFEDTS 303 Query: 299 ATFFNDRVISDPS 311 FN + +D + Sbjct: 304 KIQFNSGISADLT 316 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 41/135 (30%), Gaps = 31/135 (22%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 TAI + +A + S KK IVL+TDGEN A Sbjct: 168 GDGTAIGTGLSSAIYHLEKS-----------KSPKKSIVLITDGENNSGAVHPHTAARLA 216 Query: 394 KSQGIRIMTIAFS----VNKTQQEK--------------ARYFLSNCASPNS--FFEANS 433 ++ I + + V + ++ AS + FFE S Sbjct: 217 VNKDISLYILGVGTRGVVPIDYVDPKSNKIYSGYLESKFDTSSIARIASEGNGKFFEIES 276 Query: 434 THELNKIFRDRIGNE 448 L++ +E Sbjct: 277 ISALSQAISSISKSE 291 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 28/356 (7%), Positives = 86/356 (24%), Gaps = 36/356 (10%) Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH------- 172 + + V ++ ++ + ++ A+ + S + Sbjct: 24 TMVLLSINTSFVSAEGNSSSSEKTITNSIQIDNMNEGEVRVFKTAKPIPNSINRWEISID 83 Query: 173 -----KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF------GQPADRTVKSYSSQNGK 221 K I V+D S SM + ++ + + Sbjct: 84 VFGRLKREPSDIVLVLDTSGSMDPQKNPQGIDRISKAKREAIHFVNEIFERDASARVALV 143 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + +L + + + + + + Sbjct: 144 SYGTKVSSNSFHTKQESNLLINEIKSLKAEGGTFTQGALYEAKMLLNQSSAPNKTIVLLS 203 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRV-----ISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + V ++ V + + + + + Sbjct: 204 DGQPTYRYPLKAKVNQDLLRYDGNVIVQKRYNGQQRPFDIGITSSSNQAIPGYRFKSRPN 263 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + + + + + Y V ++ + N+ + Sbjct: 264 TNVF-DYNAMVYGTGNEYYLDELGELRSQGNQNYFVYMSSADAAIIES------NQIHQE 316 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-LNKIFRDRIGNEIFE 451 I + I F + + + S N++++A+S+ + L+ IF+ N Sbjct: 317 QIHLYAIGFDTDARGTDILKRI-----SNNNYYDASSSRDNLDDIFKKISNNIYSA 367 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 47/169 (27%), Gaps = 5/169 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 7 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQ 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L D + P +A QV + Sbjct: 63 CAQPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTAT 121 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 122 QSVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 170 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 6 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 64 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 65 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 124 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 125 YFFLGPSRTVSATSTAKAT 143 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 50.0 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 49/183 (26%), Gaps = 28/183 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIV 324 + A+ + + K G TFF D + + V + Sbjct: 114 GDGNRYDASMKAIDTFLDFRKG-------DAFGLTFFGDAFVHWVPLTTDVTAIRCSPPF 166 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G TAI A+ + E K IVL+TDG + Sbjct: 167 MRPETVPPPFGGTAIAKALNGCKTELR-----------RRDEGDKMIVLITDGFSYDLTG 215 Query: 385 EGIAICNKAKSQGIRIMTI---AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I ++G+ + I F R + F A+ L +F Sbjct: 216 NDEEIARTLSAEGVAVFCIIVGGFEPQAEIVNICRL------TGGEAFRADDPDALPAVF 269 Query: 442 RDR 444 + Sbjct: 270 KKI 272 >gi|157819015|ref|NP_001100328.1| procollagen, type VII, alpha 1 [Rattus norvegicus] gi|149018489|gb|EDL77130.1| procollagen, type VII, alpha 1 (predicted) [Rattus norvegicus] Length = 2588 Score = 50.0 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 40/146 (27%), Gaps = 21/146 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 VR ++D ++ + G+T A+ D + Sbjct: 74 NAQGVRFATVQYSDDPQTEFGLDTLGSGGDTIRAIRELS--YKGGNTRTGAALLHVSDRV 131 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 K +L+TDG + + K K QG+++ + Sbjct: 132 F-------LPHLTRPGIPKVCILITDG---KSQDLVDTAAQKLKRQGVKLFAVGI----- 176 Query: 411 QQEKARYFLSNCAS---PNSFFEANS 433 + L AS + FF N Sbjct: 177 -KNADPEELKRIASQPTSDFFFFVND 201 >gi|307196324|gb|EFN77937.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Harpegnathos saltator] Length = 2255 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/112 (18%), Positives = 36/112 (32%), Gaps = 22/112 (19%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ A + K+ EA K + L+TDG + + N Sbjct: 148 YSGGGTYTRGALLEALAIL----------EKSREEASKVVFLITDGFSNGGDPRP--AAN 195 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 K+ G + T ++ L + AS + + L F + Sbjct: 196 LLKNAGATVFTFGIRTGNVEE------LHDIASFPGY----THSYLLDSFAE 237 >gi|148665999|gb|EDK98415.1| matrilin 3 [Mus musculus] Length = 482 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 54/167 (32%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I +D R+ + V + + + + + A T A+Q Sbjct: 105 IDTLDIGATDTRVAVVNYASTVKIEFQL--NTYSDKQALKQAVARITPLSTGTMSGLAIQ 162 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + M K +++TDG + ++ + +A++ GI + + Sbjct: 163 TAMEEAFTVEAGARGPM---SNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAVG 216 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 V----DRADMES--LKMMASKPLEEHVFYVETYGVIEKLSARFQETF 257 >gi|306518578|ref|NP_034900.4| matrilin-3 precursor [Mus musculus] gi|6918887|emb|CAB72265.1| matrilin-3 [Mus musculus] gi|26339344|dbj|BAC33343.1| unnamed protein product [Mus musculus] Length = 481 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 54/167 (32%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I +D R+ + V + + + + + A T A+Q Sbjct: 105 IDTLDIGATDTRVAVVNYASTVKIEFQL--NTYSDKQALKQAVARITPLSTGTMSGLAIQ 162 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + M K +++TDG + ++ + +A++ GI + + Sbjct: 163 TAMEEAFTVEAGARGPM---SNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAVG 216 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 V----DRADMES--LKMMASKPLEEHVFYVETYGVIEKLSARFQETF 257 >gi|14548114|sp|O35701|MATN3_MOUSE RecName: Full=Matrilin-3; Flags: Precursor gi|2342635|emb|CAA71532.1| matrilin-3 [Mus musculus] Length = 481 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 54/167 (32%), Gaps = 20/167 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I +D R+ + V + + + + + A T A+Q Sbjct: 105 IDTLDIGATDTRVAVVNYASTVKIEFQL--NTYSDKQALKQAVARITPLSTGTMSGLAIQ 162 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA + + M K +++TDG + ++ + +A++ GI + + Sbjct: 163 TAMEEAFTVEAGARGPM---SNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAVG 216 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 V----DRADMES--LKMMASKPLEEHVFYVETYGVIEKLSARFQETF 257 >gi|296220039|ref|XP_002756140.1| PREDICTED: integrin alpha-X [Callithrix jacchus] Length = 1078 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 59/185 (31%), Gaps = 18/185 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +++ + +V+ ++ + F+ + + +F + Sbjct: 164 YNNFAMMKSFVRAVMSHFQRPST-----QFSLMQFSSKFKTHFTFEEFRESSNPLSLLAS 218 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T A+Q + ++ EA K ++++TDG+ D + Sbjct: 219 VDQL--GGYTHTATAIQKVVKELFLASNGAR------REATKILIVITDGKKEGDWLDYD 270 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + A GI I + + L++ AS F+ L +++ Sbjct: 271 DVIPMADKAGIIRYAIGVGSAFQNRNSWKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQ 328 Query: 445 IGNEI 449 + +I Sbjct: 329 LKEKI 333 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 62/196 (31%), Gaps = 18/196 (9%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + + A + + D V T G Sbjct: 597 DTSGSMNARVGGRTRFQVATTAADRAVGLLNSADRVALWSFSSETDQRRGKPYSEEIRLG 656 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + R G+TA+ ++ A+ ++ +++ + +V+LT Sbjct: 657 PY--DRAAFTRRLTGLRVGGNTALYATVRAAHRRLLDNHDPDRINA---------VVVLT 705 Query: 376 DGENTQD-NEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 DG+N + + + + ++ +++ +AF R AS F+A Sbjct: 706 DGKNEYPRDNDLDRLLADIELDPNRPVKVFCVAFDRESDLAALDRI---AGASAGKAFDA 762 Query: 432 NSTHELNKIFRDRIGN 447 +++ F + + Sbjct: 763 TDPATIDEAFVKLVSS 778 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/220 (9%), Positives = 58/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +A ++ + + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDKM-------ENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI A+ A + + ++++T Sbjct: 150 YISA-KMFLESIDPSLISKQGTAIGAAINLASRSFTPQ-----------EGVGRAVIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + + A +GI++ + + + Sbjct: 198 DGENHEGDAV--EAAKDAAEKGIQVNVLGVGMPEGAPIPAEGTNDYRRDRDGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++T+ K I V Sbjct: 256 EQMCQEIAKAGNGIYVRVDNTNGAQKAISREINKMAKADV 295 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 54/201 (26%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 +S + ++A + + +G F + Sbjct: 108 SMASTDVPPNRLAAAKEAGKQFADQLTPA------INLGLVEFAANATLLVPPT-----T 156 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R VK TA + + TA I + V + IVL +DG Sbjct: 157 NRAAVKAGIDSLQPAPKTATGEGIFTALQAI--ATVGSVMGGGEGPPPAR-IVLESDGAE 213 Query: 380 TQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA-- 423 + AK++G++I TI+F + L Sbjct: 214 NVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEI 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + F A+S L ++ Sbjct: 274 TDGQAFHADSLDSLKNVYSTL 294 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 64/195 (32%), Gaps = 21/195 (10%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D+SS + VR +A+++ + + R+G ++DR + Sbjct: 42 KSVQYDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPGHTRVGVVRYSDRPTT 97 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + G+T DA++ S+ N K Sbjct: 98 AFEL--GHFSSREEVKAAARRITYHGGNTNTGDALRYITSRSFSAQAGGRP---GNRAFK 152 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---P 425 + +LLTDG + + A A + GIRI + + L AS Sbjct: 153 QVAILLTDGRSQDLVLDAAA---AAHAAGIRIFAVGVGA------ALKEELDEIASEPKS 203 Query: 426 NSFFEANSTHELNKI 440 F + + ++KI Sbjct: 204 AHVFHVSDFNAIDKI 218 >gi|156409373|ref|XP_001642144.1| predicted protein [Nematostella vectensis] gi|156229285|gb|EDO50081.1| predicted protein [Nematostella vectensis] Length = 203 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 60/164 (36%), Gaps = 29/164 (17%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+GA F+ R +F ++ +R ++ G T A++ Y + S Sbjct: 45 QTRIGAVLFSSRPYLMFNFQ--KYRTVRQVLAALQRIRYPRGGTKTGRALRYTYSRLFRS 102 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + +++LTDG++ + A K+QG+ + I N +++ Sbjct: 103 RSRVRKQA---------LIVLTDGKSQDSVGQPAAF---IKNQGVELFAIGVGRNYRRRD 150 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDR--IGNEIFERVIR 455 L+ AS + F A F + I I +V R Sbjct: 151 -----LNQIASRGNVFTAK--------FENLGRIIGAIKSKVCR 181 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 139 YFFLGPSRTVSATSTAKAT 157 >gi|320161334|ref|YP_004174558.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] gi|319995187|dbj|BAJ63958.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] Length = 652 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 47/413 (11%), Positives = 101/413 (24%), Gaps = 45/413 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY--------------------EHAL 53 + + G +I A+ + +L G+ VD Sbjct: 1 MAQRQHGQVLVIFAVALVALLLFVGLAVDSGVLYVTYGQLKRASDAAAVAAANNFKRGQT 60 Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + + A + + + + + D + + Sbjct: 61 ELEMKAAAMEIYRLHNVDTSTLDLNVQICDDDGDGFTDGTYKSGSDVLFDAGLLDSDEQK 120 Query: 114 IVR-------DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + T ++ + FL +GI ++T + E Sbjct: 121 VANFFEKICPRTDPTVSAGQQTASPRKLVYVSATQRANFYFLSLLGITHANLRTYSVGEA 180 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQ-RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 S V + VID S SM F P G Sbjct: 181 AS--------VDLVIVIDTSESMASECQSYDASGVCAFFKSPGYNNTTLDDYDPYAPGNC 232 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + Y ++ Y L + + ++ + D + ++ I Sbjct: 233 NTIPTTYAGITVQNSCYPLVQAEVSAISLVRTLYEGYDNVGLVTYDSVAVDRMGGLVNMI 292 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSW-------GVHKLIRTIVKTFAIDENEMGSTA 338 ++ + +D +W ++ R +G T Sbjct: 293 NRVA--IENAIASVKVHDDAPYGRLWPTWRVPGRFNPINPEDRDGDGLDVDTPARVGYTC 350 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 + DA + N +A ++ TDG+N D+ +I + Sbjct: 351 LLDADRWDESIDPYGWGGVPCDDDNLFDAFNWVDENTDGDNNPDSNPEFSIAD 403 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 23/84 (27%), Gaps = 26/84 (30%) Query: 396 QGIRIMTIAFSVNKTQQ-----EKARYFLSNCA--------------------SPNSFFE 430 I I TI + L A S +++ Sbjct: 569 NDIAIYTIGLGAVTPDNPPAWLDTGEKILRYMAAVGDDGDRVTDPCASVGPKRSCGNYYY 628 Query: 431 ANSTHELNKIFRDRIGNEIFERVI 454 A S L IF D I + I+ R+ Sbjct: 629 APSGDALRAIFDD-IASRIYTRIS 651 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D R+ F+D V + + R+ + ++ G T + A+ A + Sbjct: 298 DGDRVNVMAFDDGVDALFLRPVPISAERRSQAVEYIDRLSDGGGTDLAGALAEA----LD 353 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAFSVNKTQ 411 + I+ LTDG+ D++ + + G R+ TI Sbjct: 354 AQHPSESEADTGSRPH-VILFLTDGQ--SDSQATLQVARG--DAGDARVFTIGVGDGV-- 406 Query: 412 QEKARYFLSNCASP--NSFFEANSTHELN 438 + L+ AS F S E+ Sbjct: 407 ---EKPLLARLASEKRGRFTFIASPSEIE 432 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 61/199 (30%), Gaps = 21/199 (10%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + FV SS + V+ + ++++ + + + R+G + Sbjct: 45 GNPCKAVPLDFVFVIDSSRSIRPRDYEKVKTFIVNLVQFL---EVGPEATRVGLLQYGSV 101 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 V + S S + + ++ T A+Q A +T + + Sbjct: 102 VQPEFSLS--TFSTKAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEADGARPAHL--- 156 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + V++TDG + + + +A+ GI+I I L S Sbjct: 157 HIPRIAVVVTDG---RPQDRVEEVAAQARQAGIQIFAIGVGR------VDMKTLKTIGSE 207 Query: 426 NSFFEANSTHELNKIFRDR 444 + L F Sbjct: 208 PHSEHVH----LVASFSQM 222 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 42/380 (11%), Positives = 92/380 (24%), Gaps = 69/380 (18%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + ++ + + +A Sbjct: 9 AAVGAAAGRSGKRLVAVVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V ++ G + + + + I V Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANGDGTYKVALNVTGAKSAGTGAIVTNQPLDIVLV 128 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 +D S SM D + ++ + + + + +N K+ Sbjct: 129 LDVSGSMADNLSG-GPKKIDALKTAVNGFIDATADENAKI-------------------- 167 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + + + LV+ A +K ND R G + + Sbjct: 168 --------------------TDQSQRNRIALVKFAG------TEKTSVGNDFYREGWSSY 201 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N + ++ V L T + G+T+ + A A + + Sbjct: 202 N-YTQIVSNLTYDVSGLTST-----VNGLSASGATSADYAFNRAQAALT---------YQ 246 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKTQQEKA 415 AKK ++ TDGE + + A K G I +I + + Sbjct: 247 PRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTS 306 Query: 416 RYFLSNCASPNSFFEANSTH 435 +S + + Sbjct: 307 SNLNKYMHGISSNYPDATAT 326 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 56/167 (33%), Gaps = 13/167 (7%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGS 336 R + V D + T F+ S+ F++ R+ G Sbjct: 170 DFKRMKNFVRAVMDRSKGTNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGL 229 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T ++T + S A+K I+++TDGE +D E + +A+ Sbjct: 230 TFTATGIRTVVRELFHSKNGAR------KSARKIIIVITDGEKYKDPLEYKDVIPEAEKA 283 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I I + Q AR L S + F+ +S L+ I Sbjct: 284 NIIRYAIGVG-DAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSI 329 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 56/167 (33%), Gaps = 13/167 (7%) Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGS 336 R + V D + T F+ S+ F++ R+ G Sbjct: 170 DFKRMKNFVRAVMDRSKGTNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGL 229 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T ++T + S A+K I+++TDGE +D E + +A+ Sbjct: 230 TFTATGIRTVVRELFHSKNGAR------KSARKIIIVITDGEKYKDPLEYKDVIPEAEKA 283 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 I I + Q AR L S + F+ +S L+ I Sbjct: 284 NIIRYAIGVG-DAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSI 329 >gi|327273523|ref|XP_003221530.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Anolis carolinensis] Length = 1091 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 72/223 (32%), Gaps = 29/223 (13%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD------ALASV----- 281 + Y + ++ D I+ +D A SV Sbjct: 216 SATGLARYYPASPWVDKSRTQNKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLTL 275 Query: 282 ----IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMG 335 I+ ++ ++D + FN+ + F+ V + +K G Sbjct: 276 KLIRTSVIEMLETLSDDDFVNVVSFNENAQNVSCFNHLVQANVRNKKKLKEAVYKIQAKG 335 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK- 394 T A++ +++ N N K I+L TDG E+ I +K Sbjct: 336 ITDYKKGFSYAFEQLLNHNHSVFRANCN-----KIIMLFTDG----GEEKAQEIFDKYNV 386 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + +R+ T FSV + +K CA+ ++E S + Sbjct: 387 EKKVRVFT--FSVGQHNYDKGPIQWMACANKGYYYEIPSIGAI 427 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 60/190 (31%), Gaps = 20/190 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT--------VRMGATFFNDRVISDPS 311 DS + + + + + S ++ T ++ F + + Sbjct: 396 DSVTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQTQLQIANKLFQVAIVQFAGKSKTRVL 455 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + +K + K G+T N A++ +S D K + Sbjct: 456 ADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKRDNC--------KMKLL 507 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 V TDG E+ K QGI + T+ S +K SP+ +F++ Sbjct: 508 VF-TDG---YSAEDTAEGIEALKRQGITVYTVGISTDKNAGLNVSELKGMATSPSHYFDS 563 Query: 432 NSTHELNKIF 441 + L K F Sbjct: 564 SDFDNLLKHF 573 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 55/190 (28%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + + + +DR + L RT KT Sbjct: 51 SRMAAAKQAFNEVLDATPEEVELGIRTLGADYPGDDRKTGCKDTAQLYPVGPLDRTEAKT 110 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A + N K IVL++DGE+T + Sbjct: 111 AVATLTPTGWTPIGPALLKAAGDLDGGNG------------SKRIVLISDGEDTCAPLDP 158 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ + A+ ++ EL Sbjct: 159 CEVAREIAAKGIGLTIDTLGLVPTAKLSRQLSCIAE--ATGGTYTSVEHQDELTDRVNQL 216 Query: 445 IGNEIFERVI 454 + V Sbjct: 217 VDRAADPVVT 226 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 58/181 (32%), Gaps = 14/181 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + + A V+ ++ + V D+ V + R K Sbjct: 37 SRISVAQQAFGEVVDALPDETQLGIRVLGATYRGEDKQQGCLDTQQIVPVGPVNRERAKA 96 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ A + + IVL+TDGE+T + Sbjct: 97 AVATLRPTGFTPVGLALREAAKDL------------GGGTTARRIVLITDGEDTCAPPDP 144 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + +QG ++ + ++ + + A+ ++ A S +L + + Sbjct: 145 CQVARELAAQGTTLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAATSAEDLTGRLKQLVD 204 Query: 447 N 447 Sbjct: 205 R 205 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/247 (16%), Positives = 81/247 (32%), Gaps = 17/247 (6%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + T + ++S + R +P V + LD + + Sbjct: 137 STGSPQHVNTAVTKRRWTSTSYTTTQRPVPTTPSQVLKERQCACDPAKLYLDIVVVVDSS 196 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 + + + L A ++ +V VR+G F+++ + + Sbjct: 197 LSMTKDGLIEVAADL---ATVFQWMNVSSGTDVGQFVRVGLVTFSNQAFVNGNL-DDFTS 252 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + F + I A+Q+A D + S + A+ I+L T Sbjct: 253 YNSLVKRLFQMPYLGGSELNIESALQSASDILQS----------SRYYARTAILLYTSAY 302 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + AI N+ K G +I+T+AF + LS+ ASP F A+ + Sbjct: 303 GEGGFTDPKAIANQIKESGTKIITVAFRQQPEGSLVEK--LSHLASPG-FSFASRQSIIT 359 Query: 439 KIFRDRI 445 + Sbjct: 360 DEILRAL 366 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 47/143 (32%), Gaps = 19/143 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V + TA +A+ TA I + + IVLL+DG Sbjct: 147 TTDRGPVVRAVDRLSLDERTATGEAIYTATQAITT---FTESLGGPDQAPPARIVLLSDG 203 Query: 378 ENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA 423 + T + +A GI + TI+F + L A Sbjct: 204 KETVPADPTEERGAFTAAERAAEAGIPVSTISFGTLYGTVDIQGRPQPVPVDDASLRTIA 263 Query: 424 --SPNSFFEANSTHELNKIFRDR 444 S FF A++ EL+ ++R Sbjct: 264 ELSGGDFFTASTLEELDSVYRTL 286 >gi|266631670|emb|CBH29316.3| integrin, alpha X (complement component 3 receptor 4 subunit [Equus caballus] Length = 1160 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + + G T A+Q + S+ Sbjct: 187 TQFSLMQFSNKFLVHFTFKDFMDSSDPLGLLNSVSQLR--GLTHTASAIQVVIKELFSAT 244 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D + A++ GI + Q + Sbjct: 245 RGAR------KDASKILIVITDGQKQGDYLGYDDVIPMAEAAGIIRYAVGVGS-AFQSTQ 297 Query: 415 ARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEI 449 A L++ AS S F+ + L ++++ +I Sbjct: 298 AWQELNDIASKPSHEHIFKVDDFDALRD-IQNQLKEKI 334 >gi|166795266|ref|NP_001107649.1| integrin alpha-X [Equus caballus] gi|164507177|gb|ABY59790.1| integrin alpha X [Equus caballus] Length = 1160 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + + G T A+Q + S+ Sbjct: 187 TQFSLMQFSNKFLVHFTFKDFMDSSDPLGLLNSVSQLR--GLTHTASAIQVVIKELFSAT 244 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D + A++ GI + Q + Sbjct: 245 RGAR------KDASKILIVITDGQKQGDYLGYDDVIPMAEAAGIIRYAVGVGS-AFQSTQ 297 Query: 415 ARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEI 449 A L++ AS S F+ + L ++++ +I Sbjct: 298 AWQELNDIASKPSHEHIFKVDDFDALRD-IQNQLKEKI 334 >gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 733 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 59/188 (31%), Gaps = 26/188 (13%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ A +V++++ + R FN + + + F Sbjct: 372 SIQQARDAVLQALDTLTPR---DRFNVIEFNSHARKLFPQAVPAQERALQQARRFVRGLK 428 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T I +A+ A + + + + +V LTDG + I + Sbjct: 429 ADGGTEIAEALDRA---LSDAAPEGY---------VRQVVFLTDGSVGNELALFKQIDQQ 476 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG--NE 448 R+ T+ R+F+ A ++ N T E++ + + Sbjct: 477 LGDS--RLFTVGIG-----PSPNRFFMRKAAQFGRGAYSHINDTAEVSDKIAELTAALRQ 529 Query: 449 IFERVIRI 456 R +R+ Sbjct: 530 PALRDVRL 537 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 50/167 (29%), Gaps = 23/167 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K D D R+G F+ + H +++ + G+T A+ Sbjct: 35 VKAFDVDLDRTRIGLAQFSGEPRIEWHL--NTHTTKEAVMEAARNLPYKGGNTLTGLALT 92 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + S + K VLLTDG+ +QD+ + + G+ + I Sbjct: 93 FILENSFS------PESGSRPGIPKIGVLLTDGK-SQDDVIPP--AQRLRDAGVEVFAIG 143 Query: 405 FSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 + L AS + ++ + I Sbjct: 144 V------KNADEGELRAIASVSEDTHVYNVADFHLMADIVDVLTRTI 184 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 54/148 (36%), Gaps = 24/148 (16%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D ++ ++ +++ + + G+T A+Q D++ + Sbjct: 1311 VQFSDEPRTEVQLKS--YRKKERLLEAISSISYKGGNTKTGRAIQHMKDSVFTE------ 1362 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 K +VLLTDG +QD+ ++ + + QG + I F+ + L Sbjct: 1363 EGGARTAVPKVLVLLTDGR-SQDDVSKVS--KELQKQGFVVFAIGFADADYGE------L 1413 Query: 420 SNCA---SPNSFFEANSTHELNKIFRDR 444 N A S F + F++ Sbjct: 1414 VNVASKPSSTHVFFVDD----LDAFKEM 1437 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/279 (11%), Positives = 80/279 (28%), Gaps = 31/279 (11%) Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 A T + + + + D ++++ D + ++ + + + G Sbjct: 37 NAARTTTTDNVTINNWHDANALDDSTKNVGRIWTDKSVSAGDVTLTSREKESGTATIKKG 96 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 LS + + + + D K+ ++ A+ S Sbjct: 97 ADSDFLVGLSALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKAAVNS 156 Query: 281 VIR-----------------------SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 I + K D + + ++ + + Sbjct: 157 FIDGSAKVNDQRADVNKQNRIAVVKFAGNKTDKIGNDQYSQNRYWYNYTQVVSGYKAYTS 216 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ +T G TA + AM T++ ++ + + + K+ ++ TDG Sbjct: 217 G-NKSEWETTVNALKPAGCTAADYAMDL-TKTLVDQSKTDANNNADRKNVKRVVIFFTDG 274 Query: 378 ENTQDNEEGIAICNKA--KSQGIR----IMTIAFSVNKT 410 E + + N A ++ I+ I TI Sbjct: 275 EPNHQSGFDDDVANDAITSAKTIKTDADIYTIGIFSGAD 313 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + R+ F++ V F+ + ++ R + +TA+ D + A Sbjct: 152 DILRPQDRLSVYAFSETVEEIVPFTSDLRRIDR-----GISEIIAGSATAMYDTIFLASK 206 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---AF 405 ++ + +K +VL+TDG +T + A + +I Sbjct: 207 ALMK------------HDGRKVMVLITDGGDTFSSTSYEQAARAATQSETLLYSIIVVPV 254 Query: 406 SVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + + + + L + + + A L+ F+ Sbjct: 255 ANSAGRDTGGEHALIQISQDTGGKHYYATDMGSLDVAFKQI 295 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 46/377 (12%), Positives = 98/377 (25%), Gaps = 22/377 (5%) Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD--IVRDTAVEMNPRKSAYQVVLSSR 136 + T + + T V PR + +S Sbjct: 155 DVFNSTTFVGAASTATSQANGGYTLGHETGHYVYALYDEYVATYTNPPRSWPWSSDVSPS 214 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 ++N +++ A T + + K W + D Sbjct: 215 GAAIMNSQWNAMKNNDGGWLNHSIAAFYVTSTDTAQKRMYGKSCWQVLLQAPSQDGVSGD 274 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + + +S + + + K L+ L Sbjct: 275 TTVKPSRVQY--SFSKPVTTSGDYLNAVTPGTHLAAAQANLKILWLAEATMVYQIVLDRS 332 Query: 257 HFVDSSSL-RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + A +++ S+ V +G F+D + Sbjct: 333 GSMGTNPDKPDDPTPLSYAKTAACNLVDSL------PKNVYVGIVQFDDSTSQVYPITLI 386 Query: 316 VHK-----LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 R K GSTAI DA A + + + + Sbjct: 387 ASNDAAAAATRAAAKAAINGLTSGGSTAIYDAASYALS------QFVAQKTALSADLLGV 440 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 LLTDGE+ ++ + + ++Q + ++T+ + A L++ F Sbjct: 441 TYLLTDGEDNSSSKSVGEVIGEYQAQKVPLITVGYGAGGQAGSFALTQLADGTGGQYFAS 500 Query: 431 ANSTHELNKIFRDRIGN 447 L ++F +G Sbjct: 501 PVDQAALQQVFFAALGK 517 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 66/193 (34%), Gaps = 24/193 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S ++ +K ++ A+++ S+ R F+ L Sbjct: 411 SGSLRKYYQNEKDFLKSAISAFGVSVNGT-------RAAVITFSYHAQLSIKL-NKYSNL 462 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I +T I+ A++ A + + K + LLTDG Sbjct: 463 NSFKEAVDNIVLM-GSTTRIDKALRLAQKEVFELENGARPGVA------KILFLLTDGSQ 515 Query: 380 TQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 TQ +E +AI N+ +S G+ I+ I L + A + + A+S + Sbjct: 516 TQERGSENPVAIANELRSAGVTIIVIGI-----TNAVDVSELFDIAGGEENAYFADSFEK 570 Query: 437 LNKI-FRDRIGNE 448 L + F + + + Sbjct: 571 LKDVKFLEVVKRK 583 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 65/190 (34%), Gaps = 24/190 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S ++ +K ++ +++ S R F+ L Sbjct: 215 SGSLRKYYQNEKDFLKSVISAFGVSFNGT-------RAAVITFSYHAQLSIKL-NKYSNL 266 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I +T I+ A++ A + + K + LLTDG Sbjct: 267 NSFKEAVDNIVLM-GSTTRIDKALRLAQKEVFELENGARPGVA------KILFLLTDGSQ 319 Query: 380 TQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 TQ +E +AI N+ +S G+ I+ I + E LS+ A + + A S + Sbjct: 320 TQERGSENPVAIANELRSAGVTIIVIGITNAVNVSE-----LSDIAGGEENAYAAESFEK 374 Query: 437 LNKI-FRDRI 445 L + F D I Sbjct: 375 LKDVNFLDVI 384 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 64/190 (33%), Gaps = 24/190 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S ++ +K ++ +++ S R F+ L Sbjct: 19 SGSLRKYYQNEKDFLKSVISAFGVSFNGT-------RAAVITFSYHAQLSIKL-NKYSNL 70 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I +T I+ A++ A + + K + LLTDG Sbjct: 71 NSFKEAVDNIVLM-GSTTRIDKALRLAQKEVFELENGARPGVA------KILFLLTDGSQ 123 Query: 380 TQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 TQ +E +AI N+ +S G+ I+ I + E LS+ A + + S + Sbjct: 124 TQERGSENPVAIANELRSAGVTIIVIGITNAVNVSE-----LSDIAGGEENAYATESFEK 178 Query: 437 LNKI-FRDRI 445 L + F D I Sbjct: 179 LKDVNFLDVI 188 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + N I+ Sbjct: 304 SSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS--WPGDAPPEGWNRTRH-VII 360 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + + + + R + Sbjct: 361 IMTDGLHNMG-GDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALAS 419 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ +L +F I + Sbjct: 420 KKN-NEQHVFKVKDMEDLENVFYKMIDETKSLGL 452 >gi|149028000|gb|EDL83451.1| complement factor B, isoform CRA_d [Rattus norvegicus] Length = 543 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + N I+ Sbjct: 108 SSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS--WPGDAPPEGWNRTRH-VII 164 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + + + + R + Sbjct: 165 IMTDGLHNMG-GDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALAS 223 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ +L +F I + Sbjct: 224 KKN-NEQHVFKVKDMEDLENVFYKMIDETKSLGL 256 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + N I+ Sbjct: 328 SSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS--WPGDAPPEGWNRTRH-VII 384 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + + + + R + Sbjct: 385 IMTDGLHNMG-GDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALAS 443 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ +L +F I + Sbjct: 444 KKN-NEQHVFKVKDMEDLENVFYKMIDETKSLGL 476 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + N I+ Sbjct: 681 SSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS--WPGDAPPEGWNRTRH-VII 737 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + + + + R + Sbjct: 738 IMTDGLHNMG-GDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALAS 796 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ +L +F I + Sbjct: 797 KKN-NEQHVFKVKDMEDLENVFYKMIDETKSLGL 829 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + N I+ Sbjct: 326 SSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS--WPGDAPPEGWNRTRH-VII 382 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + + + + R + Sbjct: 383 IMTDGLHNMG-GDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALAS 441 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ +L +F I + Sbjct: 442 KKN-NEQHVFKVKDMEDLENVFYKMIDETKSLGL 474 >gi|218200417|gb|EEC82844.1| hypothetical protein OsI_27660 [Oryza sativa Indica Group] Length = 423 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 50/176 (28%), Gaps = 32/176 (18%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F+ + + + + K+ + G T I + A Sbjct: 20 DRLAVVSFSYNARRVIRLT-RMSEDGKASAKSAVESLHADGCTNILKGLVEAAKVFD--- 75 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--------------SQGIRI 400 + A ++LL+DG++ + G + +K + + + Sbjct: 76 ------GRRYRNAVASVILLSDGQDNYNVNGGWG-ASNSKNYSVLVPPSFKRSGDRRLPV 128 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 T F + + A + +F + + F IG + V Sbjct: 129 HTFGFGTDHDAAA-----MHAIAEETGGTFSFIENQAVVQDAFAQCIGGLLSVTVQ 179 >gi|50913505|ref|YP_059477.1| collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|19108443|gb|AAL11466.1| FctX [Streptococcus pyogenes] gi|50902579|gb|AAT86294.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|198417345|gb|ACH87890.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417347|gb|ACH87891.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1036 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/314 (12%), Positives = 85/314 (27%), Gaps = 47/314 (14%) Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSE------GQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 K++ + I V+D S SM + + + + + Q YSS G+ R Sbjct: 436 KKNPLDILVVVDKSGSMQEGIGSVQRYRYYAQRWDDYYSQWVYHGTFDYSSYQGESFNRG 495 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR-DALASVIRSI 285 + Y + S + L + + K ++ A Sbjct: 496 QIHYRYRGIVSVSDGIRRDDAVKNSLLGVNGLLQRFVNINPENKLSVIGFQGSADYHAGK 555 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D N R L + + T + A+ Sbjct: 556 WYPDQSPRGGFYQPNLNNSRDAELLKGWSTNSLLDPNTLTALHNN-----GTNYHAALLK 610 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----------------NEEGIA- 388 A + + +D +K ++ ++DG T N + Sbjct: 611 AKEILNEVKDDGR---------RKIMIFISDGVPTFYFGEDGYRSGNGSSNDRNNVTRSQ 661 Query: 389 -----ICNKAKSQ--GIRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEANSTHELNK 439 ++ K++ + I ++ S + + L + ++ T EL K Sbjct: 662 EGSKLAIDEFKARYPNLSIYSLGVSKDINSDTASSSPVVLKYLSGEEHYYGITDTAELEK 721 Query: 440 IFRDRIGNEIFERV 453 + + ++ Sbjct: 722 TLNKIVEDSKLSQL 735 >gi|148749338|gb|ABR09545.1| CD11c protein [Callithrix jacchus] Length = 1161 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 59/185 (31%), Gaps = 18/185 (9%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +++ + +V+ ++ + F+ + + +F + Sbjct: 164 YNNFAMMKSFVRAVMSHFQRPST-----QFSLMQFSSKFKTHFTFEEFRESSNPLSLLAS 218 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T A+Q + ++ EA K ++++TDG+ D + Sbjct: 219 VDQL--GGYTHTATAIQKVVKELFLASNGAR------REATKILIVITDGKKEGDWLDYD 270 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + A GI I + + L++ AS F+ L +++ Sbjct: 271 DVIPMADKAGIIRYAIGVGSAFQNRNSWKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQ 328 Query: 445 IGNEI 449 + +I Sbjct: 329 LKEKI 333 >gi|315186713|gb|EFU20471.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 331 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 37/149 (24%), Gaps = 40/149 (26%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T + + A + ++ ++LL+DGE + A Sbjct: 165 SPGTDVASGLSRALE-----------AFPEQSNRQRLVILLSDGEALTGEIGP--VLELA 211 Query: 394 KSQGIRIMTIAFSVNKTQQEK-------------------ARYFLSNCA--SPNSFFEAN 432 ++ G+ + T+ L A + FF Sbjct: 212 RNLGVAVHTVGIGTESGGPVPLEGEEVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVE 271 Query: 433 STHELN-----KIFRDRIGNEIFERVIRI 456 + + I E E + R+ Sbjct: 272 NAEGQTFQHVVSTIEETIAREEREGI-RL 299 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/272 (12%), Positives = 79/272 (29%), Gaps = 22/272 (8%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + I + L++ + E + +++ S G Sbjct: 111 NYSHTAVQIPTDIYKGAPVILNELNWTQALERVFIENSRDDPSLLWQAFGSATGVTRYYP 170 Query: 227 EK--LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 +P + Y VD S + L++ ++ ++ + Sbjct: 171 AAPWRAPDKIDLYDVRRRPWYIQGASSPKDMVILVDVSGSVSGLT-LKLIKASVTEMLDT 229 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDENEMGSTAIND 341 + D VN FN++ + + + + I K G+T Sbjct: 230 LSDDDYVN------VARFNEKAEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKS 283 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A++ +++ K I+L TDG + + ++ +R+ Sbjct: 284 GFHFAFNQLLNKTNVPRANCN------KIIMLFTDGGEDRAQDIFEQY--NWPNKTVRVF 335 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 T + + ++ C++ +FE S Sbjct: 336 TFSVGQHNYDVTPLQWI--ACSNKGYYFEIRS 365 >gi|167924336|ref|ZP_02511427.1| hypothetical protein BpseBC_37618 [Burkholderia pseudomallei BCC215] Length = 396 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 94/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNHALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|167829718|ref|ZP_02461189.1| hypothetical protein Bpseu9_38925 [Burkholderia pseudomallei 9] Length = 396 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 94/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|17554334|ref|NP_498645.1| MUscle Positioning family member (mup-4) [Caenorhabditis elegans] gi|15789306|gb|AAA28092.5| Muscle positioning protein 4 [Caenorhabditis elegans] Length = 2104 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 65/253 (25%), Gaps = 21/253 (8%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 D + Y+ + + + L Sbjct: 381 CSDKSLNSCDENADCVQLP-DGYTCKCFAGYVDVSSNAN----LPPGRVCTLSTACPAQP 435 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + V LA R+ ++D++ + Sbjct: 436 TDLVFLIDGSGSIGSYVFQTEVLRFLAEFTELFDIAPQKT---RVSVVQYSDQIRHEFGL 492 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + +++ E G T A++ + S + + + Sbjct: 493 DN--YSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVGQ----VSRVAI 546 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 ++TDG + + + A+ Q I++ + + + E S + F + Sbjct: 547 VITDGRSQDNVTRPS---DNARRQDIQLFAVGVTNHVLDAELEEIS----GSKDRTFHVS 599 Query: 433 STHELNKIFRDRI 445 +LN R I Sbjct: 600 GFEDLNTRLRSAI 612 >gi|14579227|gb|AAK69172.1|AF289202_1 transmembrane matrix receptor MUP-4 [Caenorhabditis elegans] Length = 2104 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 65/253 (25%), Gaps = 21/253 (8%) Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 D + Y+ + + + L Sbjct: 381 CSDKSLNSCDENADCVQLP-DGYTCKCFAGYVDVSSNAN----LPPGRVCTLSTACPAQP 435 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + V LA R+ ++D++ + Sbjct: 436 TDLVFLIDGSGSIGSYVFQTEVLRFLAEFTELFDIAPQKT---RVSVVQYSDQIRHEFGL 492 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + +++ E G T A++ + S + + + Sbjct: 493 DN--YSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVGQ----VSRVAI 546 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 ++TDG + + + A+ Q I++ + + + E S + F + Sbjct: 547 VITDGRSQDNVTRPS---DNARRQDIQLFAVGVTNHVLDAELEEIS----GSKDRTFHVS 599 Query: 433 STHELNKIFRDRI 445 +LN R I Sbjct: 600 GFEDLNTRLRSAI 612 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/275 (12%), Positives = 66/275 (24%), Gaps = 13/275 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G ++ +LL PV LGV + +D+ + LK A + Sbjct: 2 RHFFRDRAGQITVLASLLSPVGLGVAALAIDLSTLQMVKQRLKVTA--------DAASLA 53 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 V + + + + D T +A QV Sbjct: 54 AVAVLPDTGTALSRALAIAADNAGAGAGSVTTAADVQFGSYDSATRTFTAGATPANAVQV 113 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY-HKEHGVSIQWVIDFSRS-- 188 S ++ F R++G + + A A S +Y S + S + Sbjct: 114 TASRSEARGNPVITGFARALGWATPDLSASAVAVRFSPAYCFLVLDPSASGALSVSGTGR 173 Query: 189 -MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + + + S G SP ++ Sbjct: 174 LSVPNCGVQVNSTSASAATTGNNSTAQARSFCITGGYSGSSFSPVPIT-RCLPAADPLAD 232 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +P+ + + Sbjct: 233 IPEPAPPTAGCYYNGLNTGSGMTLPSNVTYCGKIT 267 >gi|167899815|ref|ZP_02487216.1| hypothetical protein Bpse7_39190 [Burkholderia pseudomallei 7894] Length = 396 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 94/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|317505805|ref|ZP_07963650.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316255887|gb|EFV15112.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 343 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 66/192 (34%), Gaps = 26/192 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 IK +D + TV++G F S + I + + TA + + Sbjct: 127 IKFVDGMAPTVQLGVVTFAGNAQPLVRPSTDHETAKKVIDQ-MIRPDKLEKQTATGEGIY 185 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-----AICNKAKSQGIR 399 TA I + N IVL++DG+ T ++ A AK + + Sbjct: 186 TALQQIETIAGA---LGGKNHAPPARIVLVSDGKETVPDDLNAPRGAYAAARTAKEKHVP 242 Query: 400 IMTIAFSVNKTQQEKARYF---------LSNCA----SPNS---FFEANSTHELNKIFRD 443 + T+AF + L + SP + FF A S EL +I++ Sbjct: 243 VCTVAFGTKAGKITLDNQVDEVPVDLESLKKISDLSNSPGNSCRFFPAESQGELAQIYQS 302 Query: 444 RIGNEIFERVIR 455 +I +R Sbjct: 303 L-NEDIGYENVR 313 >gi|311743549|ref|ZP_07717355.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311312679|gb|EFQ82590.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 445 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 57/242 (23%), Gaps = 9/242 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS---VPLIQS 71 I G IITA++M +L +G + +D S+ + AA + + Sbjct: 12 IHEDRGATAIITAVIMTSLLSIGALTIDYGAASHVRRQTQNAADATTRSIVENCAKQAAA 71 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR------ 125 + + N + V T E Sbjct: 72 AGQNVLDGACVSATGTADASTIVQGNAPGSAPDAPVIGGAGREVSVTVAEPVDYRLAQLL 131 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 VV SS N + + T + + + + V+ Sbjct: 132 GKDSDVVRSSATAEWTNLRPVEGFPVVPFGVAQCTYNDFRPSAGNVDGARTLIRSDVLQT 191 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 RS+++ + G + + +++ + Sbjct: 192 VRSLVNNATGLLSGIAGFLRGILFPVNDLIGAITGSDAECGDGSANDVLAMQGGAWLTEA 251 Query: 246 PG 247 Sbjct: 252 ID 253 >gi|323341955|ref|ZP_08082188.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322464380|gb|EFY09573.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 162 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 48/160 (30%), Gaps = 14/160 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 G+ I+ A ++ +LG + DVV + L + A + + Sbjct: 3 NFKNEEDGNILIMFAFILMFLLGFVALSTDVVLAMNKKDKL---TEVAYL--VREARLDI 57 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E ++ E L RN + + E R + Y + Sbjct: 58 TEEMLHSQQGQQIALDNAHEILKRNGIDPSQFELVWNEQRGM---------SPNGNYMIS 108 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 L+ L + L+++GI + I+ + Sbjct: 109 LTGTIILRDVYKTTTLKALGIDEFPIKVELPVWQNFTDQQ 148 >gi|89094360|ref|ZP_01167301.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] gi|89081419|gb|EAR60650.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] Length = 707 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/205 (10%), Positives = 49/205 (23%), Gaps = 22/205 (10%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 L + + + K + + + + ++ D Sbjct: 290 NNKGTYKLTLTPGTDLPAFNQGRDWVFVLDISGSMKGKFAALVEGVREGLSNLSPND--- 346 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R FN++ S K T + +QT + + S Sbjct: 347 ---RFRIVLFNNQARSFTQGYLPADKTTVENTLNQLDQIQPGQGTNLYAGLQTGINQLDS 403 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 IVL+TDG N + + +R+ T + + Sbjct: 404 DRSTA-------------IVLVTDGVANVGTTHKSKFLNLLEQKDVRLFTFIMGNSANRP 450 Query: 413 EKARYFLSNCASPNSFFEANSTHEL 437 S +++ ++ Sbjct: 451 LLEEMT---RVSNGFAMSISNSDDI 472 >gi|39936736|ref|NP_949012.1| hypothetical protein RPA3674 [Rhodopseudomonas palustris CGA009] gi|192292562|ref|YP_001993167.1| TadE family protein [Rhodopseudomonas palustris TIE-1] gi|39650592|emb|CAE29115.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286311|gb|ACF02692.1| TadE family protein [Rhodopseudomonas palustris TIE-1] Length = 177 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 50/168 (29%), Gaps = 1/168 (0%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASV 66 ++K ++ G + A++ P+ + +++V + L+ A ++ + + Sbjct: 9 RAMARKFRRNRKGSAAVEFAIVAPIFFALLFAIIEVAMIFFASQVLETAVQDSSRLIFTR 68 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++R N +F + + ++ + Sbjct: 69 QAQDASMTQDQFKTEVCKRLISLFDCSIVRVDVQNYGSDFGTVSITTPIDSNKKFVDNMQ 128 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 LF+ +G + + + + ++ E Sbjct: 129 YNIGKAGDIIVVRAFYQWPLFVTGLGFDTSNLAGGKRLLSATAAFRNE 176 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 61/203 (30%), Gaps = 43/203 (21%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPDRLSKAKQMLSKLTD-------GFSNDKVGLIVFAGDAFTQLPITSDYVSA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + S + K I+L+TDGEN +D+ Sbjct: 160 INPSMVSTQGTAIGAAINLAVRSFTPS-----------ETSDKAIILITDGENHEDDAVK 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ---EKARYFLSN--------------C-----AS 424 A A +GI + + K + + ++ + C A Sbjct: 209 AAA--AAAEKGIHVNIVGMGDPKGSPIPVDGSNNYMKDKDGNVVITKLNEEMCQEIAAAG 266 Query: 425 PNSFFEANSTHELNKIFRDRIGN 447 ++ A++T+ + + I Sbjct: 267 HGTYVRADNTNSALRALQKEIDK 289 >gi|167908132|ref|ZP_02495337.1| hypothetical protein BpseN_38271 [Burkholderia pseudomallei NCTC 13177] Length = 396 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 93/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAAAT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|163754946|ref|ZP_02162067.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] gi|161325013|gb|EDP96341.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] Length = 593 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/375 (12%), Positives = 100/375 (26%), Gaps = 26/375 (6%) Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-PRKSAYQVVLSSRY 137 A + N ++ + T V ++ S Sbjct: 61 ANKGQILVFSYVGFTSQEIKITQPVVNVQLKQGEALECVTVVAYGVAKEKKALAYAVSTV 120 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + + + A T S + +I+ +I+ + Sbjct: 121 HGSSVENRESYKKINENRFKRTSLAPLSTFSIDVDRAAYSNIRRMINN-GEKIPVNAVKI 179 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + +N F + + + + +L P P + Sbjct: 180 EEMINYFEYDYAQPTDKHP-MVIHTELGQTPWNKDTRLLKIALKGKEIPLEEIPPSNFTF 238 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 +D S K L++ A ++ ++ D V V GA Sbjct: 239 LIDVSGSMFAQNKLPLLKSAFTLMVNKMRPEDKVAIVVYAGAAGMVLEP---------TS 289 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + GSTA ++ AY T + NN +++ TDG Sbjct: 290 GNNKQKILDALNNLRAGGSTAGGAGIELAYKTATE----NFIKNGNNR-----VIMATDG 340 Query: 378 ENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + + + + G+ + + F + Q ++ + ++ Sbjct: 341 DFNVGATSETAMETLIEQKRETGVYLSVLGFGMGNYQDDRLELLADK--GNGNHAYVDTM 398 Query: 435 HELNKIFRDRIGNEI 449 E +KIF G + Sbjct: 399 QEAHKIFGKEFGGTL 413 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 49/145 (33%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNV---NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ + ++ V +++ A F+ V DP FS Sbjct: 65 FDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDLHTFKQRVKS 124 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T A+ A + ++ K +L+TDG + N + Sbjct: 125 MNFI--GQGTFSYYAIANATRLLKREGR---------KDSVKVALLMTDGIDHPKNPDVQ 173 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 174 SISEDARNAGIIFITIGLSTVVNET 198 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 48/155 (30%), Gaps = 17/155 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + + + + T A+ A +S Sbjct: 688 TARIGIINYSHKVEEVAHLTQ--FSSKDALKRAVDNMQYLGEGTYTATALHAANRMFEAS 745 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 E KK +++TDG+ +T+D + + A + I I Sbjct: 746 RP----------EVKKVALVITDGQTDTRDEKNLTDVVKNASDINVEIFVIGVVKKNDPN 795 Query: 413 -EKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 E ++ A+ ++ + L + Sbjct: 796 FEMFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 830 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 56/460 (12%), Positives = 120/460 (26%), Gaps = 86/460 (18%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ Q + + + + E +N + + T + + + Sbjct: 49 ALESEQETPQEEGGSVSESGNEDGRGNEGEGQNDGSEADEAETGQVADPSSSRSVNVLRA 108 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + L+ L + G +I +A A + I V+D Sbjct: 109 GGLVRGDTGGTSGGEGDLVLAKSLETDGQGDKVITLEAYASGSMSIVEQGIPCDIVLVLD 168 Query: 185 FSRSMLD-YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 S SM D + L+ + + + + G VS SL Y Sbjct: 169 QSGSMDDRFGSQGSYHALSGYSNKRLGDLAENGNLYVRGGDGSYVAVDVDVSGFISLKYS 228 Query: 244 LYPGPLDPSLSEEHFVDSSS----------LRHVIKKKHLVRDALASVIRS--------- 284 L+ L+ E + + ++DA S ++S Sbjct: 229 YTWEGLEAPLTSEGRYTVPQFEGVTFYSFQTDQTVTRIAALKDAANSFVQSVRSNSLGED 288 Query: 285 -----------------IKKIDNVNDTVRMGATFFNDRVISD--PSFSWGVHKLIRTIVK 325 DN V + + S +F ++ Sbjct: 289 GIAGTVDDVPHRIAVIGFASGDNTELFVGSASYPYGQSAQSQYGSAFQDMTTEMGFDNAL 348 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T ++D + A ++ + + V+LTDG + Sbjct: 349 ASIGQLSADGGTLVDDGLDMANGVFSAN------PLVTGELRNRVTVVLTDGAPGLYGND 402 Query: 386 ------GIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFL------------------ 419 I+ ++ K+ G + +I + + + + Sbjct: 403 RGVANEAISQASELKTAGSTVFSIGIFPGADASGDLPDQSAMGWGDNDSNRFMHLLSSNY 462 Query: 420 ---------------SNCASPNSFFEANSTHELNKIFRDR 444 +SP+ + A+ + LN IF+ Sbjct: 463 PDASSMGSPGARFVDEEGSSPDYYLSASDSAGLNSIFQSI 502 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 64/195 (32%), Gaps = 21/195 (10%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D+SS + VR +A+++ + + R+G ++DR + Sbjct: 42 KSVQYDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPGHTRVGVVRYSDRPTT 97 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 G + G+T DA++ S+ N K Sbjct: 98 AFEL--GHFNSREEVKAAARRITYHGGNTNTGDALRYITSRSFSAQAGGRP---GNRAFK 152 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---P 425 + +LLTDG + + A A + GIRI + + L AS Sbjct: 153 QVAILLTDGRSQDLVLDAAA---AAHAAGIRIFAVGVGA------ALKEELDEIASEPKS 203 Query: 426 NSFFEANSTHELNKI 440 F + + ++KI Sbjct: 204 AHVFHVSDFNAIDKI 218 >gi|224531962|ref|ZP_03672594.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] gi|224511427|gb|EEF81833.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] Length = 333 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 38/117 (32%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 129 RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDL--GNGSALGLGISIALSHL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ K+ I++LTDG D + N A+ ++I +I Sbjct: 186 ----------KHSEAPKRSIIVLTDGVVNSDEIYKDQVINLAQGLNVKIYSIGIGSF 232 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 39/122 (31%), Gaps = 14/122 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G + + + + + D TAI++A+Q + Sbjct: 108 DRVGIIGYAGSAFPQVPITTDFAS-TKLFLNSMDTDMVSSQGTAISEAVQMSTTYFDDEE 166 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + + +++DGE+ + N + A GI++ I K Sbjct: 167 QKN-----------RVLFVISDGEDHEGN--FESAIENATENGIKVYAIGVGTEKGGPIP 213 Query: 415 AR 416 + Sbjct: 214 IK 215 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 58/171 (33%), Gaps = 17/171 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D++ D R+G FN S K +K ++ + G T ++ MQ A + Sbjct: 1004 DHLEDDDRLGLVLFNTGAELAEPVSLVGAK-NMQKLKGDVLEISATGGTRLSAGMQMATE 1062 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAF 405 + N E + I+ LTD + + ++ + I Sbjct: 1063 L------YDEFLEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANANKNVYTTFIGI 1116 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 V+ + A +++ +S + F++R+ +E V + Sbjct: 1117 GVDFNTELVEYITKIRGA---NYYSVHSATQ----FKERMDDEFEYMVTPL 1160 >gi|167725250|ref|ZP_02408486.1| hypothetical protein BpseD_39891 [Burkholderia pseudomallei DM98] Length = 396 Score = 49.6 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 93/371 (25%), Gaps = 12/371 (3%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 I+ AL++ V++G G+ +D+ + L+ +A A A+ + +S Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAA 59 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + + N T + + S V ++ ++ Sbjct: 60 GITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIV 119 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N L + + + + + + ++ + Sbjct: 120 NWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIG 179 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + G + + + + Y L + Sbjct: 180 DWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATT 239 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + T + ++D V +F+ Sbjct: 240 NAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLAFA-------- 291 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA---KKYIVLLTDGE 378 + N GS + A + + E + + + A VL+ D Sbjct: 292 SYQGDLITGINTGGSYNPSYYAAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPM 351 Query: 379 NTQDNEEGIAI 389 + + + + Sbjct: 352 GSGGSATPVHL 362 >gi|224171915|ref|XP_002198669.1| PREDICTED: similar to collagen, type XX, alpha 1, partial [Taeniopygia guttata] Length = 152 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 48/138 (34%), Gaps = 20/138 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + ++ ++ S + +++ + G+T A+ + ++ Sbjct: 1 LSQYSSDPRTEWELS--TYSTREQVLEAVRNLRYKGGNTFTGLALTHVLE------QNLK 52 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 LEA+K ++LLTDG+ +QD+ K+ GI I I + Sbjct: 53 PDAGARLEAEKLVILLTDGK-SQDDANL--AAQTLKNLGIEIFAIGV------KNADEAE 103 Query: 419 LSNCASPN---SFFEANS 433 L AS S + Sbjct: 104 LRQVASEPLELSVYNVLD 121 >gi|301780866|ref|XP_002925835.1| PREDICTED: integrin alpha-D-like [Ailuropoda melanoleuca] Length = 1040 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 57/159 (35%), Gaps = 13/159 (8%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +++ + +F+ + + G T ++T + S Sbjct: 198 NTLFSLIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQL--NGLTFTATGIRTVVQELFHS 255 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 A+K ++++TDG+ +D E + +A+ GI I + Q+ Sbjct: 256 KNGAR------KTARKILIVITDGQKYKDPLEYSDVIPQAERAGIVRYAIGVG-DAFQKL 308 Query: 414 KARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 AR L+ S + F ++ L+ ++++ +I Sbjct: 309 TARQELNTIGSKPSQDHVFRVDNFAALSN-IQEQLQEKI 346 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 39/117 (33%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 137 RENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL--GNGSALGLGISIALSHL- 193 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ KK +++LTDG D + N A+ ++I ++ + Sbjct: 194 ----------KHSEAPKKSVIVLTDGVVNSDEVYKDQVINLAQGLNVKIYSVGIGSD 240 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 46/367 (12%), Positives = 97/367 (26%), Gaps = 42/367 (11%) Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + ++ L+ K TD + + ++SS Y Sbjct: 179 NVAGILLTKSKKDELVDKHGEINTKTITDAVANKELAMGYTNPFASSTGLNFLISSLYTF 238 Query: 140 LLNPLSLFLRSMGIKSWLIQT----------KAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + L G +S+ I + A++ + D Sbjct: 239 DSSNLLSDKAKSGFESFQINIPLVAYTTLQMRESAKSGVLDGFVLEYQTYVNTPDIRYYE 298 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 P+ G+ + K+ + + N Y P+ Sbjct: 299 FTPFGFRHDNPMYAIGELSGEKRKTLNEFVEFSQQEKYQTLGTEYGFNGFNEYQSETEPV 358 Query: 250 DPSLS---EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI-----DNVNDTVRMGATF 301 + ++ + + + + + + ++K + + +G Sbjct: 359 SGDVIFQAQKLWKEKKKPICAVFVADVSGSMMGEPLNNLKDSLLRGQYYIGEDNMIGLVS 418 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +++ V D + R D G TA D + A I Sbjct: 419 YSNDVNIDLPIA-KFDLNQRASFAGAVNDLQAGGGTATFDGIAVAMKMIQEQRAA----- 472 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK----SQGIRIMTIAFSVNKTQQEKARY 417 + + I +L+DGE + + N K GI I TI ++ N Sbjct: 473 --DPNIRPVIFVLSDGETNKGHPL-----NDIKGIVEDTGIPIYTIGYNANI-------P 518 Query: 418 FLSNCAS 424 L +S Sbjct: 519 ALQAISS 525 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 51/199 (25%), Gaps = 22/199 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP----SFSWGVHKLIRTIVKTF 327 A ++ I+K+ + R G + S + + + + Sbjct: 284 SNFTGAKRCLVNLIEKVASYGVRPRYGLVTYATYPNVLVRVSDPKSSDANWVTEKLNQIS 343 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T A+ Y+ + +V N I+L+TDG + + + Sbjct: 344 YEDHKLKTGTNTKRALVEVYNMMS--WPGDVPPEGWNRTRH-VIILMTDGLHNMG-GDPV 399 Query: 388 AICNKAKS-QGIR------------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + N+ + I + + S + F Sbjct: 400 TVINEIRDLLNIGKDRKNPREDYLDVYVFGVG-PLVEPANINALASKKENEQHVFRVKDM 458 Query: 435 HELNKIFRDRIGNEIFERV 453 L +F I + Sbjct: 459 EHLEDVFFQMIDESQSLGL 477 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/268 (13%), Positives = 82/268 (30%), Gaps = 22/268 (8%) Query: 171 YHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK-- 228 + I + L++ + E + + +++ S G Sbjct: 137 TAVQIPTDIYKGAPVILNELNWTQALEKVFMENSQEDPSLLWQAFGSATGVTRYYPATPW 196 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 SP + Y VD S + L++ ++ ++ ++ Sbjct: 197 KSPDKIDLYDVRRRPWYIQGASSPKDMVILVDVSGSVSGLT-LKLIKASVMEMLDTLSDD 255 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDENEMGSTAINDAMQT 345 D VN FN++ + + + + I K G+T Sbjct: 256 DYVN------VARFNEKAEAVVPCFKHLVQANVRNKKIFKDAVQQMQAKGTTDYKSGFHF 309 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A++ +++ K I+L TDG +D + + + ++ +R+ T + Sbjct: 310 AFNQLLNKTNVPRANCN------KIIMLFTDG--GEDRAQDVFMQYNWPNKTVRVFTFSV 361 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANS 433 + ++ C + +FE S Sbjct: 362 GQHNYDVTPLQWI--ACTNKGYYFEIRS 387 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/314 (14%), Positives = 92/314 (29%), Gaps = 35/314 (11%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G + Q + + + + I+ +S + + Sbjct: 63 DGSQPAQGQQQQNSAQPQPGQQGKTDNGLVMPIENGQSEAPVPKSPNQPGDTVNVPASSS 122 Query: 211 TVKSYSSQNGKVGIRDEK--------LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + + G+ K + + + + + E + S Sbjct: 123 RGNGQNPDSEVGGVYTFKKQVEEVRLHATVVDDRQRLITTLDKTSFTVYENGEPQQITSF 182 Query: 263 SLRHVIKKKHLVRDALASVIR--------SIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + +V D S+ +I + N + FND D ++ Sbjct: 183 RHEDIPVALGVVIDNSGSMRDKRPAVNAATINLVKASNPEDEVFVVNFNDDYYLDQDYTD 242 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 V KL + K G TA+ DA+ + ++ + + E KK + ++ Sbjct: 243 SVAKLKEALEKYE-----TRGGTALYDAVLASNAHLMKAPKLE----------KKVLFIV 287 Query: 375 TDGENTQDNEEGIAICNKAK-SQGIRIMTIAFSVNKTQQE-KARYFLSNCA--SPNSFFE 430 TDGE+ K + G I TI + +A+ L A + F Sbjct: 288 TDGEDDASLNTLEQTIRKVQQENGPTIYTIGILDETGGHKRRAQRALREMAESTGGVAFF 347 Query: 431 ANSTHELNKIFRDR 444 S E+++I + Sbjct: 348 PQSLDEVSRITQQI 361 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 44/196 (22%), Gaps = 20/196 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D + K D + + ID V T + + Sbjct: 252 DGLATEQYPVKNVTTADFIGKTGNANDPIDLVIQGAINFPTNYVSN-NPSTPLTPNYPTH 310 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + Y + + + E + Sbjct: 311 SSKVGRRNLPESKFD------------YSNLSARITFDGVAGALVYEPRFPHPYYYY--- 355 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I AK+ G I TI + + L A+ + F + L Sbjct: 356 -FPCNAAINEAQFAKNSGYTIHTIGYDLGDFALANNS--LKLTATDENHFFTATPANLAA 412 Query: 440 IFRDRIGNEIFERVIR 455 F + I I + R Sbjct: 413 AFDN-IAQTINIGIQR 427 >gi|321460550|gb|EFX71591.1| hypothetical protein DAPPUDRAFT_255495 [Daphnia pulex] Length = 930 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 42/314 (13%), Positives = 89/314 (28%), Gaps = 31/314 (9%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV------IDFSRSMLD 191 ++ +N S K + Q + + + Y + + S+ Sbjct: 193 EIQVNLCSDTPPIFTTKDLVTQGPCKIDPATGVYDNNCAYQLSPAFLPDSSLMSVHSLDS 252 Query: 192 YQRDSEGQPLNCFGQPADRTVKS---YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + A + +S + + + + + + Sbjct: 253 VVDFCKEDQALSHVFDAPNKHNAMCGGTSTWTVITLHPDFAMNNNPPNDVAFIEPNFQIV 312 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D+S + + + ++ I+ ++D ++G F + Sbjct: 313 RSSGSRFVLVSDTSGSMNDYNRIVRLYESSRRWIKYD-----ISDGSKLGMVQFANNARI 367 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 R + +G T I +Q A D + Sbjct: 368 LSPIVEINGDASREALIARL-PVTAVGGTCIGCGLQKALDLLRPGGPGG----------- 415 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I+LLTDGE T I + K G R+++IAF ++K + + F Sbjct: 416 -VILLLTDGEETDRPFINDVISDVIK-SGARVVSIAFGRKA--EDKIEDLATKTNGKSYF 471 Query: 429 FEANSTHE-LNKIF 441 + N + + LN F Sbjct: 472 IDDNDSSQGLNDAF 485 >gi|270008952|gb|EFA05400.1| hypothetical protein TcasGA2_TC015572 [Tribolium castaneum] Length = 767 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 40/130 (30%), Gaps = 6/130 (4%) Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D+ MG T I + + + + + + I+ LTDG Sbjct: 270 DDDVNMGCTNIIGGLAVGLYLVR-RTLQKFYEKNVATKHQPMIIFLTDGLPNVGISNPDE 328 Query: 389 ICNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I +QG I +++F + + + N +EA + F Sbjct: 329 ITKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYR 388 Query: 444 RIGNEIFERV 453 + + + V Sbjct: 389 TVSSPLLRDV 398 >gi|189238321|ref|XP_972429.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 653 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 40/130 (30%), Gaps = 6/130 (4%) Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D+ MG T I + + + + + + I+ LTDG Sbjct: 344 DDDVNMGCTNIIGGLAVGLYLVR-RTLQKFYEKNVATKHQPMIIFLTDGLPNVGISNPDE 402 Query: 389 ICNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I +QG I +++F + + + N +EA + F Sbjct: 403 ITKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYR 462 Query: 444 RIGNEIFERV 453 + + + V Sbjct: 463 TVSSPLLRDV 472 >gi|189347764|ref|YP_001944293.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341911|gb|ACD91314.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 340 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 50/184 (27%), Gaps = 33/184 (17%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F + ++ ++ E T+ A++ A S Sbjct: 128 GRRALLLFAGSPLVQCPLTYDREAF-SALLGMATPALIEEQGTSFLPAVELALKLFTGSV 186 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV--NKTQQ 412 + E ++ +VLL+DGE+ + A K + G+ + + F Sbjct: 187 PLD---SDGTAEGERIVVLLSDGEDHEGATAAAAA--KLRRNGVSLFVLGFGSRNGADIP 241 Query: 413 EKARYFLS---------------------NCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + R L AS +FE D + I Sbjct: 242 DPLRPGLKKLDGAGRVVTTRFSPQTLRQLASASGGFYFEGAGGD----AVYDEVALRIDR 297 Query: 452 RVIR 455 V R Sbjct: 298 IVSR 301 >gi|302037436|ref|YP_003797758.1| hypothetical protein NIDE2113 [Candidatus Nitrospira defluvii] gi|300605500|emb|CBK41833.1| conserved protein of unknown function [Candidatus Nitrospira defluvii] Length = 359 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/334 (10%), Positives = 94/334 (28%), Gaps = 11/334 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI----- 69 G I+ A+ + ++ + +++D +H L+ A A + + L Sbjct: 21 RHDQRGAVAIVVAVSLSALVSLAALVLDGGAALVSKHQLQNLADAAALAGARQLGKLYES 80 Query: 70 --QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 ++ + + + + E RN + T +VR + A Sbjct: 81 QPAAISPPQPLSLANRARVQAVVAEVAERNRDATKLAAVTLSDVRVGRWNAATRTVVSSQ 140 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + R + + + +GI+ + A + + + + +++ + + Sbjct: 141 NGLDAVLVRAEGSMP--TFLAGVVGIRRLSVSATAVSALSALAEVQPGALTLPVGLSSAW 198 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 L Q G +++ +G + + S + Sbjct: 199 MALGPIDGRRLQVYAPGGGDPCAGWTTFTDTPPTMGRLQTIVRSLTGNSYTSPATVAGQT 258 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + KK+ +V+ ++ + + F +I Sbjct: 259 SFQFVAGNPAVIAPDIMALYNAKKNPATGQWTTVVPVYRQAQCAAPSGPLPVVGFATAII 318 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + G +V+ + G T Sbjct: 319 TSVATPSGPAL--DAVVRIGFVQLGRGGGTDYGT 350 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 55/155 (35%), Gaps = 10/155 (6%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 I N T R+G +N + + + + N + ST+ + Sbjct: 72 TSIPNQPKTTRLGLVTYNWNATIQAGL--DKFQSQQDVFENIFNALNSVSSTSES----Y 125 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + ++++ +K IVL + + IAI ++ K GI I+T+ + Sbjct: 126 LANGLVAAENVLARGPNRGNNYQKVIVLFA--ASYSSHSNPIAIADRLKQAGITIITMGY 183 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + L+ ASPN F S ++ I Sbjct: 184 NNVGDPNFYQN--LAKIASPNKSFTEKSLSQIGDI 216 >gi|170732854|ref|YP_001764801.1| membrane protein [Burkholderia cenocepacia MC0-3] gi|169816096|gb|ACA90679.1| membrane protein [Burkholderia cenocepacia MC0-3] Length = 608 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/166 (9%), Positives = 47/166 (28%), Gaps = 1/166 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T+ + + G F ++ + + + G + V + +++ A A + Sbjct: 1 MTTRHSGGTIRRASRQRGAFSVMAIIATLIAITTLG-AIGVGNLFFQRRDVQRIADMAAL 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + ++ A ++ N E + T+ Sbjct: 60 AAVQRMDDACSQPTATATSNAQSNGLDASNGDTINIECGRWDTSVNPAPSYYAAATSGTT 119 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 + V + + P ++ S + + A + Sbjct: 120 QLNAAKVVVTRQVPFFFVGPPQTVSAVSTARSTNIDTFSVGATLAA 165 >gi|269126097|ref|YP_003299467.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268311055|gb|ACY97429.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 138 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 31/72 (43%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G + A L+P+ML + M++D R + + +A + + TA + + + Sbjct: 20 ARTDRGAVTVEMAFLLPIMLMLIFMIIDFGRAFNAQLRINEAVRQGVRTAVLTQPPADPK 79 Query: 75 VSSRAKNSFTFP 86 ++ + Sbjct: 80 TAAADIMTTALG 91 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/419 (9%), Positives = 92/419 (21%), Gaps = 37/419 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNS--FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A + + S +E +K + + I++ + +N+ + + Sbjct: 153 AASLAETTEESSSNKEDKEISKQNNESNLSFSYPSNFGIKSLFDEKAQNYENISPKYTED 212 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +T + + + S + GI SW + + Sbjct: 213 ETGIYPSQSWMPENNTKVINHQGWNAFSSNW---DGIDSWDGNPANLTNSYIEHAGTNNP 269 Query: 177 VSIQWVIDFSRSMLDYQRDS---------EGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 V + D D + ++ G + + Sbjct: 270 VDFALRKYARETETPGLYDVFLNVRGNVQNPIKPVDIVLVVDMSGSMEGAREGAIKQGVK 329 Query: 228 KLSPYMVSCNKSLYYM-LYPGPLDPSLSEEHFVDSSSLRHVIKK--KHLVRDALASVIRS 284 + + + Y G P + + + + AL Sbjct: 330 SFLSSIENTAYAQYVNVGLVGYSSPGYISNSGYITVPMESLATDGHVSAMNKALERQFVG 389 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 V + ++ + T + Sbjct: 390 GTFTQLGIRQGAQMLKEDASGNEKMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDS--- 446 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTI 403 + N + GE + AK G I T+ Sbjct: 447 ----NSLDEPSFTSQLWMMNGNNRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTL 502 Query: 404 AFSVNKTQ---QEKARYFLSN---------CASPNSFFEANSTHELNKIFRDRIGNEIF 450 + K + + +LS AS + +A+S + +++ + Sbjct: 503 GIQLGKDSGYTNDSSNTYLSQEEVRKRTSLIASSGLYQDADSAENITDYLKNQADVVLS 561 >gi|119716456|ref|YP_923421.1| TadE family protein [Nocardioides sp. JS614] gi|119537117|gb|ABL81734.1| TadE family protein [Nocardioides sp. JS614] Length = 150 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 36/129 (27%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + + + + G + A M+ + GM VD+ + + + AA A Sbjct: 1 MMTARIRSLTTRRSRDERGSISVWFATAALAMIILVGMAVDLGGKVHTQQQARSAAAQAA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 T + + S + T K +YL + + DT Sbjct: 61 RTGAQEVQGSTAVRGEDLRVDLTAAKTAAMDYLRAAGVEGTVRVVDGDTLIVTTTDTYTS 120 Query: 122 MNPRKSAYQ 130 Sbjct: 121 KFLGIIGLD 129 >gi|84685160|ref|ZP_01013059.1| hypothetical protein 1099457000257_RB2654_09844 [Maritimibacter alkaliphilus HTCC2654] gi|84666892|gb|EAQ13363.1| hypothetical protein RB2654_09844 [Rhodobacterales bacterium HTCC2654] Length = 208 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 11/73 (15%), Positives = 27/73 (36%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + + G + ++MP+ L + L+D R + ++A + A+ A V Sbjct: 3 LRGHIIRFRRDEGGAALVEMGIIMPLFLLLAFGLIDFGRLGFAYVMAQKATEQAVRAAVV 62 Query: 67 PLIQSLEEVSSRA 79 ++ Sbjct: 63 RTPACDGVPAATT 75 >gi|242078369|ref|XP_002443953.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] gi|241940303|gb|EES13448.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] Length = 567 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 53/200 (26%), Gaps = 34/200 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI + D R+ F++ + + + K Sbjct: 96 KLALLKQAMCFVIDQLGPAD------RLSVVTFSNDASRLTRLA-RMSDAGKASAKIAVE 148 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---- 385 G T I + A + + E V ++LL+DG + Sbjct: 149 SLAVQGFTNIKQGIHVAAEVLAGRREKNVVAG---------MILLSDGHDNCGGTSVRPD 199 Query: 386 -----------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + + + I T F + A+ +F Sbjct: 200 GTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTSHDAGAMHAVAE---ATGGTFSFVGDE 256 Query: 435 HELNKIFRDRIGNEIFERVI 454 + F +G + V Sbjct: 257 AAIQDSFARCVGGLLSVAVQ 276 >gi|171911164|ref|ZP_02926634.1| hypothetical protein VspiD_08320 [Verrucomicrobium spinosum DSM 4136] Length = 854 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 46/154 (29%), Gaps = 17/154 (11%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D+ + V T+ E+ G + ++ A+ + Sbjct: 431 DSDMACVYAVDSEPHEVSPL-VAVGSNRGTLQNAVRRVESTGGGIYVYQGLKRAWAELEK 489 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + +++I+L D + ++ E A+ K + + I ++ Sbjct: 490 AKVG-----------QRHIILFADAADAEEPGEYKALLEKMSKEKGTVSVIGLGT---EK 535 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + FL + A F + EL +F Sbjct: 536 DSDADFLKDVALRGNGRIFFNSDPKELPALFAQE 569 >gi|47216852|emb|CAG11659.1| unnamed protein product [Tetraodon nigroviridis] Length = 1042 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 58/198 (29%), Gaps = 19/198 (9%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK--KKHLVRDALASVIRSIKKIDNV 291 + + + S S + R LV+D + +++ + Sbjct: 660 GTLENRDHLDKWLWFPVQSRSSGTGLCDRRFRERRTPKNFELVKDFVNALVERVWLSQ-- 717 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + RMG F+ ++ S + + T T A+ A+ Sbjct: 718 -EAARMGVVLFSHTSLAVASLQ--PRSSLSELKATVRSMPYLGEGTFTGSAIHRAHQLFQ 774 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKT 410 +S K ++LTDG+ + + + +A+++GI + I Sbjct: 775 ASRPGVR----------KVALVLTDGQADPRDVVQVGVSAAEAQARGIEMFVIGVMNESH 824 Query: 411 QQEKA-RYFLSNCASPNS 427 + AS Sbjct: 825 PLYPDFSAEMKAIASNPK 842 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 30/94 (31%), Gaps = 12/94 (12%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T A+ A + + + +L+TDG + + + +A Sbjct: 154 GHGTYSAYAIANATQLFSQE---------TSSNSLRVALLMTDGVDHPRSPSAVTAAAEA 204 Query: 394 KSQGIRIMTIA-FSVNKTQQEKARYFLSNCASPN 426 K IR+ I+ + + L + AS Sbjct: 205 KQHSIRLFAISLLGLPDQGPAGTK--LRSIASTP 236 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 25/186 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L + A + + + D V + R ++K Sbjct: 439 TKLELAKQAAINALPQLGPNDQVGLWMFSTNQDGGKDYRELVPMGRN----NRDLLKKRI 494 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-EGI 387 G T + D + AY T++ + ++V +V+LTDG+N DN Sbjct: 495 QGLIPGGGTGLYDTTRAAYRTVLERHSNDVINA---------VVVLTDGKNEDDNSISLE 545 Query: 388 AICNKAK----SQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIF 441 + + + + +R+ TIA+ + L + + +++ ++++F Sbjct: 546 DLLAELRTETGQETVRVFTIAYG-----NDADLEVLRQISQVTDAAAYDSREPGSIDQVF 600 Query: 442 RDRIGN 447 + N Sbjct: 601 TAVLSN 606 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 26/72 (36%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + ++ A K I L+TDG + + + Sbjct: 158 GGGTYTRGAFQRAAQILR----------QSRENATKVIFLITDGYSNGGDPRP--VAAAL 205 Query: 394 KSQGIRIMTIAF 405 + +G+ I T+ Sbjct: 206 RERGVEIFTLGI 217 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 57/180 (31%), Gaps = 22/180 (12%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 R + + + N+ G F+ + + + D M Sbjct: 108 RQEVEFLKSLARTFKISNNGAHAGVVTFSSIAELSIKLNQYY---DQEQFERAVDDIPYM 164 Query: 335 GS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEEGIAI 389 G T I+ A++ + + + + LLTDGE +E ++I Sbjct: 165 GYVTRIDLALRKSLEMFDE-------INGARKSIPQILFLLTDGEQYAGKGVVDENPVSI 217 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKI-FRDRIGN 447 + +GI I I + L++ A S F A + +EL F +I Sbjct: 218 AKLLRDKGIVIFAIGIGS-----AVRQSQLNDIAGSSEKAFLAKNFNELVNSDFLKKIKE 272 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 37/312 (11%), Positives = 80/312 (25%), Gaps = 22/312 (7%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 ++ S G S I K V+++ + Sbjct: 603 SYGDRVPGVWFHSSGDGSLQISAPING--NLDRVFKTAPVTVKEWSHVEINQQLVNNAYV 660 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 +S + + + P K L + D ++ + Sbjct: 661 YTIKLNGKVVFFEENMQATSFDNVMIYASDPWHPAQDGSIKDLTIINGKSESDCAVIVDV 720 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 S + + L ++ + G F+ + + Sbjct: 721 GFIIDSSGSLENHYQQEVEFLINLASTFNISKYGAH---AGVVTFSYDAFLSIKLNDYFN 777 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + G T I+ A++ + + + + LLTDG Sbjct: 778 QAQFNNAVKDISYL--NGGTRIDLALEKSLEMFDE-------LNGARKNTPQILFLLTDG 828 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 E + ++ + I + + +GI + L+N + F A + +E Sbjct: 829 EQS-GDKNPVDIAKRLRDRGII-----IFAIGIGSYVNKTELNNIVGSNDKAFLAENFNE 882 Query: 437 L-NKIFRDRIGN 447 L N F +I Sbjct: 883 LVNNDFLKKIKE 894 >gi|73958318|ref|XP_547049.2| PREDICTED: similar to integrin alpha X precursor [Canis familiaris] Length = 1149 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 59/158 (37%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F++ +F + + G T A++ + + S++ Sbjct: 176 TQFSLMQFSNNFRVHFTFEVFTYSSNPLALLDSVSQL--GGLTHTATAIRIVTNELFSAS 233 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +A K ++++TDG+ D+ + A++ GI + Q+ + Sbjct: 234 KGAR------KDASKILIVITDGQKKGDSLGYEDVIPMAEAAGIIRYAVGVGT-AFQKMQ 286 Query: 415 ARYFLSNCASPNSF---FEANSTHELNKIFRDRIGNEI 449 + L++ AS S F+ + L ++++ +I Sbjct: 287 SWKELNDIASKPSHEYIFKVENFDALRD-IQNQLKEKI 323 >gi|94498562|ref|ZP_01305117.1| hypothetical protein SKA58_08314 [Sphingomonas sp. SKA58] gi|94422005|gb|EAT07051.1| hypothetical protein SKA58_08314 [Sphingomonas sp. SKA58] Length = 216 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/217 (10%), Positives = 58/217 (26%), Gaps = 5/217 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + +KL + +G + L +P+ LG +++ + + ++ + A Sbjct: 2 RVPHFVQKLARDRSGLALVEFGLALPLFLGFVLSGIEMANYVMANNRTQRLT---TMAAD 58 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + N+ + T + D D VE Sbjct: 59 LVAQSGAGAIGTSEAQIYDLFSALDLTAQPFDLRNHGRIVITAVKGTDTNADNVVESRIL 118 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVI 183 + ++ + G ++ + V+ Y + + + Sbjct: 119 WQRFDGKYVVSPEVGCIQSTSLATLPGNRTLALDELLFHVQVTYRYQPVFSSAPFRMFSL 178 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 S R Q +S G Sbjct: 179 PVDFSRHAMFRARSTQFQTPAADSRFPPKNKCASVTG 215 >gi|291411003|ref|XP_002721794.1| PREDICTED: integrin, alpha X (complement component 3 receptor 4 subunit-like [Oryctolagus cuniculus] Length = 1100 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 57/177 (32%), Gaps = 17/177 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +++ + +V+ + + F++ + +F+ V + Sbjct: 164 SFSNFATMKNFVKAVMSQFPRPST-----QFSLMQFSNEFQTHFTFNDFVSSTNPLQLLD 218 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G+T A+ D + ++ +A K ++++TDG+ D Sbjct: 219 RVYQLM--GTTHTATAILRVVDQLFHAS------TGARKDATKILIVITDGQKLDDPLGY 270 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 + KA++ GI I + R L + AS F + L I Sbjct: 271 EDVIPKAEAAGIIRYAIGVGLAFQVVSSLRE-LHDIASEPAHEHVFRVENFDALRDI 326 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 51/176 (28%), Gaps = 16/176 (9%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 V+ + + R+G ++D V ++ + + + + + Sbjct: 191 DNFETVKDFVVSVVDGFEIGQSRTRIGVVQYSDEVQNEFNLTE--YGNKADVQSAISNIT 248 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A++ D S + K +++TDGE T + + + Sbjct: 249 YLQGRTYTGAALRYMTDVSFSEEAGARPPYQAIP---KVGIVVTDGEATDNVQGP---AS 302 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN--SFFEANSTHELNKIFRDRI 445 A G+ + I L A+ + A +D + Sbjct: 303 SAHEAGVNVFAIGIGG------YDVRELRQIATDPDATHVFAVDNFAATDYIKDAL 352 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + ++ + G T + A + A + Sbjct: 339 DRLGVVVFSSGHRVLFPPRPMTAQAKKEA-ESLLLSLRAGGGTVLGGAFREAVRLLQ--- 394 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K +++LTDG + I + A++ G+ + +A + Sbjct: 395 --------GVPGERKAVLVLTDGL---IADAKEPILDLAQTSGVEVSALALGPDADAPFL 443 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIF 441 F++A S EL ++F Sbjct: 444 KELARR---GGGRFYQAPSPRELPRLF 467 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 56/199 (28%), Gaps = 20/199 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + + +D L + S + + G F+ + Sbjct: 47 DSSGSLKSQYWKEKDFLKKLANSFGISNKGSH---AGVVTFSHYAELSIRLDAFYSSIDF 103 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T I+ A+ A + N + + LLTDG+ + Sbjct: 104 NDAVDRISHMDSF--TRIDLALAKALELFDIKN-------GARNDVPNLLFLLTDGK-QE 153 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 I ++ K +GI++ + + L P + F + +L Sbjct: 154 PEMPLTHISDEIKQKGIQLFAVGIGAGAN-----KTELEKIVGNPENVFMVDDFDKLLNG 208 Query: 441 -FRDRIGNEIFERVIRITK 458 F ++ V+ K Sbjct: 209 DFLKKVKQGSCSSVLENLK 227 >gi|260425606|ref|ZP_05779586.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423546|gb|EEX16796.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 178 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 23/55 (41%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 ++ G I AL P+M+G+ +++ + + AL++A + Sbjct: 5 LKNALRRFRDDDEGSAVIPFALWTPLMVGMALSAIEMGALTVRQTALERALDQTV 59 >gi|254497958|ref|ZP_05110722.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] gi|254352852|gb|EET11623.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] Length = 607 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 55/175 (31%), Gaps = 26/175 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 VN+ + + F++ + K + +K +++ T I+ A+ Sbjct: 64 VNNRAVVSVSTFSNDLEQIIPPQIVTAKFQESFLKKKKQIKSDGAWTNIDAAL------- 116 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN---------EEGIAICNKAKSQGIRIM 401 K+ + KK I+LLTDG + + + + +++ Sbjct: 117 -------NGVNKSWSKNKKVIILLTDGMLDLGSDALNKKSTQQLNETTIPILQREHVQVY 169 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 TI S + L A F S +L+ + I + I Sbjct: 170 TIGLSNEADSTLLSNISLKTNA---LFQPVISAKDLDNALYAIFSSVISVQEAPI 221 >gi|94498563|ref|ZP_01305118.1| TadE-like protein [Sphingomonas sp. SKA58] gi|94422006|gb|EAT07052.1| TadE-like protein [Sphingomonas sp. SKA58] Length = 193 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 49/155 (31%), Gaps = 3/155 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-AQTAIITASVP 67 + L K TG + A+++P +L + +++ L+ A + A I+ + Sbjct: 2 RHFAPLAKDRTGVSTVEFAIILPALLTLMCGAIELGHMLLARVVLEGAMTEAARISTASL 61 Query: 68 LIQSLEEVSSRAKNSFTFP-KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ + + + + + NF+ A Sbjct: 62 ETAEAQRTTLMEESIEQAMGNFPLADGAHVSVQTIVYGNFSSAHPETYEDANANGHYDLG 121 Query: 127 SAYQVV-LSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 +Y + ++D + S+ ++ Sbjct: 122 ESYVDRNANGKWDAATPKTGTLGGPGDVVSYTVRF 156 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/276 (14%), Positives = 86/276 (31%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 244 NFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATG 303 Query: 238 KSLYYMLYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY PL P ++ +S + V +++ + + K Sbjct: 304 VTRYYPGRYQPLSPWALPXQRHLLGRYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 363 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 364 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 423 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 424 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 472 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + CA+ +FE S + Sbjct: 473 T--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 506 >gi|170738572|ref|YP_001767227.1| hypothetical protein M446_0220 [Methylobacterium sp. 4-46] gi|168192846|gb|ACA14793.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 494 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 36/111 (32%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K ++ + + +I L PV+ GV G+ VD W + L+ AA A + Sbjct: 1 MRGKLAKLARSFWLAGAANVLVIFGLGAPVLFGVVGITVDYAIWLNQKLILQNAADAAAL 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + + ++ + + ++ + + D Sbjct: 61 AIVADIQIAAPQLERMQAIAESQVRRFVPLKEGDGPVAVRTAPRYRKTSDD 111 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 44/139 (31%), Gaps = 18/139 (12%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + + L ++ G T+I +++ + + + + Sbjct: 236 DESCGANELIYPIEDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDL---------KALD 286 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + ++TDG T + I + K + I + I F+V+ Q L Sbjct: 287 GEKTLNILYIITDGIETCG-GNPVEIAKQLKGENTNIVLGIIGFNVDANQN----RLLKQ 341 Query: 422 CAS--PNSFFEANSTHELN 438 A + N +L Sbjct: 342 IADAAGGYYSSVNDADKLT 360 >gi|91201645|emb|CAJ74705.1| hypothetical protein kuste3942 [Candidatus Kuenenia stuttgartiensis] Length = 336 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 35/141 (24%), Gaps = 45/141 (31%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK- 394 T I +A+ A +T + KK I+LLTDGE + + Sbjct: 174 GTNIGNALIAAMETFSNEAG------------KKIIILLTDGEEQL--LRRSQVVEAIRL 219 Query: 395 ---SQGIRIMTIAFSVNKTQQE-------------------------KARYFLSNCA--S 424 I I L A + Sbjct: 220 LLEKNDISTYIIGIGDPNNSTSIPKRDRLGNKSGYERTEDGEIIYTKPDPKLLREIAEMA 279 Query: 425 PNSFFEANSTHELNKIFRDRI 445 S+ + EL IF+ I Sbjct: 280 GGSYQHDATGAELQNIFKQVI 300 >gi|185132132|ref|NP_001118067.1| complement factor B [Oncorhynchus mykiss] gi|3982897|gb|AAC83699.1| complement factor Bf-1 [Oncorhynchus mykiss gairdneri] Length = 743 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 55/205 (26%), Gaps = 23/205 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR------TI 323 + R+A+ +I + + FF V+ + + Sbjct: 247 DFNSARNAVKKLITKVSSFSVSPNYEI---IFFASDVLEVVNIIDFSGDKRKPLVDVLAE 303 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + F D + T +N A +T + + + ++ TDG Sbjct: 304 LNNFKYDARDNVGTNLNLAFKTILERMAIQKKRNEMLFMEIHH---VLIFFTDGAYNMGG 360 Query: 384 EEGIAIC---------NKAKSQGI-RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + NK K + + + ++ +F+ + Sbjct: 361 SPENTMAKIRESVYMNNKTKREKYLDVYVFGVGSDIFDEDIMPLVTKRN-GERHYFKLKN 419 Query: 434 THELNKIFRDRIGNEIFERVIRITK 458 +L + F D I V + K Sbjct: 420 VIDLERTFDDIIDESEVVGVCGLHK 444 >gi|254412101|ref|ZP_05025876.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196181067|gb|EDX76056.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 570 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 42/381 (11%), Positives = 90/381 (23%), Gaps = 36/381 (9%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD---TAVEMNPRKSAYQV 131 + K + ++ Y + + K + S Y+ Sbjct: 211 AAVSGKRPEDLTAEDVKNYESQVQQIQSKITRYGVSTDSLAEAMVQNGPFWASVGSVYES 270 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-SRSYHKEHGVSIQWVIDFSRSML 190 + + + ++ +A S ++ S Sbjct: 271 SVIAANANPQPNQPRYQAVYPNATFTSNMRAIIPNAPWVSEEEKAAAEKVVTYLQSPEAQ 330 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + +P + + + P +V + Sbjct: 331 QIATELGLRPGTPGVALGPKFIPEFGVDPQAKYDSYRPPQPEVVEA----MLTSWQEVAK 386 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 VD+S K V++ L + I S+ D + FND + Sbjct: 387 KPSQVVVVVDTSGSMQGN-KLPAVQNTLQNYINSLGSKDKIA------LIDFNDEISQ-- 437 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 R F G T + DA A + + + + Sbjct: 438 PVLVEGTDAGRNRGLEFISGLQAYGGTKLYDAALYARNWLQDNPRPDAINA--------- 488 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++LTDGE++ + + + G I TI + L A Sbjct: 489 VLILTDGEDSGSQINLNQLEQELQQSGFNSDQRIAFFTIGYGKEGDF---DPEALKAIAD 545 Query: 425 -PNSFFEANSTHELNKIFRDR 444 ++ + + D Sbjct: 546 LNAGYYRKGDPETIATVMDDL 566 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 52/146 (35%), Gaps = 16/146 (10%) Query: 271 KHLVRDALASVIRSIKKIDNV---NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ V +++ A F+ V DP FS K Sbjct: 63 FDKQKDFVGSLSDKIFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDL---QTFKQR 119 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 N +G T A+ A + + K VL+TDG + N + Sbjct: 120 VKSMNFIGQGTFSYYAIANATRLLKREGR---------KDGMKVAVLMTDGIDHPKNPDV 170 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 171 QSISEDARTAGILFITIGLSAVVNET 196 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 54/178 (30%), Gaps = 20/178 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++++ + ++ + T R+G ++ +V + K + + Sbjct: 813 FQIIKNFVKTLTDRVAL---DLATTRVGIINYSHKVEEVAHLTQFSSKDD--LKRAVDNM 867 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 + T A+ A ++ KK +++TDG+ +T+D + + Sbjct: 868 QYLGEGTYTATALHAANRMFEAAKPG----------VKKVALVITDGQTDTRDEKNLTEV 917 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA-RYFLSNCASPN---SFFEANSTHELNKIFRD 443 A + I I ++ A+ + + L + Sbjct: 918 VKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDDFITLEDTLKQ 975 >gi|262165918|ref|ZP_06033655.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] gi|262025634|gb|EEY44302.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] Length = 406 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/296 (9%), Positives = 79/296 (26%), Gaps = 9/296 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I + + G ++ M ++ + + +DV L+ A A + A++ Sbjct: 3 ITQHRNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMV 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ T + + + NF D + + Sbjct: 63 ADATHDTATATDVAKSTLAS--MHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGD 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE-HGVSIQWVIDFS 186 Y V + L F++++G+ + + S + + + + Sbjct: 121 IYVRVSVADLSLT----EFFMQALGMAKQVSAAAVAGPSSSINTIGNVVPIGVCKGSESD 176 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ--NGKVGIRDEKLSPYMVSCNKSLYYML 244 + D + A + +G +R+ Y + Sbjct: 177 GIYGYHSSDVYALKVGDSSMTAMGSGNYNLLDFGSGADTVREALGGGYEGTVEIGGDIDT 236 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 G + + + + D + + + +D+ V A Sbjct: 237 QTGVATGPVGQGINTRLGIYKGGVSSSDYPPDYVTELPATPATLDSSGHVVYDYAN 292 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 83/271 (30%), Gaps = 23/271 (8%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V+ + + + LN + +++ + + Sbjct: 201 VNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYY 260 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DN 290 + + L ++ +S + V +++ + + K D Sbjct: 261 PATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDT 320 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYD 348 ++D + FN++ F+ V + + K G+T + A+D Sbjct: 321 LSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFD 380 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFS 406 + +SN K I++ TDG + + K ++ +R+ T FS Sbjct: 381 QLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FS 427 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 V + + CA+ +FE S + Sbjct: 428 VGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|328951281|ref|YP_004368616.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451605|gb|AEB12506.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 328 Score = 49.2 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/204 (9%), Positives = 57/204 (27%), Gaps = 31/204 (15%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + R + + + V R+G F+ + Sbjct: 93 DTSKSMIAVDQSPSRLEAARAIVRTFLDR------VPRGARVGLVSFSAYASVLVLPT-- 144 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY----- 370 V+ +T++ A+ A + + + + Sbjct: 145 ---ARHVEVRKALEALEPQEATSLGAAILAAVRALPGRERAGEELLGRDPVPPELQELPP 201 Query: 371 --IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK----------ARYF 418 ++L++DG +T + + A++ +RI T+ + ++ Sbjct: 202 ATVLLISDGVSTSG-LDPLEAARVARAHQVRIYTVGVGSPRGSIQEVDGQLSFIPFDPSG 260 Query: 419 LSNCA--SPNSFFEANSTHELNKI 440 L A + + + L + Sbjct: 261 LEQIAALTGGRYVYPPTPDALEAV 284 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 38/419 (9%), Positives = 92/419 (21%), Gaps = 37/419 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNS--FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A + + S +E +K + + I++ + +N+ + + Sbjct: 147 AASLAETTEESSSNKEDKEISKQNNESNLSFSYPSNFGIKSLFDEKAQNYENISPKYTED 206 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +T + + + S + GI SW + + Sbjct: 207 ETGIYPSQSWMPENNTKVINHQGWNAFSSNW---DGIDSWDGNPANLTNSYIEHAGTNNP 263 Query: 177 VSIQWVIDFSRSMLDYQRDS---------EGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 V + D D + ++ G + + Sbjct: 264 VDFALRKYARETETPGLYDVFLNVRGNVQNPIKPVDIVLVVDMSGSMEGAREGAIKQGVK 323 Query: 228 KLSPYMVSCNKSLYYM-LYPGPLDPSLSEEHFVDSSSLRHVIKK--KHLVRDALASVIRS 284 + + + Y G P + + + + AL Sbjct: 324 SFLSSIENTAYAQYVNVGLVGYSSPGYISNSGYITVPMESLATDGHVSAMNKALERQFVG 383 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 V + ++ + T + Sbjct: 384 GTFTQLGIRQGAQMLKEDASGNEKMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDS--- 440 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTI 403 + N + GE + AK G I T+ Sbjct: 441 ----NSLDEPSFTSQLWMMNGNNRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTL 496 Query: 404 AFSVNKTQ---QEKARYFLSN---------CASPNSFFEANSTHELNKIFRDRIGNEIF 450 + K + + +LS AS + +A+S + +++ + Sbjct: 497 GIQLGKDSGYTNDSSNTYLSQEEVRKRTSLIASSGLYQDADSAENITDYLKNQADVVLS 555 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 50/166 (30%), Gaps = 20/166 (12%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + R + TA + + TA I + + Sbjct: 144 AYAGTATVLVSPTTNRDATVNALDNLQLADRTATGEGIFTALQAIATVG---AVIGGGDK 200 Query: 366 EAKKYIVLLTDGENTQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKA----- 415 IVL++DG+ T + AK Q + I TIAF E Sbjct: 201 PPPARIVLMSDGKETVPSNPDNPKGAYTAARTAKDQQVPISTIAFGTKDGYVEINGQRQN 260 Query: 416 ----RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 + A S + A++ +L +++ + +I IR Sbjct: 261 VPYAPDMMEKVAKLSGGETYTASTLGQLKEVYANL-QQQIGYETIR 305 >gi|149921119|ref|ZP_01909577.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] gi|149818006|gb|EDM77465.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] Length = 428 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 66/262 (25%), Gaps = 49/262 (18%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 +V + S+ + LDP+ + ++ V A+ + I+ Sbjct: 94 TYVPPQVMLVVDASGSMVNNSWDHDLDPNTPQVTRWNTL-HGVVSTVMDNFGPAMYAGIQ 152 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISD-PSFSWGVHKLIRTIVKTFAIDENEM-----GST 337 + D M + +N GV V G T Sbjct: 153 RFPSEEACPDATPMSSNCYNSGSCIVGTQPEVGVSLDNGASVIAAIPGPTAGNTEIVGGT 212 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------------QDNEE 385 M +A + E +Y++L+TDG Q +E Sbjct: 213 PATKGMNSAVSHLEQQPEA----------FPRYVLLITDGAANCDQALSFPDYIEQYDET 262 Query: 386 GIAICNKAKSQGIRIMTI---------AFSVNKTQQEKARYFLSNCASPN---------- 426 A GI + + + + L++ A Sbjct: 263 LPTTVQAAFDGGITTFVVGIDIEDMLQGVGTDGSPEANPFERLNDVAIAGGAPKNEGMDL 322 Query: 427 -SFFEANSTHELNKIFRDRIGN 447 F+ + EL + +G Sbjct: 323 EKFYNTTNQQELLDAIQAILGE 344 >gi|116249090|ref|YP_764931.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253740|emb|CAK12133.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 176 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 40/146 (27%), Gaps = 13/146 (8%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+ G I A+L + + +++V + A ++ S + Sbjct: 10 LRRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDASVHKISRMIRT 65 Query: 71 SLEEVSSRAKNSFTF---------PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 S+ F + N ++L + + D T E Sbjct: 66 GEVASSNITLADFKAGICNDMLLSFSCSSGLLVKVNVLSDLSSAASADPIDDSGNLTVTE 125 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLF 147 +L + ++ F Sbjct: 126 TYDIGKGSDYILVQTFLPWTAVVNFF 151 >gi|29789036|ref|NP_036843.1| integrin alpha-M [Rattus norvegicus] gi|8917587|gb|AAF81280.1| integrin beta 2 alpha subunit [Rattus norvegicus] Length = 1151 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 62/191 (32%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +D S + I +++ +++V+ +K + ++D + F Sbjct: 149 SNIAFLIDGSGSINTI-DFQKMKEFVSTVMDQFQKSKTL-----FSLMQYSDEFRT--HF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K G T ++ + A K +V Sbjct: 201 TFNDFKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELFQKINGAR------DNAAKILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + +A+ GI I + + R L AS + F Sbjct: 255 VITDGEKFGDPLNYEDVIPEAEEAGIIRYVIGVGNAFHKPQSRRE-LDTIASKPAGDHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 17/162 (10%), Positives = 45/162 (27%), Gaps = 21/162 (12%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + ++ + R+ F + + Sbjct: 101 SNSMAARDVKPDRIGFAKQIVTHTMQQLAGS-------RIAMIVFAGGAYIRLPLTPDLP 153 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + T + A++ + + + + A K ++LLTDG Sbjct: 154 TART-FLADIQPGMVSNQGTNLGQALERSAQALSAPSR-----------AGKAVILLTDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 E+ + + K QGI+ + + + L Sbjct: 202 EDHEG--GLEEGIKRLKEQGIKAYVVTIGLPEGATIPIGETL 241 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 30/251 (11%), Positives = 66/251 (26%), Gaps = 20/251 (7%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G ++ ALL+ + + +DV + L+ A ++A+ L ++ + A Sbjct: 17 GAISVMAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQLKRADSAANLNA 76 Query: 80 KNSFTFPKQKIEEYLIRNFENN---------------LKKNFTDREVRDIVRDTAVEMNP 124 T + L T Sbjct: 77 AVLGTVRNIGAKNGYPSGIAMGCGDATGGGADAMTACLGVWDPASGGPKHFSATYEATKV 136 Query: 125 RKSAYQVVLSSRYDLLL-----NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 +A +V + +L + LF ++ S + + + +S+ Sbjct: 137 SPNAVRVQATQTVPILFVIDGGSGRQLFAEAIASGSPPVAAFSLGTGLLDVSTANSMLSV 196 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 S S++D+Q + N + +RD + + Sbjct: 197 LLGNTVSLSLVDWQGLVNTTVTLDQLRLKAGVGTVDQLLNTSLNLRDFYALVLGAAGRDA 256 Query: 240 LYYMLYPGPLD 250 L + P Sbjct: 257 LLTTMLGSPPT 267 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 48/153 (31%), Gaps = 24/153 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ + R + G T+I +++A+ I Sbjct: 343 WVGMVTFDSAAYVQSALRQIKGGTDRDALTKSLPTV-ASGGTSICSGLRSAFTVIR---- 397 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C N+ K G I T+A + + + Sbjct: 398 ------KKYKTDGSEIVLLTDGEDN-----TISSCFNEVKQSGAIIHTVALGPSAAAELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDR 444 + + A+ + L F Sbjct: 447 ELSKM----TGGLQTYASDQAQNNGLIDAFGAL 475 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 36/323 (11%), Positives = 79/323 (24%), Gaps = 25/323 (7%) Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS-----MGIKSWLIQT 160 + + V T + + L+ S R GI S + Sbjct: 68 YDAASITAAVNATKYPGMGTYAGKGLTLAKDDLYSAPVRSGVPRILIVMTDGISSDDVAG 127 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA------DRTVKS 214 A+A G+ + SM + + D+ Sbjct: 128 PAKALRDMGVEIFALGIGKNYDQGQLDSMGSDPKPDHVVTADFDKLDPVIQTIKDKACNG 187 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + + + + N++ + + +D S K + Sbjct: 188 IDKEQTESHSASPVDDGALKTKNQNRFRSSSVTVCKAKVDVGFLLDGSGSVEFYAKGN-F 246 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + +K D G F+ + F +I Sbjct: 247 QRCKNFINKFVKSFMVSKDDSHFGLVLFSSDSNVEFKFDDHYDAA--SITAAVNATNYPG 304 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 T + A D + S+ + ++++TDG ++ D + Sbjct: 305 MGTYAGKGLTLAKDDLYSAP--------VRSGVPRILIVMTDGISSDDVAGP---AKALR 353 Query: 395 SQGIRIMTIAFSVNKTQQEKARY 417 G+ I + N Q + Sbjct: 354 DMGVEIFALGIGKNYDQGQLDSM 376 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 37/133 (27%), Gaps = 13/133 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K D G F+ + F +I + T + Sbjct: 37 VKSFMVSKDDSHFGLVLFSSDSNVEFKFDDHYDAA--SITAAVNATKYPGMGTYAGKGLT 94 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A D + S+ + ++++TDG ++ D + G+ I + Sbjct: 95 LAKDDLYSAP--------VRSGVPRILIVMTDGISSDDVAGP---AKALRDMGVEIFALG 143 Query: 405 FSVNKTQQEKARY 417 N Q + Sbjct: 144 IGKNYDQGQLDSM 156 >gi|149725809|ref|XP_001495640.1| PREDICTED: similar to leukocyte adhesion glycoprotein [Equus caballus] Length = 1140 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 57/177 (32%), Gaps = 17/177 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +++ + V+ KK + ++D + +F + + Sbjct: 162 YENDFQKMKEFVTIVMNQFKKSKTL-----FSLMQYSDTFQTHFTFKEFANNPNPGSLVR 216 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T ++ + S A K +V++TDGE D E Sbjct: 217 PINQL--GGRTHTATGIRKVVRELFHSRNGAR------KNALKILVVITDGEKFGDRLEY 268 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 + +A +GI I + EK+R L+ AS + F N+ L I Sbjct: 269 EDVIPEADQEGIIRYVIGVGI-AFSIEKSREELNTIASKPARDHVFRVNNFEALKTI 324 >gi|114705831|ref|ZP_01438734.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] gi|114538677|gb|EAU41798.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] Length = 548 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 38/127 (29%), Gaps = 2/127 (1%) Query: 8 IFYSKKLIK-SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 I I S G+ + ALL+P++LG GG D+ +L+ A A + A Sbjct: 2 IRSLSAFISGSDRGNIAVSFALLLPLILGFGGFATDLGLAYLERRSLQAATDAAALAAIR 61 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + + + E V VE Sbjct: 62 HPHDAEAIAAKV-FAASSQKDATFDVVFGSYEPDRNRDERFRPEGSSAVSVVRVEARKPH 120 Query: 127 SAYQVVL 133 + Sbjct: 121 HYTLARV 127 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 35/112 (31%), Gaps = 22/112 (19%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A + A + S D +K + L+TDG + + + Sbjct: 156 YTGGGTNTLKAFEVAKEIFTQSRNDS----------EKVLFLITDGFSNGGDPIP--LAA 203 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + K ++I TI + + ++P + L F + Sbjct: 204 ELKKDQVKIFTIGIANGNY-----KELYELASTPGEIY-----SYLLDSFEE 245 >gi|149202125|ref|ZP_01879098.1| hypothetical protein RTM1035_12398 [Roseovarius sp. TM1035] gi|149144223|gb|EDM32254.1| hypothetical protein RTM1035_12398 [Roseovarius sp. TM1035] Length = 178 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 23/59 (38%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 F + ++ + TG + L+ P +G+ +++ + L++ A+ Sbjct: 1 MTRVFKNFLRRFRQDDTGTASLEFVLVAPFFIGIMIFSIEMGFVTMRATLLERGLDMAV 59 >gi|326385751|ref|ZP_08207380.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] gi|326209730|gb|EGD60518.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] Length = 201 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 49/140 (35%), Gaps = 1/140 (0%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA- 64 + + +L++ + A+ P ++G+ +++ + AL+ A+ A Sbjct: 9 RPFAFLARLMRDEKASTAMEFAIAAPFLIGILMATFEIIILFLAQAALETTAEGAARYVL 68 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + V T P+Q+ Y+ + + N +V T V+++ Sbjct: 69 TGQAQTNFTGVKDSNGKVITTPQQQFAAYVCTQMSSFMSCNNLYVDVNSGSDYTTVDLSV 128 Query: 125 RKSAYQVVLSSRYDLLLNPL 144 + + + + N Sbjct: 129 PQFTFDATNNYKVTNTFNYN 148 >gi|224048537|ref|XP_002190467.1| PREDICTED: similar to collagen, type XII, alpha 1 [Taeniopygia guttata] Length = 3122 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 50/150 (33%), Gaps = 22/150 (14%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 FS + + I++ GST AM + + +++ + + + Sbjct: 491 FSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKGSRPNV------PRVM 544 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SF 428 +L+TDG+++ +E K + + I + ++ R L ASP Sbjct: 545 ILITDGKSSDAFKEPAI---KLRDADVEIFAVGV------KDAVRTELEAIASPPADTHV 595 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F+ +RI + Sbjct: 596 YTVED----FDAFQRISFELTQSVCLRIEQ 621 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 50/149 (33%), Gaps = 20/149 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D V++G ++ ++ + ++ +++++ A + G+T A+ Sbjct: 1227 FDIGPDRVQIGLAQYSGDPRTEWNL--NAYRTKQSLLEAVANLPYKGGNTLTGMALDFIL 1284 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + A+K VL+TDG++ D K + +G+ + I Sbjct: 1285 R------NNFKPVAGLRPRARKIGVLITDGKSQDDVVAPS---RKLRDEGVELYAIGI-- 1333 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANS 433 + L A+ + Sbjct: 1334 ----KNADENELKQIATDPDDIHAYNVAD 1358 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 49/188 (26%), Gaps = 16/188 (8%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 ++ + A I ++ D V GA F +S+ + K Sbjct: 256 TEQYPVKNVTTADFIGETGNANDPIDLVIQGAINFPTNYVSNNPSTPLTPNYPTHSSKVG 315 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + E + Y + + + E + I Sbjct: 316 RRNLPES---------KFDYSNLSARITFDGVAGALVYEPRFPHPYYYY----FPCNAAI 362 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 AK+ G I TI + + L A+ + F + L F + I Sbjct: 363 NEAQFAKNSGYTIHTIGYDLGDFALANNS--LKLTATDENHFFTATPANLAAAFDN-IAQ 419 Query: 448 EIFERVIR 455 I + R Sbjct: 420 TINIGIQR 427 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 29/65 (44%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 +++ +G TA+L+P +L ML D++ + + L+ A A + A+ + Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAARYTGERPS 60 Query: 74 EVSSR 78 V Sbjct: 61 AVRQA 65 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 49.2 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 47/169 (27%), Gaps = 5/169 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQ 75 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L D + P +A QV + Sbjct: 76 CAQPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTAT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 135 QSVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 183 >gi|311070191|ref|YP_003975114.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] gi|310870708|gb|ADP34183.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] Length = 226 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 46/136 (33%), Gaps = 15/136 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G+H ++ N G T I A+ + K N E K + L Sbjct: 100 YGLHPYEGSLFNNALSQINPTGWTPIAKALSDTREEFQ----------KVNAEGKNVVYL 149 Query: 374 LTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + I K + + + I F+ + +K + A + A Sbjct: 150 ITDGEETCGGDPAKEI-EKLREANVDTIVNIIGFNFDVKGNQKMKQA--AVAGGGEYISA 206 Query: 432 NSTHELNKIFRDRIGN 447 + E + + + Sbjct: 207 KNAEEFEQAWENEAQK 222 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 112/365 (30%), Gaps = 24/365 (6%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS-----SRYDLLLNPLS 145 + L + + + + + R + + + +V+ + L NP Sbjct: 119 TDPLNPDTDGDEMLDGYEVIAGTSPRLDLTDGSTPDNTRVMVVEKRLNGCKLHLSGNPNI 178 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHG---VSIQWVIDFSRSMLDYQRDSEGQPLN 202 + + I + + + ++ E+ +I+ +D SR + + + Sbjct: 179 AGASIEKLDLFGISSNPSVVSAAYDFYSEYPCMEATIKISVDESRLRKCGCEFKDLKVMR 238 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + S N + ++ P L + Sbjct: 239 FDVDSREYIPLESRSDNKRRYVQATITEPGTYVVGADASVNEAATTRVAFLIDNSGSMYP 298 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 ++ V +S+ ID + R+G + F F+ L Sbjct: 299 KELCPTSSENDVDFKRLDFTQSL--IDKFDSDFRIGISKFTGTYTRMCGFTDDRTALSDV 356 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 I + DE G T A++ + ++ + + IV+L+DGE+ + Sbjct: 357 IKRIRTEDEIFDG-THNQTALKRCIEEFTATGDGKYVN---------IIVMLSDGESDET 406 Query: 383 NEEG-IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 N E + A + + ++T+ + S + ++ A+ + L+ ++ Sbjct: 407 NAESIKNLARLANEKSVIVLTVGLGREIDRAWLQEMAYS---TGGKYYSASEANALDDVY 463 Query: 442 RDRIG 446 + + Sbjct: 464 KQIVT 468 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/404 (9%), Positives = 101/404 (25%), Gaps = 30/404 (7%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRN---FENNLKKNFTDREVRDIVR 116 A+ + SV ++ + + + + + Y ++ Sbjct: 235 AVSSTSVTWQNNVRSLEADDEMAAARSNPGMGFYPTGTAPWTTGSISGRVRQINGTAAQG 294 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 N + V + + + + Sbjct: 295 VRVWAYNTADLSAPVYEAFTATQFDAEAGVSSGEYEFRGVPAGSYYVCIVPWGVNVPNAD 354 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + S + + S + +V + + +P + Sbjct: 355 LDNTGRYNLSTLNGSGNTGFATECYDDAPAGMASPDFSEIDRLREVDVATGETTPNINFV 414 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + + SS+ + K ++ A + + + R Sbjct: 415 TGAQNTDVVLVMDRSGS----MNLSSAPDPSVSKMDALKYAANVFMDFLDL----DAGHR 466 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVK---TFAIDENEMGSTAINDAMQTAYDTIISS 353 G F V+ S ++ + + + T G T I D + + ++ Sbjct: 467 AGLVQF-HEVVVPFSPAFNLQPVNAASLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTTA 525 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++ ++LLTDG + + I + + + ++ F + + Sbjct: 526 VDPSD---------RQIMLLLTDGLHNRPVGTSVTDITAPLLASEVTLYSVGFGTSTNEA 576 Query: 413 EKARYFLSNCASPNSFFEANSTHELN--KIFRDRIGNEIFERVI 454 E LS + E +L K F + + Sbjct: 577 ELTPLALS---TGGVHLENKDVSDLQLRKHFLSIAASAADSTTL 617 >gi|157961836|ref|YP_001501870.1| hypothetical protein Spea_2015 [Shewanella pealeana ATCC 700345] gi|157846836|gb|ABV87335.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345] Length = 445 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 50/153 (32%), Gaps = 2/153 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE-- 73 ++ +G ++ + + +L + +D + L+ A A + A+ L Q Sbjct: 16 RNQSGAILVMFTIGLFSILAFAALALDGGHILLSKGRLQNAVDAAALNAAKELQQGATLF 75 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 E A E + N + +F +V + E + V Sbjct: 76 EARQAAYTILLQNLGFSENGELNNSISLSSPDFNTTQVTARLHVEFSEQPDPFNPILVEG 135 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 S + + ++L I ++ + +A A Sbjct: 136 SEYVRVRVENVNLSNFLADILNFDKKIRASAVA 168 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/267 (11%), Positives = 73/267 (27%), Gaps = 18/267 (6%) Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 F+ ++ + S + + ++ + + Sbjct: 414 FMGGGLFTGKALRYVTQYGFKSTPVFSDVRDDLPRLVVLLTGSKSQDSVTGPATYARDQE 473 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR- 265 V S S++ I L+ S + L+ L + + L Sbjct: 474 VFLIGVTSDSNKGEMAEIVGNPLNLVTYSNPQQLFNQLPQLQKRICSIDVQGCQAQPLDL 533 Query: 266 ------HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + V + D D ++G ++ R + H Sbjct: 534 AFVLDASTAVGQEKFNRLKNFVTMVSLQFDINRDVTQIGLVTYSSRPETVFGL--DTHDS 591 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++++ GS + A+ Y+ +++ + + K +V++TDG Sbjct: 592 GSSLLQGIGRASYMGGSASTGSALLRVYNDVMTVQKGARPGVN------KAVVVITDGRG 645 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFS 406 +D K + GI + I Sbjct: 646 AEDAAVP---AQKLRDNGIMVYVIGIG 669 >gi|332283431|ref|YP_004415342.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] gi|330427384|gb|AEC18718.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] Length = 585 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 58/170 (34%), Gaps = 18/170 (10%) Query: 277 ALASVIRSIKKIDNVNDTVRMGAT-FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 V+ + K N + + A + SD ++ R G Sbjct: 49 TARKVMGELVKSWPENTNLGLIAYGHRSAGSCSDIETMIEPQRVDRDAFIKTVNAITPKG 108 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T I+ +++ A D + + +VL++DG + + + A+ + K Sbjct: 109 KTPISASLKQAADVLQ------------YRDHNATVVLISDGLESC-HGDPCAVAAELKE 155 Query: 396 QGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 +G+ + + F +++ E N + F A++ EL + Sbjct: 156 KGVDFKAHVVGFDLDQEGNEALSCIAKN--TGGIFVPASNADELQDALQQ 203 >gi|194675927|ref|XP_001788181.1| PREDICTED: integrin, alpha E [Bos taurus] gi|297486662|ref|XP_002695836.1| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Bos taurus] gi|296476814|gb|DAA18929.1| integrin, alpha E [Bos taurus] Length = 1163 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 55/182 (30%), Gaps = 15/182 (8%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + +K R + + + ++ R + Sbjct: 196 DPPDFQKAKDFIYDMMKNFYTKCFECRFAVVQYGEVIQTELDLLDSQDV--RASLDRVKN 253 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T AMQ D I + + +A K +V+LTDGE QD + Sbjct: 254 ISQVGKITKTASAMQHVLDNIFT------PNQGSRAKASKVMVVLTDGEIFQDPLNLTTV 307 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANS---THELNKIFRD 443 N K G+ I + KA + L AS + F+ + L ++ Sbjct: 308 INSPKMHGVERFAIGVG-EAFNKSKAYHELKLIASDPDEDHAFKVTNYMALDGLLNKLQE 366 Query: 444 RI 445 I Sbjct: 367 SI 368 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V++GA +++ + I + + G T A+Q Sbjct: 873 VKKADVGRDRVQIGALTYSNHPEILFYL--NTYSSGSAIAEHLRRPRDTGGETYTAKALQ 930 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + ++ ++++TDG + D ++ + + +GI I + Sbjct: 931 HSNVLFT-----EEHGSRLTQNVRQLMIVITDGV-SHDRDKLDEAARELRDKGITIFAVG 984 Query: 405 FSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIF 441 + L A + ++ +L I+ Sbjct: 985 VG------NANQDELETMAGKKENTVHVDNFDKLRDIY 1016 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 93/335 (27%), Gaps = 19/335 (5%) Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 L+ + + ++ ++ V ++ + S S Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP--ADRT 211 + S +E + ++ +M ++ + T Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 212 VKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 SYS G +K+ + + + + ++E + +K Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLIDGSSSIRK 487 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + I + VR+G ++ + + S + + K + Sbjct: 488 KEFEQIQIFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGID--LKKAVFNIK 545 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A+ I + + + A Y+++LTDG++ E N Sbjct: 546 QLKGLTFTGKALDFILPLI------KKGKTERTDRAPCYLIVLTDGKSNDSVLEP---AN 596 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +++ I I I E + L A + Sbjct: 597 RLRAEQITIHAIGIG------EANKTQLRQIAGKD 625 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ + +++ I+ + + + + +N +S + Sbjct: 674 TNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPI 733 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G + NE + +K+++LLTDG+ + Sbjct: 734 NRNTLTGG------------ALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGP-- 779 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 +S+ + I ++ +A+ L + S F + L I I Sbjct: 780 -ATALRSKSVTIFSVGV----YGANRAQ--LEEISGDGSLVFHVENFDHLKAIESKLI 830 >gi|294011132|ref|YP_003544592.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] gi|292674462|dbj|BAI95980.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] Length = 157 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 43/130 (33%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L+ G+ + A++MPV++ + M DV + L++AA+ A+ + Sbjct: 3 KRLLSDRYGNSTVELAIIMPVLVLLTCMAGDVAMAFKAKIGLQRAAERTAQLAAAGGYTN 62 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +S+A N+ N + K + Sbjct: 63 DTTDTSKAYNNLAADAAAAAGVPTGNVTVTPTLLCNATVQTASPEVPCPDGQQTKRYVAI 122 Query: 132 VLSSRYDLLL 141 +S Y + Sbjct: 123 AISGTYTPMF 132 >gi|553536|gb|AAA36226.1| MHC factor B [Homo sapiens] Length = 171 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 36/131 (27%), Gaps = 18/131 (13%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A+Q Y + D+V N I+L+TDG + + I + ++ + Sbjct: 14 GTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VIILMTDGLHNMG-GDPITVIDEIRD 69 Query: 396 --------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + R + Q + F+ L +F Sbjct: 70 LLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKD-NEQHVFKVKDMENLEDVFY 128 Query: 443 DRIGNEIFERV 453 I + Sbjct: 129 QMIDESQSLSL 139 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 55/199 (27%), Gaps = 22/199 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFND----RVISDPSFSWGVHKLIRTIVKTF 327 A ++ I+K+ + R G + V + S + + + + Sbjct: 197 SNFTGAKKCLVNLIEKLASYGVKPRYGLVTYATXXXIWVKVSEAVSSNADWVTKQLNEIN 256 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D T +A+Q Y + D+V N I+L+TDG + + I Sbjct: 257 YEDHKLKSGTNTEEALQAVYSMMS--WPDDVPPEGWNRTRH-VIILMTDGLHNMG-GDPI 312 Query: 388 AICNKA-------KSQG------IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 + + K + + + Q + F+ Sbjct: 313 TVIDXXXXXXYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKD-NEQHVFKVKDM 371 Query: 435 HELNKIFRDRIGNEIFERV 453 L +F I + Sbjct: 372 ENLEDVFYQMIDESQSLSL 390 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 60/154 (38%), Gaps = 11/154 (7%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+ F+ S F+ + + I++ E+ G+TA+ + ++ A Sbjct: 275 GPRFTRVAFITFSSVGKSRTHFNLNRYDNAQQIIEAIRRVESTGGTTAVGEGIRIATQ-- 332 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +++ + AKK +++ TDG + + + + +AK+ G + +I + N Sbjct: 333 ---QQEKRMGGRPISIAKKAMLIFTDGWSNKG-PDPEEMSKEAKAAGFVLYSIGYEGNGR 388 Query: 411 QQEK----ARYFLSNCASPNSF-FEANSTHELNK 439 + +Y + A + + EL + Sbjct: 389 PDAEFAGLNQYTMDAIADTMHHVYSERNFSELVE 422 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +KK D D V+ GA ++D + I++ + GST A+ Sbjct: 841 VKKSDVGRDRVQFGALRYSDDPDILFYL--NTYSNRSAIIEHLRRRRDTGGSTFTAKALG 898 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + E H + K+ ++++TDGE + D K +++GI I+ + Sbjct: 899 RSATLFE-----EQHGSRIKQNVKQMLIIITDGE-SHDRHLLNDTALKLRNKGITIIAVG 952 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIF 441 + + L A + +L ++ Sbjct: 953 VG------KANQEELEAMAGNKENTIHVKDFDKLKDVY 984 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 67/190 (35%), Gaps = 23/190 (12%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 VD SS +++ + +++ I+ D ++G F+D + Sbjct: 631 FLVDGSSSIGYAN-FEKMKNFMQTLLAKIQI---GADKTQIGVAQFSDYNKEEFPL--NK 684 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + I G+T A++ S KK+++L+TD Sbjct: 685 YFTQKEISDAIDRMLLITGNTLTGSALKFIDTYFTQSK-------GARHGVKKFLILITD 737 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTH 435 GE QD+ A+ + +G+ I+++ R L + S F + Sbjct: 738 GEA-QDDVREPAV--ALRDKGVIILSVGV-YGAN-----RTQLEEISGDGSLVFHVENFE 788 Query: 436 ELNKIFRDRI 445 +L +I R I Sbjct: 789 DLKEIERKLI 798 >gi|301780868|ref|XP_002925836.1| PREDICTED: integrin alpha-D-like, partial [Ailuropoda melanoleuca] Length = 926 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 57/154 (37%), Gaps = 13/154 (8%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 +++ + +F+ + + G T ++T + S Sbjct: 3 LIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQL--NGLTFTATGIRTVVQELFHSKNGAR 60 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 A+K ++++TDG+ +D E + +A+ GI I + Q+ AR Sbjct: 61 ------KTARKILIVITDGQKYRDPLEYSDVIPQAERAGIVRYAIGVG-DAFQKLTARQE 113 Query: 419 LSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 L+ S + F ++ L+ ++++ +I Sbjct: 114 LNTIGSKPSQDHVFRVDNFAALSN-IQEQLQEKI 146 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 59/190 (31%), Gaps = 18/190 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A V+ + K + +DR + L RT KT Sbjct: 59 SRMSAAKQAFNEVLDATPKEVQLGIRTLGADYPGDDRKTGCKDTAQLYPVGPLDRTEAKT 118 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A D + E K IVL++DGE+T + Sbjct: 119 AVATLAPTGWTPIGPALLKAADDL------------EGGEGTKRIVLISDGEDTCAPLDP 166 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + ++GI I T+ N + + A+ ++ EL Sbjct: 167 CEVAREIAAKGIGLTIDTLGLVPNAKLRVQLSCIAE--ATGGTYTSIEHRDELTDRVNQL 224 Query: 445 IGNEIFERVI 454 + V+ Sbjct: 225 VDRAADPVVV 234 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 47/169 (27%), Gaps = 5/169 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQ 75 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L D + P +A QV + Sbjct: 76 CAQPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTAT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 135 QSVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 183 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 47/169 (27%), Gaps = 5/169 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I G F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQ 75 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L D + P +A QV + Sbjct: 76 CAQPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTAT 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 135 QSVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 183 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 45/159 (28%), Gaps = 33/159 (20%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN------------- 364 + T G TA+ + + A T++ + R+ N Sbjct: 154 SRSKDAAATAIDRLKPQGGTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPDASHS 213 Query: 365 -------LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT------- 410 IVL +DGE+ A A + G+R+ T+ Sbjct: 214 GEAVTPGSYPSGAIVLFSDGESNTGPAATQAA-QLAAAHGVRVYTVGVGTTDGVVLSVDG 272 Query: 411 ---QQEKARYFLS--NCASPNSFFEANSTHELNKIFRDR 444 + L A+ +F +L +++R Sbjct: 273 WSARVRLDEKVLKEVANATGAEYFPLADAAQLKRVYRAL 311 >gi|126730248|ref|ZP_01746059.1| hypothetical protein SSE37_10849 [Sagittula stellata E-37] gi|126708981|gb|EBA08036.1| hypothetical protein SSE37_10849 [Sagittula stellata E-37] Length = 183 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 5/55 (9%), Positives = 20/55 (36%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 ++ ++ + + G+ + A + V V++ + L+++ Sbjct: 8 SRLGRFAGRFCREENGNSTLEFAFYFSIFFLVLAAGVEIAYMNLRHAMLERSVDR 62 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 41/130 (31%), Gaps = 16/130 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 257 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLT-RMSDAGRQLALQAV 309 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 310 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS---------IILLSDGRDTYTTNHPDP 360 Query: 389 ICNKAKSQGI 398 A +Q I Sbjct: 361 SYKDALAQCI 370 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 41/130 (31%), Gaps = 16/130 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+++ D R+ F+ + + R + Sbjct: 258 TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLT-RMSDAGRQLALQAV 310 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I D ++ + E I+LL+DG +T Sbjct: 311 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS---------IILLSDGRDTYTTNHPDP 361 Query: 389 ICNKAKSQGI 398 A +Q I Sbjct: 362 SYKDALAQCI 371 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 17/157 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 214 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 268 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 269 EYMERGTMTGLALRHMVEHSFSEAQGARPRDLNVP---RVGLVFTDGR-SQDDISV--WA 322 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +AK +GI + + + L AS S Sbjct: 323 ARAKEEGIVMYAVGVG------KAVEEELREIASEPS 353 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/310 (9%), Positives = 81/310 (26%), Gaps = 22/310 (7%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 ++ ++L + L + I +I + S + + ++ Sbjct: 73 SIDNMNLMGNVAKTGNALANVADYFTEAKGARPSSKVSKILLLITDNPSQDEVKVPADSL 132 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY------PGPLDPSL 253 N + + + ++ K++PY S +K + Sbjct: 133 RSNGLIIYSIGGFSANKKELQEIS---GKITPYYQSFDKLQTIENDLLFRICNPEEECKR 189 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 E + ++ ++ + K D V+ GA ++D + Sbjct: 190 IEVADIVFVIDSSGSIDYTEYKEMQNFMVSLVNKSAVGPDNVQFGALKYSDY---NTELF 246 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + + G + + +++ Sbjct: 247 YLNRYTNKVDIINHINKDTTQGGNTYTAGAVRFSKEFFTEKHGSR----KARGVPQIVMV 302 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +TDG++ ++ + + +GI I I E + +F + Sbjct: 303 ITDGDSHDKDKLNET-ARQLEQEGIIIYAIGIDQ-ANTNELETLA----GTEGKWFMVAN 356 Query: 434 THELNKIFRD 443 L I Sbjct: 357 FSGLQDILVQ 366 >gi|227819050|ref|YP_002823021.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] gi|227338049|gb|ACP22268.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] Length = 155 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/146 (10%), Positives = 40/146 (27%), Gaps = 3/146 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + +K +G +I A+ +PV++G G+ + W + L+ AA + A+ Sbjct: 12 RRSGLPRGWLKDESGTVAVIAAVTLPVLVGAMGLGAETGYWYLKDRKLQHAADVSAHAAA 71 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V ++ + Y + + + Sbjct: 72 VRYRAGDQKPALETTAKRIAM---ASGYSPGSLSVSTGAGAGGSHKVTVELSETHPRLFS 128 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSM 151 ++ + R Sbjct: 129 SIFSSEPVTMSARAPWPRSMVAPRLA 154 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 60/193 (31%), Gaps = 28/193 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A ++ ++ D V F + V R V+ I Sbjct: 59 AKQAALRLLDILRDKDYVGVYG------FAGKFYKVLEP---VPATNRNEVEKAIIGLKL 109 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI---AIC 390 T I YDT+ E+ +++ + I+ +TDGE T ++ + Sbjct: 110 GSGTNI-------YDTLKKLVEETKKVLESGAISLVRIIFITDGEPTTGQKKPEKILEMA 162 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD------R 444 K + G + I ++ +R + F + L K+ + Sbjct: 163 KKLREAGASALIIGVGTEYNEKLLSRMAM---VLNGEFEHVSDPASLEKLISEYAKSTQE 219 Query: 445 IGNEIFERVIRIT 457 I + + R++ Sbjct: 220 ISAKNVAVLFRLS 232 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 63/186 (33%), Gaps = 24/186 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I + D + A ++D V + + + Sbjct: 207 VEQVKIFTKKFIDKLTDRYYRCDRNLVWNAGALHYSDSVELIQGLTRMPSG-QKNLKDRV 265 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 + T + A++ + ++ ++ + KY++++TDG + +E Sbjct: 266 EAVQYIGKGTHTDCAIKRGIEELL--------IGGSHQKENKYLIVVTDGHPLEGYKEPC 317 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK GI++ +IA LS A+ +++ F NS +++ Sbjct: 318 GGLEDAVNEAKHLGIKVFSIAI-----SPGHLEPRLSIIATDHTYRRNFTVNSEENVDES 372 Query: 441 FRDRIG 446 I Sbjct: 373 IDTMIN 378 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 61/220 (27%), Gaps = 24/220 (10%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + S + K V+ + + K VR+ Sbjct: 957 YTCPIKFTSSSDITILVDSSTSVGSHNFKTSKQFVKRLAERFLSADK---TDTADVRVSV 1013 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEV 358 ++ R + T + + D +T +N A++ S+ Sbjct: 1014 VQYSGRNQQKLEAQFL---QNYTEIASIIDDMEFINDATDVNAAIRYVTTLYQKSSP--- 1067 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQE---- 413 KK ++L +DG + + I +A+ GI I + + + Sbjct: 1068 ------RGVKKRLLLFSDGNSQGITGKAIEAAVQEAQRAGIEIYVLVVGRHANEPNIRVL 1121 Query: 414 -KARYFLSNCASPNSF-FEANSTHELNK-IFRDRIGNEIF 450 + + A F H L + +F + +I Sbjct: 1122 VTGKTTEYDVAYGERHLFRVPDYHALLQGVFYQTVSRKIS 1161 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 62/207 (29%), Gaps = 38/207 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A S I+ + TV++G F + L+ I Sbjct: 106 NRLVAAQEAAKSFIKDL------PRTVKVGIVAFAGSAQVAQLPTTNHDDLVTAIDSFQL 159 Query: 329 IDENEMGS-------TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY----------- 370 G+ T DA S + + + K + Sbjct: 160 QRATATGNAIVVSLATLFPDAGIDVEQFSAPSRQRGTPIDQTEKKLKDFTPVAPGSFTSA 219 Query: 371 -IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------EKARYFL 419 I++LTDG+ T + + A +G+RI T+ + L Sbjct: 220 AIIMLTDGQRTTG-VDPLDAAKAAADRGVRIYTVGVGTVDGETIGFEGWSMRVRLDEETL 278 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDR 444 A + +F A + ++L K++ Sbjct: 279 KAVANKTNAEYFYAGTANDLKKVYETL 305 >gi|156383823|ref|XP_001633032.1| predicted protein [Nematostella vectensis] gi|156220096|gb|EDO40969.1| predicted protein [Nematostella vectensis] Length = 182 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 29/123 (23%), Gaps = 19/123 (15%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + A G T +A+ + + K +V+ Sbjct: 61 DDYDHINDIKAAVDAFSHQHGGQTRTGEALTFTLEEVFKKAP--------RPSVKNVLVV 112 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFE 430 LTDG+ + + G+ + I L + AS F Sbjct: 113 LTDGKAQGNVTGP---AQDVRDHGVEVFAIGVG-----PHSNEAQLKDIASDPDDKHVFH 164 Query: 431 ANS 433 Sbjct: 165 VTD 167 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 41/153 (26%), Gaps = 18/153 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 76 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 132 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 133 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 191 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + F+ L +F I Sbjct: 192 KKD-NEQHVFKVKDMENLEDVFYQMIDESQSLS 223 >gi|221109964|ref|XP_002168937.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 221 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 65/193 (33%), Gaps = 24/193 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 +S ++ +K ++ +++ SI R F+ L Sbjct: 40 SASLRKYYQNEKDFLKSLISAFGVSINST-------RAAVITFSYHAQLSIKL-NKYSNL 91 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I +T I+ A++ A + + K + LLTDG Sbjct: 92 NSFKEAVDNIVLM-GSTTRIDKALRLAQKEVFELENGARPGVA------KILFLLTDGSQ 144 Query: 380 TQ--DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHE 436 TQ +E + I N+ +S G+ I+ I L + A + + A+S + Sbjct: 145 TQERGSENPVVIANELRSAGVTIIVIGI-----TNAVDVSELFDIAGGEENAYFADSFEK 199 Query: 437 LNKI-FRDRIGNE 448 L + F + + + Sbjct: 200 LKDVNFLEVVKRK 212 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 43/427 (10%), Positives = 109/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + L + + + ++ V + +S L L Sbjct: 101 YHSQANKLILDNKEYDIIDETENKKDTDVSVPKVDEIEEESSKENENSVS---PFTLPTL 157 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 158 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 214 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 215 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 274 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 275 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 333 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + + +V R+ Sbjct: 334 FDSVKNQVKSITPSRTNGGTFTQKALRDAGNMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 393 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 394 HAQSSSNYYGTQFSNTQDQPGNTSRISNSYYAPDQNNLSRRIDSTFIATIGEAMALKERG 453 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 454 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 512 Query: 448 EIFERVI 454 +I V Sbjct: 513 QISATVS 519 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 56/168 (33%), Gaps = 21/168 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +V+D + S+I + + R+G ++ + S + VKT Sbjct: 818 YEVVKDFVNSLIDHVSVS---REATRVGVVLYSHVEVVVASLQQLY---DQAAVKTAVRR 871 Query: 331 ENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIA 388 +G T A++ A ++ K V+LTDG + +D Sbjct: 872 MPYLGEGTFTGSAIRRATQLFQAARPGVR----------KVAVVLTDGLADNRDAVSLKD 921 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNCASPN--SFFEANS 433 A S GI I + N Q + + ++ AS ++ Sbjct: 922 AAEGAHSAGIEIFVVGIVNNSDSQYAEFKNEMNILASDPDENYVYLTD 969 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 16/134 (11%) Query: 316 VHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ +G T A+ A + + +LL Sbjct: 107 RDWQDIDVFQSRVDSMTYIGHGTYSAYAISNATQLFTQE---------TAPTSFRAALLL 157 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCAS--PNSF-FE 430 TDG++ + + + AK I + I ++ +Q+ R L + AS P + F Sbjct: 158 TDGDDHPRSPSAVTAASDAKIHDIHMFVIGLPGSSRDEQDYGR--LRSIASAPPQQYLFS 215 Query: 431 ANSTHELNKIFRDR 444 +K+FR+ Sbjct: 216 LTDPQLDDKLFREL 229 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 32/93 (34%), Gaps = 13/93 (13%) Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------E 413 +++LTDG+ T + + A +G+RI T+ + Sbjct: 214 GSYTSAAVIMLTDGQRTTG-VDPLDAAKAAADRGVRIYTVGVGTVDGETIGFEGWSMRVR 272 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 L A + +F A + +L K++ Sbjct: 273 LDEETLKAVANKTQAEYFYAGTAADLKKVYETL 305 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 44/136 (32%), Gaps = 13/136 (9%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ L I+ + +G F + + + + +D Sbjct: 118 SSKNRLDFAKELIEYFVYQRENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDL- 176 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 +A+ + A + K++ KK +++LTDG D + N Sbjct: 177 -GNGSALGLGISIALSHL-----------KHSEAPKKSVIVLTDGVVNSDEVYKDQVINL 224 Query: 393 AKSQGIRIMTIAFSVN 408 A+ ++I ++ + Sbjct: 225 AQGLNVKIYSVGIGSD 240 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 139 YFFLGPSRTVSATSTAKAT 157 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 62/193 (32%), Gaps = 28/193 (14%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + A ++ ++ D V F + V R V+ I Sbjct: 59 AKQAALGLLDILRDKDYVGVYG------FAGKFYKVLEP---VPATKRGEVERAIISLKL 109 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI---AIC 390 T I YDT+ E+ ++N + I+ +TDGE T + + Sbjct: 110 GSGTNI-------YDTLKKLVEETKKVLQNGALSLVRIIFITDGEPTVGKKNPKKILEMA 162 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI--GNE 448 K + G + I ++ +R + A F + L K+ + E Sbjct: 163 KKLREAGASALIIGVGTEYNEKLLSRMAM---ALNGEFEHISDPASLEKLISEYAKSTQE 219 Query: 449 IFER----VIRIT 457 + + ++R++ Sbjct: 220 VSAKNVAVLLRLS 232 >gi|242051338|ref|XP_002463413.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] gi|241926790|gb|EER99934.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] Length = 491 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 55/197 (27%), Gaps = 31/197 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A+ VI + D R+ F+ + + + + K Sbjct: 90 KLELLKQAMCFVIHQLGPAD------RLSIVTFSRHATRQIRLA-RMSDVGKASAKFAVG 142 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------N 383 + T I ++ + E ++LL+DG++T Sbjct: 143 ALCAVRGTNIGQGLRVGAQVLAGRRERNAVAG---------MILLSDGQDTSGCWTTVRP 193 Query: 384 EEGIAICNKA------KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + N S+ I T F + + + +F + + Sbjct: 194 DGTKTYANLVPPSTSFSSRPAPIHTFGFGTDHDAAAMHAIAEA---TGGTFSFVGNEAAI 250 Query: 438 NKIFRDRIGNEIFERVI 454 F +G + V Sbjct: 251 QDSFARCVGGLLSVAVQ 267 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 12/84 (14%), Positives = 32/84 (38%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ ALL+ + + +DV Y + L++ A ++ + L + ++ + Sbjct: 12 RQRGTVSLMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQLKSADDQAT 71 Query: 77 SRAKNSFTFPKQKIEEYLIRNFEN 100 + A + + + N Sbjct: 72 TNANVLSSVTSAAAQNGYTKAVAN 95 >gi|325496693|gb|EGC94552.1| hypothetical protein ECD227_0790 [Escherichia fergusonii ECD227] Length = 496 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/297 (10%), Positives = 79/297 (26%), Gaps = 35/297 (11%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------- 217 T S +++ + + S+ + +N F Sbjct: 40 ATFSLDVDTGSYANVRRFL-KTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENS 98 Query: 218 -QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + + + P + +D+S + ++ L++ Sbjct: 99 PFSINYELTPAPWNEKHTLLRLDIAANDIARSKLPPANLVFLIDTSGSMNSDERLPLIKS 158 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +L ++ ++ D ++ S + + + G Sbjct: 159 SLKLLVNELRDQDRISIV---------TYAGSARLLLSSTSGSEKNTILNAIANLQAGGG 209 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAK 394 T + AY+ + + N I+L TDG+ + + K + Sbjct: 210 TNGGAGVAMAYE----QAQAGYIKGGVNR-----ILLATDGDFNIGDDPASVEDLVKKQR 260 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 GI + T+ + A ++ +S E K+ + + + Sbjct: 261 ESGITLSTLGVG----DNNYNEEMMVKIADTGNGNYSYLDSLSEAQKVLSNEMNQTL 313 >gi|218509982|ref|ZP_03507860.1| hypothetical protein RetlB5_22280 [Rhizobium etli Brasil 5] Length = 168 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 42/127 (33%), Gaps = 7/127 (5%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+ G I A+L + + +++V + A ++ S + Sbjct: 10 LRRLLGDRQGVAAIEFAILALPLFIMLFGIIEVSLMFFVNS----ALDASVHKISRMIRT 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + +K + K KI + ++ +F + + D+ T+ + Sbjct: 66 GE---VASSKITLAGFKAKICDDMLLSFNCSSDLVVKVNVLSDLSSATSTDPIDNSGNLA 122 Query: 131 VVLSSRY 137 V + Sbjct: 123 VTQTFDV 129 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 5 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 63 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 64 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 123 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 124 YFFLGPSRTVSATSTAKAT 142 >gi|220922540|ref|YP_002497842.1| hypothetical protein Mnod_2575 [Methylobacterium nodulans ORS 2060] gi|219947147|gb|ACL57539.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 506 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/328 (9%), Positives = 88/328 (26%), Gaps = 8/328 (2%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 F + + L G ++++ V+ G+ + +D +R+ + ++ AA + Sbjct: 19 FMHSLLSQCRSLWADSRGIMLPYVSIMLVVIAGLSLLGLDGIRYMSLQTQMQAAADALAL 78 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + L + + + + V + Sbjct: 79 AGAAELNGKAGARARAQAAIDNLLSNGLSGMGVTAPVQVTVQFKRTLPVASQLPTGGTAA 138 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 A+ V++ + + ++F S + + A++V+ + ++ Sbjct: 139 TADSDAHFVIVDAS---PITLNTIFPASFVNSALANTFTSGAQSVAGMNQSICSIPPVYL 195 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + + P Q + S + + + + Sbjct: 196 CNPWEGTGIDLTTALATPGQARRQLKLLNDGTSSPGHFGWLVPPDNNVSASNLQDWISRT 255 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 D +S S+L + + DA + ++ V A + Sbjct: 256 TPKTCYRDGYVSLNTGAKQSALAGFNTRFDIGADAT-----HVPDVNVRKGRVSNSANWC 310 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAID 330 N + +D + + A+ Sbjct: 311 NAQPDTDRALALPQDSTFTNFASGGAMG 338 >gi|154250534|ref|YP_001411358.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154484|gb|ABS61701.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 140 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 37/143 (25%), Gaps = 4/143 (2%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G I A + V L + + AL A A L Sbjct: 2 SRFGRDERGSVAIEFAFIAAVFLAILFGTISYGFQFATRIALSYAVTEGGRAAVAGLSDQ 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E + RA ++ + R + L + + EV + R + Sbjct: 62 --ERTQRAADAIYAVVDAYAPLIDRGGISLLDPQWRETEVGRTGDIAIEYTDARFTFLP- 118 Query: 132 VLSSRYDLLLNPLSLFLRSMGIK 154 + + F+ + Sbjct: 119 -FVPAIPGTMRVQTTFVVADPSG 140 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 36/139 (25%), Gaps = 14/139 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSL 72 I G F ++ A+ M V + G VD+ + L++ A A + + + Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 EEVSSRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ A + + T + +V Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 122 MNPRKSAYQVVLSSRYDLL 140 + V +S Sbjct: 139 YFFLGPSRTVSATSTAKAT 157 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/280 (12%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ S + S Sbjct: 433 VTLSNVSERKDNMRLGLSLTTNPKDNSFVACSPLWSHECGSSYYTTGMCSRVNSNFRFSK 492 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 493 TVAPALQRCQTYMDIIIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 542 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 543 VGVVQYGEDVVHEFHL-NDYRSVKEVVEAASHIEQRGGTETRTAFGIEFA--------RS 593 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + ++ + +A ++ Sbjct: 594 EAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINP 652 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 653 EAFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 691 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 61/203 (30%), Gaps = 43/203 (21%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPDRLSKAKQMLSKLTD-------GFSNDKVGLIVFAGDAFTQLPITSDYVSA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + S + K I+L+TDGEN +D+ Sbjct: 160 INPSMVSTQGTAIGAAINLAMRSFTPS-----------ETSDKAIILITDGENHEDDAVK 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ---EKARYFLSN--------------C-----AS 424 A A +GI + + K + + ++ + C A Sbjct: 209 AAA--AAAEKGIHVNIVGMGDPKGSPIPVDGSNNYMKDKDGNVVITKLNEEMCQEIAAAG 266 Query: 425 PNSFFEANSTHELNKIFRDRIGN 447 ++ A++T+ + + I Sbjct: 267 HGTYVRADNTNSALRALQKEIDK 289 >gi|268532310|ref|XP_002631283.1| C. briggsae CBR-CLEC-60 protein [Caenorhabditis briggsae] gi|187036877|emb|CAP23542.1| CBR-CLEC-60 protein [Caenorhabditis briggsae AF16] Length = 408 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 48/154 (31%), Gaps = 8/154 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI-VKTFAIDENEMGSTAIND 341 I T R+G +N + + + T + + I Sbjct: 74 TRIGTDPYEPRTTRLGLVTYNAVANTVANLDTYQSLDDVYDGIFTALSQVSSSDESYIVH 133 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A D + + KN ++ +++ + I + ++ K+ G+ I+ Sbjct: 134 GLAQAEDILEDGKSN-----KNRTHYQRVVIVYASTYKGTGPLDPIPVADRLKTAGVTIV 188 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 T+A+ + A L A+P F Sbjct: 189 TVAYDQDGDGALLAD--LQLIATPPYNFSNTDQA 220 >gi|327280282|ref|XP_003224881.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XIV) chain-like [Anolis carolinensis] Length = 1885 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 33/122 (27%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + + K +L+ Sbjct: 214 NTYSTKDAVLDAVRNLPYKGGNTLTGLALTFILE------NNFKSEAGARPGVPKIGILI 267 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ K GI + I + L AS + Sbjct: 268 TDGK-SQDDVIPP--AKNLKDAGIELFAIGV------KNADETELKEIASEPDNTHVYNV 318 Query: 432 NS 433 Sbjct: 319 AD 320 >gi|315497471|ref|YP_004086275.1| tade family protein [Asticcacaulis excentricus CB 48] gi|315415483|gb|ADU12124.1| TadE family protein [Asticcacaulis excentricus CB 48] Length = 187 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 37/128 (28%), Gaps = 4/128 (3%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G + AL+ G+ +++ + +L A S L Sbjct: 25 LWRDRRGATAVEFALIAFPFFGLIMGCIELAIVLFAGVSL----DLATAKVSRELRTGTS 80 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ + T ++ T + + + V ++ + + Q + Sbjct: 81 GKATTSAIFITKVCNEMAWIGSDCKSKLRVDVRTFTNFQMVSQAPDVIVDGKFVSMQYTV 140 Query: 134 SSRYDLLL 141 + L Sbjct: 141 GGSSQIQL 148 >gi|319940447|ref|ZP_08014792.1| hypothetical protein HMPREF9464_00011 [Sutterella wadsworthensis 3_1_45B] gi|319806073|gb|EFW02822.1| hypothetical protein HMPREF9464_00011 [Sutterella wadsworthensis 3_1_45B] Length = 184 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 44/143 (30%), Gaps = 8/143 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY----EHALKQAA 57 +F ++ + ++ G + TA PV+L ++V+ + E A+ A Sbjct: 5 IFRSRIFTAGRAFLRDGRGVSAVETAFAFPVILAAFFIIVEFANMALTIQVGESAVSSAL 64 Query: 58 ----QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + AS + ++ + + T+ D Sbjct: 65 LRFRDAGELGASAENDIRQGIAAYSFGYLKPSNVSRVTVEAYESLDALGNPGGTNGAGED 124 Query: 114 IVRDTAVEMNPRKSAYQVVLSSR 136 TA + + VV+ S+ Sbjct: 125 DEEGTAEADSSYPAWKVVVVISK 147 >gi|298705150|emb|CBJ28593.1| glycin-rich protein [Ectocarpus siliculosus] Length = 1238 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 42/133 (31%), Gaps = 21/133 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + D G T I D + + + +S ++++ Sbjct: 137 GTFYSLEDFNAFVDADAKYSGGTDIIDGIAKGRELLSAS-----------PTTTSFMIVT 185 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANS 433 TDG + + A+++G + + +Q+ L + F+ ++ Sbjct: 186 TDGVA----PDPQDEADAARAEGTILYAVGVGSGPSQEN-----LLAIGGDEANVFDVDN 236 Query: 434 THELNKIFRDRIG 446 EL+ D + Sbjct: 237 FEELDLALDDIVS 249 >gi|299138185|ref|ZP_07031365.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600115|gb|EFI56273.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 382 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 44/145 (30%), Gaps = 1/145 (0%) Query: 262 SSLRHVIKKKHLVRDALASVIR-SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + R A + + + ++ + F V + KL Sbjct: 91 DTSQSQRTVIDSERSASGTFLDKMLAPASANRESDKAFVVQFAREVELLQDPTDSRSKLK 150 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + + + + ST + T+ + + + + ++ ++LLTDG + Sbjct: 151 QALKELDTTAPSTSSSTGDDSGHAHGGTTLYDAVFLSADEVTSKQKGRRALILLTDGVDR 210 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAF 405 E A+ I I F Sbjct: 211 NSKESIAEAIEAAQRADTIIYAIYF 235 >gi|323702113|ref|ZP_08113781.1| Protein of unknown function DUF2134, membrane [Desulfotomaculum nigrificans DSM 574] gi|323532995|gb|EGB22866.1| Protein of unknown function DUF2134, membrane [Desulfotomaculum nigrificans DSM 574] Length = 342 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/257 (8%), Positives = 67/257 (26%), Gaps = 21/257 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHA------------L 53 + K+L G ++ AL M V++G ++ D + + + Sbjct: 33 RLSNMLKQLASREHGSAMVLLALAMTVLIGAVALVTDFGLVALNKSRLANAADAAALAGV 92 Query: 54 KQAAQ--TAIITASVPLIQSLEEVS-------SRAKNSFTFPKQKIEEYLIRNFENNLKK 104 ++ + + + Q + + + ++ N+ Sbjct: 93 QELPEYPMEALAVARSFGQQNGITDEQMQIKLVQDLATSRYVGVEVTATKTVNYIMAKVL 152 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 +T +V+ + + A + + L+ + Sbjct: 153 GYTTVDVQAKAAAQVIPVTSAYGAVPLFVPDTQQLIFGQQYTLRSKDADFGPGNFGALDF 212 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + K Q VI S + + + + + + + + Sbjct: 213 GSGANDFEKTLAQGYQGVIKVSYEIDTLTGVKKNKTIWGIEERIRGCTHGCTYDHFQPDC 272 Query: 225 RDEKLSPYMVSCNKSLY 241 + P + + + Sbjct: 273 PRVMILPVVHYDKLNGH 289 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 54/174 (31%), Gaps = 43/174 (24%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + ++ +H L + + +G T I+ A+ A + Sbjct: 131 VVFSGTAMWQCPLTFDLHALKMFLQSVETTNL-PLGGTRISSAIMLASKAASCESAGS-- 187 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY-- 417 + ++L++DGEN + + N AK G+RI++I + + Sbjct: 188 ---------RVMILISDGENH--DSKIKEAVNAAKKAGLRIISIGIGKKEGAPIPVKDET 236 Query: 418 ----------------------FLSNCA--SPNSFFEANSTH---ELNKIFRDR 444 L N A +F+A L K R+ Sbjct: 237 GTVIDYVKDVNGKVVISRANPVLLKNVAEEMGGKYFDALDDDVYPSLVKAVRNL 290 >gi|167624203|ref|YP_001674497.1| hypothetical protein Shal_2279 [Shewanella halifaxensis HAW-EB4] gi|167354225|gb|ABZ76838.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4] Length = 446 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 15/183 (8%), Positives = 47/183 (25%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 I +G ++ + + ++ + +D + L+ A A + A+ L + Sbjct: 15 IGRQSGAILVMFTIGIFAVIAFAALALDGGHMLLSKGRLQNAVDAAALNAAKELQEGATL 74 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + +R + + + + R ++ Sbjct: 75 LEAREAAYTILLQNLSFTENGELNTSVSLSSPDFNNTQVTPRLQVEFSELPDPFNPILAE 134 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + ++ L + + A V+ V+ + Sbjct: 135 GSEYVRVRVENVKLSNFLADILNFDKEIRASAVAGRSQDLACVNKILPLLVCGKEGSTAE 194 Query: 195 DSE 197 D+ Sbjct: 195 DNY 197 >gi|150005796|ref|YP_001300540.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294776175|ref|ZP_06741664.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|149934220|gb|ABR40918.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294449998|gb|EFG18509.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] Length = 340 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/221 (10%), Positives = 57/221 (25%), Gaps = 44/221 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAEDVTPSRLDKSKKLISRLVDTFN-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++T T I A++ A + + I+++T Sbjct: 150 YVSA-KMFLETINPSLITTQGTDIGAAIRLAMKSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------- 418 DGEN + A K G+++ + A Sbjct: 198 DGENHEGGAVEAAQEAAEK--GMQVFVLGVGSPDGSPIPAEDGSNNFRRDKDGNVIVTRL 255 Query: 419 -LSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 C A + ++T+ K+ + I V Sbjct: 256 NEQMCQEIAKAGNGMYIRVDNTNSAEKVLNNEIAKLSKADV 296 >gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 426 Score = 48.8 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 51/193 (26%), Gaps = 37/193 (19%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A+ +I +K D R+ F+ V +I + G Sbjct: 64 QAVEGLIDRLKVGD------RISVVAFSGSVEVIIPNQVIEDP--ESIKSQIKSKLSASG 115 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------------DN 383 TAI + ++ ++ V + LLTDG DN Sbjct: 116 GTAIAEGLELGITELMKGTRGAVSQA----------FLLTDGHGESSLRIWKWDIGRDDN 165 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + + KA + I T F + L A + + + F Sbjct: 166 KRCLKLAQKAAKLNLTINTFGFG-----NSWNQDLLEKIADVGGGTLAHIEHPEQAVEQF 220 Query: 442 RDRIGNEIFERVI 454 G + Sbjct: 221 SRLFGRIQSIGLT 233 >gi|324112804|gb|EGC06780.1| von Willebrand protein type A [Escherichia fergusonii B253] Length = 530 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/297 (10%), Positives = 78/297 (26%), Gaps = 35/297 (11%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------- 217 T S +++ + + S+ + +N F Sbjct: 74 ATFSLDVDTGSYANVRRFL-KTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENS 132 Query: 218 -QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + + + + P + +D+S + ++ L++ Sbjct: 133 PFSINYELTPAPWNEKHTLLRLDIAANDIARSKLPPANLVFLIDTSGSMNSDERLPLIKS 192 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 L ++ ++ D ++ S + + + G Sbjct: 193 LLKLLVNELRDQDRISIV---------TYAGSARLLLSSTSGSEKNTILNAIANLQAGGG 243 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAK 394 T + AY+ + + N I+L TDG+ + + K + Sbjct: 244 TNGGAGVAMAYE----QAQAGYIKGGVNR-----ILLATDGDFNIGDDPSSVEDLVKKQR 294 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 GI + T+ + A ++ +S E K+ + + + Sbjct: 295 ESGITLSTLGVG----DNNYNEAMMVKIADTGNGNYSYLDSLSEAQKVLSNEMNQTL 347 >gi|254882022|ref|ZP_05254732.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319640970|ref|ZP_07995678.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] gi|254834815|gb|EET15124.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317387415|gb|EFV68286.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] Length = 340 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/221 (10%), Positives = 57/221 (25%), Gaps = 44/221 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAEDVTPSRLDKSKKLISRLVDTFN-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++T T I A++ A + + I+++T Sbjct: 150 YVSA-KMFLETINPSLITTQGTDIGAAIRLAMKSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------- 418 DGEN + A K G+++ + A Sbjct: 198 DGENHEGGAVEAAQEAAEK--GMQVFVLGVGSPDGSPIPAEDGSNNFRRDKDGNVIVTRL 255 Query: 419 -LSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 C A + ++T+ K+ + I V Sbjct: 256 NEQMCQEIAKAGNGMYIRVDNTNSAEKVLNNEIAKLSKADV 296 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 42/113 (37%), Gaps = 7/113 (6%) Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + + R + ++LLTDG++ A+ ++ K + I + F Sbjct: 159 WTWLYQATLEAMERAVQAPTTRAAVILLTDGKDEGSPVSEEAVLDRIKGAQVPIYAMGFG 218 Query: 407 VNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 +L AS +F +L +++ + + ++R T Sbjct: 219 SKA-----QVDYLQKVASASQGAFLSTPQAADLTNLYQTVLDYLKNQYILRFT 266 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 841 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 897 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 898 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 956 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 957 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 989 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 48/185 (25%), Gaps = 18/185 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAIND 341 + + ++ + ++ + + L + + T + Sbjct: 136 QSNGVWSGSEPICRQPYSYDFPEDVASALDTSLTNLLGATNPTQNLLTNHENATGTNTYE 195 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR- 399 + Y + S + E + I+LLTDG++ + + + I Sbjct: 196 VLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDS-PKKAVTRIRELLSIEQ 254 Query: 400 -------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 I I ++ S F L +IF Sbjct: 255 NRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEHM------LD 308 Query: 453 VIRIT 457 V ++T Sbjct: 309 VSKLT 313 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 535 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 591 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 592 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 650 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 651 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 683 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 328 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 384 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 385 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 443 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 444 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 328 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 384 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 385 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 443 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 444 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 325 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 381 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 382 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 440 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 441 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 473 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 296 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 352 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 353 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 411 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 412 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 444 >gi|148694797|gb|EDL26744.1| complement factor B, isoform CRA_g [Mus musculus] Length = 541 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 106 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 162 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 163 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 221 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 222 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 254 >gi|387437|gb|AAA39549.1| MHC factor B [Mus musculus] Length = 477 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 42 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 98 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 99 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 157 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 158 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 190 >gi|74195997|dbj|BAE30555.1| unnamed protein product [Mus musculus] Length = 436 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 1 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 57 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 58 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 116 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 117 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 149 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 38/154 (24%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + D T A+Q Y + + + I+ Sbjct: 326 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWNRTRH---VII 382 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 ++TDG + + + ++ + R + Sbjct: 383 IMTDGLHNMG-GNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALAS 441 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + F+ +L +F I + Sbjct: 442 KKD-NEHHVFKVKDMEDLENVFYQMIDETKSLSL 474 >gi|119386037|ref|YP_917092.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376632|gb|ABL71396.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 855 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 58/192 (30%), Gaps = 24/192 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDN-VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 I K + RD + ++ N T + + + G + Sbjct: 45 INKITIARDVVGDIVSDFPADQNLGFVTYGHRERGQCADIETLVEPAPGTA----AEIAG 100 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 N G T + DA+ TA + + + ++L++DG T N + Sbjct: 101 IVEGLNPRGMTPMTDAVVTAAQALRHTEQAAT------------VILVSDGIETC-NPDP 147 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 A + G+ I F V + + A + F A++ ELN+ R Sbjct: 148 CAAARALEEAGVDFTAHVIGFDVRGEADALLQ--MQCIAEETGGRFLTADNAQELNEALR 205 Query: 443 DRIGNEIFERVI 454 + Sbjct: 206 EVTAAPASGSFT 217 >gi|330811037|ref|YP_004355499.1| hypothetical protein PSEBR_a4091 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327379145|gb|AEA70495.1| Hypothetical protein; putative exported protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 426 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/252 (8%), Positives = 50/252 (19%), Gaps = 8/252 (3%) Query: 42 DVVRWSYYEHALKQAAQTAIITASVPLIQ---SLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 D + L+ A A + + L Q + S+ + + Sbjct: 21 DGGHMLLNKTRLQNAVDAAALGGAKTLSQVTGGMNMASTTRAAALDTLSRNASAVGNAEL 80 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + N ++ + + + + S + Sbjct: 81 ATAVAGNPGAFAAVELSSSVYGPFSYPGPTDAKYVRVSVANYQLNGFFWSFVQSMGSGGL 140 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 KA A + + ++ + + N + V ++ Sbjct: 141 GGKAVAAIATAGPSPTSPCDVAPLM-----VCGDASQYDPAAGNFWSYHFGDLVVLKTAA 195 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 I S G + V +S V + Sbjct: 196 GNTSPIGPGNFQLLDFGSGGSSVREDLAGGGSVCRAVGDNVQTSPGNTVGPASQGLNTRF 255 Query: 279 ASVIRSIKKIDN 290 + D Sbjct: 256 GIYNGPVSASDY 267 >gi|118579649|ref|YP_900899.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118502359|gb|ABK98841.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 337 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 47/139 (33%), Gaps = 14/139 (10%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + + + I+K + + A+ T Sbjct: 138 FASKGVPIVPLTNDYGYC-QYILKHANDSTISTPGSDLGQAITTGIYLF----------E 186 Query: 362 KNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 +++ + K IVL++DGE+ +D+ +A ++GI I T+ + R + Sbjct: 187 ESSRTSVKSIVLISDGEDINEDSSVMHEAAQRAAAKGIAIYTVGTGRGQGVMVPIRDAIG 246 Query: 421 NCASPNSFFEANSTHELNK 439 A +++ L Sbjct: 247 A-AIEG-YYQDEDGSYLKT 263 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 56/169 (33%), Gaps = 32/169 (18%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D +R+G T ++ ++ + +++ + G+T A+ + Sbjct: 282 DKIRVGLTQYSSDPRTEWDL--NTYATRDEVLEALRSLRYKGGNTFTGLALTHVLE---- 335 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + EA K I+LLTDG +QD+ K+ GI+I + + Sbjct: 336 --HNLKADTGARSEAPKLIILLTDG-KSQDDANPP--AQVLKNMGIQIFAVGV------K 384 Query: 413 EKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L AS + + F + V R+T+ Sbjct: 385 NADETELRQVASDPLELNVYNVLD-------FP-----LLSSLVGRLTR 421 >gi|302868694|ref|YP_003837331.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504835|ref|YP_004083722.1| von willebrand factor type a [Micromonospora sp. L5] gi|302571553|gb|ADL47755.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315411454|gb|ADU09571.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 316 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 68/205 (33%), Gaps = 31/205 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + ++A + + ++ +G F + + Sbjct: 105 SPNRLEAAQEAAKQFVGEL------PESYNLGLVSFAKSANVLVPPTKDRAAVTTA---- 154 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 +TA +A+ T + I S IVLL+DG T Sbjct: 155 -IDGLVLAEATATGEAVFTCLEAIRSV-----PADGAAGIPPARIVLLSDGYRTSGRSVE 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEANSTH 435 A A++ + + TIAF + Q + R LS A + F+EA S Sbjct: 209 EAA-AAAQAANVPVSTIAFGTDSGQVDIGGQLQRVPVDRTALSQLAETTQGFFYEAASAS 267 Query: 436 ELNKIFRDRIGNEIFERVI--RITK 458 EL ++++D G+ I R IT+ Sbjct: 268 ELKQVYQDM-GSSIGYRTEPREITQ 291 >gi|261883870|ref|ZP_06007909.1| hypothetical protein CfetvA_01086 [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 216 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 58/186 (31%), Gaps = 18/186 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN----DRVISDPSFSWG 315 +++ + + A+ S + S+ + NV + + R S + Sbjct: 42 SMANIEGDQTRMEAAKSAINSFMESLPQEANVGLRIYGHEGSGSKTDKARSCSSSELVYS 101 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + K + G T I A++ A + + L++ Sbjct: 102 IDKYDPAKFNQALVKAKPAGWTPIGFALKEAQKDL---------AAFQGDANTNIVYLVS 152 Query: 376 DGENTQDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DG +T D+ + + I I I F+V++ Q++ + + ++ Sbjct: 153 DGISTCDD-DPVGSAKALYDSDITPIINVIGFNVDQEGQKQLQEVAK--VTEGTYQNVQM 209 Query: 434 THELNK 439 L Sbjct: 210 AEGLYD 215 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 54/156 (34%), Gaps = 15/156 (9%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + V+ GA ++ I + + G T A++ A Sbjct: 845 ANVGRGQVQFGALKYSKEPEDLFYL--NTFSKGAAITENLRRRRDTYGETYTAKALEHAN 902 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 E H + K+ ++++TDG + D E+ K +++GI I + Sbjct: 903 SQFT-----EEHGSRIKQNVKQMLIVITDGV-SHDREQLSDTALKLRNKGIIIYAVGVG- 955 Query: 408 NKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFR 442 E +Y L A N+ ++ +L I++ Sbjct: 956 -----EADQYELEAIAGDKNNTRHVDNFDKLKDIYQ 986 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 46/130 (35%), Gaps = 14/130 (10%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 FS GV+ + K + G T A+ I R + + Y+ Sbjct: 493 FSIGVYSNDVDLRKAVLNIKQLTGDTHTGAALAFMLPKI------REGRKQRPSKVPCYL 546 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFE 430 ++LTDG+ +QD+ + N+ +++G+ I + E + L A Sbjct: 547 IVLTDGQ-SQDHHRILETANRIRAEGVTIHAVGIG------EADKTELQQIAGNEERVHF 599 Query: 431 ANSTHELNKI 440 + L I Sbjct: 600 GQNFDSLKSI 609 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 46/157 (29%), Gaps = 21/157 (13%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDT 349 D ++G F+ + + + + +T A++ Sbjct: 662 GPDKTQIGVVQFSTDAKEEFQL---NKYFTQKEISDAIDRMSLINQNTYTGHALEFV--- 715 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ H KK+++L+TDGE + + + + I ++ Sbjct: 716 ----DQYFSHLKGARPGVKKFLILITDGEARDGVRDP---ARALREKEVVIFSVGV-YGA 767 Query: 410 TQQEKARYFLSNCASPNS-FFEANSTHELNKIFRDRI 445 R L + S F+ +L I I Sbjct: 768 N-----RTQLEEISGDGSLVFQVEKFDDLKAIESKLI 799 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 49/145 (33%), Gaps = 23/145 (15%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +GV + + G T I A++ A T N H + L Sbjct: 108 YGVQPYEKESFENSLNGLGPNGWTPIARALEHAKQTDEQLNNGTKH----------IVYL 157 Query: 374 LTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + + + + + + I N + + + A +++A Sbjct: 158 ITDGEETCG-GDPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAK--AGKGHYYQA 214 Query: 432 NSTHELNKIFRDRIGNEIFERVIRI 456 ++ E+ G+ + +++ Sbjct: 215 STGKEM--------GSILSAESLKL 231 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 171 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 225 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 226 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 279 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 280 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 309 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/249 (11%), Positives = 69/249 (27%), Gaps = 11/249 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI---------ITASV 66 + G +ITA+ M +LG+ + +D+ + L+ AA A Sbjct: 7 QKQRGSVAVITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLYQRAQCG 66 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF--ENNLKKNFTDREVRDIVRDTAVEMNP 124 + + ++ + + +F + + + + +T Sbjct: 67 NTTATEPDWTTATQKASSFATASTSNTVQGSVIKVAQTGSGYWNITGSPGTLETVPFTPG 126 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + ++ S+ + I + + T + S G + + Sbjct: 127 TNDLPAIQVTITKSAANANGSVPVYLASILGVTSLSASATATAAVSRPGYVGPGGLFPLA 186 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S+ + D DS A + ++ I S + C + L Sbjct: 187 MSKCLFDNYWDSSTNSPKLATSTAKIPGQDFNQTPNTPYIFQVSSSYQVNGCEAGQWTTL 246 Query: 245 YPGPLDPSL 253 + + Sbjct: 247 TSQQNNVTF 255 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 212 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 266 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 267 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 320 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 321 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 350 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 42/154 (27%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + +++ N I+ Sbjct: 831 SSNADWVTKKLSEINYEDHKLKSGTNTKRALQAVYSMMS--WPEDIPPEGWNRTRH-VII 887 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 888 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVDQVNINALAS 946 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + F+ L +F I + Sbjct: 947 KKD-NEQHVFKVKDMENLEDVFFQMIDESQSLSL 979 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 54/184 (29%), Gaps = 17/184 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 + + + + ++ P+ ++ T ++E G T A Sbjct: 130 QGNGVWSGTEPICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQRTKDHENGTGTNTYAA 189 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR-- 399 + + Y + + + + E + I+LLTDG++ ++ + I Sbjct: 190 LNSVYLMMNNQMQLLGMKTMAWQEIRHAIILLTDGKSNMGGS-PKTAVDQIREILNINQK 248 Query: 400 ------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I + S F T L+++F V Sbjct: 249 RNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEHM------LDV 302 Query: 454 IRIT 457 ++T Sbjct: 303 SKLT 306 >gi|261194779|ref|XP_002623794.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588332|gb|EEQ70975.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 756 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 56/221 (25%), Gaps = 29/221 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + + + L + A ++I ++ D R+G F Sbjct: 81 DISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDND------RLGVVAF 134 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + +K + ST + ++ + + + + Sbjct: 135 STDAEVVYKISNMNEDNKKAALKA-VEALWPLSSTNLWHGLKLSLEALEEVTPIPQNV-- 191 Query: 363 NNLEAKKYIVLLTDGENT------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +LTDG + +I + K + I T F Sbjct: 192 ------QALYILTDGMPNHMCPRQGYVPKLRSILQQ-KDRLPMIHTFGFGYYI-----RS 239 Query: 417 YFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L ++ + +F I N + Sbjct: 240 GLLQAISEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQ 280 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 63/183 (34%), Gaps = 21/183 (11%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 K + R AL ++I + + NV SD + L R + Sbjct: 189 TPKTVIARQALIALINRLPETTNVALRTYGHRR---ADDCSDTELIQALAPLQRDALIAR 245 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + A ++ +D N IVL++DG+ T + + Sbjct: 246 INAIRPVNGGRTPIA-----QSLADMAQDLAGIEGNV-----LIVLVSDGDETCG-GDPV 294 Query: 388 AICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLS--NCASPNSFFEANSTHELNKIFRD 443 A + ++ +RI I F V +QE+ R L A ++F+A++ +L Sbjct: 295 ATASMLRAANSQLRISVIGFDV---EQEEWRRRLEGIAVAGGGAYFDASNAEQLADALDQ 351 Query: 444 RIG 446 I Sbjct: 352 AIA 354 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 63/197 (31%), Gaps = 21/197 (10%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN-DRVISDPSFSWGVHKLIRT 322 R AL +I+ + + D+ + + G+ K Sbjct: 176 DDSSSVDDTAFRSALNWIIQVVSYFSSYIDSGDLRVGVYGFSNDDHRSGIRIGLRKWTSA 235 Query: 323 IVKTFAIDE----NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +K + + T I+ A++ ++ K I+LLTDG Sbjct: 236 TLKKQIGELLNVKSTGAGTYISHAIKETVKVFEANGR---------KGISKEIILLTDGG 286 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHEL 437 + D + A+ +GI ++++ + L A + F+A + L Sbjct: 287 AS-DWWLLKGEADTARDKGIVLVSVGVGTSVN-----NDQLLAIAGNKSRVFQATDYNTL 340 Query: 438 NKIFRDRIGNEIFERVI 454 +++ + + ++ Sbjct: 341 DEVVNNVVSTIDAIKIT 357 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 59/178 (33%), Gaps = 10/178 (5%) Query: 269 KKKHLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V ++ + + + +++ ++ A F + F+ + +++ Sbjct: 250 TTTDPVYNSYKDLSKRLVQQLKIGPHYTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGI 309 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ G+TAI ++ A I S K +++ TDG + + + Sbjct: 310 DKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT------KVMIVFTDGWSNKG-PDPE 362 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 A + G + T+A++ LS S F + L + Sbjct: 363 KRARDAVNAGFEMYTVAYTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIKQ 420 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/299 (13%), Positives = 89/299 (29%), Gaps = 34/299 (11%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS- 216 +KAE + + + + +V D S + Q + A + + Sbjct: 127 YNSKAEMDYDGEDIDSQMSLKLDFVYDPSFKNQVNYSHTAVQIPTDIYKGAPVILNELNW 186 Query: 217 ----SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + RD+ + + + YP + + D I+ Sbjct: 187 TQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGAS 246 Query: 273 LVRDALASVIRSIKKI---------------DNVNDTVRMGATFFNDRVISDPSFSWGVH 317 +D + V S D ++D + FN++ + + Sbjct: 247 SPKDMVILVDVSGSVSGLTLKLIKASVTEMLDTLSDDDYVNVARFNEKAEAVVPCFDHLV 306 Query: 318 KLI---RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + I K G+T A++ ++++N K I+L Sbjct: 307 QANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNTNV-------PRANCNKIIMLF 359 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 TDG + + ++ +R+ T + + ++ C++ +FE S Sbjct: 360 TDGGEDRAQDIFEQY--NWPNKTVRVFTFSVGQHNYDVTPLQWI--ACSNKGYYFEIRS 414 >gi|270008949|gb|EFA05397.1| hypothetical protein TcasGA2_TC015569 [Tribolium castaneum] Length = 2194 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT--FAIDENEMGSTAINDAMQTAYDTI 350 D + D +++ + V I K D+ MG T I + + Sbjct: 307 DFMEKFDYENLDSCLTELNLGNAVQFTEDNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLV 366 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIR---IMTIAF 405 + + + + I+LLTDG + + I +QG I +++F Sbjct: 367 R-RTLKKNYEQNVETKHQPMIILLTDGLPNVGLSDPVEITKIVTKINQGTNRAAIFSLSF 425 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + N +EA + F + Sbjct: 426 GEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTV 465 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 38/124 (30%), Gaps = 6/124 (4%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + + + + + + I+ LTDG + I Sbjct: 1108 GCTNIIGGLVVGLYLVR-RTLQKFYEKNVETKHQPMIIFLTDGLPNEGISNPDKITKIVT 1166 Query: 395 --SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +QG I +++F + + + N +EA + F + + + Sbjct: 1167 KINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTVSSPL 1226 Query: 450 FERV 453 V Sbjct: 1227 LRDV 1230 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 53/189 (28%), Gaps = 13/189 (6%) Query: 271 KHLVRDALASVIRSIK-KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 V + + IK + + FN + + K + Sbjct: 1837 IVSVWNFEKNRFDQIKFAQKPEYENLDSFLAEFNLG-DAAQVSEDNIKKAKEIKDHDVDM 1895 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T I + + + E + + + + I+ LTDG I Sbjct: 1896 D-----CTNIIGGLVVGLYLVRQTLE-KFYEKNIETKHQPMIIFLTDGLPNVGLIIRDEI 1949 Query: 390 CNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + +QG I +++F + + + N +EA + F Sbjct: 1950 TDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRT 2009 Query: 445 IGNEIFERV 453 + + + V Sbjct: 2010 VSSPLLRDV 2018 >gi|189238315|ref|XP_972124.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT--FAIDENEMGSTAINDAMQTAYDTI 350 D + D +++ + V I K D+ MG T I + + Sbjct: 352 DFMEKFDYENLDSCLTELNLGNAVQFTEDNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLV 411 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIR---IMTIAF 405 + + + + I+LLTDG + + I +QG I +++F Sbjct: 412 R-RTLKKNYEQNVETKHQPMIILLTDGLPNVGLSDPVEITKIVTKINQGTNRAAIFSLSF 470 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + N +EA + F + Sbjct: 471 GEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTV 510 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 50/179 (27%), Gaps = 19/179 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + + +D L + S + + G F+ + Sbjct: 20 DSSGSLKSQYWKEKDFLKKLANSFGISNKGSH---AGVVTFSHYAELSIRLDAFYSSIDF 76 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T I+ A+ A + N + + LLTDG+ + Sbjct: 77 NDAVDRISHMDSF--TRIDLALAKALELFDIKN-------GARNDVPNLLFLLTDGK-QE 126 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNK 439 I ++ K +GI++ + + L P + F + +L Sbjct: 127 PEMPLTHISDEIKQKGIQLFAVGIGAGAN-----KTELEKIVGNPENVFMVDDFDKLLN 180 >gi|116625802|ref|YP_827958.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228964|gb|ABJ87673.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 326 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 64/189 (33%), Gaps = 31/189 (16%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + + + +A+ +++++ D F+DR + ++ Sbjct: 104 FDTSDSMQPRMNKAHEAVEALLKNANPAD------EFFLVQFSDRARLVAGMTKDSEEIS 157 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R GSTA+ DA+ A + + K+ +K +V+++DG++ Sbjct: 158 RRAASMRI-----GGSTALLDAVAMAMEEM-----------KSAHYLRKVMVIISDGDDN 201 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK-------TQQEKARYFLSNCA--SPNSFFEA 431 + + + I I + + + L+ A + FE Sbjct: 202 SSRCPVNDLKRIVREGDVTIYAIGITDDNVPLAYPQRDRLTGAALLNEIATQTGGRLFEV 261 Query: 432 NSTHELNKI 440 + +L +I Sbjct: 262 HKLKQLPEI 270 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/280 (13%), Positives = 79/280 (28%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ S + S Sbjct: 97 VTLSNVSERKDNMRLGLSLATNPKDNSFLACSPLWSHECGSSYYTTGMCSRVNSNFRFSK 156 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 157 TVAPALQRCQTYMDIVIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 206 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 207 VGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEFA--------RS 257 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE+ + I K + +A ++ Sbjct: 258 EAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSEKD-NVTRYAVAVLGYYNRRGINP 316 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 317 EAFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 355 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/324 (11%), Positives = 80/324 (24%), Gaps = 71/324 (21%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + ++ N +++ + Sbjct: 66 IPKPFETAIPAKEIVLVLDISGSMDEEIENPCTNKRVRYCTRHSSSDPNHEEWFLSWHRW 125 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI----------------RSIKKIDNVN 292 ++ E + ++ KK ++ A I SI I+ + Sbjct: 126 INDYCVEHNTSGEHNITANNKKIDELKRAANGFIERMKDVPNLKIGIVAYSSIATINPNS 185 Query: 293 DTVRMGATFFNDR----VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 + + V + S + + + + + +G T I + M+ A Sbjct: 186 KSGTKKVKSLDSNSSHDVTNYNSLGANFLQSNDSRLTSVINNLEALGGTNIGEGMRKAVY 245 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-------------NEEGIAIC----- 390 + S + A K IVL+TDG T + I Sbjct: 246 MLDSG----------DKSASKTIVLMTDGLPTFYSVTGSNKNNYMTIDNTEPKIAGIGTG 295 Query: 391 -------------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS------PNSF--- 428 KS+G +I + ++ + P+ + Sbjct: 296 LDTKSINYSRAVGEIIKSRGYNSFSIGYGLDTDGNDTLLSIHEAMTGVSIKGKPDLYESS 355 Query: 429 -FEANSTHELNKIFRDRIGNEIFE 451 F ST+ + +F + Sbjct: 356 GFFPTSTNAIQAVFNRIATQILDS 379 >gi|170720024|ref|YP_001747712.1| hypothetical protein PputW619_0838 [Pseudomonas putida W619] gi|169758027|gb|ACA71343.1| conserved hypothetical protein [Pseudomonas putida W619] Length = 651 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 61/216 (28%), Gaps = 6/216 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G ++ + + L ++VD R + L++ A A + A+ Sbjct: 6 AARQRGAIGLMAVTTLALALLFMLVVVDSGRLYLEQRKLQRIADMAALEAAGQSAVCTGN 65 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + L + + T N R A +V +S Sbjct: 66 GPQATAIASVAAARNGHT-----PGSPLVASCGYLQTGANSLRTFTSDNARNEAIRVDVS 120 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + L G +A A S+ + +SI+ + S Sbjct: 121 NTVTTSFAAGIYILTQGGEVPLTSTLQAHAV-ASKPLPPQAMLSIRTTLATVDSRQSALL 179 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 D L Q K ++ + V ++L+ Sbjct: 180 DGLLGALGGNVQLGLAGWKGIAATDLNVLNYLDQLA 215 >gi|17533681|ref|NP_496741.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876681|emb|CAB03055.1| C. elegans protein F35C5.5a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 389 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 53/160 (33%), Gaps = 9/160 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSF--SWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I T R+G +N L T+ + D + + Sbjct: 66 TRIGTNYIDKRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLA 125 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A + + + K+ +++ N + E+ I + K+ G+ I Sbjct: 126 KGIGAAEQLLAAG-----RKNNTRKNYKQMVIVYASAYNDEGEEDPRPIAERLKASGVSI 180 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 T+AF ++T E+ + A+P F + + +I Sbjct: 181 ATVAF--DQTGDEEMIKLIGEIATPGFNFTNEDENLVKEI 218 >gi|260577971|ref|ZP_05845896.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] gi|258603897|gb|EEW17149.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] Length = 551 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/322 (13%), Positives = 87/322 (27%), Gaps = 45/322 (13%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP-------- 200 ++ GI ++ ++ E D+ S L ++ + Sbjct: 245 KADGIFNYQSVLQSMKENDGVDIEVIVPGDAAGAADYPLSALTKPGETGPEADTVAKVEA 304 Query: 201 -LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD----PSLSE 255 + F ++ + ++ Y + Sbjct: 305 LADWFKVHSEEVGERTHLIVSGKDAENKGTQNYYEYPRSKDHIRSLVSLYRDDLRRPADS 364 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND----TVRMGATFFNDRV---IS 308 +D+S + ++ L +I + F+ +V + Sbjct: 365 NFLLDTSGSM-RGNRLADLKGILNRLIDGTAGEAGNPKGFGRRETITLMPFSSKVADGYT 423 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + R ++ + G TAI DA+ AYD + Sbjct: 424 QEHYDPDSAEQSRG-LRDYVNGLQPRGETAIYDAVLRAYDRVGEGGGSLN---------- 472 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNC 422 IVL+TDGE+ I K K + I + I + + + Sbjct: 473 -SIVLMTDGESNSGTNRQEFI-TKMKRKMAETDRKIPVFVILYG----EASEDEMNFLAD 526 Query: 423 ASPNSFFEANSTHELNKIFRDR 444 + F A S +L+K F + Sbjct: 527 FTGGKVFNARSG-DLSKAFEEI 547 >gi|307719356|ref|YP_003874888.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] gi|306533081|gb|ADN02615.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] Length = 331 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 36/149 (24%), Gaps = 40/149 (26%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T + + A + ++ ++LL+DGE + A Sbjct: 165 SPGTDVASGLSRALE-----------AFPQQSNRQRLVILLSDGEALTGEIGP--VLELA 211 Query: 394 KSQGIRIMTIAFSVNKTQQEK-------------------ARYFLSNCA--SPNSFFEAN 432 ++ G+ + T+ L A + FF Sbjct: 212 RNLGVAVHTVGIGTESGGPVPLEGEDVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVR 271 Query: 433 STHELN-----KIFRDRIGNEIFERVIRI 456 + I E E + R+ Sbjct: 272 DAEGQTFQHVVSTIEETIAREEREGI-RL 299 >gi|291575288|gb|ADE10213.1| hypothetical protein [Actinoplanes liguriensis] Length = 432 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 46/141 (32%), Gaps = 9/141 (6%) Query: 12 KKLIKS---CTGHFFIITALLMP-----VMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 ++LI + G +ITAL+ V+LG+ +++D+ L+ A A Sbjct: 2 RRLIHALFPPRGEHGVITALVAVLAGAGVLLGMAALVIDIGALYAEREQLQSGADAASWK 61 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + + A + + + Y RN ++ + T Sbjct: 62 VAQACAGTAGRDLTSATCTVAAQRDNAQRYADRNAKDLVSDVQFCITTVSAAGVT-TADA 120 Query: 124 PRKSAYQVVLSSRYDLLLNPL 144 S++ ++ + Sbjct: 121 GCPSSWNTPVTCPAPPSASGP 141 >gi|190894969|ref|YP_001985262.1| hypothetical protein RHECIAT_PC0000635 [Rhizobium etli CIAT 652] gi|190700630|gb|ACE94712.1| hypothetical conserved protein [Rhizobium etli CIAT 652] gi|327193255|gb|EGE60161.1| hypothetical protein RHECNPAF_1700074 [Rhizobium etli CNPAF512] Length = 176 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 42/127 (33%), Gaps = 7/127 (5%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+ G I A+L + + +++V + A ++ S + Sbjct: 10 LRRLLGDRQGVAAIEFAILALPLFIMLFGIIEVSLMFFVNS----ALDASVHKISRMIRT 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + +K + K KI + ++ +F + + D+ T+ + Sbjct: 66 GE---VASSKITLAGFKAKICDDMLLSFNCSSDLVVKVNVLSDLSSATSTDPIDNSGNLA 122 Query: 131 VVLSSRY 137 + + Sbjct: 123 ITQTFDV 129 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 73/233 (31%), Gaps = 21/233 (9%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 + + N+ E ++ R + + I + Sbjct: 2 TATVVSEVNRPYVPTGGAKLTAEIEVEPGRLEEPPTRQIALCIDASGSMAGNDIEQARAG 61 Query: 289 DNV-----NDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDA 342 ++ + F++ V + + + WG + R D + G T + Sbjct: 62 AEWVFGLLDEDDYVSIIAFDNEVTTVLAPTRWGT--ISRETAVDAVADISAGGGTDMYSG 119 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG-IAICNKAKSQGIRIM 401 + A ++ + + + ++LL+DG++ + E + + ++GIRI Sbjct: 120 LLEAKASLQD--------LPTDDNTARRVLLLSDGKDNSHDPEAFGTLAREIDTEGIRIK 171 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +E+ L A + ++ ++ F D + E V Sbjct: 172 AAGIGSD--YREETIRTLGTVA-RGEWTHLDAAGDIESFFGDAV-EEAGTVVA 220 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 71/200 (35%), Gaps = 25/200 (12%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + A + + +G F S L ++ Sbjct: 104 DPDRLSAAKSAARDFVDGL------PREFNVGLVAFAGSAAVLVPPSTDREAL-HDGIRR 156 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 A + TAI +A+ T+ + + +E + I++L+DG NT + Sbjct: 157 LAEGITGVQGTAIGEAISTSLGAVKALDEQATTQPPPAR-----IIVLSDGANTSG-MDP 210 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEK---------ARYFLSNCA--SPNSFFEANSTH 435 + +A + + + TI+F ++ L A + +F +A+++ Sbjct: 211 MEAAAEAVAFEVPVHTISFGTPGGFVDRGGRPIQVPVDGETLQAVAEQTGGAFHQADTSD 270 Query: 436 ELNKIFRDRIGNEIFERVIR 455 EL+ ++ D IG+ + R R Sbjct: 271 ELHAVYDD-IGSSVGWRKER 289 >gi|251789655|ref|YP_003004376.1| hypothetical protein Dd1591_2051 [Dickeya zeae Ech1591] gi|247538276|gb|ACT06897.1| conserved hypothetical protein [Dickeya zeae Ech1591] Length = 526 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 60/533 (11%), Positives = 124/533 (23%), Gaps = 99/533 (18%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 F ++ + G L M +L G ++D ++ + I Sbjct: 4 LRAHFGDSLRRFWRQERGAGSAFYVLGMMGLLLTGAYVIDT---------VQTTGDASQI 54 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + +KN+ F + +NL N + D + E Sbjct: 55 KRAADAAALAVGREALSKNNSDFSAADRQRLAWEYVRSNLGMNSKLAQQLDAADISVTEG 114 Query: 123 NPRKSAYQVVLS--------------------SRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 + +S S +++ P + L + A Sbjct: 115 RRSNARRTFTVSVSFASQPVLLKVAQRQHEIYSTVEVIYRPSEVALVLPNTSTETAAELA 174 Query: 163 EAETVSRSYHKEHGVSIQ------WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + +++ + S S+ D S S Sbjct: 175 VLRRLGKNFATDFLADYPDRRLALVPYSQSVSVYDDANWSSRIRNWSLSGALKPVELSSL 234 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 +N GI + S M + + + +F +++ Sbjct: 235 FRNNDYGIANLA-SRMMPDLRSKRMCL----YRGLNQGDNYFWETAPAGSFRVHYRHDLP 289 Query: 277 ALASVIRSIKKIDNVNDTVRMGATF---FNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 A + +I+ I D + + P+ + + Sbjct: 290 INAPDMPAIEWIGPNPDFGQANGVNDTRYIIGDKGCPTAALLPLTDDLNAISARLDQMEP 349 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVH---------RMKNNLEAKKYIVLLTDGENTQDNE 384 +T AM A ++ S N K IV++ + + Sbjct: 350 QFNTNYAIAMGWAAMSLAPSFRGSDGWGDGEYPLDFNDNGSGNIKAIVMMGNTTGDWFDT 409 Query: 385 ---------------------------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++CN +S+ + I ++ R Sbjct: 410 DSYNAYVGEAIDGDGQGGDLGKQAATLRFRSLCNSFRSRDLLFYFIGV-RPGDPEDFGRT 468 Query: 418 F--------LSNCAS--PNSFFEANST---------HELNKIFRDRIGNEIFE 451 L CA+ A+S+ EL D I Sbjct: 469 LFGAVAVPGLQACATNDGGYIRFADSSSFSGAEGQISELLDGILDDIDKHSSA 521 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 51/182 (28%), Gaps = 27/182 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K + ALA + + + R Sbjct: 101 TKSEAAKTALAQF---ARITPKDANMGLAVFDAYGIAERVPLGLE------NRDKFIAAV 151 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T ++DA+ Y + + + + ++V++TDG+ N++ Sbjct: 152 NATAPGNGTPLHDALLLGYRRLEETARRQAG------YGEYHLVVITDGQAYPQNQDPTP 205 Query: 389 ICNKA-KSQGIRIMTIAF--SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + I TI F + + + R A++ EL + + + Sbjct: 206 VVAYILRQSPVVIHTIGFCIGTDHSLNQPGRTVYRA---------ADNPRELQQGLEEVL 256 Query: 446 GN 447 Sbjct: 257 AE 258 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 60/199 (30%), Gaps = 12/199 (6%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 P D + D + + ++++ + R+ F Sbjct: 356 DSVTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQTQLQIANKLQQMPIRPNLTRVAIVQF 415 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + +K + K G+T N A++ +S D Sbjct: 416 AGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKRDNC---- 471 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K +V TDG E+ K QGI + T+ S +K Sbjct: 472 ----KMKLLVF-TDG---YSAEDTAEGIEALKRQGITVYTVGISTDKNAGLNVSELKGMA 523 Query: 423 ASPNSFFEANSTHELNKIF 441 SP+ +F+++ L K F Sbjct: 524 TSPSHYFDSSDFDNLLKHF 542 >gi|269929102|ref|YP_003321423.1| hypothetical protein Sthe_3201 [Sphaerobacter thermophilus DSM 20745] gi|269788459|gb|ACZ40601.1| Protein of unknown function DUF2134, membrane [Sphaerobacter thermophilus DSM 20745] Length = 341 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 61/199 (30%), Gaps = 11/199 (5%) Query: 12 KK-LIKSC-TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++ L + G ++ A+ + M+G+ G+ +D+ + A++ AA A Sbjct: 7 RRWLWHARARGQVLVLFAVALVGMIGMTGLAIDLGYTFSQKRAIQNAA---DAGALAGAQ 63 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + Q + + L +++ D + E + Sbjct: 64 YLTKSTPDTRYPVRNEVAQVVAQNGFGGATPQLVSCQYVDASDNLLGDCSQEPPDEATGV 123 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +V + + + F++ + T+A A G + V + + Sbjct: 124 EVRVRETH------RTFFIQIVPGAPDTASTEAMARARVFKVTNAPGDAPFVVCGYDTEL 177 Query: 190 LDYQRDSEGQPLNCFGQPA 208 S +P PA Sbjct: 178 AGGGTFSILKPGTNEIDPA 196 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/318 (10%), Positives = 84/318 (26%), Gaps = 22/318 (6%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD-SEG 198 N + R + S + + + + ++ + Sbjct: 22 TFNMDTRNPRVIAGPSAAFFGYTVQQH---DISGKKWLVVGAPLETNGHQKTGDVYKCPV 78 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 NC R S + + + + + + Sbjct: 79 TQGNCTKLNLGRVTLSNACSPLWSHECGSSYYTTGMCSRVNSNFRFSKTVAPALQRCQTY 138 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D + + + +I + K +++G + + + + Sbjct: 139 MDIVIVLDGSNSIYPWVEVQHFLIEILTKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYKS 197 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + I++ T ++ A E + AKK ++++TDGE Sbjct: 198 VKDVVEAASHIEQRGGTETRTAFGIEFA--------RSEAFQKGGRKGAKKVMIVITDGE 249 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL----SNCASPN---SFFEA 431 + D+ + + +++ + +A ++ AS FF Sbjct: 250 -SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNV 308 Query: 432 NSTHELNKIFRDRIGNEI 449 L D +G+ I Sbjct: 309 TDEAALKD-IVDALGDRI 325 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 54/181 (29%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ L+ VI ++ + R+ + V + K + Sbjct: 167 FEKVKTFLSQVIDTLDIGETAT---RVAVVNYASTVKVEFHLQTHSDKESLKQAVSRIAP 223 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A++TA D + + N K +V++TDG + ++ Sbjct: 224 LAT--GTMSGLAIRTAMDEVFTVEAGARAPAFNIP---KVVVIVTDG---RPQDQVQEAV 275 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDR 444 +A++ GI I + L AS F + +L FR Sbjct: 276 AQAQASGIEIYAVGVGR------ADMQSLRQLASEPVETHAFYVETYGVIEKLTSTFRKT 329 Query: 445 I 445 Sbjct: 330 F 330 >gi|188586927|ref|YP_001918472.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] gi|179351614|gb|ACB85884.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] Length = 301 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 12/41 (29%), Positives = 20/41 (48%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 KL+K G +I AL++ V++ ++VDV L Sbjct: 5 KLLKDEEGSSVVIVALMLTVLMSFTALVVDVGLMYAERSRL 45 >gi|115767164|ref|XP_001193368.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] gi|115976266|ref|XP_001179968.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] Length = 797 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 53/150 (35%), Gaps = 18/150 (12%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ + + R + + +G T I + + + + S + Sbjct: 303 VVFDSTSQISGNLTDITETASRQRLVNAL-PPSPVGGTCIGCGILSGIEVLGSYAQGG-- 359 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 YI+LL+DGE T ++ K+ G+ I TI S + QQ + Sbjct: 360 ----------YIILLSDGEETDAPFIMDTY-DEIKNSGVIIDTITISDSADQQMEDLST- 407 Query: 420 SNCASPNSFF--EANSTHELNKIFRDRIGN 447 N + +F +A + L + F+ I Sbjct: 408 -NTSGIANFCSDDARTGIRLIQAFQSTITE 436 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 55/185 (29%), Gaps = 31/185 (16%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D + + F F+ + I+ + + T I DA+ TA +T Sbjct: 124 KNTDNLAVSLVGFAGTSFIACPFTQDMETF-SYILDNLSTKSVTLQGTRIADALVTAKNT 182 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 KK IVL+TDGE+ + + K I + TI ++ Sbjct: 183 FNVDAVS-----------KKSIVLITDGEDHGGY--FDEVLKELKDMNISVYTIGVGTSQ 229 Query: 410 TQQ--------------EKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ L A + A + L IF D N Sbjct: 230 GAAISTDLGVREKSVISKRDDNTLKLIADSTLGKSYIAENVS-LESIFDDMKKNMDSVSA 288 Query: 454 IRITK 458 +R + Sbjct: 289 VRNNR 293 >gi|299532826|ref|ZP_07046213.1| membrane protein-like protein [Comamonas testosteroni S44] gi|298719050|gb|EFI60020.1| membrane protein-like protein [Comamonas testosteroni S44] Length = 628 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/233 (11%), Positives = 62/233 (26%), Gaps = 12/233 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + S+ I+ +G + AL LM ++ + +D+ + + L++ A A Sbjct: 1 MQSFATGLSRTRIRRQSGSVATLGALWLMIAVICLA--TIDIGNVFWQKRELQKIADLAA 58 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + Q + +A + + N+T R V Sbjct: 59 LAGA----QGETPAACQANAARIATLNGMT-----GVPQVECGNWTPSPGVADTRTYFVN 109 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 +A V ++ L +K+ ++ R + + Sbjct: 110 GASPLNASCVTVARTVPYLFLFSVTAANGRQVKAVATAARSRPLAQLRIRSTLLVIDDKK 169 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + + + A+ V G + V Sbjct: 170 SAILNPIIGGLLGGNLNIQAVGWNGLANAHVSLLDFFKELGGKKGLIDINATV 222 >gi|209886336|ref|YP_002290193.1| TadE family protein [Oligotropha carboxidovorans OM5] gi|209874532|gb|ACI94328.1| TadE family protein [Oligotropha carboxidovorans OM5] Length = 145 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 48/131 (36%), Gaps = 4/131 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ-AAQTA- 60 T+F + + K+ +G + A+++PV L + +V + H ++Q AA+ A Sbjct: 1 MTTRFRRF-RDFSKARSGASAVEFAIVLPVFLMLVFGIVMFGAYLALVHDVQQLAAEAAR 59 Query: 61 -IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + + S A+N+ ++P + + ++ N V T Sbjct: 60 TSVAGLNETERRSLAASYVAQNAASYPLIVPAQLSVNAATSSTDPNVFIVTVNYDASHTF 119 Query: 120 VEMNPRKSAYQ 130 + P Sbjct: 120 IYTLPSFVPAP 130 >gi|17533679|ref|NP_496742.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876680|emb|CAB03054.1| C. elegans protein F35C5.5b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 387 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 53/160 (33%), Gaps = 9/160 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSF--SWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I T R+G +N L T+ + D + + Sbjct: 66 TRIGTNYIDKRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLA 125 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A + + + K+ +++ N + E+ I + K+ G+ I Sbjct: 126 KGIGAAEQLLAAG-----RKNNTRKNYKQMVIVYASAYNDEGEEDPRPIAERLKASGVSI 180 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 T+AF ++T E+ + A+P F + + +I Sbjct: 181 ATVAF--DQTGDEEMIKLIGEIATPGFNFTNEDENLVKEI 218 >gi|326435586|gb|EGD81156.1| hypothetical protein PTSG_11196 [Salpingoeca sp. ATCC 50818] Length = 1445 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 42/131 (32%), Gaps = 7/131 (5%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D DT R+ ++ V FS+ + I+ T G T A+ A Sbjct: 213 FDVSPDTTRVALMVYSSSVTEIFDFSYVLSNTRDEIITTIRNTNYPGGGTRTGSALDYAR 272 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF-S 406 + ++ ++ + +++ DG++ + +++ + I I Sbjct: 273 TNMFLTSRGARP---SSAGVPRVAIVIIDGQSGDSVAQP---AENLRNENVNIFAIGISG 326 Query: 407 VNKTQQEKARY 417 + + Sbjct: 327 ADVNELNMIAS 337 >gi|327283159|ref|XP_003226309.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1076 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 56/174 (32%), Gaps = 19/174 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +++D + +++ + + R+G ++ V + + + + + Sbjct: 820 FEIIKDFVTALVDRVTV---GRNATRIGLVLYSLEVRLEFGL--NRYTTQQDVKQAIRKM 874 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A++ A +K ++LTDG+ + + I Sbjct: 875 LYMGEGTYTGTAIRKATQ---------EGFFGARTGVRKVAIVLTDGQADKREAVKLDIA 925 Query: 391 -NKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 +A + I + I + + Q + L+ AS + + + L Sbjct: 926 VREAHAANIEMYAIGIVNTSDPTQVEFLRELNLIASDPDREHMYLIDDFNTLPA 979 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 42/324 (12%), Positives = 85/324 (26%), Gaps = 30/324 (9%) Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 V S + S +++ + + + + + I Sbjct: 178 NGKNIPVKCSGAITGTNKVNACSGGSCSVRNCRTDPQTGKLNKGCKFIPDEFQTEKASIM 237 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 F +S+ + N A + S + ++ ++ + Sbjct: 238 FMQSINSVVEFCTEKNHNKDAPNAHNQKCNLRSTWEVIKDSEDYKKSNPMTAAQPPKPTF 297 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + +D S + + + A + I + G F+ Sbjct: 298 SLKQIRERI-LVLVLDKSGSMASGDRLNRLNQASKLFLLQIIE-----KGSWAGMVTFDS 351 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 R + T G T+I ++TA+ I K Sbjct: 352 SATIQSELIQIETDAQRNSLITRLPTV-AGGGTSICSGLRTAFTVI----------KKKF 400 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 IVLLTDGE+ I+ C ++ K G I T+A + + + Sbjct: 401 STDGSEIVLLTDGEDN-----TISTCFDEVKQSGAIIHTVALGPSADPGLEKLAEM---- 451 Query: 424 SPNSFFEANSTHE---LNKIFRDR 444 + A + L F Sbjct: 452 TGGMKTTATDNAQNNGLIDAFSAL 475 >gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 806 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 41/161 (25%), Gaps = 15/161 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 ND R F+ V K R F + MG TAI++A++ A Sbjct: 344 NDGDRFEIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLES 403 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI---RIMTIAFSVN 408 +V LTDG T + I + + RI + Sbjct: 404 KEGRPF------------VVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTD 451 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 R A F +I + Sbjct: 452 VNTHLLDRIAEETRAFSQYVLPEEDLEVKVSSFFSKINEPV 492 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 44/143 (30%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + I + +G F + + + Sbjct: 118 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 170 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D Sbjct: 171 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIX 217 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 218 KDQVINLAQGLNVKIYSIGIGSS 240 >gi|284029672|ref|YP_003379603.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] gi|283808965|gb|ADB30804.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] Length = 144 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 41/132 (31%), Gaps = 10/132 (7%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G ++ ++L + ++ D+ + L A A++ A+ L + Sbjct: 6 RRDERGQMTVLIIGFTAIVLLMIAVVTDISKVFLVRRDLDATADGAVLAAANGLAAIYGQ 65 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+ + + L + + + + + T + V L+ Sbjct: 66 TSAGSTAELDVEQ---ARRLAAEYLDTVAASNRFDGLDWSAEVTGTTV-------TVELT 115 Query: 135 SRYDLLLNPLSL 146 S DL P Sbjct: 116 STVDLPFQPPGW 127 >gi|306820467|ref|ZP_07454103.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551542|gb|EFM39497.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 538 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 44/143 (30%), Gaps = 14/143 (9%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 S + L + + G T+I D+++ + + E+ Sbjct: 250 TESCSANELIHPIETLNTSAISKALSSVEATGWTSIADSIKNGGEDLSKFKEEGAVN--- 306 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSN 421 + ++TDG T + I K+ G + + I F+VN TQ + Sbjct: 307 ------ILYIVTDGIETCG-GDPIEAAQTLKNSGTNVVLGIIGFNVNATQDAVLKKIAE- 358 Query: 422 CASPNSFFEANSTHELNKIFRDR 444 A + AN L Sbjct: 359 -AGGGHYAIANDAGTLTSELYKI 380 >gi|115453449|ref|NP_001050325.1| Os03g0403600 [Oryza sativa Japonica Group] gi|113548796|dbj|BAF12239.1| Os03g0403600 [Oryza sativa Japonica Group] Length = 416 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 38/148 (25%), Gaps = 29/148 (19%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R + G+T I ++ + + R N ++L++DG Sbjct: 3 DAARADLGALVDGLAADGNTNIRAGLEIGLA-VAAGRRLTAGRAVN-------VMLMSDG 54 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANST 434 + + + + G+ + T + L A F Sbjct: 55 QQNRGD------ATRLDPGGVPVHTFGLGADHDPA-----VLQAIAGKSREGMFHYVADG 103 Query: 435 HELNKIFRD-------RIGNEIFERVIR 455 L F I ++ V R Sbjct: 104 VNLTAPFSQLLGGLLTIIAQDLELTVTR 131 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + I + +G F + + + Sbjct: 110 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D + Sbjct: 163 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEID 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 210 KDQVINLAQGLNVKIYSIGIGSS 232 >gi|48734898|gb|AAH71224.1| Matrilin 3 [Mus musculus] Length = 481 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 59/181 (32%), Gaps = 23/181 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+ ++ +I ++ R+ + V + + + + + A Sbjct: 94 FTKVKTFVSRIIDTLGI---GATDTRVAVVNYASTVKIEFQL--NTYSDKQALKQAVARI 148 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+QTA + + M K +++TDG + ++ + Sbjct: 149 TPLSTGTMSGLAIQTAMEEAFTVEAGARGPM---SNIPKVAIIVTDG---RPQDQVNEVA 202 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDR 444 +A++ GI + + + + L AS F + +L+ F++ Sbjct: 203 ARARASGIELYAVGV----DRADMES--LKMMASKPLEEHVFYVETYGVIEKLSARFQET 256 Query: 445 I 445 Sbjct: 257 F 257 >gi|311268950|ref|XP_003132277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Sus scrofa] Length = 1153 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTIILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTGRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + CA+ +FE S + Sbjct: 428 T--FSVGQHNYDVTPPAXXGCATKGYYFEIPSIGAI 461 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 39/317 (12%), Positives = 95/317 (29%), Gaps = 28/317 (8%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ D L+ G K+ ++ ++ ++ ++Q D + Sbjct: 209 VYYDAKADAELDDPESEDVERGSKASTLRLDFVEDSNFKNKVNYSYTAVQIPTD-----I 263 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 LN + +++ + + + + L Sbjct: 264 YKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLY 323 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTVRMGATFFND 304 ++ +S + V +++ + + K D ++D + FN+ Sbjct: 324 DVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE 383 Query: 305 RVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + F+ V + + K G+T + A+D + + Sbjct: 384 KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQN-------PNI 436 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLS 420 K I++ TDG + + K ++ +R+ T FSV + + Sbjct: 437 TRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVFT--FSVGQHNYDVTPLQWM 490 Query: 421 NCASPNSFFEANSTHEL 437 C + +FE S + Sbjct: 491 ACTNKGYYFEIPSIGAI 507 >gi|270008951|gb|EFA05399.1| hypothetical protein TcasGA2_TC015571 [Tribolium castaneum] Length = 902 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 37/124 (29%), Gaps = 6/124 (4%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + + + E + + I+ LTDG I Sbjct: 342 GCTNIIGGLVVGLFLVRRTLEKNYEKNVALK-HQPMIIFLTDGLPNVGISNPDEITKLVT 400 Query: 395 --SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +QG I +++F + + + N +EA + F + + + Sbjct: 401 KINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTVSSPL 460 Query: 450 FERV 453 V Sbjct: 461 LRDV 464 >gi|189238319|ref|XP_972336.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 713 Score = 48.4 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 37/124 (29%), Gaps = 6/124 (4%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + + + E + + I+ LTDG I Sbjct: 358 GCTNIIGGLVVGLFLVRRTLEKNYEKNVALK-HQPMIIFLTDGLPNVGISNPDEITKLVT 416 Query: 395 --SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +QG I +++F + + + N +EA + F + + + Sbjct: 417 KINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTVSSPL 476 Query: 450 FERV 453 V Sbjct: 477 LRDV 480 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 42/157 (26%), Gaps = 15/157 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 FN V S + + F G T + A+ +A Sbjct: 411 QDSFNIIEFNSDVSLLSSTPLPATATNLAMARQFVNRLQADGGTEMAQALNSAL------ 464 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + ++ + ++ +TDG ++ I N+ R+ T+ Sbjct: 465 PRQAFNTASGEDKSLRQVIFMTDGSVGNESALFELIRNQIGDN--RLFTVGIGSAPNSH- 521 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 F+ A +F E+ + + Sbjct: 522 ----FMQRAAELGRGTFTYIGDVDEVEQKISKLLAKI 554 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 44/155 (28%), Gaps = 19/155 (12%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + R K TA +A+ TA I + + Sbjct: 54 AYAGTATVLVSPTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDT 110 Query: 366 EAKKYIVLLTDGENTQDNE-----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY--- 417 IVL +DG+ T AK QG+ I TI+F E Sbjct: 111 PPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQP 170 Query: 418 ------FLSNCA--SPNSFFEANSTHELNKIFRDR 444 + A S + + A + EL ++ Sbjct: 171 VPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSL 205 >gi|332970881|gb|EGK09858.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 442 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 49/147 (33%), Gaps = 19/147 (12%) Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + + G T + A+Q A + + K Sbjct: 193 CQSNELVYPLKSYDSSQFEQSLNQFKPTGWTPLASAIQAAREDL-----------KEWAG 241 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYFLSNCAS 424 A+ + +++DG T + + K GI + I F V+ Q++ + A+ Sbjct: 242 ARNIVYVVSDGVETCG-GDPVREAKKLGESGIEPMVKIIGFDVDDAGQQQLKKVAE--AA 298 Query: 425 PNSFFEANSTHELNKIF---RDRIGNE 448 S+ S +L + ++ I E Sbjct: 299 DGSYQTVTSGDDLKEYLKGEKESIKQE 325 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + ++ + G T + A + A + Sbjct: 339 DRLGVVVFSSGHRVLFPPRPMTAQAKKEA-ESLLLSLRAGGGTVLGGAFREAVRLL---- 393 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +K +++LTDG + I + A++ G+ + +A + Sbjct: 394 -------HGVPGERKAVLVLTDGL---IADAKEPILDLAQTSGVEVSALALGPDADAPFL 443 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIF 441 F++A S EL ++F Sbjct: 444 KELARR---GGGRFYQAPSPRELPRLF 467 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + +I ++ R+G + + + + ++ Sbjct: 110 NRLEKSKQLVTQIINNLAS-------DRVGIIAYAGKAFPQLPITTDYASA-KMFLQNMN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D TAIN+A++ A + + +V+++DGE+ + +A Sbjct: 162 TDMLSSQGTAINEAIELAKTYYDDDQQ-----------TNRVLVIISDGEDHSEAAANVA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +A ++GIRI TI K + Sbjct: 211 E--EASNEGIRIFTIGVGDAKGGPIPIK 236 >gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella aurantiaca DW4/3-1] Length = 749 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 20/184 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +DAL + + D F+ V + F Sbjct: 294 SRMQIAKDALKYCVTRLNPQDT------FNVVRFSTDVEALFPALKSAQPENIQKAVAFV 347 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +G TAI++A+ A ++ +TDG+ T + A Sbjct: 348 EQLEAIGGTAIDEALVRGLQ-----------DNDGKSSAPHLLMFITDGQPTIGETDEGA 396 Query: 389 ICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I AK R+ T + + R + + + F D++ Sbjct: 397 IAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKV 456 Query: 446 GNEI 449 N + Sbjct: 457 SNPV 460 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 47/145 (32%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIR---SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ + + ++ +++ A F+ V D FS Sbjct: 64 FDKQKDFVDSLSDKVFQLTPVRSLKYDIKLAALQFSSSVQIDSPFSSWKDLQTFKQRVKS 123 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 T A+ A + + K +L+TDG + N + Sbjct: 124 MNLI--GQGTFSYYAISNATRLLQREGR---------KDGVKVALLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 173 SISEDARTAGILFITIGLSTVVNEA 197 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 55/178 (30%), Gaps = 20/178 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++++ + ++ + DT R+G ++ +V + K Sbjct: 814 FQIIKNFVKTLTDQVAL---DLDTARIGIINYSHKVEMVAHLTQFSSKDD--FKLAVDNM 868 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 + T A+ A ++ KK +++TDG+ +T+D + + Sbjct: 869 QYLGEGTYTATALHAANHMFEAARPG----------VKKVALVITDGQTDTRDEKNLTEV 918 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 A + I I E ++ A+ ++ + L + Sbjct: 919 VKNASDASVEIFVIGVVKRNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 976 >gi|327290735|ref|XP_003230077.1| PREDICTED: collagen alpha-1(XIV) chain-like, partial [Anolis carolinensis] Length = 562 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 33/122 (27%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + + K +L+ Sbjct: 107 NTYSTKDAVLDAVRNLPYKGGNTLTGLALTFILE------NNFKSEAGARPGVPKIGILI 160 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ K GI + I + L AS + Sbjct: 161 TDGK-SQDDVIPP--AKNLKDAGIELFAIGV------KNADETELKEIASEPDNTHVYNV 211 Query: 432 NS 433 Sbjct: 212 AD 213 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 59/178 (33%), Gaps = 10/178 (5%) Query: 269 KKKHLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V ++ + + + +++ ++ A F + F+ + +++ Sbjct: 253 TTTDPVYNSYKDLSKRLVQQLKIGPHYTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGI 312 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ G+TAI ++ A I S K +++ TDG + + + Sbjct: 313 DKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT------KVMIVFTDGWSNKG-PDPE 365 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 A + G + T+A++ LS S F + L + Sbjct: 366 KRARDAVNAGFEMYTVAYTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIKQ 423 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/268 (11%), Positives = 73/268 (27%), Gaps = 26/268 (9%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP-YMVS 235 + D + + ++ + N + + + + V Sbjct: 67 IDWSQWPDMTHVISACASWAKRKNYNDKFGVQFYGECWADADSSRFNDHGLATNCINGVG 126 Query: 236 CNKSLYYMLYPGPLDPSL-SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + S + G P + +D+S + + ++ + S Sbjct: 127 EHWSNFVYSMKGVCPPLKMNLVFLIDNSGSIND-TEFDNFKEFAKKLAESFTISAT---Y 182 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + A +FN + + ++ + I N G T I A+ D + Sbjct: 183 THVAAVYFNTLANFGFNLKYDINVIKTAID----NLPNIGGGTHIGKALTYTLDNVFKVA 238 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +V+LTDG++ A G+ + + + + Sbjct: 239 P--------RQNVKNVLVVLTDGKSHDSVTLPAAAVRNY-GPGVEVFAVGVGAGDSFVAQ 289 Query: 415 ARYFLSNCASP---NSFFEANSTHELNK 439 L+ AS + F ++ Sbjct: 290 ----LNVIASDPDEDHVFHVEHFSQIES 313 >gi|311268952|ref|XP_003132278.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Sus scrofa] Length = 1146 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTIILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTGRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + CA+ +FE S + Sbjct: 428 T--FSVGQHNYDVTPPAXXGCATKGYYFEIPSIGAI 461 >gi|149067646|gb|EDM17198.1| integrin alpha M [Rattus norvegicus] Length = 1151 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 61/187 (32%), Gaps = 18/187 (9%) Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 +D S + I +++ +++V+ +K + ++D + F++ Sbjct: 153 FLIDGSGSINSI-DFQKMKEFVSTVMDQFQKSKTL-----FSLMQYSDEFRT--HFTFND 204 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 K G T ++ + A K +V++TD Sbjct: 205 FKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELFQKINGAR------DNAAKILVVITD 258 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 GE D + +A+ GI I + + R L AS + F+ ++ Sbjct: 259 GEKFGDPLNYEDVIPEAEEAGIIRYVIGVGNAFHKPQSRRE-LDTIASKPAGDHVFQVDN 317 Query: 434 THELNKI 440 LN I Sbjct: 318 FEALNTI 324 >gi|149184583|ref|ZP_01862901.1| hypothetical protein ED21_27733 [Erythrobacter sp. SD-21] gi|148831903|gb|EDL50336.1| hypothetical protein ED21_27733 [Erythrobacter sp. SD-21] Length = 144 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 30/81 (37%), Gaps = 1/81 (1%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ-AAQTAIITASVPL 68 +L + G I ALL PV+LG+ L+ + ++L+ A+ TA + Sbjct: 5 LFHRLRRDERGTAIIEFALLAPVILGLFFGLIQIGISMQAYNSLRGVASDTARYAVVEYM 64 Query: 69 IQSLEEVSSRAKNSFTFPKQK 89 + ++ + Sbjct: 65 NGDTIDDTTIENRAKAIATGA 85 >gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella putrefaciens 200] Length = 757 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 50/407 (12%), Positives = 107/407 (26%), Gaps = 36/407 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + +V++ A N + +E + + R I Sbjct: 165 RASLVSQERPNMFTTDVANLAPNEQLVVEISYQENIKYEDGLFSLRFPLVVAPRYIPGFI 224 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + +A ++S + LR K + +++ K S Sbjct: 225 HDKASANDNAANSRVTSSEVFDAERIVAPLRRANSKQDPVLNANIEVRLAKGVDKSTIAS 284 Query: 179 IQ--WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS----SQNGKVGIRDEKLSPY 232 +D + + N R + S N + + S Sbjct: 285 PYHQIKLDEPHHGIINVSLTNSVVANRDFVLQWRAKQGMSPMALVFNQQGKTHGDGASED 344 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI---- 288 VS N+ Y + P ++EH + + +I I K Sbjct: 345 NVSENRQSDYHYSLVMVLPPKTDEHALSTLPRE-LILVIDTSGSMAGDSIVQAKSALLYA 403 Query: 289 -DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST----AINDAM 343 + + FN + S ++ + F G T A+N A+ Sbjct: 404 LNGLKAEDSFNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL 463 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + S++ ++ +TDG + I + R+ T+ Sbjct: 464 PRGINRLSESSQSLRQ-----------VIFMTDGSVGNEQALFDLIRYQIGES--RLFTV 510 Query: 404 AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 F+ A +F + E+ + + Sbjct: 511 GIGSAPNSH-----FMQRAAELGRGTFTYIGNVDEVEQKISQLLSKI 552 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 78/291 (26%), Gaps = 12/291 (4%) Query: 98 FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 + + TD V V + + + +S D + Sbjct: 583 VDISAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLT 642 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 ++ E + + D ++ P G D + + Sbjct: 643 MKVPQEDAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAG 702 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 ++ +G N Y + +E ++ I + L++DA Sbjct: 703 KDILIGDTGGTELNVQAGKN----YNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDA 758 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L ++ S+K D + + F+ +F+ K I ++ T G T Sbjct: 759 LKNLADSLKGHD---GKINVSIIDFDTNAKEPITFNDLTSKNISDLI-TKIDALKAEGGT 814 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 DA + + + LTDG+ T N + Sbjct: 815 NYEDAFLKTTSWFDTQS----VTYGKAQGYENLTYFLTDGDPTFSNRDTKN 861 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|148689378|gb|EDL21325.1| procollagen, type VII, alpha 1 [Mus musculus] Length = 2944 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ + G+T A+ D + Sbjct: 78 VRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELS--YKGGNTRTGAALHHVSDRVF--- 132 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + K K QG+++ + + Sbjct: 133 ----LPRLTRPGVPKVCILITDG---KSQDLVDTAAQKLKGQGVKLFAVGI------KNA 179 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 180 DPEELKRVASQPTSDFFFFVND 201 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 200 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 259 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 260 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 319 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 320 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 379 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 380 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 428 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 429 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 462 >gi|115647999|ref|NP_031764.2| collagen alpha-1(VII) chain precursor [Mus musculus] gi|143955303|sp|Q63870|CO7A1_MOUSE RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor Length = 2944 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ + G+T A+ D + Sbjct: 78 VRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELS--YKGGNTRTGAALHHVSDRVF--- 132 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + K K QG+++ + + Sbjct: 133 ----LPRLTRPGVPKVCILITDG---KSQDLVDTAAQKLKGQGVKLFAVGI------KNA 179 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 180 DPEELKRVASQPTSDFFFFVND 201 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 238 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 297 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 298 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 357 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 358 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 417 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 418 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 466 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 467 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 500 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 127 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 186 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 187 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 246 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 247 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 306 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 307 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 355 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 356 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 389 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 199 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 258 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 259 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 318 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 319 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 378 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 379 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 427 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 428 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|2326168|gb|AAB66593.1| type VII collagen [Mus musculus] Length = 2944 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ + G+T A+ D + Sbjct: 78 VRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELS--YKGGNTRTGAALHHVSDRVF--- 132 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + K K QG+++ + + Sbjct: 133 ----LPRLTRPGVPKVCILITDG---KSQDLVDTAAQKLKGQGVKLFAVGI------KNA 179 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 180 DPEELKRVASQPTSDFFFFVND 201 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 84/276 (30%), Gaps = 29/276 (10%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +++S + + D + + + + Sbjct: 147 NFKNKVNYSYTAVQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSATG 206 Query: 238 KSLYYMLYPGPLDPSLS------EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--- 288 + YY P + ++ +S + V +++ + + K Sbjct: 207 VTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVC 266 Query: 289 ---DNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAINDAM 343 D ++D + FN++ F+ V + + K G+T Sbjct: 267 EMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGF 326 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIM 401 + A+D + +SN K I++ TDG + + K ++ +R+ Sbjct: 327 EYAFDQLQNSN-------ITRANCNKMIMMFTDG----GEDRVQDVFEKYNWPNRTVRVF 375 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 T FSV + + C + +FE S + Sbjct: 376 T--FSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 409 >gi|239617869|ref|YP_002941191.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] gi|239506700|gb|ACR80187.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] Length = 1706 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/200 (10%), Positives = 57/200 (28%), Gaps = 33/200 (16%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S ++ + +DA + + + + + F+ + +F Sbjct: 292 PSLNFVLEVDRSGSMKPVMEKAKDAASYFLDLL------PENSELALIAFDTEIEVLKNF 345 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + +L R + G+T + D + + + ++++ Sbjct: 346 TRDREQLKRALAI-----IKARGATPLYDTVAKGIELLSE------------RSGPRFLI 388 Query: 373 LLTDG-ENTQDNEEG------IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 L+TDG + + + A+ + I I + ++ Sbjct: 389 LVTDGVDANYGDTAPGSEKTLSEVIRLARENNVVIFAIGLGTRIDEFSLGTL---ARSTG 445 Query: 426 NSFFEANSTHELNKIFRDRI 445 F ++ + L F + Sbjct: 446 GMFLKSPTIDNLKTAFNSLL 465 >gi|156370019|ref|XP_001628270.1| predicted protein [Nematostella vectensis] gi|156215242|gb|EDO36207.1| predicted protein [Nematostella vectensis] Length = 1552 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 15/108 (13%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK-- 392 G T I+ A++ + + + + L KK ++LTDG T+D + Sbjct: 28 GLTFIDKALKISAEKLFTEKNGMR------LNRKKVALVLTDGIQTKDKGPFTPLQKASQ 81 Query: 393 -AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELN 438 K +G+ + ++ + E + AS + F A S EL Sbjct: 82 PMKDKGVEVYSLGIGSDIDVSELITF-----ASGEKYVFNAKSFDELQ 124 >gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 710 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 42/328 (12%), Positives = 91/328 (27%), Gaps = 33/328 (10%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVIDFSRSMLDYQRD 195 + +N FL + ++ + + S + + I + + Sbjct: 119 KVEINLKITFLADINENGYIYKFPLTHKYQKGSVTNDLPENFEFTIKIMTMKEIKKLNVT 178 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV----SCNKSLYYMLYPGPLDP 251 ++G A K+ + + K + ++ Y Sbjct: 179 AKGNMKIIDNHNAIFETKTTPRDDAIIIETSIKDEDKGIAISSDGYIAISTYPYFEGKVE 238 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + SE +F+ S + + L +I S+ R F Sbjct: 239 NNSEFYFLIDCSGSMYGSRIENAKFCLNLLIHSL------PIDSRFSIIKFGTSYEEIFP 292 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 +K ++ ++ + +M T I ++ Y + + I Sbjct: 293 ICDYTNKNVKIAMRQIKDLDADMDGTDILSPLEYVYTQTTKNG------------YHRKI 340 Query: 372 VLLTDGENTQDNEEGIAICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 LLTDG+ + IC+ A K RI I L S ++ Sbjct: 341 FLLTDGQVHNSDV----ICSLAQEKRDNNRIYAIGLGSGADPGLIKNVSLK---SWGNYV 393 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRIT 457 +N+ + + + I + I+ Sbjct: 394 LIADKDNMNEKVIELMESSISPYLTNIS 421 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 51/167 (30%), Gaps = 21/167 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + VR G + ++ + T G T +A+++ Sbjct: 683 GANNVRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQI--GGGTRTGNALRSMKSLF 740 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + V + ++++TDG + ++ + QGI I I Sbjct: 741 QMAYRENVP---------QILIVITDG---KSEDKVNQAARDLRQQGIVIYAIGI----- 783 Query: 411 QQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ + L A + N F N L I + + V + Sbjct: 784 -KDAVQQELEEIAETKNRMFFVNDFDSLKHIKHEIVQEVCSTNVCKN 829 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 42/399 (10%), Positives = 104/399 (26%), Gaps = 19/399 (4%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L++ + A + ++ V+ + + + + +N Sbjct: 1005 NLQKLCKRAEAADIMFVVHGSSGVTDLQFKNMLQLVEAVVNNSVVGKDNVQFGVL--VYS 1062 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++ KS + + S L P + S +++ + A +++ + Sbjct: 1063 SNPEVQFSLNSYASKSQIREAVFSLKPLSGQPFTARALSFARQTFGVNYGGRASSLAVAR 1122 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 V ++ R + + + + + + R Sbjct: 1123 ILVLITDEPTVPSDRDNLPMAIRALKEDKIVLIAVGVSKASREELEEITEDQKRLFFAQS 1182 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH--LVRDALASVIRSIKKID 289 Y N + H D L + + + D Sbjct: 1183 YDALENMHENLTQTVCESSKPVCSNHVADLIFLIDGSESISENSFSTMKTFMKDVVDSFD 1242 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D VR+G ++ + + I ST ++ + Sbjct: 1243 ISRDKVRLGVVQYSQEPQREFYLNEFYSDTIIKEQINRIEQLRS--STFTGKGLRF-VQS 1299 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + S + +V++TDG + D+ + A+ +S GI + + + Sbjct: 1300 LFESANGGRKNQGVSQN----LVVITDG-YSADSVDDAAM--ALRSNGIHVFAVGVGIVN 1352 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 + L F + + L I I Sbjct: 1353 SF-----ELLRIAGDARRVFTVENFNALKTIKSTIINEI 1386 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 63/199 (31%), Gaps = 22/199 (11%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 VD+S+ V++ L+S++ S+ D D +R+G ++ Sbjct: 36 QTACSKATVADVVFIVDTSTSIAQEN-FQKVKNFLSSLVSSL---DIGLDMIRVGLAQYS 91 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + L ++ G T A+ + + + Sbjct: 92 DEAYQVFLL--NQYLLKSDVLDQIGNLPYRGGETYTGRALDFVSTRYFTESAGSRAKGY- 148 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 + VL+T GE+ + E+ K + +GI I + + + L A Sbjct: 149 ---VPQLAVLITSGESNDEVEQP---AKKLRYRGISIYVVGIGI------QNTTELQQIA 196 Query: 424 SPN--SF-FEANSTHELNK 439 S + + S +L Sbjct: 197 SKPFRRYLYSIGSFDDLPD 215 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 40/316 (12%), Positives = 93/316 (29%), Gaps = 24/316 (7%) Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS---------MGIKSWLIQTKAEAETVS 168 N R Q S+ ++++ S +R +G + +++ Sbjct: 681 QLGANNVRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNALRSMKSLF 740 Query: 169 RSYHKEHGVSIQWVIDFSRS---MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + ++E+ I VI +S + RD Q + + V+ + + R Sbjct: 741 QMAYRENVPQILIVITDGKSEDKVNQAARDLRQQGIVIYAIGIKDAVQQELEEIAETKNR 800 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 ++ + + + + + + + + + + Sbjct: 801 MFFVNDFDSLKHIKHEIVQEVCSTNVCKNVRADIVFLVDSSNSIRAAEFQKIKDFMQSFV 860 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 K+D D VR+G F+ + + + I + + + T A+ Sbjct: 861 IKVDVGLDNVRIGLIQFSSEIREEFQL--DRYSTIADVQRAIQEMQQIKLGTLTGKALTF 918 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A E K+Y++++TDGE + + +GI I I Sbjct: 919 A-------ASYFDRPKGGRPELKQYLIVITDGEAQDSVKSP---ARAIRDKGITIYAIDM 968 Query: 406 SVNKTQQEKARYFLSN 421 Q + Sbjct: 969 LQANNSQLVEITGAQD 984 >gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016] gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016] Length = 688 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 16/144 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F++RV + V++ T G T + DA++ A + S Sbjct: 343 DRFRILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYDALERAVSGLDSDR 402 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I+L+TDG E + +R+ T F + + Sbjct: 403 TTG-------------IILVTDGVANVGVTEKKQFLKLMQRYDVRLYT--FIMGNSANTP 447 Query: 415 ARYFLSNCASPNSFFEANSTHELN 438 ++ S +++ ++ Sbjct: 448 LLEPMTQ-VSNGFATSISNSDDIL 470 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 55/160 (34%), Gaps = 10/160 (6%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + V + FN + ++ S L + + + N S I + Sbjct: 80 NSSEPITTRVGLITYNFNATLNANLSQFQSYDDLSNGVFHSLSNVTNSTDS-FIGTGLAM 138 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + N + KK I++ ++E I ++ K G++I+T+ + Sbjct: 139 AEQLLRRQNFNT-----TRDHYKKVIIVYASAFQRNEDETPEWIADRLKGSGVKIITVGY 193 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + LSN ASP F ++ L + + Sbjct: 194 GNSHG----LIKSLSNIASPGLSFNSSGDGNLINQIQTSL 229 >gi|320105085|ref|YP_004180676.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752367|gb|ADV64127.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 323 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 51/183 (27%), Gaps = 27/183 (14%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI--RTI 323 + + A+ + S G TFF + + + V + Sbjct: 119 GDGTRYDMSMKAIDKFLDS-------RRGDAFGLTFFGNNYLHWVPLTSDVSAIKCAPPF 171 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQD 382 +K G T I A+ T++ E + I+L++DG + Sbjct: 172 MKPEVAPLWMSG-TEIGKALLGCRRTLVERQEG-----------DRAIILISDGASFDLG 219 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIF 441 + K GI + I + C+ + F + L IF Sbjct: 220 GGNDEEVARLLKRDGIVVYAIHID-ETEIPDPIVTI---CSITGGDAFAPDDPSALEAIF 275 Query: 442 RDR 444 + Sbjct: 276 KRI 278 >gi|209546921|ref|YP_002278839.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538165|gb|ACI58099.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 176 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 40/132 (30%), Gaps = 8/132 (6%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M +F ++LI G I A+L + + +++V + A + Sbjct: 1 MRRGKRFAP-LRRLIGDRKGAAAIEFAILALPLFIILFGIIEVSLMFFVNS----ALDAS 55 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + S + S S K KI + ++ F+ + + D+ Sbjct: 56 VHKISRMIRTGEVASSKITLAS---FKAKICDDMLLAFDCSSGLVVKVNVLSDMSSAAHT 112 Query: 121 EMNPRKSAYQVV 132 + V Sbjct: 113 DPIDSSGKLAVT 124 >gi|254477813|ref|ZP_05091199.1| TadE-like protein [Ruegeria sp. R11] gi|214032056|gb|EEB72891.1| TadE-like protein [Ruegeria sp. R11] Length = 186 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 9/77 (11%), Positives = 28/77 (36%), Gaps = 4/77 (5%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI--- 61 T+ + ++ + G I A+++P L + V++ + + L++A + Sbjct: 11 TRALTGLRRFRQREDGTATIEFAIVIPAFLFLLMNTVELGLITIQQSMLERALDQTVRDL 70 Query: 62 -ITASVPLIQSLEEVSS 77 + + + Sbjct: 71 RLGTGAAQQHNEIRDAV 87 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 49/180 (27%), Gaps = 5/180 (2%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + + + G F + A+ M V + G+ VD+ + L++ A A + Sbjct: 9 EHRLSACGRGTRARERGSFAVAAAIWMLVAIAALGV-VDIGNVFFVRRDLQRVADMAALA 67 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + ++ S + + + D + + + Sbjct: 68 GAQRM---DDQCSQPNAAAAANASSNGFDRAASGNTLTVSCGRWDTQSNAAPSYFSTT-S 123 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 +A QV + S + + + T S S+ + Sbjct: 124 TPLNAVQVTATQNVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLASLQGGLVNSVLNAL 183 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 64/193 (33%), Gaps = 18/193 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH--KLIRTIVKT 326 + + A VI ++ + DR ++ RT KT Sbjct: 60 SRMAAAKQAFNEVIDAVPDEVRLGIRTLGATYPGQDRKTGCLDSKQLYPVGRVDRTEAKT 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A++ A + +A + IVL+TDGE++ + Sbjct: 120 AVATLRPTGWTPIGLALRGASKDL------------AGGDATRRIVLITDGEDSCGQPDP 167 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + +QG + T+ +++ +++ A+ ++ T +L+ + Sbjct: 168 CDVARELAAQGTHLVVDTLGLTLDSKVRDQLSCIAE--ATGGTYTAVQHTDQLSTRIKQL 225 Query: 445 IGNEIFERVIRIT 457 + V T Sbjct: 226 VRRAADTPVQTPT 238 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 43/161 (26%), Gaps = 18/161 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + A + + + ++++ + G TA A+ T Sbjct: 425 TSWKPGGTHVAAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTI 484 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ ++K + +TDG + K +G +I I Sbjct: 485 RQVVP---------FTREGSQKALFFITDGHSNIGGS-PRKAAKILKDKGFQIYAIGVG- 533 Query: 408 NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRI 445 ++ R L AS +L + + Sbjct: 534 ----KKVRRRELMEIASEPEDEYVISVRKYKQLLSAVKKAV 570 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 43/161 (26%), Gaps = 18/161 (11%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + A + + + ++++ + G TA A+ T Sbjct: 418 TSWKPGGTHVAAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTI 477 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ ++K + +TDG + K +G +I I Sbjct: 478 RQVVP---------FTREGSQKALFFITDGHSNIGGS-PRKAAKILKDKGFQIYAIGVG- 526 Query: 408 NKTQQEKARYFLSNCAS---PNSFFEANSTHELNKIFRDRI 445 ++ R L AS +L + + Sbjct: 527 ----KKVRRRELMEIASEPEDEYVISVRKYKQLLSAVKKAV 563 >gi|330447678|ref|ZP_08311326.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491869|dbj|GAA05823.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 257 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 76/224 (33%), Gaps = 22/224 (9%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDN 290 P + + S +D+++ + +K ++A+ + I I Sbjct: 53 PTLNNTQTSSSDNWLAANYLLIFDGSGSMDNTNCGNGQRKIVAAKEAMQTFINDI----- 107 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +G F+ +D S + R +K D G+T + ++ + Y + Sbjct: 108 -PQDANVGLYVFD---NADSSLRVPLGINNRATLKQAIYDVKAGGTTPLKSSLTSGYTAL 163 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 E ++ + N +V++TDG+ + E +AI ++ + I TI F + Sbjct: 164 EKQAEKQLGYGEYN------VVIVTDGDASVGEEPEVAISRIYQNSPVTIHTIGFCIGNR 217 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A + AN+ +L + + V Sbjct: 218 HALNAEGI-------TYYQSANNPEKLLAGLQSVLAESAQFDVS 254 >gi|296134298|ref|YP_003641545.1| Protein of unknown function DUF2134, membrane [Thermincola sp. JR] gi|296032876|gb|ADG83644.1| Protein of unknown function DUF2134, membrane [Thermincola potens JR] Length = 314 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/248 (9%), Positives = 63/248 (25%), Gaps = 5/248 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +L+K G +I A+ V LG ++ D+ ++ + A A + + Sbjct: 2 KHLISRLLKKEEGSVIVIVAIAFTVFLGFVAIVTDIGLLYVKQNKVAHAVDAAALAGAQE 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + + AK+ + +N T + ++V + Sbjct: 62 LPDNPANAVAIAKSYAQLNGLDPLQVTATVAPDNKSITVTSNDTVNLVFARLFGTDFSTV 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + L F + + + S +D++ Sbjct: 122 SASAKARVAPARTLTGAVPFSVLKQELHYGEEYVLKQGAGGVSNDGGRWHGWFGALDYTG 181 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY---YML 244 + + + ++G + Y ++ ++ + Sbjct: 182 --GGGGASEYRETIKNGYAGSVSIGDVIDIKSGNMSGPTSDGVNYRINLCRTRHSPECTW 239 Query: 245 YPGPLDPS 252 D Sbjct: 240 DSYQPDCP 247 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/433 (9%), Positives = 117/433 (27%), Gaps = 46/433 (10%) Query: 28 LLMPVM--LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 ++M +M L + G + + +++E R+ Sbjct: 6 IIMLLMSSLILSGCGPESENKESQQQQPSTPTDQ---QVLAAQQAAIKEAEQRSVADKAT 62 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 K + + + K ++ + S NP + Sbjct: 63 ADAKAKALAQQEAQQYSDKQALQGRLQAAPKYQHAAREKAASQI-----------ANPGT 111 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + T S +++ ++ + + +N F Sbjct: 112 ARYKQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNH-GLLPPPDAVRVEEIVNYFP 170 Query: 206 QPADRTVKSY------SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 D K + + + + + + P+ + + Sbjct: 171 YDWDIKDKQSIPATKPIPFAMRYELAPAPWNEQLTLLKIDILAKDHKSEELPASNLVFLI 230 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D+S ++ L++ +L +++ +++ DN+ + G H Sbjct: 231 DTSGSMISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIALPSISGSH-- 282 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + + GST ++ AY ++ + I+L TDG+ Sbjct: 283 -KAEINAAIDSLDAEGSTNGGAGLELAYQ--QAAKGFIKGGINR-------ILLATDGDF 332 Query: 380 TQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 ++ ++ K + G+ + T + + ++ ++ E Sbjct: 333 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI--ADVGNGNYSYIDTLSE 390 Query: 437 LNKIFRDRIGNEI 449 K+ + + Sbjct: 391 AQKVLNSEMRQTL 403 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 73/201 (36%), Gaps = 11/201 (5%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D + ++I I + + + DR+IS F G K + I++ Sbjct: 263 DEEDFERAKDVIKTLIEKISYYEVSPNYEILLFATDTDRIISMREFKNGQGKDLLKIIQK 322 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL-TDGENTQDNEE 385 + + AY+ I S E+ K + +A ++IV++ TDG+ Sbjct: 323 LQDYAYDKKGQRSGTNIAQAYNKIYESMTIELMTNKEDFKATQHIVIMFTDGQANMGGS- 381 Query: 386 GIAICNKAK--------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + +K K + + + + ++ ++ A+ FF+ S +L Sbjct: 382 PKPLVDKIKSLVRQNSVEEKLELYVFGLGNDVHAEDIN-DLKTDRANEKFFFKLKSLDDL 440 Query: 438 NKIFRDRIGNEIFERVIRITK 458 + F + I + + K Sbjct: 441 KETFDNMIDEGNSVELCGLYK 461 >gi|159900724|ref|YP_001546971.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893763|gb|ABX06843.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 415 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 63/192 (32%), Gaps = 27/192 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS--FSWGVHKLIRTIVKT 326 K +R+A+ ++ +++ ID V+ F+D + + + L + Sbjct: 59 DKIQHLREAVREIVANLRPIDAVS------IVLFDDTLEVLVPARLADDLPAL-----QN 107 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 E G TA++ +Q + + ++LLTDG+ D + Sbjct: 108 AIESIGEQGGTAMSLGLQAGLAELQKFQAADRVGR---------VLLLTDGQTWGDEDTC 158 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + G+ I + + AS +++K F+ + Sbjct: 159 RDLAKQIGDLGVSITALGLGTEWNEALLDDL---ATASNGESDYIADPSQISKYFQQTLQ 215 Query: 447 NEIFERV--IRI 456 + V R+ Sbjct: 216 SAQTTTVVNARL 227 >gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6] gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6] Length = 688 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 16/144 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F++RV + V++ T G T + DA++ A + S Sbjct: 343 DRFRILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYDALERAVSGLDSDR 402 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I+L+TDG E + +R+ T F + + Sbjct: 403 TTG-------------IILVTDGVANVGVTEKKQFLKLMQRYDVRLYT--FIMGNSANTP 447 Query: 415 ARYFLSNCASPNSFFEANSTHELN 438 ++ S +++ ++ Sbjct: 448 LLEPMTQ-VSNGFATSISNSDDIL 470 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 48/129 (37%), Gaps = 19/129 (14%) Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + F++ H+ +V+ + G+T ++ A D Sbjct: 66 TRTEFTFTTHRNGTELVQAIRNLGYKGGNTRTGTGLRYAADNFF-------GPTIIRPNV 118 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--- 424 K +L+TDG+ +QD+ + + KSQGI++ + + L+ AS Sbjct: 119 PKVAILITDGK-SQDDVDPPT--QRLKSQGIKMFAVGI------KNADSRELTRVASTPT 169 Query: 425 PNSFFEANS 433 + FF N Sbjct: 170 EDFFFYVND 178 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 60/468 (12%), Positives = 115/468 (24%), Gaps = 73/468 (15%) Query: 47 SYYEHALKQAAQTA-IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 + +A Q A L+ S S+ + ++ L+ Sbjct: 112 LRDRTDIARALQVAGASGVGRLLLLSDGAESAGDALAALPELPVDVLHVRGPESVRLEAL 171 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 R + +V L + D P S G + A Sbjct: 172 HAPESARPGETVEVTAVVASDLETEVTLRATRDGEALPPITRTVSPGRTPLSFSLELGA- 230 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE-GQPLNCFGQPADRTVKSYSSQNGKVGI 224 + S E + + + F + +P+ PA + + + G Sbjct: 231 GAAGSVAVEARLEVPFEQPFQDDLQRLDIAVAADEPVLVINDPAMAELLEVQGFHVRRGR 290 Query: 225 RDEKLSPYMVS------CNKSLYYML------------------YPGPLDPSLSEEHFVD 260 + +P S + P V+ Sbjct: 291 PADVTAPLNYSAVVLREGAGAFTPGQLELLGSYVQGGGGLMMTGGPDAFGLGGWYRTPVE 350 Query: 261 SSSLRHVIKKKHLVRDALASVI-----RSIKKIDNVN----------------DTVRMGA 299 + + + +A VI +S+ + + +G Sbjct: 351 AVLPVTTDLRTEVEVPLVALVIVMDVSQSMTAGNPSRLELAKEGAVGVVDLAYERDMLGF 410 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+DR + R + G T A + A D +++ Sbjct: 411 ITFSDRPEWVFRPRQATLQGKREMTAAILN-VAPQGGTIFEPAYREALDVLMAQ------ 463 Query: 360 RMKNNLEAKKYIVLLTDGENTQDN--------EEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 A K++++LTDG+ + + + GI TIA Sbjct: 464 -----EAAVKHVIVLTDGKFADGTGPFSRGPAPDFGRLAALGRRSGITTSTIAIGDGADP 518 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRI--GNEIFERVIRIT 457 Q+ A ++EA L +IF R +T Sbjct: 519 QQLTTI---ARAGGGRYYEALDVSTLPRIFTTEALSATRSLLREGPLT 563 >gi|113682008|ref|NP_001038479.1| hypothetical protein LOC563353 [Danio rerio] gi|94732542|emb|CAK03688.1| novel collagen protein [Danio rerio] Length = 873 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 57/173 (32%), Gaps = 10/173 (5%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + A +I K D + ++ ++D Sbjct: 2 GKYFAGCSTTPNDLAFIIDGSSSLGVPNFETAKRWLINITKGFDVSSRHTQVAVVQYSDT 61 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G H+ + +V+ G+T A++ A D + + + Sbjct: 62 PRLEIPL--GKHQNSQELVEAVGSVSYLGGNTRTGRAIKFATDHV-----FGMPNHTSQS 114 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + V+LTDG +QD+ E A+ +A++Q I + + T E Sbjct: 115 PRNRIAVVLTDGR-SQDDVEDAAM--EARAQNIVLFAVGVGNEITNSELVSMA 164 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 45/131 (34%), Gaps = 20/131 (15%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + T + G TA+ DA+ T + ++ T Sbjct: 251 TYTNDKDRLLTGIDNMTPYGMTALYDALYTGITNASNHPGFNC------------VIAFT 298 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 DG++ + + + AK +GI + I E L N A + F++ NS Sbjct: 299 DGQDNESTHTADEVISLAKEKGIPVYLIG------TSEADSSTLQNIANETNGYFWDMNS 352 Query: 434 THELNKIFRDR 444 ++ ++ Sbjct: 353 ISDMQEVMNRI 363 >gi|221197788|ref|ZP_03570834.1| membrane protein [Burkholderia multivorans CGD2M] gi|221181720|gb|EEE14121.1| membrane protein [Burkholderia multivorans CGD2M] Length = 634 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/281 (9%), Positives = 75/281 (26%), Gaps = 4/281 (1%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F + ++ G ++ A+ + V L V G +DV + L++ A + A Sbjct: 11 FARSQRAGSRAQRGSVAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQ 69 Query: 67 PLIQS--LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + A S + F+ N Sbjct: 70 SVNDLCPQTDTTVTASGSNAVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAG 129 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 L++ + + LF ++ + A+A + D Sbjct: 130 YFGTATNQLNAVRVVATKTVPLFF-IGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQD 188 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + + + G G + ++ + K+ + + + Sbjct: 189 CAGNSVSADQRNTGLVNALLGALLNTSLSLNIGSYQALACTRVKVGDLVKAQVGAGTVDQ 248 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + ++ + + A ++ + ++ Sbjct: 249 LLATKLTLNQLVSVMATALKNTSVANLGIYSSASSAALDTL 289 >gi|218782851|ref|YP_002434169.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218764235|gb|ACL06701.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 684 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 5/85 (5%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I+L++DG+ T + E A + K+ GI + + F V +E+ + A + Sbjct: 142 ILLVSDGKETC-DPEPCAFVAELKALGINFVMHVVGFDVGGETEEELKCM--AAAGDGEY 198 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 F A++ +L + I + + + Sbjct: 199 FPASNADKLKEALATVIEKTVEKNL 223 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 61/180 (33%), Gaps = 18/180 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ + D VR+G ++D ++ S ++ I+ Sbjct: 45 LDNFQQIRQFLSSLVENFEVAPDKVRIGLVQYSDTPRTEFSL--NTYQNKEEILDYIRNL 102 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T ++ I+ + E + + +++TDG+ +QD + Sbjct: 103 RYKTGGTHTGQGLEF----ILKQHFIEEAGSRAQQNVPQIAIVITDGD-SQDEVDLQ--A 155 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGN 447 + + +GI+I I ++ L A+ + + L I + + Sbjct: 156 QELRQRGIKIFAIGI------KDADVRLLRQIANEPYDQYVYSVSDFAALQGISQSVVRE 209 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 52/146 (35%), Gaps = 15/146 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + +I ++D D +++ ++D +D I++ G T Sbjct: 633 DFLENTIGRLDVGKDKIQIAVILYSDFPRADVYL--NTFSNKNDILRYINTLPYGRGKTY 690 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ A + + + + ++ V++TDG++T D A + + G+ Sbjct: 691 TGAALRFAKEHVFTKARGSR----RDKYVQQVAVVITDGKSTDDAASAAA---ELRRSGV 743 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS 424 I + ++ L AS Sbjct: 744 SIFALGI------KDTKEDDLREIAS 763 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 50/157 (31%), Gaps = 14/157 (8%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++I +K++D D R+G F+D ++ + + + Sbjct: 1179 KPPSWDILKQTMIGIVKELDIAKDKWRVGVAQFSDILLHQFYL--NTYTSFAEVEEAINN 1236 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + T DA++ N ++L+TDGE +E+ + Sbjct: 1237 IKQRKQGTNTWDALKLIKYYFTKEN-GSRIEGGVAQN----LLLITDGEAN--DEKDLNA 1289 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K++ I I I E + L A Sbjct: 1290 LADLKNKKIAITVIGIG-----NEIKKSELREIAGSP 1321 >gi|319782171|ref|YP_004141647.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168059|gb|ADV11597.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 509 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 8/49 (16%), Positives = 23/49 (46%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 + G I+ ++ +P ++G + +D+ R + + L++ A + Sbjct: 5 IRAFWNDQRGIALILVSITLPAIIGFSLLAIDMSRINNLHNDLQKGADS 53 >gi|28210485|ref|NP_781429.1| membrane-associated protein [Clostridium tetani E88] gi|28202922|gb|AAO35366.1| membrane-associated protein [Clostridium tetani E88] Length = 842 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 53/162 (32%), Gaps = 24/162 (14%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +KK L + I++++ D +G F+D + F + + Sbjct: 428 VKKIELAKQGAIETIKALESEDY------IGILGFSDTIDWVVPF---QKAENKEKLIKE 478 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G T I + T+ S+ K+++LLTDG+ ++ Sbjct: 479 VGKLKPKGGTLIIPGLIEGVKTLSSA-----------KTKVKHMILLTDGQAEKN--GFD 525 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 K + + T+ + ++ LS+ +F Sbjct: 526 KYLENMKKNNMTLSTVGLGEDSDREVLTH--LSDFTGGRKYF 565 >gi|291388581|ref|XP_002710673.1| PREDICTED: collagen, type XXII, alpha 1 [Oryctolagus cuniculus] Length = 1571 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 62/198 (31%), Gaps = 35/198 (17%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + + R+G ++DR + Sbjct: 37 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPEHTRVGVVRYSDRPSTAFEL 92 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G + G+T DA++ S + ++ + Sbjct: 93 --GHFSSREEVKAAARRIAYHGGNTNTGDALRFITARSFSPQAGGRP---GDRAFQQVAI 147 Query: 373 LLTDGENTQDNEEGIAICNKAKSQ-------GIRIMTIAFSVNKTQQEKARYFLSNCAS- 424 LLTDG + + + GIRI + E R L AS Sbjct: 148 LLTDGRSQ----------DLVRDAAAAAHAAGIRIFAVGVG------EALREELHEIASE 191 Query: 425 --PNSFFEANSTHELNKI 440 F + ++KI Sbjct: 192 PTSAHVFHVSDFDAIDKI 209 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 58/189 (30%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I I+ + T F+ + Sbjct: 312 DKSGSMAGEDRLNRLNQASQLFLLQI--IEKGSWTG---MVTFDSSATIQSALIQIETDA 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++TA+ I K IVLLTDGE+ Sbjct: 367 QRNSLISRLPT-AAGGGTSICSGLRTAFTVI----------KKKFSTHGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + I+ C ++ K G I T+A + + + + A + Sbjct: 416 S-----TISSCFDEVKQSGAIIHTVALGPSADPGLEKLAEM----TGGMKTSATDNAQNN 466 Query: 437 -LNKIFRDR 444 L F Sbjct: 467 GLIDAFSAL 475 >gi|170672288|gb|ACB29772.1| matrilin-3 alternative transcript [Homo sapiens] Length = 444 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 68/232 (29%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R +P + + G + F+ SS + V+ ++ Sbjct: 49 GRRPSPAAPDGAPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 108 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 109 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 162 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + K +++TDG + ++ + +A++ GI Sbjct: 163 GLAIQTAMDEAFTVEAGAREPS---SNIPKVAIIVTDG---RPQDQVNEVAARAQASGIE 216 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 LYAVGV------DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 262 >gi|162448822|ref|YP_001611189.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161159404|emb|CAN90709.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 563 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 61/180 (33%), Gaps = 12/180 (6%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + L + A ++ + + S ++ + L + + T Sbjct: 48 YGDRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVASYSDT-SSVTYPLTALPASGLDT 106 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 GST+I ++ A D +IS + A++ ++LL+DG++ Sbjct: 107 AVDALVADGSTSIGAGLEEALDMLISESP--------TKSARECVILLSDGQHNTPPAPS 158 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + +IA + + + SP + A+S +++ + IG Sbjct: 159 DFYADYFSRVD-EVHSIALGSGADEAMMSDIAANYGFSPGLYLRADSDTAIDQ--LELIG 215 >gi|158255148|dbj|BAF83545.1| unnamed protein product [Homo sapiens] Length = 486 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 68/232 (29%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R +P + + G + F+ SS + V+ ++ Sbjct: 49 GRRPSPAAPDGAPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 108 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 109 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 162 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + K +++TDG + ++ + +A++ GI Sbjct: 163 GLAIQTAMDEAFTVEAGAREPS---SNIPKVAIIVTDG---RPQDQVNEVAARAQASGIE 216 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 LYAVGV------DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 262 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 64/189 (33%), Gaps = 25/189 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I ++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLNDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY+V++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEELL--------VGGSHLKENKYLVVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK GI++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEI 449 + I I Sbjct: 225 -EEVISQTI 232 >gi|78066138|ref|YP_368907.1| membrane protein [Burkholderia sp. 383] gi|77966883|gb|ABB08263.1| membrane protein [Burkholderia sp. 383] Length = 603 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 16/166 (9%), Positives = 47/166 (28%), Gaps = 1/166 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T+ + G F ++ + + + + G+ + V Y +++ A A + Sbjct: 1 MKTRHSRGGFRFSSRQRGAFSVMAIVALLIAVTTLGV-IGVGNLFYQRRDVQRIADMAAL 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + ++ A ++ + E + + T Sbjct: 60 AAVQRMDDTCTQPTATATSNAQSNGLNASNGDSISIECGRWDTAVNPKPSYYAAATPGTT 119 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 + V + + P + S + + A + Sbjct: 120 QLNAAKVTVTRQVPFFFVGPPQPVSAVSTARSTNIDTFSVGATLAA 165 >gi|11321565|ref|NP_002372.1| matrilin-3 precursor [Homo sapiens] gi|14548113|sp|O15232|MATN3_HUMAN RecName: Full=Matrilin-3; Flags: Precursor gi|3647275|emb|CAA12110.1| matrilin-3 [Homo sapiens] gi|62630192|gb|AAX88937.1| unknown [Homo sapiens] gi|119621242|gb|EAX00837.1| matrilin 3 [Homo sapiens] gi|146218451|gb|AAI39908.1| Matrilin 3 [Homo sapiens] Length = 486 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 68/232 (29%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R +P + + G + F+ SS + V+ ++ Sbjct: 49 GRRPSPAAPDGAPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 108 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 109 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 162 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + K +++TDG + ++ + +A++ GI Sbjct: 163 GLAIQTAMDEAFTVEAGAREPS---SNIPKVAIIVTDG---RPQDQVNEVAARAQASGIE 216 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + L AS F + +L+ F++ Sbjct: 217 LYAVGV------DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 262 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 64/176 (36%), Gaps = 22/176 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH-KLIRTIVKTF 327 + + +++ + I ++V ++G F+ K+ + ++K Sbjct: 340 NRIDKLGESVRAWI-----KNDVPTGSQLGMVMFSSTAHIVSELQVISDMKIRQEMMKKV 394 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 D + T I + A + ++ IVL+TDG N+ + Sbjct: 395 PKDLYSI--TCIGCGLDLAVQMLQEKGNNKTGG---------IIVLVTDGRNSAGYLDIS 443 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--EANSTHELNKIF 441 + GIR++T+AF + L++ S++ + +S+ L + F Sbjct: 444 DVEEDIVKAGIRVVTVAFGSEADSNIER---LADVTGGKSYYIKDGDSSEALQRAF 496 >gi|163751746|ref|ZP_02158964.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] gi|161328398|gb|EDP99557.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] Length = 447 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/179 (10%), Positives = 49/179 (27%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F +S + G ++ + + +L V + +D + L+ A ++ + A+ Sbjct: 13 FKQFSLAKLGKQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSALYAAK 72 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L +R + + E + + + Sbjct: 73 ILQNGGSLFDAREAATLILMQNLGFEENSELNTSVDLSSPDYNATQVTANIFIEFSLWPD 132 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 V S + + ++ L + + K A V+ ++ + Sbjct: 133 PFIPVFDDSSKYVRIRVENVGLNNFFAQILNFDKKVRASAVAGKSTDIECLNRVVPMMV 191 >gi|162452618|ref|YP_001614985.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] gi|161163200|emb|CAN94505.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] Length = 1031 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 42/182 (23%), Gaps = 27/182 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLI--------RTIVKTFAIDENEMGSTAINDA 342 ++ F D + G T+ + Sbjct: 653 NCNSSSSCGVAFGDGCRNALIPFDGTAARRLTEDPAPWDQQENALVNALFPGPGTSGDTP 712 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKK--YIVLLTDGENTQDNEEGIAICNKAKSQ---- 396 M A N ++ +VL+TDG + + I A++ Sbjct: 713 MYPALHGAT-----TWAINYKNAHPEEEVAVVLVTDGIPSDCDTVPNHIAALARNAFANH 767 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH----ELNKIFRDRIGNEIFER 452 G+R+ + F + L F N+ L G + Sbjct: 768 GVRVHAVGFGSS----NAEIINLIADQGGGQAFNLNAGTTLQGNLLNALVSIRGEALPCD 823 Query: 453 VI 454 V Sbjct: 824 VT 825 >gi|156120138|ref|NP_001095285.1| complement C2 [Sus scrofa] gi|148724911|emb|CAN87699.1| complement component 2 [Sus scrofa] Length = 752 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 61/201 (30%), Gaps = 23/201 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + ++ I + + + F + S + + +V + Sbjct: 270 DFGVFKRSASLLVDRIFSFE---INISVAIITFASKPKIIMSVLKDKSRDVTEVVHSLEN 326 Query: 330 ----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D T I +A+ + Y + + ++ E + I+LLTDG++ Sbjct: 327 INYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAIILLTDGKSNMGGS- 385 Query: 386 GIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + K I+ I I + + S F T Sbjct: 386 PKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKDGERHAFILKDTEA 445 Query: 437 LNKIFRDRIGNEIFERVIRIT 457 L+++F V ++T Sbjct: 446 LSQVFEHM------LDVSQLT 460 >gi|38455778|gb|AAR20890.1| complement C2 [Sus scrofa] Length = 734 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 61/201 (30%), Gaps = 23/201 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + + + ++ I + + + F + S + + +V + Sbjct: 270 DFGVFKRSASLLVDRIFSFE---INISVAIITFASKPKIIMSVLKDKSRDVTEVVHSLEN 326 Query: 330 ----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 D T I +A+ + Y + + ++ E + I+LLTDG++ Sbjct: 327 INYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAIILLTDGKSNMGGS- 385 Query: 386 GIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + K I+ I I + + S F T Sbjct: 386 PKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKDGERHAFILKDTEA 445 Query: 437 LNKIFRDRIGNEIFERVIRIT 457 L+++F V ++T Sbjct: 446 LSQVFEHM------LDVSQLT 460 >gi|320159019|ref|YP_004191397.1| hypothetical protein VVM_02412 [Vibrio vulnificus MO6-24/O] gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O] Length = 688 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 16/144 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F++RV + V++ T G T + DA++ A + S Sbjct: 343 DRFRILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYDALERAVSGLDSDR 402 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I+L+TDG E + +R+ T F + + Sbjct: 403 TTG-------------IILVTDGVANVGVTEKKQFLKLMQRYDVRLYT--FIMGNSANTP 447 Query: 415 ARYFLSNCASPNSFFEANSTHELN 438 ++ S +++ ++ Sbjct: 448 LLEPMTQ-VSNGFATSISNSDDIL 470 >gi|320106177|ref|YP_004181767.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924698|gb|ADV81773.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 370 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 63/231 (27%), Gaps = 32/231 (13%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 L ++ + ++ K R AL+ +R+ Sbjct: 119 GLERDNFYLYENNQPQTIKTFSTDDAPVSIGIVFDLSGSMMSKFGRARKALSEFMRTSNP 178 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D FNDR ++ V + +V TA+ DA Sbjct: 179 QD------EFFVVGFNDRPAVIVDYTSNVDDVDARMV-----MLRPERRTALIDAAYLGL 227 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA--- 404 + + + +K +++++DG + + + + +I +I Sbjct: 228 NKLKDA-----------KYERKALLIISDGGDNRSRYVESELRRAVRESDTQIYSIGIFD 276 Query: 405 -FSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ ++ + C + F EL I I + Sbjct: 277 VYAATPEEKSGPTLLMDICEMTGGRMFRVTDADELGD-----IAARISAEL 322 >gi|218438801|ref|YP_002377130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218171529|gb|ACK70262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 573 Score = 48.1 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 44/130 (33%), Gaps = 23/130 (17%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + DR +L + G+TA+ D M ++ Sbjct: 437 LVTYGDRAAEVVPLGL-FDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQK---- 491 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS----QGIRIMTIAFSVNKTQQEK 414 KNN + + Y++LLTDG+ ++ K G+R+ IA+ + Sbjct: 492 ---KNNPDGRFYLLLLTDGQANMGVTF-----DEVKEVIEYSGVRVYPIAYG------DV 537 Query: 415 ARYFLSNCAS 424 + L AS Sbjct: 538 NQEELEAIAS 547 >gi|319649593|ref|ZP_08003749.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] gi|317398755|gb|EFV79437.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] Length = 461 Score = 47.7 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 19/113 (16%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G T I +++ AY + + +K+I+LLTDG+ + N + + Sbjct: 6 SITPGGGTEIFTSLEQAYSEL-----------EELQLQRKHIILLTDGQ-SATNGDYELL 53 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 K + I + T+A Q+ R L + A F++ + + I Sbjct: 54 IEGGKEKNITLSTVALG-----QDADRGLLEDLAGMGLGRFYDVTDSSIIPSI 101 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 47.7 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 43/451 (9%), Positives = 108/451 (23%), Gaps = 65/451 (14%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + + V + + L ++L + + + + Sbjct: 80 VAFASGAIQVAGPTARR------LD---LGEGTRLGEALAKARELKADRIVLVSDGLFQD 130 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + + V T + + +V + F G+ Sbjct: 131 RAEPIGIPIYSLYQPPSPNLSVVLTGPALPAKGETVEVRAVLESTTAVRAQLTFNGPAGV 190 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS-----RSMLDYQRDSEGQPLNCFGQPA 208 +Q + +V + + ++ ++ + DS + A Sbjct: 191 VERAVQLEPGRSSVGYRFRLDGPSTVTVRVESPLGLEQGRLELSPTDSTRVWVLGDAALA 250 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYM-----------------------------VSCNKS 239 S + I + + + + Sbjct: 251 RYLRAQGFSVEERKEITLPIRAEVVALGVGARDLSAAELDALQSFLNQGGSLLWTATPQG 310 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS------VIRSIKKIDNVND 293 L++ + V+ ++ + L V S++ I + Sbjct: 311 LFFGGWERTSLADSLPVEPVEEPGGVGIVLVLDVSGSMLEDDKLGLAVTGSLELIRSARP 370 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 +G F+DR + + G T I A A + + Sbjct: 371 QDYIGVVVFSDRPRWLFRPRPMTEQGRKEAESLLLST-QAGGGTMIRRAYLEALEAL--- 426 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K ++ LTDG + +A S I+ T+A + + Sbjct: 427 --------EQVPTESKQVIALTDGLAADVTPDLFDAAREA-SPRIKTNTVAIGADADGRF 477 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDR 444 A ++++ +L + F + Sbjct: 478 LRELAQ---AGDGTYWDVPRPEDLPRFFLEE 505 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 47.7 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 46/152 (30%), Gaps = 22/152 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R ++ + G T+I ++TA+ I Sbjct: 344 WVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSA-GGTSICSGLRTAFTVI----- 397 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C + K G I T+A K Sbjct: 398 -----KKKYPTDGSEIVLLTDGEDN-----TISSCFDLVKQSGAIIHTVALG---PAAAK 444 Query: 415 ARYFLSNCASPNSFFEANSTH--ELNKIFRDR 444 LS + ++ L F Sbjct: 445 ELEQLSKMTGGLQTYSSDQVQNNGLVDAFAAL 476 >gi|332253805|ref|XP_003276022.1| PREDICTED: matrilin-3 [Nomascus leucogenys] Length = 488 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 51/168 (30%), Gaps = 22/168 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAM 343 I +D R+ + V + + ++ T A+ Sbjct: 112 IDTLDIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLRQAVGRITPLSTGTMSGLAI 168 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 QTA D + E + K +++TDG + ++ + +A++ GI + + Sbjct: 169 QTAMDEAFTV---EAGARDPSSNIPKVAIIVTDG---RPQDQVNEVAARARASGIELYAV 222 Query: 404 AFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 L AS F + +L+ F++ Sbjct: 223 GV------DRADLKSLKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 264 >gi|189238317|ref|XP_972173.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 604 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 38/124 (30%), Gaps = 6/124 (4%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I + + + + + + I+ LTDG + I Sbjct: 364 GCTNIIGGLVVGLYLVR-RTLQKFYEKNVETKHQPMIIFLTDGLPNEGISNPDKITKIVT 422 Query: 395 --SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 +QG I +++F + + + N +EA + F + + + Sbjct: 423 KINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRTVSSPL 482 Query: 450 FERV 453 V Sbjct: 483 LRDV 486 >gi|326426687|gb|EGD72257.1| hypothetical protein PTSG_00277 [Salpingoeca sp. ATCC 50818] Length = 2847 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 64/203 (31%), Gaps = 19/203 (9%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + ++ S V R L+ + ++ + VR+ ++ Sbjct: 819 ECVGRAIDVFYLIDGSGSIVSSDFERERTFLSDLTDALFAFP--GNDVRISIAEYSTTYT 876 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 T T + G+TA A+ A D I ++ +A Sbjct: 877 QVLVP---YTADETTAQNTISNVVQSGGATATGTALGLAADDIGANAR---------PDA 924 Query: 368 KKYIVLLTDGENTQDNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + +VLLTDG + +++ I + S G+ I I N L P Sbjct: 925 ARVLVLLTDGATSDGDQQNIDPSVSDLNSIGVSITAIGIGDNAD----ETELLQIAGDPT 980 Query: 427 SFFEANSTHELNKIFRDRIGNEI 449 F + +L + IG+ + Sbjct: 981 RVFNNIAFVDLGDFIDEIIGSVL 1003 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 43/137 (31%), Gaps = 14/137 (10%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + TA A+ A I++ N + Sbjct: 1295 TFCATQTLLTTFTGVTTDALNALDSAVNTVSCGTATGAALDFAIANILNDN--------S 1346 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N A++ ++++TDG + +D+ A ++ ++ + I L A Sbjct: 1347 NPGARRVVIVVTDGASQEDSSFVTAASDRLHAEVDEVFAIGVG-----PASNLAELQTIA 1401 Query: 424 -SPNSFFEANSTHELNK 439 S ++ F L Sbjct: 1402 TSNDNAFIETDFDALRD 1418 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 78/231 (33%), Gaps = 19/231 (8%) Query: 223 GIRDEKLSPYMVSCNKSLYYM----LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 G R +P ++ M G S+ F+ SS + + L++ Sbjct: 421 GDRGAPGNPGPTGLKGNVGPMGLPGPAGGECVYGASDIAFILDSSGSILPEDYSLLKQFA 480 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V+ + ++ + R+ F+D FS+ TIV T + TA Sbjct: 481 ENVVDELVISESES---RIAVEIFSDVQEVSLQFSYD----KPTIVDTIRNLRHIQRGTA 533 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ A I SS++ NL V++TDG++ ++ + + G+ Sbjct: 534 TGAALELARRDIFSSSD----PNYRNLAVPAIAVVITDGQSQEEPAFVAEQALRLRELGV 589 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 ++ + E S + + +L + F I Sbjct: 590 TVLALGIGDGTDPDELLAIA----GSADRVEQVEQFGDLVQEFVRAFLQGI 636 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 55/202 (27%), Gaps = 20/202 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S K+ ++ A + + +N + + + Sbjct: 336 SGSMNAETPKRIERLKVAAKNFVS---LAENGTELGIVSYASDAAVASGRTEVAIAPLGA 392 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R T I +Q A D I ++ + YIVL++DG N Sbjct: 393 NRAAWNNAIDGLGPSTRTNIGAGLQKARDLITAAGGVTANT---------YIVLMSDGLN 443 Query: 380 TQDNEEGIAICNK------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + A + + GI + + + + + ++ Sbjct: 444 NEPAPQANADADLNGKIAMLLADGIPVYVTCTGSDLGLASQCSEI--GTGTGGHYVDSAD 501 Query: 434 THELNKIFRDRIGNEIFERVIR 455 + L + F D + +R Sbjct: 502 SARLPEAFADFHERIVAHEPVR 523 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/220 (9%), Positives = 56/220 (25%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +A ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDKM-------QNDKVGMIVFAGDAFTQLPITND 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ + TAI A+ A + + I+++T Sbjct: 150 YISA-KMFLESISPSLISKQGTAIGAAISLATRSFTPQ-----------EGIGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A +GI++ + + Sbjct: 198 DGENHEGGV--AEAAKTATEKGIQVNVLGVGMPDGAPIPVEGTNDYRRDREGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++T+ K + V Sbjct: 256 EEMCQEIAKAGNGIYVRVDNTNSAQKAISQEMNKMAKADV 295 >gi|311897983|dbj|BAJ30391.1| hypothetical protein KSE_46100 [Kitasatospora setae KM-6054] Length = 455 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 59/177 (33%), Gaps = 21/177 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + IR++ +D D + V K + KT Sbjct: 85 IDALPNETEFGIRTLGATYPGSDQ---------KEGCKDTQQLYRVGKTNKVEAKTAVAT 135 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I A++ A + + + IVL+TDGE+T + + Sbjct: 136 VRPTGWTPIGIALRAAAQDLGTGP------------TTRRIVLITDGEDTCAPPDPCDVA 183 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + SQGI ++ + + + + A+ +F + + +L K + + Sbjct: 184 RELASQGIHLVVDTLGLAHDDKTRQQLICIANATGGTFTDVRTQEQLTKRVKQLVNR 240 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 45/153 (29%), Gaps = 23/153 (15%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + ++A S + + ++G F+ + L + Sbjct: 283 GDSNRLGVAKNATLSALELLNPQS------QIGVIVFDTEETTVVPL----STLDIPAAQ 332 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G TAI + AY + + K+I+++TDG + + Sbjct: 333 IALDRVDTGGGTAIYPGLVAAYRELQ-----------RSESPAKHIIVMTDGLSQPGD-- 379 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 I + + G + +A V Sbjct: 380 WEGILRQITADGTTVSAVAIGVGADTGAAENIA 412 >gi|326916308|ref|XP_003204450.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 902 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 50/150 (33%), Gaps = 22/150 (14%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 FS + + I++ GST AM + + +++ + + + Sbjct: 491 FSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKGSRPNV------PRVM 544 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SF 428 +L+TDG+++ +E K + + I + ++ R L ASP Sbjct: 545 ILITDGKSSDAFKEPAI---KLRDADVEIFAVGV------KDAVRTELEAIASPPAETHV 595 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRITK 458 + F+ +RI + Sbjct: 596 YTVED----FDAFQRISFELTQSVCLRIEQ 621 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 23 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAV 77 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 78 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 131 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 132 ARAKEEGIAMYAVGVG------KAVEAELREIASEP 161 >gi|307318081|ref|ZP_07597517.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306896122|gb|EFN26872.1| TadE family protein [Sinorhizobium meliloti AK83] Length = 141 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 44/134 (32%), Gaps = 6/134 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 F ++L +S +G + AL+ +L + +++ R Y + L A A + + Sbjct: 6 FILRRLFRSQSGATAVEFALVCLPLLLLVFGIIEFGRAFYVRNELSHAVDVA----ARRV 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + + T + E L T V I P Sbjct: 62 LIGQIARDATDSEALTKLAGAVRESFHSGDPTLLTIAVTKETVDGIAFRVLSIRYP--FT 119 Query: 129 YQVVLSSRYDLLLN 142 + V ++ + LN Sbjct: 120 FVVPGLAQSPVSLN 133 >gi|149638912|ref|XP_001511941.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ornithorhynchus anatinus] Length = 3176 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 57/186 (30%), Gaps = 20/186 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + L + + D + R+G ++ ++ + + ++ + Sbjct: 208 FKYILDFIAALVSAFDVGEEKTRVGVVQYSSDTRTEFNL--NQYYQRNELLSAIKKIPYK 265 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T DA+ + K +++TDG++ + E + Sbjct: 266 GGNTMTGDAIDYLIKNTFTDAAGARIGF------PKVAIVITDGKSQDEVEIP---AREL 316 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIF 450 +++G+ + ++ + L AS F + + I + I Sbjct: 317 RNRGVEVFSLGI------KAADAKELKQIASTPSLQHVFNVANFDAIVDIQNEIISQVCS 370 Query: 451 ERVIRI 456 ++ Sbjct: 371 GVDEQL 376 Score = 43.1 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +K + +++ A + G+T A+ + + A+K VL+ Sbjct: 1306 NAYKDKKNLLEAVANLPYKGGNTLTGMALNFIRQ------NNFRPQAGMRPRARKIGVLI 1359 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K + +G+ + + + L A+ + Sbjct: 1360 TDGKSQDDVEAPS---KKLRDEGVELFAVGI------KNADENELKAVATDPDDTHAYNV 1410 Query: 432 NSTHELNKIFRDRIGNEIFE 451 L+ I D N Sbjct: 1411 ADFSFLSSIVEDLTANLCNS 1430 >gi|87311196|ref|ZP_01093319.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] gi|87286104|gb|EAQ78015.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] Length = 179 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 16/139 (11%), Positives = 34/139 (24%), Gaps = 5/139 (3%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K + G + A P++ V+ R + A A A+ Sbjct: 41 KIRPTKRSRSTQRHGAAIVEFAFAAPILFLFILASVEFGRLTMIR----HTADNAAYEAA 96 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + S + + + + + + + T + VR A + Sbjct: 97 RYAMVPGATSSEAKEKATQLLATIGARKVEVDVDPAVLQPETK-TITVTVRVPAAGNSWV 155 Query: 126 KSAYQVVLSSRYDLLLNPL 144 Y + L Sbjct: 156 VPKYFSSYAIEAKSHLATE 174 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/261 (13%), Positives = 76/261 (29%), Gaps = 26/261 (9%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + D + +N F R + + + + + + Sbjct: 108 GRLPDPATVRPEEFVNSFRPDYPRP--ADNGFSVTLDGARAGSDGWSLVRVGLATRAADR 165 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + VD S + LV+++L + ++ D++ F+D Sbjct: 166 TGERPPAALTFVVDISGSMAEPGRLDLVKESLGLLADELRDDDSIA------LVTFSDEA 219 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + R V+ ST + ++T YD + + + Sbjct: 220 ETRLPMT--RVGEARGRVREVVNSLATTSSTNVEAGVRTGYDVAVDGHRKDATNR----- 272 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 +VLL+D E AI + K+ GI + + + R Sbjct: 273 ----VVLLSDALANTGATEAGAILERIEEERKAYGITLFGVGVGSDYGDAFMERLADR-- 326 Query: 423 ASPNSFFEANSTHELNKIFRD 443 +++ + K+F D Sbjct: 327 -GDGQTTYVSTSAQARKVFVD 346 >gi|308473880|ref|XP_003099163.1| CRE-CLEC-65 protein [Caenorhabditis remanei] gi|308267636|gb|EFP11589.1| CRE-CLEC-65 protein [Caenorhabditis remanei] Length = 376 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 51/160 (31%), Gaps = 11/160 (6%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D + V +N ++D L I + + ++ A+ Sbjct: 69 TGYTDPRSTRVGFITYNYNATDVADFYKLQSWADLDSQIQRLKMTPLARTTLSKMDTALY 128 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + I S+ KK +++ T + + K++GI ++T+ Sbjct: 129 AAINMINSTAGF-------RDNYKKMVIVFTSVHGSYQKNPPKDVSKILKARGIPVVTV- 180 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 N + +L N AS N + + +I + Sbjct: 181 ---NTGSSSDTQSYLKNIASDNMAYAMADGNATQEILKAM 217 >gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 698 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 46/172 (26%), Gaps = 28/172 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + D ++ + + D R FN S + G V + Sbjct: 324 KIATLADGVSQTLGKLNSND------RFRIITFN---QSAADLTRGFVTATPEAVGQWIN 374 Query: 330 ---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 GST + ++TA + IVL+TDG E Sbjct: 375 RVKTIAAGGSTNLFAGLETACRRLDDDRTTS-------------IVLVTDGVANVGRTEQ 421 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 +R+ T + + R + S + + ++ Sbjct: 422 REFLQLLTEYDVRLFTFVIGNSANRPLLDR-LAKD--SGGFAMQISDVDDIQ 470 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 69/221 (31%), Gaps = 35/221 (15%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV----RDALASVIRSIKKIDNVNDTVRMG 298 P D ++ +D+S + + + + I ++ + Sbjct: 34 NSCPEKADCPINVYFVLDTSESVTMQSPIDSLLYHMKQFVRQFISQLQDETYLEQVALSW 93 Query: 299 AT---FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 F+D V +++ +I T + A+ I Sbjct: 94 RYGGLHFSDVVRVFSPPDSDRASFTKSLESIVSIR----KGTFTDCALANMTQEI----- 144 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEK 414 R + + V++TDG T GI + +A+ +GIRI T+A Q Sbjct: 145 ----RQLKSKGGVHFAVVITDGYVTGSPCGGIKLQAERAREEGIRIFTVA-----PDQVP 195 Query: 415 ARYFLSNCAS------PNSFFEANSTHELNKIFRDRIGNEI 449 L + AS N++ E++ +D I I Sbjct: 196 NEQGLRDMASMPLELYRNNYATVRPDLEID---QDTINRII 233 >gi|256261598|gb|ACU65921.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 50/145 (34%), Gaps = 13/145 (8%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+DR +F + G T A++ D ++S+ Sbjct: 190 LVQFSDRFQEHFTFKDFATSSDPLNLLNSVWQL--GGWTFTASAIRFVTDRLLSAA---- 243 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +A K ++++TDGE T+ + + +A++ GI I + + Sbjct: 244 --YGARKDASKILIVITDGEKTEK-VDYKEVIPRAEAAGIIRYAIGVGSAFQYRNSLQEL 300 Query: 419 LSNCASPN---SFFEANSTHELNKI 440 + + AS F+ + L I Sbjct: 301 I-DIASTPSKEHVFQVENFDALRDI 324 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 49/145 (33%), Gaps = 23/145 (15%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +GV + + + G T I A++ A N H + L Sbjct: 108 YGVQPYEKESFENSLNELGPNGWTPIARALEHAKQADEQLNNGTKH----------IVYL 157 Query: 374 LTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + + + + + + I N + + + A +++A Sbjct: 158 ITDGEETCG-GDPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAK--AGKGHYYQA 214 Query: 432 NSTHELNKIFRDRIGNEIFERVIRI 456 ++ E+ G+ + +++ Sbjct: 215 STGKEM--------GSILSAESLKL 231 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 7/53 (13%), Positives = 23/53 (43%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 I+ +L+ + + G+L+D+ + ++ + A + L ++ + Sbjct: 2 ILVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGADQTT 54 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/394 (9%), Positives = 108/394 (27%), Gaps = 36/394 (9%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 P + ++ + K + + ++ E+ Sbjct: 683 AGVFPDGSAWDDSKVSEYFNGNTLKYITTHNFKMFYMERGAGASNLKMKFNLEVIPTYEV 742 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS---------WLIQTKAEAETVSRSYHK 173 N K Q + + + S + + S T ET + + + Sbjct: 743 NFNKVTTQGAALAGAEFEIRDDSDSSKVYNVTSDSDGRVSVRLHEGTYTMTETQTPTGYL 802 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + ++ + + + + ++ + K++ Y Sbjct: 803 AASGTWKITVNADGTYTITKNGRPIDKGSDSVYKIVNKGQHEDAEANLTTSKTVKVTDYN 862 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH---LVRDALASVIRSIKKIDN 290 + G + ++ V R + +A + I ++K Sbjct: 863 KREYEITLGASTSGREAGTEAKAASVVLVLDRSGSMGADGMTALVNAADTFIDTLKTASP 922 Query: 291 VND----TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + ++ S + +++I + + G T + DA++ A Sbjct: 923 DSQVAVVYFNGTQDEDDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYSYGGTPMGDALEKA 982 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAKS------QGI 398 + + +KY++ TDG ++ D+ + N A + Sbjct: 983 KGLLDADQTGN----------QKYVLFFTDGLPGHSSDDAFNCMVANSAVNCATDIKANA 1032 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS--FFE 430 I T+ + ++ T ++ A + ++ Sbjct: 1033 TIYTVGYHLSGTLYWHKGDSATSSAGRDHGGYYN 1066 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 40/159 (25%), Gaps = 15/159 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 FN V I + F G T ++ A++ A Sbjct: 410 KAQDSFNIIEFNSDVSLLSPVPLPATAENLAIARQFVNRLQADGGTEMSLALEAALPKQR 469 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + + ++ +TDG + I ++ R+ T+ Sbjct: 470 PSRAASENNVLQQ------VIFMTDGSVGNEEALFELIRHQIGDN--RLFTVGIGSAPNS 521 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 F+ A +F E+ + + Sbjct: 522 H-----FMQRAAELGRGTFTYIGDVDEVEQKINQLLTKI 555 >gi|291239280|ref|XP_002739551.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 289 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 26/87 (29%), Gaps = 3/87 (3%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +++LTDG+ T I + K+ + + F N + N Sbjct: 1 MLIMLTDGQPTSGEIHTDTIMDNVKAHSKDHYSLFCLGFGDNVDFLFLEKLSFENNGVAR 60 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERV 453 +E F D + + V Sbjct: 61 KIYEDTDASLQLSGFFDEVATPLLVDV 87 >gi|32472885|ref|NP_865879.1| hypothetical protein RB4059 [Rhodopirellula baltica SH 1] gi|32444122|emb|CAD73564.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 156 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 43/146 (29%), Gaps = 5/146 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ + + G + A++ V+L M +++ R + + + A A Sbjct: 14 SRPCRVPARRAATRDGATLVEFAIVCNVLLLTIFMCMELARMNMARNLAQ---DAAYYAA 70 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + A+ Y + N V + D V + Sbjct: 71 RTAIVPGATADEAIAEAETIMESLFASGYDVECTPINDDTEEVTVTVSLSLDD--VALFA 128 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRS 150 + L+S + + F R Sbjct: 129 PMFLGNLELTSSATMQTERYNGFFRV 154 >gi|149634622|ref|XP_001513644.1| PREDICTED: similar to AMACO [Ornithorhynchus anatinus] Length = 801 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 61/170 (35%), Gaps = 21/170 (12%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D ++G F ++ + + H +++ + G + A+ Y Sbjct: 576 FDINRDVTQIGLVVFGRQIRTVFAL--DTHPTGSGVLEAVSQMPFVGGVGSAGTALLHVY 633 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D +++ + + K +VL+TDG +D K +S G+ + I Sbjct: 634 DEVMTVQKGARPGV------SKAVVLITDGTGIEDAVVP---AQKLRSNGVSVFVIRVGP 684 Query: 408 NKTQQEKARYFLSNCASPNSFF-EANSTHEL---NKIFRDRIGNEIFERV 453 + L A S+ +A+S +L F DRI E + V Sbjct: 685 F------QKEALLRIAGSPSYLVQASSYKDLKSYEDSFIDRICEEAKQPV 728 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 56/179 (31%), Gaps = 21/179 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW-GVHKLIRTIVKTFAIDENEM 334 A V++ +D D VR+GA F+ + + + +K Sbjct: 83 TAKYFVVKLCDALDINPDRVRVGAVQFSVASWLEFPLDSCLTRQEAKDKIKKIVFR---G 139 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 GST A++ + N + ++L+TDG++ + + K Sbjct: 140 GSTETGLALKYIL--------WKGFPGGRNASVPQILILITDGKSQGNVTVP---AQQLK 188 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 +GI + + + L AS + +++ + + V Sbjct: 189 DRGITVFAVGV------RFPRWEELHLLASEPNEQHVLFAEDVDDAYNGLMSTLTSSTV 241 >gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] Length = 540 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 20/184 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +DAL + + D F+ V + F Sbjct: 85 SRMQIAKDALKYCVTRLNPQDT------FNVVRFSTDVEALFPALKSAQPENIQKAVAFV 138 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +G TAI++A+ A ++ +TDG+ T + A Sbjct: 139 EQLEAIGGTAIDEALVRGLQ-----------DNDGKSSAPHLLMFITDGQPTIGETDEGA 187 Query: 389 ICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I AK R+ T + + R + + + F D++ Sbjct: 188 IAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKV 247 Query: 446 GNEI 449 N + Sbjct: 248 SNPV 251 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ S + S Sbjct: 91 VTLSNVSERKDNMRLGLSLATNPKDNSFLACSPLWSHECGSSYYTTGMCSRANSNFRFSK 150 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 151 TVAPALQRCQTYMDIVIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 200 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + + + + + I++ T ++ A Sbjct: 201 VGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEFA--------RS 251 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + +++ + +A ++ Sbjct: 252 EAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINP 310 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 311 ETFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 349 >gi|319652226|ref|ZP_08006344.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] gi|317396049|gb|EFV76769.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] Length = 463 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 69/238 (28%), Gaps = 19/238 (7%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + + PY+ KS + + Sbjct: 116 ASPNYKNIIEKSNSYKPDFEEPYLPDPTKSEPGEEKKASEQAIILLDASSSMLLQAGGKM 175 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVK 325 K + + A+ S ++I + V+ V D + + K + Sbjct: 176 KMDIAKSAVKSFAQTIGQSSEVSLVVYGHKGSEADADKEISCSGVEEVYPMGKYSKKEFH 235 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T + A+Q A + + + I +++DG T D + Sbjct: 236 EAVDSFESKGWTPLAGAIQKAAEM------------SSGYDGSTTIYIVSDGAETCDGDP 283 Query: 386 GIAICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 A N K+ + I F V+ + + + A + +A++ EL + Sbjct: 284 VSASKNLVKNNSSNSVNIIGFGVDGKAENQLKAVAE--AGNGEYLKADNPDELKNTIQ 339 >gi|312139254|ref|YP_004006590.1| lipoprotein [Rhodococcus equi 103S] gi|311888593|emb|CBH47905.1| putative lipoprotein [Rhodococcus equi 103S] Length = 314 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 19/42 (45%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 G ++ A+LM V+LG + VD+ + L+ A Sbjct: 4 DERGVVAVLVAILMVVLLGCAAISVDIGANYVVKRQLQNGAD 45 >gi|297698645|ref|XP_002826428.1| PREDICTED: integrin alpha-X-like, partial [Pongo abelii] Length = 836 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEKFRSSSNPLSLLDSVHQLR--GLTHTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D + + A + GI I Sbjct: 244 ------YGARRDAAKILIVITDGKKEGDRLDYKDVIPMADAAGIIRYAIGVGSAFEYINS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFEALKD-IQNQLKEKI 333 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 59/180 (32%), Gaps = 18/180 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + AL + ++ ID+ FN V + + F Sbjct: 361 ITQAKQALQFALAGLRDIDS------FNIIEFNSDVTMLSATPLSANSRNIGKANRFIQS 414 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G T + A+QTA ++ S + + + + E + ++ +TDG ++E I Sbjct: 415 LDADGGTEMRSALQTA---LVDSVQQDSDQTDAHSEMLRQVIFMTDGAVGNEHELYQLIN 471 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 ++ R+ T+ F+ A +F + E+ + + Sbjct: 472 DQLGDS--RLFTVGIGS-----APNSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKI 524 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 42/134 (31%), Gaps = 18/134 (13%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQD 382 VK G T+I++ M+ + ED V + I LLTDGEN D Sbjct: 103 VKDAIASLKAEGGTSIDEGMKLGIKQVALGKEDRVSQ----------IFLLTDGENEHGD 152 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKI 440 NE + + A I + T+ F + L + A + + Sbjct: 153 NERCLKLAQVAGEYNITLNTLGFG-----NHWNQDVLESIADSVGGTLCYIEQPEQALTE 207 Query: 441 FRDRIGNEIFERVI 454 F + Sbjct: 208 FSRLFTRMQSVGLT 221 >gi|301778757|ref|XP_002924796.1| PREDICTED: integrin alpha-M-like [Ailuropoda melanoleuca] Length = 1153 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 64/241 (26%), Gaps = 20/241 (8%) Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 ++ G+ P + +S +D S Sbjct: 102 RLLACGPTVHQTCKENTYVNGLCFL-FGPNLWQQPQSF-PEKIRECPRQDSDIAFLIDGS 159 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + I +++ +++V+ K + ++ F++ K Sbjct: 160 GSINPI-DFQRMKEFVSTVMDRFKNSKTLFSLMQF-------SEDFQTHFTFNEFKANPN 211 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T + + S+ K +V++TDGE D Sbjct: 212 PRFLVNAIIQLYGRTHTATGILKVVRELFHSSSGARENAL------KILVVITDGEKFGD 265 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNK 439 + + +A +GI I + L+ AS + F N+ L Sbjct: 266 PLDYKDVIPEADREGIIRYVIGVGEAFDNPKHREE-LNTIASKPARDHVFRVNNFEALKT 324 Query: 440 I 440 I Sbjct: 325 I 325 >gi|281352695|gb|EFB28279.1| hypothetical protein PANDA_014199 [Ailuropoda melanoleuca] Length = 1110 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 64/241 (26%), Gaps = 20/241 (8%) Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 ++ G+ P + +S +D S Sbjct: 57 RLLACGPTVHQTCKENTYVNGLCFL-FGPNLWQQPQSF-PEKIRECPRQDSDIAFLIDGS 114 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + I +++ +++V+ K + ++ F++ K Sbjct: 115 GSINPI-DFQRMKEFVSTVMDRFKNSKTLFSLMQF-------SEDFQTHFTFNEFKANPN 166 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T + + S+ K +V++TDGE D Sbjct: 167 PRFLVNAIIQLYGRTHTATGILKVVRELFHSSSGARENAL------KILVVITDGEKFGD 220 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNK 439 + + +A +GI I + L+ AS + F N+ L Sbjct: 221 PLDYKDVIPEADREGIIRYVIGVGEAFDNPKHREE-LNTIASKPARDHVFRVNNFEALKT 279 Query: 440 I 440 I Sbjct: 280 I 280 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 44/139 (31%), Gaps = 18/139 (12%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + + L ++ G T+I +++ + + + + Sbjct: 236 DESCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDL---------KALD 286 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + ++TDG T + I + K + I + I F+V+ Q L Sbjct: 287 GEKTLNILYIITDGIETCG-GNPVEIAKQLKGENTNIVLGIIGFNVDANQN----RLLKQ 341 Query: 422 CAS--PNSFFEANSTHELN 438 A + N +L Sbjct: 342 IADAAGGYYSSVNDADKLT 360 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I ++ +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQAGQLFLLQIIEL-----RSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I ++ A+ I K IVLLTDGE+ Sbjct: 367 DRDTLTKRLPT-AASGGTSICSGLRLAFTVI----------KKKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + ++ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAARELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDR 444 L F Sbjct: 467 GLIDAFGAL 475 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 47/160 (29%), Gaps = 22/160 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R ++ + G T+I ++TA+ I Sbjct: 344 WVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSA-GGTSICSGLRTAFTVI----- 397 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C + K G I T+A K Sbjct: 398 -----KKKYPTDGSEIVLLTDGEDN-----TISSCFDLVKQSGAIIHTVALG---PAAAK 444 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 LS + +S N F D Sbjct: 445 ELEQLSKMTGGLQTY--SSDQVQNNGFVDAFAALSSGNAA 482 >gi|270007556|gb|EFA04004.1| hypothetical protein TcasGA2_TC014153 [Tribolium castaneum] Length = 813 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 45/134 (33%), Gaps = 9/134 (6%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ D++ +G + A++ + +D + + I+ LTD T Sbjct: 387 AARSTIYDKSGLGLSNPVYALEVGLF-LAKRIQDNLPN-----RYQPMIIFLTDSYPTVG 440 Query: 383 NEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I N K + I I +++F + + + N +EA Sbjct: 441 MTSQNEIINTVTKVNNNRIPIFSLSFGEDVDKNFMRQLAAKNLGFSGHIYEALDASVQIL 500 Query: 440 IFRDRIGNEIFERV 453 F I + + +V Sbjct: 501 NFYRSISSPVLSQV 514 >gi|238060066|ref|ZP_04604775.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881877|gb|EEP70705.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 316 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 60/195 (30%), Gaps = 29/195 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A + + T +G F + + Sbjct: 107 NRLEAAQEAAKQFVAEL------PQTYNLGLVSFAKSANVLVPPTKDRDAVTTA-----I 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TA +A+ T + I S IVLL+DG T Sbjct: 156 DGLVLAEATATGEAVFTCLEAIRSV-----PADGAAGIPPARIVLLSDGFRTSGRSV-EE 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS----NCA-------SPNSFFEANSTHEL 437 A++ + + TIAF + Q + A + F+EA S EL Sbjct: 210 AAAAAQAANVPVSTIAFGTDAGQVDIGGQLQRVPVDRMALAELAETTEGYFYEAASVSEL 269 Query: 438 NKIFRDRIGNEIFER 452 ++++D G+ I R Sbjct: 270 KQVYQDM-GSSIGFR 283 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 24/173 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+D I + + D + A ++D V+ S + + Sbjct: 63 VTQVKDFTNQFIDKLTQRYYRCDRNLVWNAGALHYSDEVVLIKSLTPMPSGQSELKNRVS 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 AI+ T + A++ + ++ ++ + KY++++TDG + +E Sbjct: 123 AINYI-GKGTYTDCAIKRGIEELL--------IGGSHHKENKYLIVVTDGHPLEGYKEPC 173 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANS 433 N+AK GI++ ++A L+ A+ +++ F A S Sbjct: 174 GGLDDAANEAKLLGIKVFSVAI-----SPNHLDQRLNIIATDHAYRRNFTATS 221 >gi|189237279|ref|XP_973594.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 45/134 (33%), Gaps = 9/134 (6%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ D++ +G + A++ + +D + + I+ LTD T Sbjct: 387 AARSTIYDKSGLGLSNPVYALEVGLF-LAKRIQDNLPN-----RYQPMIIFLTDSYPTVG 440 Query: 383 NEEGIAICN---KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I N K + I I +++F + + + N +EA Sbjct: 441 MTSQNEIINTVTKVNNNRIPIFSLSFGEDVDKNFMRQLAAKNLGFSGHIYEALDASVQIL 500 Query: 440 IFRDRIGNEIFERV 453 F I + + +V Sbjct: 501 NFYRSISSPVLSQV 514 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 46/152 (30%), Gaps = 15/152 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D + F+ V + G + + A G ++D++ A + + Sbjct: 445 RDFDEITVLPFDSAVQNQYGPVAG-SEREVAQGEIIARGVTGGGGINVHDSLVAAGNVL- 502 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 K ++I+LL DG ++Q E + + ++ + GI TIA Sbjct: 503 ----------KGRNAPIRHIILLADGSDSQQQENAVRLTDEHRRLGITTSTIAIGNGGDV 552 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 A F L I Sbjct: 553 GFLNNV---AVAGGGRHFLVEDALSLPDIVLQ 581 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 LV+ + ++ + + R+G F+ RV ++ G + + + Sbjct: 23 FELVKRFVNQIVDFLDVS---PEGTRVGLVQFSSRVRTEFPL--GRYGTAVEVKQAVLAM 77 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T A++ + S + R N + ++ TDG +QD+ Sbjct: 78 EYMERGTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WA 131 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +AK +GI + + + L AS Sbjct: 132 ARAKEEGIVMYAVGVG------KAVEAELREIASEP 161 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 38/123 (30%), Gaps = 20/123 (16%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + V G T I A+ AY + +K+I+LLTDG++ Sbjct: 463 NKDEVIETIRSTALGGGTDIFPALNQAYQQLNEM-----------DLKRKHIILLTDGQS 511 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 I + + + + T+A + L A F+E + Sbjct: 512 NDGPY--EEIIEEGLTNNVTLSTVAIGGDAD-----TSLLEELAEIGTGRFYEVYEASAV 564 Query: 438 NKI 440 I Sbjct: 565 PSI 567 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 27/200 (13%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF----NDRVISDPSFSWGVH 317 + + K + AL V+ ++ V V +D + Sbjct: 51 AKDPSGLTKIEAAKRALTGVVGALPDTAQVGLRVYGAKVDGKGKPTPAACADTQLVHPIA 110 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 L + + + +G T I ++ A + K+ IVL++DG Sbjct: 111 TLDKPKLTSTIAAIKALGETPIAHSLTEALKDL-------------GTSGKRNIVLVSDG 157 Query: 378 ENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 E + + K + G+ +I T+ F VN + + + A ++++A Sbjct: 158 EESC-VPDPCPAITKLTAAGVDLQIDTVGFGVNTKARAQLQCI--AAAGKGTYYDAKDAS 214 Query: 436 ELNKIFRDRIGNEIFERVIR 455 L +++ +R +R Sbjct: 215 ALTTSL-----SKLSQRALR 229 >gi|262171234|ref|ZP_06038912.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] gi|261892310|gb|EEY38296.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] Length = 406 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 50/169 (29%), Gaps = 6/169 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I + + G ++ M ++ + + +DV L+ A A + A++ Sbjct: 3 ITQHRNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMV 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ T + + + NF D + + Sbjct: 63 ADATHDTATATDVAKSTLTS--MHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGD 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 Y V + L F++++G+ + + S + Sbjct: 121 IYVRVSVADLSLT----EFFMQALGMAKQVSAAAVAGPSSSINTIGNVV 165 >gi|258621449|ref|ZP_05716483.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] gi|258586837|gb|EEW11552.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] Length = 406 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 50/169 (29%), Gaps = 6/169 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I + + G ++ M ++ + + +DV L+ A A + A++ Sbjct: 3 ITQHRNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMV 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ T + + + NF D + + Sbjct: 63 ADATHDTATATDVAKSTLAS--MHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGD 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 Y V + L F++++G+ + + S + Sbjct: 121 IYVRVSVADLSLT----EFFMQALGMAKQVSAAAVAGPSSSINTIGNVV 165 >gi|258627502|ref|ZP_05722283.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] gi|258580308|gb|EEW05276.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] Length = 406 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/296 (9%), Positives = 75/296 (25%), Gaps = 9/296 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I + + G ++ M ++ + + +DV L+ A A + A++ Sbjct: 3 ITQHRNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMV 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + ++ T + + + NF D + + Sbjct: 63 ADATHDTATATDVAKSTLTS--MHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGD 120 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 Y V + L F++++G+ + + S + S Sbjct: 121 IYVRVSVADLSLT----EFFMQALGMAKQVSAAAVAGPSSSINTIGNVVPIGVCKGSESG 176 Query: 188 SMLDYQRDSE---GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + Y + +G +R+ Y + Sbjct: 177 GIYGYHSSDVYALKVGDSSMTAMGSGNYNLLDFGSGADTVREALGGGYEGTVEIGGDIDT 236 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 G + + + + D + + + +D+ V A Sbjct: 237 QTGVTTGPVGQGINTRLGIYQGGVSSSDYPPDYVTELPATPATLDSTGHVVYDYAN 292 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 63/170 (37%), Gaps = 32/170 (18%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F++ ++ L +++ D T + A++ AY+ +++S + Sbjct: 118 RIGIIIFSNDAYIHVPLTYDAAAL-ELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSAD 176 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT----- 410 K +VL TDGEN+ + N + GI + ++A Sbjct: 177 PTSRA--------KMMVLFTDGENSSSCTNALF--NNLRRFGIGVYSVAVGTKVGISIQE 226 Query: 411 -------------QQEKARYFLSNCA--SPNSFFEANST-HELNKIFRDR 444 + FL A S S++E N++ +++ K+ D Sbjct: 227 NGKPLKDKNDKLVISKLDENFLRGIANSSRGSYYELNNSKNDIQKLISDI 276 >gi|209886335|ref|YP_002290192.1| hypothetical protein OCAR_7223 [Oligotropha carboxidovorans OM5] gi|209874531|gb|ACI94327.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 610 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/169 (10%), Positives = 49/169 (28%), Gaps = 10/169 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++ + G+ I A M ++ G + VDV + + Sbjct: 59 SGARGQGLLRRFAANERGNVAFIAAACMVLVTGCAALGVDVGAVFTDKRRAQS------- 111 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + ++ S ++ S + + + ++ + Sbjct: 112 ---AADLAAIVAASDLSRASRAAAATVAKNHYPPDSLVAVETGVYTANTSLGPQQRFSAG 168 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 SA +V + +R L + ++ ++ + T + T + Sbjct: 169 TTPASAVRVTMQTRTPLFFGKVLTGDPTVNLRVTSVATTTQLATFAIGS 217 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 50/126 (39%), Gaps = 20/126 (15%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG------ENTQDNEEGIAIC 390 T I A++ A D +MK + + + +++LLTDG +N + +E I Sbjct: 105 TNIGAALEKA-------AYDHKQQMKADQDYQTHVILLTDGMVDIDRDNRLNKKERQRIL 157 Query: 391 NKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N + GI + TIA S N ++ + L A+ A + EL +F Sbjct: 158 NNVLPMYQQSGITLHTIALSDNADKKLLNKLAL---ATDGKVSVAKNAEELMNVFLRVFN 214 Query: 447 NEIFER 452 + Sbjct: 215 QAVPLE 220 >gi|296190554|ref|XP_002806559.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Callithrix jacchus] Length = 3582 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 37/126 (29%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTSRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVLFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 58/169 (34%), Gaps = 18/169 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN--EMGSTA 338 + IDN+ R+ F+D I F K + +VK + G T Sbjct: 135 FKATSSLIDNLEGNRRIAFMTFDDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDGQTG 194 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI-AICNKAKSQG 397 + D + AY+ I + N+ +++++DG + D+ I A+ + Sbjct: 195 VRDMINEAYELIQN----------NSKNHSGSLIMISDGAPSDDSASNIPALVSNYVQNN 244 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 I I TI +L + A + + + T + F Sbjct: 245 IPIYTIGMMYGDNSA---EQYLIDIANLTGGQHYSTSDTTMIAGAFGQI 290 >gi|84996511|ref|XP_952977.1| thrombospondin-related protein [Theileria annulata strain Ankara] gi|71532874|emb|CAJ20069.1| thrombospondin-related protein, putative [Theileria annulata] Length = 606 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 39/314 (12%), Positives = 81/314 (25%), Gaps = 24/314 (7%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + P+ R + + E + S + I S+ Sbjct: 99 SVLPMETLDRLSEAITKNSEHPVTFEGLDGGSVVVTNNSDTFSIKLYPSLPGLNLTPGML 158 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 P + T K + D S Y L V Sbjct: 159 PTTKPNSHINFTGNHNEHALVKHALHDLSSSNYDKGLY-PDGIKKPSSYCHRELDLTILV 217 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D SS + + + ++ I+ + V + F+ + SF Sbjct: 218 DESSSIY----IEEWNKLIPFLKSLVRSINISPNYVHLSMVTFSTSIRWLISF-LDPASK 272 Query: 320 IRTIVKTFAIDENEM----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + G T A+ + + + + K I+++T Sbjct: 273 DEQLALAVLDKLKNSKPVFGYTFTGQALNFISEAV--------YMFGARRNSPKGIIIIT 324 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANS 433 DG +TQ N + G+ I+ + K + + C++ F + Sbjct: 325 DGSSTQTN-VTSQASALLRDAGVTILVVGVGKAKESECRGIVG---CSTKGECPLFFMTN 380 Query: 434 THELNKIFRDRIGN 447 E+ + + + Sbjct: 381 WDEIIRKVGELMAE 394 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/367 (11%), Positives = 101/367 (27%), Gaps = 14/367 (3%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR--YDLLLN 142 F K + E + ++ ++ + + N + V L++ + Sbjct: 518 FGKYTVTETVPMEYKLESTESGYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFK 577 Query: 143 PLSLFLRSMGIKSWLIQT-KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + S I + +S + + ++D S SM + S Sbjct: 578 ATDEKTNYLPSGSETINNYEYNVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTH 637 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 T + + +V + S N ++ + + ++ Sbjct: 638 LKNNVIKFITKLYEHNPDSRVSVITFAYSADGSITNN--NFVKLSDIKSGNETWYTYLTK 695 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ K + ++ + F D + F+ + Sbjct: 696 NNGGIKNIKASGGTQIDLGLYEVRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDD 755 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + A N+ + + + I SSN + + KN+ K N Sbjct: 756 NGYRVGAEALNQADFIKFSGNLTGINNYIESSNGSKYYGHKNDDITKNRS------NNNS 809 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 ++ K G I TI + FL+ AS + +AN++ + F Sbjct: 810 NDAGNRTN-RSGKGLGKTIFTIGLNS--NNSSLFDSFLTRLASEGHYTKANNSSAMENAF 866 Query: 442 RDRIGNE 448 + Sbjct: 867 NSIFTSI 873 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 36/289 (12%), Positives = 79/289 (27%), Gaps = 18/289 (6%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 A ++ + G I V D S SM + + + + + + Sbjct: 73 ATGKAQDSVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRL--TKAKNAAIEFLNNSK 130 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 I K + Y + + Y + + + + L + ++ L Sbjct: 131 ISGNKKNRYSI-VTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKART 189 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAIND 341 +K + N + + + + + + T T + Sbjct: 190 VLKNAKSENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGEN 249 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ N+ + NN N AK G I Sbjct: 250 AINYDSVIKGGRNDFTLDLYLNNSHYS---------LNNAAATLAENEALLAKKSGNTIF 300 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 TI ++ + FL N A+ + +S+ +L+ I+ + + Sbjct: 301 TIGYTTGSSVNS----FLKNVATQGEGYAYSSSSDLSGIYENIANEIVT 345 >gi|288928459|ref|ZP_06422306.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331293|gb|EFC69877.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] Length = 554 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/196 (10%), Positives = 52/196 (26%), Gaps = 29/196 (14%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + V + + + +++ + R+G F + Sbjct: 97 DISNSMMAQDVVPSRLEKSKLLIENLVDNFT-------HDRIGLVVFAGDAFVQLPITTD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++ T I A+ + + + K I+++T Sbjct: 150 YVSA-KMFLQNIDPALIATQGTDIAKAINLSMRSFSQQ-----------KDIGKAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DGE+ + A +GI + + K S + S Sbjct: 198 DGEDHEGGAL--EAAKAANERGIHVFILGIGSTKGSPIPT--------SEGGYLTDRSGQ 247 Query: 436 ELNKIFRDRIGNEIFE 451 + + + +I + Sbjct: 248 TVLTALNESMCKQIAQ 263 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 60/169 (35%), Gaps = 17/169 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D + +V+ + ++ + + +FS L + + Sbjct: 170 KDFVRAVMGQFEGT-----NTLFSLMQYSSGLKTHFTFSKFRSSLSPLRLVDPIVQLQ-- 222 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + + + S AKK ++++TDG+ +D E + +A+ Sbjct: 223 GLTFTATGILAVVNELFHSKNGAR------RSAKKILIVITDGQKYKDPWEYRDVIPQAE 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 GI I + Q+ AR L+ S + F+ ++ L+ I Sbjct: 277 RAGIIRYAIGVG-DAFQEPIARQELNTIGSAPSQDHVFKVDNFAALSSI 324 >gi|296448099|ref|ZP_06890001.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296254413|gb|EFH01538.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 618 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/370 (9%), Positives = 90/370 (24%), Gaps = 18/370 (4%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA----IITASVPLIQSLEEVSSRAK 80 + A ++P+++GV ++V+ + ++ A A + S + +S A Sbjct: 1 MFAFVLPLVIGVCSLVVEFGNALLIKARYQRVADIASFSGALAYSGTSSTTTMTNASVAV 60 Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLL 140 + + + + + + + Q+ S+ + Sbjct: 61 AALNQVPAAAVSANLTVSPASSSNSAVLVAISAAHPLFLTPVLNGPRSLQIAASAYSQIA 120 Query: 141 LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP 200 ++ S L KS + + S S+ S + S P Sbjct: 121 VSASSCLLALDSSKS-GVTLSGGTSVTATSCVVASNSSVTVPCGTSIAAKAVDYYSSSAP 179 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 + VK+Y++ P + + + V Sbjct: 180 SAPCSGISGSIVKAYTTD------------PVAGTSGLTTATARLTTVAAQTAPSAPSVS 227 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + S + + + I+ + Sbjct: 228 TGSNVSFGYTAASMSIGNGCTATTSSAYSGKWTVTCPAGGTYTFGAITTSATLSFATSGT 287 Query: 321 RTIVKTFAIDENEMGSTAINDAM-QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 F+ G+ + + A + S + + ++G Sbjct: 288 AATTYNFSGVVTLSGTNSFGPGIYNFAKGIVTSGGSVTTFAAGSTFNIGQSTTYCSNGYY 347 Query: 380 TQDNEEGIAI 389 + N + I Sbjct: 348 SICNTATLTI 357 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDSLKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK GI++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|219684622|ref|ZP_03539565.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219685812|ref|ZP_03540621.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] gi|219671984|gb|EED29038.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219672645|gb|EED29675.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] Length = 333 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 39/117 (33%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 129 RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDL--GNGSALGLGISIALSHL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ K+ IV+LTDG D + N A+ ++I +I + Sbjct: 186 ----------KHSEALKRSIVVLTDGVVNSDEIYKDQVINLAQGLNVKIYSIGIGSS 232 >gi|3182939|sp|Q91145|COCA1_NOTVI RecName: Full=Collagen alpha-1(XII) chain gi|632648|gb|AAA80217.1| type XII collagen alpha-1 chain [Notophthalmus viridescens] Length = 929 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 55/160 (34%), Gaps = 20/160 (12%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D +D V++ + ++ ++ HK ++++ A + G+T A++ Sbjct: 663 FDIGSDRVQIAVSQYSGDPRTEWQL--NTHKTKKSLMDAVANLPYKGGNTNTGSALKFIL 720 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + + +A+K +LLTDG++ D + +GI + + Sbjct: 721 E------NNFRPGVGMREKARKIAILLTDGKSQDDIVAPS---KRYADEGIELYAVGI-- 769 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + L AS + L I D Sbjct: 770 ----KNADENELKEIASDPDELYMYNVADFSLLTNIVNDL 805 >gi|332970883|gb|EGK09860.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 448 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 58/181 (32%), Gaps = 18/181 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS----WGVHKLIRTIV 324 K +L + A+ S+ + V+ V + + S + + Sbjct: 158 MKMNLAKAAVERFASSLPENAKVSLWVYGHKGSNSKKDKPVSCKSTEEVYPLGTYQEEKF 217 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G T I +M+ A + + + + +A + +++DG T Sbjct: 218 SQSLDQFRATGWTPIAASMKAAREELQKN---------SGEDATHMLYIVSDGVETCG-G 267 Query: 385 EGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + +A K I+ + I F V+ Q+ + A + S +L F Sbjct: 268 DPVAEAKKLNQSKIKAVVNIIGFDVDDAGQQALQKVAE--AGGGEYETVESEQDLRSYFD 325 Query: 443 D 443 + Sbjct: 326 E 326 >gi|227114885|ref|ZP_03828541.1| hypothetical protein PcarbP_18070 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 539 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 48/475 (10%), Positives = 113/475 (23%), Gaps = 63/475 (13%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I+ G AL +L +VD + +K+A + Sbjct: 22 RERLSAFIQEEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRAT------DAAA 75 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + + + + S + +Y+ N N + V + + Sbjct: 76 LAVAHQATINGEEYSQEETNKLAYDYVKNNLGMNKALSDKLEVGDVAVTEGRNSETRKTY 135 Query: 128 AYQVVLS----------------SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 V S +++ P + ++VSRS+ Sbjct: 136 TVTVAFETKPSLLNLGARQQEVFSTAEVINRPTEVAFVMPVTGDMSEGDIRSLKSVSRSF 195 Query: 172 HKEHGVSIQWVIDFSR-SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLS 230 + S D S++ Y + + + + + + Sbjct: 196 VERMLSSADGKRDNLWLSLVPYSQSVSVYDAEDANRIRRWSAPGALNPPELRSLFASGVV 255 Query: 231 PYMVSCNKSLYYMLYPGPLDP-SLSEEHFVDSSSLRHV--IKKKHLVRDALASVIR-SIK 286 + E F D + + L ++ S + Sbjct: 256 SSLADRRFPDRRANLLCMYRGLGREENFFWDEPPVGQFKVYYRHDLPQNGSPGAPPISWR 315 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + D T + P+ + + + + + M A Sbjct: 316 GPNPDFDDTDAVDTRWIVADRGCPNAALMPLTNEESKLNQRIDQFSARFNVNYAIGMSWA 375 Query: 347 YDTIISSNEDEVH----------RMKNNLEAKKYIVLL---------------------- 374 + + + N + +K IV+L Sbjct: 376 GAALSPNMRGSDGWGDATLPLDFNLDGNGDGQKVIVMLANTIGNWFDTDSYNFNRNAFSG 435 Query: 375 ---TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +D T + +C+ +++ I+ + ++ R A+P Sbjct: 436 QTGSDPARTFAAQRFRDLCSSFRARNIKFYFVGI-RPGDPEDFGRNLFDAEATPG 489 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 44/158 (27%), Gaps = 32/158 (20%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R + L + + + TA+ DA+ A + Sbjct: 128 DRIGLVAFGSRAYVVLPPTDDRAALTQALSRLSVGA--AGRRTAMGDAVGLAVKQLD--- 182 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +V+ DG + + A ++GI + T+ + Sbjct: 183 --------RAPGLARLVVVFGDGLSNAGEVRPVEAAKAAAARGIAVFTVGVGGDGPAPFL 234 Query: 415 ARYFL-------SNCA------------SPNSFFEANS 433 + L N A S +FF A Sbjct: 235 VNHPLLGQEIVRENAAVDTAALTELAALSGGAFFRAED 272 >gi|224534421|ref|ZP_03674999.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] gi|224514523|gb|EEF84839.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] Length = 333 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 39/117 (33%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 129 RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDL--GNGSALGLGISIALSHL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ K+ IV+LTDG D + N A+ ++I +I + Sbjct: 186 ----------KHSEALKRSIVVLTDGVVNSDEIYKDQVINLAQGLNVKIYSIGIGSS 232 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 47.7 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 45/422 (10%), Positives = 107/422 (25%), Gaps = 58/422 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S + K L + T + + E+ Sbjct: 412 SASTGADHQVQLIRGLKVYSVNKLSQLDLVKSMNKDTYPDADSHTFNVGDEVPYTFVVRN 471 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQ--TKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 ++ ++ +N ++ S G + T + S + Sbjct: 472 SGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVDSEGHFTNTATASGT 531 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG- 247 + + Q + K ++ + S Sbjct: 532 DDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVTGAANSSTITTTQSV 591 Query: 248 --PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFF- 302 L +S + S + IKK ++ A+ + + +I + ++ +R+G F Sbjct: 592 DFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFA 651 Query: 303 -------NDRVISDPSFSWGVH------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + ++ F + + +K G+T + + A Sbjct: 652 GKESSKVGNETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHNGATRADLGFKHASTV 711 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMT 402 + +AK+ ++ TDG T+ ++ + N A K G + + Sbjct: 712 MS----------GARTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYS 761 Query: 403 IAFSVNKTQQEKARY----FLSNCAS----------------PNSFFEANSTHELNKIFR 442 I F++ +S + ++ +L IF Sbjct: 762 IGVFDGANPSSIEEDQKNQFMNAVSSNYPHATAYDKLGTGSNAGYYKVVSNVSDLKSIFE 821 Query: 443 DR 444 Sbjct: 822 KI 823 >gi|327402260|ref|YP_004343098.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327317768|gb|AEA42260.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 473 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 64/190 (33%), Gaps = 28/190 (14%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + ++ +D SS +K +L++ +L ++ ++ D MG + Sbjct: 281 DNFDESYFKDVNVIFVIDISSSMKNGEKMNLMKYSLNQLVSKLRPND------DMGMVTY 334 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + G + +K+ G TA ++ Y + Sbjct: 335 ANTADVFQAPTSGS---NKESLKSSITSLKPSGMTAGGKGIKLGYKEV-----------M 380 Query: 363 NNLEAKK--YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 N + K ++++TDG +D+++ K +G+ + +E+ L Sbjct: 381 KNYDPAKANMVIIITDGAFNKDSDDYQKTVQKYAKKGVVFSVLGIET----RERDAKLLQ 436 Query: 421 NCA--SPNSF 428 A + Sbjct: 437 EAAAFGNGRY 446 >gi|13475416|ref|NP_106980.1| hypothetical protein mll6478 [Mesorhizobium loti MAFF303099] gi|14026168|dbj|BAB52766.1| mll6478 [Mesorhizobium loti MAFF303099] Length = 500 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 50/181 (27%), Gaps = 27/181 (14%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-----------AQ- 58 + G I+ ++ +P ++G + +D+ R + + L++A Sbjct: 5 IRAFWHDQRGIALILVSVTLPALIGFSLLAIDMSRVNNLHNDLQKAADAFALAGAAELDG 64 Query: 59 ------TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK--------- 103 A + + + FT + N N+ Sbjct: 65 LPGSWARAERAMATLVTNQAAFSTVGTNGRFTLTSGQPGGTARCNSAGNISWCFLKTIPA 124 Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + T + T +V ++ + P S + S+ + KA Sbjct: 125 SDATPVSSANYANATQSIGEDETVFVEVTVAPTGFAAIFPASFLTGNAASNSFNVAAKAT 184 Query: 164 A 164 A Sbjct: 185 A 185 >gi|116619435|ref|YP_821591.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222597|gb|ABJ81306.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 377 Score = 47.7 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 43/148 (29%), Gaps = 15/148 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + +++ + + + + + + +W K + Sbjct: 117 WKTYQDAILELVWNLLPGDKRYTGYLISYGNTADIAVNTTWDSDK-----IADKVRKMKP 171 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G A+ DA+ A E ++ IV++ DG + + A Sbjct: 172 GGGAALYDAIYLACTR-------RELVKGEPYEPRRVIVVIGDGHDNASKHNLEEVLELA 224 Query: 394 KSQGIRIM---TIAFSVNKTQQEKARYF 418 + + I T+AF + QE Sbjct: 225 QRNLVTIYAVSTMAFGFSNEDQEVLERL 252 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 49/348 (14%), Positives = 96/348 (27%), Gaps = 35/348 (10%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 D E R T N + +V++S L + M Sbjct: 410 LDGEERRSQTFTHKVENEDCAVTEVIVSMEGTGNLQKTAKIKSVMNKDKMCTDVVGLVGE 469 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 +S + + + + V+ + + + I Sbjct: 470 PFSIKISSEYKKATLSFKIDQS-KLGDTEFDNLMFLWYDEKKKEFVELDTFYDSENSIVS 528 Query: 227 EKLSPY--MVSCNKSLYYMLYPGPLDP------SLSEEHFVDSSSLRHVIKKKHLVRDAL 278 + + + +K+ ++ + + ++ + + +D Sbjct: 529 IETEHFSKYMVVDKTEWFNNWRKIYNSYAAIFSAIPSYTAICVDCSGSMSTNDKSFKDDN 588 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVIS-DPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + V + V F N +I+ + S S + G T Sbjct: 589 GVLTCYRNIA--VQNYVESMFVFDNASIITFESSASEECEMTNNKRTLSGKASFYNRGGT 646 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 N A+ A D + KK I+LL+DG+ + + I C K+ G Sbjct: 647 NANSAIDIAIDELNHV------------YGKKNIILLSDGDVNVSD-DNIKYC---KNNG 690 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 IRI T+A L A + + A + L KI+ Sbjct: 691 IRIHTVALGSGAN-----SQLLKQYANDTGGTPLTATTAEGLTKIYES 733 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 55/193 (28%), Gaps = 26/193 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V + +G F+ Sbjct: 261 DKSGSMASSDRLNRMNQAAKYFLLQV-----VENGSWVGMVHFDSTASVKSQLIQIKSNS 315 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T I ++ A+ N + IVLLTDGE+ Sbjct: 316 ERNQLLQSLPTV-AGGGTQICLGIRAAFQVFKQQNSEIDGSE---------IVLLTDGED 365 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH--E 436 + + C N+ G I IA + +SN + ++ +++ Sbjct: 366 S-----TASSCVNEVIESGTIIHFIALGPSADASVIQ---MSNLTGGSHYYASDNAQNNG 417 Query: 437 LNKIFRDRIGNEI 449 L F Sbjct: 418 LIDAFGALTSENA 430 >gi|212693196|ref|ZP_03301324.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|237709938|ref|ZP_04540419.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725395|ref|ZP_04555876.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753590|ref|ZP_06088945.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664301|gb|EEB24873.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|229436082|gb|EEO46159.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456031|gb|EEO61752.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235304|gb|EEZ20828.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 340 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/221 (10%), Positives = 57/221 (25%), Gaps = 44/221 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAEDVTPSRLDKSKKLISRLVDTFN-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++T T I A++ A + + I+++T Sbjct: 150 YVSA-KMFLETINPSLITTQGTDIGTAIRLAMKSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----------------- 418 DGEN + A K G+++ + A Sbjct: 198 DGENHEGGAVEAAQEAAEK--GMQVFVLGVGSPDGSPIPAENGSNNFRRDKDGNVIVTRL 255 Query: 419 -LSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 C A + ++T+ K+ + I V Sbjct: 256 NEQMCQEIAKAGNGMYIRVDNTNSAEKVLNNEIAKLSKADV 296 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 54/187 (28%), Gaps = 27/187 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++A ++ S+ D R+ F+ + V Sbjct: 59 IKTVKEAAIRLVESLGSGD------RLSVVAFDHKAKVIVP---NQPIDDIKTVNQQIQR 109 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 G T I++ M+ + +D V + I LLTDGEN DNE + + Sbjct: 110 LEPAGGTCIDEGMKLGIKEVALGKDDRVSQ----------IFLLTDGENEHGDNERCLKL 159 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGN 447 A I + T+ F + L + A + + F Sbjct: 160 AQVAAEYNITLNTLGFG-----NHWNQDVLESIADAVGGTLCYIEQPEQAITEFIRLFTR 214 Query: 448 EIFERVI 454 + Sbjct: 215 IQSVGLT 221 >gi|332262934|ref|XP_003280513.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Nomascus leucogenys] Length = 1164 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 62/179 (34%), Gaps = 14/179 (7%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKTFAIDENE 333 + + + + V + +T F+ S+ F++ + + A Sbjct: 163 SSGNFATMMNFVRAVISQFQRPSTQFSLMQFSNRFETHFTFEKFRRSSNPLSLLASVRQL 222 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A+Q + ++ +A K ++++TDG+ D+ + + A Sbjct: 223 GGLTYTATAIQKVVHQLFHAS------YGARRDATKILIVITDGKKEGDSLDYKHVIPMA 276 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + GI I Q + L++ AS F+ L ++++ +I Sbjct: 277 DAAGIIRYAIGVGS-AFQNTNSWKELNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|327351891|gb|EGE80748.1| U-box domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 756 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 56/221 (25%), Gaps = 29/221 (13%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + + + L + A ++I ++ D R+G F Sbjct: 81 DISYSMSSSAPLPTTDDSGKPEDTGLSILDLTKHAARTIIETLNDND------RLGVVAF 134 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + +K + ST + ++ + + + + Sbjct: 135 STDAEVVYKISNMNEDNKKAALKA-VEALWPLSSTNLWHGLKLSLEALEEVTPIPQNV-- 191 Query: 363 NNLEAKKYIVLLTDGENT------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + + +LTDG + +I + K + I T F Sbjct: 192 ------QALYILTDGMPNHMCPRQGYVPKLRSILQQ-KDRLPMIHTFGFGYYI-----RS 239 Query: 417 YFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L ++ + +F I N + Sbjct: 240 GLLQAISEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQ 280 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 49/158 (31%), Gaps = 18/158 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D +R+G FN D G + I + + T +A+ Sbjct: 888 PDDMRVGVARFNRHFDRDSEILIGNYSNISELRQKLRRMPYRGRGTLTGNALWHM----- 942 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N +H N IV++TDG + + N K Q +++ + + Sbjct: 943 --NNHSLHAPGNRPGVPDVIVVITDGLASDEVLR---AANALKEQDVKMYVVGLINRMNR 997 Query: 412 QEKARYFLSNCASPNSFFEANS------THELNKIFRD 443 A+ L + +S + + EL+ Sbjct: 998 MNLAQ--LQDISSGTEYLQIIDNGYERLADELSDTLTQ 1033 >gi|195941904|ref|ZP_03087286.1| hypothetical protein Bbur8_03396 [Borrelia burgdorferi 80a] gi|312149118|gb|ADQ29189.1| von Willebrand factor type A domain protein [Borrelia burgdorferi N40] Length = 333 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 45/143 (31%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + S I + +G F + + + Sbjct: 110 SSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D Sbjct: 163 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIY 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 210 KDQVINLAQGLNVKIYSIGIGSS 232 >gi|323137420|ref|ZP_08072498.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322397407|gb|EFX99930.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 208 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 20/58 (34%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 I G + A++ LG+ G + + + L+ A +TA + Sbjct: 24 FIADREGTTAVEFAMIAVPFLGLIGAIFETGTIYFRTAQLQMATETASRAVLTHSTAA 81 >gi|304394408|ref|ZP_07376331.1| von Willebrand factor, type A [Ahrensia sp. R2A130] gi|303293848|gb|EFL88225.1| von Willebrand factor, type A [Ahrensia sp. R2A130] Length = 689 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 51/136 (37%), Gaps = 17/136 (12%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G + A G T ++DA++ A +++ E + +V Sbjct: 92 APGPAASTVPEMIARANAIKPKGKTPLSDAVRKAAESL------------RYTENEATVV 139 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 L+TDG T N + A+ + + G+ + F ++K + + +N + F Sbjct: 140 LVTDGIETC-NADPCALATELEESGVDFTTHVVGFGLSKDEGRQVACLAAN--TGGKFIS 196 Query: 431 ANSTHELNKIFRDRIG 446 A+ EL + + Sbjct: 197 ADDADELKAALDETLS 212 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 54/177 (30%), Gaps = 23/177 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI- 320 R + L++ A ++ ++ D R+ FNDR + + S Sbjct: 101 PERRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLYMSADG 154 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R G TA+ A + A + R+ + +IVLLTDGE+T Sbjct: 155 RRKAMKSVDQLEARGGTALVPAFEEAVKVLD-------GRVGDGRNRLGFIVLLTDGEDT 207 Query: 381 QDNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + + T+ + L A S ++ + Sbjct: 208 SGFTLSERRREVIRGALGRYPVHTLGLG-----RAHDPEVLLYLAQESHGTYSFVDD 259 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 54/177 (30%), Gaps = 23/177 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI- 320 R + L++ A ++ ++ D R+ FNDR + + S Sbjct: 93 PERRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLYMSADG 146 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 R G TA+ A + A + R+ + +IVLLTDGE+T Sbjct: 147 RRKAMKSVDQLEARGGTALVPAFEEAVKVLD-------GRVGDGRNRLGFIVLLTDGEDT 199 Query: 381 QDNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANS 433 + + T+ + L A S ++ + Sbjct: 200 SGFTLSERRREVIRGALGRYPVHTLGLG-----RAHDPEVLLYLAQESHGTYSFVDD 251 >gi|291229678|ref|XP_002734799.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1003 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 64/194 (32%), Gaps = 22/194 (11%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D+S + + A+++ I + +D+ + F+ ++ Sbjct: 292 DTSGSMEENGRIDKLHQAVSNYI--LNTLDDGEEVG---VVTFSTTATIQSHLVLINNES 346 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ ++ T+I + A+D + + IV+++DGE Sbjct: 347 RTELLSRVPSMQSVGRWTSIGSGLLKAFDVLEEGERNAAGG---------VIVVISDGEE 397 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 + + I +G+ + +I + + + A+ + E ++++ Sbjct: 398 NR-DPLIADIIPMVLEKGVTVDSIGIGTDASTNLEVLPA----ATDGMTFYYSEDSNSNG 452 Query: 437 LNKIFRDRIGNEIF 450 LN+ E Sbjct: 453 LNEALAASASREKS 466 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 59/187 (31%), Gaps = 24/187 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L + A + + D V + + V R ++T Sbjct: 434 SKLDLAKQAAINALPQFGPHDKVGLWMFSTKRDGEKDHLELAPLDT-VDAAQRKTLRTRL 492 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 G T + D AY + + E ++ LTDG+N +N Sbjct: 493 DGLTPDGGTGLYDTALAAYQHVRDRHSGEAINA---------VIFLTDGKNEDNNSLSLE 543 Query: 385 --EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 A + +R+ TIA+ + L + + + +++ + ++++ Sbjct: 544 NLLPDLRAESA-EESVRMFTIAYGQDADLG-----VLKQISETTNAAAYDSRESGSIDQV 597 Query: 441 FRDRIGN 447 F + N Sbjct: 598 FTAVVSN 604 >gi|218249312|ref|YP_002374701.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223889245|ref|ZP_03623833.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226320920|ref|ZP_03796471.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|226321491|ref|ZP_03797017.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|218164500|gb|ACK74561.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223885278|gb|EEF56380.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226232680|gb|EEH31433.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|226233692|gb|EEH32422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|312147800|gb|ADQ30459.1| von Willebrand factor type A domain protein [Borrelia burgdorferi JD1] Length = 333 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 45/143 (31%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + S I + +G F + + + Sbjct: 110 SSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D Sbjct: 163 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIY 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 210 KDQVINLAQGLNVKIYSIGIGSS 232 >gi|216264497|ref|ZP_03436489.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221217546|ref|ZP_03589016.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224532807|ref|ZP_03673422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224534090|ref|ZP_03674673.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225548563|ref|ZP_03769611.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] gi|225549807|ref|ZP_03770771.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|215980970|gb|EEC21777.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221192609|gb|EEE18826.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224512196|gb|EEF82582.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224512789|gb|EEF83157.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225369615|gb|EEG99064.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|225370826|gb|EEH00261.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] Length = 333 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 44/143 (30%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + I + +G F + + + Sbjct: 110 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D Sbjct: 163 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIY 209 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 210 KDQVINLAQGLNVKIYSIGIGSS 232 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 66/238 (27%), Gaps = 38/238 (15%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 Y V + P + + + + + + A + SI + Sbjct: 385 YQVDFEITGTPPEKPVDVILVIDTSGSMGTRIPGDSKAPLYYAKLAAINFANSIIDENPD 444 Query: 292 NDT----VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + G + + + + L + T I + AY Sbjct: 445 SRVGVIEFSGGYYGYASDASTVINLTNNKANLASS-----INGLTTHNMTNIQAGFRLAY 499 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------------NEEGIAICNKAKS 395 + I + + + ++ K +V LTDG N IA + +S Sbjct: 500 NKISA--------ISSTRDSVKSVVFLTDGVANVSIGNWSSSNPVVHNTHTIAAYTEGQS 551 Query: 396 Q----GIRIMTIA-FSVNKTQQ--EKARYFLSNCASPN--SFFEANSTHELNKIFRDR 444 + TI F AR L + ++EA+S +L ++ Sbjct: 552 LYSYINGNLFTIGLFGAISNSSVKSIARDTLQKAVYDDLEKYYEASSAVDLGPVYETI 609 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICRGLRSAFTVI----------KKKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISECFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|47220812|emb|CAG00019.1| unnamed protein product [Tetraodon nigroviridis] Length = 1557 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 55/159 (34%), Gaps = 20/159 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D ++ F+D ++ S +++ + G+T A++ ++I Sbjct: 925 GTDGTQVAIAQFSDDARTEFQLSSH--SNKEALLEAIQKISYKGGNTKTGRAIKHVKESI 982 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S K +V+LTDG +QD+ ++ + + G I I F+ Sbjct: 983 FS------LEAGARRGVPKVLVVLTDGR-SQDDVNKVS--KEMQMDGYIIFAIGFADADY 1033 Query: 411 QQEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIG 446 + L N A S F + + KI I Sbjct: 1034 GE------LVNIASKPSDRHVFFVDDLDAVKKIEEQLIT 1066 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 44/144 (30%), Gaps = 21/144 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + + K +L+ Sbjct: 101 NAYSTKDAVIDAVRNLPYKGGNTLTGLALNFILE------NCFKPESGSREGLPKIGILI 154 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ ++ GI + I + L + ASP + Sbjct: 155 TDGK-SQDDVVPP--AESLRNAGIELFAIGV------KNADENELQSIASPPEDTHVYNV 205 Query: 432 NS---THELNKIFRDRIGNEIFER 452 + + + + ++ ++ Sbjct: 206 ADFSVMNSIVEALTRTVCEQVVQQ 229 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 67/191 (35%), Gaps = 31/191 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A+A+ + S D FNDR F +++ Sbjct: 125 KLAKSRAAVAAFLSSANPED------EFSLVLFNDRAQLVSGF-----NRQTDELQSKLF 173 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ DA+ A D + + +K +++++DG + + Sbjct: 174 YAQSKGRTALLDAIYLAMDQMKHAKHS-----------RKAVLVISDGGDNCSRYSMREV 222 Query: 390 CNKAKSQGIRIMTI------AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 N+ K +I +I F ++ L + A S FE ++ +EL+ + Sbjct: 223 KNRVKEGDAQIYSIGILEAMGFRGRSAEELAGPALLDDIASQSGGRLFEIDNLNELSDV- 281 Query: 442 RDRIGNEIFER 452 +IG + + Sbjct: 282 ASKIGMALRNQ 292 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 47/124 (37%), Gaps = 11/124 (8%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ +KT T + + ++ ++K ++L TDG Sbjct: 447 SQNKSQLKTIIDRSPFYSGTTFTNQALKKMAALYEESKRPNAKLK--------LMLFTDG 498 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + +D EG KSQG+ + T+ S +K+ + S +++++ +L Sbjct: 499 YSAEDTSEGE---EALKSQGVVVYTVGISTDKSAGLNMKELRGMATSSEHYYDSSDFADL 555 Query: 438 NKIF 441 K F Sbjct: 556 LKHF 559 >gi|242016552|ref|XP_002428850.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513586|gb|EEB16112.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1945 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 41/126 (32%), Gaps = 13/126 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAM 343 I ++ R+G ++ + + S + + +++ +T +A+ Sbjct: 61 ISDVEVGPKAFRLGVIIYSTQAVDHLSVTDELDLEKCSLLHVQIPKIKYPGKNTNTKEAL 120 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + A + + + A K I L+TDG + N + K Q I T Sbjct: 121 EKAEKILTRARKG----------AVKIIFLITDGFSNMGNPLP--MAQILKDQDTIIYTF 168 Query: 404 AFSVNK 409 Sbjct: 169 GIINGN 174 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 65/191 (34%), Gaps = 25/191 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I ++K D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLKDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY+V++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEELL--------VGGSHLKENKYLVVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK GI++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFE 451 + I I Sbjct: 225 -EEVISQTIET 234 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 80/290 (27%), Gaps = 11/290 (3%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 + +S + + ++D S SM + S T + Sbjct: 595 NYEYNVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTHLKNNVIKFITKLYEHNP 654 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 + +V + S N ++ + + ++ ++ K Sbjct: 655 DSRVSVITFAYSADGSITNN--NFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQID 712 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + ++ + F D + F+ + + A N+ Sbjct: 713 LGLYEVRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIK 772 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + + + I SSN + + KN+ K N ++ K G Sbjct: 773 FSGNLTGINNYIESSNGSKYYGHKNDDITKNRS------NNNSNDAGNRTN-RSGKGLGK 825 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 I TI + FL+ AS + +AN++ + F + Sbjct: 826 TIFTIGLNS--NNSSLFDSFLTRLASEGHYTKANNSSAMENAFNSIFTSI 873 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 36/289 (12%), Positives = 79/289 (27%), Gaps = 18/289 (6%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 A ++ + G I V D S SM + + + + + + Sbjct: 73 ATGKAQDSVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRL--TKAKNAAIEFLNNSK 130 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 I K + Y + + Y + + + + L + ++ L Sbjct: 131 ISGNKKNRYSI-VTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKART 189 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID--ENEMGSTAIND 341 +K + N + + + + + + T T + Sbjct: 190 VLKNAKSENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGEN 249 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ N+ + NN N AK G I Sbjct: 250 AINYDSVIKGGRNDFTLDLYLNNSHYS---------LNNAAATLAENEALLAKKSGNTIF 300 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 TI ++ + FL N A+ + +S+ +L+ I+ + + Sbjct: 301 TIGYTTGSSVNS----FLKNVATQGEGYAYSSSSDLSGIYENIANEIVT 345 >gi|78357412|ref|YP_388861.1| hypothetical protein Dde_2369 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219817|gb|ABB39166.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 163 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/146 (10%), Positives = 39/146 (26%), Gaps = 5/146 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 F + +L +G + AL +PV+L + L++ + A + Sbjct: 2 HAFRLVTGRLRHDTSGLSSLELALTLPVLLMMVFGLIEFGYNLFART----TVDKAALIG 57 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD-TAVEMN 123 + + +++ L + ++ + Sbjct: 58 ARYAVTGQGFDDGTRHARIVQEARRLTGVLAGSSPQSVTVTIGSIAAGAGDDALIEGDAG 117 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLR 149 QV + RY + + L Sbjct: 118 LPCDRVQVRVEYRYTPVTPVVGSLLG 143 >gi|327542166|gb|EGF28659.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1014 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 45/138 (32%), Gaps = 24/138 (17%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + AM+ A ++ + K++++++DG+ Sbjct: 505 NRRAMLAAVGRMTPGDMPEFDPAMRMAVTGLV-----------RTDASVKHLIIISDGDP 553 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + ++ K I I T+A + + L + A + ++ S L Sbjct: 554 GPPSN---SVIQAFKDNSITISTVAVESHGLSDSRR---LQDIARATGGKYYAVKSGRAL 607 Query: 438 NKIFRDRIGNEIFERVIR 455 IF+ RV R Sbjct: 608 PGIFQRE-----ARRVTR 620 >gi|309791847|ref|ZP_07686333.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226108|gb|EFO79850.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 542 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 53/152 (34%), Gaps = 15/152 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G F+ + + R ++ + + G TA+ DA++ A ++ + Sbjct: 401 DDRVGMITFSSSATEVVAPAALSE--NRMQLQMAISEMSATGKTAVFDAVELARQSLEA- 457 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K IVLL+DG + + G+ I +A+ + Sbjct: 458 ------LPSTGEDRMKAIVLLSDGADNASRITLADLERNFDETGVSIFPVAYGADAD--- 508 Query: 414 KARYFLSNCAS-PNSFFEANSTHELNKIFRDR 444 R L A + T ++ +IF + Sbjct: 509 --RSILDAIAEFSRTIVVVGDTGDIAQIFENL 538 >gi|115665362|ref|XP_001180845.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] gi|115941435|ref|XP_001179810.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 763 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 51/162 (31%), Gaps = 23/162 (14%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V + + FN I D + + + R + G+T I +QTA Sbjct: 198 VPNNSYVAIVEFNYGAIVDSNMTELTSAISR-KDLASLLPTYADGATCIGCGIQTAIQVA 256 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + D Y++LL+DG+ + + + G+ + +IAF Sbjct: 257 QYNGMDSRGV---------YLILLSDGQENSGTLIADTL-DDIEDSGVIVHSIAF-YEAD 305 Query: 411 QQEKARYFL-----SNCASPNSFFEANSTHELNKIFRDRIGN 447 Q + + + CA S + F I Sbjct: 306 TQLEDLAQMTGGISATCADGG------SAQCVISAFESIIAQ 341 >gi|330918891|ref|XP_003298384.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] gi|311328422|gb|EFQ93524.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] Length = 1367 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/361 (10%), Positives = 96/361 (26%), Gaps = 24/361 (6%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q ++ + + D E+ + V ++ + +V + Sbjct: 775 YELQHRNAHIPASEDAIESVINPDHEIFINPVCSTVAVSAVYQEWCLVKVYACAYAQPVV 834 Query: 145 SLFLRSMGIKSW-LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 S + K+ I K + + S+ + I + Sbjct: 835 SYWEPKATTKTMQSIVFKYYRQKFATMSSTSLQASLVFWIKMRDRGDRHITGEWSDAHWE 894 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS 263 P + PL + +S Sbjct: 895 ---PISPYFNREFCTGTLSEEFCFTKEQDDEDSKNTYDADKVSQPLVFKVYLGLDSSASK 951 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + + + +++ + I + N +G F+ + + + V Sbjct: 952 QKSHMTRLDVLKQMFDAYINRLLA---YNFHSHVGLVTFSTKALVAQKITNAVENFR--- 1005 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + G TAI D++ A D I E I+ ++DGE+ + Sbjct: 1006 --HKLNNLKASGDTAIWDSIALAQDQIQQYAEQYPGSKLR-------IICISDGEDNKSQ 1056 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + + ++ I + + F ++ ++ + C+ + A T + + Sbjct: 1057 NTAVDLASRLIRDDITVDS--FCLDDHSNKELQTL---CSLSGGYSFAPKTLDEAMAICE 1111 Query: 444 R 444 Sbjct: 1112 M 1112 >gi|311268857|ref|XP_003132237.1| PREDICTED: collagen alpha-1(VII) chain-like [Sus scrofa] Length = 2945 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 46/142 (32%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ S +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFSLDTLGSGGD--VIRAIRELSYKGGNTRTGAAILHVADQVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + A + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDAAAQRLKEQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRIASQPTSDFFFFVND 200 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 50/470 (10%), Positives = 110/470 (23%), Gaps = 88/470 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + S+ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 -KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 N + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 ADSGITNNININMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALR 359 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE------------------- 385 A D + + N KK IVLLTDG T Sbjct: 360 DAGDMLATPN-----------GHKKVIVLLTDGVPTFSYTVSRVQTEADGRFYGTQFTNR 408 Query: 386 --------------------------------GIAICNKAKSQGIRIMTIAFSVNKTQQE 413 I K +GI I + + + Sbjct: 409 QDQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRA 468 Query: 414 KAR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 469 NLSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 518 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/342 (11%), Positives = 97/342 (28%), Gaps = 24/342 (7%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 K + + + + +T ++ + + Sbjct: 190 DEIKKAYGDVTVENLAKAVKDGKLLGYTNPYTSSTGLSLLSQLLFTFDKENPISDKAKTE 249 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG-------VSIQWVIDFSRS 188 N S F+ + ++ A+ ++S + V D Sbjct: 250 FQQFQNNIPSTFVTTTQLREAAKNGSADILSISYQTYINTPEFSDYEYVPFGIRQDSPLY 309 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + +S ++ G + D N + M Sbjct: 310 ATTNDATKQEVLKKFSSYVLEGNNQSKATSYGFNKLDDYSFEEQTTDGNLLMS-MQNLWK 368 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + S+ + + + S+ + + +N+ ++G ++D V Sbjct: 369 KNKNNSQPIVGVFVTDVSGSMDGEPMNNLKKSL---LNSLQYINEENQIGLVSYSDDVTI 425 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + ++ ++ + G TA D A I+ +MK N A+ Sbjct: 426 NVPIDT-MNSTQKSYFTSAIKGLTPSGGTATYDGTLVAVKMILD-------KMKENPGAR 477 Query: 369 KYIVLLTDGENTQDNE--EGIAICNKAKSQGIRIMTIAFSVN 408 I +L+DG+ E I K+ GI + TI ++ + Sbjct: 478 PVIFVLSDGQTNGGYEFERVEPI---IKALGITVNTIGYNAD 516 >gi|284989271|ref|YP_003407825.1| hypothetical protein Gobs_0677 [Geodermatophilus obscurus DSM 43160] gi|284062516|gb|ADB73454.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 146 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 34/124 (27%), Gaps = 2/124 (1%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA--SVPLIQSLEEV 75 G + A+L+P +L + G+ VD A A ++ L +L Sbjct: 13 ERGAIGVFLAVLVPGLLLIVGLAVDGGAKVAATQRANAIADEAARAGGQALDLSAALTGQ 72 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + Q E + T + I T + V + Sbjct: 73 VRVDPAAAVAAVQNYLERSGVQGAVTVVDGDTLQVSTTISEPTTFLGLIGITTLTVEGTG 132 Query: 136 RYDL 139 DL Sbjct: 133 TADL 136 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 51/147 (34%), Gaps = 14/147 (9%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR+G ++D ++ + S + I + T A+ A + + + Sbjct: 68 DAVRVGLIQYSDAAKTEFNLS-RYSERNDIITHLETLTFMPGEDTRTGVALDKADEEMFN 126 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + L+A + I+L TDG + + +G++I TI+ + Sbjct: 127 ------YIGGARLKATRLIILFTDGLSMDK---PTKSAKTLRRKGVKIYTISVNSIGFVP 177 Query: 413 EKARYFLSNCASPNSFFEANSTHELNK 439 E L ++ F + + Sbjct: 178 E----MLGIVGDADNVFGPTDEDRIEE 200 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 38/118 (32%), Gaps = 15/118 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R + R I+ G T I A++ A + Sbjct: 134 HRIGLVAFAGRASVLAPLTPDY-GFFRMILDGVDTKSVSRGGTEIGQALRKAVRSFDPGP 192 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K I+L+TDGE+ +A G+R++ I F + Q Sbjct: 193 GA------------KMILLITDGEDHGGYA--EDAAREALEAGVRVVAIGFGSEQGSQ 236 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 62/203 (30%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I ++ A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRLAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|90419742|ref|ZP_01227651.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335783|gb|EAS49531.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 448 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 51/170 (30%), Gaps = 13/170 (7%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 S G+ I ALL P+++ + G +V++ L+ AA +A + A+ L + Sbjct: 8 FAASTQGNVATIVALLAPLLIFIMGSVVNLSTAHSAHSRLQSAADSAALAAARELYMANS 67 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ--- 130 +++F + + +D + T V + K Sbjct: 68 RPEVLKSVAYSFAMTNLGNQAEGVSIDVKIGGRSDATAAESAISTEVTVTLNKDFGTSLP 127 Query: 131 ----------VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + S+ + + G I A+ + Sbjct: 128 MPDLTGAIGALTASATARIAGGGRICMIGLTGEGKRAIDISGNAQVQAAD 177 >gi|296474257|gb|DAA16372.1| complement component 2 precursor [Bos taurus] Length = 750 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 56/204 (27%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D+ + ++ I + V + F + S + + + + Sbjct: 267 SKDDFEIFKDSASRMVDRIFSFE---IKVSVAIITFASKPKIIMSVLEDRSRDVTEVENS 323 Query: 327 FAI----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D T I +A+ Y + + E + I+LLTDG++ Sbjct: 324 LRNINYKDHENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMG 383 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + K I I I ++ S F Sbjct: 384 GS-PKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKD 442 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L+++F V ++T Sbjct: 443 VQALSQVFEHM------LDVSQLT 460 >gi|291299992|ref|YP_003511270.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290569212|gb|ADD42177.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 316 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 67/208 (32%), Gaps = 29/208 (13%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + + ++ A + + +G F S + Sbjct: 96 SLSMKAKDVSPDRFSAMKKASLEFVDEL------PKNYNLGLVTFAKSASVAVSPT---- 145 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R VK+ +TAI + + +A I S D I+LL+DG Sbjct: 146 -KDRNQVKSAIKSMKLDRATAIGEGIFSALQAIQSVPPDGASEPAPAR-----ILLLSDG 199 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 T AK+ + + TIAF + E R LS A + Sbjct: 200 YRTSGRLVEDGA-KAAKAAKVPVSTIAFGTDTGTVEIEGETQEVPVDRETLSQTAETTGG 258 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 F+EA S +L ++ D G+ I R + Sbjct: 259 KFYEAASVDDLKGVYEDM-GSSIGHRTV 285 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 21/161 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F S + GST A+ +++ +S Sbjct: 673 NTSFSVVSFASDAEILSGLSS-----ADKTINVLDQLIYSGGSTNHGQAINACQESLFTS 727 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA----ICNKAKSQGIRIMTIAFSVNK 409 +++ KK+I+L+TDG + D+ A AKS GI I I ++ Sbjct: 728 DQNINR--------KKFIMLITDGVSATDDLNPEADAIDAAETAKSSGITI--IPIFISP 777 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 A F+S+ ++ + + L +D++ ++ Sbjct: 778 YNDIDAVSFMSSLSNDGEVY-VTNFDSL-DSLKDQLVEQVS 816 >gi|111120280|gb|ABH06325.1| complement component 2 precursor [Bos taurus] Length = 787 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 56/204 (27%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D+ + ++ I + V + F + S + + + + Sbjct: 267 SKDDFEIFKDSASRMVDRIFSFE---IKVSVAIITFASKPKIIMSVLEDRSRDVTEVENS 323 Query: 327 FAI----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D T I +A+ Y + + E + I+LLTDG++ Sbjct: 324 LRNINYKDHENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMG 383 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + K I I I ++ S F Sbjct: 384 GS-PKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKD 442 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L+++F V ++T Sbjct: 443 VQALSQVFEHM------LDVSQLT 460 >gi|77735935|ref|NP_001029664.1| complement C2 precursor [Bos taurus] gi|115311857|sp|Q3SYW2|CO2_BOVIN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|74267667|gb|AAI03358.1| Complement component 2 [Bos taurus] Length = 750 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 56/204 (27%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +D+ + ++ I + V + F + S + + + + Sbjct: 267 SKDDFEIFKDSASRMVDRIFSFE---IKVSVAIITFASKPKIIMSVLEDRSRDVTEVENS 323 Query: 327 FAI----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 D T I +A+ Y + + E + I+LLTDG++ Sbjct: 324 LRNINYKDHENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMG 383 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + K I I I ++ S F Sbjct: 384 GS-PKVAVDNIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKD 442 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L+++F V ++T Sbjct: 443 VQALSQVFEHM------LDVSQLT 460 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 48/136 (35%), Gaps = 14/136 (10%) Query: 275 RDALASVIRSI---KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +D + + I K +M F+ V + SF+ K Sbjct: 76 KDFVLNFTDQISHLKLAKPWKTKTKMAIIQFSSSVRIEQSFNEWTGVEN---FKRIVNSM 132 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T I T Y + ++N + H+ N K +L+TDG + + + + Sbjct: 133 -----TYIGQGTYTYYAIMNATNIFKAHKSAGN---VKVAILMTDGIDHPKSPDARQASD 184 Query: 392 KAKSQGIRIMTIAFSV 407 A++ GI ++I S Sbjct: 185 FARAAGINFISIGLST 200 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 40/123 (32%), Gaps = 14/123 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I+ D + + F F+ + I+ + T I DA+ Sbjct: 119 IEDFVKNTDNLSVALVGFAGTSFVASPFTQDMETF-SYILNELNTKSVTLQGTRIADALV 177 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA +T + KK I+L+TDGE+ I + K I + T+ Sbjct: 178 TAKNTFNVNIPG-----------KKSIILITDGEDHAGY--FDNILKELKDNDISVYTVG 224 Query: 405 FSV 407 Sbjct: 225 VGS 227 >gi|70734188|ref|YP_257828.1| hypothetical protein PFL_0686 [Pseudomonas fluorescens Pf-5] gi|68348487|gb|AAY96093.1| conserved hypothetical protein [Pseudomonas fluorescens Pf-5] Length = 664 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 45/149 (30%), Gaps = 6/149 (4%) Query: 11 SKKLI--KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+ G ++ A+ + L + ++VD R + L++ A A + A Sbjct: 4 LKRFYGPARQRGAIGLMAAVTFGLALLLMLLVVDSGRLYMEQRKLQRVADNAALEAVSRG 63 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 +++ A + ++L + T + A Sbjct: 64 GTCQAGLTAAAYAGQNATR----NGFTVATGSSLSTSCGSLTTGANGLRTFTANPAQAVA 119 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 +V+ + + + L + G + Sbjct: 120 IRVIATHTVPISVASGVAALFTPGPINLT 148 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K I L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVIFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|221104611|ref|XP_002170515.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Hydra magnipapillata] Length = 1137 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 67/216 (31%), Gaps = 17/216 (7%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S + + ++ S V + Y + + +D+S+ Sbjct: 680 STNCISEDFKRCNDSSSETCVGKTNTNYIYSFVKECTKQIEIGILLDASTSV----TLSN 735 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + V K+ VR F+D I S S T + + E Sbjct: 736 WKKTIDFVQDFSKQFKMGPTGVRFALIDFSDDAILQISISDPRFWDQETFGEKVSSIEYS 795 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + A++ A + + + +++LTDG++T Sbjct: 796 QGKTRTDLALEVARKHVFCN------ECGLRHNTPRLLIVLTDGQSTFPK-LTQFEAQLI 848 Query: 394 KS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 K+ + I+++ S +E L + A+ Sbjct: 849 KAENNLTIISVGVSDQVDIEE-----LKSLATDRDH 879 >gi|251799254|ref|YP_003013985.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247546880|gb|ACT03899.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 562 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 43/349 (12%), Positives = 85/349 (24%), Gaps = 23/349 (6%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S + + + N + K V + Sbjct: 211 SVTEAIANNELSMGYTDPFASSTGLNFLVTALHTFDSTDLLSDKAVQEFEKFQSNVPFIA 270 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + + G + S G + + + D Sbjct: 271 STTIQMRDAAKSGALDGFVLEYQTYVNATDLKSGYVFTPFGFRHDSPLYALGDLPQEKLD 330 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + Q NG +L+P S + Sbjct: 331 ILKKFAEFVTQDKYMKTAQEKGFNGL-ETYKSELAPVDGDLLTSAQKVWKEKKNGNKPIT 389 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 FV S + + ++++L +K + +G ++D V + Sbjct: 390 AVFVTDVSGSMDGEPLNRLKESL------LKGQKYLGRDNSIGLVSYSDGVTIKLPIA-K 442 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 +++ G TA D + A + + + K I +L+ Sbjct: 443 YDTNQQSLFVGAVNSLQSGGGTATFDGIVVALKLLEDQ-------LATDPNTKPVIFVLS 495 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 DGE Q I ++ I I TI ++ N L N +S Sbjct: 496 DGETNQGYT-LQDIKGLIETYKIPIYTIGYNANI-------QALQNISS 536 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 108/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 101 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 157 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 158 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 214 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 215 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 274 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 275 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 333 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 334 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDCVPTFSYKVQRV 393 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 394 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 453 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 454 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 512 Query: 448 EIFERVI 454 +I V Sbjct: 513 QISATVS 519 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 58/181 (32%), Gaps = 15/181 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +D+L + K+ D VR+G + ++ + H + Sbjct: 207 FSNFQDSLRFLASLTKRFTIGPDDVRVGFSVYSSTSTIHSH--FNQHMNNSALEAEILGT 264 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T+ A+ D + + + + + +V++TDG++ + Sbjct: 265 SYTGGGTSTGRAI---NDVLNNGFVERNGARPASEGVPRILVVMTDGQSGDSVKTPS--- 318 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNE 448 + K+ GI + + E AS + + + L + R+ +E Sbjct: 319 DNVKAAGITVFGVGIGSGIDIAEVNE-----IASNPDSRYAYELTGFNLLNVLSQRLTDE 373 Query: 449 I 449 Sbjct: 374 S 374 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 57/187 (30%), Gaps = 24/187 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + K + A+ + + I + + +D S S G ++ Sbjct: 136 SMAGKVNGEVKMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEV 192 Query: 320 -------IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T + A+++ D + E + Sbjct: 193 MYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDF---------KEYTGEENLNVVY 243 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +++DGE T + + + I F V ++Q++ +Y A ++ Sbjct: 244 IVSDGEETCG-GDPVNAAKNLNQSNTHAVVNIIGFDVKNSEQQQLKYTAE--AGKGNYAT 300 Query: 431 ANSTHEL 437 +S EL Sbjct: 301 VSSADEL 307 >gi|156383257|ref|XP_001632751.1| predicted protein [Nematostella vectensis] gi|156219811|gb|EDO40688.1| predicted protein [Nematostella vectensis] Length = 161 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 54/151 (35%), Gaps = 16/151 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +I +K ++ +G ++ + F++ + +IVK + G T Sbjct: 27 NFMINIVKSFKISSERTHVGLVLYSS--FTQLKFNFDKYSDSASIVKAINTTDYPKGGTR 84 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A++ A + + + K +++LTDG ++ E K +G+ Sbjct: 85 TGEALKMAKSQLFGA---------SMRSVPKVLIVLTDGRSSDKVEAPS---KALKDEGV 132 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 I + E +++ + + F Sbjct: 133 VIFAVGVGDQIDPSELN--VMASDSKSDHVF 161 >gi|153873859|ref|ZP_02002297.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152069676|gb|EDN67702.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 367 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 50/163 (30%), Gaps = 17/163 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++ + + G + + GST + + AY+ Sbjct: 37 TEKDKVSLVVYAGAAGVVLEPTPGHQSVK---INGALERLTAGGSTHGSAGIHLAYNL-- 91 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---GIAICNKAKSQGIRIMTIAFSVN 408 E + N I+L TDG+ + + + + GI + T+ F Sbjct: 92 --AEQAFIKNGINR-----ILLATDGDFNVGTVDFEALKNLVEEKRKSGISLTTLGFGRG 144 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + A ++ ++ +E K+ D + + + Sbjct: 145 NYNDQLMEQLAD--AGNGNYAYIDTLNEAQKVLVDEMSSTLNT 185 >gi|301166645|emb|CBW26221.1| putative membrane protein [Bacteriovorax marinus SJ] Length = 287 Score = 47.3 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 68/219 (31%), Gaps = 28/219 (12%) Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + K + + G + +D S + K V+ L + Sbjct: 84 IETPPVKKEVVTEAVQTKGILDVDVGFKFKGKNIVFLIDVSGSMKTLDKMGQVKAGLKML 143 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I S+ + V R S S++ + + + V F N G+T Sbjct: 144 ITSM--PSDYQIDVIHFPGKRGARYYSLWSYTQKLGERQKKDVYRFLNRLNPKGATPTRS 201 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----EGIAICNKAKS-- 395 A++ A + +VLL+DG T+ N + I ++ K Sbjct: 202 ALKYALTKYPDLTD---------------VVLLSDGAPTKMNSSEYDDIKDILSEVKKDN 246 Query: 396 -QGIRIMTIAFSV--NKTQQEKARYFLSNCA--SPNSFF 429 + I+I TI + A FL A S F+ Sbjct: 247 FKNIQINTIGVGAAFSLQSTTPASVFLKELAKQSGGFFY 285 >gi|320010752|gb|ADW05602.1| Protein of unknown function DUF3520 [Streptomyces flavogriseus ATCC 33331] Length = 528 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 32/298 (10%), Positives = 74/298 (24%), Gaps = 29/298 (9%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 S E+ + Y + + + + + Sbjct: 67 SREEGDGPARESAAPDYLSTFALDVDTAGYGYARRTLGDGQLPAADTVRPEEFVNSFRQG 126 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE-----HFVDSSSLRHVIK 269 Y G + + + L + VD S Sbjct: 127 YERPKGNGFAVSVDGARSDAAGWSLVRVGLATRAASNTGERPPAALTFVVDISGSMAEPG 186 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + L + +L + ++ D+V+ F++ + + RT ++ Sbjct: 187 RLDLAKTSLGILADELRDDDSVS------LVTFSEEAETRLPMTRLR--GNRTKLRDAIE 238 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + ST + ++ Y+ + + +VLL+D E AI Sbjct: 239 EMEPADSTNVAAGVERGYEEAVEGHRKGATNR---------VVLLSDALANTGETEADAI 289 Query: 390 CNKAKSQ----GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + GI + + + + R + + K+F D Sbjct: 290 LERVGDARQEYGITLFGVGVGSDYGDELMERLTNK---GDGNTTYIADEAQARKVFVD 344 >gi|32477499|ref|NP_870493.1| hypothetical protein RB12409 [Rhodopirellula baltica SH 1] gi|32448053|emb|CAD77570.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 1032 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 45/138 (32%), Gaps = 24/138 (17%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + AM+ A ++ + K++++++DG+ Sbjct: 523 NRRAMLAAVGRMTPGDMPEFDPAMRMAVTGLV-----------RTDASVKHLIIISDGDP 571 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + ++ K I I T+A + + L + A + ++ S L Sbjct: 572 GPPSN---SVIQAFKDNSITISTVAVESHGLSDSRR---LQDIARATGGKYYAVKSGRAL 625 Query: 438 NKIFRDRIGNEIFERVIR 455 IF+ RV R Sbjct: 626 PGIFQRE-----ARRVTR 638 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 108/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 123 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 179 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 180 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 236 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 237 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 296 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 297 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 355 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 356 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 415 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 416 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 475 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 476 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 534 Query: 448 EIFERVI 454 +I V Sbjct: 535 QISATVS 541 >gi|51598434|ref|YP_072622.1| hypothetical protein BG0172 [Borrelia garinii PBi] gi|51573005|gb|AAU07030.1| hypothetical protein BG0172 [Borrelia garinii PBi] Length = 333 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 39/117 (33%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 129 RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDL--GNGSALGLGISIALSHL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ K+ IV+LTDG D + N A+ ++I +I + Sbjct: 186 ----------KHSEALKRSIVVLTDGVVNSDEIYKDQVINLAQGLNVKIYSIGIGSS 232 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|330982726|gb|EGH80829.1| hypothetical protein PSYAP_30018 [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 162 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 41/151 (27%), Gaps = 4/151 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ F Q + + V V + A + + + Sbjct: 72 A----SAPDFANQSATRNGFVPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + + + + A Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATR 158 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 45/422 (10%), Positives = 107/422 (25%), Gaps = 58/422 (13%) Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S + K L + T + + E+ Sbjct: 412 SASTGADHQVQLIRGLKVYSVNKLSQLDLVKSMNKDTYPDADSHTFNVGDEVPYTFVVRN 471 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQ--TKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 ++ ++ +N ++ S G + T + S + Sbjct: 472 SGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVDSEGHFTNTATASGT 531 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG- 247 + + Q + K ++ + S Sbjct: 532 DDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVTGAANSSTITTTQSV 591 Query: 248 --PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFF- 302 L +S + S + IKK ++ A+ + + +I + ++ +R+G F Sbjct: 592 DFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFA 651 Query: 303 -------NDRVISDPSFSWGVH------KLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + ++ F + + +K G+T + + A Sbjct: 652 GKESSKVGNETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHNGATRADLGFKHASTV 711 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMT 402 + +AK+ ++ TDG T+ ++ + N A K G + + Sbjct: 712 MS----------GARTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYS 761 Query: 403 IAFSVNKTQQEKAR----YFLSNCAS----------------PNSFFEANSTHELNKIFR 442 I F++ +S + ++ +L IF Sbjct: 762 IGVFDGANPSSIEENQKNQFMNAVSSNYPHATAYDKLGTGSNAGYYKVVSNVSDLKSIFE 821 Query: 443 DR 444 Sbjct: 822 KI 823 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 49/443 (11%), Positives = 122/443 (27%), Gaps = 66/443 (14%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F+ +L G F I++A+L+ L + ++ D + L+ + A Sbjct: 7 FLSVFSRLNHCTNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYA------ 60 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEY-LIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L + E+S +K Y L N++ L + + + + E + Sbjct: 61 -LHNNPNELSVGTIKQREMLIKKRIGYFLDSNYKGTLLTEEQIKLIVNQSTVSITERSFY 119 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + ++ L L L + K + I + + + Sbjct: 120 PQQFHINIELHKNIQLKSLILHMAMNPKKDFNISQRKSSLYKKNVALMVVPFTWTGEWIP 179 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S+ + + + ++ +N + K+ + Sbjct: 180 PSLFTTQFTVSQDLLPSDLKTEHFKKTEYFNKRNQFFKMFLSKIKENNLCIA-------- 231 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P + + S L + RD +I + N F Sbjct: 232 PYHYSAIVYWSEGIFSYKLPFSTTFLYSFRDIYVKQYSTIWDMKPSN----YILDLFAGA 287 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + R +++ + G+ Sbjct: 288 ELHSNRLTPADPCFRRGVIQKKFMLIIAAGNQ---------------------------- 319 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKA-KSQG------IRIMTIAFSVNKTQQEKARYF 418 ++D +N+ + + C K+ G I + ++ S + + R Sbjct: 320 --------ISDRKNSAEYFKMKHGCTLMGKNMGKNPQEEITVYSLGISPD---PDTKRDL 368 Query: 419 LSNCASPNSFFEANSTHELNKIF 441 + P+ ++E S ++ + Sbjct: 369 IQCTRHPDRYYEIQSYKDIAPVI 391 >gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein [Shewanella amazonensis SB2B] gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Shewanella amazonensis SB2B] Length = 713 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 22/214 (10%), Positives = 55/214 (25%), Gaps = 24/214 (11%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + GP+ +D S + K + + + + Sbjct: 302 TYRDPKVSTKGTVKLTFTPGDDLGPVTQGRDWVFVLDKSGSMN--GKYATLVEGVRQGLG 359 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D R F++ V + + T + + Sbjct: 360 KLPAQD------RFRIILFDESTQEFSKGFVPVDSNNINQALAWVEGISPGNGTDLYQGL 413 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + A + + +VL+TDG E + Q +R+ T Sbjct: 414 KRALTPLDADRSTG-------------VVLITDGVANVGVTEKRRFLELMQQQDVRLFT- 459 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 F + + ++ S ++ ++ Sbjct: 460 -FIMGNSANTPLLVPMTRL-SNGVATSVSNADDI 491 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 108/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 101 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 157 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 158 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 214 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 215 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 274 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 275 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 333 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 334 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 393 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 394 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 453 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 454 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 512 Query: 448 EIFERVI 454 +I V Sbjct: 513 QISATVS 519 >gi|153812898|ref|ZP_01965566.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] gi|149830976|gb|EDM86066.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] Length = 838 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 33/100 (33%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 G + L++ +L G++VD+ R + + A + A + L+ Sbjct: 4 FCGKNRGAISVFLTLILVPVLIFSGIIVDISRLYAAKTVISGAGDLTMNAALARYDKQLK 63 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + + ++ YL ++F + + Sbjct: 64 DSYGLITMADDPSSPSMKTYLEQSFLESCNASALKDTKST 103 >gi|116626778|ref|YP_828934.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] gi|116229940|gb|ABJ88649.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] Length = 466 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 31/253 (12%), Positives = 66/253 (26%), Gaps = 4/253 (1%) Query: 18 CTGHFFI-ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I + +++PV+ G+ G +D+ R L AA+ A I A+ LI + + Sbjct: 33 REGAISIQLAVIMVPVLFGMMGFALDLGRLYLVRGELNHAAEAAAIAAASHLIGTTGALD 92 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + + +Y + + D + V Sbjct: 93 AATTAAQQTLT--LNKYNFGSLTPGEGSGNLTSTITDPAYFSTVAGATGNDPNGSQADGT 150 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 + + S L ++ ++ V + + + S Sbjct: 151 TARHVQVSLTADAPLLFWSLLSAGQSRKTPIAAQALAGISVPLCTACGIEPFAIAAKDAS 210 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + FG PAD ++ S + Y + + Sbjct: 211 DLTDF-GFGAPADDVHYTFYYNCTGTAPAFLPNSGQSAAYTIINRYDAGNTTVTDESDQL 269 Query: 257 HFVDSSSLRHVIK 269 + L Sbjct: 270 FRYGAGGLLSSTT 282 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLRTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|330798620|ref|XP_003287349.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] gi|325082616|gb|EGC36092.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] Length = 559 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 57/174 (32%), Gaps = 11/174 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + I + + F + + + A Sbjct: 165 TRIEFAQAIFQTFIDKMVSYELSAACG---LVCFGSSAQLTFGITRNFDSFSNELGEIQA 221 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 MG+T + +A+ A TI+ + ++ + + L+DGE+ ++ + Sbjct: 222 N----MGNTHLWEAIILAAKTIVDFRNNPNIKLAAPEKLLCRVFCLSDGEDNSNSSTMLD 277 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + K + + I + + A A+ S F A+S+ E ++F Sbjct: 278 AYDYLKKNNVVLDCIPIGLEGRSRLSALST----ATGGSCFIADSSQEGVELFE 327 >gi|297199802|ref|ZP_06917199.1| lipoprotein [Streptomyces sviceus ATCC 29083] gi|197710264|gb|EDY54298.1| lipoprotein [Streptomyces sviceus ATCC 29083] Length = 506 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 38/269 (14%), Positives = 73/269 (27%), Gaps = 26/269 (9%) Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 D + +N F Q DR G R K +V + Sbjct: 96 GRRPDPSTIRPEEFVNSFRQDYDRP-DGNGFTVTVDGARTSKDDWSLVRVGLATRSAGEN 154 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 P + +D S + L + +L ++ ++ D+V F+DR Sbjct: 155 ADERPPAALTFVIDISGSMSEPGRLDLAQRSLDTMTERLRDDDSVA------LVTFSDRA 208 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R + ST + ++T Y T + Sbjct: 209 RKVLPMTRLGGHRDR--IHEAIDGLEPTYSTNLGAGVETGYKTAVEGLRKGATNR----- 261 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 +VL++D + AI + + GI + + + R Sbjct: 262 ----VVLISDALANDGETDPDAILERIDTARREHGITLFGVGVGSDYGDALMERLADK-- 315 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S + K+F +++ I Sbjct: 316 -GDGHTVYV-SGDDAEKVFCEQLPQNIDL 342 >gi|167535479|ref|XP_001749413.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772041|gb|EDQ85698.1| predicted protein [Monosiga brevicollis MX1] Length = 2014 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 38/125 (30%), Gaps = 8/125 (6%) Query: 314 WGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + + G+T ++ + + +N + ++ Sbjct: 398 DLIETTDQYALNDLLDNVEFPAGATHLSWGLDFIDREMF---RLAAGMRSSNNSIPRVLI 454 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 +LTDG + E K +GI I I +++ FE + Sbjct: 455 VLTDGRSNPG-FEPDEYSTALKDKGIEIYAIGVG---DYYSIEVQEMASEPKDRHAFELS 510 Query: 433 STHEL 437 + +L Sbjct: 511 NQDDL 515 >gi|83589209|ref|YP_429218.1| hypothetical protein Moth_0341 [Moorella thermoacetica ATCC 39073] gi|83572123|gb|ABC18675.1| hypothetical protein Moth_0341 [Moorella thermoacetica ATCC 39073] Length = 212 Score = 47.3 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 56/186 (30%), Gaps = 10/186 (5%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEH-ALKQAAQTAIITASVPL 68 K+L+K G I + +PV++ L D + + +++A + A+ A++ + Sbjct: 1 MLKRLLKDQRGPALIWFLIFLPVLMLGMAYLADYTQATTESDIDVQRALEMAVRAAAMQV 60 Query: 69 IQS---------LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 ++ A ++ + + + K D + D Sbjct: 61 TPDSQAAGHPRINIVAANIAFRRELASNLGLDANTLAPLKGSAMKTRPDYTLVVYNGDDT 120 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 AY+ V S + + F + GI S + + I Sbjct: 121 YAAGGALEAYKYVFSGGLTGGMMAAAGFPYTFGITSSDVLPGGGGTLQTSLDMPGVVAVI 180 Query: 180 QWVIDF 185 + Sbjct: 181 STSVTK 186 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 69/234 (29%), Gaps = 20/234 (8%) Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + + + S+ G + + S + + +D S Sbjct: 141 NYQNLPSLKWQYFGSEQGVTTLFPSLRATDCGSFDNRCRPWYVQANVPKPKQIVIVIDKS 200 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDN------VNDTVRMGATFFNDRVISDPSFSWGV 316 V +L ++A SV+ ++ D + V +T +D+ + Sbjct: 201 GSMGV-TNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQSTVASDQCFATT--FADA 257 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN--NLEAKKYIVLL 374 + V+ F + G T A+Q A+ +K E + I+ + Sbjct: 258 SPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIKKIDPSEIDRVILFM 317 Query: 375 TDGENTQDNEEGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +DG ++ N+ + + I+T L N A+ Sbjct: 318 SDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLGNADFG------VLRNMATN 365 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ S + S Sbjct: 91 VTLSNVSERKDNMRLGLSLATNPKDNSFLACSPLWSHECGSSYYTTGMCSRVNSNFRFSK 150 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 151 TVAPALQRCQTYMDIVIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 200 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + + + + + I++ T ++ A Sbjct: 201 VGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEFA--------RS 251 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + +++ + +A ++ Sbjct: 252 EAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINP 310 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 311 ETFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 349 >gi|2826780|emb|CAA73140.1| thrombospondin related adhesive protein [Plasmodium cynomolgi] Length = 545 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 15/165 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + S+ +D S V L+ +I ++ D + + + F + Sbjct: 9 CNESVDLYLLIDGSGSIGYPNWITRVIPMLSGLIGNLSLS---RDAINLYMSLFANHTTE 65 Query: 309 DPSFSWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 G + + + + G+T ++ A+ + N Sbjct: 66 LIRLGSGPSVDKKLALDSSSELRKTYVPYGATNMSSALAEVEMHLKDRV--------NRE 117 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +A + ++LLTDG + + + K + +++ I Sbjct: 118 KAIQLVILLTDGVPN-NKFRVVELSKALKEKNVKLAVIGIGHGIN 161 >gi|299132281|ref|ZP_07025476.1| TadE family protein [Afipia sp. 1NLS2] gi|298592418|gb|EFI52618.1| TadE family protein [Afipia sp. 1NLS2] Length = 181 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 47/166 (28%), Gaps = 9/166 (5%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITAS 65 ++ + G + A++ P+ + +V+V + L+ TA + + Sbjct: 11 LRDVFRRFGMNRRGSAAVQFAMVAPLFFALLFAIVEVAMMFFATQVLETGTQDTARLLLT 70 Query: 66 VPLIQSLEEVSSRA-------KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + T ++ + D T Sbjct: 71 HQAQDQQMTAEQIHDNLCGRVQFLLTCSGIYLDVRAYPAGDAFTVPTLFDGAGNATNNFT 130 Query: 119 AVEMNPRKSAYQVVLS-SRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 ++ VV + ++ LL+ L L ++G LI + Sbjct: 131 YQPPAAGSASIVVVRTFYKWPLLITNLGFSLANIGSDKRLISSTTA 176 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLRTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|283807143|pdb|3K6S|A Chain A, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807145|pdb|3K6S|C Chain C, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807147|pdb|3K6S|E Chain E, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807149|pdb|3K6S|G Chain G, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807156|pdb|3K71|A Chain A, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807158|pdb|3K71|C Chain C, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807160|pdb|3K71|E Chain E, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807162|pdb|3K71|G Chain G, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807164|pdb|3K72|A Chain A, Structure Of Integrin Alphax Beta2 gi|283807166|pdb|3K72|C Chain C, Structure Of Integrin Alphax Beta2 Length = 1095 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 167 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 224 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 225 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 278 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 279 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 314 >gi|34452173|ref|NP_000878.2| integrin alpha-X precursor [Homo sapiens] gi|146345441|sp|P20702|ITAX_HUMAN RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leu M5; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor Length = 1163 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 244 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|23398603|gb|AAH38237.1| ITGAX protein [Homo sapiens] gi|123983342|gb|ABM83412.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] gi|123998043|gb|ABM86623.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] Length = 1169 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 244 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|60654537|gb|AAX29959.1| integrin alpha X [synthetic construct] Length = 1170 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 244 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 41/159 (25%), Gaps = 27/159 (16%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + V G T + I + ++ Sbjct: 151 YGQQNQATTDLDLSSDLRAVTDSINRITAFGGTNTEAGFEQGRSVIEGTTSNQNPNSN-- 208 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR----------------IMTIAFSVN 408 K +++LTDG T N A I+ + TI + Sbjct: 209 ----KVVIMLTDGLPTASNGNPYAETTDINHVHIQRAINAGKNIYQNDIADVFTIGLTTG 264 Query: 409 KTQQEKARYFLSNC---ASPNSFFEANSTHELNKIFRDR 444 + N A ++ A S +L+ IF + Sbjct: 265 MN--ATEKALADNILTQAQNKGYYPAPSATDLDAIFEEI 301 >gi|288959068|ref|YP_003449409.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] gi|288911376|dbj|BAI72865.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] Length = 149 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 26/99 (26%) Query: 37 GGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIR 96 G VD +R E L QA A + + S + R+ FP + +L Sbjct: 38 AGCTVDALRSYAVEARLSQAVDAAALAGGRVMFDSQRDGHIRSFFDKAFPNGFLGSHLSP 97 Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 T + K V S Sbjct: 98 LTIAEDAAAGTLTVSAHASVNAIFLRLFGKKEVTVEAQS 136 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + N F + S G+ S + S Sbjct: 91 VTLSNVSERKDNMRLGLSLATNPKDNSFLACSPLWSHECGSSYYTTGMCSRVNSNFRFSK 150 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 151 TVAPALQRCQTYMDIVIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 200 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + + + + + I++ T ++ A Sbjct: 201 VGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEFA--------RS 251 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + +++ + +A ++ Sbjct: 252 EAFQKGGRKGAKKVMIVITDGE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINP 310 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G+ I Sbjct: 311 ETFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGDRI 349 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 38/125 (30%), Gaps = 17/125 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + + R T + A+ + D I + Sbjct: 126 TRIGLVVFSGFAELAVAPTTDRDAITRA-----LDGLTTGRGTTVGSAILKSVDAISEID 180 Query: 355 EDEVHRMKN-----------NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D A + +VLLTDG NT A +G+R+ I Sbjct: 181 PDVAPSDPAPGTVPPPPRAPGTYAPEIVVLLTDGANTTGVT-PEDAAKTAAERGVRVYPI 239 Query: 404 AFSVN 408 F + Sbjct: 240 GFGTD 244 >gi|269926137|ref|YP_003322760.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] gi|269789797|gb|ACZ41938.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] Length = 490 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/292 (12%), Positives = 73/292 (25%), Gaps = 17/292 (5%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS---VPLIQSLEE 74 +G ++ ALL V++G M +D+ ++ A A + A+ +Q + Sbjct: 21 RSGQSLVLFALLSVVLIGFIAMGIDLGMAYSQRRFMQNAVDAAALAATNQLADNLQGTSD 80 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S + + + Y+ N + +E + S Sbjct: 81 GSWTFLVNDQNIRDTVRRYIDANSGITPPGSSYGAAGGCSSGPFCIEYLNVNKSLLAR-S 139 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + + + +R +++ S I V S Y Sbjct: 140 PTSNGQVPSGTAIVRVNIKRTYSTFLATVLGRNKMSVGATAAAQIFPV-----SKPKYPP 194 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + + A + + V S V K L+Y+ Sbjct: 195 TGVWPMVRKYTGNALTEFPTNGACPPPVVFWSPNDSGSTVGDFKGLFYVGKYSAY----- 249 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 ++ V + + D DT A V Sbjct: 250 -VQELNYDPSNKVAGTIPNHVQMITEFLSDND--DPYGDTEGPPANLNGSDV 298 >gi|197105074|ref|YP_002130451.1| hypothetical protein PHZ_c1611 [Phenylobacterium zucineum HLK1] gi|196478494|gb|ACG78022.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 164 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 9/69 (13%), Positives = 27/69 (39%), Gaps = 4/69 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I G I AL+ P+++ + +++ + + A+ T + + Sbjct: 4 RRFIDDVRGGAAIEFALISPLLIFLIFATIEMAVMAM----MSAGLDNAVATTARMIRTG 59 Query: 72 LEEVSSRAK 80 ++ + A Sbjct: 60 QDDGPASAA 68 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 43/147 (29%), Gaps = 22/147 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + + + R+ F+ + + + G T Sbjct: 163 LSDFNVSYNYTRVAVITFSSQKKIFRHIDQISQSVEDNDKCLLLNYQVPRIAFSGGGTYT 222 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A++ A + ++ L++KK I L+TDG + + +A K + Sbjct: 223 YGALKEAEEIFKNA----------RLDSKKIIFLITDGFSNGRDPIPLAG-RLKKDNNVV 271 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN 426 I +I + L AS Sbjct: 272 IYSIGIQSGNYAE------LHAIASAP 292 >gi|303248311|ref|ZP_07334573.1| TadE family protein [Desulfovibrio fructosovorans JJ] gi|302490336|gb|EFL50248.1| TadE family protein [Desulfovibrio fructosovorans JJ] Length = 165 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 62/171 (36%), Gaps = 7/171 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T+ + + + +G + AL++PV++ + +++V ++ ++A Sbjct: 1 MRTQPLRNQRVSRRRESGATAVEFALVLPVLVFMLLGIIEVANIL----RIQFTLESAAT 56 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE- 121 T + + Q+ + A + K + Y + + +D + T Sbjct: 57 TVAHDISQNPNITNQSA--AQNLFDGKQDSYAPLVQQGRDTSDPSDPPALAMSPTTRPTC 114 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 + + + + ++ Y + P+ F + + + ++ + + + Sbjct: 115 NSSSCTPFLITITYTYKAMTAPMQPFFDGLTLSASARKSPEPNTSTALAAQ 165 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 43/129 (33%), Gaps = 20/129 (15%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T N + + +K ++ LTDG Sbjct: 146 DRFKLLQSIQTLVPAGGTDYNKGFI--------KSNAGGLDILKKGLHEKVLIFLTDG-- 195 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFE-ANSTHE 436 + I +AKS G ++ I ++ ++ L A+ S++E S E Sbjct: 196 -YGDVNPTEIIQQAKSIGAKVYVITLGMSAPEE------LKRIVTATNGSYYENVISEQE 248 Query: 437 LNKIFRDRI 445 +N ++ + Sbjct: 249 INAVYMSIL 257 >gi|487830|gb|AAA59180.1| eukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1163 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEEFRRTSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 244 ------YGARRDATKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 698 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 46/355 (12%), Positives = 96/355 (27%), Gaps = 32/355 (9%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF--LRSMGIKSWLIQTKAEA 164 ++ + + + P ++ L + LN F T + Sbjct: 98 LGEDIGNGLSSFQLGNLPAGKTAEIHLKVSFLADLNENGYFYKFPLTHKYQKGSVTNDYS 157 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + D S+ + ++ A + Sbjct: 158 DKPETFHFATTIKTQKEIQDVKVSVQGNKNLNDPHNATFVTNEAPTK---DAIIIEAQIK 214 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 ++K ++ + S SE +FV S K+ + I+S Sbjct: 215 DEDKNVAVSSDGYIAVTTYPFFEGSIESNSEFYFVVDCSGSMAGKRIENAVKCMRLFIQS 274 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R F + + + + +K + +MG T I Sbjct: 275 LPVG------CRFAILKFGSQFQTVLEPCDYTDENVARAMKLLDTIKADMGGTDI----- 323 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ--GIRIMT 402 + + K K I LTDGE + IC A+ G RI + Sbjct: 324 -----LSPLQHVSELKAKEG--FVKQIFFLTDGEVHNPDM----ICATAQKNRSGNRIFS 372 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 I + S ++ +N+ + + + I + IT Sbjct: 373 IGLGSGAD-PGLIKGMARK--SGGNYSIIGDDDNMNEKVIEMLSSAISPALKDIT 424 >gi|322433071|ref|YP_004210320.1| hypothetical protein AciX9_4225 [Acidobacterium sp. MP5ACTX9] gi|321165298|gb|ADW71002.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 457 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 56/154 (36%), Gaps = 2/154 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L+ + +G + A+ + V+LG G+ +DV + L+ A TA A++ + Sbjct: 7 LVSNESGQVLVAAAVSLLVILGFLGISIDVGHRQLSKLRLQSATDTAATAAALEIRVCGS 66 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 VS A S Y N + D + A+ + + Sbjct: 67 LVSCPAMQSAVQSSFIENGYPATPLLLNCATSSDDLTLVLNSPPCAMGSADPNYGKRGYV 126 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + + + F++ +GI + I ++EA Sbjct: 127 EVQVAQHVP--TYFMKMLGISQFNISARSEAARN 158 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 62/189 (32%), Gaps = 17/189 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++D + +++ + +++ + + +F+ L + Sbjct: 171 SDFTQMKDFVKALMGQLASTSTS-----FSLMQYSNILKTHFTFTEFKSSLSPQSLVDAI 225 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T +Q + S AKK ++++TDG+ +D E Sbjct: 226 VQLQ--GLTYTASGIQKVVKELFHSKNGAR------KSAKKILIVITDGQKFRDPLEYRH 277 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRI 445 + +A+ GI I + ++ A L+ S + F+ + L I R Sbjct: 278 VIPEAEKAGIIRYAIGVG-DAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQ 336 Query: 446 GNEIFERVI 454 V+ Sbjct: 337 EKIFAIEVL 345 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 62/189 (32%), Gaps = 17/189 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++D + +++ + +++ + + +F+ L + Sbjct: 166 SDFTQMKDFVKALMGQLASTSTS-----FSLMQYSNILKTHFTFTEFKSSLSPQSLVDAI 220 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T +Q + S AKK ++++TDG+ +D E Sbjct: 221 VQLQ--GLTYTASGIQKVVKELFHSKNGAR------KSAKKILIVITDGQKFRDPLEYRH 272 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRI 445 + +A+ GI I + ++ A L+ S + F+ + L I R Sbjct: 273 VIPEAEKAGIIRYAIGVG-DAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQ 331 Query: 446 GNEIFERVI 454 V+ Sbjct: 332 EKIFAIEVL 340 >gi|21675085|ref|NP_663150.1| hypothetical protein CT2279 [Chlorobium tepidum TLS] gi|21648325|gb|AAM73492.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 335 Score = 46.9 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 42/146 (28%), Gaps = 15/146 (10%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S+S+ + A +++ I R F + + Sbjct: 98 SNSMLARDTAPDRLTHAKTELLQ----ISRRLGDGRKALLLFAGTPVVQCPLTDDEEDF- 152 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ A + T A A S E + +VL +DGE+ Sbjct: 153 EILLDMAAPELITTQGTDYRRAFDAALKLTNSGGELS--------SNETVLVLASDGEDH 204 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFS 406 ++ I K++G+ + I Sbjct: 205 GND--LGDIATAMKTRGVHLHVIGVG 228 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 62/178 (34%), Gaps = 18/178 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D + +V+ + ++ + +F+ + + Sbjct: 178 KDFVRAVMGQFEGT-----NTLFSLIQYSHLLKIHFTFTQFQSSWNPLSLVDPIVQL--D 230 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + S AKK ++++TDG+ +D E + +A+ Sbjct: 231 GLTYTATGIRKVVEELFHSKNGAR------KSAKKILIVITDGQKYKDPLEYSDVIPQAE 284 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 GI I + + A+ L N S + F ++ L+ ++++ +I Sbjct: 285 RAGIIRYAIGVG-DAFWKPSAKQELDNIGSEPAQDHVFRVDNFAALSS-IQEQLQEKI 340 >gi|301780200|ref|XP_002925521.1| PREDICTED: collagen alpha-1(XII) chain-like [Ailuropoda melanoleuca] Length = 3172 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEDELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 35/126 (27%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + E G T Sbjct: 137 LSDFPVVPSATRVAIVTFSSKNNVVPRVDYISSSRAHQHKCSLLNREIPNITYRGGGTYT 196 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 197 KGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 244 Query: 400 IMTIAF 405 I T Sbjct: 245 IFTFGI 250 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 50/151 (33%), Gaps = 23/151 (15%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRT-------IVKTFAIDENEMGSTAINDAMQTAYDTII 351 ++ + + G + K E G+T A + A + + Sbjct: 458 VVVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLT 517 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-EGIAICNKAKSQGIRIMTIAFSVNKT 410 K+ +AKK I+LLTDG T N + + +++ + I+ + Sbjct: 518 --------GNKSRPDAKKMIILLTDGATTAANTLQLKEELDVSRAANVMILAVGVGKF-- 567 Query: 411 QQEKARYFLSNCASPN-SFFEANSTHELNKI 440 + L A +FF EL K+ Sbjct: 568 ----NQTELIQIAGDRKNFFAVTKFSELEKV 594 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 45/138 (32%), Gaps = 10/138 (7%) Query: 295 VRMGATFFNDRVISDPSF---SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R+G +N + ++ ++ + + I + A + + Sbjct: 78 TRVGFVTYNSQATTNADLNKFKSYGDLQQGVYNSYNDMNLSPEKTPYIGTGLIAAGELLQ 137 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + H K I+ N + +++ N KS GI I+TIA + Sbjct: 138 IQGSADGHV-----NHPKVIIAYATALNGTGLLDPLSVANTLKSAGITIITIAVDTDDNG 192 Query: 412 QEKARYFLSNCASPNSFF 429 L+ ASP + F Sbjct: 193 --VIEKQLAPLASPGAAF 208 >gi|167526046|ref|XP_001747357.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774192|gb|EDQ87824.1| predicted protein [Monosiga brevicollis MX1] Length = 889 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 50/151 (33%), Gaps = 14/151 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++MGA F+DRV F+ V+ T ++ A + Sbjct: 590 SGVIQMGAFSFDDRVQPISRFTS---------VEVLVDTLQSAKWTGGASSLAEA---LS 637 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + + ++ +V+L DG + + + G+ I IA + Sbjct: 638 FTGARYFQDLSLPADHRRILVILLDGAADDTVAQVARQATQLRQMGVTIFAIAVQDSNDV 697 Query: 412 QEKARYFLSNCASPNSFF--EANSTHELNKI 440 Q++ ++ +S + + L Sbjct: 698 QQQEAQLMAAVSSTVEYHLMRVPNMEALMDA 728 >gi|125602048|gb|EAZ41373.1| hypothetical protein OsJ_25890 [Oryza sativa Japonica Group] Length = 757 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 49/176 (27%), Gaps = 32/176 (18%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ F+ + + K+ + G T I + + A Sbjct: 354 DRLAVVSFSYSARRVIRLTRMSEGGKASA-KSAVESLHADGCTNILEGLVEAAKVFD--- 409 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--------------SQGIRI 400 + A ++LL+DG++ + G + +K + + + Sbjct: 410 ------GRRYRNAVASVILLSDGQDNYNVNGGWG-ASNSKNYSVLVPPSFKRSGDRRLPV 462 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 T F + + A + +F + + F IG + V Sbjct: 463 HTFGFGTDHD-----ASAMHTIAEETGGTFSFIENQAVVQDAFAQCIGGLLSVPVQ 513 >gi|327261831|ref|XP_003215731.1| PREDICTED: collagen alpha-1(XII) chain-like [Anolis carolinensis] Length = 3118 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 62/186 (33%), Gaps = 20/186 (10%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + L ++ + D D R+G ++ ++ + + R +++ + Sbjct: 157 FKYILDFIVALVSAFDIGEDKTRVGIVQYSSDTRTEFNL--NQYYRQRDLIEAIKNIPYK 214 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T +A++ + + + K +++TDG++ + E + Sbjct: 215 GGNTMTGEAIEYLMRNTFVES------AGSRKDFPKVAIIITDGKSQDEVEIP---AREL 265 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEIF 450 ++ G+ + ++ + L AS F + + I + + Sbjct: 266 RAAGVEVFSLGI------KAADAKELKLIASQPSLTHVFNVANFDGIVDIQNEIVSQVCS 319 Query: 451 ERVIRI 456 ++ Sbjct: 320 GVDEQL 325 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 50/160 (31%), Gaps = 20/160 (12%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D V++G ++ ++ HK ++++ A + G+T A+ Sbjct: 1229 FDIGPDKVQIGLAQYSGDPRTEWQL--NSHKTKQSLMDAVANLPYKGGNTLTGMALNFIL 1286 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + A+K VL+TDG++ D + K G+ + + Sbjct: 1287 R------NNFKPEAGMRPGARKIGVLITDGKSQDDIVAPS---QRLKDLGVELYAVGI-- 1335 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + L AS + L I D Sbjct: 1336 ----KNADENELKQIASDPDETHAYNVGDFTLLVNIVDDL 1371 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/237 (12%), Positives = 74/237 (31%), Gaps = 27/237 (11%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 K + + + C + + +D S + D + Sbjct: 2 TKENLWSVATTATLFFCLCFSSFKAETPSCHGAYDLYFVLDRSGSVS--TDWSEIYDFVK 59 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ V+ +R+ F+ R + ++ + + + N G T + Sbjct: 60 NLTERF-----VSPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLK--TLSEVNPAGETYM 112 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGI 398 ++ ++ A + + + IV LTDG+ T ++ I + A+ G Sbjct: 113 HEGIKLATEQMKKEPKKS----------SSIIVALTDGKLETYIHQLTIDEADSARKYGA 162 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 R+ + ++ L++ A S F + K + I + ++ Sbjct: 163 RVYCVGV------KDFDEEQLADVADSKEQVFPVKGGFQALKGIVNSILKQSCTEIL 213 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/237 (12%), Positives = 74/237 (31%), Gaps = 27/237 (11%) Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 K + + + C + + +D S + D + Sbjct: 2 TKENLWSVATTATLFFCLCFSSFKAETPSCHGAYDLYFVLDRSGSVS--TDWSEIYDFVK 59 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 ++ V+ +R+ F+ R + ++ + + + N G T + Sbjct: 60 NLTERF-----VSPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLK--TLSEVNPAGETYM 112 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGI 398 ++ ++ A + + + IV LTDG+ T ++ I + A+ G Sbjct: 113 HEGIKLATEQMKKEPKKS----------SSIIVALTDGKLETYIHQLTIDEADSARKYGA 162 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 R+ + ++ L++ A S F + K + I + ++ Sbjct: 163 RVYCVGV------KDFDEEQLADVADSKEQVFPVKGGFQALKGIVNSILKQSCTEIL 213 >gi|323444971|gb|EGB01833.1| hypothetical protein AURANDRAFT_69450 [Aureococcus anophagefferens] Length = 376 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 57/206 (27%), Gaps = 30/206 (14%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 SE + RH + + A V S+ DT R+G ++ + + Sbjct: 5 SEVQDAAGNVQRHGFSTLDVCKHAARCVACSLD------DTCRLGLVVYDAQARVVVGLA 58 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + V ST + ++ D ++ D + ++L Sbjct: 59 RVTPAHV-AKVHAALEKLAPGTSTNLWGGLELGVDELVGGAGDNA----------RAVLL 107 Query: 374 LTDGENTQDNEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 LTDG EG +AK S+ + + F L + A Sbjct: 108 LTDGVPNNSPPEGEVAALRAKRLTKDGSETVAVFAAGFGY-----ALKSDLLLSLAREGG 162 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 F + F + + Sbjct: 163 GLFSFVPDAGMVGTSFNHLVASLRSS 188 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 55/189 (29%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + + +G F+ Sbjct: 312 DKSGSMSSDNRLNRLNQAGKLFL-----LQTIEQGSWVGMVTFDSAAQVRSELRQIKSGT 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I ++ A+ I K IVLLTDGE+ Sbjct: 367 DREALTKSLPTV-PSGGTSICSGLRVAFSVI----------KKKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I +C ++ + G I T+A + + + + + A+ + Sbjct: 416 N-----TIKVCFDEVRQSGAIIHTVALGPSAALELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDR 444 L F Sbjct: 467 GLIDAFGAL 475 >gi|123491786|ref|XP_001325914.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908821|gb|EAY13691.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 753 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/312 (12%), Positives = 90/312 (28%), Gaps = 34/312 (10%) Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + S+ + ++ F+ + ++ + + + S + + ++ I Sbjct: 115 VGSKKTVEVHLKVSFMAQINENGYIYKFPLINKNLEISLLDDIPDNFEFSIAIKTKKEIQ 174 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI------RDEKLSPYMVSCNKSLYYMLYP 246 + + P + T + + N + I +K S+ + Sbjct: 175 EIIVSAKGTMNVIDPHNVTFVTKTYPNDRSIIIEARIKDKDKSIAISSDGYISISTYTFF 234 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +E +F+ S + + L ++ S+ R F + Sbjct: 235 EGKVQANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSL------PIGCRFSIIKFGTKF 288 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + + I+ M D I H K + Sbjct: 289 EVALEPCDYTDENMSKAMHQLDL---------IDADMC-GNDMISPLKYISEHPQK--KD 336 Query: 367 AKKYIVLLTDGENTQDNEEGIAICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K + LLTDGE+ + I+IC ++ R+ TI + + N Sbjct: 337 YIKQVFLLTDGEDDR-----ISICAMVQANRDNFRVFTIGIGSDAD-RNLIIDVARN--G 388 Query: 425 PNSFFEANSTHE 436 + + E Sbjct: 389 SGRYIFIDDEDE 400 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 60/466 (12%), Positives = 126/466 (27%), Gaps = 88/466 (18%) Query: 52 ALKQAAQTAIITASVP--LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 L A + + + + P L+ E + + +E + KN + Sbjct: 120 QLSLAVEQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTVPENTKETNKNDSAP 179 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW-LIQTKAEAETVS 168 E + T NP A + R +L L ++ + ++ + Sbjct: 180 EKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNV 239 Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 ++ QW S + R + G AD ++ Y+ + G+ D Sbjct: 240 LNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVY 299 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 L+ L VD S + + V+ + + ++ Sbjct: 300 LN-----------VRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLAD- 347 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + MG ++ ++ + G ++ +K + G T A++ A D Sbjct: 348 SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRDAGD 406 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN--------------- 391 + + N KK IVLLTDG T + Sbjct: 407 MLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQP 455 Query: 392 ----------------------------------KAKSQGIRIMTIAFSVNKTQQEKAR- 416 K +GI I + + + Sbjct: 456 GSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSK 515 Query: 417 ----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 516 QQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 561 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 65/189 (34%), Gaps = 25/189 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKAFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEELL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK GI++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEI 449 + I I Sbjct: 225 -EEIISQTI 232 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 62/200 (31%), Gaps = 30/200 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + I I+ + T F+ + Sbjct: 312 DKSGSMAGGDRLNRLNQASQLFLLQI--IEKGSWTG---MVTFDSSATIQSALIQIETDA 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + + G T+I ++TA+ I + IVLLTDGE+ Sbjct: 367 QRNSLISRLPT-AAGGGTSICSGLRTAFTVIKNKF----------STDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 + I+ C ++ K G I T+A + + + + A + Sbjct: 416 S-----TISSCFDEVKQSGAIIHTVALGPSADPGLEELAKM----TGGMKTSATDNAQNN 466 Query: 437 -LNKIFRDRIGNEIFERVIR 455 L F + +E R Sbjct: 467 GLIDAF-SALSSENGAITQR 485 >gi|386831|gb|AAA51620.1|AAA51620 integrin alpha subunit precursor [Homo sapiens] Length = 1163 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 48/149 (32%), Gaps = 12/149 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHLTFEEFRRTSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D + + A + GI I + + Sbjct: 244 ------YGARRDATKILIVITDGKKEGDTLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKI 440 + L++ AS F+ L I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKDI 325 >gi|212636466|ref|YP_002312991.1| type IV pilin biogenesis protein [Shewanella piezotolerans WP3] gi|212557950|gb|ACJ30404.1| Type IV pilin biogenesis protein, putative [Shewanella piezotolerans WP3] Length = 1175 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 58/489 (11%), Positives = 116/489 (23%), Gaps = 79/489 (16%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G F II A ++ ++ G + L + +S + + Sbjct: 4 GQFLIIVATVLIMISGAS---------YSDDTELY------VFESSARSGSRPQVLIIFD 48 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + ++ E + R N + V + S S Y Sbjct: 49 NSGSMRTQEYSENFYSRGETVNDGTKLYYAKGGGSVPEPTSNNYFYNSVNGCKSSKEYLA 108 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + F+R G +T+ + + D + D+ Sbjct: 109 DYGVFTGFVREYGFVGENGTWLELPDTLGSAVSQVDCFEDFETNDDIKFENATGVDNGLP 168 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + + A SS I + + ++ LY Sbjct: 169 VDSLGSKAAPIPYTHISSSASSSDIENAIDAAFLTKFGTGKSVTLYTERYVNWYHGYKSS 228 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + ++ K+ + I N N + R + + Sbjct: 229 SWQTRMNIAKRVMEDTVVTTPSVDFGLAIFNYNTSKRTDGGRIVSGINRLSTG------- 281 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTII----------------------SSNEDE 357 + + D N +T + + + AY S+ Sbjct: 282 DKISLVDKINDLNAETNTPLCETLYEAYRYFSGSAVHFAKEAGSMTPTRDKSIESNGNYV 341 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGI--------------------AICNKAKSQG 397 + + Y+V +TDG TQD+ A+ + Q Sbjct: 342 SPFKSSQCSNRSYVVYITDGSPTQDSAANTKVSKMSGYKSSERVDGSYLPALASIMHRQD 401 Query: 398 I----------RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 + TI F A L A S ++F A + +L + Sbjct: 402 VNPSLAGEQFVTTYTIGF---SDGAADAEPILKKTAELSGGTYFPAKNATQLQSALQQVF 458 Query: 446 GNEIFERVI 454 + Sbjct: 459 SQILEVNAS 467 >gi|268530440|ref|XP_002630346.1| C. briggsae CBR-CLEC-62 protein [Caenorhabditis briggsae] Length = 386 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 52/168 (30%), Gaps = 8/168 (4%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDEN 332 + R + TVR+G +N++ + + V + Sbjct: 54 IVTIFGGGTRIGTIAYSDKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLS 113 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + + A + + + + ++ +D + E+ + Sbjct: 114 ASDEVYLAKGLDAAESVLSAG----RKNATRSNYKQLVLIYASDYRDD-GEEDPRPTAER 168 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 KS G+ I T+AF ++T E + ASP F + +I Sbjct: 169 MKSSGVSIATVAF--DQTGNEGVVKAIGEIASPGFNFTNEDADLVREI 214 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/372 (10%), Positives = 92/372 (24%), Gaps = 28/372 (7%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A S+ + ++T +E N +D+ Sbjct: 1886 TLDAAGNAVYHSNPNSVSPPGATDTFTYTIRDSDGDESTTTITVNVADSKLVASTDQDVT 1945 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM------GIKSWLIQTKAEAETVSR 169 + + N ++ G + A Sbjct: 1946 VYEKALDLTQDGQDLAPGTVTGSDPGNTGETATGTLVGSVSGGSGAITYTLVGSATGTYG 2005 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 + + + + D F A + + ++ V I D+ Sbjct: 2006 QIQLNADGTYTYTLTSAPKTTPNANDGPNTLSESFTYKATDALGNSTTSTIVVNIVDD-- 2063 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 P V+ +S+ + + L + +S + + L + A+++++ + Sbjct: 2064 VPKAVASERSVAAVEIDSNILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDL- 2122 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 V++ F+ S + KT + G T + A+ T Y+ Sbjct: 2123 ---GDVKVQLVTFSSNATDRTSV-----WVDVATAKTLLAGLSAGGGTNYDAAVATMYNA 2174 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK---AKSQGIRIMTIAFS 406 +S A+ +DG+ + + + I+ I Sbjct: 2175 FNTS--------GKLTGAQNVGYFFSDGKPNEGDIGTADEATLKAFLDANNIKNYAIGLG 2226 Query: 407 VNKTQQEKARYF 418 + Sbjct: 2227 SGVSNANLDPLA 2238 >gi|239945536|ref|ZP_04697473.1| hypothetical protein SrosN15_31402 [Streptomyces roseosporus NRRL 15998] gi|239992004|ref|ZP_04712668.1| hypothetical protein SrosN1_32194 [Streptomyces roseosporus NRRL 11379] Length = 416 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 54/192 (28%), Gaps = 22/192 (11%) Query: 269 KKKHLVRDALASVIR------SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + A V+ + D L RT Sbjct: 50 SRMAAAKQAFNDVLDAVPEQVQLGIRTLGADYPGEDRKVGCKDTRQLYPVGP----LDRT 105 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 KT G T I A+ A D + E + IVL+TDGE+T Sbjct: 106 EAKTAVATLAPTGFTPIGPALLGAADDL------------EGGEGSRRIVLITDGEDTCG 153 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + + ++G ++ + + + + A+ ++ EL+ + Sbjct: 154 PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELSGRVK 213 Query: 443 DRIGNEIFERVI 454 + V Sbjct: 214 QLVDRAAEPVVT 225 >gi|198283210|ref|YP_002219531.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665578|ref|YP_002425441.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|198247731|gb|ACH83324.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218517791|gb|ACK78377.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 1137 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 41/442 (9%), Positives = 102/442 (23%), Gaps = 62/442 (14%) Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP---R 125 S ++++ + Y+ + + + Sbjct: 113 NTSGITAAAQSACASVGASGASSAYIDNSQSMLNVAEQAIGTIINTPEYNNNMQFGLMDY 172 Query: 126 KSAYQVVLSSRYDLLLNPLSLF---LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + +V L + + ++ + F + G S + + +I Sbjct: 173 ALSGRVSLYNTWAYYMSDNNGFSFGTSATGAPSGDVTVANPCYQSGSNSCINLQQNIFGQ 232 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + S S + + Q +PY ++ + Sbjct: 233 GNGSISDPYLYVQASSDAPVINDVLYAIAAQW---QPDATNAVGWGPNPYGLNLQSYMTG 289 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + S + +S + + + + + N + AT Sbjct: 290 NSGVNFSNYSNGLVSGMTPTSAGYFPLSQQVWESQRGYAFNANTSYNKGNIVSPISATNS 349 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + V + ++I + A+ T +S E + Sbjct: 350 ANATNIANAILPEVFRTDKSINYAAEGPITASAGYSPTAG---AFSTALSYYEGSLPNPP 406 Query: 363 NNLEAKKYIVLLTDGENTQD-----------------------------------NEEGI 387 KY++ +TDG+ TQ E I Sbjct: 407 PATCGSKYVIFITDGQPTQGMEHGYVYPPLGSASAQMFGVTSITASTWSSTNNNAVVETI 466 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKAR---------------YFLSNCASPNSFFEAN 432 +GI+ + ++ ++F+ A Sbjct: 467 NEIQALAQKGIKTYVLGVGSAVNPNVPGASAADQAEALQGQAVLTAMAQAGGTSNFYAAT 526 Query: 433 STHELNKIFRDRIGNEIFERVI 454 S ++ I N + + V Sbjct: 527 SASDVQSAMNSIIANILGKSVS 548 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 66/211 (31%), Gaps = 32/211 (15%) Query: 269 KKKHLVRDALASVIRSIKKID--------------NVNDTVRMGATFFNDRVI------- 307 +K + +A+ +++ +K+ D +V+ FN+ Sbjct: 317 RKIDQLIEAMQNILTDLKETDLFNIVRFGDLAMVWDVSQNQFTQLPNFNEYGNLEPHLRE 376 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + K D++ +G T + ++ I + E + + Sbjct: 377 INLPRAVNGTEENIEAAKKIIEDKSRLGMTNMMYGLEVGLFLIKRTQE------ETPDKY 430 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNC 422 + IV LTDG I N S + I +++F ++ + N Sbjct: 431 QPMIVFLTDGHPNAGMSGRDEITNTVTSLNSGKKKASIFSLSFGDFADKRFLRKISSKNS 490 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 +E++ + F I + V Sbjct: 491 GFSRHIYESSDASLQLQDFYRAISAPLLSNV 521 >gi|332845803|ref|XP_510951.3| PREDICTED: integrin alpha-X [Pan troglodytes] Length = 1209 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 54/158 (34%), Gaps = 13/158 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 186 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHQLFHAS 243 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 244 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 297 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 + L++ AS F+ L ++++ +I Sbjct: 298 WKE-LNDIASKPSQEHIFKVEDFDALKD-IQNQLKEKI 333 >gi|332222722|ref|XP_003260519.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Nomascus leucogenys] Length = 3535 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|297685094|ref|XP_002820135.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Pongo abelii] Length = 3553 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 95 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 154 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 155 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 202 Query: 400 IMTIAF 405 I T Sbjct: 203 IFTFGI 208 >gi|297270401|ref|XP_002800091.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Macaca mulatta] Length = 3386 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|193783708|dbj|BAG53619.1| unnamed protein product [Homo sapiens] Length = 868 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|157273368|gb|ABV27267.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 324 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 48/135 (35%), Gaps = 20/135 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ + G TA+ DA+Q + K +K IV+ +DG++ Sbjct: 150 RPADIEEAVDNIVWGGGTALLDAIQL----------SAEYADKEGKNRRKAIVVFSDGDD 199 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFS--------VNKTQQEKARYFLSNCA--SPNSFF 429 + + + +++ + F ++ +++A + + A + F Sbjct: 200 RDSYYDRRQLIKLLQEYQVQVYIVGFPDDDDDGGLFGRSTRKRAVQLIKDIANETGGRAF 259 Query: 430 EANSTHELNKIFRDR 444 S EL +I R Sbjct: 260 FPKSVDELPEIVRTI 274 >gi|55662683|emb|CAH74138.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665761|emb|CAH73557.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 845 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 87 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 146 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 147 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 194 Query: 400 IMTIAF 405 I T Sbjct: 195 IFTFGI 200 >gi|55662684|emb|CAH74139.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665762|emb|CAH73558.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55957947|emb|CAI14068.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 3548 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 87 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 146 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 147 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 194 Query: 400 IMTIAF 405 I T Sbjct: 195 IFTFGI 200 >gi|148886654|ref|NP_699197.3| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Homo sapiens] gi|296452942|sp|Q4LDE5|SVEP1_HUMAN RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=CCP module-containing protein 22; AltName: Full=Polydom; AltName: Full=Selectin-like osteoblast-derived protein; Short=SEL-OB; AltName: Full=Serologically defined breast cancer antigen NY-BR-38; Flags: Precursor Length = 3571 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|119579467|gb|EAW59063.1| hCG1794476, isoform CRA_b [Homo sapiens] Length = 1196 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|112180424|gb|AAH30816.1| SVEP1 protein [Homo sapiens] Length = 868 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|37222213|gb|AAQ89957.1| selectin-like protein [Homo sapiens] gi|68655017|emb|CAF04067.1| SEL-OB protein [Homo sapiens] Length = 3574 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 38/126 (30%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V R+ F+ + P + + R + E G T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A ++ + E+ K + L+TDG + + I + G+ Sbjct: 170 KGAFQQAAQILLHARENST----------KVVFLITDGYSNGGDPRP--IAASLRDSGVE 217 Query: 400 IMTIAF 405 I T Sbjct: 218 IFTFGI 223 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 69/212 (32%), Gaps = 21/212 (9%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + VS L G +D ++ + + S H ++ L+ V+ Sbjct: 410 ITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAP 469 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + VR+GA + D F + + + K G+T A+ Sbjct: 470 HK---VRVGAVQYADSWD--LEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSL 524 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + + + ++V+LT+G + E N+ + + IR+ I Sbjct: 525 LQKAKK------QRGNKVPCHLVVLTNGMSKDSILEP---ANRLREEHIRVYAIGI---- 571 Query: 410 TQQEKARYFLSNCASPN-SFFEANSTHELNKI 440 +E + L A + + L I Sbjct: 572 --KEANQTQLREIAGEEKRVYYVHDFDALKDI 601 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 21/182 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + S+ +D + +R+G +++ S S G+ ++ V + + A Sbjct: 249 GFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGI---NKSEVLQHIQNLSPRTGKA 305 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A A + + + N + VL+T + + + +G+ Sbjct: 306 YTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRD---SEDNVTKAAVNLRREGV 359 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNK---IFRDRIGNEIFER 452 I T+ Q L AS + + + +L F ++ N+I Sbjct: 360 TIFTLGIEGASDTQ------LEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 413 Query: 453 VI 454 V Sbjct: 414 VS 415 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 54/188 (28%), Gaps = 18/188 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ + D T++ G + + + D Sbjct: 195 FDIYKECSVYLVDELASFD---MTIQFGIISYATVPKVIIPIYDENSDNDAHVFEVIEND 251 Query: 331 EN-----EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + T I A++ Y+ + E + N I+LLTDG+ Sbjct: 252 LKYSDHKDKTGTNIKTALEEVYNMMSFQKETYKNESVWNSIHH-IIILLTDGKANIGGRP 310 Query: 386 GIAI--------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 I K + + + T + + + + F S +E+ Sbjct: 311 ADTIKHIEEFLDIKKKREDYLDVYTFGIGPDVDMADLSEIASKKD-GESHVFRMESANEM 369 Query: 438 NKIFRDRI 445 +F+ + Sbjct: 370 KTVFQKIV 377 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 61/177 (34%), Gaps = 17/177 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 ++ +++V+ +K + ++D + +F+ ++ Sbjct: 162 DPVDFDRMKKFVSTVMSRFQKSKTL-----FALMQYSDDFRTHFTFNDFKRNSDPELLVR 216 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T ++ + S+ + A K ++++TDGE D E Sbjct: 217 PIGQLF--GRTHTATGIRKVVRELFHSSSGARN------HALKIMIVITDGEKYLDPLEY 268 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + +A +GI I + +K+R L AS + F+ N+ L I Sbjct: 269 RDVIPEADRKGIIRYVIGVG-DAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTI 324 >gi|307298147|ref|ZP_07577951.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] gi|306916233|gb|EFN46616.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] Length = 704 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 55/182 (30%), Gaps = 25/182 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K + AL V++ ++ D R F+ RV + + + Sbjct: 279 YGEKIEQAKRALEQVLQMLRPGD------RFAIVTFDGRVHNLTG--SLLDASEKAEWIE 330 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+QT+ D + K ++ LTDGE T+ + Sbjct: 331 KVRRIQADGMTNIYGALQTSIDMFSKYDTGRF----------KALLFLTDGEPTEGITDI 380 Query: 387 IAICN----KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 I + +A+++ + + + + R N + Sbjct: 381 GRIISDATPEARARNVHLFSFGVGTGVVAELLDRLVQEN---AGRVSYIIEGESIEAKVT 437 Query: 443 DR 444 D Sbjct: 438 DL 439 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 39/153 (25%), Gaps = 22/153 (14%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L G + AL +PV L V ++++ Y + L A++ ++ Sbjct: 4 FRRLAADGRGSVAVEFALFLPVFLLVIFSIIELGAAWYQKQMLVNASREGARLGALFSTS 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + S Sbjct: 64 GGLTAQEVQERVNQYLSD---------------------SGFPSQAVVQAVGVDGASGDP 102 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 V ++ D LS F+ ++ + + Sbjct: 103 VTVNVSADYEFPVLSAFIGAVP-GTISLSATTV 134 >gi|16263307|ref|NP_436100.1| hypothetical protein SMa1572 [Sinorhizobium meliloti 1021] gi|307304374|ref|ZP_07584125.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|14523986|gb|AAK65512.1| conserved hypothetical protein [Sinorhizobium meliloti 1021] gi|306902576|gb|EFN33170.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 141 Score = 46.9 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 43/132 (32%), Gaps = 6/132 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L +S +G + AL+ +L + +++ R Y + L A A + ++ Sbjct: 8 LRRLFRSQSGATAVEFALVCLPLLLLVFGIIEFGRAFYVRNELSHAVDVA----ARRVLI 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + T + E L T V I P + Sbjct: 64 GQIARDATDSEALTKLAGAVRESFHSGDPTLLTIAVTKETVDGIAFRVLSIRYP--FTFV 121 Query: 131 VVLSSRYDLLLN 142 V ++ + LN Sbjct: 122 VPGLAQSPVSLN 133 >gi|126334042|ref|XP_001370610.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1154 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 48/154 (31%), Gaps = 13/154 (8%) Query: 293 DTVRMGATFFNDRVISD---PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R T F+ S+ F++ + K E G T A+Q Sbjct: 178 GQFRGTNTLFSLMQYSNDFRIHFTFNIFKNNPDPGILVNRIEQLGGLTFTATAIQKVVKE 237 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 I S +A K ++++TDG D E + A+ GI I Sbjct: 238 IFQSWNGAR------KDAVKILIVITDGRKESDRLEYEDVIPLAEKAGIIRYAIGVGNTF 291 Query: 410 TQQEKARYFLSNCASPN---SFFEANSTHELNKI 440 T + L AS F+ N+ L I Sbjct: 292 TTPSALQE-LRTIASQPSQEHVFQVNNFAALRNI 324 >gi|45384490|ref|NP_990665.1| collagen alpha-1(XIV) chain precursor [Gallus gallus] gi|1705533|sp|P32018|COEA1_CHICK RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|288873|emb|CAA50064.1| collagen XIV [Gallus gallus] Length = 1888 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 32/122 (26%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + K +L+ Sbjct: 213 NAYGTKDAVLDAVRNLPYKGGNTLTGLALTYILE------NSFKPEAGARPGVSKIGILI 266 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ + GI + I + L AS + Sbjct: 267 TDGK-SQDDVIPP--AKNLRDAGIELFAIGV------KNADINELKEIASEPDSTHVYNV 317 Query: 432 NS 433 Sbjct: 318 AD 319 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 46/139 (33%), Gaps = 17/139 (12%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D VR+ + FS+G ++ + I G T A+ A Sbjct: 102 TDKDDVRIAMIQYA--ETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRTGKALLAAKGE 159 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVN 408 + S A K IVL TDG ++ I + + + I+I + + Sbjct: 160 LFSE------EKGARKNASKIIVLFTDGL---SVDDPIKHAQQLREIEKIKIYVVYVGSD 210 Query: 409 KTQQEKARYFLSNCASPNS 427 +QE ++ A S Sbjct: 211 GFEQE-----MNRIAGGRS 224 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 52/159 (32%), Gaps = 25/159 (15%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + + D+ +G FN + R + G T I + ++ Sbjct: 410 LNALGQLGDSTSVGVVGFNRQAYEVVGLEQLTE--NRDTTRQRIRQLRAGGGTNIANGLR 467 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + ++L++DG + + +G+R++T+ Sbjct: 468 GAEEML-DGQRGT-------------VILISDGVD--ARSRATVVAESLGRRGVRVITVG 511 Query: 405 FSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + L A S ++F+AN T L +F Sbjct: 512 AGQRVNEP-----LLEQIADISGGTYFQANETDRLRILF 545 >gi|104779911|ref|YP_606409.1| hypothetical protein PSEEN0657 [Pseudomonas entomophila L48] gi|95108898|emb|CAK13594.1| conserved hypothetical protein [Pseudomonas entomophila L48] Length = 635 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 39/145 (26%), Gaps = 5/145 (3%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + AL M + L + VD R + L++ A A + A+ + Sbjct: 1 MAALTMGLALLFMLLSVDSGRLYLEQRKLQRIADMAALEAAEHSASCNGSGPQAFALARN 60 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + L + + V N R A +V +S+ + Sbjct: 61 AATRN-----GLAVTDPLIASCGYLRTGNDNLRRFVADNTRNEAIKVEVSNVVVTSVAAG 115 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSR 169 + A A S Sbjct: 116 VFAMVQGNTVPRTTTLHASAVAASP 140 >gi|224046671|ref|XP_002199336.1| PREDICTED: collagen, type XIV, alpha 1 (undulin) [Taeniopygia guttata] Length = 1883 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 32/122 (26%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + K +L+ Sbjct: 213 NAYGTKDAVLDAVRNLPYKGGNTLTGLALTYILE------NSFKPEAGARPGVSKIGILI 266 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ + GI + I + L AS + Sbjct: 267 TDGK-SQDDVIPP--AKNLRDAGIELFAIGV------KNADINELKEIASEPDSTHVYNV 317 Query: 432 NS 433 Sbjct: 318 AD 319 >gi|218296567|ref|ZP_03497295.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] gi|218243109|gb|EED09641.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] Length = 706 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 43/146 (29%), Gaps = 18/146 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + + ++ + G T + A + A + Sbjct: 346 VVLFSSSPRVLFPPRPMTAQGKKEA-ESLLLSLRAGGGTVLGGAFREALRLLQDV----- 399 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +K +++L+DG E +A+ A G+ + +A + Sbjct: 400 ------PVERKALLVLSDGIIFDPKEPILALAATA---GVEVSALALGPDADAAFLEALA 450 Query: 419 LSNCASPNSFFEANSTHELNKIFRDR 444 F+ A + EL ++F Sbjct: 451 QR---GGGRFYRAATPKELPRLFLKE 473 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 49/165 (29%), Gaps = 39/165 (23%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN------------- 364 + V T G TA+ + + A T++ ++ R+ N Sbjct: 154 SHSKDDVATAIEGLKPQGGTALGNGLLIALTTLLPQTTNDAERLMNGGDVAQPGKPGKAA 213 Query: 365 -------------LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT- 410 A IVL +DGE+ + A + G+R+ T+ + Sbjct: 214 PGELDNGEPVRPGSYASGAIVLFSDGESNSGPG-AVQAAQLAATYGVRVYTVGVGTTEGV 272 Query: 411 ---------QQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 + L A + +F T L K++R Sbjct: 273 VLSADGWSARVRLDEKVLKQVADTTGAEYFRLEDTAALKKVYRAL 317 >gi|260832994|ref|XP_002611442.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] gi|229296813|gb|EEN67452.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] Length = 1121 Score = 46.9 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 61/221 (27%), Gaps = 25/221 (11%) Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + S +V + P + + V + VR+ Sbjct: 902 CTFTCNHGYHLQGSSEVVCGADGQWRGTPPSCAACDSAADVIFSIDVSGSVSGQFDTVRN 961 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + V+ + + F S + + + + + Sbjct: 962 FIRGVVNCLTIG--GSHARVGVIKFAGSNANRQISLT---DYNNKNDLLVRIRNLDRSLG 1016 Query: 337 TAIND--AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 +A+ + + +S A+K ++LTDG +T ++ I + Sbjct: 1017 SAVGSVAGLSVMRNEFSTSGR---------PTARKIGIVLTDGRDTSSSDAVIPDAETLR 1067 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEAN 432 ++G I ++ R L N AS + F A Sbjct: 1068 NEGTTIFSVGV------ANARRETLENMASRPVNENVFTAT 1102 >gi|332185423|ref|ZP_08387171.1| tadE-like family protein [Sphingomonas sp. S17] gi|332014401|gb|EGI56458.1| tadE-like family protein [Sphingomonas sp. S17] Length = 193 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 38/126 (30%), Gaps = 4/126 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G + AL+ L + ++ ++A ++A+ A+ ++ + Sbjct: 18 LSDRRGATIVEFALVATPFLLLLIAIIQTSLAYLA----QEALESAVQVAARGVVTGQAQ 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S +S ++ E N L + + V+ A +A + Sbjct: 74 ASDVKGSSTGMTSAQLAERFRINGCAALPSFMSCSRLYVDVKSAATGAGLGSNAMPLTFD 133 Query: 135 SRYDLL 140 Sbjct: 134 GNGKPS 139 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 69/194 (35%), Gaps = 27/194 (13%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + R +K+ Sbjct: 1126 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDDVEIIQGLT--RMPGDRDTLKSR 1183 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 1184 VDAIKYFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEP 1235 Query: 387 ----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNK 439 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 1236 CGGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRD 1290 Query: 440 IFRDRIGNEIFERV 453 + I I V Sbjct: 1291 A-EEVISQTIDTIV 1303 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 54/161 (33%), Gaps = 26/161 (16%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + R+G ++ +G + ++ V+ T A++ + Sbjct: 75 SSQTRVGVGLYSTFASVPIP--FGKYTSLQETVEGIKKLRYPGEGTRTGRALKLMKTHLF 132 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S + + H K +++LTDG ++ A + G+ + + + Sbjct: 133 SQSRPKAH---------KVLIVLTDG---TSVDDVKAPAKALRESGVEVFAVGIGEHYRP 180 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 +E L + A+ + H L FRD + E R Sbjct: 181 RE-----LKDIATD-------TGHVLTAGFRDLMSVERKLR 209 >gi|260810969|ref|XP_002600195.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] gi|229285481|gb|EEN56207.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] Length = 323 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 18/106 (16%), Positives = 40/106 (37%), Gaps = 15/106 (14%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T ++A+ A T+ +S + ++LLTDG+ +D+ + + ++ Sbjct: 208 TDTHEALDYARTTMFTSRTGLRPGSL------REVLLLTDGQPNEDD-LTVQAAERLRNS 260 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNK 439 GI + + + + L S + F N+ +L Sbjct: 261 GITVFALGIADGVDNEH-----LEQLVSDPEYKHIFHLNTFEDLAD 301 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 46/167 (27%), Gaps = 17/167 (10%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V + + + D +++G F+ + + K+ + T Sbjct: 1047 VASMVNQSNIGTDGIQIGLLQFSSIPQEEFRLNQYSSKVDIYSAIFDVQQMRD--GTRTG 1104 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ ++Y++++TDG + + + I I Sbjct: 1105 KALNFTL-------PFFDSSKGGRPSVQQYLIVITDGVAQDNVIIP---AKALRDKNIII 1154 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 I + K L + + + L + ++ + Sbjct: 1155 FAIGVG-----EAKKSQLLEITNDEDKVYHDVNFEALQNLEKEILSK 1196 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 54/167 (32%), Gaps = 17/167 (10%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D VR+G +ND + H L T+++ G T A++ Sbjct: 266 DISSDHVRVGLAQYNDNIYPAFQL--NQHPLKSTVLEQIQNLPYRTGGTNTGSALEFIRT 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + ++L+TDGE N+E + ++ K G+ Sbjct: 324 NYLTEESGSRAK----DRVPQIVILVTDGE---SNDEVQEVADRLKEDGVV------VYV 370 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 + L AS F + + F I + V Sbjct: 371 VGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 417 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 33/99 (33%), Gaps = 10/99 (10%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + +Y++++TDG+++ + G+ I I +E L Sbjct: 934 ADTGRINVARYLIVITDGKSSDP---VAEAAEGLRENGVIIYAIGV------REANIDEL 984 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + F L I ++ + + + +K Sbjct: 985 KEIA-KDKIFFVYEFDLLKDIQKEVVQDICSSEACKTSK 1022 >gi|84685161|ref|ZP_01013060.1| hypothetical protein 1099457000257_RB2654_09849 [Maritimibacter alkaliphilus HTCC2654] gi|84666893|gb|EAQ13364.1| hypothetical protein RB2654_09849 [Rhodobacterales bacterium HTCC2654] Length = 164 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 49/169 (28%), Gaps = 6/169 (3%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + ++ G L++P+ML + G+ ++ R + QA A + + Sbjct: 2 RAFRHIRRFAHRDEGAVLAEFGLVLPLMLILFGVTIEAARTFWA----YQAT-IAGVRDA 56 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + +E + + + I ++ F V + Sbjct: 57 TRYVARVETPTICDEVGADLDDWQATVTDIVRNASDGTLIFPASITVSSVTAALTCASGD 116 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 V + + +LN F R + + T + + Sbjct: 117 YRTGTVPV-ATVTAVLNITFPFARVFELAGMTLPTASTQVSDQGRIFGT 164 >gi|299067814|emb|CBJ39025.1| conserved protein of unknown function [Ralstonia solanacearum CMR15] Length = 348 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/263 (12%), Positives = 75/263 (28%), Gaps = 3/263 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G +I ++ + VG + VD+ R + L+ AA A + + L + Sbjct: 11 ARRQRGAVGLIMPAMLIAIFSVGALAVDIARLIVVRNELQNAADAAALAGAAGLYPTNPT 70 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ-VVL 133 + + K+ + + + + + P + V + Sbjct: 71 PNWSNGVTQGTSAVKLNKSSNVQLVSGTVQAGYWNLTGTPAGLQSQSITPGANDVPAVQV 130 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD-Y 192 + N + I L + + G + I S+ + D Y Sbjct: 131 TISRSAGNNGGPVATLLAPIFGALSANSSATAVAVIAAPGSAGPGALFPIAISKCLYDLY 190 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK-LSPYMVSCNKSLYYMLYPGPLDP 251 S GQP + +S + + + S L Sbjct: 191 WNYSTGQPKIDPSTGKPYVFQINTSYPTSSSSCTSGEWTGFNGPTDASTLKGLVQNGNTS 250 Query: 252 SLSEEHFVDSSSLRHVIKKKHLV 274 +LS + ++++ V ++ Sbjct: 251 TLSIGNMINTALATGVKSSVYMA 273 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 108/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 123 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 179 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 180 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 236 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 237 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 296 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 297 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 355 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 356 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 415 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 416 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 475 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 476 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 534 Query: 448 EIFERVI 454 +I V Sbjct: 535 QISATVS 541 >gi|288875|emb|CAA50063.1| collagen XIV [Gallus gallus] Length = 1857 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 32/122 (26%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ + G+T A+ + K +L+ Sbjct: 213 NAYGTKDAVLDAVRNLPYKGGNTLTGLALTYILE------NSFKPEAGARPGVSKIGILI 266 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ + GI + I + L AS + Sbjct: 267 TDGK-SQDDVIPP--AKNLRDAGIELFAIGV------KNADINELKEIASEPDSTHVYNV 317 Query: 432 NS 433 Sbjct: 318 AD 319 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 47/161 (29%), Gaps = 20/161 (12%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G F+D + V G T + M AY+ + S Sbjct: 458 DDKLGVVTFDDSAHWTIELDKVPSQDD---VVAALAPVPPSGQTNVVSGMNAAYEQLRQS 514 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + K+ +LLTDG + +I GI + +A Sbjct: 515 DAKI-----------KHAILLTDG--WGHATDIGSIAENMNKDGITLSVVAAGNGSDNAL 561 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L ++ A E+ +IF + ++ Sbjct: 562 QRYAEL----GGGRYYPARVMEEVPQIFLQETIQAVGTYIV 598 >gi|91201647|emb|CAJ74707.1| hypothetical protein kuste3944 [Candidatus Kuenenia stuttgartiensis] Length = 339 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/217 (10%), Positives = 58/217 (26%), Gaps = 40/217 (18%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + A+ ++ + +G + F + L +++ + Sbjct: 87 CIDTSFSTTGTAIETIKEVVAAFIKKRSNDIIGLSIFGTDAALITMPTMETELLEKSLER 146 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE------------------- 366 + T+I + + T+ + Sbjct: 147 VHSSQM--GYQTSIGEGIFTSITALFEKEMGSRFTFTELRNSINKQYLDDYAVSFVKEMK 204 Query: 367 -----AKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAF------SVNKTQQE 413 K I+L TDG + + + + GI+ +A ++ Sbjct: 205 KRDLLKNKLIILFTDGIYNIGISPDRPLRLLQRM---GIKAYVVAVKASDVYGIDPDIAA 261 Query: 414 KARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNE 448 + L ++ ++ A++ E+ K F D I Sbjct: 262 QHIQELQEAIESTGGKYYHADNFEEVAK-FYDEIDRI 297 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/427 (8%), Positives = 119/427 (27%), Gaps = 30/427 (7%) Query: 28 LLMPVM--LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 ++M +M L + G + + +++E R+ + Sbjct: 6 IIMLLMSSLILSGCGPESENKESLQQQPSTPTDQ---QVLAAQQAAIKEAEQRSVAAKAT 62 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 K + + + K ++ + S ++RY + Sbjct: 63 ADAKAKALAQQEAQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPV 122 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + ++ + + R + + + + + D + Sbjct: 123 KQVAQNPLATFSLDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNK 182 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 +P + + + + + P+ + +D+S Sbjct: 183 EPVPASKP--IPFAMRCELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 240 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 ++ L++ +L +++ +++ DN+ + G H + + Sbjct: 241 ISDERLPLIQSSLKLLVKELREQDNIA------IVTYAGDSRIALPSISGSH---KAEIN 291 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + GST ++ AY ++ + I+L TDG+ ++ Sbjct: 292 AAIDSLDAEGSTNGGAGLELAYQ--QAAKGFIKGGINR-------ILLATDGDFNVGIDD 342 Query: 386 GI---AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 ++ K + G+ + T + + ++ ++ E K+ Sbjct: 343 PKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRI--ADVGNGNYSYIDTLAEAQKVLN 400 Query: 443 DRIGNEI 449 + + Sbjct: 401 SEMRQTL 407 >gi|198421549|ref|XP_002127942.1| PREDICTED: similar to calcium activated chloride channel 4 [Ciona intestinalis] Length = 1075 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/212 (9%), Positives = 60/212 (28%), Gaps = 15/212 (7%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVN 292 N + + P+++ + + + + ++++ + + Sbjct: 296 DFNAVNHPNPTLENILPTINVRQQPTNRYVLVLDTSGSMSGSNYEYMMQAATDFIMTYIP 355 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 G F+ + +K R + + GST I + + + + Sbjct: 356 KGAEAGIVEFSYTATTLSQLVSIENKADREYLASRL-PGQPDGSTCIGCGILNGIEVLSN 414 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 D +++LTDGE + + A + + +I F + Sbjct: 415 QGRDPAGGQ---------LIVLTDGEENYSPYVND-VRDNAIEAHVVVDSIFFGASGNGA 464 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + + + L + F+ Sbjct: 465 --LQQLTEDTKGTMYYNDVTDITGLKETFKQL 494 >gi|156358436|ref|XP_001624525.1| predicted protein [Nematostella vectensis] gi|156211311|gb|EDO32425.1| predicted protein [Nematostella vectensis] Length = 1323 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/347 (9%), Positives = 91/347 (26%), Gaps = 17/347 (4%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + D + + + V R + + + + +T Sbjct: 979 SIKDSQFSASSEYSVTLKAGNARLHLVPRDGRGGGWAAKNNDNKQWLQLDFDSYKTVTRI 1038 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 T R + S S+ + + + + +V +S + Sbjct: 1039 ATQGRQDANQWVTSYSVKYSQDGSVFSPYQHNGKDVIFTGNRD-RGSVVLHSFDPPITNV 1097 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGP-----LDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 R +L P + SL Y P L + + + R A Sbjct: 1098 RMLRLLPLTWHSHISLRLEFYSCPGEIPHLAKPCPKSLDIGIALDRSTSVGPTNFNIAKT 1157 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + ++++ + G ++ SF + G T Sbjct: 1158 FLKILVERMKISTNGSHFGLIAYSSSASRVISFRFSQKAADINRQIDAIE--FTGGKTRT 1215 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + A+Q A + +++ + ++++T+G +Q + + K + + Sbjct: 1216 DFALQVAITDLFTNS------AGDRENVTDVLIVMTNGRTSQGSLPYKDVMKPLKEKKVD 1269 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ + + + E + + L + Sbjct: 1270 VIAVGIGPDVNEAELLEIAE---GGLDHVIRVDDYEALATKLNSILA 1313 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 60/179 (33%), Gaps = 17/179 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++D + +++ + +++ + + +F+ L + Sbjct: 171 SDFTQMKDFVKALMGQLASTSTS-----FSLMQYSNILKTHFTFTEFKSSLSPQSLVDAI 225 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T +Q + S AKK ++++TDG+ +D E Sbjct: 226 VQLQ--GLTYTASGIQKVVKELFHSKNGAR------KSAKKILIVITDGQKFRDPLEYRH 277 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 + +A+ GI I + ++ A L+ S + F+ + L I R Sbjct: 278 VIPEAEKAGIIRYAIGVG-DAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQI 335 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 60/179 (33%), Gaps = 17/179 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++D + +++ + +++ + + +F+ L + Sbjct: 171 SDFTQMKDFVKALMGQLASTSTS-----FSLMQYSNILKTHFTFTEFKSSLSPQSLVDAI 225 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T +Q + S AKK ++++TDG+ +D E Sbjct: 226 VQLQ--GLTYTASGIQKVVKELFHSKNGAR------KSAKKILIVITDGQKFRDPLEYRH 277 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 + +A+ GI I + ++ A L+ S + F+ + L I R Sbjct: 278 VIPEAEKAGIIRYAIGVG-DAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQI 335 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 60/179 (33%), Gaps = 17/179 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++D + +++ + +++ + + +F+ L + Sbjct: 173 SDFTQMKDFVKALMGQLASTSTS-----FSLMQYSNILKTHFTFTEFKSSLSPQSLVDAI 227 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T +Q + S AKK ++++TDG+ +D E Sbjct: 228 VQLQ--GLTYTASGIQKVVKELFHSKNGAR------KSAKKILIVITDGQKFRDPLEYRH 279 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDR 444 + +A+ GI I + ++ A L+ S + F+ + L I R Sbjct: 280 VIPEAEKAGIIRYAIGVG-DAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQI 337 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 52/171 (30%), Gaps = 25/171 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + I+ + ++ F+ K +V M+ Sbjct: 70 LNFINGLRGDSQIAIILFDGGTSVAMPLVPVSQKTRDALVTNLDKLTYTGRFRNSAAGME 129 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE---------GIAICNKAKS 395 A + + EA+K I+LLT+G +E+ + ++A Sbjct: 130 RALYEL---------KNFGRPEAEKAIILLTNGPIETGDEKRDRDFSRWMQEYLAHEAAE 180 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 GI++ IAF+ L A + +++ A +L F Sbjct: 181 AGIKVFGIAFTEAADFH-----LLQILAHTTGGTYYRAPQAVDLQSAFNRI 226 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 38/129 (29%), Gaps = 17/129 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIVKTF 327 K + AL S + K N ND + + F G + + + + T Sbjct: 98 KLDSAKYALQSAVDYFKANANPNDRFALVPFSDGVQSDKVVPFPSGTYDVKQHLNWIATV 157 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNE 384 A G T A+Q A KKYI+ LTDG T Sbjct: 158 ANSLRANGGTNYTQALQQAQSFFNDPAR------------KKYIIFLTDGMPTVSIAKEP 205 Query: 385 EGIAICNKA 393 +C Sbjct: 206 ITYKVCEGI 214 >gi|170748500|ref|YP_001754760.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170655022|gb|ACB24077.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 195 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 40/134 (29%), Gaps = 5/134 (3%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + +LI G I +++ P++L + +++ R L AA Sbjct: 1 MLSGRPRACLIRLIGDREGVSAIEFSVIAPILLLILMGSIELPRAYMIGKRLDNAA---- 56 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 LI + + T + + ++D V ++ E Sbjct: 57 -ATMADLISRGSYADLKPVFAATGAISNPYDVSRASIVLTAAGTYSDGSVATTKVCSSAE 115 Query: 122 MNPRKSAYQVVLSS 135 N + L + Sbjct: 116 SNGQARTAGSSLGA 129 >gi|118593261|ref|ZP_01550646.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] gi|118434152|gb|EAV40808.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] Length = 772 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 41/146 (28%), Gaps = 19/146 (13%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 I + F++ + + + F D + G T IN A+ Sbjct: 388 MTAAIKALRPDDYFRILHFSNDTSQFAGQAVLATERNKQKALKFVADLSAGGGTEINQAV 447 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A+D + +V LTDG + +I N+ RI Sbjct: 448 NAAFDQAQP------------DNTTRIVVFLTDGYIGDEATVIKSIANRIGKA--RIYAF 493 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFF 429 + R+ L A+ + Sbjct: 494 GVGNSVN-----RFLLDAMATEGRGY 514 >gi|291395815|ref|XP_002714336.1| PREDICTED: complement component 2 isoform 2 [Oryctolagus cuniculus] Length = 613 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 16/152 (10%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++I ++ D T I A+ + + + E + I+LL Sbjct: 183 DMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQMQRLGMETTGWQEIRHAIILL 242 Query: 375 TDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASP 425 TDG++ + K I+ I I + S Sbjct: 243 TDGKSNMGGS-PKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKKDGE 301 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F L ++F V ++T Sbjct: 302 RHAFILQDARALQQVFEHM------LDVSKLT 327 >gi|291395813|ref|XP_002714335.1| PREDICTED: complement component 2 isoform 1 [Oryctolagus cuniculus] Length = 744 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 42/152 (27%), Gaps = 16/152 (10%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++I ++ D T I A+ + + + E + I+LL Sbjct: 314 DMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQMQRLGMETTGWQEIRHAIILL 373 Query: 375 TDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASP 425 TDG++ + K I+ I I + S Sbjct: 374 TDGKSNMGGS-PKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKKDGE 432 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F L ++F V ++T Sbjct: 433 RHAFILQDARALQQVFEHM------LDVSKLT 458 >gi|297473612|ref|XP_002686715.1| PREDICTED: collagen, type XXVIII-like, partial [Bos taurus] gi|296488704|gb|DAA30817.1| collagen, type XXVIII-like [Bos taurus] Length = 327 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 54/178 (30%), Gaps = 20/178 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++++ + ++ + T R+G ++ +V + K + + Sbjct: 27 FQIIKNFVKTLTDRVAL---DLATTRVGIINYSHKVEEVAHLTQFSSKDD--LKRAVDNM 81 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 + T A+ A ++ KK +++TDG+ +T+D + + Sbjct: 82 QYLGEGTYTATALHAANRMFEAAKPG----------VKKVALVITDGQTDTRDEKNLTEV 131 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKA-RYFLSNCASPN---SFFEANSTHELNKIFRD 443 A + I I ++ A+ + + L + Sbjct: 132 VKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDDFITLEDTLKQ 189 >gi|149184582|ref|ZP_01862900.1| TadE-like protein [Erythrobacter sp. SD-21] gi|148831902|gb|EDL50335.1| TadE-like protein [Erythrobacter sp. SD-21] Length = 153 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 4/63 (6%) Query: 8 IFYSKKL---IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-AQTAIIT 63 I +KL ++ G I TA+++PV++ + + R E L+ A A+ A + Sbjct: 2 IRMLRKLAAIRRNENGSVVIETAIVLPVLVLMALGGFETSRIVSRESELQAAIAEGAAVV 61 Query: 64 ASV 66 + Sbjct: 62 LAT 64 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 59/226 (26%), Gaps = 21/226 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFS 406 IVLLTDGE+ C G I TIA Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG 441 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 27/192 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 90 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 149 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 150 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 200 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANS---THEL 437 N+AK GI++ ++A + LS A+ +++ F A + + Sbjct: 201 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGHSRDA 255 Query: 438 NKIFRDRIGNEI 449 ++ I + Sbjct: 256 EEVISQTIDTIV 267 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 27/192 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 58 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 117 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 118 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 168 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANS---THEL 437 N+AK GI++ ++A + LS A+ +++ F A + + Sbjct: 169 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGHSRDA 223 Query: 438 NKIFRDRIGNEI 449 ++ I + Sbjct: 224 EEVISQTIDTIV 235 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 27/192 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 58 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 117 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 118 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 168 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANS---THEL 437 N+AK GI++ ++A + LS A+ +++ F A + + Sbjct: 169 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGHSRDA 223 Query: 438 NKIFRDRIGNEI 449 ++ I + Sbjct: 224 EEVISQTIDTIV 235 >gi|330465087|ref|YP_004402830.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] gi|328808058|gb|AEB42230.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] Length = 141 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 34/121 (28%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I A+ + +L + GM D + A A + + Sbjct: 9 GRVSIFLAIALVGVLAIIGMAFDGAGQLRTLQRAENLAAEAARAGGQAIDLATAIEGGPK 68 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + ++ + +YL + +F + +R R S+ + Sbjct: 69 QINRRQARRAVADYLAAAGASGHTVSFPVVDGETQIRVRVSVTYRRAMLGLFGFSNTVTV 128 Query: 140 L 140 Sbjct: 129 S 129 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 ADSS-ITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|291448990|ref|ZP_06588380.1| secreted protein [Streptomyces roseosporus NRRL 15998] gi|291351937|gb|EFE78841.1| secreted protein [Streptomyces roseosporus NRRL 15998] Length = 420 Score = 46.5 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 54/192 (28%), Gaps = 22/192 (11%) Query: 269 KKKHLVRDALASVIR------SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + A V+ + D L RT Sbjct: 54 SRMAAAKQAFNDVLDAVPEQVQLGIRTLGADYPGEDRKVGCKDTRQLYPVGP----LDRT 109 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 KT G T I A+ A D + E + IVL+TDGE+T Sbjct: 110 EAKTAVATLAPTGFTPIGPALLGAADDL------------EGGEGSRRIVLITDGEDTCG 157 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + + + ++G ++ + + + + A+ ++ EL+ + Sbjct: 158 PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELSGRVK 217 Query: 443 DRIGNEIFERVI 454 + V Sbjct: 218 QLVDRAAEPVVT 229 >gi|281348096|gb|EFB23680.1| hypothetical protein PANDA_015036 [Ailuropoda melanoleuca] Length = 3047 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1243 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1296 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1297 QDDVEAPS---KKLKDEGVELFAIGI------KNADEDELKMIATDPDDTHAYNVADFES 1347 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1348 LSKIVDDL 1355 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 57/180 (31%), Gaps = 26/180 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGAT-FFNDRVISDPSFSWGVHKL-------IR 321 K + A+ + + I N V + +D S S G ++ + Sbjct: 173 KMEAAKKAIYNYLDKI----PNNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKK 228 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G T + A+++ + + E + +++DGE T Sbjct: 229 EQFNAALSNFGPKGWTPLASAIESVNNDF---------KEYTGEENLNVVYIVSDGEETC 279 Query: 382 DNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + I F V ++Q++ + A ++ ++ EL + Sbjct: 280 G-GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAE--AGKGNYATVSNAEELYQ 336 >gi|118093660|ref|XP_001235190.1| PREDICTED: similar to Bardet-Biedl syndrome 5 [Gallus gallus] Length = 1556 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/232 (13%), Positives = 74/232 (31%), Gaps = 25/232 (10%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE-----EHFVDSSSLRHVIKKKH 272 G G+ E L K ++++ E +DSS Sbjct: 704 DVGDPGLSGEPLVNKNFDSVKFIHHLKETNGCGIKCKEIPMELVFVIDSSESVGPEN-FE 762 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +++D + +++ + + R+G ++ V + H + + + + Sbjct: 763 IIKDFVTALVDRVTV---GRNATRIGLVLYSLEVQLEFGL--NKHTTQQDVKRAIRKMQY 817 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-N 391 T A++ A + +K ++LTDG+ + + I Sbjct: 818 MGEGTYTGTAIRKATQ---------EGFLGARTGVRKVAIVLTDGQADKREAVKLDIVVR 868 Query: 392 KAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 +A + I + I + + Q + + L+ AS + + + L Sbjct: 869 EAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDDFNTLPA 920 >gi|110636421|ref|YP_676629.1| TadE-like [Mesorhizobium sp. BNC1] gi|110287405|gb|ABG65464.1| TadE-like protein [Chelativorans sp. BNC1] Length = 189 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 17/51 (33%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 ++ + G I ALL L V +++ + L AA Sbjct: 17 TALLRRFGREKKGATAIEFALLAFPFLLVLFAIIESAVAFVAQELLTNAAD 67 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 62/188 (32%), Gaps = 24/188 (12%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVN-DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K L + A + D V D+ + + + + + Sbjct: 430 KSKIDLAKQAAIDSLGQFVPRDQVGLWQFATHLD--GDKDYQELLPVQPLGSNGKETLAS 487 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEE 385 T + D+ AY+ + + + +V+LTDG N + Sbjct: 488 RLSGLTPQSGTGLYDSSLAAYEYLKAHLDPSAINA---------VVVLTDGRNEDPGGVD 538 Query: 386 GIAICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + + + +G +R+ TIA+ + Q L A+ S ++++ +N+ Sbjct: 539 LDHLVPQLRPEGNAESVRLFTIAYGGDADQN-----VLKQIAEATAGSEYDSSKPDSINQ 593 Query: 440 IFRDRIGN 447 +F I N Sbjct: 594 VFTSVISN 601 >gi|297559546|ref|YP_003678520.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843994|gb|ADH66014.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 587 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 52/188 (27%), Gaps = 10/188 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + A + + Sbjct: 396 TRMQVTSAAATQGLEMFTPSSELGLWEFSTNVNNELHYQEIAPIRELQAAADDGTAHRDV 455 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGI 387 + + Y+T +++ ++ + + I++LTDG+N Sbjct: 456 LAGALASLQPLPQGDTALYETYLAAYQEMSRTYQPDRTN--VILMLTDGDNDNPGGLGLD 513 Query: 388 AICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + ++ +S + I I+TIAF + E + A+ + + E+ +IF Sbjct: 514 ELMSQIESLASPSRPIPIITIAFGPDVQNLEPLQEI--AAATGGAAYMTEDPTEIGEIFL 571 Query: 443 DRIGNEIF 450 I Sbjct: 572 QAFSLRIS 579 >gi|194228584|ref|XP_001915384.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Equus caballus] Length = 1313 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 43/158 (27%), Gaps = 10/158 (6%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 F+D V + + K + G T IN A+ A + Sbjct: 315 NDYFNIISFSDTVSVWKAGCSIQATIQNVYDAKDYLGHMEAGGWTDINTALLAAASVLYP 374 Query: 353 SNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIAFSV 407 SN + I+ LTDGE T D I + + + ++ F Sbjct: 375 SN----XEPGRGPSVGRISLIIFLTDGEPTADMMTPSVILSNILQALGNRVNLFSLVFWD 430 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L N + ++ + I Sbjct: 431 DADFPLLRHLSLENWGAAWHIYKDTDAALHLEGLYKEI 468 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 52/148 (35%), Gaps = 9/148 (6%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ + F+ ++++ ++ G+T+ + A+ AY + ++ + Sbjct: 3 LITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDTDNEN- 61 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 KK I++ TDG + + K +G+ I I ++ Sbjct: 62 ---GVREGVKKMIIIFTDG---HSQRSPQDMALRLKDKGVEIFAITL-TPAPYADEGE-L 113 Query: 419 LSNCASPNSFFEANSTHELNKIFRDRIG 446 LS + + F + + F IG Sbjct: 114 LSITQNTDHIFTPVNLKDFEIKFLPYIG 141 >gi|148656642|ref|YP_001276847.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568752|gb|ABQ90897.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 452 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 58/184 (31%), Gaps = 23/184 (12%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + V AL +V+ + + D R+ F D + G ++ Sbjct: 99 RKAASSALDHVVHALHTVVERLDRND------RLSLVVFADHALLLIPGMVGSDRVTLVR 152 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + T + D + A + I ++ + ++LLTDG T+D Sbjct: 153 AIERLPGLDLGDGTNLADGIALALNQIRANRDARRANR---------VLLLTDG-FTRDP 202 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIF 441 + + ++A + I I TI E L+ A S + + + Sbjct: 203 AACLTLADQAADEHIAITTIGLGG-----EFQDDLLTGIADRSGGNALFLKRASAIPRAI 257 Query: 442 RDRI 445 + Sbjct: 258 SAEL 261 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 108/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 123 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 179 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 180 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 236 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 237 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 296 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 297 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 355 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 356 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSRLSVPNGHKKVIVLLTDGVPTFSYKVQRV 415 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 416 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 475 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 476 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 534 Query: 448 EIFERVI 454 +I V Sbjct: 535 QISATVS 541 >gi|309364363|emb|CAP25017.2| CBR-CLEC-62 protein [Caenorhabditis briggsae AF16] Length = 393 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 52/168 (30%), Gaps = 8/168 (4%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDEN 332 + R + TVR+G +N++ + + V + Sbjct: 61 IVTIFGGGTRIGTIAYSDKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLS 120 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + + A + + + + ++ +D + E+ + Sbjct: 121 ASDEVYLAKGLDAAESVLSAG----RKNATRSNYKQLVLIYASDYRDD-GEEDPRPTAER 175 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 KS G+ I T+AF ++T E + ASP F + +I Sbjct: 176 MKSSGVSIATVAF--DQTGNEGVVKAIGEIASPGFNFTNEDADLVREI 221 >gi|332256817|ref|XP_003277513.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Nomascus leucogenys] Length = 3172 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 51/170 (30%), Gaps = 20/170 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA-I 339 + +K + + FN ++ + + V + + + +G T Sbjct: 59 LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYR---TKQEVLSHISNMSYIGGTNQT 115 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 ++ + ++ + IV+LTDG + A + KS + Sbjct: 116 GKGLEYIMQSHLTKAAGSRAGDG----VPQVIVVLTDGHSKDGLALPSA---ELKSADVN 168 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIG 446 + +I + L AS F + L+ I + + Sbjct: 169 VFSIGVE------DADEGALKEIASEPLNMHVFNLENFTSLHDIVGNLVS 212 >gi|291232650|ref|XP_002736273.1| PREDICTED: mind bomb 2-like [Saccoglossus kowalevskii] Length = 847 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 66/238 (27%), Gaps = 22/238 (9%) Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 SS++G G S + L +D+S + L Sbjct: 403 QGSSKDGASGASSPSGETQGGQSGTSQACIELKYLYIMPLDTVLCLDTSGSMAGRGLREL 462 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + ++ + + F + + + + K Sbjct: 463 KKACTEFLLGIQQTATQTGLRENVAVVEFGSKTRIVRNLTDNYR-----LTKNAIDSLQA 517 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN-- 391 G+T + + + A +I + + + I+L+TDG ++ A + Sbjct: 518 GGTTPMFEGLMEAMKEVIQNGGVLTLPGGKKMTPR--IILMTDGRPDDKDQVLKAAMSFG 575 Query: 392 -KAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 K+ G I I + + + L+ A + F +L FR Sbjct: 576 PLWKAVGLPFPIPIACVG-----CGPDVDKELLAVIAKVTNGM-FVVGDISQLGDFFR 627 >gi|58176651|pdb|1RRK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B gi|58176654|pdb|1RS0|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With Di-Isopropyl-Phosphate (Dip) gi|58176655|pdb|1RTK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With 4-Guanidinobenzoic Acid Length = 497 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 41/154 (26%), Gaps = 18/154 (11%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S + + + + D T A+Q Y + D+V N I+ Sbjct: 62 SSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--WPDDVPPEGWNRTRH-VII 118 Query: 373 LLTDGENTQDNEEGIAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFL 419 L+TDG + + I + ++ + + R + Q Sbjct: 119 LMTDGLHNMG-GDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALAS 177 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + L +F I + Sbjct: 178 KKD-NEQHVCKVKDMECLEDVFYQMIDESQSLSL 210 >gi|313886050|ref|ZP_07819788.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924580|gb|EFR35351.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 342 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 15/155 (9%), Positives = 43/155 (27%), Gaps = 21/155 (13%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ + + + ++ + R+ F + + Sbjct: 101 SNSMAARDVKPDRIGFAKQIVTHTMQELAGS-------RVAMVVFAGGAYIRLPLTPDLP 153 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + T + A++ + + + + A K ++LLTDG Sbjct: 154 TART-FLADIQPGMVSNQGTNLGQALERSAQALSAPSR-----------AGKAVILLTDG 201 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 E+ + ++ K Q I+ + + Sbjct: 202 EDHEG--GLEEGIDRLKKQEIKAYVVTIGLPDGAN 234 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 56/178 (31%), Gaps = 20/178 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 L+R+ ++ +K D V+ + + + ++ Sbjct: 234 TPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAGYAVTGP-------NDELLLEKI 286 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D G T + +++ Y+ + + + +VL++DG + Sbjct: 287 NDVVHGGGTNLYGGLESGYEL--------AQMVYDPDAINR-LVLISDGGANAGITDLDL 337 Query: 389 ICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I A S GI ++ + ++ +++ S F S E+ F D Sbjct: 338 IAENAAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGKGASVFM-PSEEEVWTTFGD 394 >gi|324515146|gb|ADY46105.1| C-type lectin protein 160 [Ascaris suum] Length = 309 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 51/152 (33%), Gaps = 16/152 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 VR+ F+ + F + + + I A+QTA + Sbjct: 44 QFVRVALVTFSGHATIAGDLRT-FGNYSSLVDALFRMPFHGDSTIDIVKALQTASSILED 102 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + K ++L + ++ + AI N+ K G I+T+AF Sbjct: 103 ----------SRKYVKTVVLLYSSAFSSGGFTDPTAIANQLKESGTTIVTVAFRQQPEGS 152 Query: 413 EKARYFLSNCASPNSFFEANSTH---ELNKIF 441 + L + ASP+ F + + EL F Sbjct: 153 LVEK--LGSIASPHFAFNSMDSDIIGELLDAF 182 >gi|114576315|ref|XP_515709.2| PREDICTED: matrilin-3 [Pan troglodytes] Length = 486 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 70/232 (30%), Gaps = 27/232 (11%) Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G R +P + + G + F+ SS + V+ ++ Sbjct: 49 GRRPSPAAPDGTPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSR 108 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAI 339 +I ++ D R+ + V + + +K T Sbjct: 109 IIDTL---DIGPADTRVAVVNYASTVKIEFQL---QAYTDKQSLKQAVGRITPLSTGTMS 162 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+QTA D + K +++TDG + ++ + +A++ GI Sbjct: 163 GLAIQTAMDEAFTVEAGAREPS---SNIPKVAIIVTDG---RPQDQVNEVVARAQASGIE 216 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS---THELNKIFRDRI 445 + + + + L AS F + +L+ F++ Sbjct: 217 LYAVGV----DRADMES--LKMMASEPLEEHVFYVETYGVIEKLSSRFQETF 262 >gi|111114995|ref|YP_709613.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|216263812|ref|ZP_03435806.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] gi|110890269|gb|ABH01437.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|215979856|gb|EEC20678.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] Length = 333 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 39/117 (33%), Gaps = 13/117 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +G F + + + + +D +A+ + A + Sbjct: 129 RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDL--GNGSALGLGISIALSHL- 185 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 K++ K+ IV+LTDG D + N A+ +RI +I + Sbjct: 186 ----------KHSEALKRSIVVLTDGVVNSDEIYKDQVINLAQGLNVRIYSIGIGSS 232 >gi|166031603|ref|ZP_02234432.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] gi|166028580|gb|EDR47337.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] Length = 685 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 39/127 (30%), Gaps = 16/127 (12%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + T++ DA+ TA + + + N + ++ TDG Sbjct: 398 SDDAATLTNDINNLVTGDMTSLYDALYTAVERVAAQNGA------------RCVIAFTDG 445 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + N + N A + + I + + ++ + + Sbjct: 446 NDNYSNCTKEDVVNVANRYHVPVFIIGIGS-IDYADVNDIATQ---TGGMYYNVSDVTSM 501 Query: 438 NKIFRDR 444 +KI+ + Sbjct: 502 DKIYEEI 508 >gi|197117451|ref|YP_002137878.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197086811|gb|ACH38082.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 357 Score = 46.5 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 57/233 (24%), Gaps = 19/233 (8%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 VD R + L A A A+ L Q +E + K S + F Sbjct: 20 VDAGRAYGVKAKLHAAVDAASYEAAKALAQGEDEDDMKEKASEAAFDYFRANFPSDYFGA 79 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQT 160 + R + + A V ++R ++F +G KS + Sbjct: 80 QCSGPELELSERKSEQ--------KMRALSVSATARLP------NIFAGILGWKSIDLPA 125 Query: 161 KAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 ++ A + + + S + + + + + + Sbjct: 126 RSRAVR--KDADVVLVLESSDALRDSFPQVKQRVATFSDRFSQHDDRMSLVTFAAGADPV 183 Query: 221 KVGI---RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 K P + N Y +++ K Sbjct: 184 ISICGVYNPAKDRPGAGTFNCGSGYQKSNFAKAFLELSPQMAAAAAAPEDAMK 236 >gi|187735658|ref|YP_001877770.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187425710|gb|ACD04989.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 859 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 57/187 (30%), Gaps = 23/187 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S S+ K L I + D ++ +++ S K+ Sbjct: 401 MSCSVPGGKTKMDLANAGTCQTISLLSDQDLISVHAVDSEPH---PIVTLSSLGPNRKKM 457 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I ++ + ++ G I ++ + + S H ++L D ++ Sbjct: 458 ISSVSRIASM----GGGIFIGAGLKAGWQELQRSVAGTRH-----------LLLFADADD 502 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 +++ + + +G+ + IA K+ L A F + ++ Sbjct: 503 SEEPADYRETLKEMVKEGVTVSVIALGTEKSADAG---LLREIAELGRGRIFFCDRPGDI 559 Query: 438 NKIFRDR 444 IF Sbjct: 560 PSIFAQE 566 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 59/226 (26%), Gaps = 21/226 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 236 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 295 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 296 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 350 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 351 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 409 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFS 406 IVLLTDGE+ C G I TIA Sbjct: 410 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG 441 >gi|13471071|ref|NP_102640.1| hypothetical protein mll0948 [Mesorhizobium loti MAFF303099] gi|14021815|dbj|BAB48426.1| mll0948 [Mesorhizobium loti MAFF303099] Length = 408 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 87/341 (25%), Gaps = 11/341 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + + +S + I AL +P+++G G V+ W Y L+ A A Sbjct: 7 TRIRLVATGIRRSTRANVATIFALSLPIVVGAAGFGVETSYWYYNSLRLQATADAAAYAG 66 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ I ++ + A + L + + V N Sbjct: 67 ALEQISGSDKPTIIAAA----TQSAATNGLGSGTIVVNTPPASGPNTAKKAVEVIVAQNI 122 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + ++L + + A + + + + Sbjct: 123 NRMFTSIFTQGQVPEQARAVALITNASKACVLALDPSASQAALFSGNTSVKLIGCSVMSN 182 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE-KLSPYMVSCNKSLYYM 243 S + S G +C ++ + + I + Sbjct: 183 SIASDAIKLQGSAGLQADCLISGGGVSLSNPVTTVCAAPITQALPAADPFADLPAPAASN 242 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG----- 298 +LS + +L+ + V ++ + + Sbjct: 243 PCQNDNKATLSPGTYCSGMTLKGNVTLSPGVYVVQGNLKINANAAITGSGVTIFMSGSSS 302 Query: 299 -ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + N V S ++ +T + TA Sbjct: 303 VSINGNASVKLSAPTSGAYSGVLFYGDRTGNSASSTFNGTA 343 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 37/280 (13%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M N F + S G+ S + S Sbjct: 94 VTLSNVSERKDNMRLGMSLVTNPKDNSFVACSPLWSHECGSSYYTTGMCSRVNSNFRFSR 153 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + P +D + + + + + +I ++K ++ Sbjct: 154 IVAPALQRCPTFMDIVIVLDGSNS----------IYPWVEVQSFLISILQKFYIAPGQIQ 203 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 204 VGVLQYGETVVHEFYL-NNYRSVTDVVEAAKRIEQRGGTETRTALGIEKAVT-------- 254 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AKK ++++TDGE + D+ + + ++ I +A ++ Sbjct: 255 EAFQRGGRKGAKKVMIVITDGE-SHDSPDLQRVIESSEKDNITRYAVAVLGYYNRRGINP 313 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G I Sbjct: 314 EAFLNEIKYIASDPDDKHFFNVTDEAALKD-IVDALGERI 352 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 58/197 (29%), Gaps = 22/197 (11%) Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 S+ +D+S ++ + +V+ + ++ Sbjct: 172 DVCQTSVDLVFILDTSGSVGSYN-FEKMKTFVKNVVDFFNIGPKGTH---VAVITYSTWA 227 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + + +K G T DA+ A I Sbjct: 228 QVEFNLKAHHSS--KAALKNAVNAIYYRSGWTYTADALDLAGRNIFQVANGMRPDKGIP- 284 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K VLLTDG + +N + N ++ G+ + + L++ A+ Sbjct: 285 ---KIAVLLTDGYSNGNN--PLGPANDLRAAGVNVFCVGIG------NYYERELNDIATD 333 Query: 426 ---NSFFEANSTHELNK 439 + F+ + ++LN Sbjct: 334 PDKDHVFKLENFNDLNS 350 >gi|110833087|ref|YP_691946.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] gi|110646198|emb|CAL15674.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] Length = 556 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 39/126 (30%), Gaps = 4/126 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI--ITASVPLIQSLE 73 K G + + + + ++VD + L+ AA + AS E Sbjct: 8 KKQHGSALVWFV-SLTFIFLMVALVVDGSNLYNEKRRLQAAANAVASELAASGQTCFGSE 66 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S S +Y R+F + T V + + V + + + V + Sbjct: 67 LASDATDLSNNAETILARDYPGRDFVVTSAQVVTVDGVDGRYQVSPV-VFDARESNGVAV 125 Query: 134 SSRYDL 139 S + Sbjct: 126 SLSSPV 131 >gi|119717474|ref|YP_924439.1| membrane protein-like [Nocardioides sp. JS614] gi|119538135|gb|ABL82752.1| membrane protein-like protein [Nocardioides sp. JS614] Length = 573 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 14/173 (8%), Positives = 44/173 (25%), Gaps = 3/173 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G A+ + +++ + VD+ ++ A + S + Sbjct: 2 RRQRDERGAVVPFVAVTLTLLIAMAAFAVDLGMQRVVRRDMQALADVVALDLSRLVNGRT 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFE---NNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 N F ++ + RN + + + + K+ Sbjct: 62 AAQIEAGYNGFPTISTELGRSVARNRDTLGDAPVVTYKLVHLDAATGLLETSGGAVKAYS 121 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + + F + G + A+ + + + Sbjct: 122 GTEVPNAVWVQAEGSVDFAFATGSGGAVRTAVAQPSPSACFRLGSYAAGVSTE 174 >gi|296284151|ref|ZP_06862149.1| TadE-like protein [Citromicrobium bathyomarinum JL354] Length = 257 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 44/144 (30%), Gaps = 13/144 (9%) Query: 6 KFIFY---SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ----AAQ 58 + + K+L + G I A+ +P+ LG+G +++ ++ + Q A Sbjct: 11 RLRQFGRRLKRLRRDNRGLALIEFAISLPIFLGLGMFGIELANYAVTSMNVSQISLTIAD 70 Query: 59 TAIITASVPLIQS------LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 A + + S A + + E+ + D+++ Sbjct: 71 NAARMGQTSSSSTTKTIYRSDVNSIFAGAAKQGENIDLTEHGRVVLSSLETVGVLDKQLI 130 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSR 136 R T + ++ Sbjct: 131 RWQRCTGSAAYASRYGPELTSEVT 154 >gi|15674025|ref|NP_268200.1| hypothetical protein L107379 [Lactococcus lactis subsp. lactis Il1403] gi|12725093|gb|AAK06141.1|AE006434_3 unknown protein [Lactococcus lactis subsp. lactis Il1403] Length = 1450 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 45/395 (11%), Positives = 104/395 (26%), Gaps = 23/395 (5%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 S EE S+ K + I+ N+L +N+ + +T + N Sbjct: 188 SSQEETVSQLKKDSQLAFSYPSNFGIKASFNDLAQNYENISPEYRQDETGISPNHSWIP- 246 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++ S + G+ SW + + + V + Sbjct: 247 --TGNTTVVNHQGWNSFSSQWDGVNSWNGEATNLENSYIEYAGVNNPVDFALRKYAKETE 304 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + Q + V + ++ K + + L + Sbjct: 305 TPGLYDVYLN-VRGNVQNPIKPVDIVLVIDMSGSMQGAKETAVRQGVSDFLSTIQNTAYA 363 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D V SS + + + I + +V + A F+ + Sbjct: 364 DYVN--VGIVGYSSPGNYVTG---ASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQ 418 Query: 310 PSFSWGVHKL-IRTIVKTFAIDENEMGSTAINDAMQTAY---DTIISSNEDEVHRMKNNL 365 G L + + G + + +A + I + E N Sbjct: 419 LGLRKGTEMLEQDSSDNQKMMILMTDGVPTFSYKVNSASKVDNVIYGQSFAESRDEPGNT 478 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI------RIMTIAFSVNKTQQEKARYFL 419 + + D + E I I T+ + ++ + Sbjct: 479 SKIQSPYYVKDINGGSNIEIRDTWAATLGEAEISKQEISEIHTLGIQLGNDGSYLSQEEV 538 Query: 420 SN----CASPNSFFEANSTHELNKIFRDRIGNEIF 450 + A+ + +ANS +++ +++ + Sbjct: 539 KSRTSLIATTGLYQDANSANDITDYLKNQANVVLS 573 >gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS278] gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand factor type A (VWA) domain; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 755 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 39/304 (12%), Positives = 79/304 (25%), Gaps = 24/304 (7%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 ++ + ++ ++ L PADR + Sbjct: 256 AKNDPVNPTKITVRLQAGFALGEVKSHHHQVTVESTDAETRVITLADGVVPADRDFELTW 315 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYY-MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + + + L + + V +R Sbjct: 316 KPASENMPSVGLFHEQVGDADYLLAFVTPPAVATATQRPQPRDVIFVIDNSGSMGGTSIR 375 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A AS++ ++ ++ R F+D + S F + G Sbjct: 376 QAKASLLYALGRLQPN---DRFNVIRFDDTMTVLFPSSVPADAEHVGNATRFVSSLDARG 432 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + AM+ A ++ + + +V LTDG D + I + Sbjct: 433 GTEMVPAMRAALT-----------DDGSDSDRMRQVVFLTDGAIGNDQQLFETI-TAMRG 480 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + RI + Y +S A +F S ++ + RD V Sbjct: 481 RS-RIFMVGIGSAPN-----TYLMSRAAELGRGAFTHIGSVEQVEERMRDLFAKLENPVV 534 Query: 454 IRIT 457 +T Sbjct: 535 TGLT 538 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 39/129 (30%), Gaps = 17/129 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIVKTF 327 K + AL S + K N ND + + + F G + + + + T Sbjct: 98 KLDSAKYALQSAVDYFKANANPNDRFALVPFSDDVQYSKVVPFPSGTYDVKQHLNWIATV 157 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---QDNE 384 A G T A+Q A KKYI+ LTDG T Sbjct: 158 ANSLRANGGTNYTQALQQAQSFFNDPAR------------KKYIIFLTDGMPTVSIAKEP 205 Query: 385 EGIAICNKA 393 +C Sbjct: 206 ITYKVCEGI 214 >gi|283787709|ref|YP_003367574.1| tight adherence protein TadG [Citrobacter rodentium ICC168] gi|282951163|emb|CBG90854.1| putative tight adherence protein TadG [Citrobacter rodentium ICC168] Length = 647 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 59/216 (27%), Gaps = 12/216 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + S G I+ A++ P +L + + +D + L A+ AI+ Sbjct: 19 LSSENGAITIVYAIIFPALLAMVALALDGALMINRKARLADASSEAILAI---------- 68 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S+ + + ++F N N +++ +V + Y++ +S Sbjct: 69 -SAVDNRLVDSVAIDNNKQIAKDFVNYYLPNNQAEQLKVVVTSFDRTIEKGYIDYKIAIS 127 Query: 135 SRYDLLLNPL-SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + LL F + +V+DFS SM Sbjct: 128 ATLPTLLPLGHLGFSAFDRSVTVGNFDNNSGNARKFVTVISDPADYVFVVDFSDSMNSSY 187 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 D ++N Sbjct: 188 IDQGRVTTRLAMLKQVVREVISGNKNPDSQFAIVPF 223 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 42/164 (25%), Gaps = 40/164 (24%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 +T + + + G T + + + KK I++++ Sbjct: 471 TSSEEKTRLISDFAGMSAGGGTDSTIGLLRSV-----------PVVAAGTNPKKVIIVIS 519 Query: 376 DGENTQDNE------EGIAICNKAKS------------QGIRIMTIAFSVNKTQQEKARY 417 DGE+ D +C+ K+ G+ T+ Sbjct: 520 DGEDEIDPAIVADQFHSRGMCDVIKAGLKDEGLFASGKTGVVANTVEIHYVSINDNNNES 579 Query: 418 FL----SNCASPN-------SFFEANSTHELNKIFRDRIGNEIF 450 + CA + A + L + IG E Sbjct: 580 RMAFWGKYCARGDDTTDQKQYTHTATNYSSLIETLVSIIGVETG 623 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 11/72 (15%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+T +Q A++ ++ AKK ++L+TDG T + + K K Sbjct: 127 GATNTGAGLQAAWEVFQ----------RSRPTAKKLLILVTDGMATMG-PDPVKKAEKLK 175 Query: 395 SQGIRIMTIAFS 406 + G+ I Sbjct: 176 NMGVDIFVFGIG 187 >gi|222084462|ref|YP_002542991.1| hypothetical protein Arad_0354 [Agrobacterium radiobacter K84] gi|221721910|gb|ACM25066.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 192 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 43/171 (25%), Gaps = 4/171 (2%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + + G I A+L PV+L + ++ E A Sbjct: 6 MAARIYTRLRHFRRDERGIGAIEFAILFPVLLMLYLGAFEITVGLSVEKRTS----RAAG 61 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + L Q + + Y + D V + + Sbjct: 62 SIADILTQKTSTTKAELATMPSVAGAIFTPYATTGLTLKVTGIQIDAGSSATVAWSWAQD 121 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + ++ DL L L + I ++ ++ + Sbjct: 122 GSKPYTAGSAVTVPSDLNLPSSFLVRTELAIPYQILSFGSDFLPAGSNQIT 172 >gi|83814223|ref|YP_446667.1| putative batB protein [Salinibacter ruber DSM 13855] gi|83755617|gb|ABC43730.1| putative batB protein [Salinibacter ruber DSM 13855] Length = 350 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 47/153 (30%), Gaps = 10/153 (6%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++ + +++ + R+G F + Sbjct: 92 SMRAQDVPPSRLRRAKNEIRTLVDDLSG-------DRVGLVLFAGSGFVQSPLTTDY-GA 143 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + A D+ T ++ A+ + + + +++++DGEN Sbjct: 144 FRLFLDAAAPDQISTPGTDVSAAVDAGLQAFGAPRPTDDTTAAPEEPRPRALLIVSDGEN 203 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +A+ G+ ++T + Sbjct: 204 HAGDLDAAR--QRAEEAGVTLLTAGVGTEDGAR 234 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/320 (9%), Positives = 81/320 (25%), Gaps = 25/320 (7%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 N + R + S + + + + ++ + Sbjct: 22 TFNMDTRNPRVIAGPSAAFFGYTVQQH---DISGKKWLVVGAPMETNGHQKTGDVYKCPV 78 Query: 200 P---LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 + + + + + Sbjct: 79 TQGNCTKLNLAGSLCPMW-ACSPLWSHECGSSYYTTGMCSRVNSNFRFSKTVAPALQRCQ 137 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 ++D + + + +I +KK +++G + + + + Sbjct: 138 TYMDIVIVLDGSNSIYPWVEVQHFLINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDY 196 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + I++ T ++ A E + AKK ++++TD Sbjct: 197 RSVKDVVEAASHIEQRGGTETRTAFGIEFA--------RSEAFQKGGRKGAKKVMIVITD 248 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL----SNCASPN---SFF 429 GE + D+ + + +++ + +A ++ AS FF Sbjct: 249 GE-SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFF 307 Query: 430 EANSTHELNKIFRDRIGNEI 449 L D +G+ I Sbjct: 308 NVTDEAALKD-IVDALGDRI 326 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 59/226 (26%), Gaps = 21/226 (9%) Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I F +++ + N + + S + + + + ++ Sbjct: 250 SIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDVIKTSADFQNAPPMRGTEAPP 309 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S + + A + I V +G Sbjct: 310 PPTFSLLKSRRRVVCLVLDKSGSMDKEDRLIRMNQAAELYLTQI-----VEKESMVGLVT 364 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ + + G T+I +Q + I SS++ Sbjct: 365 FDSAAHIQNYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE 423 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFS 406 IVLLTDGE+ C G I TIA Sbjct: 424 ---------IVLLTDGEDNGIRS-----CFEAVSRSGAIIHTIALG 455 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 59/187 (31%), Gaps = 23/187 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VDSS K + + + + DNV F+ + Sbjct: 382 VDSSGSMRDDDKMDQAKLGVEVFLNRLPSKDNVG------MIGFSSSPAVLVPLA--TRS 433 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ G+T++ DA+ A + + + + IV+L+DG Sbjct: 434 ENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQPDRINA---------IVVLSDGA 484 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN-STHEL 437 +T + I+I IA+ + L A + ST ++ Sbjct: 485 DTASQLSIDQMLGNFGESSIQIFPIAYGADA-----ETSILQQIADFSRTELVQGSTGDI 539 Query: 438 NKIFRDR 444 +KIF + Sbjct: 540 DKIFENL 546 >gi|85374479|ref|YP_458541.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] gi|84787562|gb|ABC63744.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] Length = 202 Score = 46.5 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 37/125 (29%), Gaps = 3/125 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + K++ + G I A MPV + L D+ Y + ++ A Q A S Sbjct: 2 RRTSLLKRIARREDGVTIIEFAFAMPVFAVILMALFDLGFQIYAQSIVQGAVQEAARA-S 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + L F NF D + + DT+ + Sbjct: 61 TLESGGSNSAALDDTVRKNVQTVIPGATLT--FTRKNYANFEDVGIPEDFTDTSGSEDGI 118 Query: 126 KSAYQ 130 + + Sbjct: 119 CNNGE 123 >gi|294508602|ref|YP_003572661.1| aerotolerance-related membrane protein [Salinibacter ruber M8] gi|294344931|emb|CBH25709.1| aerotolerance-related membrane protein [Salinibacter ruber M8] Length = 358 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 47/153 (30%), Gaps = 10/153 (6%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + ++ + +++ + R+G F + Sbjct: 100 SMRAQDVPPSRLRRAKNEIRTLVDDLSG-------DRVGLVLFAGSGFVQSPLTTDY-GA 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + A D+ T ++ A+ + + + +++++DGEN Sbjct: 152 FRLFLDAAAPDQISTPGTDVSAAVDAGLQAFGAPRPTDDTTAAPEEPRPRALLIVSDGEN 211 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +A+ G+ ++T + Sbjct: 212 HAGDLDAAR--QRAEEAGVTLLTAGVGTEDGAR 242 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 43/450 (9%), Positives = 109/450 (24%), Gaps = 63/450 (14%) Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + T ++ + E+ K+ + + + P Sbjct: 143 QATKEKPTDTQDNTSRAETDAVKETAATDTHSAESEESKSINPSNTNETLAENRETEGPE 202 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S ++ +++ + S + T + + + Sbjct: 203 ASEVSERSTAADEVIKPFSLPSVASDPVFDRSHKAILPQYTTDASGTYPTASWQPTGNQN 262 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + ++ + SY G D + Y + + Sbjct: 263 VRNHQGNKDGAAQWDGQTEWNGEPTNKTNSYIEYGGTGSQADYAIRKYAKETTTPGLFDV 322 Query: 245 YPGPLDP------SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 Y L VD S + + V+ + + ++ + D + MG Sbjct: 323 YLNIRGNVQKEIAPLDLVLVVDWSGSMNDNNRIGEVQKGVDRFVDTL-AESGITDNIHMG 381 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII------- 351 ++ + S + G ++ +KT + G T A++ A + + Sbjct: 382 YVGYSSDGYKNDSVAMGPFDSVKNAIKT-ITPSSTTGGTFTQKALRDAGNMLATPNGHKK 440 Query: 352 ---------SSNEDEVHRMKNNLEAKKYIVLLT---------------DGENTQDNEE-- 385 + +V +++ Y T Q N Sbjct: 441 VIVLLTDGVPTFSYQVSKVQTETNGSYYGTQFTNRQDQPGSTSRISNSYYAPDQRNTNKL 500 Query: 386 -------GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----LSNCAS----PNSFF 429 I K +GI I + + + S + ++ Sbjct: 501 INSTFIATIGEAMALKQRGIEIHGLGIQLQSDTNAGLSKQEVENKMRQMVSADENGDLYY 560 Query: 430 EANS-THELNKIFR----DRIGNEIFERVI 454 E+ +++ G +V Sbjct: 561 ESADHAPDISDYLARKAVQISGTVANGKVT 590 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 55/186 (29%), Gaps = 17/186 (9%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + +L S K + ++ F+ + ++ ++ L I Sbjct: 257 TAAKASLISYENDTKTSRLLPGQQQVAIYSFDSEITLLTDYTSDINLLEAAIDTIPNSVL 316 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 ST + AM+ A + + Y VLLTDGE+ D+ I Sbjct: 317 ERGNSTNLLGAMEIAAERWNDQIDLIAVERG-------YAVLLTDGEHNFDSRSPADIEA 369 Query: 392 KAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STHELNKIFRDRI 445 + ++ IA N + AS N S EL +F + Sbjct: 370 DLTNFFGTRKKVYAIAVGNNVNLENLEAIT----ASSEQVLTVNSFESAEELEAVFTEVA 425 Query: 446 GNEIFE 451 E Sbjct: 426 TTEAKA 431 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/324 (11%), Positives = 82/324 (25%), Gaps = 36/324 (11%) Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 V + + + N + + L+ + L P FL A +++ S Sbjct: 12 VTNNPKYIIDDSNNATNYFGYTLNVKATLTQPPEVTFLVGSPKSGSTTNPIGAAPSLTPS 71 Query: 171 YHKEH--------GVSIQWVIDFSRSMLDYQRDSE------GQPLNCFGQPADRTVKSYS 216 + ++ + ++ S+ L + Sbjct: 72 SSEIVGPGGILSCPITFNNITPPCYPVVPLLNQSDAKSLPGRDSLGLSKILSPPGPNLTV 131 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE----------HFVDSSSLRH 266 ++ Y V + + + + S+ S Sbjct: 132 CSPLRMRNCSSGYVTYGVCYSSNDKGRTWSTDTKQADSKCPVEDLDLIFLLDGSGSVTVP 191 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV---HKLIRTI 323 V+ + +V I + + + + + P + + Sbjct: 192 DPLNFDRVKQWVKNVTDRFD-ISTFANVGVIQYSHYYETRTVQPYMKVEIGLGQYKTQAE 250 Query: 324 VKTFAIDENEMGSTAIND-AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + G T A+ + ++S +K IVLLTDG++ D Sbjct: 251 FQMAVDSIQFQGFTTFTAHALNRTVEEFMNSTRYSDP------TTRKVIVLLTDGQSN-D 303 Query: 383 NEEGIAICNKAKSQGIRIMTIAFS 406 E A+ GI I + Sbjct: 304 REFLEETSAYARGLGITIFAVGVE 327 >gi|307323133|ref|ZP_07602343.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890622|gb|EFN21598.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 422 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 18/193 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV--ISDPSFSWGVHKLIRTIVKT 326 + + A V+ + + + DRV D + V ++ RT K Sbjct: 59 TRMAAAKQAFNEVLDATPEEVRLGIRTLGANYPGKDRVAGCRDSEQLYPVGQVDRTEAKA 118 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A++ A + S E + IVL+TDGE++ + Sbjct: 119 AVATLRPTGWTPIGLALRGASKDLSSG------------EGTRRIVLITDGEDSCGQPDP 166 Query: 387 IAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + +QG + T+ ++++ +E+ A+ ++ +L+ + Sbjct: 167 CDVARELAAQGTHLVVDTLGLTLDRKVREQLSCIAE--ATGGTYTAIQHRDQLSSRIKQL 224 Query: 445 IGNEIFERVIRIT 457 + V T Sbjct: 225 VRRSADTPVQTPT 237 >gi|326434435|gb|EGD80005.1| hypothetical protein PTSG_10281 [Salpingoeca sp. ATCC 50818] Length = 736 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 44/139 (31%), Gaps = 12/139 (8%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D DT R+ + F++ + ++ + G T A+ A Sbjct: 212 FDVSADTTRIAVMVYASYNYLIFDFNYILTHTKDELLDAISAINYPYGGTRTGGALDYAR 271 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + +++ + + +++TDG + D + + +G+ + I Sbjct: 272 TVMFTADRGVRPSSEGIP---RVAMVITDGASADDVAAP---AQQLRDEGVTLYAIGI-- 323 Query: 408 NKTQQEKARYFLSNCASPN 426 L+ ASP Sbjct: 324 ----AGANENELNEIASPP 338 >gi|326330152|ref|ZP_08196463.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325951965|gb|EGD43994.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 141 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 32/129 (24%), Gaps = 1/129 (0%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+ G VM + G+ VD+ + + A A L + Sbjct: 2 KQRTHDERGSVSTWLVTASLVMAMLVGLAVDLGGQIHAQQRAHDLAAQAARVGGEQLQAA 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA-YQ 130 ++ YL + T V VRDT + Sbjct: 62 PAIQGRYVAVDTAAARRAATSYLRSAGVSGSVAVRTGDTVAVTVRDTYETRFLGIIGISR 121 Query: 131 VVLSSRYDL 139 V ++ Sbjct: 122 VRVTGTATA 130 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 36/135 (26%), Gaps = 14/135 (10%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSLEEVS 76 G F ++ A+ M V + V G VD+ + L++ A A + + + + Sbjct: 1 RGSFALVAAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAA 59 Query: 77 SRAKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + A + + T + +V Sbjct: 60 AAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFL 119 Query: 126 KSAYQVVLSSRYDLL 140 + V +S Sbjct: 120 GPSRTVSATSTAKAT 134 >gi|149632103|ref|XP_001514474.1| PREDICTED: similar to collagen type VI alpha 4 [Ornithorhynchus anatinus] Length = 1844 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 41/155 (26%), Gaps = 17/155 (10%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + + + VR+G F+ + + K + + T Sbjct: 648 MVNMSNIGPENVRIGVLQFSSSPREEFMLNKYTTKEDLSRAISDIKQIKAGTQTG----- 702 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + +Y++++TDGE + + +GI I I Sbjct: 703 ----QALTFTLPYFDTSRWGRPTEPQYLIVITDGEAQDSVKGP---AKALRDKGISIFAI 755 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + L + + F N L Sbjct: 756 GV-----LEANKTQLLEITGTEDQVFYENDFDSLI 785 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 52/145 (35%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TI S + Sbjct: 173 SISEDARISGISFITIGLSTVVNEA 197 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 891 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 892 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 941 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 942 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 980 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 52/145 (35%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TI S + Sbjct: 173 SISEDARISGISFITIGLSTVVNEA 197 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 891 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 892 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 941 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 942 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 980 >gi|311251228|ref|XP_003124501.1| PREDICTED: integrin alpha-D-like [Sus scrofa] Length = 230 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 60/169 (35%), Gaps = 17/169 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + +V+ + +++ + + +FS + + Sbjct: 35 KNFVRAVMDQFMDTS-----ILFSLMQYSNLLKTHFTFSQFQTHRSPQSLVDPIVQLK-- 87 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T +QT + S AKK ++++TDG+ +D E + +A+ Sbjct: 88 GLTYTATGIQTVVKELFHSKNGAR------SSAKKILLVITDGQKYKDPLEYEDVIPQAE 141 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHELNKI 440 + I + Q+ A+ LS S + F+ ++ L+ I Sbjct: 142 KANVIRYAIGVG-DAFQEHSAKQELSIIGSLPSKDHVFKVDNFAALSSI 189 >gi|198420236|ref|XP_002121660.1| PREDICTED: similar to collagen type VI alpha 6 [Ciona intestinalis] Length = 1638 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 53/170 (31%), Gaps = 16/170 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ + S +R + + ++G ++ RV G + ++ + Sbjct: 810 FETTKEWIGSFVREFEIGEYNT---KIGVVQYSSRVR--SEIDIGDYDSKADLLAAISSI 864 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E G+T A++ S + + K +++LTDG + Sbjct: 865 EFAAGNTNTGSALEYVRTVGFSGRHGARNGV------PKVLIVLTDGNAQDG---VLDAA 915 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 +K G+ + I Q A + + + + + I Sbjct: 916 SKLHRDGVAVYAIGVGRPNMGQLNAVTS--EPITSPNIYHVRNFDAIQTI 963 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 46/121 (38%), Gaps = 9/121 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +K D + R+ ++ V + + +G + ++ + MG T A+ Sbjct: 254 VKAFDVGPEETRVAVVQYSTSVQEEFN--FGHLLSKQQVLDAIDNMDYIMGDTHTGAALT 311 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 D I SS + + + +++TDG+ + + + N+ G+ + T+ Sbjct: 312 YMLDEIYSSANGDRPDVPDLA------IVMTDGKAQEPDLV-VEAANRVHEAGVTVYTVG 364 Query: 405 F 405 Sbjct: 365 V 365 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 56/169 (33%), Gaps = 23/169 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D DT R+ ++ F + +++ + + G+T A++ A Sbjct: 64 FDISRDTTRVAVVQYSS--YPRTEFDLDTYSSAVGVLRGIDLIQYMSGNTQTGLALRYAI 121 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 +++ S ++ K ++L+DG + ++ +S GI + Sbjct: 122 ESVFSRAR---------EDSAKVAIVLSDG---RSQDQVNEAATSLRSSGIATFAVGIGD 169 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + L + + + + +F + I + R+ Sbjct: 170 EMSHERMEE--LRQIS-------VAAAEDQSSVFMAKDFRSIGQLQERL 209 Score = 37.3 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 60/151 (39%), Gaps = 12/151 (7%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +K D VR+G ++ R ++ + G + ++ + F E ++G T A Sbjct: 1010 NMTQKFQIGPDAVRVGMIQYSTRPKTNI--AIGQYNDKESLQEAFGQVEWQLGDTYTARA 1067 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 ++ S + NL A K ++++TDG+ QD E S+G RI Sbjct: 1068 LRYV------SKSYARATTRENLHATKLLIIITDGQP-QDRNEVKQAVRNLHSEGWRIFA 1120 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 I + + L++ + F AN+ Sbjct: 1121 IGVGQSDISELG---ILASNPDSDHVFYANN 1148 >gi|114568964|ref|YP_755644.1| hypothetical protein Mmar10_0413 [Maricaulis maris MCS10] gi|114339426|gb|ABI64706.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 183 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 33/106 (31%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + ++ G + AL+ P M+ + V+V + + + + Sbjct: 9 LTGFLRRFGGDRRGVSAVEFALIAPFMILLYLGSVEVSLALSIDRKITSISSALADLVAQ 68 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 + + +E++ L + L + D EV+ Sbjct: 69 DDVITDDEITDILNAGAVIVAPFDPTPLEIRITSILMDSGGDVEVQ 114 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 53/166 (31%), Gaps = 15/166 (9%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R ++ VI + + K + A T A+ + Sbjct: 103 WRAALLQYSSHVIIEQTLKQW--KGTENFKSSIAPMAYIGHGTYTTYAITNMTKIFVEE- 159 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-- 412 ++ E K +LLTDG N + + AK+QG+++ TI + Sbjct: 160 --------SSPERIKIALLLTDGFFHPRNPDIFSAMADAKNQGVKVFTIGITRTANDPVN 211 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 LS+ + + T+ + K+ ++ +++ Sbjct: 212 AANLRLLSSTPASRFLYNLQDTNVMEKVITQIA--QLANDGCPLSQ 255 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 42/131 (32%), Gaps = 19/131 (14%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++++ + I + + + ++ G Sbjct: 74 SVSASTTRVAVISYSSCNQIHIRVNYISSPENKNKCTFDNDLTSVN---------YHPGG 124 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A++ A ++S A++ ++LLTDG + A K KS+ Sbjct: 125 TCTAGALEAAGRDVLSHGR---------PGAQRVVMLLTDGASNDGGP-PHANAQKLKSE 174 Query: 397 GIRIMTIAFSV 407 G++I TI Sbjct: 175 GVKIFTIGIGS 185 >gi|282898869|ref|ZP_06306855.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] gi|281196242|gb|EFA71153.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] Length = 1499 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 42/371 (11%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 L A A IT + + + + + + N + Sbjct: 680 LSNATDGATITTATAVGTIQNDDVVAPGLA-----------IAPTNAIQTEGNIGTKAFT 728 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 V + + + + V SS + + F ++ + ++T+ Sbjct: 729 FTVTRSGDTTSSSSANWAVTGSSTNQADV---TDFGGTLPTGTVNFTAGETSKTIIVDVL 785 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR------- 225 + V + S + + D T + + Sbjct: 786 GDTTVEPDEGFTVTLSNPTNATITTATAVGTITNDDDNTGSIQGFKWKDINGNGVREDLI 845 Query: 226 --DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 D +++ + S + G ++++ ++ + L + Sbjct: 846 QGDSPNIVFVIDVSGS-TRGPFQGIPVGDVNKDGIQNTILDAEIAGFIALNNSLVRKGFG 904 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 S K+ V+ + + + + +K V+ I G T A+ Sbjct: 905 SRAKVSIVSFAS-------DAKTLLTTNPETDSNKNGTKDVEEKLISLKSGGETNFEIAL 957 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q A T+ + ++ ++DG+ Q N + + G+++ Sbjct: 958 QEAAKTLRDIGTTAGNGN---------VIFMSDGQPNQGNYTDEVL--DLQKAGVKLSAF 1006 Query: 404 AFSVNKTQQEK 414 + Sbjct: 1007 GVGTGASIDSL 1017 >gi|170748744|ref|YP_001755004.1| hypothetical protein Mrad2831_2326 [Methylobacterium radiotolerans JCM 2831] gi|170655266|gb|ACB24321.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 546 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 13/159 (8%), Positives = 42/159 (26%), Gaps = 10/159 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G+ I+ AL +++G G + +D+ + I ++ Sbjct: 10 RFGAARGGNVAILVALGSAMLMGAGAVGIDLGLVFQARRKAQ----------GAADIAAM 59 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ + + ++ +A +V Sbjct: 60 LAAVDPSQADTVARRSLSDNGYGAATATVAPGSYDASTRGVAPGGRFSAGGSPANAVRVG 119 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 LS+ + ++ ++ A+ + Sbjct: 120 LSTSVPVTFGRAIGLPAAVPLRVTGTAASAQFAAFTIGS 158 >gi|150024647|ref|YP_001295473.1| outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] gi|149771188|emb|CAL42655.1| Probable outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] Length = 631 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 41/345 (11%), Positives = 96/345 (27%), Gaps = 36/345 (10%) Query: 95 IRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 L + + V V P + V + Y+ + ++ + Sbjct: 127 KAQNSQELSEKIYNVPVNSNVAVCKNPSIPNSTKTPVPNNEDYNSWIENPFESPKTAPLS 186 Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 ++ I + T R + + +N F + Sbjct: 187 TFSIDVDNASYTNIRRLINN------------GQKVPKDAVRVEEMINFFKYQYPQPKD- 233 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + D + L P P+ + +D S + K L+ Sbjct: 234 NHPFSITTQYSDCPWNSKRKLVQIGLQGKNIPTNDLPASNLVFLIDVSGSMNADNKLPLL 293 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++L ++ ++K D V + + G + + N Sbjct: 294 IESLKILVEQLRKQDKVA------IVVYAGAAGLVLPPTAGNE---KQTIINALEKLNAG 344 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA- 393 GSTA ++ AY + ++ + NN ++L TDG+ + A+ Sbjct: 345 GSTAGGAGIELAYK----TAQENFIKDGNNR-----VILATDGDFNVGSTSDSAMQTLIE 395 Query: 394 --KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + G+ + + + + + K ++ ++ E Sbjct: 396 DKRESGVFLTCLGYGMGNYKDSKMEILADK--GNGNYAYIDNIQE 438 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 62/203 (30%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSASTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|161524907|ref|YP_001579919.1| TadE family protein [Burkholderia multivorans ATCC 17616] gi|189350343|ref|YP_001945971.1| Flp pilus assembly protein [Burkholderia multivorans ATCC 17616] gi|160342336|gb|ABX15422.1| TadE family protein [Burkholderia multivorans ATCC 17616] gi|189334365|dbj|BAG43435.1| Flp pilus assembly protein [Burkholderia multivorans ATCC 17616] Length = 167 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 44/153 (28%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G I A + P+ + +V + +L AA A Sbjct: 13 WRRQRGVAAIEFAFVFPLFFLIFYGIVTFAMIFVIQQSLTFAASEGARAALNYASTPCNR 72 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 +S A+ + T F + + + V + + S + V+++ Sbjct: 73 LSVNAQKAVTQALAGAPWSTNVTFAAQVSVSAPTPKSTAGVTCDSTFASTSSSTFNVMVT 132 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + Y NPL ++ + + Sbjct: 133 TTYSYAANPLIPWIFLFKAPQLQSSATVQVQPN 165 >gi|83941161|ref|ZP_00953623.1| hypothetical protein EE36_02993 [Sulfitobacter sp. EE-36] gi|83955720|ref|ZP_00964300.1| hypothetical protein NAS141_07935 [Sulfitobacter sp. NAS-14.1] gi|83840014|gb|EAP79190.1| hypothetical protein NAS141_07935 [Sulfitobacter sp. NAS-14.1] gi|83846981|gb|EAP84856.1| hypothetical protein EE36_02993 [Sulfitobacter sp. EE-36] Length = 179 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 1/60 (1%) Query: 3 FDTKFIFYSKKLIKSCTGHFFII-TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ + G +I +L PV+ G+ V+V +S + L + +TA+ Sbjct: 1 MIARISQALRRFRRGEDGAVVLIEFVILFPVIFGMFLASVEVSLYSLRQFHLNRGLETAV 60 >gi|258515700|ref|YP_003191922.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] gi|257779405|gb|ACV63299.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] Length = 140 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 43/140 (30%), Gaps = 1/140 (0%) Query: 33 MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT-FPKQKIE 91 ML + +D R +H L+ A A + S + L + + + I Sbjct: 1 MLMFSALGLDYGRAYVLKHQLQAACDAASLAGSSAVSAKLITDGTGSVTNKKLLLDPIIA 60 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 E + N + + + DT+ + + + + ++ + S + Sbjct: 61 EARATDVWNQNVSSMKFIDKGVTIVDTSNHSALDEDSDGYLDAYKWGVTAKIQSYIAGPI 120 Query: 152 GIKSWLIQTKAEAETVSRSY 171 I A + ++ Sbjct: 121 SGMGNHITVTRVAISKAKDT 140 >gi|254481635|ref|ZP_05094879.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] gi|214038263|gb|EEB78926.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] Length = 1306 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 78/341 (22%), Gaps = 74/341 (21%) Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + V S ++ NG S M Sbjct: 273 VEICEPDTPVPGNWAALSNSVHTPDHVECLDDVNSLNAGNGSGVGDGYPQSNAMDGNEYG 332 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + + + + +A+ + + N + + Sbjct: 333 GSVDGDISWGNTAYTFYTSHYMNWWHDDSLVTPRPKIDIAADVITTIIDTNTSVDFGLLE 392 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + + RT + G T + ++ AY + + Sbjct: 393 FNYREGGRVTQRIIQNMTAANRTNLINLVDALEPAGWTPLCESTYEAYRYLAGESPVYSG 452 Query: 360 RMKNNLEAKK---------------------------YIVLLTDG-------ENTQDNEE 385 + + A YI+++TDG N Sbjct: 453 SASDGIYAGWRYDIAASDSAAMSGGNYVSPNTDCAYTYIIIMTDGLPGRDGGANDAIETL 512 Query: 386 GIAICN-------KAKSQ-----------------------GIRIMTIAFSVNKTQQEKA 415 C+ +K+ GI TI F Sbjct: 513 TQKDCDMYDSEEGNSKNCMPQLAEYMATNDLDGDTTNGSQFGIT-YTIGF-------ATD 564 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + L + A ++ A+S EL + F+ I + + Sbjct: 565 QQLLQDTAEKGKGEYYTADSAAELTEAFQGAIVSILSRDTT 605 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 82/326 (25%), Gaps = 43/326 (13%) Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF---LRSMGIKS 155 ++ N D D + ++ K D N + F ++ G Sbjct: 16 MASIWPNTADVVYADPKKGAITSVSEEKIEGWATAPGEGDGKGNSIRFFEVRIQLKGQNG 75 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF---------GQ 206 + + EA V + E + + + ++ + Sbjct: 76 TAVSYRTEAVEVEPDRNGEKKYTFSLDMRGKWPSAPGTTATYEIVVDAYRVLGNGQEEVY 135 Query: 207 PADRTVKSYSSQNGKVGIRDE--------------KLSPYMVSCNKSLYYMLYPGPLDPS 252 + ++ + + +L Sbjct: 136 FSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYAKPPDGDAQGRLDVTLIPQGGVPAPVRP 195 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + FV S K + AL + + K N + R F+D V + Sbjct: 196 PIDVVFVMDVSGSMTTMKLQSAKSALQAAVNYFKT--NYHPNDRFALIPFSDDVKATSVV 253 Query: 313 SWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +G + + + G T + A+ A N+ E KK Sbjct: 254 PFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLA------------QSYFNDPERKK 301 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKS 395 YI+ LTDG T N + K Sbjct: 302 YIIFLTDGMPTVLNTTSSITHKEIKK 327 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 32/346 (9%), Positives = 79/346 (22%), Gaps = 20/346 (5%) Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + + + + + + L + Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYA 168 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG---QPLNCF 204 G + + + +V+D S SM + S Q + Sbjct: 169 KPPDGDAQGRLDVTLI-PQGGVPAPVRPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNY 227 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP-SLSEEHFVDSS- 262 + ++ ++ + P+ N + + + ++ Sbjct: 228 FKTNYHPNDRFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAAL 287 Query: 263 SLRHVIKKKHLVRDALASVIR--SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 SL + + + + T + F D + ++ L Sbjct: 288 SLAQSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGL- 346 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ T+I + YD + +N + Y GE Sbjct: 347 ----------YSDGKMTSIIFTDKDGYDRLFYNNHIDYVNGWLFSNDNGYSFTYAWGE-G 395 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + + + + I + +I F N LS A Sbjct: 396 KAYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMDYLRALSTTAGGE 441 >gi|296331310|ref|ZP_06873782.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676296|ref|YP_003867968.1| hypothetical protein BSUW23_18120 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151425|gb|EFG92302.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414540|gb|ADM39659.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 281 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 44/129 (34%), Gaps = 15/129 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G+H ++ + G T I A+ + + + K + L Sbjct: 156 YGLHPYEGSLFDNSLSEIKPTGWTPIAKALSDTRKEFEA----------FDADGKNVVYL 205 Query: 374 LTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + I K + + + I F+ + E+ + A + A Sbjct: 206 ITDGEETCGGDPAAEI-EKLRESNVDTIVNIIGFNFDIKGNEEMKQA--AVAGGGEYISA 262 Query: 432 NSTHELNKI 440 NS E + Sbjct: 263 NSADEFEQA 271 >gi|294012237|ref|YP_003545697.1| hypothetical protein SJA_C1-22510 [Sphingobium japonicum UT26S] gi|292675567|dbj|BAI97085.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 543 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/274 (8%), Positives = 71/274 (25%), Gaps = 19/274 (6%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT-----------A 60 K L++S G AL + ++ GG+ D R + + L+ AA Sbjct: 6 KGLLQSTGGAVAPTVALSLFGLIAAGGIAFDYARMAGLDTELQNAADQAALAAASQLDGT 65 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYL--IRNFENNLKKNFTDREVRDIVRDT 118 + + + ++ + + + + + + + + T Sbjct: 66 STAITRATSAAQTLIVNQTRFANDGNASGLSVTVPTLTFYSSYDQDADTGTVTTSATAAK 125 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 V+++ L+ L + + G+ S + + + Sbjct: 126 FVQVSVGTREAVYALTPVVQLFRSGNLNAFATAGLGSAICKVPPLMLCIPSETFPSSADI 185 Query: 179 IQWVIDFSRSMLDYQRDSEG------QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + + ++ + + G++ E + Sbjct: 186 GDGLRLQPGPQTGAWVPGDYGYLDFGNGASGLAINLGANNQAANCMDNSGGLQTEPGNKA 245 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 V+ + + L G + + Sbjct: 246 SVTEALNTRFDLDAGKKYSCSTSTGDYCPAENVG 279 >gi|73967405|ref|XP_537778.2| PREDICTED: similar to integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Canis familiaris] Length = 1178 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 47/153 (30%), Gaps = 15/153 (9%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + + ++ + + T AMQ D I + Sbjct: 247 LVQYGEVIQTEFDLRDSQDVMTS--LAKVQNITQVGNVTKTASAMQHVLDNIFT------ 298 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + A K +V+LTDG+ D + N K QG+ I N ++ K + Sbjct: 299 PNHGSRKNASKVMVVLTDGDIFGDPLNLTTVINSPKMQGVERFAIGVG-NAFEKNKTYHE 357 Query: 419 LSNCASPN---SFFEANS---THELNKIFRDRI 445 L AS F+ + L + I Sbjct: 358 LKLIASDPDDRYAFKVTNYTALDGLLSKLQQTI 390 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQNELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|156405834|ref|XP_001640936.1| predicted protein [Nematostella vectensis] gi|156228073|gb|EDO48873.1| predicted protein [Nematostella vectensis] Length = 250 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 69/214 (32%), Gaps = 20/214 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D+S + V + ++I S + + V + ++ Sbjct: 25 CSKPSDVALLIDASGSIGRRRW-PKVVEFTQAIINSFNVSEEGSH-VGIILYSTKTELLV 82 Query: 309 DPSFSWGVHKLIRTIVK--TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + G I + G T I+ A++ A + + S M+ + + Sbjct: 83 KFNTFQGSELTADNINAKVAAVNYRDWGGLTYIDRALKLANEQLFS----PEGGMRASKD 138 Query: 367 AKKYIVLLTDGENTQDN---EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K V+ TDG+ T+D E K + +++ + + + A Sbjct: 139 ILKVAVVFTDGKQTKDKGPFTELQIASQPLKDKDVQVYGLGIG---DETTIDVQEMQEMA 195 Query: 424 S-PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + P + A + EL + +I + V I Sbjct: 196 NKPENVLTAKTFEELKN-----LAAQITQGVCEI 224 >gi|152990153|ref|YP_001355875.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422014|dbj|BAF69518.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 549 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 54/175 (30%), Gaps = 25/175 (14%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + ++ + + + + + + I + + + F D Sbjct: 84 SANVGIALDISNSMKATDYYPDRLQFAKKKIEEFI-------KASKNLNIALLAFADEAY 136 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S L+ + + T A+ +A + E Sbjct: 137 IVSPPSSDKEALLYMLKHLDTESL-ALQGTNFLAALMSADMLL-------------GKEG 182 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY-FLSN 421 +K +VL TDG N +D + IA AK +GI++ I K + ++ + Sbjct: 183 QKSVVLFTDGGNKEDFSKEIAF---AKKRGIQVHIIGIGTQKGAPIPTKNGYMQD 234 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|225551939|ref|ZP_03772879.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] gi|225370937|gb|EEH00367.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] Length = 332 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 44/143 (30%), Gaps = 20/143 (13%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + I + +G F + + + Sbjct: 109 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 161 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D Sbjct: 162 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIY 208 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I ++ + Sbjct: 209 KDQVINLAQGLNVKIYSVGIGSS 231 >gi|58429455|gb|AAW78131.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 42/138 (30%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLK 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|300021847|ref|YP_003754458.1| hypothetical protein Hden_0312 [Hyphomicrobium denitrificans ATCC 51888] gi|299523668|gb|ADJ22137.1| hypothetical protein Hden_0312 [Hyphomicrobium denitrificans ATCC 51888] Length = 210 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 45/149 (30%), Gaps = 7/149 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + Y + + + AL+ P+++ + +V R ++A Sbjct: 7 LAARIRLYFRTFASDTSAVAAVEFALIAPLLMLMTFGTFEVSRALVAHKRFQRAT----- 61 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKI--EEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 L+ +++ S + T + ++ + F + + +A Sbjct: 62 AMVGDLVSREKQIGSSLSTANTALDGMLVSAQHAMEPFSATPLQIAITQLRASATDASAT 121 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ S + + S D P + Sbjct: 122 KVEWSWSYHSAPIKSCGDTKSMPDENMIS 150 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/280 (11%), Positives = 81/280 (28%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + + F + S G+ + + + Sbjct: 94 VTLSNVSERKDNMRLGLSLATNPKDSSFLACSPLWSHECGSSYYTTGMCSRVNANFRFAK 153 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 154 TVAPALQRCQTYMDIIIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 203 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 204 VGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGTETRTAYGIEFA--------RS 254 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AK+ ++++TDGE + D+ + + ++ + +A ++ Sbjct: 255 EAFQKGGRKGAKRVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINP 313 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G I Sbjct: 314 EAFLNEIKFIASDPDDKHFFNVTDEAALKD-IVDALGERI 352 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 82/326 (25%), Gaps = 43/326 (13%) Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF---LRSMGIKS 155 ++ N D D + ++ K D N + F ++ G Sbjct: 16 MASIWPNTADVVYADPKKGAITSVSEEKIEGWATAPGEGDGKGNSIRFFEVRIQLKGQNG 75 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF---------GQ 206 + + EA V + E + + + ++ + Sbjct: 76 TAVSYRTEAVEVEPDRNGEKKYTFSLDMRGKWPSAPGTTATYEIVVDAYRVLGNGQEEVY 135 Query: 207 PADRTVKSYSSQNGKVGIRDE--------------KLSPYMVSCNKSLYYMLYPGPLDPS 252 + ++ + + +L Sbjct: 136 FSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYAKPPDGDAQGRLDVTLIPQGGVPAPVRP 195 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + FV S K + AL + + K N + R F+D V + Sbjct: 196 PIDVVFVMDVSGSMTTMKLQSAKSALQAAVNYFKT--NYHPNDRFALIPFSDDVKATSVV 253 Query: 313 SWGVHKLIRTIVKTFA---IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +G + + + G T + A+ A N+ E KK Sbjct: 254 PFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLA------------QSYFNDPERKK 301 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKS 395 YI+ LTDG T N + K Sbjct: 302 YIIFLTDGMPTVLNTTSSITHKEIKK 327 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 32/346 (9%), Positives = 80/346 (23%), Gaps = 20/346 (5%) Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 + + + + + + L + Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFSQPEYA 168 Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG---QPLNCF 204 G + + + +V+D S SM + S Q + Sbjct: 169 KPPDGDAQGRLDVTLI-PQGGVPAPVRPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNY 227 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP-SLSEEHFVDSS- 262 + ++ ++ + P+ N + + + ++ Sbjct: 228 FKTNYHPNDRFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAAL 287 Query: 263 SLRHVIKKKHLVRDALASVIR--SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 SL + + + + T + F D + ++ L Sbjct: 288 SLAQSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGL- 346 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ T+I+ + YD + +N + Y GE Sbjct: 347 ----------YSDGKMTSISFTDKDGYDRLFYNNHIDYVNGWLFSNDNGYSFTYAWGE-G 395 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + + + + I + +I F N LS A Sbjct: 396 KAYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMDYLRALSTTAGGE 441 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/280 (11%), Positives = 81/280 (28%), Gaps = 28/280 (10%) Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 V++ V + +M + + F + S G+ + + + Sbjct: 94 VTLSNVSERKDNMRLGLSLATNPKDSSFLACSPLWSHECGSSYYTTGMCSRVNANFRFAK 153 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 + +D + + + + +I +KK ++ Sbjct: 154 TVAPALQRCQTYMDIIIVLDGSNS----------IYPWVEVQHFLINILKKFYIGPGQIQ 203 Query: 297 MGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 +G + + V+ + + + I++ T ++ A Sbjct: 204 VGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGTETRTAYGIEFA--------RS 254 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E + AK+ ++++TDGE + D+ + + ++ + +A ++ Sbjct: 255 EAFQKGGRKGAKRVMIVITDGE-SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINP 313 Query: 417 YFL----SNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 AS FF L D +G I Sbjct: 314 EAFLNEIKFIASDPDDKHFFNVTDEAALKD-IVDALGERI 352 >gi|255263037|ref|ZP_05342379.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255105372|gb|EET48046.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 203 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 18/170 (10%), Positives = 54/170 (31%), Gaps = 4/170 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + K+ G + +L PV + + ++V + L++ A+ + Sbjct: 10 RFRKNEDGSSSVEFVILFPVFIVLVATSIEVGLVMTRQVMLERGTDLAVRAIRLGTTSPG 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++ N I + + + + + V D +P A +V Sbjct: 70 PVGAAQITNMICSTASIIPDCVNQVKVEMRPIDPRSLTLIPTVADCRDRDDPAVPAREVT 129 Query: 133 LSSRYDLLLNPL----SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 ++ L++ + F ++G+ + + + + + Sbjct: 130 SGTQNQLMVLRVCALIDPFFPTVGLGKRFLVDENGKLVPRTNENGDIMRD 179 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 46/165 (27%), Gaps = 5/165 (3%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G F ++ A+ M V + G VD+ + L++ A A + + + ++ + Sbjct: 1 RGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQCAQP 56 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 + + + L D + P +A QV + Sbjct: 57 NAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTATQSVP 115 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 116 YFFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 160 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 31/203 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 312 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQNELIQINSGS 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 367 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 415 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 416 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 466 Query: 437 -LNKIFRDRI--GNEIFERVIRI 456 L F + +R I++ Sbjct: 467 GLIDAFGALSSGNGAVSQRSIQL 489 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 40/109 (36%), Gaps = 11/109 (10%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A++ + +S + + + ++++TDG T D + + Sbjct: 213 QVGGWTETATAIRRVVRELFTSQKGSRNGAT------RILIVITDGVKT-DRLQYSQVIP 265 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHEL 437 +AK GI I + + A+ L + AS F + + L Sbjct: 266 EAKQAGIIRYAIGVGDAFSSPD-AKRELDDIASEPKAEHIFTVYNFNAL 313 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 42/135 (31%), Gaps = 15/135 (11%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 +NDR ++ I+ I G TA A++ A + + + Sbjct: 1 MVLYNDRPSAEFYL--DTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQKRGSR 58 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + K+ +++TDGE+ D + + G+ + + + + Sbjct: 59 KALG----VKQIAIVMTDGESEDDVTTT---AAELRRSGVTVYALGV------KNASVEE 105 Query: 419 LSNCASPNSFFEANS 433 L S + Sbjct: 106 LKKIGSYPEHEFVFN 120 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 49/469 (10%), Positives = 111/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + S+ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-------------------- 385 A D + + N KK IVLLTDG T Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYTVSRVQTEADGRFYGTQFTNRQ 407 Query: 386 -------------------------------GIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|221127586|ref|XP_002157796.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 449 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 15/130 (11%) Query: 319 LIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + G TA +DA++ A + S + + + ++L+ DG Sbjct: 92 FEKQDIFDRIDKIPYPGYRTATDDALRVANKYMFSLSGGARQGV------PQVLILVNDG 145 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANST 434 T +E + K +G+ I + + + L + AS + FF + Sbjct: 146 RCTVCSENVSSASAPLKEKGVSIFCVGI-----SKTVDKKELLSIASEPAEDHFFYVETI 200 Query: 435 HELNKIFRDR 444 EL Sbjct: 201 DELPTFISKL 210 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 51/166 (30%), Gaps = 13/166 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + R A + + +F L +T I+ A++ Sbjct: 297 VDSFSISQQNARFAALVYGSNASVEFNFVRYDSALEIKQAIQSLSYLKS--NTRIDKALE 354 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + S ++++ Y+ DG T+ + ++ K G++I+ I Sbjct: 355 VAKSDLFSLQ----GKVRSRRPMILYVFF--DGTVTRSMSDLESVVQPLKDYGVKIIAIG 408 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 E RY L + N+ F S EL + + Sbjct: 409 VG-----PEVNRYQLKKISEDNAIFSGKSFKELAPLLYSIVEQSCS 449 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 45/139 (32%), Gaps = 18/139 (12%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + + L ++ G T+I +++ + + + + Sbjct: 75 DESCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDL---------KALD 125 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + ++TDG T + I + K + I + I F+V+ Q L Sbjct: 126 GEKTLNILYIITDGIETCG-GNPVEIAKQLKGENTNIVLGIIGFNVDANQN----RLLKQ 180 Query: 422 CAS--PNSFFEANSTHELN 438 A + N ++L Sbjct: 181 IADAAGGYYSSVNDANKLT 199 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 46.1 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 58 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 117 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 118 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 168 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 169 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 223 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 224 -EETISQTIDTIV 235 >gi|156741667|ref|YP_001431796.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232995|gb|ABU57778.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 826 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 54/165 (32%), Gaps = 5/165 (3%) Query: 12 KKLIKSCT-GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 ++L+ T G ++ A ++ +++G+ + VD+ + + + A + LI Sbjct: 3 RRLLHHRTEGQNIVLLAGILALLVGMAALAVDLGVTYAEQRNIVRGTNAASLAGMNRLIS 62 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNF----ENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + I+ + + + + + I + Sbjct: 63 GGRDADVALAIYESLRSNGIQVTVPGESPQPGDRAFEALYLGSDGAPIPGACSRVGACGS 122 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 Q V + DL N + F R G + + A A + + Sbjct: 123 QRPQGVKYIQIDLKGNVDTYFARLFGQSTLPVGATAYASVGACAT 167 >gi|329851852|ref|ZP_08266533.1| hypothetical protein ABI_46220 [Asticcacaulis biprosthecum C19] gi|328839701|gb|EGF89274.1| hypothetical protein ABI_46220 [Asticcacaulis biprosthecum C19] Length = 172 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 35/92 (38%), Gaps = 11/92 (11%) Query: 5 TKF--IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI- 61 ++ + + + G + AL++PV+LGV + D+ + E A++ + +A Sbjct: 6 SRLNVKTFLSRFARERHGAAIVEFALIVPVLLGVLAAVDDLSAVTMKEKAMRSSVSSAAQ 65 Query: 62 --------ITASVPLIQSLEEVSSRAKNSFTF 85 ++ + ++ S + Sbjct: 66 YVMRGGSDLSVARQIVLSSWTPETEDATVSAV 97 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 111/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNISKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 45/139 (32%), Gaps = 18/139 (12%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + + L ++ G T+I +++ + + + + Sbjct: 236 DESCGSNELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDL---------KALD 286 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + ++TDG T + I + K + I + I F+V+ Q L Sbjct: 287 GEKTLNILYIITDGIETCG-GNPVEIAKQLKGENTNIVLGIIGFNVDANQN----RLLKQ 341 Query: 422 CAS--PNSFFEANSTHELN 438 A + N ++L Sbjct: 342 IADAAGGYYSSVNDANKLT 360 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 KGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 KGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 KGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 KGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|119716454|ref|YP_923419.1| TadE family protein [Nocardioides sp. JS614] gi|119537115|gb|ABL81732.1| TadE family protein [Nocardioides sp. JS614] Length = 131 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 40/133 (30%), Gaps = 9/133 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I LMP + + + + + + + + A A + Sbjct: 7 RRSRDERGSVTIQMVFLMPALFLLMFLGLQGALYYHAK---QVAL-AAAQEGAREAGSET 62 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + TF + ++ + + + T + + +V V Sbjct: 63 GTRDAGVATANTFLQDAGGSDVMTSTSVSGSRTTTTATITVTGKSMSVIPGWH-----VT 117 Query: 133 LSSRYDLLLNPLS 145 ++ + + L+ Sbjct: 118 VTQSASVPVERLT 130 >gi|316941492|gb|ADU75526.1| hypothetical protein Clo1313_2521 [Clostridium thermocellum DSM 1313] Length = 1112 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 9/52 (17%), Positives = 23/52 (44%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 + K G I +++ +M+ + +LV+ R + ++ A + +A Sbjct: 2 RFFKKTEGAISIFLCIVLLIMIAIPAILVEGTRIKAAQTQIESALDISAKSA 53 >gi|182434868|ref|YP_001822587.1| hypothetical protein SGR_1075 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463384|dbj|BAG17904.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 424 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 60/188 (31%), Gaps = 14/188 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 + + A V+ + + ++ DR D + V L RT K Sbjct: 60 SRMAAAKQAFNDVLDAAPEEVHLGIRTLGADYPGEDRKVGCKDTKQLYPVGPLDRTEAKA 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A D + E + IVL+TDGE+T + Sbjct: 120 AVATLAPTGFTPIGPALLGAADDL------------EGGEGSRRIVLITDGEDTCGPLDP 167 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + ++G ++ + + + + A+ ++ EL+ + + Sbjct: 168 CEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELSGRVKQLVD 227 Query: 447 NEIFERVI 454 V Sbjct: 228 RAAEPVVT 235 >gi|126334038|ref|XP_001370553.1| PREDICTED: similar to Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (Leu M5) (CD11c antigen) [Monodelphis domestica] Length = 1224 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 55/178 (30%), Gaps = 18/178 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + ++I +K + F +F + + Sbjct: 228 KNFVKAMISQFEKPST-----QFSLMQFASNFKIHFTFEKFKNSHDPRRLVDEITQL--S 280 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + A K ++++TDGE D E + A+ Sbjct: 281 GVTKTASGIKKVINELFQKTRGARQ------YATKILIVITDGEKYDDPLEYSQVIPTAE 334 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 GI I + + L AS + F ++ L+ ++++ +I Sbjct: 335 KAGIIRYAIGVGEAFERPSSRQE-LEEIASEPSKDHIFWVDNFGALSN-IQNQLKEKI 390 >gi|116625580|ref|YP_827736.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] gi|116228742|gb|ABJ87451.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] Length = 376 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 30/299 (10%), Positives = 66/299 (22%), Gaps = 11/299 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + KS G + AL +LGV G+ VDV R ++ ++ + + + Sbjct: 3 RRPKSRRGFVLVTMALSTVAVLGVVGLAVDVGRIFIAKNEVQVYCDS-----AAVAAAAA 57 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ + L T + + V Sbjct: 58 LDGTTYGIAAAAGAAAASPNKWNFGTTAILSPLVTFAQAPGGPWLANPNSAAGYTYAHVA 117 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S L PL + + + + + + R VS + Sbjct: 118 ASVTVPLYFLPLVVGQSTSAVTAAAAAGQISLAALPRGLAPYTAVSTNTAGPTFGFVAGN 177 Query: 193 QRDSEGQPLNCFGQ-PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 N + G LS + + + Sbjct: 178 SYTIHWPTYNANRSHCRVDNPDKCFNSPPCAGDSAASLSAVVANWGSKYHGYWGSNSNSD 237 Query: 252 SLSEEHFVDSSSLRHVIKKKHLV-----RDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + V + + + A + D + + Sbjct: 238 IAAAVMDTIQLTPLAVGDNVDPLLTPGNKQSEAGYLDQRASQDTDTTDNTVAGYLASAG 296 >gi|262091908|gb|ACY25457.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 667 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 43/154 (27%), Gaps = 17/154 (11%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D R+ F+ V + + + ST + + + + Sbjct: 332 RDQDRLAVVHFSTDVFAVV----DHRRPNARQLSNSIGSLRPRNSTNVQAGLNRGVE-LA 386 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG----IRIMTIAFSV 407 E N YI+L++DG D + AI A+ + +R++TI + Sbjct: 387 DEVRRERPNAYN------YIILMSDGVANVDATDPFAILASAEDRDSSNPLRLITIGVGI 440 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + F Sbjct: 441 QNYNDVLLEQLAQY--GNGWYRYLSDVAQARSTF 472 >gi|326775386|ref|ZP_08234651.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326655719|gb|EGE40565.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 424 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 60/188 (31%), Gaps = 14/188 (7%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR--VISDPSFSWGVHKLIRTIVKT 326 + + A V+ + + ++ DR D + V L RT K Sbjct: 60 SRMAAAKQAFNDVLDAAPEEVHLGIRTLGADYPGEDRKVGCKDTKQLYPVGPLDRTEAKA 119 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T I A+ A D + E + IVL+TDGE+T + Sbjct: 120 AVATLAPTGFTPIGPALLGAADDL------------EGGEGSRRIVLITDGEDTCGPLDP 167 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + ++G ++ + + + + A+ ++ EL+ + + Sbjct: 168 CEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELSGRVKQLVD 227 Query: 447 NEIFERVI 454 V Sbjct: 228 RAAEPVVT 235 >gi|282879638|ref|ZP_06288369.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306586|gb|EFA98615.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 346 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 21/213 (9%), Positives = 59/213 (27%), Gaps = 42/213 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + V + + + +++ + ++G F + Sbjct: 102 DISNSMLAEDVVPSRLAKSKLLVENLVDNFT-------NDKIGLVIFAGDAFIQLPITSD 154 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++ T I A++ + + + + I+L+T Sbjct: 155 YVSA-KMFLQNTDPSLITTQGTDIARAIRLSMSSFTQQ-----------DKVGRAIILIT 202 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFL----- 419 DGE+ + A +GI + + + E + + Sbjct: 203 DGEDHEGGAL--EAAADANKKGINVFILGVGNTQGAPIPVAEGGYMKDENGQTVMTALNE 260 Query: 420 SNC-----ASPNSFFEANSTHELNKIFRDRIGN 447 C A ++ ++T++ K D + Sbjct: 261 KMCQDIARAGKGTYIHVDNTNDAQKKLNDELAK 293 >gi|17231904|ref|NP_488452.1| hypothetical protein alr4412 [Nostoc sp. PCC 7120] gi|17133548|dbj|BAB76111.1| alr4412 [Nostoc sp. PCC 7120] Length = 820 Score = 46.1 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 48/167 (28%), Gaps = 22/167 (13%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D + RT + + G T + ++ + Sbjct: 331 NPDDTFSIVDFSDTTRQLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLN--- 387 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + IVLLTDG +N+ + K G R+ + + Sbjct: 388 --------FPVTDPGRLRSIVLLTDGYIGNENQILAEVQQHLK-SGNRLYSFGAGSSVN- 437 Query: 412 QEKARYFLSNCASPNS-----FFEANSTHELNKIFRDRIGNEIFERV 453 R+ L+ A T E+ F +I N + + Sbjct: 438 ----RFLLNRIAELGRGIAQIIRHDEPTDEIVDKFYRQINNPVLANI 480 >gi|322437151|ref|YP_004219363.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164878|gb|ADW70583.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 324 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 56/155 (36%), Gaps = 15/155 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + R F+ + ++ + L + G TA+ +D++ Sbjct: 131 HQNDRAFVEGFDVQTYLPQDYTNNIDLLDQ-----GIRKLRPGGGTAL-------FDSLY 178 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + D++ ++ + E +K ++L++DG++ + + TI+ + T+ Sbjct: 179 KTCRDQMLALQADNEVRKALILVSDGDDNYSRASMTDAIKMCQRADTIVYTISTDTSPTR 238 Query: 412 QEKARYFLSNC--ASPNSFFEANSTHELNKIFRDR 444 + L + A+ F ++ F++ Sbjct: 239 GKGGA-VLESISGATGGRTFFPIKLEDVAIGFKNI 272 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 54/355 (15%), Positives = 111/355 (31%), Gaps = 57/355 (16%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID--FSRSMLDYQRD 195 + +S + K+ + ET + + + ++ S+ Sbjct: 947 SITYVAVSESDGKVTFKNLRVGEYTLTETAAAKGYVRETAEWKVKVERGKDGSIAAALYQ 1006 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP----YMVSCNKSLYYMLYPGPLDP 251 S+G L + + T K + +N + ++ + + + S Sbjct: 1007 SDGTTLTEGQKIVNYTEKEEAVKNLESDKTASVVNEADRVFQIELSASTKGRDEGIAAQA 1066 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + S+S++ KK ++DA + + + K+ +++ + S Sbjct: 1067 ASIVLVLDASASMQENGKKLKDIQDAAKAFVNTTKEKSPISE--IAVIWYQGSEGSSSTI 1124 Query: 312 FSWGVHKLIRTIVKTFAIDE----NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 G + L + N G T + DA++ A + + Sbjct: 1125 TDSGFYTLDTSDNVDAINRFISNKNASGGTPMGDALEEANSILS-----------GRPNS 1173 Query: 368 KKYIVLLTDGENTQDNEEGIAIC-------NKAKSQG--IRIMTIAFSVNKTQ------- 411 KY +L TDG ++ C N+AK ++ TI + ++ + Sbjct: 1174 SKYALLFTDGMPGYNSSNNSFNCMVANHANNEAKEIKEYAKLYTIGYKLSGSFKWEEGHS 1233 Query: 412 ------------QEKARYFLSN-CASPN-----SFFEANSTHELNKIFRDRIGNE 448 + KA FL N AS + ++T L KIF D G Sbjct: 1234 QDSTNNHGSHKTETKAADFLKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQI 1288 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/312 (10%), Positives = 81/312 (25%), Gaps = 16/312 (5%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + + + ++ + E V+ T V Sbjct: 372 VEDKLKDKAKNENYSIAGFLAYDISFVDEDGKEVEPDGNVKVTMEYKKDVIPKEVEVTEK 431 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR- 194 + + L + K + ++ + ++V D + + Sbjct: 432 DLGVTVMHLEENEKGQVKKVVDMVADDGSKASVETTDNGKVKKAEFVTDSFSTFTLAWQE 491 Query: 195 -----DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + +D L+ + + + Sbjct: 492 YEPLLTDYANGSVRTSDNSLGAPEHNKRIKYNEKDKDYTLTLDVTGKRGKKAGVDVLLVI 551 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 D S S + + + ++ + +++ +I D+ + +F +D + Sbjct: 552 DKSGSMGLNDNGRTDSNYFNLMPTLKKTVPTLVDTI-LPDSDSVNRVAAISFSSDDYTGN 610 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 + V ++ G T AM+ A + E + KK Sbjct: 611 DISTDWVDYNGKSGFNRKIEGLGTKGGTNWQLAMRNADKKLKPRAE---------SQNKK 661 Query: 370 YIVLLTDGENTQ 381 +V L+DGE T Sbjct: 662 VVVFLSDGEPTY 673 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 90/335 (26%), Gaps = 27/335 (8%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 A D + + N KK IVLLTDG T Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPT 382 >gi|186683831|ref|YP_001867027.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186466283|gb|ACC82084.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 615 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 55/174 (31%), Gaps = 22/174 (12%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + + ++D V S + +K G T ++ + Sbjct: 68 DQLEPDDILSVVVYDDEVDSVVPP---QAVTNKATLKDSIRKVRAGGITNLSGGWLKGCE 124 Query: 349 TIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGIAICNKA---KSQGIRIMTI 403 + K L+ +K ++LLTDG ++ + + +GI T+ Sbjct: 125 HV-----------KTRLDPQKINRVLLLTDGHANMGIQDPKVLTATSGQKAEEGITTTTL 173 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F+ + A+ +F+ S E ++F + + +T Sbjct: 174 GFAQGFNEDLLIGM---ARAARGNFYFIQSIDEATEVFSIELDSLRAVVGQNLT 224 >gi|239833242|ref|ZP_04681571.1| TadE family protein [Ochrobactrum intermedium LMG 3301] gi|239825509|gb|EEQ97077.1| TadE family protein [Ochrobactrum intermedium LMG 3301] Length = 215 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 46/159 (28%), Gaps = 5/159 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII---TASVP 67 ++ ++ G + LL+ L + +++ + A +T + Sbjct: 39 LRRFPRAQNGVAAVEFVLLIVPFLLIVFATIEIGVSFAARQVIANATETVARKLQMSGRI 98 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 +++ Q+++ + N T + DT ++ + + Sbjct: 99 GGIQIKDAPVSVDALRNELCQQMQFMVASGCPNLSFNLGTYEGFGQVPTDTILDEEGKLT 158 Query: 128 AYQVVLSSRYDL--LLNPLSLFLRSMGIKSWLIQTKAEA 164 + +S LN + + I + A Sbjct: 159 RTGITGTSGTSTINQLNVVYAWPVLTNILYLMQSPHAAG 197 >gi|282900974|ref|ZP_06308907.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] gi|281194065|gb|EFA69029.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] Length = 487 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 67/200 (33%), Gaps = 32/200 (16%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+SS K V+ A + ++ + ++ F V + + + Sbjct: 56 IDTSSSMSD-GKLTEVKTAASQFMQRRNL-----EVDQIAVVNFGSEVATPAPLTNDI-- 107 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 I+ E GST + + + TA + + K I+L TDG Sbjct: 108 ---NILNNAINQLLENGSTPMGEGIDTAQGQLQA------------TTLNKNIILFTDGI 152 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANSTHEL 437 N + + ++ GI+++ +A +L+ S F ANS + Sbjct: 153 PDDPNFAYNSALS-VRNAGIKLIAVATGG------ADTNYLTQITGDRSLVFYANSG-QF 204 Query: 438 NKIFRDRIGNEIFERVIRIT 457 ++ F + + T Sbjct: 205 DQAFSQAEAVIYKQLIESNT 224 >gi|225575062|ref|ZP_03783672.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] gi|225037732|gb|EEG47978.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] Length = 393 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 71/225 (31%), Gaps = 30/225 (13%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + V YML P ++ +D S KK A +++ + Sbjct: 86 TGFAVGAGGQFLYMLPPMENHSAVDMVLLLDGSGSMQ--GKKEPCVQATEALLEQMD--- 140 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + R A F V+ + + + R + F + +G T + A ++ Sbjct: 141 ---EQSRAQAVAFASCVLGNTEL-LPLDEEGRETLIKFVEGTDIIGGTEFGQPLTFALNS 196 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + E + ++LL+DGE K + + + TI + Sbjct: 197 LEEKKETGRIQA---------VILLSDGEGPFPETLEEEY----KEKDVVLYTIRMDAGE 243 Query: 410 TQQEKARYFLSNCASPNSF---FEAN-----STHELNKIFRDRIG 446 + E AR + F + ST EL K FR Sbjct: 244 QETETARQLVQFAQKTGGFDTKIPVDEKGQISTDELTKAFRQAFS 288 >gi|148253063|ref|YP_001237648.1| hypothetical protein BBta_1525 [Bradyrhizobium sp. BTAi1] gi|146405236|gb|ABQ33742.1| putative membrane protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 174 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 40/134 (29%), Gaps = 2/134 (1%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 T+F S + +S +G I AL +P + + VD+ Y + A + ++ Sbjct: 19 KTRFGKLSGR--RSESGTAAIEFALFIPFLFILLVGTVDLGFAMYEAMQVSNAVEAGMLY 76 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ S +S S +P T + + Sbjct: 77 AAKNGWDSAGITNSVLNASSVYPGGTPALTATPAPSQFCGCPQATGIAVATCSSTCPDGS 136 Query: 124 PRKSAYQVVLSSRY 137 QV + + Sbjct: 137 AVSQYVQVNAALNH 150 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPRYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQE- 413 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDSRAN 518 Query: 414 ----KARYFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + + S + ++E A+ +++ + V Sbjct: 519 LSKQQIEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 37/378 (9%), Positives = 93/378 (24%), Gaps = 62/378 (16%) Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 +S L LSL ++ + + + V+D + Sbjct: 169 NSVSPFTLPTLSLPAMTVSANQTIPTEYTTDDQGTYPKANWQPTGNTNVLDHQGNKNGSN 228 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + N + + SY G D + + + +Y Sbjct: 229 QWDGI---NSWDGDPNDRTHSYIEYGGTGNQADYAIRKFAKETTTPGLFDVYLNARGNVQ 285 Query: 254 SEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + VD S + + V+ + + ++ + D + MG ++ Sbjct: 286 KDITPLDLVLVVDWSGSMNDNDRIGEVKIGVDRFVDTLSD-SGITDKINMGYVGYSSDGY 344 Query: 308 SDPSFSWGVHKLIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII--------------- 351 + + + + + G T ++ A D + Sbjct: 345 NYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGHKKVIVLLTDG 404 Query: 352 -SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE------------------------EG 386 + +V R+ Y ++ ++ N Sbjct: 405 VPTFSYKVQRVHAQPSNDYYGTQFSNTQDQPGNTSRIARSYYAPDQNNQSRRIDSTFIAT 464 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----LSNCAS----PNSFFEANS-THE 436 I K +GI I + + + S + ++E+ + Sbjct: 465 IGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSADEKGDLYYESADHATD 524 Query: 437 LNKIFRDRIGNEIFERVI 454 +++ + +I V Sbjct: 525 ISEYLAKK-AVQISATVS 541 >gi|145595544|ref|YP_001159841.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145304881|gb|ABP55463.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 316 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 61/172 (35%), Gaps = 23/172 (13%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D+ +G F + R V STA +A+ T + I Sbjct: 124 PDSYNLGLVSFAKAANVLVPPT-----KDRQAVLAAINGLALAESTATGEAVFTCLEAIR 178 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S IVLL+DG T A A++ + + TIAF + Q Sbjct: 179 SV-----PADGAAGIPPARIVLLSDGYRTSGRSVEQAA-AAAQAANVAVSTIAFGTDGGQ 232 Query: 412 QE---------KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + R L++ A + F+EA S EL ++++D G+ I R Sbjct: 233 VDIGGQRQRVPVDRLALADLAETTDGYFYEAASVSELKQVYQDM-GSSIGFR 283 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 834 TARIGVINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 891 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 892 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 941 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 942 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 980 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 51/145 (35%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K +L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVALLMTDGIDHPRNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TI S + Sbjct: 173 SISEDARISGISFITIGLSTVVNEA 197 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 53/172 (30%), Gaps = 12/172 (6%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + + + + ++ + + + + Sbjct: 230 DPPDFQRAKDFIANMMTNFSEKCFECSFALVQYGRVIQTEFNLQD--SQNMTASLAKVQN 287 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T A+Q D I + R + +A K IV+LTDG+ +D + Sbjct: 288 ITQVRNVTRTASAIQHVLDDIFT------QRHGSRKKASKVIVVLTDGDTFEDPLNLTVV 341 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 N K QGI I + ++ + L AS F+ + L+ Sbjct: 342 INSPKMQGIERFAIGVG-DAFKKHQTEQELKLIASDPDETHAFKVTNYSALD 392 >gi|260824339|ref|XP_002607125.1| hypothetical protein BRAFLDRAFT_68081 [Branchiostoma floridae] gi|229292471|gb|EEN63135.1| hypothetical protein BRAFLDRAFT_68081 [Branchiostoma floridae] Length = 152 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 27/83 (32%), Gaps = 3/83 (3%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 IVLLTDG+ T I A+ + + F + + + R L N Sbjct: 1 MIVLLTDGQPTVGVTNTNQIIANARDFLAGNHALFCLGFGYDVSFEFLERLALQNGGFAR 60 Query: 427 SFFEANSTHELNKIFRDRIGNEI 449 + + F D + + Sbjct: 61 RIYPDDDGELQLTSFFDEVAKPL 83 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 57/171 (33%), Gaps = 17/171 (9%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D++ D R+G FN S K +K ++ + T ++ MQ A + Sbjct: 1002 DHLEDDDRLGLVLFNTGAELAEPVSLIGAK-NMQKLKGDVLEISATDGTRLSAGMQMATE 1060 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---EGIAICNKAKSQGIRIMTIAF 405 + N E + I+ LTD + + ++ + I Sbjct: 1061 L------YDEFLEVNQSEYENRIIFLTDAMPNLGQTSEESLLGMTEANANKNVYTTFIGI 1114 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 V+ + A +++ +S K F++R+ +E V + Sbjct: 1115 GVDFNTELVEYITKIRGA---NYYSVHSA----KQFKERMDDEFEYMVTPL 1158 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 46/457 (10%), Positives = 106/457 (23%), Gaps = 80/457 (17%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + + + E + V + ++T + Sbjct: 124 AVKQSSLQTAQPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITVPENTKETNKNDS 183 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 Q + + ++ L + T + + Sbjct: 184 APDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQ 243 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + ++ + SY G D + Y + Sbjct: 244 NVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYASETTTPGLFD 303 Query: 244 LYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 +Y E VD S + + V+ + + ++ + + + M Sbjct: 304 VYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLAD-SGITNNINM 362 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 G ++ ++ + G ++ +K + G T A++ A D + + N Sbjct: 363 GYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRDAGDMLATPN--- 418 Query: 358 VHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------------------ 391 KK IVLLTDG T + Sbjct: 419 --------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGS 470 Query: 392 -------------------------KAKSQGIRIMTIAFSVNKTQQEKAR-----YFLSN 421 K +GI I + + + + Sbjct: 471 YNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMRE 530 Query: 422 CAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 S + ++E A+ +++ + V Sbjct: 531 MVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/295 (12%), Positives = 76/295 (25%), Gaps = 12/295 (4%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + + TD V V + + + +S D + Sbjct: 1233 TTVYKYDISLSAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTVKVPQGET 1292 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 ++ E + + D ++ P G D Sbjct: 1293 SISDKLTMKVPQEDAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTLVVGSNKDS 1352 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + ++ +G N Y + +E ++ I + Sbjct: 1353 VINGGAGKDILIGDTGGTQLNVQAGKN----YNIALVVDTSGSMKEASGSKTAWGTTISR 1408 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++DAL ++ S+K D + + F+ +F+ K I + T Sbjct: 1409 IDLLKDALKNLADSLKGHD---GKINVSIIDFDTNAKEPITFNDLTSKNISD-LITKIDA 1464 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T DA + + + LTDG+ T N Sbjct: 1465 LKAEGGTNYEDAFLKTTSWFDTQS----VTYGKAQGYENLTYFLTDGDPTFSNRN 1515 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 MGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 MGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 34/126 (26%), Gaps = 17/126 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAI 339 + V+ R+ F+ + + E G T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYT 170 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A Q A + S E+ K I L+TDG + + I + G+ Sbjct: 171 MGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASLRDFGVE 218 Query: 400 IMTIAF 405 I T Sbjct: 219 IFTFGI 224 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEAISQTIDTIV 236 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 50/166 (30%), Gaps = 22/166 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDN----VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + +DA + + A + D G + + Sbjct: 88 TRLDAAKDATKKFVSELGGTIPLGLVTYGGTVDEAPENQEAGCQDIHVVSGPKEDVGDSF 147 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G T I D+++ A + + IVL++DG +T Sbjct: 148 TGPIDALQAKGYTPIGDSLKKAAEEL-GGQHGT-------------IVLVSDGIDTCAPP 193 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + + QGI I TI F+V++ +++ A+ + Sbjct: 194 PVCEVAKELHEQGIDLVINTIGFNVDEEARKELSCIAE--AAGGEY 237 >gi|91084771|ref|XP_972223.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 606 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 53/189 (28%), Gaps = 13/189 (6%) Query: 271 KHLVRDALASVIRSIK-KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 V + + IK + + FN + + K + Sbjct: 295 IVSVWNFEKNRFDQIKFAQKPEYENLDSFLAEFNLG-DAAQVSEDNIKKAKEIKDHDVDM 353 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T I + + + E + + + + I+ LTDG I Sbjct: 354 D-----CTNIIGGLVVGLYLVRQTLE-KFYEKNIETKHQPMIIFLTDGLPNVGLIIRDEI 407 Query: 390 CNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + +QG I +++F + + + N +EA + F Sbjct: 408 TDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYRT 467 Query: 445 IGNEIFERV 453 + + + V Sbjct: 468 VSSPLLRDV 476 >gi|107022601|ref|YP_620928.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116689550|ref|YP_835173.1| membrane protein [Burkholderia cenocepacia HI2424] gi|105892790|gb|ABF75955.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116647639|gb|ABK08280.1| membrane protein [Burkholderia cenocepacia HI2424] Length = 603 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 14/156 (8%), Positives = 43/156 (27%), Gaps = 1/156 (0%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G F ++ + + + G + V + +++ A A + A + + Sbjct: 11 RRASRQRGAFSVMAIIATLIAITTLG-AIGVGNLFFQRRDVQRIADMAALAAVQRMDDAC 69 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ A ++ E + T+ + V Sbjct: 70 SQPTATATSNAQSNGLDASNGDTITIECGRWDTSVNPAPSYYAAATSGTTQLNAAKVVVT 129 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 + + P ++ S + + A + Sbjct: 130 RQVPFFFVGPPQTVSAVSTARSTNIDTFSVGATLAA 165 >gi|301767380|ref|XP_002919113.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VI) chain-like [Ailuropoda melanoleuca] Length = 1059 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 67/208 (32%), Gaps = 37/208 (17%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V R +K Sbjct: 59 VDKVKAFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIISPLRPMPS--DRDALKAS 116 Query: 328 AIDENEMG-STAINDAMQTAYDTI--------------ISSNEDEVH----RMKNNLEAK 368 G T + A++ + + + + ++L+ Sbjct: 117 VDAVKYFGKGTYTDCAIKKGLEGLXXXXXXXXXXXXXASAGGRRPARSRGCQGGSHLKEN 176 Query: 369 KYIVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 KY++++TDG + +E N+AK G+++ ++A + LS A+ Sbjct: 177 KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIAT 231 Query: 425 PNSF---FEANSTHELNKIFRDRIGNEI 449 +++ F A + + I I Sbjct: 232 DHTYRRNFTAADWGQTRDA-EEIITQTI 258 >gi|150377243|ref|YP_001313838.1| hypothetical protein Smed_5128 [Sinorhizobium medicae WSM419] gi|150031790|gb|ABR63905.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 436 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 56/184 (30%), Gaps = 4/184 (2%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 +++ ++ +K+ G +I A+ PV++G G+ + W + L+ AA + Sbjct: 14 SSRWQKPLRRFLKAEGGAVAVIAAVAFPVLVGAMGLGAETGYWYLEKRKLQHAADVSAYA 73 Query: 64 ASVPLIQSLEEVSSRAKNSF----TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A+V ++ + A + + N T R + Sbjct: 74 AAVRHRAGDQQSALEAAARRVAGGSGFSPGDLTLSTASAAAGGSNNVTVELTETHPRLFS 133 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + ++ L L + + + E + S S Sbjct: 134 SVFGTGTITIKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVQLSGCSVVSNSSASD 193 Query: 180 QWVI 183 +++ Sbjct: 194 AFLM 197 >gi|221315959|ref|ZP_03597764.1| hypothetical protein BsubsN3_19772 [Bacillus subtilis subsp. subtilis str. NCIB 3610] Length = 235 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 45/129 (34%), Gaps = 15/129 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G+H ++ + G T I A+ + + + K + L Sbjct: 110 YGLHPYEGSLFDNSLSELKPTGWTPIAKALADTRKEFEA----------FDADGKNVVYL 159 Query: 374 LTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + I K ++ + + I F+ + E+ + A + A Sbjct: 160 ITDGEETCGGDPAAEI-EKLRASNVDTIVNIIGFNFDVKGNEEMKQA--AVAGGGEYISA 216 Query: 432 NSTHELNKI 440 NS E + Sbjct: 217 NSADEFEQA 225 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 113/469 (24%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + +++T + + Q + + ++ L + T + Sbjct: 172 PENMKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWRPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNISKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|149019071|gb|EDL77712.1| procollagen, type XII, alpha 1, isoform CRA_c [Rattus norvegicus] Length = 1721 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNVADFES 199 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 200 LSKIVDDL 207 >gi|144898055|emb|CAM74919.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 174 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 51/170 (30%), Gaps = 6/170 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 +L G + AL+ + +V+ ++ ++ A + A Sbjct: 1 MRFPLTKILHRLRGDSRGSTAVEFALVALPFFLMIAGMVETGYVAFKAAVMEGATREAAR 60 Query: 63 TASVPLIQSLEEVSSRAKNSF-----TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 ++Q + ++R + F + + +R+F + Sbjct: 61 QVRTGVVQGAGDAAARFQQEFCPNLIGLFPCQDFYFDVRSFADFATIALPAPVFDAAGIP 120 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLL-NPLSLFLRSMGIKSWLIQTKAEAET 166 T V+ +P + V + + PL L G + + + T Sbjct: 121 TNVQFSPGGANTVVTVRVIHVHSFITPLIGSLMGGGDGTLPLISTTVMRT 170 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 113/469 (24%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + ++ Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPENTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGTQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDSYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/284 (10%), Positives = 79/284 (27%), Gaps = 36/284 (12%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + ++ ++ + ++ ID ++ E +P A+ + Sbjct: 27 LSLSMIMLTALITPAWGIKPEITDIEDIDDKVTLQIQVTGEESKP---IMGLAESDFQLK 83 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYML--------YPGPLDPSLSEEHFVDSSSLRHV 267 K P+ + + G ++ S D+ ++ Sbjct: 84 VLDKKNNKTYQGKQLPFDWKSPRETTPPDAWIVVLIDFSGSMNCSQDLNTKCDAKAVAKG 143 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +K +AL + I K + F +D + + + + Sbjct: 144 KRKLDAAINALGTFI---KLASERKGNTYLSIVPFGVEGKNDKPGACDYYPKVTSETLDN 200 Query: 328 AI---------------DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY-- 370 D+ +T A++ + E + + K Sbjct: 201 FNLVQDVKLTNFLGSLADKTPCATTNFYQALKETVKFFKNDKEGRFYPKDKEGKPLKPQP 260 Query: 371 ---IVLLTDGENTQDNEE--GIAICNKAKSQGIRIMTIAFSVNK 409 I+LL+DG + N + + N ++ I + T+ + + Sbjct: 261 RLSIILLSDGFDNNSNYQEVQKTLANLQNNKDIVVHTLGYGLTP 304 >gi|152993598|ref|YP_001359319.1| hypothetical protein SUN_2020 [Sulfurovum sp. NBC37-1] gi|151425459|dbj|BAF72962.1| hypothetical protein [Sulfurovum sp. NBC37-1] Length = 940 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 20/166 (12%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + K + RDAL +V+R + + +D + K+ + V Sbjct: 42 NGVTKIEIARDALKNVVREWN---PNVELGLTVYGHRSKGDCNDIEVVIPIGKVDKKRVI 98 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + G T I+ +++ A + E K I+L++DG+ T + + Sbjct: 99 DTVMKIKPKGKTPISRSLRKAAGEL------------KYTEEKATIILISDGKETC-DPD 145 Query: 386 GIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 A + K +GI + F+V+K ++ A+ +F Sbjct: 146 PCATAKELKKEGIDFVAHVVGFNVDKKTDKQLECI--ANATGGEYF 189 >gi|222055527|ref|YP_002537889.1| von Willebrand factor A [Geobacter sp. FRC-32] gi|221564816|gb|ACM20788.1| von Willebrand factor type A [Geobacter sp. FRC-32] Length = 569 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 34/266 (12%), Positives = 71/266 (26%), Gaps = 27/266 (10%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 S++ D N PA++ + + K ++ + + Sbjct: 321 SVVANFGDVVRVEGNDIVMPANKNIDNALFYKLKTVLQLVAQRNIAYNRGLPSGKIDRTR 380 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT--FFNDR 305 S F + + L+ D S+ K + +N Sbjct: 381 LYRAPTSGAIFNFKKTNYQLHNDIVLLVDCTGSMAEPNKWTKTEVIYQTLFMALHAYNRN 440 Query: 306 VIS-DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + ++ G TA +A+ + S+ Sbjct: 441 ARIFAYNERNDTCRITEIYKNGQFYSVMPHGRTASGEAIIATALNLKQSSRKP------- 493 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI--CNKAKSQGIRIMTIAFSVNK-TQQEKARYFLSN 421 +I+ LTDG AI C K++ +R++TI + + A Sbjct: 494 -----FIIHLTDGAANWGCGVSDAIRYC---KARNVRLLTIGLECDPMNKTALAEEH--- 542 Query: 422 CASPNSFFEANSTHELNKIFRDRIGN 447 N +L +F+ + + Sbjct: 543 ---GEMIQFVNDVDQLPTVFKQLLSS 565 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 55/161 (34%), Gaps = 17/161 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTII 351 T R+G ++ +V S K K + +G T A+Q A D Sbjct: 833 GTARIGIINYSHKVEKVASLKQFSSKDD---FKLVVDNMQYLGEGTYTATALQAANDMFK 889 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + KK +++TDG+ +++D ++ + A + I I Sbjct: 890 EARPG----------VKKVALVITDGQTDSRDKKKLADVVKDANDSNVEIFVIGVVKKDD 939 Query: 411 QQEKA-RYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ ++ + L + ++ +I Sbjct: 940 PNFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKI 980 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 40/129 (31%), Gaps = 11/129 (8%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + ++ +++ A F+ V DP S T A+ Sbjct: 80 QLTPGRSLKYDIKLAALQFSSSVQIDPPLSSWKDLRTFKQRVKSLNLI--GQGTFSYYAI 137 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + + K +L+TDG + + + +I A+ GI +T+ Sbjct: 138 SNATRLLKREGR---------KDGVKVALLMTDGIDHPKSPDVQSISEDARILGISFITV 188 Query: 404 AFSVNKTQQ 412 S + Sbjct: 189 GLSTVVNEA 197 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 113/469 (24%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY+ G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYTEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNISKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDSRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|167526012|ref|XP_001747340.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774175|gb|EDQ87807.1| predicted protein [Monosiga brevicollis MX1] Length = 1632 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 50/149 (33%), Gaps = 11/149 (7%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L V + + D + R+G F+ + + + + K ++ ST Sbjct: 360 LDFVKQVVSYFDISANATRVGVITFSSSAVINFNLNSFYDKSD--MLDAIDNINYPASST 417 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 I+ A+ + ++ + K +V+LTDG+ + E K G Sbjct: 418 RISLALASVRQNMLKEYNGMRPESEGVP---KVVVVLTDGQASSGYEPAYEAA-LLKDMG 473 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + + +I + L + A+ Sbjct: 474 VNMFSIGIGSSID-----TDQLEDMATAP 497 >gi|296484270|gb|DAA26385.1| collagen, type XII, alpha 1 [Bos taurus] Length = 3115 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 46/140 (32%), Gaps = 18/140 (12%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 HK +++++ A + G+T A+ + R +K VL+ Sbjct: 1254 NAHKDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLI 1307 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K K +G+ + I + L A+ + Sbjct: 1308 TDGKSQDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNV 1358 Query: 432 NSTHELNKIFRDRIGNEIFE 451 L++I D N Sbjct: 1359 ADFDSLSRIVDDLTNNLCNS 1378 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 55/133 (41%), Gaps = 11/133 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ T G TA+ +D++ + +D++ ++ + E +K IVL++DG++ Sbjct: 212 DMLDTGIRRLRPGGGTAL-------FDSLYRTCKDQMLTLQQDAEVRKAIVLVSDGDDDY 264 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNK 439 + + TI+ +V + ++K L A+ F ++ Sbjct: 265 SRVLETEAIKMCQRAETIVYTISTNVGPS-RDKGDDVLQQISDATGGQSFYPQRIDDVAI 323 Query: 440 IFRDRIGNEIFER 452 FR+ I E+ + Sbjct: 324 GFRN-IEEELRSQ 335 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 37/128 (28%), Gaps = 15/128 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D R+G ++DR + G + V + G T A++ Sbjct: 22 VNTFDISPTATRVGVVQYSDRNSLVFNL--GNKVNKPSTVSAINGISYQSGGTNTGAALK 79 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + K I++LTDG+++ + G+ + I Sbjct: 80 YVRQ----------YAAWREGNVPKVIIVLTDGKSSDSVSGPS---RDLVAAGVEVYAIG 126 Query: 405 FSVNKTQQ 412 Q Sbjct: 127 VGKFDHGQ 134 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 61/177 (34%), Gaps = 17/177 (9%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 ++ +++V+ +K + ++D + F++ K + Sbjct: 162 DPVDFDRMKKFVSTVMSRFQKSKTL-----FALMQYSDDFRT--HFTFNDFKRNSDLELL 214 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G T ++ + S+ + A K ++++TDGE D E Sbjct: 215 VRPIGQLFGRTHTATGIRKVVRELFHSSNGARN------HALKIMIVITDGEKYLDPLEY 268 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + +A +GI I + +K+R L AS + F+ N+ L I Sbjct: 269 RDVIPEADRKGIIRYVIGVG-DAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTI 324 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 52/146 (35%), Gaps = 16/146 (10%) Query: 271 KHLVRDALASVIRSIKKIDNV---NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ + ++ V +++ A F+ V DP FS K Sbjct: 64 FDKQKDFVNSLSDKVFQLTPVGFLKYDIKLAALQFSSSVQIDPPFSSWKDL---QTFKQK 120 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 N +G T A+ A + + K +L+TDG + N + Sbjct: 121 VKSMNFIGQGTFSYYAISNATMLLKREGR---------KDGVKVALLMTDGIDHPKNPDV 171 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 172 QSISEDARTAGILFITIGLSTVVNEA 197 >gi|330469087|ref|YP_004406830.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812058|gb|AEB46230.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 316 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 67/203 (33%), Gaps = 31/203 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++A +R + +T +G F + + Sbjct: 107 TRLEGAQEAAKQFVREL------PETYNVGLVSFAKSANVLVPPTKDRPAVT-----NAI 155 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 +TA +A+ T + I S IVLL+DG T A Sbjct: 156 DGLVLAEATATGEAVFTCLEAIRSV-----PADGAAGIPPARIVLLSDGFRTAGRSVEEA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEANSTHEL 437 A++ + + TIAF + R L+ A + F+EA S EL Sbjct: 211 A-AAAQAANVPVSTIAFGTDAGHVAIGGQLQRVPVDRMALAALAETTEGYFYEAASVSEL 269 Query: 438 NKIFRDRIGNEIFERVI--RITK 458 ++++D G+ I R IT+ Sbjct: 270 KQVYQDM-GSSIGFRTEAREITQ 291 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 111/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYEDNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|73958268|ref|XP_848527.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 2 [Canis familiaris] Length = 1153 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 18/195 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + +++ +++V+ K + F++ Sbjct: 146 PRQDSDIAFLIDGSGSINP-TDFQRMKEFVSTVMDQFKNSKTL-----FSLMQFSEDFQI 199 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 +F+ + + G T ++ + S+ Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLL--GRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKI 440 + F N+ L I Sbjct: 311 DHVFRVNNFEALKTI 325 >gi|73958266|ref|XP_856286.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 3 [Canis familiaris] Length = 789 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 18/195 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + +++ +++V+ K + F++ Sbjct: 146 PRQDSDIAFLIDGSGSINP-TDFQRMKEFVSTVMDQFKNSKTL-----FSLMQFSEDFQI 199 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 +F+ + + G T ++ + S+ Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLL--GRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKI 440 + F N+ L I Sbjct: 311 DHVFRVNNFEALKTI 325 >gi|73958270|ref|XP_856370.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 [Canis familiaris] Length = 1036 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 18/195 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + +++ +++V+ K + F++ Sbjct: 146 PRQDSDIAFLIDGSGSINP-TDFQRMKEFVSTVMDQFKNSKTL-----FSLMQFSEDFQI 199 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 +F+ + + G T ++ + S+ Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLL--GRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKI 440 + F N+ L I Sbjct: 311 DHVFRVNNFEALKTI 325 >gi|73958264|ref|XP_547048.2| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 1 [Canis familiaris] Length = 1165 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 18/195 (9%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +D S + +++ +++V+ K + F++ Sbjct: 146 PRQDSDIAFLIDGSGSINP-TDFQRMKEFVSTVMDQFKNSKTL-----FSLMQFSEDFQI 199 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 +F+ + + G T ++ + S+ Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLL--GRTHTATGIRKVVRELFHSSSGARENAL------ 251 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--- 425 K +V++TDGE D + + +A +GI I + K R L+ AS Sbjct: 252 KILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVG-DAFNHLKNREELNIIASKPPR 310 Query: 426 NSFFEANSTHELNKI 440 + F N+ L I Sbjct: 311 DHVFRVNNFEALKTI 325 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR+G ++D ++ + S + I + T A+ A D I Sbjct: 68 DAVRVGIVQYSDEAKTEFNLS-RYSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFD 126 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + L+A + I++ TDG + + +G++I TI+ + Sbjct: 127 ------YDGGARLKATRLIIVFTDGLSMDKPTL---AAKALRRKGVKIYTISVNSIGFVP 177 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 E L ++ F + + + +G R I Sbjct: 178 E----MLGIVGDADNVFGPTDENRIEE---RLLGEVESSRSCEI 214 >gi|225873423|ref|YP_002754882.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] gi|225793805|gb|ACO33895.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] Length = 339 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 46/129 (35%), Gaps = 11/129 (8%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTA-YDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + D + G TA+ DA+ +A D ++++ + ++ I+L++D Sbjct: 169 SNNLDTLSSAIQDLHPGGGTALYDAVYSACRDKLLNAASGPI-------YVRRAIILVSD 221 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 G++ Q + + + I ++ + T + L A + F Sbjct: 222 GDDNQSHAYLTDAIKECQRAQTAIYAVSTDTDPT-PDPGDDILRKMAEETGGRAFFPRVI 280 Query: 435 HELNKIFRD 443 L F Sbjct: 281 TNLPASFNS 289 >gi|73973310|ref|XP_867431.1| PREDICTED: similar to alpha 1 type XII collagen short isoform precursor isoform 2 [Canis familiaris] Length = 1901 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEDELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 200 LSKIVDDL 207 >gi|116623354|ref|YP_825510.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226516|gb|ABJ85225.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 328 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 51/183 (27%), Gaps = 27/183 (14%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + + A+ +R+ D F+D+ +F+ Sbjct: 121 DASSSMKNRMDKSVAAIQQFLRTNMAGDEY------FLVRFSDKPTMVTTFT-----HNP 169 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + +G TA++DA+ + + ++ + +LTDG + Sbjct: 170 AEIASELTLIKPLGWTALHDAIYLGTQQMRKAKNS-----------RRALFVLTDGGDNN 218 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + N +R+ I E+ + A S +L Sbjct: 219 SRYTEAEVKNFVVESDVRVYAIGLFERPKFLERLAAL-----TGGEALWAKSLKDLPDAI 273 Query: 442 RDR 444 Sbjct: 274 ERI 276 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 867 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 924 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 925 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 974 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 975 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 1013 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 51/145 (35%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K +L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVALLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TI S + Sbjct: 173 SISEDARISGISFITIGLSTVVNEA 197 >gi|260837282|ref|XP_002613634.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] gi|229299020|gb|EEN69643.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] Length = 1460 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 65/242 (26%), Gaps = 26/242 (10%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV----GIRDEKLSPYMVSCNKS 239 D + D+ +P ++ S + + N Sbjct: 1091 DSCATFTAGNGDNTVRPNVYDIMKSNIHEMSGVTWQYYGAKEGEYHQFPKNDRSCEGNDH 1150 Query: 240 LYYMLYPGPLDPSLSE-------EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + Y P + + +L + A +V+ ++ D Sbjct: 1151 RFRNWYVSAASPKKKNVVIVMDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRD--- 1207 Query: 293 DTVRMGATFFNDRVISDPS-FSWGVHKLIRT---IVKTFAIDENEMGSTAINDAMQTAYD 348 G F+ R + + + T I+K F T + A+D Sbjct: 1208 ---WGGVVSFSARAETPEGCLGDSLGEANPTNIGIMKDFINQRVPETITMYGVGFRKAFD 1264 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC--NKAKSQGIRIMTIAFS 406 + + ++ + I+ L+DG T + AI + + + I T Sbjct: 1265 -MFAEARNKKPEQFEDCYN--IIIFLSDGSPTDKDFALNAITQGQELMDRSVYIFTYGLG 1321 Query: 407 VN 408 N Sbjct: 1322 AN 1323 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 48/146 (32%), Gaps = 15/146 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS-FSWGVHKLIRT---IV 324 + +L + A +V+ ++ D G F+ R + + + T I+ Sbjct: 231 NRLNLAKQAALTVLDTLTPRD------WAGVVSFSARAKAPEGCLGDSLGEANPTNIGIM 284 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K F T + + A++ S + + ++ I+ LTDG+ T Sbjct: 285 KDFINQRVPETITVYAEGFKKAFNMFFESKNKKPEQFEDCQN---IIIFLTDGQPTDTYF 341 Query: 385 EGIAIC--NKAKSQGIRIMTIAFSVN 408 I + + I T N Sbjct: 342 TLDDIVKGQDLMERSVHIFTYGLGAN 367 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 891 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 892 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 941 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 942 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 980 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 53/145 (36%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TIA S + Sbjct: 173 SISEDARISGISFITIALSTVVNEA 197 >gi|119613999|gb|EAW93593.1| hCG19532, isoform CRA_a [Homo sapiens] Length = 302 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 68 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 69 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 118 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 119 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 157 >gi|51095063|gb|EAL24307.1| similar to matrilin 2 precursor [Homo sapiens] Length = 448 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 157 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 214 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 215 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 264 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 265 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 303 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 41/427 (9%), Positives = 105/427 (24%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + +E V + +S L L Sbjct: 101 YHSQANKLIVDNKEYDIIDETENKKETDVSVPKADEIEEESSKENENSVS---PFTLPTL 157 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + + N + Sbjct: 158 SLPAVSVPSNQTISTEYTTDDQGTYPKANWQPTGNTNVLDHQGNKNGSNQWDGI---NSW 214 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + + + +Y + Sbjct: 215 DGDPNDRTHSYIEYGGTGNQADYAIRKFAKETTTPGLFDVYLNARGNVQKDITPLDLVLV 274 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ + + + + Sbjct: 275 VDWSGSMNNNDRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGHNYSNGTVQMGS 333 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T ++ A D + + +V R+ Sbjct: 334 FDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 393 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 394 RAQSSNDYYGTQFSNTQDQPGNTSRIARSYYAPDQNNQSRRIDSTFIATIGEAMALKERG 453 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCAS----PNSFFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 454 IEIHGLGIQLQSDSAAGLSKAEVESRMRKMVSADEKGDLYYESADHATDISEYLAKK-AV 512 Query: 448 EIFERVI 454 +I V Sbjct: 513 QISATVS 519 >gi|206559892|ref|YP_002230656.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] gi|198035933|emb|CAR51825.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] Length = 164 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 38/140 (27%), Gaps = 7/140 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + + G I AL++PV + ++ + L + A+ +L Sbjct: 10 MRRRERGATAIEFALMLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARAALN 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A S +Q + T + + + V + Sbjct: 63 YQQVGAAASVQAAQQAALAARAQAACTAATNLTTWLKGSTCSPTQQGNCSYDPTMLCVQI 122 Query: 134 SSRYDLLLNPLSLFLRSMGI 153 + Y +PL +G Sbjct: 123 TLTYPYSQSPLIPSFPLLGS 142 >gi|254447588|ref|ZP_05061054.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] gi|198262931|gb|EDY87210.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] Length = 1197 Score = 45.7 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 46/334 (13%), Positives = 88/334 (26%), Gaps = 76/334 (22%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP---YMVSCNKSL 240 ++ ++ +C +V Y++ NG P Y S + Sbjct: 166 PSRWETIEENHSNDEVAYDCERDEVGNSVTEYAAINGSFDNSTGGDLPSAAYTSSEDFED 225 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + ++++ + K ++ + I +DNVN G Sbjct: 226 RLNWSSEDYSARVVTGNYLNWQENHNDEKTVTRLKAVQNVLHDVIGGMDNVN----FGLM 281 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM---------------QT 345 F + S R + D G T + +A+ + Sbjct: 282 DFYGKGRVIYPVSLVND--DRVQLLKTVDDLEANGGTPLQEALHESGRYLMGDTYYYGER 339 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG--------------IAICN 391 + S+ N +VLLTDGE T + C Sbjct: 340 GVSSAFESDGKTYDSPILNECQANAVVLLTDGEPTDYGTDNVIHDMIKPVHEKNGTRRCT 399 Query: 392 KAKSQG-----------------------------IRIMTIAFSVNKTQQEKARYFLSNC 422 + G + T+ F + + L + Sbjct: 400 SYRDNGNDYTRYCLPEIAQFMRDFDLSSGLDGDQTAQTFTVGFKSD-------QELLQDT 452 Query: 423 ASPN--SFFEANSTHELNKIFRDRIGNEIFERVI 454 AS ++F+AN + EL + +D + E Sbjct: 453 ASKGQGNYFQANDSVELTQALKDIVRKIKSESST 486 >gi|291223809|ref|XP_002731900.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 992 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 61/183 (33%), Gaps = 21/183 (11%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + +R A ++ I + + + +G F++ + V+ + Sbjct: 337 RLTKLRQAASAFI-----RNTICEGSYLGIVEFSEFAQELAPLTL-VNGSDSREGLIRRL 390 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T+I + + + ++ ++ I+ ++DG + A+ Sbjct: 391 PHSVGGWTSIGAGIMKGIEVLSTNGQNPEGG---------LIMAISDGGENRAPTLSEAL 441 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE--ANSTHELNKIFRDRIGN 447 G+ I TIA+ Q ++ L+ SFF + + L F I + Sbjct: 442 -QAVDESGVTIDTIAY---SEQADENLASLAARTGGMSFFYSGDDDSTVLEDAFATSITS 497 Query: 448 EIF 450 Sbjct: 498 RAA 500 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 50/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E +I V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKIIVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + S+ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGSQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-------------------- 385 A D + + N KK IVLLTDG T Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYTVSRVQTEADGRFYGTQFTNRQ 407 Query: 386 -------------------------------GIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 I K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|258543794|ref|ZP_05704028.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] gi|258520969|gb|EEV89828.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] Length = 1128 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 39/318 (12%), Positives = 78/318 (24%), Gaps = 52/318 (16%) Query: 178 SIQWVIDFSRSMLDY-QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 ++ +D S SML+ S P + T + + + Sbjct: 62 NVMLFLDDSGSMLENANTGSYRIPDLWNTYLGNCTYNQNNLLISRNLLPGNGQQTNPSMG 121 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--DT 294 YY + +A+ V+ N + Sbjct: 122 RCLYYYNADIIYYRNKGLPPGPKSFL-RPAAQVRMDANINAINEVLTKTGDSVNWHLLTL 180 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII--- 351 N + L + + + + T + A + + Sbjct: 181 WGSEFRHLNGGFYKFDNIGTASVGLSAEKARKLVNNMSPIAGTPATERYLKAANVLDHQI 240 Query: 352 -----------------SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----- 389 + + + + + G N+ D +G+++ Sbjct: 241 KYRCQKNYIVFMSDGEANGGGFPWPGGVYGTQPQWWSQHDSSGPNSGDPGKGVSVFSSKL 300 Query: 390 ----------------CNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 + K Q I +TI + T Q R +L N A S ++F Sbjct: 301 YNMDMRVGGTDVEGGSWDDPKYPKQNIETITIGYGNGLTPQ--GRNYLKNAAQPSEGAYF 358 Query: 430 EANSTHELNKIFRDRIGN 447 AN+ EL F + Sbjct: 359 -ANNASELADAFLKALAK 375 >gi|13475415|ref|NP_106979.1| hypothetical protein mll6477 [Mesorhizobium loti MAFF303099] gi|14026167|dbj|BAB52765.1| mll6477 [Mesorhizobium loti MAFF303099] Length = 183 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 36/164 (21%), Gaps = 16/164 (9%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITAS 65 Y + G + L+ P+ML + + + + + ++ A A Sbjct: 8 LSRYLGRFRHDQRGAVMVEMTLITPLMLVLSAGVFEFGNLIHDKLLMEAGLTDAARYAAR 67 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + T + N + T N Sbjct: 68 CNSQLYTDSGLAAINCTTTAANIAVYGKPSVTVVNGAVTDTPRVSGWQTSNVTVTTNNSC 127 Query: 126 KSAY---------------QVVLSSRYDLLLNPLSLFLRSMGIK 154 + V S Y + F+ I Sbjct: 128 QDTVVGGVTQYRSTTAQVCTVRASGTYPYTGVGMLSFIGIGPIT 171 >gi|327310820|ref|YP_004337717.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] gi|326947299|gb|AEA12405.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] Length = 509 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 67/213 (31%), Gaps = 18/213 (8%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + + +D S K +A ++ S+ D V V F+++V Sbjct: 35 APETNTAFAILIDKSKSMADFDKLAHAIEAAKGLVESMAPEDIVAVYV------FDEKVK 88 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNL 365 + + + T + A+ D + R Sbjct: 89 ALVPPTP---VEKARKMLGKIEKIKPGTYTLLYQALLQVIDDLRGIKRGLPLMPRRAVPE 145 Query: 366 EAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K IV++TDGE + A GI I I EK Y L+N Sbjct: 146 NIPKRIVVITDGEPWPYYTEERWYEHLGKAAARYGITISAIGIG--DDYNEKILYALAN- 202 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 +S +++ + +++++ + + V R Sbjct: 203 SSGGAWYHISQIRDISQVLANEL-RRAKTVVAR 234 >gi|148685681|gb|EDL17628.1| integrin alpha M, isoform CRA_c [Mus musculus] Length = 1037 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 150 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 201 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 202 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 255 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 256 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 314 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 315 QVDNFEALNTI 325 >gi|187923652|ref|YP_001895294.1| cyclic nucleotide-binding protein [Burkholderia phytofirmans PsJN] gi|187714846|gb|ACD16070.1| cyclic nucleotide-binding protein [Burkholderia phytofirmans PsJN] Length = 596 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 50/171 (29%), Gaps = 10/171 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ---- 70 + G ++ + + V + G +DV + L++ A A + A + Sbjct: 25 ARRQHGAVAVVAVIWLSVAIAALG-ALDVGNVFFARRQLQRTADLAALAAVQMIGSTGGC 83 Query: 71 ----SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ A I R + T + V+ T ++ P Sbjct: 84 ATATTAAQQNAAANGFTAGGTTSINTTCGRWDTSKSTYFGTSGNPLNAVQVTTTQVVPYF 143 Query: 127 SAYQVV-LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +S+ + + F GI S Q A + K Sbjct: 144 FLGPSRNVSATATAFASNIDAFSLGTGIASINTQQSALLNAILGGLLKTSV 194 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 56/159 (35%), Gaps = 13/159 (8%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++ + +F+ + + + G T ++ + + S Sbjct: 194 NTLFSLIQYSHLLKIHFTFTQFQNSWNPLSLVDPIVQLK--GLTYTATGIRKVVEELFHS 251 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 AKK ++++TDG+ +D E + +A+ GI I + + Sbjct: 252 KNGAR------KSAKKILIVITDGQKYKDPLEYSDVIPQAERAGIIRYAIGVG-DAFWKP 304 Query: 414 KARYFLSNCASP---NSFFEANSTHELNKIFRDRIGNEI 449 A+ L N S + F ++ L+ ++++ +I Sbjct: 305 SAKQELDNIGSEPAQDHVFRVDNFAALSS-IQEQLQEKI 342 >gi|311268149|ref|XP_003131915.1| PREDICTED: integrin alpha-E-like, partial [Sus scrofa] Length = 1032 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 44/153 (28%), Gaps = 15/153 (9%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 + + ++ + T AMQ D I + Sbjct: 246 LVQYGSVIQTELDLQDSQDVAAS--LDRVQNITQVGSVTKTASAMQHVLDNIFT------ 297 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +A K +V+LTDG+ D + N K QG+ I + Sbjct: 298 PSHGSKAKASKVMVVLTDGDIFDDPLNLTTVINSPKMQGVERFAIGVG-GAFNKSNTYNE 356 Query: 419 LSNCASP---NSFFEANS---THELNKIFRDRI 445 L AS + F+ + L + RI Sbjct: 357 LKLIASDPDEDHAFKVTNYMALDGLLSKLQQRI 389 >gi|227114884|ref|ZP_03828540.1| hypothetical protein PcarbP_18065 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 204 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 37/93 (39%), Gaps = 2/93 (2%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 ++ ++F+ ++ S + TAL P++L +G + D+ ++Q Sbjct: 5 LLKTSRFLSGMRRFWLSRRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQRV--G 62 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 I + + + Q L+E + P + Y Sbjct: 63 AIASVLAMQQKLDESGLQGLLDTMLPVEGAGNY 95 >gi|222055940|ref|YP_002538302.1| hypothetical protein Geob_2856 [Geobacter sp. FRC-32] gi|221565229|gb|ACM21201.1| conserved hypothetical protein [Geobacter sp. FRC-32] Length = 391 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 25/259 (9%), Positives = 65/259 (25%), Gaps = 15/259 (5%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G +++ V L + G+ +++ E L AA+ + + + + Q Sbjct: 4 QEEQGMDTTYVGIMLVVFLVIAGLAINIGYMYVSEDDLHNAAELSALAGAQAIGQ----- 58 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ T + E + N + L++ Sbjct: 59 -QMQLSARTGTGKLKETIYDQVQPAARAAAIDHVSGHHQASALIEIRNSNIN----RLTT 113 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 DL + ++ R+ + + ++ ++ Sbjct: 114 ENDLTMGFWNISSRTYTPGGTPVNAVQVRTRRTAESESAGLGTLGSILSKISGSQKLNYT 173 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 E A+ +V + + +S + S M P+ S Sbjct: 174 PEAVAAIPALADANFSVCVDACGTECTYPNICTIQERKLSPDSSDPRMD-----SPTKSR 228 Query: 256 EHFVDSSSLRHVIKKKHLV 274 + S + Sbjct: 229 YAYTSLSYPPGDTMSLSNL 247 >gi|26333265|dbj|BAC30350.1| unnamed protein product [Mus musculus] Length = 1036 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|261854814|ref|YP_003262097.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261835283|gb|ACX95050.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 339 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 55/160 (34%), Gaps = 13/160 (8%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM---GSTAINDAMQTAYDTIISSNED 356 F+ V+++ + + ++ + TA+ D + A +I + ++ Sbjct: 152 DRFSVIVVTNSAGTLVPMTTDHAVLDYWIRQLRAGINGSDTALGDGLAMAIRSIAAQSQA 211 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +V+ TDG +T A++ GI++ T+ + + ++ + Sbjct: 212 GQPAP--------LLVVWTDGFSTGGLMTPAEALALARAYGIKLFTVNLAPKGSPPDQGQ 263 Query: 417 YFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 L+ A + A+ +N + + + Sbjct: 264 PSLAQLADLTGGKPILASDLAAMNAVTDQIAASVAPKSAT 303 >gi|119614000|gb|EAW93594.1| hCG19532, isoform CRA_b [Homo sapiens] Length = 304 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFEDA 68 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + A + I I Sbjct: 69 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPN 118 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 119 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 157 >gi|126178503|ref|YP_001046468.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] gi|125861297|gb|ABN56486.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] Length = 1002 Score = 45.7 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 21/197 (10%), Positives = 51/197 (25%), Gaps = 28/197 (14%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + R N + G+ ++ + T Sbjct: 558 SQNVGGWLGRDSGTWDDLSY-RTANYGNNRYYTEYATLDLGLSD-DQSAINTTIAGMIPD 615 Query: 335 GSTAINDAMQTAYDTIISSNE--------------DEVHRMKNNLEAKKYIVLLTDGENT 380 T + + A + + + + D NT Sbjct: 616 SGTPMRYGIYKAITELKDNGRSDAVKAVVVLGDGEYNYYGDPLARGTGRTTKF--DWSNT 673 Query: 381 QDN--------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFE 430 Q++ + A + +R+ I+FS + + AS + + Sbjct: 674 QNDYTYFSDLGAAEQRMSTYANNHNVRLYMISFSDDIVNGSHTWNTMETLASETGGTHYH 733 Query: 431 ANSTHELNKIFRDRIGN 447 A S +L +++ + G Sbjct: 734 APSDSDLAQVYTEIAGE 750 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 55/151 (36%), Gaps = 9/151 (5%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D VR+ ++ + F+ ++++ ++ G+T+ + A+ AY + Sbjct: 226 PDAVRLALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLT 285 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++ + KK I++ TDG + + K +G+ I I Sbjct: 286 DTDNEN----GVREGVKKMIIIFTDG---HSQRSPQDMALRLKDKGVEIFAITL-TPAPY 337 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFR 442 ++ LS + + F + L ++ Sbjct: 338 ADEGE-LLSITQNTDHIFTPVNLKVLITTYK 367 >gi|148685680|gb|EDL17627.1| integrin alpha M, isoform CRA_b [Mus musculus] Length = 1168 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 150 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 201 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 202 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 255 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 256 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 314 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 315 QVDNFEALNTI 325 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 113/469 (24%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQE- 413 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDSRAN 467 Query: 414 ----KARYFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + + S + ++E A+ +++ + V Sbjct: 468 LSKQQIEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|254443409|ref|ZP_05056885.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257717|gb|EDY82025.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 257 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 43/134 (32%), Gaps = 15/134 (11%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R G T + +++ T Y + + + + +V++TDG + Sbjct: 132 NRQAFIGSVSQIRAKGRTPLVESIVTGYRVLTEQAQRQSG------YGRYVLVIVTDGAS 185 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + N G+A+ + I + TI F V + A+S L Sbjct: 186 SDGNPAGVAM-EVTRESPIEVQTIGFGVADHALNLP--------GVTQYVTASSPKALID 236 Query: 440 IFRDRIGNEIFERV 453 I +E + Sbjct: 237 ALNQVIASESESFI 250 >gi|124956|sp|P05555|ITAM_MOUSE RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: CD_antigen=CD11b; Flags: Precursor gi|52983|emb|CAA30479.1| unnamed protein product [Mus musculus] Length = 1153 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|293361231|ref|XP_243912.5| PREDICTED: collagen alpha-1(XII) chain [Rattus norvegicus] Length = 3119 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|149732356|ref|XP_001492551.1| PREDICTED: similar to complement component 2 [Equus caballus] Length = 751 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 45/152 (29%), Gaps = 16/152 (10%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 V ++I ++ D T +A+ + Y + + + E + ++LL Sbjct: 315 DVMEVINSLDNIHYKDHENGTGTNTYEALNSVYIMMNNQMQRLGMNTVAWQEIRHAVILL 374 Query: 375 TDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASP 425 TDG++ + K I+ I I + S Sbjct: 375 TDGKSNMGGS-PKLAVDNIKELLNIKQKRNDYLDIYAIGVGNLDVDWRELNELGSKKDGE 433 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F L+++F V ++T Sbjct: 434 RHAFILKDAEALSQVFEHM------LDVSQLT 459 >gi|553646|gb|AAA60557.1| inter-alpha-trypsin inhibitor heavy chain old gene name 'ITI' [Homo sapiens] Length = 325 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 12/85 (14%), Positives = 29/85 (34%), Gaps = 3/85 (3%) Query: 372 VLLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 ++LTDG+ T+ + I ++ + + F N + N Sbjct: 1 IMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRI 60 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 +E + + + F ++ + V Sbjct: 61 YEDHDATQQLQGFYSQVAKPLLVDV 85 >gi|149914293|ref|ZP_01902824.1| hypothetical protein RAZWK3B_19871 [Roseobacter sp. AzwK-3b] gi|149811812|gb|EDM71645.1| hypothetical protein RAZWK3B_19871 [Roseobacter sp. AzwK-3b] Length = 177 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 5/48 (10%), Positives = 19/48 (39%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 ++ ++ G+ I ++ PV + +++ + L++ Sbjct: 9 LRRFRRAEDGNASIEFVIIFPVFMMFLIFALELSFITLRHAMLERGLD 56 >gi|330417948|ref|NP_001193426.1| collagen alpha-1(XII) chain [Bos taurus] Length = 3065 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 46/140 (32%), Gaps = 18/140 (12%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 HK +++++ A + G+T A+ + R +K VL+ Sbjct: 1254 NAHKDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLI 1307 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K K +G+ + I + L A+ + Sbjct: 1308 TDGKSQDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNV 1358 Query: 432 NSTHELNKIFRDRIGNEIFE 451 L++I D N Sbjct: 1359 ADFDSLSRIVDDLTNNLCNS 1378 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 45.4 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 38/124 (30%), Gaps = 8/124 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +K + GSTA+ D A D +I+ + +L DG++ Sbjct: 378 KQALKAAIDNIYASGSTALFDGSSLALDNLITYQTAAASGAPG------VVFVLADGDDN 431 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + ++ + I + F ++N F + E+ Sbjct: 432 SSIKNEATVITAYQNANVPIFS--FGYGSASPTGPLVTMANATGGKYFSSPTTLSEIIDA 489 Query: 441 FRDR 444 F Sbjct: 490 FLQA 493 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|132626289|ref|NP_001076429.1| integrin alpha-M isoform 1 precursor [Mus musculus] Length = 1154 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|78189842|ref|YP_380180.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172041|gb|ABB29137.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 334 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 44/156 (28%), Gaps = 8/156 (5%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + L + I I R G F + + T++ A + Sbjct: 104 TDVAPNRLMRAKQEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRDGF-ATLLNMAAPE 162 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 E T + A A +NE + IVLL+DGE+ + Sbjct: 163 LIEEQGTRLQPAFALASTIFDVANESNAASTRGVQ----VIVLLSDGEDH--DSNVQRAA 216 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARY-FLSNCASP 425 + Q +++ I K AS Sbjct: 217 QQLAKQSVQLFVIGVGSLKPSPIPLADGSFKRDASG 252 >gi|74213162|dbj|BAE41718.1| unnamed protein product [Mus musculus] Length = 1153 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|74191759|dbj|BAE32836.1| unnamed protein product [Mus musculus] Length = 1166 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 56/179 (31%), Gaps = 24/179 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-------IRT 322 K + A+ + + I + + +D S S G ++ + Sbjct: 173 KMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKE 229 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + G T + A+++ D + E + +++DGE T Sbjct: 230 QFNAALSNFGPKGWTPLASAIESVNDDF---------KEYTGEENLNVVYIVSDGEETCG 280 Query: 383 NEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + I F V ++Q++ + A ++ ++ EL + Sbjct: 281 -GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAE--AGKGNYATVSNADELYQ 336 >gi|146325834|sp|Q60847|COCA1_MOUSE RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor Length = 3120 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|132626295|ref|NP_032427.2| integrin alpha-M isoform 2 precursor [Mus musculus] gi|148685679|gb|EDL17626.1| integrin alpha M, isoform CRA_a [Mus musculus] gi|162318464|gb|AAI56095.1| Integrin alpha M [synthetic construct] gi|162319590|gb|AAI56992.1| Integrin alpha M [synthetic construct] Length = 1153 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|74178358|dbj|BAE32446.1| unnamed protein product [Mus musculus] Length = 1153 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 40/324 (12%), Positives = 80/324 (24%), Gaps = 38/324 (11%) Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 L + + + + + +R K I VID S SM D Sbjct: 158 TLDPDLVNEGSVKAYKPVVEEDPNIANKWTVKMLVAARDSVKTS--KIVLVIDTSGSMKD 215 Query: 192 YQRDSE-GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 + R N F +S + +S + + K + Sbjct: 216 FGRMKGAKNAANAFVDNVLDGSQSTQIGIVRFASNVSIVSDFTSNKAKLHSAINALSAEG 275 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + ++ + ++ A + + + + + D Sbjct: 276 GTFTQAGVKQARTMLAG-------SGADKKYMVVLSDGVPTFNYKISHPNNY----LIDG 324 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 H+ +T ++M Y+ + Sbjct: 325 GPGSHTHEKQTGKQLPENEFLYNESATGSGNSMWKQYERVKIGG---------------- 368 Query: 371 IVLLTDGENTQDNEEGIAICNK--AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNS 427 + T E N AI AK+ +R+ T+ + L + AS P Sbjct: 369 GIFPTY-EYHYYNSGNCAIAESGFAKAANMRVFTVGLQTD----ATGSGVLRDIASAPGD 423 Query: 428 FFEANSTHELNKIFRDRIGNEIFE 451 F E +L +F + Sbjct: 424 FTEVTDVSQLTPVFEQIAAQILKT 447 >gi|260836190|ref|XP_002613089.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] gi|229298473|gb|EEN69098.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] Length = 267 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 41/143 (28%), Gaps = 16/143 (11%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + ++G + D + + + ++ G T A Sbjct: 29 NTTSDFQIGTAHTQVGVVQYEDNPYDEFPL--NQYATLDELLTAIRNITYRGGGTQTGKA 86 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + D ++ + + K ++++TDG++ +A GI ++ Sbjct: 87 IDHVVDNSLTESHGARPGV------PKVVIVVTDGQSWDSVVAP---AQRANHSGIIMVA 137 Query: 403 IAFSVNKTQQEKARYFLSNCASP 425 I E L AS Sbjct: 138 IGVGSGYDINE-----LMEIASS 155 >gi|148694465|gb|EDL26412.1| procollagen, type XII, alpha 1, isoform CRA_a [Mus musculus] Length = 1722 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 200 LSKIVDDL 207 >gi|55962355|emb|CAI11852.1| novel protein (zgc:56119) [Danio rerio] gi|56207242|emb|CAI21015.1| novel protein (zgc:56119) [Danio rerio] Length = 754 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 12/88 (13%), Positives = 30/88 (34%), Gaps = 3/88 (3%) Query: 370 YIVLLTDGENTQDNEEGIAICNKAK---SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 I+L++DG+ T + I K + + ++ + R + N Sbjct: 297 MIILVSDGDPTVGEIKLSTIQKNVKLRMKEEFSLFSLGIGFDVDFDFLERIAMDNRGIAQ 356 Query: 427 SFFEANSTHELNKIFRDRIGNEIFERVI 454 + + E K F ++ + + + Sbjct: 357 RIYANQNAAEQLKTFYSQVSSPLLRTIT 384 >gi|16080726|ref|NP_391554.1| hypothetical protein BSU36730 [Bacillus subtilis subsp. subtilis str. 168] gi|221311632|ref|ZP_03593479.1| hypothetical protein Bsubs1_19856 [Bacillus subtilis subsp. subtilis str. 168] gi|221320871|ref|ZP_03602165.1| hypothetical protein BsubsJ_19720 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325156|ref|ZP_03606450.1| hypothetical protein BsubsS_19886 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313222|ref|YP_004205509.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] gi|8928526|sp|P70961|YWMD_BACSU RecName: Full=Uncharacterized protein ywmD; Flags: Precursor gi|1648854|emb|CAB03681.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636198|emb|CAB15690.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019496|gb|ADV94482.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] Length = 224 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 45/129 (34%), Gaps = 15/129 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G+H ++ + G T I A+ + + + K + L Sbjct: 99 YGLHPYEGSLFDNSLSELKPTGWTPIAKALADTRKEFEA----------FDADGKNVVYL 148 Query: 374 LTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +TDGE T + I K ++ + + I F+ + E+ + A + A Sbjct: 149 ITDGEETCGGDPAAEI-EKLRASNVDTIVNIIGFNFDVKGNEEMKQA--AVAGGGEYISA 205 Query: 432 NSTHELNKI 440 NS E + Sbjct: 206 NSADEFEQA 214 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 40/132 (30%), Gaps = 28/132 (21%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC- 390 G T I ++ ++ + IVLLTDGE+ D ++C Sbjct: 376 TAGGGTNICAGIRAGIAL---------NKNFDGSSYGTEIVLLTDGEDNLD----TSLCF 422 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS---THELNKIFR----- 442 G I IA N ++ + + + F A +EL F Sbjct: 423 KDITDSGAIIHVIALGPNAAKELETIANM----TGGLRFNALDKVEANELIDAFSGLHSG 478 Query: 443 --DRIGNEIFER 452 + I I Sbjct: 479 NGEIIQQAIQLE 490 >gi|299134237|ref|ZP_07027430.1| Protein of unknown function DUF2134, membrane [Afipia sp. 1NLS2] gi|298590984|gb|EFI51186.1| Protein of unknown function DUF2134, membrane [Afipia sp. 1NLS2] Length = 547 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 27/267 (10%), Positives = 73/267 (27%), Gaps = 13/267 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ + G+ I A M ++ G + VDV + + Sbjct: 9 RHLGRQFVSDERGNIAFIAAACMLLVTGCAALGVDVGSVFTDKRRAQS----------AA 58 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + ++ S + + + ++ + + + + Sbjct: 59 DLAAIVAASDLTRAPRAAAATVAKNNYPPDSLVAVETGVYKADTSLVPQQRFTAGATPAN 118 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGI--KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 A +V + ++ L + + + S T+ + +G + ++ Sbjct: 119 AVRVTMQTKTPLFFGKVLTGDPYINLRVSSVASTTQLATFAIGSRLASLNGGLLNQLLGQ 178 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 + Q L + + +++ G+ + L + + + Sbjct: 179 MLGTSLSLSAMDYQALADAKVNLFDFMSALATRANLTGVSYDSLLQSNLKVTDIIAALQA 238 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKH 272 G + S V SSL V K Sbjct: 239 SGATGAASSALSNV-GSSLAGVTTKVT 264 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 45.4 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 35/114 (30%), Gaps = 15/114 (13%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 +++ + G+T A+ + ++ EA K ++ Sbjct: 289 DLNKFASKDKVLEAVRNLRYKGGNTFTGLALTHVLE------QNLKLEAGPRPEADKIVI 342 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 LLTDG + +E A K GI I I + L AS Sbjct: 343 LLTDG---KSQDEANAAAQALKDLGISIFAIGV------KNADEAELRQVASHP 387 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 2/48 (4%) Query: 411 QQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 E + L CAS +F+A +L F+ IG + +V R+T Sbjct: 2 APEGGQALLQYCASDASHYFQAEKMEDLFAAFK-AIGAKASTQVTRLT 48 >gi|3982895|gb|AAC83698.1| complement factor Bf-2 [Oncorhynchus mykiss gairdneri] Length = 749 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 50/192 (26%), Gaps = 21/192 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + R+A+ +I + + + +D + K +D Sbjct: 255 FNKARNAVKKLITKVSSFAVSPNYEILFLA--SDVFEVVNILDFLGEKRKTLEDVLADLD 312 Query: 331 ENEMG-----STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G T +N A +T + + + K ++ TDG Sbjct: 313 NFNYGDRQNVGTNLNLAFKTILERMAIQKQRNETLFKEVHH---VLIFFTDGAFNMGGRP 369 Query: 386 GIAIC----------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + + + + I ++ + + +F+ Sbjct: 370 DDTVAKVREMVYMNQKEERDKYLDIYVFGVGREIFDEDIQPLVTKRD-NEDHYFKLKDGT 428 Query: 436 ELNKIFRDRIGN 447 EL + F I Sbjct: 429 ELEETFDKIIDE 440 >gi|74222716|dbj|BAE42227.1| unnamed protein product [Mus musculus] Length = 1064 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 59 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 110 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 111 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 164 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 165 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 223 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 224 QVDNFEALNTI 234 >gi|315639746|ref|ZP_07894885.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] gi|315484523|gb|EFU74980.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] Length = 1228 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 50/441 (11%), Positives = 111/441 (25%), Gaps = 71/441 (16%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 S S + +Q++ + + K T + TA K + +V Sbjct: 169 EATTVSSEATSQSSSEQQVTQSMTEVSTETAKSYVTQEKEFTNESTTASSKAKDKESSKV 228 Query: 132 VLS--SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 S + + ++S+ + E T + + + S Sbjct: 229 QADRASVANFSAKSVLDSIQSLSATTSYSNVSPEYTTDATGIYPKSYWSPTGNTTVRNHQ 288 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + + A SY G G + + Y + Y +Y Sbjct: 289 GKLYGATSWDGVTTWNGDASNKTNSYIEYGGTGGNANFAIRKYAKETSTPGLYDVYLNVR 348 Query: 250 DPSLSEEHFVDSS-----SLRHVIKKKHLVRDALASVIRSIKKI----DNVNDTVRMGAT 300 + + +D S + VR + + I+ V + Sbjct: 349 GNEIKDIKPIDIVLVIDMSGSMEPSRAPAVRQGIKDFFKYIQDAGVSQYVNVGFVGYSSI 408 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 ++ V + T ++ +G T ++T D + S Sbjct: 409 NQSNYVGLIQEGIQPISNTTHTNNINTKLNRTFIGGTFTQLGIKTGADMLTSY------- 461 Query: 361 MKNNLEAKKYIVLLTDGENTQDN------------------------------------- 383 N +K ++LLTDG T Sbjct: 462 ---NSTNEKMMILLTDGVPTYSYKVNESTTINGTLYGTKFNTSLDNPGDTSKLNYPYIDN 518 Query: 384 ---------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN----CASPNSFFE 430 + K G ++ + + + + + AS + + Sbjct: 519 SNHKINDTFAATLGQAKITKDGGTQLHVLGIQLGPDKNYLTQNQVRAYTSLMASDGHYLD 578 Query: 431 ANSTHELNKIFRDRIGNEIFE 451 AN+ ++ ++ N + Sbjct: 579 ANTADDIKTYLENQAANVVSA 599 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 56/187 (29%), Gaps = 24/187 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + K + A+ + + I + + +D S S G ++ Sbjct: 136 SMAGKVNGEVKMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEV 192 Query: 320 -------IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T + A+++ D + E + Sbjct: 193 MYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDF---------KEYTGEENLNVVY 243 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +++DGE T + + + I F V ++Q++ + A ++ Sbjct: 244 IVSDGEETCG-GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAE--AGKGNYAT 300 Query: 431 ANSTHEL 437 +S EL Sbjct: 301 VSSADEL 307 >gi|149018695|gb|EDL77336.1| similar to procollagen, type VI, alpha 3 isoform 4 (predicted) [Rattus norvegicus] Length = 719 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/250 (8%), Positives = 61/250 (24%), Gaps = 13/250 (5%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + A + + + + S + D + +N + + + Sbjct: 64 ALSKMALVFRNTARTNVAQYLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDA--NTT 121 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 R + + + + S++ + Sbjct: 122 ELEEIANNRVFFTDDFHFLKSIHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEK 181 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 V R + + + D +++G F+ + + + + + Sbjct: 182 IKDFVKRMVNQSNIGADKIQIGLLQFSSTPREEFTLKNNYSSKDE-MCRAISNVTQINSG 240 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A+ +Y++++TDG++ D + + Sbjct: 241 TETGKALNFTL-------PFFDISQGGRPGVHQYLIVITDGDSHDDIVSP---AKALRDR 290 Query: 397 GIRIMTIAFS 406 I I I Sbjct: 291 NIIIFAIGVG 300 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 13/73 (17%), Positives = 26/73 (35%), Gaps = 9/73 (12%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 R +Y++++TDG+++ + GI I I ++ L Sbjct: 73 RNTARTNVAQYLIVITDGQSSDP---VADAAQGLRDTGINIYAIGV------RDANTTEL 123 Query: 420 SNCASPNSFFEAN 432 A+ FF + Sbjct: 124 EEIANNRVFFTDD 136 >gi|74212905|dbj|BAE33399.1| unnamed protein product [Mus musculus] Length = 1232 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|120554526|ref|YP_958877.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324375|gb|ABM18690.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 715 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 45/132 (34%), Gaps = 27/132 (20%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN----- 391 T + AM+ A + + ++++L+DG+ + + Sbjct: 111 TNLGLAMEKAAN--------DWLSGGTLENT--HLIVLSDGKVDVPGGDDASQAEEKRIV 160 Query: 392 -----KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 K +G I T+ ++ FL N A + SF A S LN F + Sbjct: 161 DSLLPALKDKGATIHTVGL-----SEKADIRFLRNLARETGGSFQLAQSAEALNLAFANA 215 Query: 445 IGNEIFERVIRI 456 + + + + I Sbjct: 216 LNTAVPQEQVPI 227 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 49/126 (38%), Gaps = 17/126 (13%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + K+ + T A + G T I +++ A + K I+L++ Sbjct: 104 IDKIDADAITTAATELKPEGDTPIAYSLEKAAGDFTEAKG------------PKTILLVS 151 Query: 376 DGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 DGE T + + + K SQG+ R+ I F V+ +++ A S+++A Sbjct: 152 DGEETCG-GDPVKVAEKIASQGVDLRVHVIGFQVDDATRKQLTEIAK--AGKGSYYDAQD 208 Query: 434 THELNK 439 L Sbjct: 209 GPALAS 214 >gi|220918102|ref|YP_002493406.1| TadE family protein [Anaeromyxobacter dehalogenans 2CP-1] gi|219955956|gb|ACL66340.1| TadE family protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 140 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 8/118 (6%), Positives = 33/118 (27%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +S G + AL++P++L + ++ + + + +A+ ++ Sbjct: 2 RRARSERGAAAVEFALVLPLLLTIVFGTIEWGYYFFNRQVVINSAREGARAGTLQYASGT 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + + + T + + + Sbjct: 62 SASAVAVTTAENYLTSAGLPLTRDSISVARIEFKTADGGTCPSGSSCIRVQYDLPTLT 119 >gi|162453507|ref|YP_001615874.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] gi|161164089|emb|CAN95394.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] Length = 490 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 44/160 (27%), Gaps = 18/160 (11%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + +S + G T A+ A + + Sbjct: 247 YATPAVSIAPL----DAAWADELIGSIDRTFPDGGTPTAGALSGAIAQARAHAQA----- 297 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIA-FSVNKTQQEK 414 N + VL TDG T+ N + ++G I I F + + Sbjct: 298 --NPTHRVVTVLATDGMPTECNPTSADGIAEIAAEGLGGSPSISTFVIGVFGRDDDGAQA 355 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ S + +++ ++ + F D + + Sbjct: 356 TMNRIAEGGGTKSAYFIDTSSDVTQAFLDALSEIRGTSLA 395 >gi|113476689|ref|YP_722750.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167737|gb|ABG52277.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 420 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 61/196 (31%), Gaps = 28/196 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K ++A +V+ ++ D ++ ++ RV S + Sbjct: 61 KLQRAKEACLAVVSHLRDPDYLS------LAGYSTRVTPLLE-SLAGGGAAAGFAEGAIA 113 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA- 388 D G T I+ A+ ++++ VL+TDG T + Sbjct: 114 DLQARGVTRIDLALDWIEESLLPE-----------KSPPLVGVLITDGHATNAGGTPLDD 162 Query: 389 ------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 KS GI + + LS+ +F A++ +L + Sbjct: 163 MKPFIVKARNMKSCGIILCAVGLGDAANFNTSFLTDLSD-QGGGAFIYADTPDKLLSDLQ 221 Query: 443 DRI--GNEIFERVIRI 456 +R+ EI ++ Sbjct: 222 NRLKADQEIAIVDAKL 237 >gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia nassauensis DSM 44728] Length = 831 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 49/173 (28%), Gaps = 19/173 (10%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWG--VHKLIRTIVKTF--AIDENEMGSTAINDAMQ 344 D ++ R F+ + S R + G T I + Sbjct: 333 DTLSSADRFAVRCFDTAMTSPEGLDPNGLSAGTDRNRFRAVEHLAGTETRGGTDILKPLS 392 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 TA D + + + + I+L+TDG+ +++ + + G+R+ + Sbjct: 393 TAVDLLTAGEKGR----------DRVIILVTDGQVGNEDQILRELTGRL--SGMRVHVVG 440 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 R L S L++ + V +T Sbjct: 441 IDKAVNAGFLHRLAL---VGRGRCELVESEDRLDEATAHIHRRIVAPVVTDLT 490 >gi|254456154|ref|ZP_05069583.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] gi|207083156|gb|EDZ60582.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] Length = 1384 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 46/152 (30%), Gaps = 5/152 (3%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + VHK + N G T M+ A D K + Sbjct: 403 CDTQNCLKVRVHKDGAAQINKIISSVNARGGTDPYTFMKIAQDYYNHG--SLSPIDKKSP 460 Query: 366 EAKKYIVLLTDGE--NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR-YFLSNC 422 K Y++++ DG+ + I + ++GI+ +AF + R L+ Sbjct: 461 CQKSYVIVIGDGDFFPSMGWNTHINMAKALNNKGIKTFGVAFGTGISPGGLKRFNDLAKA 520 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A + L + I I ++ Sbjct: 521 GGTTKAIIAPTAAALKTQLQAAISQIIASKLS 552 >gi|197106455|ref|YP_002131832.1| hypothetical protein PHZ_c2994 [Phenylobacterium zucineum HLK1] gi|196479875|gb|ACG79403.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 416 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 19/49 (38%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 + ++L +G + T L M V+LG G+ D W + Sbjct: 2 RHAPLFRRLRDDRSGASALTTGLSMVVILGFVGLGTDAGAWYAARRDAQ 50 >gi|254418896|ref|ZP_05032620.1| hypothetical protein BBAL3_1206 [Brevundimonas sp. BAL3] gi|196185073|gb|EDX80049.1| hypothetical protein BBAL3_1206 [Brevundimonas sp. BAL3] Length = 186 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 39/127 (30%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++L +G + AL+ PVML +VD+ + AA S Sbjct: 3 RSLIRRLAGDESGVSAVEFALIAPVMLLFYAGMVDLCQGYMALKRTSHAASAVADLVSQS 62 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + +++S + + + + + + + T K Sbjct: 63 RTITKADINSIFEVGPAIMAPFASTSMEQRISSVTRVSANKYTLNWSRSWTPDGGAGTKM 122 Query: 128 AYQVVLS 134 +V++ Sbjct: 123 NKPLVIA 129 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 56/160 (35%), Gaps = 21/160 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + S++R++ R+G ++ R I F +I + T Sbjct: 88 RDYQACANYATSIVRNLDMPKTGA---RVGTVIYSKRSIVLFDFKDNKTDII-LQLNTIK 143 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 E + + A++ A + + +KK +L+T GE +QD+ + Sbjct: 144 YREKPNRALSTGQALELAQKRLFQNAR---------RNSKKIAILVT-GEKSQDDVIIPS 193 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 K G+ I T+ + L + AS S+ Sbjct: 194 --KLMKDSGVIIYTVGVGEGYFLPQ-----LESIASSPSY 226 >gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] Length = 865 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 47/162 (29%), Gaps = 17/162 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R G F+ + + F G T + + + A + + Sbjct: 326 DRFGVIAFSSDFRALEPSLAPFTQATLKAADAFVDGLRADGGTEMLNPLLAAVGMLGDAE 385 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 D + +VLLTDG+ + + + + +G+RI T N + Sbjct: 386 RD------------RVVVLLTDGQVGNEAQIVDRVVQ--RGKGVRIYTFGIGTNVSDVLV 431 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + + +++ + RV + Sbjct: 432 NDLARR---TEGAAEFIHPGERIDEKVTAQFARATAVRVTGL 470 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 45.4 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 47/417 (11%), Positives = 104/417 (24%), Gaps = 32/417 (7%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 A + I+ + + AK + +Q + + + N +V +I Sbjct: 115 AAVDHLRLWIGERFIEGDIQPKAIAKKQYEAAQQAG---VKASLVEQHRPNIFSTQVANI 171 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D +V + + + L + + + E R + Sbjct: 172 APDESVTVEIEYQEAVLYRDGEFSLRFPTVVAPRYIPVVPLNKVPDVNEITPFLRDLQDD 231 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + ID + + ++ D + S + P Sbjct: 232 PTLPFSLNIDLNAGLPIAVINTPSHAFTQQKLSEDHYILSLIQPDIADRDVVLSWRPKAT 291 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + L + + L ++ V+ + + Sbjct: 292 DLPSTALFTQHVEGQGYGLLLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSMYGESIE 351 Query: 295 ----------------VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 FN + V T F + G T Sbjct: 352 QAKQALRYGLQQLQPEDSFNIVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLDADGGTE 411 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + A++ A+ + H N+ IV +TDG ++ I + + Sbjct: 412 MAAALKAAFSI-------KTHDQLNSTRWLNQIVFITDGSVGNESALFDLIEQQLVDR-- 462 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 R+ T+ R + ++ E+N R I + V+R Sbjct: 463 RLFTVGIGSAPNSYFMTRAAMK---GKGTYTYIGDVKEVNTKMRLLFSK-ISQPVMR 515 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|326789665|ref|YP_004307486.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540429|gb|ADZ82288.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 564 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 40/354 (11%), Positives = 92/354 (25%), Gaps = 24/354 (6%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + NF + F + + E Sbjct: 208 AINLKTITDAVANNEIAMGYTNPFASSTGLNFLLSTLSTFDSQNPFSETAISGFEKFQEN 267 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + +L M ++++ T+ + + V + H I + D S Sbjct: 268 IPFVAYTTIQMRESAKTGALDGFIMEYQTYINDTELKGDYVFTPFGIRHDSPIYAIGDLS 327 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 R+ + K ++ + Sbjct: 328 AEKQAILREFVQFATSETYAAKAVEYGFNGLDEYKSEMKSVDKQQ--------ITKAQKL 379 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 + +++ + + + S+I + + +G ++D V Sbjct: 380 WKEKKNGTKKISAVFVADISGSMDGEPLNNLKKSLIT---GAEYIGKDNSIGLVTYSDDV 436 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + R++ +D G TA D M A ++ N + Sbjct: 437 NINLPVG-KFDLNHRSLFTGAVMDLEASGGTATFDGMIVALKMLMEEKAA-------NPD 488 Query: 367 AKKYIVLLTDGENTQDNEEG--IAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 AK + +L+DGE + + I K I + TI ++ N + Sbjct: 489 AKLMLFVLSDGETNRGHSLNDIEGILRTLK---IPVYTIGYNANISALNNISSI 539 >gi|293349337|ref|XP_001060689.2| PREDICTED: collagen, type XII, alpha 1 [Rattus norvegicus] Length = 3064 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 14/159 (8%), Positives = 45/159 (28%), Gaps = 21/159 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R +++ V + R + G T + ++ + + Sbjct: 127 QDRFSLVTYSNSVNGGDGLHYLTAD-KRNSLNWMVDSIPAGGGTNLGGGLEKGVGVLRAY 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC--NKAKSQG--IRIMTIAFSVNK 409 + ++L++DG+ Q + + + G + T+ Sbjct: 186 GAPDRMGK---------VILISDGQANQGVTDPNQLAAMAALRDDGLVYSVTTVGIG--- 233 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 Q+ ++ A ++ + + +F++ Sbjct: 234 --QDFNEQLMATVADGGRGRYYYLENPGDFLAVFQEEAN 270 >gi|15963888|ref|NP_384241.1| hypothetical protein SMc04117 [Sinorhizobium meliloti 1021] gi|307315736|ref|ZP_07595255.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307320421|ref|ZP_07599838.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15073063|emb|CAC41522.1| Conserved hypothetical transmembrane protein [Sinorhizobium meliloti 1021] gi|306893987|gb|EFN24756.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306898627|gb|EFN29295.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 204 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 11/69 (15%), Positives = 18/69 (26%), Gaps = 4/69 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++LI G I A+L V ++ E L A T + Sbjct: 16 RGLFRRLIGDRKGATAIEFAILALPFFIVVFASIETFIAFAGEQLLANATDT----LARK 71 Query: 68 LIQSLEEVS 76 + Sbjct: 72 IRTGEITTD 80 >gi|271964702|ref|YP_003338898.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] gi|270507877|gb|ACZ86155.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] Length = 514 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 59/191 (30%), Gaps = 29/191 (15%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMG----ATFFNDRVISDPSFSWGVHKLIRT--I 323 + +R AL ++ + R F F Sbjct: 341 RIEALRQALVTLTGADTSASGTFSRFRSRENVIMIPFGGSAGLPQPFILPERDPQPALAQ 400 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 ++ +A G TAI D ++ AY + D IVL+TDGENT + Sbjct: 401 IRAYAERLRAAGGTAIYDGLRAAYGQAGDAGRDHYTS----------IVLMTDGENTDGS 450 Query: 384 E--EGIAICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + A + +R + F E + A + + F+A + L Sbjct: 451 SYEDFEAYYRSLPEARRQVRTFVVLFG------ESDADEMERIATLTRGAVFDARTGS-L 503 Query: 438 NKIFRDRIGNE 448 F++ G + Sbjct: 504 ASAFKEIRGYQ 514 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 54/185 (29%), Gaps = 2/185 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + I + + G + TA+ + ++ + + ++ Y + L++A A Sbjct: 1 MSTKPVISRRRNAGRQIRGSILVNTAIALS-LIVITLIGTELGYLFYMKRELQKATDLAA 59 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + + + S+ + RN + + E Sbjct: 60 LAGAKEISYAGSCPSA-KTAAKLSANGTGSTDRNRNLPISFSLEDAEIECGQWDPAKTTS 118 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + + R L P +L G ++ + A + ++ S+ Sbjct: 119 DHFDSAPPDQQNAIRITLNRTPATLLSFFEGNRTIQTKAVATNDPIAAFSIGTGVASLDE 178 Query: 182 VIDFS 186 S Sbjct: 179 GAVNS 183 >gi|149635474|ref|XP_001506111.1| PREDICTED: similar to alpha-1 type VII collagen [Ornithorhynchus anatinus] Length = 2993 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 48/150 (32%), Gaps = 22/150 (14%) Query: 290 NVNDTVRMGATFFNDRVISDPSFS---WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + VR F SD + G H + +++ + G+T ++ Sbjct: 67 PFVNVVRDTGVRFGAVQYSDDPRTEFALGTHASGQEVMRAVRELNYKRGNTRTGAGLRHV 126 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + H K +L+TDG +QD+ E A+ K K+Q +++ + Sbjct: 127 AEHFF-------HPQLARPGVPKVCILITDGR-SQDDVEQGAL--KLKNQNVKVFAVGI- 175 Query: 407 VNKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS F Sbjct: 176 -----KNAHEEELRKVASSPVEEYHFFIPD 200 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 63/202 (31%), Gaps = 28/202 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + +R A + I + G FN + ++ Sbjct: 308 DISGSMNGFDRIYRLRQAGEQFLLQILET-----GSWAGIVVFNSQALTKTYLKQITGDS 362 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +R + + G T I ++ + + IVLLTDGE+ Sbjct: 363 VRQTLSAYLPT-AAGGGTNICSGIREGFQVFLKKYPSTEGCE---------IVLLTDGED 412 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE--- 436 + + + G I TIA + ++ + + + F A + + Sbjct: 413 AGVSSCFAEV----QRSGSIIHTIALGPSAAKELEMLADM----TGGLKFSATDSLDSNG 464 Query: 437 LNKIFRDRIGNE--IFERVIRI 456 L F I ++ I++ Sbjct: 465 LIDAFSGISSGSGDISQQSIQL 486 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 42/427 (9%), Positives = 109/427 (25%), Gaps = 65/427 (15%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + Q + + + + + ++ V + +S L L Sbjct: 123 YHSQANKLIVDNKEYDIIDETENKKDTDVSVPKPDEIEEESSKENENSVS---PFTLPTL 179 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 SL S+ + + + V+D + ++ +N + Sbjct: 180 SLPAVSVPSNQTIPTEYTTDDQGTYPKASWQPTGNTNVLD---HQGNKNGTNQWDGINSW 236 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------ 258 + SY G D + Y + + +Y + Sbjct: 237 NGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGLFDVYLNARGNVQKDITPLDLVLV 296 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 297 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 355 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + + +V R+ Sbjct: 356 FDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 415 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 416 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 475 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 476 IEIHGLGIQLQSDSAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 534 Query: 448 EIFERVI 454 +I V Sbjct: 535 QISATVS 541 >gi|296198593|ref|XP_002806760.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XII) chain-like [Callithrix jacchus] Length = 3113 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 45/133 (33%), Gaps = 18/133 (13%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 H+ +++++ A + G+T A+ + R +K VL+ Sbjct: 1254 NTHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFRTQAGMRPRA------RKIGVLI 1307 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K K +G+ + I + L A+ + Sbjct: 1308 TDGKSQDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNV 1358 Query: 432 NSTHELNKIFRDR 444 L++I D Sbjct: 1359 ADFESLSRIVDDL 1371 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMEPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|218661390|ref|ZP_03517320.1| von Willebrand factor type A [Rhizobium etli IE4771] Length = 370 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 63/200 (31%), Gaps = 23/200 (11%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +D S + K L++ + ++ +K D V + + Sbjct: 2 PNLVFLIDVSGSMDELDKLPLLKSSFRLLVNRLKADDTVA------IVTYAGNAGTVLEP 55 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + ++ + + GST + ++ AYD + + + Sbjct: 56 T---RVSEKSKILSAIDRLEAGGSTGGAEGIEAAYDL--------AQKAFVKDGVNRVM- 103 Query: 373 LLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 L TDG+ +E+ I + + +GI + + F + N S Sbjct: 104 LATDGDFNVGPSSDEDLKRIIEEKRKEGIFLTVLGFGRGNLNDSLMQTLAQN--GNGSAA 161 Query: 430 EANSTHELNKIFRDRIGNEI 449 ++ E K D G+ + Sbjct: 162 YIDTLAEAQKTLVDEAGSTL 181 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 48/144 (33%), Gaps = 19/144 (13%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V+ +R+ F+ + + +++ + + G T +++ + A + I Sbjct: 72 VSPRMRLSFIVFSTQAKIILPLTGDRYEITKGLK--DLSSVIPAGETYMHEGFKLANEQI 129 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + I+ LTDG+ Q N A+ +G R+ + Sbjct: 130 VKAG---------GKSTASVIIALTDGKLADQIPVLTEKEANIARGRGARVYCVGVLDFN 180 Query: 410 TQQEKARYFLSNC-ASPNSFFEAN 432 Q L A+P + F Sbjct: 181 FDQ------LKRIAAAPENVFRVE 198 >gi|153002167|ref|YP_001367848.1| putative outer membrane adhesin-like protein [Shewanella baltica OS185] gi|151366785|gb|ABS09785.1| putative outer membrane adhesin like proteiin [Shewanella baltica OS185] Length = 1215 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 39/124 (31%), Gaps = 8/124 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +K + GSTA+ D Q A D + + + +L DG++ Sbjct: 378 KQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGAPG------VVFVLADGDDN 431 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + ++ ++ + I + F ++N F + E+ Sbjct: 432 NSIKSESSVITAYQNANVPIFS--FGYGSASPTGPLVTMANATGGKYFSSPTTLAEIIDA 489 Query: 441 FRDR 444 F Sbjct: 490 FLQA 493 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 40/111 (36%), Gaps = 9/111 (8%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D V + S R+ G+TA D + A + Sbjct: 415 LVSYSDDVYINLPIS-RFDLNNRSYFVGAVGGLQAGGATATFDGIAVAMKML-------E 466 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ + + K I +L+DGE + + I + GI I TI ++ + Sbjct: 467 EQLALDPKLKPKIFVLSDGETNRGHSLND-IRKLVEESGIPIYTIGYNADI 516 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 111/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPDKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNISKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|282897345|ref|ZP_06305347.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] gi|281197997|gb|EFA72891.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] Length = 449 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 56/151 (37%), Gaps = 18/151 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT---------F 327 A+ ++ K + ++ F + + P ++ L + + + + Sbjct: 152 AINAIREFTKVSSDRGGDTQISVVPFGEAGKNCPEYTVNKDTLDKFLSASDFKLQNSLEY 211 Query: 328 AIDENEMGSTAINDAMQTAYDTIIS--SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 N GST + ++ A + + + + + + I+LL+DG + NE Sbjct: 212 LSGLNPCGSTNLYQPLKKALEFLGNPEDPRFTLPENSSEPNPRLSIILLSDGYHNAMNEF 271 Query: 386 GIAICNKAKS-----QGIRIMTIAFSVNKTQ 411 N+ KS + I + T+ + + +Q Sbjct: 272 QDF--NELKSLLQSYENITVHTLGYGLTPSQ 300 >gi|221116226|ref|XP_002166419.1| PREDICTED: similar to collagen, type XXVIII [Hydra magnipapillata] Length = 680 Score = 45.4 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 40/146 (27%), Gaps = 26/146 (17%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G + + R + G T + A++ A D S Sbjct: 162 TTRVGIIRYASDAEIALHLTRFNDTTSRDTAIDNI--FYKTGGTRTDIALKKAADVFQFS 219 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQ----------DNEEGIAICNKAKSQGIRIMTI 403 + + ++L+TDG + ++ K G+ I I Sbjct: 220 EQKN-----------QVLILVTDGPTNSLEINKDHFVEGKDLVAGPVDRLKDAGVAIFCI 268 Query: 404 AFSVNKTQQEKARYF---LSNCASPN 426 + ++ + AS Sbjct: 269 GIVPDSETPDEIETMKEEMRVIASEP 294 >gi|322434933|ref|YP_004217145.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162660|gb|ADW68365.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 347 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 30/256 (11%), Positives = 77/256 (30%), Gaps = 34/256 (13%) Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + ++ + + + + + V N + + D ++ D S Sbjct: 73 RLEANLVLVPMTVTTTDTNRLVTGLEATNFEVFDNNTGQVIKTFSTQDAPVTIGIVFDLS 132 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 K R AL+ +R+ D FND+ ++ V + Sbjct: 133 GSM--TSKFGRARKALSEFLRTSNPAD------EFFVVGFNDKPAVIVDYTSDVEDVEAR 184 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 +V TA+ DA+ + + K +K +++++DG + + Sbjct: 185 MV-----MLKPENRTALIDAVYLGVNKL-----------KEAKYDRKALLIVSDGGDNRS 228 Query: 383 NEEGIAICNKAKSQGIRIMTIAF---SVNKTQQEKARYFLSNCA--SPNSFFEANSTHEL 437 + + ++I +I +++ L + + + F ++ Sbjct: 229 RYTEGELRRVVRESDVQIYSIGIYDAYAPTEEEQLGPVLLKDISEMTGGRMFPVTDIADM 288 Query: 438 NKIFRDRIGNEIFERV 453 I + I + Sbjct: 289 AD-----IASRISAEL 299 >gi|157694069|ref|YP_001488531.1| hypothetical protein BPUM_3318 [Bacillus pumilus SAFR-032] gi|157682827|gb|ABV63971.1| hypothetical protein YwmD [Bacillus pumilus SAFR-032] Length = 225 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 53/184 (28%), Gaps = 19/184 (10%) Query: 269 KKKHLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDR---VISDPSFSWGVHKLIRTIV 324 KK + + ++ S + + T N + +G + Sbjct: 51 KKIDIAKRSIQSFASILSDDTQVLLRVFGHEGTNKNAGKAISCASSEAVYGFGSYESSTF 110 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + G T + A+ E K + +++DG+ T Sbjct: 111 QQALNVYKPTGWTPLAKALTDTKQDFEDHQ----------AEGKNIVYVVSDGKETCGGS 160 Query: 385 EGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 + GI + I F VN+ + + + A + A + ELN I + Sbjct: 161 -PSQAAKELHEDGIDTIVNIIGFDVNEKEAKSLKSVAK--AGGGQYQPAANAEELNHILQ 217 Query: 443 DRIG 446 + Sbjct: 218 NEAS 221 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 39/329 (11%), Positives = 94/329 (28%), Gaps = 36/329 (10%) Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + +S+ L L + I + + A+ ++ Sbjct: 20 SRAGVSAPSRFRLTLLGWMMLLAVIVGFAVPGLAQVDSNEVHVQPREAPKP----PTPPQ 75 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 ++ +P+ V N V + + V + + Sbjct: 76 GDPADVNTHTRPMRVDVNIVLVPVTVTDPDNRLV--TGLEKENFEVLDQNIPQQIRHFSS 133 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D +S D S K R+A+ ++ D FND+ Sbjct: 134 EDAPVSIGVIFDMSGSMS--NKIDKSREAIVEFFKTANPDD------EFFVVAFNDKPEV 185 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F+ + ++ T++ DA+ + + + + Sbjct: 186 LQDFTNRIED-----IQEKLTILQPKDRTSLLDAIYLGMNKMRQA-----------KYER 229 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV---NKTQQEKARYFLSNCA-- 423 K +++++DG + I + + ++I I T++ L + Sbjct: 230 KALLIISDGGDNHSRYTENEIKSMVREADVQIYAIGIYDLAPTTTEEMAGPALLGEISDW 289 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + F ++ +EL + +IG E+ + Sbjct: 290 TGGRMFPIDNVNELADV-ATKIGVELRNQ 317 >gi|126310411|ref|XP_001373784.1| PREDICTED: similar to collagen type XII alpha 1 [Monodelphis domestica] Length = 3116 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 45/133 (33%), Gaps = 18/133 (13%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 HK R+++ + A + G+T A+ + + A+K VL+ Sbjct: 1246 NAHKDRRSLLDSVANLPYKGGNTLTGMALNFIRQ------NNFRPQAGMRPRARKIGVLI 1299 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K K G+ + I + L A+ + Sbjct: 1300 TDGKSQDDVEAPS---KKLKDDGVELFAIGI------KNADESELKMIATDPDDTHAYNV 1350 Query: 432 NSTHELNKIFRDR 444 L++I D Sbjct: 1351 ADFDSLSRIVDDL 1363 >gi|160876887|ref|YP_001556203.1| outer membrane adhesin-like protein [Shewanella baltica OS195] gi|160862409|gb|ABX50943.1| outer membrane adhesin like proteiin [Shewanella baltica OS195] gi|315269091|gb|ADT95944.1| outer membrane adhesin like proteiin [Shewanella baltica OS678] Length = 1215 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 39/124 (31%), Gaps = 8/124 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +K + GSTA+ D Q A D + + + +L DG++ Sbjct: 378 KQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGAPG------VVFVLADGDDN 431 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + ++ ++ + I + F ++N F + E+ Sbjct: 432 NSIKSESSVITAYQNANVPIFS--FGYGSASPTGPLVTMANATGGKYFSSPTTLAEIIDA 489 Query: 441 FRDR 444 F Sbjct: 490 FLQA 493 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 38/124 (30%), Gaps = 8/124 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +K + GSTA+ D A D +I+ + +L DG++ Sbjct: 378 KQALKAAIDNIYASGSTALFDGSSLALDNLITYQTAAASGAPG------VVFVLADGDDN 431 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + ++ + I + F ++N F + E+ Sbjct: 432 SSIKNESTVITAYQNANVPIFS--FGYGSASPTGPLVTMANATGGKYFSSPTTLSEIIDA 489 Query: 441 FRDR 444 F Sbjct: 490 FLQA 493 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 52/159 (32%), Gaps = 8/159 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSF-SWGVHKLIRTIVKTFAIDENEMGSTAIND 341 I D T R+G +N ++ + + V + + + Sbjct: 85 TRIGVRDYEPKTTRVGLVSYNADAKILAGLDTYQSYDDLANGVFDSLNSVSATDESYLAK 144 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A N + KK +++ + + ++ K+ G++I+ Sbjct: 145 GLSAAEKVFEEGKSTA-----NRTQYKKVVIVYASSYKGTGELNPVPVADRMKTAGVKII 199 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 T+AFS N + LS AS + F + +I Sbjct: 200 TVAFSQNND--DGLLKDLSEIASLDFDFANTDKGVVEEI 236 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 69/193 (35%), Gaps = 30/193 (15%) Query: 270 KKHLVRDALASVIRSIKKID----NVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TI 323 + +R AL + + + + FN + +F+ Sbjct: 359 RIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTGPGAPRTFTVPEENPAAELAQ 418 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 +KTFA G TAI D++ AY + + R + IVL+TDGEN + Sbjct: 419 IKTFAEGLVARGGTAIYDSLSRAYQVLEPLMAADPDRFTS-------IVLMTDGENANGS 471 Query: 384 EEGIAICN------KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 + + K + + T+ F L+ A + F+A + Sbjct: 472 SLPDFLTSLASLPPAMKQ--VPVFTVLFGEGS------SDELTQVATRTGGKVFDARNV- 522 Query: 436 ELNKIFRDRIGNE 448 +L+++F++ G + Sbjct: 523 QLSRVFQEIRGYQ 535 >gi|156523281|ref|NP_001096038.1| anthrax toxin receptor 2b [Danio rerio] gi|151335854|gb|ABS00409.1| capillary morphogenesis protein 2B [Danio rerio] Length = 487 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 55/168 (32%), Gaps = 20/168 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 V+ +R+ F+ + + + + G T +++ ++ A + + Sbjct: 69 VSPKMRVSFIVFSSSAEIILPLTGDRVDIDSGLQQ--LSKIRPAGDTYMHEGLKKAIEQM 126 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICNKAKSQGIRIMTIAFSVNK 409 S A I+ LTDG+ NE I + A+ G R+ + Sbjct: 127 TSQ----------GARASSIIIALTDGKLEVFMNELAIKEADLARQYGARVYCVGV---- 172 Query: 410 TQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 ++ L+ A + F + K + I + V ++ Sbjct: 173 --KDFDANQLTEIADNKDQVFPVVDGFQALKNIVNSILQKSCVEVFKL 218 >gi|217974748|ref|YP_002359499.1| outer membrane adhesin-like protein [Shewanella baltica OS223] gi|217499883|gb|ACK48076.1| outer membrane adhesin like proteiin [Shewanella baltica OS223] Length = 1215 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 39/124 (31%), Gaps = 8/124 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + +K + GSTA+ D Q A D + + + +L DG++ Sbjct: 378 KQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGAPG------VVFVLADGDDN 431 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + ++ ++ + I + F ++N F + E+ Sbjct: 432 NSIKSESSVITAYQNANVPIFS--FGYGSASPTGPLVTMANATGGKYFSSPTTLAEIIDA 489 Query: 441 FRDR 444 F Sbjct: 490 FLQA 493 >gi|73973308|ref|XP_539002.2| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 1 [Canis familiaris] Length = 3065 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEDELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|77454616|ref|YP_345484.1| hypothetical protein pREL1_0049 [Rhodococcus erythropolis PR4] gi|77019616|dbj|BAE45992.1| conserved hypothetical protein [Rhodococcus erythropolis PR4] Length = 151 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 33/122 (27%), Gaps = 1/122 (0%) Query: 17 SCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++T +L P+++ G++VD A+ A + ++ SL Sbjct: 17 HDKGAVTLMTVILAPILIFFVWGLIVDGGGMHTAAQRADNVAEDAARASGQQIVGSLSAR 76 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + YL R + R + + S+ Sbjct: 77 GIDTVVDPVRAVAAAKRYLADAGVEGEVIPTGPRTLLITTRIVYDNKLLPLPSKLITGSA 136 Query: 136 RY 137 Sbjct: 137 TV 138 >gi|134298678|ref|YP_001112174.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] gi|134051378|gb|ABO49349.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] Length = 183 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 20/55 (36%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 K++K G FI+ + +++ +D + L+ + + + Sbjct: 4 KVLKGKKGFTFILFVPVFLIIMLFMARGIDWGMATVARGKLQTISDAGSLAGASA 58 >gi|74212839|dbj|BAE33379.1| unnamed protein product [Mus musculus] Length = 967 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 18/191 (9%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D S + +++ +++V+ KK + ++D F Sbjct: 149 SDIVFLIDGSGSIN-NIDFQKMKEFVSTVMEQFKKSKTL-----FSLMQYSDEFR--IHF 200 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ K + + + G T ++ + K +V Sbjct: 201 TFNDFKRNPSPRSHVSPIKQLNGRTKTASGIRKVVRELFHKTNGARENAA------KILV 254 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 ++TDGE D + + +A G+ I + + R L AS F Sbjct: 255 VITDGEKFGDPLDYKDVIPEADRAGVIRYVIGVGNAFNKPQSRRE-LDTIASKPAGEHVF 313 Query: 430 EANSTHELNKI 440 + ++ LN I Sbjct: 314 QVDNFEALNTI 324 >gi|289675735|ref|ZP_06496625.1| hypothetical protein PsyrpsF_20856 [Pseudomonas syringae pv. syringae FF5] Length = 328 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ F Q + + V V + A + + + Sbjct: 72 A----SAPDFANQSATRNGFVPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASAAPLAALTIRSAGVTVDSTGAA 184 >gi|282853088|ref|ZP_06262425.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|282582541|gb|EFB87921.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|314922731|gb|EFS86562.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1] gi|314965815|gb|EFT09914.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2] gi|314982958|gb|EFT27050.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3] gi|315091262|gb|EFT63238.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4] gi|315094498|gb|EFT66474.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1] gi|315105219|gb|EFT77195.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2] gi|327328992|gb|EGE70752.1| putative membrane protein [Propionibacterium acnes HL103PA1] Length = 169 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 21/58 (36%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + + R S + + AAQ A SV Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGESIAV 93 >gi|260462604|ref|ZP_05810810.1| TadE family protein [Mesorhizobium opportunistum WSM2075] gi|259031510|gb|EEW32780.1| TadE family protein [Mesorhizobium opportunistum WSM2075] Length = 181 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 12/142 (8%), Positives = 38/142 (26%), Gaps = 1/142 (0%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + T F + + + G + L+ P+ML + + + + + ++ Sbjct: 3 MIRTLFR-HLDRFRRDHRGSVLVEMTLITPLMLILSAGVFEFGNLIHDKLLMEAGLTDGA 61 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 A+ Q + + + + ++ + + T Sbjct: 62 RYAARCNSQLYTDAGLAGIDCTGYAANIAVFGKPIVTQADVPRVSGWQTSGATSFATVTM 121 Query: 122 MNPRKSAYQVVLSSRYDLLLNP 143 N + A ++ Sbjct: 122 ANSCQDAVSAGVTLYRSTTAQV 143 >gi|223670960|dbj|BAH22727.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 53/177 (29%), Gaps = 21/177 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 ++I + T R+ F+D + V+ + + G Sbjct: 274 QFARTIIDEFGISATPSGT-RVAVIVFSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGG 332 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA A+Q DT+ + KK + L+TDG++ + ++ Sbjct: 333 GTATKLALQAVIDTVTPELRNNS---------KKALFLITDGKSNKGGS-PDRPAKVLRA 382 Query: 396 Q-GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNE 448 I I + L + AS ++ L K ++ I + Sbjct: 383 GFNFEIFAIGV-----SDSVDKDELKSIASEPFRTHVYQIKDYATLVK-LKELITTK 433 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 45.4 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 44/128 (34%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A N+ G+T A+ + R +K VL+TDG++ Sbjct: 1263 KKSLLQAVANLPNKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1316 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1317 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1367 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1368 LSKIVDDL 1375 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 45/146 (30%), Gaps = 18/146 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F+ + + ++ + G T + A + A + Sbjct: 331 VVLFSSTHRVLFPPRPMTEQGKKEA-ESLLLSVRAGGGTVLGSAFREAVRLLQ------- 382 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +K I++L+DG + + +A+ G+ + +A + + Sbjct: 383 ----GVPVERKGILVLSDGLISDPQDPILALAEA---SGLEVSAMALGADADRAFLKVLA 435 Query: 419 LSNCASPNSFFEANSTHELNKIFRDR 444 +++A + EL ++F Sbjct: 436 ER---GGGRYYQAATAQELPRLFLKE 458 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 55/190 (28%), Gaps = 20/190 (10%) Query: 224 IRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 R + + ++V + S+ + P D +R K LV +AL +++ Sbjct: 43 SRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKIDLVIEALQNLL 102 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 S + + R+ F+D F+ K G T + Sbjct: 103 SSPQLQPS----DRLAIVKFDDVAEVVQPFTPANEKARLVAAAERLT--QYSGGTQMGAG 156 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 M+ + ++LLTDG+ T D + + I + Sbjct: 157 MREGMRLLEREAGSRR------------LILLTDGQ-TFDEPLVETVAAQLAQARIPVTA 203 Query: 403 IAFSVNKTQQ 412 I Sbjct: 204 IGVGDEWNDD 213 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 59/215 (27%), Gaps = 20/215 (9%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 M + P D +L +D S + AL + + ++ Sbjct: 163 VMYDKKGYARMSVIPDACDTNLDVVFVLDQSGSIGYYNH----QLALNFLSKVVEFFKIG 218 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++G ++ + + TI+ + G TA + A + Sbjct: 219 ANKTQVGLITYSTHAYVQFDLND--YHSKSTILNRISRIYYTGGWTATALGLFQAGVILN 276 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + ++LLTDG + + GI++ T+ Sbjct: 277 PQQMRGARPISQG--VPRVVILLTDGRSN--RVPIDEVAPSLHDFGIQVYTVGVG----- 327 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 L AS + L F D G Sbjct: 328 -NIYLPELKFIASDPDPYHIF----LLDSFSDASG 357 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 48/153 (31%), Gaps = 24/153 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ R + G T+I +++A+ I Sbjct: 343 WVGMVTFDSAAQVQSELIQINSGTERDALTKSLPTV-ATGGTSICSGLRSAFAVI----- 396 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C N+ K G I T+A + ++ + Sbjct: 397 -----KKKYPTDGAEIVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDR 444 + + A+ + L F Sbjct: 447 ELSKM----TGGLQTYASDQAQNNGLIDAFGAL 475 >gi|326674791|ref|XP_001922046.3| PREDICTED: collagen alpha-1(XIV) chain [Danio rerio] Length = 1852 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 39/152 (25%), Gaps = 20/152 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D D R+G ++ + ++ + G+T A+ Sbjct: 185 VNAFDVGIDKTRIGLAQYSGDPRIEWHL--NGFSTKEAVIDAVKNLPYKGGNTLTGLALT 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + K +L+TDG++ D +S G+ + I Sbjct: 243 YVLE------NSFKPESGARDNIPKIGILITDGKSQDDVISP---AQTLRSSGVELFAIG 293 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS + Sbjct: 294 V------KNADENELKAIASEPEDTHVYNVAD 319 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 24/147 (16%) Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+D ++ + ++ + G+T A++ + I S Sbjct: 1064 QFSDDARTEFKL--NSYSDKEALLDAVQRISYKGGNTKTGRAIKHVKEAIFSE------D 1115 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K +V+LTDG +QD+ I+ + + +G I I F+ + L Sbjct: 1116 AGVRRGIPKVLVVLTDGR-SQDDVNKIS--KEMQMEGYIIFAIGFADADYGE------LV 1166 Query: 421 NCA---SPNSFFEANSTHELNKIFRDR 444 N A S F + F+ Sbjct: 1167 NIASKPSERHVFFVDD----LDAFKKI 1189 >gi|111074529|ref|NP_031756.2| collagen alpha-1(XII) chain [Mus musculus] Length = 3061 Score = 45.0 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 37/369 (10%), Positives = 98/369 (26%), Gaps = 19/369 (5%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 + L+ S K ++ + + +T +I+ Sbjct: 177 TDRIAGNTAGILMSKEVYDSYTEKYGEVTMDGVLKAAIAGDMVFAYTNPYTSATGMNILC 236 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + + N + + +K + +A + + Sbjct: 237 SMLHAFDNENPLSDNAVDQLVQYQNNAPTAAYTTSILKESASKGIIDAMVMEEQTYINTP 296 Query: 177 -------VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 D + + E + F + + + ++ + Sbjct: 297 ELKNYVYTPAGVRHDHPVYAFPWTDEDELEAAKMFVDFCLTSDSQSMATERGFNLHEDYV 356 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 S + + + + ++ S++ +++ ID Sbjct: 357 SEDYGMTGSDFLSAQKVWKTNKNGTRPTIAVFVTDISGSMNGTRIKSLKNSLLSTMQYID 416 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + + +G ++D+V + + R D + G TA DA+ Sbjct: 417 SSS---YIGLVSYSDKVYINLPIAQ-FDNKQRAYFSGAVKDLDVGGQTATYDAVLVGMQM 472 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 ++ + K +A + +L+DG E I GI + TI + + Sbjct: 473 LME-------KSKEVPDANMMLFVLSDGAQNAG-FELKRITPIVGGLGISVYTIGYEMTD 524 Query: 410 TQQEKARYF 418 + +E + Sbjct: 525 SDKEDLKAL 533 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 52/186 (27%), Gaps = 14/186 (7%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI----- 285 M + + +S + + + S+ Sbjct: 357 NQMNDYKGDNIKVNGNSWTQMQKLWKTNKNSGKPIAAVFVLDTSGSMSGAPLNSLKASLR 416 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 I +N + +G ++ V D + + G+TA A+ Sbjct: 417 NSIKYINSSNYIGVVSYSSNVNVDLELA-KFDLNQQAYFMGAVDSLTASGNTATFSALSQ 475 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + +D N + LL+DG++ E I + I I TI + Sbjct: 476 AMIMLRDFTKD-------NPNVSPMVFLLSDGQSNSG-SEFSDIDGAIATAQIPIYTIGY 527 Query: 406 SVNKTQ 411 + N + Sbjct: 528 NANLNE 533 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 35/374 (9%), Positives = 93/374 (24%), Gaps = 23/374 (6%) Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + + + + + + + ++ P + + ++ +++ + Sbjct: 235 AKALSDEIDSSVKPAEYAQYYAKKQLSSFDIHDQVSAPERIFDEAPMAEQHNNRVALKVA 294 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQ 206 F M I+ ++ S ++ + + + F Q Sbjct: 295 FDSEMPIEHIRSPYHQINVDMAESGGAIVSLNRDAIANKDFVLTWKPIQGNEPTAAVFSQ 354 Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHFVDSSSLR 265 + P S L M G S + Sbjct: 355 IGKTHTSQA-------SSSEASTEPQTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDT 407 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + A ++ ++ + T + FN V + + Sbjct: 408 SGSMSGDAIIQAKTALKYALAGLRP---TDKFNIVQFNSDVDKWSGMAMSATPYNLAQAQ 464 Query: 326 TFAIDENEMGSTAINDAMQTAYDT---IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + G T ++ A+ A + E+ + ++ +TDG + + Sbjct: 465 NYINRLEANGGTEMSIAINAALNIETVTDKETGTELDNNDLGSNLLRQVLFITDGAVSNE 524 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + I + R+ TI F+ A ++ E+N+ Sbjct: 525 SMLFELIEAQLGDS--RLFTIGIGSAPNAH-----FMQRAAQLGRGTYTYIGKLDEVNQK 577 Query: 441 FRDRIGNEIFERVI 454 + +V Sbjct: 578 VVSLLKKIEKPQVT 591 >gi|219847012|ref|YP_002461445.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541271|gb|ACL23009.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 847 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 2/51 (3%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHA 52 + D + Y +L + G + AL++ V++G+ + VDV R E Sbjct: 1 MLDRRIRTY--RLSRRTRGQSIPLLALMIVVLIGMVALSVDVGRTFSEERR 49 >gi|330938298|gb|EGH41949.1| hypothetical protein PSYPI_05803 [Pseudomonas syringae pv. pisi str. 1704B] Length = 514 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ + F Q + + V V + A + + + Sbjct: 72 AT----APDFANQSATRNGFTPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASAAPLAALTIRSAGVTVDSTGAA 184 >gi|320108233|ref|YP_004183823.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319926754|gb|ADV83829.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 342 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 41/166 (24%), Gaps = 8/166 (4%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F+ V ++ +L R + K + + + + Sbjct: 141 FLIKFDINVDMLADYTNSPRELKRALDKAEINTG-AGTGSVTGNGTPRG-TLLYDAVYLG 198 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF------SVNKTQ 411 H +K +V+LTDG + + A+ I I Sbjct: 199 AHDKLRQEAGRKVLVVLTDGGDQGSQLKLQDAIEAAQKANAIIYVILIADRGGFGNFSFG 258 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 LS N+ +L F + ++ T Sbjct: 259 GSGDMDRLSRETGGRMINVGNNGKKLEDAFAQISDELRTQYLVSYT 304 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 75 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 129 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 130 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 178 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 179 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 229 Query: 437 -LNKIFRDR 444 L F Sbjct: 230 GLIDAFGAL 238 >gi|149019069|gb|EDL77710.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] gi|149019070|gb|EDL77711.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] Length = 2827 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1201 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1254 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1255 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDIHAYNVADFES 1305 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1306 LSKIVDDL 1313 >gi|73973312|ref|XP_867438.1| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 3 [Canis familiaris] Length = 2989 Score = 45.0 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEDELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|328722084|ref|XP_001948722.2| PREDICTED: hypothetical protein LOC100159668 [Acyrthosiphon pisum] Length = 1500 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/288 (7%), Positives = 48/288 (16%), Gaps = 4/288 (1%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 +QA A + + + S+ A T + + + Sbjct: 515 QQATSAAANQQTTSAAANQQTTSAAANQQTTSASANQQTTSAAANQQTTSAAANQQTTSA 574 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 + + + ++ + + Sbjct: 575 AANQQTTSAAANQQTTSAAANQQTTSAAANQQTTSSGANQQTTSAGFNQQTTGAGSNQQT 634 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 S Q F + S Q N S G + S Sbjct: 635 TGAGSNQQTTGFGSNQQTTSEGSNQQTTNVGSNQQTTGFGSNQQTTGGGSNQQINNSGNK 694 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + S S++ + K + Sbjct: 695 PTSPMSSTLKPTTPIYTTKKPPV----STTPKPTTNKPTTSKPTTLKPTTPKPTTSKPTT 750 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + + + T K + +T Sbjct: 751 QQPTTPKPTTSQPTTQKPTTSKPTTQKPTTQKPIISSTTQKVTTTSPS 798 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 17/216 (7%), Positives = 52/216 (24%), Gaps = 42/216 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + + + ++ ++G F + Sbjct: 97 DISNSMMAQDVIPSRLEKSKLLIEDLVDHFT-------NDKVGLVVFAGDAFVQLPITND 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + ++ T + A+ + + I+++T Sbjct: 150 YVSA-KMFLQNINPSLITTQGTDLARAISLSQSCFT-----------QREHIGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFL----- 419 DGE+ + +A +GI + + +K + L Sbjct: 198 DGEDHEGGAL--EAAREAYKKGINVFILGVGTSKGAPIPDGNGGYLKDNSGQTVLSTLNE 255 Query: 420 SNC-----ASPNSFFEANSTHELNKIFRDRIGNEIF 450 C A + ++T + + + Sbjct: 256 QMCQQVAKAGNGVYIHVDNTSDAQEKLNKELSKLQS 291 >gi|223670958|dbj|BAH22726.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 53/177 (29%), Gaps = 21/177 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 ++I + T R+ F+D + V+ + + G Sbjct: 274 QFARTIIDEFGISATPSGT-RVAVIVFSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGG 332 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA A+Q DT+ + KK + L+TDG++ + ++ Sbjct: 333 GTATKLALQAVIDTVNPELRNNS---------KKALFLITDGKSNKGGS-PDRPAKVLRA 382 Query: 396 Q-GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNE 448 I I + L + AS ++ L K ++ I + Sbjct: 383 GFNFEIFAIGV-----SDSVDKDELKSIASEPFRTHVYQIKDYATLVK-LKELITTK 433 >gi|121534348|ref|ZP_01666172.1| hypothetical protein TcarDRAFT_1914 [Thermosinus carboxydivorans Nor1] gi|121307118|gb|EAX48036.1| hypothetical protein TcarDRAFT_1914 [Thermosinus carboxydivorans Nor1] Length = 307 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 1/47 (2%) Query: 5 TKFI-FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE 50 ++F KL+++ G ++TAL + MLG ++VDV Y Sbjct: 2 SRFCVKLVGKLLRNQQGAVAVLTALALTAMLGFAAIVVDVGLLYYNR 48 >gi|89069886|ref|ZP_01157220.1| hypothetical protein OG2516_06277 [Oceanicola granulosus HTCC2516] gi|89044562|gb|EAR50681.1| hypothetical protein OG2516_06277 [Oceanicola granulosus HTCC2516] Length = 178 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 6/62 (9%), Positives = 20/62 (32%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++L + G+ + L+ P+ L + + L++ ++ Sbjct: 1 MTRQARSALRRLWRDEDGNPTVEFVLVFPLFLTLMISAFESGILMTRHMMLERGLDISVR 60 Query: 63 TA 64 Sbjct: 61 AI 62 >gi|326434685|gb|EGD80255.1| hypothetical protein PTSG_10931 [Salpingoeca sp. ATCC 50818] Length = 706 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 40/398 (10%), Positives = 105/398 (26%), Gaps = 44/398 (11%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A I A+ L +S + + + + + V + Sbjct: 5 AATIVATALLATMASALSPGQTSPSNTDEADVSGVAVDYIDIGSAAYEALNNFDTFVGTS 64 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY-HKEHGV 177 + ++S ++ LL L + Q++ + T + S + Sbjct: 65 HGIVFDKESHKL--MTQYMAQLLQALDNAIGEDIAVVRGFQSRPQVPTSAPSLLDVGRAL 122 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 +++ D S S +N + + V Sbjct: 123 RLRYKADGSASSDARLSTLAQACINAGFDFVAVPADTTTDNYVYVSTPK----MDCGDAV 178 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 L ++L S + + ++ + + +A+++ Sbjct: 179 ADLLFILDGSGSVGSGNFQTMLNFAQEVVSFFDVAPDKTRVAAMV--------------- 223 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 ++ F + + ++ F G T A+ A ++ ++ Sbjct: 224 ----YDSSNYRKFDFDYIQSVSKQQLINYFDTFAYPDGGTETGSALSFALSSMFVTSRGA 279 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + + +++TDG++ D + G+ + + S Sbjct: 280 RDLSEGVP---RVAIVITDGKSGDDVSAP---AQALRDAGVTLYAVGISG------ADVS 327 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L+ ASP F + + + ++ R Sbjct: 328 ELNQIASPP----VEDNVVFIDTFSEF--SALASKISR 359 >gi|325285570|ref|YP_004261360.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321024|gb|ADY28489.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 235 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 35/123 (28%), Gaps = 16/123 (13%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 FN+ V + RT F + G T + + A T Sbjct: 123 FNNNVTKQATEFRVASNTTRTSSNIFVQNLKASGGTNTLEGLLEALSTADVQE------- 175 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 IVL++DG + + K + I I TIAF + N Sbjct: 176 ---------IVLMSDGLPNSGPKAVLEEIKKVNTSNIIIHTIAFGEDADLDFMRTLAQEN 226 Query: 422 CAS 424 + Sbjct: 227 NGT 229 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 49/191 (25%), Gaps = 22/191 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAI 329 A + I+K+ + + G + S + + + + Sbjct: 286 FTRAKNCLRDFIEKVASYGVKPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYE 345 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A++ Y + E N I+L+TDG + +++ Sbjct: 346 DHMLKSGTNTKKALEAVYSMMN--WPGETPPADWNRTRH-VIILMTDGLYNMG-GDPVSV 401 Query: 390 CNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + ++ + R + Q+ F+ Sbjct: 402 IHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKD-KEQHVFKVKDMEN 460 Query: 437 LNKIFRDRIGN 447 L +F + Sbjct: 461 LEDVFIQMLDE 471 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 49/191 (25%), Gaps = 22/191 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAI 329 A + I+K+ + + G + S + + + + Sbjct: 631 FTRAKNCLRDFIEKVASYGVKPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYE 690 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A++ Y + E N I+L+TDG + +++ Sbjct: 691 DHMLKSGTNTKKALEAVYSMMN--WPGETPPADWNRTRH-VIILMTDGLYNMG-GDPVSV 746 Query: 390 CNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + ++ + R + Q+ F+ Sbjct: 747 IHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKD-KEQHVFKVKDMEN 805 Query: 437 LNKIFRDRIGN 447 L +F + Sbjct: 806 LEDVFIQMLDE 816 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 49/191 (25%), Gaps = 22/191 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAI 329 A + I+K+ + + G + S + + + + Sbjct: 286 FTRAKNCLRDFIEKVASYGVKPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYE 345 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A++ Y + E N I+L+TDG + +++ Sbjct: 346 DHMLKSGTNTKKALEAVYSMMN--WPGETPPADWNRTRH-VIILMTDGLYNMG-GDPVSV 401 Query: 390 CNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + ++ + R + Q+ F+ Sbjct: 402 IHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKD-KEQHVFKVKDMEN 460 Query: 437 LNKIFRDRIGN 447 L +F + Sbjct: 461 LEDVFIQMLDE 471 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D ++G ++ + + + H + ++ + + T A+ A Sbjct: 34 FDIGPTATQVGVVQYSWFIRQECAL--NAHSSLASLQQAISNITVLGLGTHTGAALTFAR 91 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 +T +++ + K +V++TDG + D ++ G+ I F++ Sbjct: 92 NTALTAANGARPGV------PKIVVVMTDGASEDDVTLPS---QNLRNDGV----ITFAI 138 Query: 408 NKTQQEKARYFLSNCA-SPNSFFEANSTHELN 438 + + L + A SP+ F A L+ Sbjct: 139 SVSWSLPNDRLLQDIAGSPDRIFAATDFDALD 170 >gi|224004082|ref|XP_002295692.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209585724|gb|ACI64409.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1007 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 44/380 (11%), Positives = 100/380 (26%), Gaps = 40/380 (10%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 ++L +N ++ A + + + S + + + Sbjct: 630 SSLPRNAVSMSPSEVPSFHASDAPSLFPSLKTSQPSTHPSSSPVTPNPTQRPSSRPVTSS 689 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 S +S S+ + PL Q + + Sbjct: 690 PSPSPSASPLSPTATPSLSPILSPTIPPSIEPIISNPVPTPLPSREQSQPTNMPGVVQEV 749 Query: 220 GKVGIRDEKLS-PYMVSCNKSLYYMLYPGPLDPSLSEE-HFVDSSSLRHVIKKKHLVRDA 277 + P + + + E + V KK + Sbjct: 750 TLTESPVSSEACPLSGDFSLCFAIDMSGSVCNNGRGECLGCSPLDTCNSVGVKKTMCCSN 809 Query: 278 LASVIRSIKKIDNVND----TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 S++ K + + F +V + R +K Sbjct: 810 FLSLMEFTKDVTTSFNEVATDHDFSVVHFATKVNTAIGLVS-----SRQALKVLNQLIYT 864 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG--------------EN 379 G T + + + +T S++ + K I+L+TDG + Sbjct: 865 GGMTNLASGIASGQETFSSTS--------GTSKRKNMILLITDGKLFIVYNGKRCSMYQP 916 Query: 380 TQDNEEGIA----ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + AKS+G I+ + + ++ F+ +S + + + Sbjct: 917 SLPSYNPEGNAHEAATNAKSKGTVIVPV-LMEAASDNAQSLMFMQKISSNGNVY-VSDFK 974 Query: 436 ELNKIFRDRIGNEIFERVIR 455 LN +D I +EI ++ Sbjct: 975 GLND-LKDTILSEISCGAVK 993 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 53/145 (36%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TIA S + Sbjct: 173 SISEDARISGISFITIALSTVVNEA 197 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 53/145 (36%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TIA S + Sbjct: 173 SISEDARISGISFITIALSTVVNEA 197 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 45.0 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 53/145 (36%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + + K ++L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATRLLKREGR---------KDGVKVVLLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A+ GI +TIA S + Sbjct: 173 SISEDARISGISFITIALSTVVNEA 197 >gi|300776752|ref|ZP_07086610.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300502262|gb|EFK33402.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 335 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 56/191 (29%), Gaps = 39/191 (20%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + ++G F +S + + I G T MQ Sbjct: 119 MMATMKKMNNDKIGIVIFAGNAMSIMPLTTDYNSAETYISGIETSSMQIQG-TDFLKGMQ 177 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + + ++ + +VLL+DGE+ + N+ A A +G+ I ++ Sbjct: 178 AAVEKFKNVSKGS-----------RKVVLLSDGEDNEGNDN--AAIRLANKEGVSITSVG 224 Query: 405 FSVNKTQQEKA-----------------------RYFLSNCA--SPNSFFEANSTHELNK 439 ++ L A + ++ + N+ +E Sbjct: 225 IGTDEGAPVPEYVFGQLMGYKTDVNGGTVISKRQTEALKKMAESTDGTYIDGNNINEAPD 284 Query: 440 IFRDRIGNEIF 450 + + + Sbjct: 285 RIAEAVNKKSA 295 >gi|193208855|ref|NP_505147.3| C-type LECtin family member (clec-218) [Caenorhabditis elegans] gi|163644494|gb|AAY43994.2| C-type lectin protein 218 [Caenorhabditis elegans] Length = 389 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 52/161 (32%), Gaps = 10/161 (6%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + R+ ++D + S + + ST+ + + Sbjct: 71 RVGSNYPDKRGARVSVLTYSDSPTVHANLSDFKST---DELTSMIYALKP--STSYDSNL 125 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q++ + + + N I+ D + D ++ K+ G++I+T+ Sbjct: 126 QSSLKLVKNMMNYKDINAPRNNTQTVIIIYAGDYVD-YDEPTIAQFGDQLKADGVKIITV 184 Query: 404 A-FSVNKTQQEKARYFLSNCASPNSFFEANS---THELNKI 440 A S Q +L AS + F N + E+ K Sbjct: 185 ADISNTDHQHVSKLKWLKELASEGNGFNINDDYVSEEVQKA 225 >gi|296124235|ref|YP_003632013.1| hypothetical protein Plim_4003 [Planctomyces limnophilus DSM 3776] gi|296016575|gb|ADG69814.1| protein of unknown function DUF1355 [Planctomyces limnophilus DSM 3776] Length = 1023 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 53/151 (35%), Gaps = 22/151 (14%) Query: 293 DTVRMGATFFNDRVISDPSFSWG--VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D V + A +ND + +L I D T MQ D + Sbjct: 491 DEVGVLAYDYNDGEKWIFELTPAGKYEELSLLINSAEIGDMPSFQQT-----MQMGIDGL 545 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +S+ K++++++DG+ + + + + + + I T+A + Sbjct: 546 EASDASS-----------KHMIIISDGDPSPASPDLL---KRFIDAKVTISTVAVFPHGD 591 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIF 441 ++ + +++ + ++L IF Sbjct: 592 VDTPTMTSIAQI-TGGRYYKPTNPNQLPAIF 621 >gi|218680613|ref|ZP_03528510.1| TadE family protein [Rhizobium etli CIAT 894] Length = 176 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 42/131 (32%), Gaps = 7/131 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + K ++L+ G I A+L + + +++V + A ++ Sbjct: 1 MTRKKIFAPLRRLVGDHKGAAAIEFAILALPLFIILFGIIEVSLMFFVNS----ALDASV 56 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 S + + +K + K KI + ++ F+ + + D+ + + Sbjct: 57 HKISRMIRTGE---VASSKITLAGFKAKICDDMLLTFDCSSGLVVKVNVLSDMSSAASTD 113 Query: 122 MNPRKSAYQVV 132 V Sbjct: 114 PIDNSGKLTVT 124 >gi|307544656|ref|YP_003897135.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] gi|307216680|emb|CBV41950.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] Length = 612 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 44/172 (25%), Gaps = 31/172 (18%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F +RV + + + + T I A++ A Sbjct: 78 VWTFGERVDNPLPLGEVSEQWRERALALPPALRDYQQYTDIETALEQA------------ 125 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEE--------------GIAICNKAKSQGIRIMTIA 404 ++++L+TDG ++ + +G+ + IA Sbjct: 126 --ASAEANGWRHLILMTDGVIDLPPSRGNKPGIDEASRQRLVESMAPRFADRGVVVHAIA 183 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 FS + + A S L F D I + + Sbjct: 184 FSDEADLALVEQLAQR---TGGLASVAKSPESLLGAFLDIIERIFPADQLPL 232 >gi|284031227|ref|YP_003381158.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] gi|283810520|gb|ADB32359.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] Length = 279 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 16/150 (10%), Positives = 38/150 (25%), Gaps = 22/150 (14%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 + G +L ++ + G+++D R + + + Sbjct: 44 NERGALSPAVVILALMIFTLAGLVIDGGRQLGAKSR----------------AVGYAQEA 87 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 +RA + + D + +V + Sbjct: 88 ARAGVGTIDFNSAQARIDVAKAGEAVGNFCAKVTENDPAVTSCATSELDPEHLKVQV--- 144 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + + FL +GI+S + EA Sbjct: 145 ---QIANKTSFLGMIGIQSLTANGEGEAHA 171 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 29/87 (33%), Gaps = 6/87 (6%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T DA++ A+ +S ++++TDG + + Sbjct: 116 RSGGKTNTQDALELAFQMFTTSKYGSTPGGLAR------VLVVTDGRSNIEKHRTERKAF 169 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYF 418 K K+ GI + IA ++ Sbjct: 170 KLKANGIEVFVIAIGDYLEGMDELARM 196 >gi|114586161|ref|XP_001172548.1| PREDICTED: hypothetical protein [Pan troglodytes] Length = 543 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 16/165 (9%), Positives = 46/165 (27%), Gaps = 17/165 (10%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + + D +++G F+ + + + K+ + T A Sbjct: 121 SMVNQSNIGTDGIQIGLLQFSSIPLEEFRLNQYSSKVDIYRATFDVQQMRD--GTRTGKA 178 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + ++Y++++TDG + + + I I Sbjct: 179 LNFTL-------PFFDSSKGGRPSVQQYLIVITDGVAQDNVIIP---AKALRDKNIIIFA 228 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 I + K L + + + L + ++ + Sbjct: 229 IGVG-----EAKKSQLLEITNDEDKVYHDVNFEALQNLEKEILSK 268 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 35/99 (35%), Gaps = 10/99 (10%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + + +Y++++TDG+++ ++ G+ I I +E L Sbjct: 6 ADTSRINVARYLIVITDGKSSDS---VAEAAEGLRANGVNIYAIGI------REANIDEL 56 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + F L I ++ + + + +K Sbjct: 57 KEIA-KDKIFFVYEFDLLKDIQKEVVQDICSSEACKNSK 94 >gi|92116019|ref|YP_575748.1| TadE-like [Nitrobacter hamburgensis X14] gi|91798913|gb|ABE61288.1| TadE-like protein [Nitrobacter hamburgensis X14] Length = 204 Score = 45.0 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 15/170 (8%), Positives = 43/170 (25%), Gaps = 8/170 (4%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + + G I A+++P+ML + +DV + + + Sbjct: 8 IRVSAAAMRRDSRGVAAIEFAMIVPLMLVMLFGTIDVSSGVAVKRK--------VTLVAR 59 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L + T + ++ ++ + V+ + Sbjct: 60 TLSDLTSQSKVVGDADITNFLAASYGIMWPYPSAPVQATISELYIDPATSVARVQWSQGS 119 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 S V + L +++ + + + Y Sbjct: 120 SPRGVGSTVSISSDLIGKDSSDKTLPGQYLIYSEVSYLYQPIVGYVMAKA 169 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|148694466|gb|EDL26413.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] gi|148694467|gb|EDL26414.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] Length = 2886 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 111/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYEDNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 35/390 (8%), Positives = 104/390 (26%), Gaps = 24/390 (6%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I +++ L+ + + ++ + + + + + ++N+ + T Sbjct: 4 SIASALALMGAASQAQAQFAKTPLYLQNESQIIEQPKIKHNIMFFIDNSGSMSRNAITGG 63 Query: 121 EMNPRKSA-YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 P + + V ++ +L F + G+++ + + + K++ + Sbjct: 64 YNFPGQESRMDVTKTALNKVLDKYQDKF--NWGLQALVNTENNFSNPDPTKHTKKNYTNS 121 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + S + D+ G V + + + M + + Sbjct: 122 EETFNTSWEQMKTLVDNMGASGYTPATRRYYEVVASAVMPNIKYRCQKSYVVMMSDGDAN 181 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + + + + + K + Sbjct: 182 FSCNHDSTSFHYRFNYDGNNGRRNSNY-----PKAYLNYEADLYRTKPFITAYNYFGESP 236 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + D + + + A + + + A I ++ D V Sbjct: 237 MKYFDPYELGNTCRDKRENVHYSKFWDTANRYRGVEGGMAFFSRKLAVKDIKTAAVDRVD 296 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + DG+ D + Q ++ T+ F + +Y + Sbjct: 297 AAGVSW----------DGDPNIDPKGVDY-----SKQTVQTFTVGFGNGIS-PAGEQYLI 340 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + +F A +L K F I Sbjct: 341 RGASHDGWYFNAAKPDDLYKAFEKIISQIS 370 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 52/154 (33%), Gaps = 16/154 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + + + + + K ++ T I+ A++ A Sbjct: 49 ISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 104 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 105 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 158 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A + L Sbjct: 159 GSAVNETELSHIT----GGKKNAYTAATFDSLTD 188 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 52/154 (33%), Gaps = 16/154 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + + + + + K ++ T I+ A++ A Sbjct: 265 ISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 320 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 321 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 374 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A + L Sbjct: 375 GSAVNETELSHIT----GGKKNAYTAATFDSLTD 404 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 21/168 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +RM ++ + + +++ + + G T + + A I Sbjct: 105 SSDIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ--LRKIVPDGHTFMQAGFRKAIQQIE 162 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + N I+ +TDGE ++ + KA+ G + T+ Sbjct: 163 TFNSGNKVPS--------MIIAMTDGELVAHAFQDTLREAQKARKLGANVYTVGV----- 209 Query: 411 QQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + ++ A SP F + K RD + + + +T Sbjct: 210 -ADYKLDQITAIADSPEHVFAVENG---FKAMRDTVDALTSKVCLDVT 253 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 21/168 (12%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + +RM ++ + + +++ + + G T + + A I Sbjct: 105 SSDIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ--LRKIVPDGHTFMQAGFRKAIQQIE 162 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + N I+ +TDGE ++ + KA+ G + T+ Sbjct: 163 TFNSGNKVPS--------MIIAMTDGELVAHAFQDTLREAQKARKLGANVYTVGV----- 209 Query: 411 QQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + ++ A SP F + K RD + + + +T Sbjct: 210 -ADYKLDQITAIADSPEHVFAVENG---FKAMRDTVDALTSKVCLDVT 253 >gi|239906055|ref|YP_002952794.1| hypothetical protein DMR_14170 [Desulfovibrio magneticus RS-1] gi|239795919|dbj|BAH74908.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 137 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 33/98 (33%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P++L V ++D R+ + + A A A +P + Sbjct: 9 DQRGASAVEMALILPLLLTVVFAIIDYSRFFFLRSTVTAAVADATRLAVLPGTTDAMIAA 68 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 + ++ Q + + + + V Sbjct: 69 AISQALLDPINQADGQTPNVSVTPSQRSAGQPVTVTAS 106 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 56/187 (29%), Gaps = 24/187 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + K + A+ + + I + + +D S S G ++ Sbjct: 163 SMAGKVNGEVKMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEV 219 Query: 320 -------IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T + A+++ D + E + Sbjct: 220 MYPLQPYNKEQFNAALSKFGPKGWTPLASAIESINDDF---------KEYTGEENLNVVY 270 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +++DGE T + + + I F V ++Q++ + A ++ Sbjct: 271 IVSDGEETCG-GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAE--AGKGNYAT 327 Query: 431 ANSTHEL 437 +S EL Sbjct: 328 VSSADEL 334 >gi|302527162|ref|ZP_07279504.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436057|gb|EFL07873.1| von Willebrand factor [Streptomyces sp. AA4] Length = 326 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 31/201 (15%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +DA N+ V +G F + + + + Sbjct: 109 TRLKAAQDAAKQF------AQNMTPGVNLGLISFAGTATVLVNPTTDRAGVTKA-----I 157 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE--G 386 + STA + + A +I S + IVL++DG+ T + Sbjct: 158 DNLKLAQSTATGEGIYAAMQSIQS---FSAVVGGADGPPPARIVLMSDGKQTVPEDLYAP 214 Query: 387 IAICNKAKSQG---IRIMTIAFSVNKTQQE---------KARYFLSNCA--SPNSFFEAN 432 A++ + I +I+F + L A S F++A Sbjct: 215 RGAYTAAQAAKQAQMPISSISFGTEHGSVDIEGKQQDVRVDDESLREIARLSGGEFYKAA 274 Query: 433 STHELNKIFRDRIGNEIFERV 453 S EL +++ D G +I + Sbjct: 275 SADELKRVYADL-GEQIGYEL 294 >gi|126322632|ref|XP_001381040.1| PREDICTED: similar to collagen, type XIV, alpha 1 [Monodelphis domestica] Length = 1892 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 33/122 (27%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + G+T A+ ++ K +L+ Sbjct: 360 NTFSTKDAVIDAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGARTGVSKIGILI 413 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ + + G+ + I + L AS + Sbjct: 414 TDGK-SQDDIIPAS--KTLRDSGVELFAIGV------KNADVAELQEIASEPDSTHVYNV 464 Query: 432 NS 433 Sbjct: 465 AE 466 >gi|46362531|gb|AAH68979.1| Slc35c2 protein [Danio rerio] Length = 1816 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 44/122 (36%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +++ + G+T A+ A + + + ++++LL Sbjct: 631 NNFTSKEPLLEAVRNFRYKGGNTFTGQALIHALE------NNLKEEVGARPNTPQFLLLL 684 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG + ++ IA N+ K+ G+ I+ I + L AS + ++ Sbjct: 685 TDG---KSQDDAIAAANRLKNAGVEIIAIGV------KNADEAELRQVASEPLELNVYKV 735 Query: 432 NS 433 N Sbjct: 736 ND 737 >gi|24431113|gb|AAN61407.1|AF486289_1 matrilin [Biomphalaria glabrata] Length = 394 Score = 45.0 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 17/183 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 DSSS K ++ + + D VR+ + + + F+ + Sbjct: 60 DSSSSLRNRDFKTAIKFLQEFLSQYEISSDPNG--VRVSIISYGKGIYPEIGFNLTTYDT 117 Query: 320 IRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +++ ++ G T A+Q ++ +++ K +++TDG Sbjct: 118 KDEVIEAIGRIPHKAGLRTDTGRAIQYMHEAQLANGVVRPGV-------TKVSIVITDG- 169 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHEL 437 N+Q+ + +A+ I + I + E L N A + + ++ ++L Sbjct: 170 NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSE-----LLNIAGDQSRVTKVDNYNQL 224 Query: 438 NKI 440 + I Sbjct: 225 SSI 227 >gi|306828192|ref|ZP_07461454.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] gi|304429615|gb|EFM32662.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] Length = 1036 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 29/295 (9%), Positives = 77/295 (26%), Gaps = 42/295 (14%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + I V+D S SM + E + + + + P + Sbjct: 433 PLDILVVVDRSGSMQEGIGSIEKYKYWKYKYDEYYHIWRNAGTIYFDNYLGPRYQPDTYT 492 Query: 236 CNKSLYYMLYPGPLDPS-LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 P + ++ + S+ L + V+ + Sbjct: 493 YYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDINPQNQLA--VVGFQGSVAYRYYD 550 Query: 295 VRMGATFFNDRVISDPSFSW-----GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + T +N + + + + ++ + T + A+ A + Sbjct: 551 EKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYQDRNGTNYHAALLKADEM 610 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------------------NEEGIA 388 + + +K +V ++DG T + Sbjct: 611 LQKVANNG---------HRKIMVFISDGVPTFYFGADNYRSGNGTVSDGNIINSQKGSKL 661 Query: 389 ICNKAKSQ--GIRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEANSTHELNK 439 ++ K++ + I ++ S + + L + + + T +L K Sbjct: 662 AIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYLSGDDYYSGITDTEQLEK 716 >gi|298711243|emb|CBJ26488.1| similar to collagen, partial [Ectocarpus siliculosus] Length = 356 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 42/133 (31%), Gaps = 21/133 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + D G T I D + + + +S ++++ Sbjct: 130 GTFYSLEDFNAFVDADVKHSGGTDIIDGIAKGRELLSASPAATS-----------FMIVT 178 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANS 433 TDG + + A+++G + + +Q+ L + F+ ++ Sbjct: 179 TDGAA----PDPQDEADAARAEGTILYAVGVGSGPSQEN-----LLAIGGDEANVFDVDN 229 Query: 434 THELNKIFRDRIG 446 EL+ D + Sbjct: 230 FEELDLALDDIVS 242 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 33/95 (34%), Gaps = 17/95 (17%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T +A+ A + + EA + +VL++DG+ E I Sbjct: 121 SGGTRTREALGYAGEIL----------WNARQEANRIVVLISDGQANSG-SEPSEIARLL 169 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 + +GI I + + L + AS + Sbjct: 170 RVKGIVIFGVGV------AHINKDELLDVASSPAH 198 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 53/139 (38%), Gaps = 21/139 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 T + A TAI A++ AY + S + IVL+TDG Sbjct: 462 SADLTAISAAADGLTLGSGTAIYSALEAAYRYVADSAAAPADGVAPLTS----IVLMTDG 517 Query: 378 ENTQDNEEGI------AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 EN Q A+ + A+S +R T+ F + + A + + F Sbjct: 518 ENNQGTTADAFHSSYLALPDAARS--VRTFTVVFG------DARVDEMRTIADWTGGAMF 569 Query: 430 EANSTHELNKIFRDRIGNE 448 +A ++ L++ FR+ G + Sbjct: 570 DARTSS-LSEAFREIRGYQ 587 >gi|333030668|ref|ZP_08458729.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741265|gb|EGJ71747.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 342 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 57/222 (25%), Gaps = 45/222 (20%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVSPSRLAKSKLLISKLVDEL-------HNDKVGLILFAGDAFTQLPITND 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + + + E T+I A+ A + S + I+L+T Sbjct: 150 FVSA-KMFLSSISPNLIERQGTSIGKAVDLATRSFTSQ-----------EGVGRTIILIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ----------------------E 413 DGE+ + +A GI++ + + Sbjct: 198 DGEDHEPGAL--EAVKRAVDAGIQVNVMGVGSPDGAPIPVTERNGSEYRKDNEGNVVVTK 255 Query: 414 KARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + ++T+ K I + Sbjct: 256 LNEEMAQEIAKAGNGLYVRVDNTNAAQKAITSEINKLTKTNI 297 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 45.0 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 42/156 (26%), Gaps = 12/156 (7%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY-DTIISSNED 356 F+ V ++ ++ R+I K T T + + Sbjct: 112 FLISFDINVDLLSDYTNSPREIKRSIDKATIN---TGAGTGSVTGNSTPKGTLLYDAVYL 168 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 H +K +V+LTDG + E A+ + I + Sbjct: 169 AAHDKLRQEAGRKILVMLTDGGDQGSQETLKTATEAAQKANAIVYVILIADRGFYSGGGF 228 Query: 417 YF-----LSNCA--SPNSFFEA-NSTHELNKIFRDR 444 F + + A + N+ +L F Sbjct: 229 SFGGDRDMESLAHDTGGRVINVGNNGRKLEDAFDQI 264 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 59 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVD 118 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 119 AVK-YFGKGTYTDCAIKKGLEQLL--------VGGSHLKENKYLIVVTDGHPLEGYKEPC 169 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKI 440 N+AK G+++ ++A + LS A+ +++ F A + Sbjct: 170 GGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGQSRDA 224 Query: 441 FRDRIGNEIFERV 453 + I I V Sbjct: 225 -EEVISQTIDTIV 236 >gi|148681993|gb|EDL13940.1| mCG48880 [Mus musculus] Length = 249 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 55/161 (34%), Gaps = 17/161 (10%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTII 351 T R+G ++ +V S K K + +G T A+Q A D Sbjct: 46 GTARIGIINYSHKVEKVASLKQFSSKDD---FKLVVDNMQYLGEGTYTATALQAANDMFK 102 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + KK +++TDG+ +++D ++ + A + I I Sbjct: 103 EARPG----------VKKVALVITDGQTDSRDKKKLADVVKDANDSNVEIFVIGVVKKDD 152 Query: 411 QQEKA-RYFLSNCASP-NSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ ++ + L + ++ +I Sbjct: 153 PNFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKI 193 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 45.0 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 25/253 (9%), Positives = 57/253 (22%), Gaps = 11/253 (4%) Query: 21 HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE--VSSR 78 + +L V G+ +D R + L A A I A+ + + +++ Sbjct: 21 VIALPF------ILLVVGLSIDAGRAYIVKSKLFAAVDAASIAAARAVANGEDAGRAAAQ 74 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 + P N +F + + + + Sbjct: 75 KYFAANIPADFYSATPNLGDVNFAYDSFGNISIDISATAQVPTVFLPLIGLD-TFNPGVS 133 Query: 139 LLL--NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS 196 P+ L L S + + + K + D + Sbjct: 134 AQSIRRPVDLVLVIDNTTSLRLGSIGDVTQDVIDRSKSFIENFHEGFDRISLVKFAFGSE 193 Query: 197 EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE 256 N + T+KS + + + ++ + Sbjct: 194 VPVGFNATRGHSRSTIKSEIDSFNFGSTSNAQYTNASEGMYRAFNELRTVTDPANLKVIV 253 Query: 257 HFVDSSSLRHVIK 269 F D + Sbjct: 254 FFTDGAPNTFAST 266 >gi|260462605|ref|ZP_05810811.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259031511|gb|EEW32781.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 495 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 7/43 (16%), Positives = 20/43 (46%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 + G I+ ++ +P ++G + +D+ R + + L+ Sbjct: 6 RAFWHDQRGIALILVSVTLPALIGFSLLAIDMSRVNNLHNDLQ 48 >gi|315644678|ref|ZP_07897809.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315279937|gb|EFU43236.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 562 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 40/121 (33%), Gaps = 16/121 (13%) Query: 305 RVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + G + L + ++ G+TA D + A + Sbjct: 431 STDVAINLPIGKYDLNQQSMFVGAINSLEASGNTATFDGIVVAMKMLHDEMA-------V 483 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N + K I +L+DGE + I K+ I + TI ++ N L + + Sbjct: 484 NPDVKPLIFVLSDGETNVGHS-LDDIRELIKAFKIPVYTIGYNANI-------QALQSIS 535 Query: 424 S 424 S Sbjct: 536 S 536 >gi|270008950|gb|EFA05398.1| hypothetical protein TcasGA2_TC015570 [Tribolium castaneum] Length = 873 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 42/134 (31%), Gaps = 6/134 (4%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K+ D +M T I + + + + + + I+ LTDG Sbjct: 354 KSIKDDIVDMACTNIIGGLVVGLYLVR-RTLQKFYEKNIETKHQPMIIFLTDGLPNVGLI 412 Query: 385 EGIAICNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I N +QG I +++F + + + N +EA + Sbjct: 413 IRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQ 472 Query: 440 IFRDRIGNEIFERV 453 F + + + V Sbjct: 473 NFYRTVSSLLLRDV 486 >gi|91084773|ref|XP_972278.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 698 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 42/134 (31%), Gaps = 6/134 (4%) Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K+ D +M T I + + + + + + I+ LTDG Sbjct: 354 KSIKDDIVDMACTNIIGGLVVGLYLVR-RTLQKFYEKNIETKHQPMIIFLTDGLPNVGLI 412 Query: 385 EGIAICNKAK--SQGIR---IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I N +QG I +++F + + + N +EA + Sbjct: 413 IRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQ 472 Query: 440 IFRDRIGNEIFERV 453 F + + + V Sbjct: 473 NFYRTVSSLLLRDV 486 >gi|148655419|ref|YP_001275624.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567529|gb|ABQ89674.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 824 Score = 44.6 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 32/303 (10%), Positives = 74/303 (24%), Gaps = 15/303 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G ++ A ++ +++G+ + VD+ + + + A + LI Sbjct: 3 QRFYHKTEGQNVVLIAGILALLIGMAALAVDLGVTYAEQRNIVRGTNAASLAGMNRLISG 62 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNF----ENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + I + + + + + I + Sbjct: 63 GRDTDVARAIYESLRSNGIPVTPPGEAPQPGDRSFEAIYLGSDGAPIPGACSRVGACGSQ 122 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY-----HKEHGVSIQWV 182 + V + L + F R G + A A + + V Q V Sbjct: 123 RPEGVKYIQIRLDGEVETYFARLFGQSQLPVGATAYASVGACATGYYPFGVRSTVGGQPV 182 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN----- 237 D + Y E + + G +R +P + Sbjct: 183 FDEYGFVN-YDGFYEDETYAQLRYQRLYLRDGNNPNGGFSLLRWRNNTPAGNAGALAEML 241 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 P D + + +A V ++ + N V Sbjct: 242 TGDGTYSQGYAEAPWPEIIQGSDGPTRPDSYPFEPNAINAGDWVYGNVFSGNPFNQQVLN 301 Query: 298 GAT 300 Sbjct: 302 QLD 304 >gi|307304370|ref|ZP_07584121.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|306902572|gb|EFN33166.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 431 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 21/184 (11%), Positives = 52/184 (28%), Gaps = 4/184 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +++ + + G +I A+ PV++G G+ V+ W + L+ AA + A+ Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAA 75 Query: 66 VPLIQSLEEVSSRAKNSF----TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 V ++ + A + T R + Sbjct: 76 VRHRAGDQQSALEAAARRVAGGSGFSPGGLTVSTAPGSAGGSNKVTVELTETHPRMFSSV 135 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + ++ L L + + + E S + + Sbjct: 136 FGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNSNAADAF 195 Query: 182 VIDF 185 ++ Sbjct: 196 LMKN 199 >gi|288574994|ref|ZP_06393351.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] gi|288570735|gb|EFC92292.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] Length = 1057 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 51/404 (12%), Positives = 96/404 (23%), Gaps = 89/404 (22%) Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +Y L L L + + + V Y ++ +ID S SML Sbjct: 6 TFKCKYPALYLVLLLIALFSCTIAMANDLPSVFKPVPEEYSTPVPPNVLLLIDTSGSMLF 65 Query: 192 Y----QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 +G S ++ + Y+ S Sbjct: 66 DLEGDTTHGDGSKPFKNQAYYGDDTDSSNNDPDDGSLSYYPPVTYLSDEEVSDLRYDTLL 125 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKK-----------HLVRDALASVIRSIKKIDNVNDTVR 296 L + + S +++KK ++ L + + K + V Sbjct: 126 GLMGRKGHRYLHPNDSRMYILKKVLWSIFTDPSMVEGLKIGLCTYHQREKYGVPGSGYVS 185 Query: 297 MGATFFND--RVISDPSFSWGVHKLIRTIVKTFAIDENE---------MG------STAI 339 + SW + + + + G T+ Sbjct: 186 YEFPSYAWLGWYWKRQKLSWQPTGENKAVKRLSLDVIDPFFYAPSSFSGGVPDDKLGTSH 245 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKY-------------------------IVLL 374 + D + +S DE+ + K +++L Sbjct: 246 WYDLLALIDGVETSKNDELRAVGATPLEKSIYSKGARDCAYEFIKEEIDYPCQDNWLIVL 305 Query: 375 TDGENTQDNEEGIAICNKA---------------KSQGIRIMTIAFSVNKTQQEKARYFL 419 TDGE++ + + A K K+Q +R I + L Sbjct: 306 TDGEDSSSDADPPAAVKKLYEANLDDTWPKPYGKKAQPVRTFVIGL------VDSQSDTL 359 Query: 420 SNCASPNSF-----------FEANSTHELNKIFRDRIGNEIFER 452 A + A T L + FR R Sbjct: 360 DAMADEGRAWEVDESIKKTAYYATDTESLLEAFRTIFRTIQKNR 403 >gi|330953032|gb|EGH53292.1| hypothetical protein PSYCIT7_16979 [Pseudomonas syringae Cit 7] Length = 677 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ + F Q + + V V + A + + + Sbjct: 72 AT----APDFANQSATRNGFTPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASAAPLAALTIRSAAVTVDSTGAA 184 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + I Sbjct: 166 GGGTYTKGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASL 213 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 214 RDFGVEIFTFGI 225 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + I Sbjct: 164 GGGTYTKGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--IAASL 211 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 212 RDFGVEIFTFGI 223 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 28/260 (10%), Positives = 60/260 (23%), Gaps = 17/260 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL------------KQAAQT---- 59 +S G F ++ A + +G +D+ + + + +Q Sbjct: 18 QSRRGAFMVMAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITDAVQTT 77 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + + ++ + L + + + +D Sbjct: 78 ADGIGGSDNVSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTFH 137 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + Y VV + R + + K A T S E I Sbjct: 138 MVWGENAKPYNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTAAVTTSAIAFIE-ARDI 196 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 V+D+S SM + + G Sbjct: 197 VLVLDYSGSMSYDSEFDAMSSYRLGKSAVEANLDDIWETLVDSGATYSDSGKLKFPATGY 256 Query: 240 LYYMLYPGPLDPSLSEEHFV 259 G S ++++ Sbjct: 257 GRINSEVGTYISSTNDDYIY 276 >gi|315081597|gb|EFT53573.1| TadE-like protein [Propionibacterium acnes HL078PA1] Length = 169 Score = 44.6 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 20/58 (34%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + R S + + AAQ A SV Sbjct: 36 RGAVAVEAALILPALLMIAAAATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAV 93 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 85/345 (24%), Gaps = 26/345 (7%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + + + + P+ ++ L + ++ F + + + Sbjct: 98 FGEDTGNGLNSFKLGNLPKDKTVEIHLKVSFLADIDENGYFFKFPLSTKYQKGIMTNEYS 157 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + + N + + K + D Sbjct: 158 DKPDSFHFSLKVNTQKELSDFKVSVDGTKNVIDSHNATFETNEAPKKDAIFIETPIKDED 217 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + ++ + G ++ + VD S + + V+ + I+S+ Sbjct: 218 KSIAVSSDGYIAISTNPSFSGKIESNSEFYFVVDCSGSMSGARIINAVK-CMRLFIQSLP 276 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 R F + + + + +MG T I +Q Sbjct: 277 LG------CRFSIIKFGTSFETVLQPCDYSDENVDKALNLLKSVNAKMGGTDILSPLQH- 329 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-IRIMTIAF 405 + K I LLTDGE + I K++ RI +I Sbjct: 330 IAGLKPQPGFV-----------KQIFLLTDGEVNNPD---ITCATALKNRNENRIFSIGL 375 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + S ++ +N+ + + I Sbjct: 376 GSGAD-PGLIKGLAKK--SGGNYIMIADEDNMNEKVITLLSSAIA 417 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 50/145 (34%), Gaps = 14/145 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVND---TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + + I ++ +++ A F+ V DP FS L K Sbjct: 64 FDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSSSVQIDPPFSS-WKDLQTFKQKVK 122 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 +++ T A+ A + K +L+TDG + N + Sbjct: 123 SMNLI-GQGTFSYYAISNATKLLKREGRKGS---------VKVALLMTDGIDHPKNPDVQ 172 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQ 412 +I + A+ GI +TI S + Sbjct: 173 SISDDARISGISFITIGLSTVVNEA 197 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 52/159 (32%), Gaps = 15/159 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T R+G ++ +V + K + T A+Q A D + Sbjct: 834 TARIGVINYSHKVEKVANLKQFSSKDD--FKLAVDNMQYLGEGTYTATALQAANDMFKDA 891 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 KK +++TDG+ +++D E+ + + I I Sbjct: 892 RPG----------VKKVALVITDGQTDSRDKEKLTEVVKNVSDTNVEIFVIGVVKKNDPN 941 Query: 413 EKA-RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEI 449 + ++ A+ P ++ + L + ++ +I Sbjct: 942 FEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKI 980 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|313764974|gb|EFS36338.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1] gi|314916261|gb|EFS80092.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4] gi|314917531|gb|EFS81362.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1] gi|314921865|gb|EFS85696.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3] gi|314955334|gb|EFS99739.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1] gi|315102367|gb|EFT74343.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1] gi|315109816|gb|EFT81792.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2] gi|327334607|gb|EGE76318.1| putative membrane protein [Propionibacterium acnes HL097PA1] gi|327454298|gb|EGF00953.1| hypothetical protein HMPREF9581_00471 [Propionibacterium acnes HL087PA3] gi|327456363|gb|EGF03018.1| hypothetical protein HMPREF9586_00740 [Propionibacterium acnes HL083PA2] gi|328756057|gb|EGF69673.1| hypothetical protein HMPREF9579_00544 [Propionibacterium acnes HL087PA1] gi|328758902|gb|EGF72518.1| hypothetical protein HMPREF9588_00650 [Propionibacterium acnes HL025PA2] Length = 169 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 21/58 (36%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + + R S + + AAQ A SV Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAV 93 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 49/400 (12%), Positives = 100/400 (25%), Gaps = 50/400 (12%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L Q + A + + Y N+ + K N + + +T + Sbjct: 224 LYQKTTVSQAFASIEPDYTTDQSGTYPSANWTIDEKSNVLNHQGNKDAGETWDGVTTWNG 283 Query: 128 AYQVVLSSRYDLLLNPLS--LFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVID 184 + + +S + LR ++ V + ++ + + V+D Sbjct: 284 DPENLTNSYIEYGGVGDEADFALRKFAKETNTPGLFDVYLNVRGNVQRQIDPIDVVLVVD 343 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 +S SM + R +E + GI+D Y+ + Y Sbjct: 344 WSGSMNEMGRITEVKKGVDRFLNQIEG----------SGIQDSVYMGYVGYSSDGNNYQN 393 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 L + + A + N Sbjct: 394 KTCQLGKFSEVKE----------TIRTMTPETAAGGTFTQ-----RGLRQAGDMLSTQNG 438 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + GV + K + TA + + +T N N Sbjct: 439 HKKVIVLLTDGVPTYSYHVSKVHTQADGSYYGTAFSLSQDQPMNTSHLYNGYFASDQYGN 498 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----L 419 + + T GE K +GI I + + Q + Sbjct: 499 SKWINNTFVATIGEAM-----------ALKERGIEIHGLGIQLQGDQTAGYTKADVENKM 547 Query: 420 SNCASPNS----FFE-ANSTHELNKIFRDRIGNEIFERVI 454 S + ++E AN ++ + ++ I V Sbjct: 548 RQMVSADEEGHLYYESANEAADIAD-YLEKKALHISATVT 586 >gi|323492793|ref|ZP_08097935.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] gi|323312864|gb|EGA65986.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] Length = 694 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 23/235 (9%), Positives = 59/235 (25%), Gaps = 25/235 (10%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + + + + + +D S K + + + + Sbjct: 283 AYRDPETSKRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMS--GKYSTLVEGVRQGLG 340 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D R F++ V+ + T + + M Sbjct: 341 KLPAED------RFRVVMFDNNTYDLTGGFVAVNPTNVSKALQSVEQVEPSNGTNLYEGM 394 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + IVL+TDG E + +R+ T Sbjct: 395 SAAIRRLDDDRPTG-------------IVLVTDGVANVGVTEKRRFFELMEKHDVRLFT- 440 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 F + + ++ S ++ ++ + I +++ + R + Sbjct: 441 -FIMGNSANTPLLVPMTKL-SNGVATSVSNADDIIGHLMN-ITSKLTYQAYRNIQ 492 >gi|298704728|emb|CBJ28324.1| conserved unknown protein [Ectocarpus siliculosus] Length = 877 Score = 44.6 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 66/234 (28%), Gaps = 37/234 (15%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 G + + + + LD S+ + R + + + + + Sbjct: 246 NAGCPPVPDAVASPTASAADGGSEGLCVLDLQRSDPYR-SRKQSRLSLTRMDVSKQLFHA 304 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 I + D V +G F D V + G+T + Sbjct: 305 FINRQQAYD---LPVEVGLVSFGDDVDVPCEPTPLFENFR-----DEVDTLTPAGNTKLF 356 Query: 341 DAMQTAYDTIIS------------------------SNEDEVHRMKNNLEAKKYIVLLTD 376 DA+ A + ++ + + +V+L+D Sbjct: 357 DAISEACTLLEKWQTEWVEKADKRKEEENRKRKAAGGDQANHGPVPDEKRPVLRVVVLSD 416 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G++T+ A+C + + G+ + +I K Q K A+ F Sbjct: 417 GKDTKSTISAHAVCGRLQKVGVIVDSITVGTEKNNQLKCLSL----ATGGYAFH 466 >gi|301785912|ref|XP_002928373.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Ailuropoda melanoleuca] Length = 1188 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 49/172 (28%), Gaps = 12/172 (6%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + ++ + + + ++ + + Sbjct: 224 DPPDFQRAKDFISNMMRNFYEKCFQCSFALVQYGEVIQTEFDLRDSQDAMAS--LARVQN 281 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T AMQ D I + + A K +V+LTDG+ +D + Sbjct: 282 ITQVGKVTKTASAMQHVLDNIFT------PSHGSRKNASKVMVVLTDGDIFEDPLNLTTV 335 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 + K QG+ I + K L AS F+ + L+ Sbjct: 336 ISSPKMQGVERFAIGVGK-AFENNKTYNELKLIASDPDDRYAFKVTNYTALD 386 >gi|281346139|gb|EFB21723.1| hypothetical protein PANDA_018300 [Ailuropoda melanoleuca] Length = 1151 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 49/172 (28%), Gaps = 12/172 (6%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + ++ + + + ++ + + Sbjct: 190 DPPDFQRAKDFISNMMRNFYEKCFQCSFALVQYGEVIQTEFDLRDSQDAMAS--LARVQN 247 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T AMQ D I + + A K +V+LTDG+ +D + Sbjct: 248 ITQVGKVTKTASAMQHVLDNIFT------PSHGSRKNASKVMVVLTDGDIFEDPLNLTTV 301 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 + K QG+ I + K L AS F+ + L+ Sbjct: 302 ISSPKMQGVERFAIGVGK-AFENNKTYNELKLIASDPDDRYAFKVTNYTALD 352 >gi|238923487|ref|YP_002937003.1| hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] gi|238875162|gb|ACR74869.1| Hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] Length = 1082 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 33/365 (9%), Positives = 86/365 (23%), Gaps = 16/365 (4%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 K G + +++ + V + VDV R + + AA A+ + L Sbjct: 6 FSKRERGAVSVFLVIILVPCMLVASIFVDVGRVYLSKSMAESAADMALNSLMTHYDADLN 65 Query: 74 E-----------VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + + + L ++ N L F+ D + + Sbjct: 66 DWYGMVASCQNIDEFYDASIKCYKNALKSQNLSKDEMNTLVGEFSAMIGADSKASDYLRV 125 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 D + L+S + + + K + Sbjct: 126 TDDGDDS--TTIKAVDGANLANATMLKSQIVDFMKYRAPIAITQTAIDKIKNKSIPGIDD 183 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 + S + D + + + + ++ Sbjct: 184 VLKSDENKPLVEKKQDYCKADEKLMRDSYNTYKYLFDNYSYGNPQPSNSLLTGTRDAMQT 243 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR--DALASVIRSIKKIDNVNDTVRMGAT 300 L + ++S L + + + + S + K + + Sbjct: 244 AREQYRELNKLMITNLYNTSGLVVFNRAQVPLNQYNYTKSSTKCYKAKYDGKTYYVNCHS 303 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 +D + + K A D+ + T + + + + + H Sbjct: 304 RKDDDTYYIDG-TELTKQFDDISKKIKAFDKAKNDFTNAVNNSISYSSGVTNDIQYWKHA 362 Query: 361 MKNNL 365 Sbjct: 363 ADAYS 367 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTAAEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGGAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 58/171 (33%), Gaps = 13/171 (7%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISD-PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 I +D + T R+GA F+ + + + + + T+ T A Sbjct: 424 IAALDVLPKTTRLGAWAFSSNLQKNHVDYLPLTNGEQPILDDTYRNGLIAKAHTLPGLAA 483 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNE--EGIAICNKAK----- 394 + + + + + + +V+LTDG N N + + K Sbjct: 484 KNGDTALYDTIAAAYKSVTDTYDPNYVNSVVVLTDGTNDDPNGGLALDQLLARLKSQYSA 543 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + ++I+TI+ R A+ ++ + +++ +F D Sbjct: 544 DKPVKIVTISLGTGTDPDALKRI---AKATDGLSYQTKTPEQISGVFVDAF 591 >gi|34525892|emb|CAE46626.1| trombospondin-related protein [Plasmodium falciparum] Length = 331 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + IV+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENASQLIVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I + Sbjct: 180 DRGVKIAVVGIGQGINVA 197 >gi|321460551|gb|EFX71592.1| hypothetical protein DAPPUDRAFT_326968 [Daphnia pulex] Length = 950 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 26/184 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + +++ + I++ N ++G F+ + Sbjct: 300 DVSGSMKEFNRIGKLGESVRAWIKTDFPSGN-----QLGMVQFSSNAEILSDLRMIADEK 354 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + +T I +Q A + IVL+TDG+N Sbjct: 355 SREEMMAKVPK-EVFVATCIGCGLQLAMQMLKDGG---------------IIVLVTDGKN 398 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF--EANSTHEL 437 + + + IR++TIA+ + + L++ S+F + +S+ L Sbjct: 399 SPGYHDISDVKKDIVDAKIRVITIAYGSEADKNVEH---LADVTGGKSYFIKDDDSSEAL 455 Query: 438 NKIF 441 + F Sbjct: 456 QQAF 459 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 44.6 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 51/180 (28%), Gaps = 39/180 (21%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F+ + + +V M T + A++ +S Sbjct: 128 DRVGLVAFSGEAQVMVPLTLDYGTV--QMVLRELNPGWLMPGTNLESAIRKGMTLFKNSG 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 H ++L++DGE + A +A GI+I TI + Sbjct: 186 GASQHS---------VMILMSDGEELEAAAVNAAK--EAAEFGIKIYTIGIGSREGVPIP 234 Query: 415 ARY-----------------------FLSNCA--SPNSFFEANSTH-ELNKIFRDRIGNE 448 + L A + +F A+ +L K+ + E Sbjct: 235 LKDKNGGSVYKKDMQGNIVTTRLEEGTLQEIANVTGALYFYASPGEFQLQKVLTEIATLE 294 >gi|221109526|ref|XP_002169886.1| PREDICTED: similar to coagulation factor C homolog, cochlin [Hydra magnipapillata] Length = 336 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 52/154 (33%), Gaps = 16/154 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + + + + + K ++ T I+ A++ A Sbjct: 49 ISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 104 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 105 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 158 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A + L Sbjct: 159 GSAVNETELSHIT----GGKKNAYTAATFDSLTD 188 >gi|148555257|ref|YP_001262839.1| TadE family protein [Sphingomonas wittichii RW1] gi|148500447|gb|ABQ68701.1| TadE family protein [Sphingomonas wittichii RW1] Length = 135 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 15/132 (11%), Positives = 35/132 (26%), Gaps = 3/132 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ-AAQTAIITASVPLIQS 71 +L++ G + AL+ P L + +D R ++ L++ A +A A + Sbjct: 3 RLLRDARGVTAVEFALVAPAFLMFMFLTIDGARMAWTYQTLQEVAVNSARCAA--LGVTG 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + V + Sbjct: 61 CKTAGEVQSYAVARAAAGGVKLTAAAVTLTPAATCSSVAGMTRVTIASTYQGASTKLLPS 120 Query: 132 VLSSRYDLLLNP 143 L++ P Sbjct: 121 KLTALSTESCFP 132 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 42/137 (30%), Gaps = 16/137 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N +R+ ++ + ++ ++++ G T + ++ A + I Sbjct: 106 TNPNLRISIITYSTEAEVILPLTSDSKEINKSLL--VLKSIVPQGLTHMQKGLRKANEQI 163 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEGIAICNKAKSQGIRIMTIAFSVNK 409 S I+ LTDG + + KA+ G + T+ Sbjct: 164 RKST-------LGGRIVNSVIIALTDGLLLLKPYLDTMEEAKKARRMGAIVYTVGV---- 212 Query: 410 TQQEKARYFLSNCASPN 426 ++ L N A Sbjct: 213 --FMYSKQQLVNIAGDP 227 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 14/128 (10%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + T RMG F + ++I G TA+ +A+ TA +T Sbjct: 124 HRARTDRMGLVAFAGTAFLQCPLTLDDAAFSQSIDSLDTRTI-SEGGTALAEAINTARET 182 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 KN + K +VL TDGE+ + ++ KA ++G+ I TI Sbjct: 183 F-----------KNEKDNHKVLVLFTDGED--QDMGAVSAAEKAAAEGMLIFTIGIGTPD 229 Query: 410 TQQEKARY 417 + + + Sbjct: 230 GELLRIKD 237 >gi|319779653|ref|YP_004130566.1| hypothetical protein TEQUI_1510 [Taylorella equigenitalis MCE9] gi|317109677|gb|ADU92423.1| hypothetical protein TEQUI_1510 [Taylorella equigenitalis MCE9] Length = 424 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 26/316 (8%), Positives = 88/316 (27%), Gaps = 23/316 (7%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDV--VRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ L + ++ + + + + L +A ++ ++L Sbjct: 9 RKGQVLVL-GLALIAIVIISFLGM-YRNSQIISKRTKLTHVVDSAAYAGAIVQARALNMQ 66 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + T + + + + +N + + + ++ + L Sbjct: 67 AYI-NLAQTANQIALANLITQGSWTQWGQNMGKQVTKRNPQAELIKRFFGSKYSRAYLKG 125 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + +G S + K + E+ + ++ + + + D M Sbjct: 126 KRAQ-----------LGASSHIADLKNQFESHQKIINEVLFETSKAIHDTQLDMRSKAVF 174 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK---LSPYMVSCNKSLYYMLYPGPLDPS 252 Q + +N ++ + ++ ++ ++ L S Sbjct: 175 ELIQSNYTKDDLKTISWNIEDIENPEISVIYNPKGKYISFIRDVSEKYKFLQKRYFLKSS 234 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +LRH + ++ + SI + +R+ + + Sbjct: 235 FLTPIYKSCPTLRHRLIRRGETKLNKDGDWESIDTQSFHSLRLRLKVMCY----YREYPM 290 Query: 313 SWGVHKLIRTIVKTFA 328 WG + + Sbjct: 291 GWGYTNHDSSYIGESI 306 >gi|229051629|ref|ZP_04195099.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] gi|228721740|gb|EEL73214.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] Length = 452 Score = 44.6 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 57/189 (30%), Gaps = 24/189 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + K + A+ + + I + + +D S S G ++ Sbjct: 163 SMAGKVNGEVKMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEV 219 Query: 320 -------IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G T + A+++ + + E + Sbjct: 220 MYPLQPYNKEQFNAALSKFGPKGWTPLASAIESV---------NADFKEYTGEENLNVVY 270 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +++DGE T + + + I F V ++Q++ + A ++ Sbjct: 271 IVSDGEETCG-GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAE--AGKGNYAT 327 Query: 431 ANSTHELNK 439 ++ EL + Sbjct: 328 VSTADELYQ 336 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 49/153 (32%), Gaps = 15/153 (9%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D ++G ++ + + + + ++ + T A+ A Sbjct: 34 FDIGPTATQVGVVQYSSSPQQEFALNAHSSLV--SLQQAITNIIIIGRGTNTGSALTFAR 91 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D +++ + K +V +TDG +++D ++ G+ I + Sbjct: 92 DVALTAANGARPGL------PKIVVTMTDGASSEDVLTPS---QNLRNDGVITFAIGVTS 142 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + S + F A+ L+ I Sbjct: 143 RASDWQVEEIA----GSLDRVFTASDFDALDNI 171 >gi|314969033|gb|EFT13131.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1] Length = 169 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 21/58 (36%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + + R S + + AAQ A SV Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAV 93 >gi|313225343|emb|CBY06817.1| unnamed protein product [Oikopleura dioica] Length = 321 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 66/212 (31%), Gaps = 18/212 (8%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 ++ + S Y + + + + Sbjct: 92 TASANRNQAASTPQSTYGYQIVDKSKFDPDATVIKECKRIDQTDLVILIDGSWSVTPTNF 151 Query: 276 DALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + ++ K +D +G ++D + + ++ + + Sbjct: 152 ERVKIFLSALLKHFSIGHDASMIGIAQYSDNPRLEFGLNEHYD--FPSLNAAVNRMKYKG 209 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G+TA A+ A D + + + A+K ++++TDGE+ QD + + Sbjct: 210 GNTATGKALTFALDHVFGRS--------SRPNAQKVVLIITDGESLQDTV--TEPARRLR 259 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 G+ I +I +E L + A+ Sbjct: 260 ENGVEIFSIGVGDEINLEE-----LKDMATDP 286 >gi|320105612|ref|YP_004181202.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924133|gb|ADV81208.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 335 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 51/138 (36%), Gaps = 11/138 (7%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + ++ G TA +D + ++ D++ +K + ++ +++++D Sbjct: 164 YTNSVDLLNQAIHKLRPGGGTAF-------FDALYTTCRDQMLTLKESNTVRRALIVVSD 216 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANST 434 G + Q + + + TI+ +++ T+ L A+ F N Sbjct: 217 GHDNQSRAQENDAIKMCQRAETIVYTISTNISPTKDAADE-VLRRIADATGGRVFFPNRI 275 Query: 435 HELNKIFRDRIGNEIFER 452 ++ F I E+ + Sbjct: 276 EDVANGFHS-IEEELRSQ 292 >gi|253582981|ref|ZP_04860199.1| magnesium chelatase [Fusobacterium varium ATCC 27725] gi|251835187|gb|EES63730.1| magnesium chelatase [Fusobacterium varium ATCC 27725] Length = 632 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 77/217 (35%), Gaps = 28/217 (12%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 K + + S VDSS V K+ V+ A+ S++ D R+ Sbjct: 435 KKEHIRVKVREKRTGASILFVVDSSGSMGVKKRMEAVKGAVMSLL-----KDAYEKRDRV 489 Query: 298 GATFFN-DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 G F D+ + + + + K G T + + + AY I + Sbjct: 490 GMVSFRRDKAEELLPITRSIDLAQKKLEK-----LATGGKTPLAEGIAKAYTIIKNEMR- 543 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNE------EGIAICNKAKSQGIRIMTIAFSVNKT 410 + E IV L+DG+ E + + K K++GIR + I Sbjct: 544 ------KDKEVVPLIVFLSDGKGNFSASGKDPVKESLEMAEKIKNEGIRAIVIDTEEGFI 597 Query: 411 QQEKARYFLSNCASPNSFFEANS--THELNKIFRDRI 445 + E A+ A +++ + + ++ K+ +D I Sbjct: 598 KLEMAKTLSE--AMKAEYYKLENLRSEDMLKLIKDNI 632 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 47/154 (30%), Gaps = 18/154 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+G ++ R F+ +K G T A+ A + Sbjct: 34 GPHNTRIGIVRYSTRPSGIFRFTSYRNKHSTKHRVNRIR--YTGGWTRTGAAINYARRYL 91 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ +K ++++TDG + + + K GI + I Sbjct: 92 --------YQHNRRRGVRKVLIVMTDG---KSQDSVVGASRSVKRMGIEVFAIGIGRGYR 140 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + E L+ A+ + +L+KI Sbjct: 141 RSE-----LNQMATDRNHVLTARFRDLHKIIGKI 169 >gi|50841536|ref|YP_054763.1| hypothetical protein PPA0046 [Propionibacterium acnes KPA171202] gi|289424429|ref|ZP_06426212.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289427366|ref|ZP_06429079.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|295129575|ref|YP_003580238.1| hypothetical protein HMPREF0675_3046 [Propionibacterium acnes SK137] gi|50839138|gb|AAT81805.1| putative membrane protein [Propionibacterium acnes KPA171202] gi|289155126|gb|EFD03808.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289159296|gb|EFD07487.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|291376464|gb|ADE00319.1| conserved hypothetical protein [Propionibacterium acnes SK137] gi|313771156|gb|EFS37122.1| conserved hypothetical protein [Propionibacterium acnes HL074PA1] gi|313792519|gb|EFS40605.1| conserved hypothetical protein [Propionibacterium acnes HL110PA1] gi|313803520|gb|EFS44702.1| conserved hypothetical protein [Propionibacterium acnes HL110PA2] gi|313806906|gb|EFS45404.1| conserved hypothetical protein [Propionibacterium acnes HL087PA2] gi|313811818|gb|EFS49532.1| conserved hypothetical protein [Propionibacterium acnes HL083PA1] gi|313817689|gb|EFS55403.1| conserved hypothetical protein [Propionibacterium acnes HL046PA2] gi|313821484|gb|EFS59198.1| conserved hypothetical protein [Propionibacterium acnes HL036PA1] gi|313824571|gb|EFS62285.1| conserved hypothetical protein [Propionibacterium acnes HL036PA2] gi|313826242|gb|EFS63956.1| conserved hypothetical protein [Propionibacterium acnes HL063PA1] gi|313832354|gb|EFS70068.1| conserved hypothetical protein [Propionibacterium acnes HL007PA1] gi|313832813|gb|EFS70527.1| conserved hypothetical protein [Propionibacterium acnes HL056PA1] gi|313839673|gb|EFS77387.1| conserved hypothetical protein [Propionibacterium acnes HL086PA1] gi|314926286|gb|EFS90117.1| conserved hypothetical protein [Propionibacterium acnes HL036PA3] gi|314961709|gb|EFT05810.1| conserved hypothetical protein [Propionibacterium acnes HL002PA2] gi|314964231|gb|EFT08331.1| conserved hypothetical protein [Propionibacterium acnes HL082PA1] gi|314975246|gb|EFT19341.1| conserved hypothetical protein [Propionibacterium acnes HL053PA1] gi|314977661|gb|EFT21756.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1] gi|314980207|gb|EFT24301.1| conserved hypothetical protein [Propionibacterium acnes HL072PA2] gi|314985154|gb|EFT29246.1| conserved hypothetical protein [Propionibacterium acnes HL005PA1] gi|314987063|gb|EFT31155.1| conserved hypothetical protein [Propionibacterium acnes HL005PA2] gi|314990444|gb|EFT34535.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3] gi|315078862|gb|EFT50880.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2] gi|315083131|gb|EFT55107.1| conserved hypothetical protein [Propionibacterium acnes HL027PA2] gi|315086659|gb|EFT58635.1| conserved hypothetical protein [Propionibacterium acnes HL002PA3] gi|315088063|gb|EFT60039.1| conserved hypothetical protein [Propionibacterium acnes HL072PA1] gi|315097114|gb|EFT69090.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1] gi|315107450|gb|EFT79426.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1] gi|327332549|gb|EGE74284.1| putative membrane protein [Propionibacterium acnes HL096PA2] gi|327333722|gb|EGE75439.1| putative membrane protein [Propionibacterium acnes HL096PA3] gi|327444419|gb|EGE91073.1| hypothetical protein HMPREF9568_01683 [Propionibacterium acnes HL013PA2] gi|327446672|gb|EGE93326.1| hypothetical protein HMPREF9571_01238 [Propionibacterium acnes HL043PA2] gi|327448885|gb|EGE95539.1| hypothetical protein HMPREF9570_00450 [Propionibacterium acnes HL043PA1] gi|327457365|gb|EGF04020.1| hypothetical protein HMPREF9584_00563 [Propionibacterium acnes HL092PA1] gi|328757926|gb|EGF71542.1| hypothetical protein HMPREF9563_00560 [Propionibacterium acnes HL020PA1] gi|328759748|gb|EGF73344.1| putative membrane protein [Propionibacterium acnes HL099PA1] gi|332674444|gb|AEE71260.1| hypothetical protein PAZ_c00470 [Propionibacterium acnes 266] Length = 169 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 21/58 (36%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + + R S + + AAQ A SV Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAV 93 >gi|296135239|ref|YP_003642481.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295795361|gb|ADG30151.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 336 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 36/147 (24%), Gaps = 24/147 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F + ++ + TA+ DA+ Sbjct: 143 DRFALVAFGSHAATLLPPTFDARAAGQMAGLLAVGQLGPD--TALGDAIALGLR------ 194 Query: 355 EDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 + K ++L TDG ++ A+ G+RI T+ Sbjct: 195 -----QAGALHGLKPVLILYTDGGQSNTGAISPADAVALARHLGVRIYTVEVGTTPDPGR 249 Query: 413 -------EKARYFLSNC--ASPNSFFE 430 + L A+ F+ Sbjct: 250 PYTVPAYAGPQPDLRLIAEATGGRFYF 276 >gi|239613390|gb|EEQ90377.1| U-box domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 766 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 57/228 (25%), Gaps = 35/228 (15%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + + + L + A ++I ++ D R+G F Sbjct: 81 DISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDND------RLGVVAF 134 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + +K + ST + ++ + + + + Sbjct: 135 STDAEVVYKISNMNEDNKKAALKA-VEALWPLSSTNLWHGLKLSLEALEEVTPIPQNV-- 191 Query: 363 NNLEAKKYIVLLTDG-----------ENTQDNEEGIAICNKA--KSQGIRIMTIAFSVNK 409 + + +LTDG N + +KA K + I T F Sbjct: 192 ------QALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGYYI 245 Query: 410 TQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 L ++ + +F I N + Sbjct: 246 -----RSGLLQAISEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQ 288 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 43/137 (31%), Gaps = 16/137 (11%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 N +R+ ++ + ++ ++++ + G T + ++ A + I Sbjct: 106 TNPNLRISIITYSTEAEVILPLTSDSKEINKSLL--VLKNIVPQGLTHMQKGLRKANEQI 163 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN-EEGIAICNKAKSQGIRIMTIAFSVNK 409 S I+ LTDG + + KA+ G + T+ Sbjct: 164 RKST-------LGGRIVNSVIIALTDGLLLLKPYLDTMEEAKKARRMGAIVYTVGV---- 212 Query: 410 TQQEKARYFLSNCASPN 426 ++ L N A Sbjct: 213 --FMYSKQQLVNIAGDP 227 >gi|118096709|ref|XP_001233876.1| PREDICTED: similar to tumor suppressor candidate 4 [Gallus gallus] Length = 1208 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 60/195 (30%), Gaps = 20/195 (10%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 YP + ++ D I+ +D + V S ++ D Sbjct: 56 YPATPWRAPNKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLD 115 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + + + K G+T + A+D + +SN Sbjct: 116 TLS-----DDDYVNVASKVFKEDVQGMVVKGTTDYKAGFEYAFDQLQNSN-------ITR 163 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K I++ TDG + + K ++ +R+ T FSV + + C Sbjct: 164 ANCNKMIMMFTDG----GEDRVQDVFEKYNWPNKTVRVFT--FSVGQHNYDVTPLQWMAC 217 Query: 423 ASPNSFFEANSTHEL 437 A+ +FE S + Sbjct: 218 ANKGYYFEIPSIGAI 232 >gi|66805993|ref|XP_636718.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] gi|60465117|gb|EAL63216.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] Length = 549 Score = 44.6 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 50/144 (34%), Gaps = 11/144 (7%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F ++ + S + + + A GST + +A+ A + I Sbjct: 177 LVLFGQKIDTAFEISKNFDSFSQELGEVVANQ----GSTRLYEAIYHAANEI--EKYRNN 230 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + K + I LLTDG++T N + K I + I + Sbjct: 231 PKEKLAPDVCCRIFLLTDGQDTS-NINPYNVYQYLKPLNIILDAIPIGRDDN--STLSTL 287 Query: 419 LSNCASPNSFFEANSTHELNKIFR 442 A+ S F ANST E ++F Sbjct: 288 TK--ATGGSCFMANSTQEGVELFE 309 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 54/157 (34%), Gaps = 21/157 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + ++ + L ++ + S ++ + R+G F S + Sbjct: 97 DISNSMNAQDVAPDRLSLAKNIVISSMQKMT-------NDRVGLAVFAGEAFSVMPLTTD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + G T AMQ A + ++ IVL++ Sbjct: 150 YLAAESFVSGLETSVVSTQG-TDFYKAMQVAVSKFKAVSKGSGR-----------IVLIS 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 DGE+ + NE A +A+S GI+++T+ + Sbjct: 198 DGEDNEGNE--AAAIKEAQSNGIQVITVGVGTEEGAP 232 >gi|241191500|ref|YP_002968894.1| hypothetical protein Balac_1485 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241196905|ref|YP_002970460.1| hypothetical protein Balat_1485 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|240249892|gb|ACS46832.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis Bl-04] gi|240251459|gb|ACS48398.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295794492|gb|ADG34027.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis V9] Length = 671 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 60/186 (32%), Gaps = 38/186 (20%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 NDT +N + S +W L + + G+T + +Q Sbjct: 40 NSDKIGNDTYNEDGYNYN-YSQTVHSLAWTPEDLQK--EQAAVNSLKAGGATRADFGLQH 96 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE-------EGIAICNKAKSQGI 398 A + S A+K V +DG T + I + K+ Sbjct: 97 AVKQLNSGRPG----------AQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHS 146 Query: 399 RIMTIAF--SVNKTQQEKARYFLSNCAS---------------PNSFFEANST-HELNKI 440 ++++I + + + A F++ +S +++ A S +L I Sbjct: 147 QVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTI 206 Query: 441 FRDRIG 446 F++ I Sbjct: 207 FKEIIS 212 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 46/164 (28%), Gaps = 5/164 (3%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G F ++ A+ M V + G VD+ + L++ A A + + + ++ + Sbjct: 1 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQCAQPN 56 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + + L D + P +A QV + Sbjct: 57 AAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTATQSVPY 115 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 S + + + T S ++ + Sbjct: 116 FFLGPSRTVSATSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 159 >gi|254508395|ref|ZP_05120516.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] gi|219548708|gb|EED25712.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] Length = 696 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 22/215 (10%), Positives = 51/215 (23%), Gaps = 24/215 (11%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S + + + +D S K + + + + Sbjct: 283 AYRDPESSKRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMS--GKYSTLVEGVRQGLG 340 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D R F+ V++ + T + + M Sbjct: 341 KLPSED------RFRVVMFDSNTYDLTGGFVAVNQSNISKALQAVEQVEPSNGTNLYEGM 394 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + IVL+TDG E + +R+ T Sbjct: 395 AAAIRKLDDDRPTG-------------IVLVTDGVANVGVTEKRRFFELMEKHDVRLFT- 440 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 F + + ++ S ++ ++ Sbjct: 441 -FIMGNSANTPLLVPMTKL-SNGVATSVSNADDII 473 >gi|56477525|ref|YP_159114.1| Flp pilus assembly protein TadG [Aromatoleum aromaticum EbN1] gi|56313568|emb|CAI08213.1| hypothetical protein, similarity to Flp pilus assembly protein TadG [Aromatoleum aromaticum EbN1] Length = 148 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 13/149 (8%), Positives = 44/149 (29%), Gaps = 2/149 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G + A L+ ++ +G + + R Y + L +A + A + E Sbjct: 1 MRHMRGVVAVEFAFLLIPLVMLGFGITEFGRAIYSYNTLAKAVRDAARHLTAKTPGDPVE 60 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + L + + ++ + T + V + Sbjct: 61 HAIAKCMAVHGNPDCNGPALAPDLKTSMVQ--TCDTILSCPGVENTVTTGTGMINTVTVR 118 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + + + + + + +++ Sbjct: 119 ISGYPYNSVVEYVMPDITFNNIAVTMRSQ 147 >gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens CN-32] gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32] Length = 757 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 49/407 (12%), Positives = 106/407 (26%), Gaps = 36/407 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + +V++ A N + +E + + R I Sbjct: 165 RASLVSQERPNMFTTDVANLAPNEQLVVEISYQENIKYEDGLFSLRFPLVVAPRYIPGFI 224 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + +A ++S + LR K + +++ K S Sbjct: 225 HDKASANDNAANSRVTSSEVFDAERIVAPLRRANSKQDPVLNANIEVRLAKGVDKSTIAS 284 Query: 179 IQ--WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS----SQNGKVGIRDEKLSPY 232 +D + + N R + S N + + S Sbjct: 285 PYHQIKLDEPHHGIINVSLTNSVVANRDFVLQWRAKQGMSPMALVFNQQGKTHGDGASED 344 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI---- 288 VS N+ + P ++EH + + +I I K Sbjct: 345 NVSENRQSDDHYSLVMVLPPKTDEHALSTLPRE-LILVIDTSGSMAGDSIVQAKSALLYA 403 Query: 289 -DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST----AINDAM 343 + + FN + S ++ + F G T A+N A+ Sbjct: 404 LNGLKAEDSFNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMSLALNAAL 463 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + S++ ++ +TDG + I + R+ T+ Sbjct: 464 PRGINRLSESSQSLRQ-----------VIFMTDGSVGNEQALFDLIRYQIGES--RLFTV 510 Query: 404 AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 F+ A +F + E+ + + Sbjct: 511 GIGSAPNSH-----FMQRAAELGRGTFTYIGNVDEVEQKISKLLSKI 552 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 54/179 (30%), Gaps = 24/179 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-------IRT 322 K + A+ + + I + + +D S S G ++ + Sbjct: 173 KMEAAKKAIYNYLDKIPD---NANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKE 229 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T + A+++ D + E + +++DGE T Sbjct: 230 QFNAALSKFGPKGWTPLASAIESVNDDF---------KEYTGEENLNVVYIVSDGEETCG 280 Query: 383 NEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + + + I F V ++Q++ A ++ ++ EL + Sbjct: 281 -GDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLMNTAE--AGKGNYATVSNADELYQ 336 >gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1] gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1] Length = 757 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 49/407 (12%), Positives = 106/407 (26%), Gaps = 36/407 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + + +V++ A N + +E + + R I Sbjct: 165 RASLVSQERPNMFTTDVANLAPNEQLVVEISYQENIKYEDGLFSLRFPLVVAPRYIPGFI 224 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + +A ++S + LR K + +++ K S Sbjct: 225 HDKASANDNAANSRVTSSEVFDAERIVAPLRRANSKQDPVLNANIEVRLAKGVDKSTIAS 284 Query: 179 IQ--WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS----SQNGKVGIRDEKLSPY 232 +D + + N R + S N + + S Sbjct: 285 PYHQIKLDEPHHGIINVSLTNSVVANRDFVLQWRAKQGMSPMALVFNQQGKTHGDGASED 344 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI---- 288 VS N+ + P ++EH + + +I I K Sbjct: 345 NVSENRQSDDHYSLVMVLPPKTDEHALSTLPRE-LILVIDTSGSMAGDSIVQAKSALLYA 403 Query: 289 -DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST----AINDAM 343 + + FN + S ++ + F G T A+N A+ Sbjct: 404 LNGLKAEDSFNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL 463 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + S++ ++ +TDG + I + R+ T+ Sbjct: 464 PRGINRLSESSQSLRQ-----------VIFMTDGSVGNEQALFDLIRYQIGES--RLFTV 510 Query: 404 AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 F+ A +F + E+ + + Sbjct: 511 GIGSAPNSH-----FMQRAAELGRGTFTYIGNVDEVEQKISKLLSKI 552 >gi|19554206|ref|NP_602208.1| hypothetical protein NCgl2910 [Corynebacterium glutamicum ATCC 13032] gi|62391861|ref|YP_227263.1| hypothetical protein cg3341 [Corynebacterium glutamicum ATCC 13032] gi|21325786|dbj|BAC00407.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41223008|emb|CAF18953.1| putative membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 501 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 32/81 (39%), Gaps = 4/81 (4%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL++DG T + + QG+ I T+ F+V+++ + + A ++ Sbjct: 9 IVLVSDGIATCTPPPVCEVAAELADQGVDLVINTVGFNVDESARAELECIAQ--AGNGTY 66 Query: 429 FEANSTHELNKIFRDRIGNEI 449 +A+ L + Sbjct: 67 ADASDADSLVAELKRAATRTA 87 >gi|260062899|ref|YP_003195979.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] gi|88784467|gb|EAR15637.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] Length = 378 Score = 44.6 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 58/186 (31%), Gaps = 16/186 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFN--DRVISDPSFSWGVHKLIRTIVKTFA 328 ++ A AS I ++ ++ +M +F+ D + + +LI + Sbjct: 156 LTELKSASASFIDNVMPAVP-AESFQMAIYWFDGEDVLHELNPLTSSREELIAAVESID- 213 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D + ST + A+ + D D + IVL TDG + Sbjct: 214 SDFSNDPSTDLYGAVIKSTDLATDLLRDSEQNNTIGAAS---IVLFTDGTDQASRYSESQ 270 Query: 389 ICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 +K + I TI L S F A + EL F Sbjct: 271 ALDKVEKANSNISFFTIGLGAEID-----SEVLEEIGKTFSVF-AGNKEELEVTFNQL-S 323 Query: 447 NEIFER 452 ++ ER Sbjct: 324 QKVSER 329 >gi|303241714|ref|ZP_07328211.1| hypothetical protein AceceDRAFT_3559 [Acetivibrio cellulolyticus CD2] gi|302590715|gb|EFL60466.1| hypothetical protein AceceDRAFT_3559 [Acetivibrio cellulolyticus CD2] Length = 318 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 16/135 (11%), Positives = 42/135 (31%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L+K+ +G +I +++ + + D+ + + + + + LI Sbjct: 9 NLLKNRSGSTSVIFLCSAVIIILLSAIATDIGYIAIERYKMDRVLDQIAKIGATALIVDK 68 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + K + I + I +NN + + D + + + +V Sbjct: 69 NECTKVIKENAVKRIDNITKLEINVSDNNREMSINIERKLDYIFLKYIGFKDKNINSRVT 128 Query: 133 LSSRYDLLLNPLSLF 147 + F Sbjct: 129 AKVSNVTSFKGIRPF 143 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 53/146 (36%), Gaps = 16/146 (10%) Query: 271 KHLVRDALASVIRSIKKIDNV---NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ + ++ V +++ A F+ V DPSFS K Sbjct: 64 FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDL---QTFKQR 120 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 N +G T A+ A + + K +L+TDG + N + Sbjct: 121 VKSMNFIGQGTFSYYAISNATGLLKREGR---------KDGVKVALLMTDGIDHPKNPDV 171 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 172 QSISEDARTAGILFITIGLSTVVNEA 197 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 48/154 (31%), Gaps = 19/154 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSN 354 R+G ++ +V + + K +G T A+ A ++ Sbjct: 836 RVGIINYSHKVEKVAHLTQFS---TKDDFKLAVDRMQYLGEGTYTASALHEANHMFEAAR 892 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 KK +++TDG+ +T+D + + KA + I I Sbjct: 893 PG----------VKKVALVITDGQTDTRDEKNLTEVVKKASDINVEIFVIGVVKKNDPNF 942 Query: 413 EKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 E ++ A+ ++ + L + Sbjct: 943 EVFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 976 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 53/146 (36%), Gaps = 16/146 (10%) Query: 271 KHLVRDALASVIRSIKKIDNV---NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 +D + S+ + ++ V +++ A F+ V DPSFS K Sbjct: 60 FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDL---QTFKQR 116 Query: 328 AIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 N +G T A+ A + + K +L+TDG + N + Sbjct: 117 VKSMNFIGQGTFSYYAISNATGLLKREGR---------KDGVKVALLMTDGIDHPKNPDV 167 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ 412 +I A++ GI +TI S + Sbjct: 168 QSISEDARTAGILFITIGLSTVVNEA 193 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 43/134 (32%), Gaps = 16/134 (11%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSN 354 R+G ++ +V + + K +G T A+ A ++ Sbjct: 665 RVGIINYSHKVEKVAHLTQFS---TKDDFKLAVDRMQYLGEGTYTASALHEANHMFEAAR 721 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 KK +++TDG+ +T+D + + KA + I I Sbjct: 722 PG----------VKKVALVITDGQTDTRDEKNLTEVVKKASDINVEIFVIGVVKKNDPNF 771 Query: 413 EKARYFLSNCASPN 426 E ++ A+ Sbjct: 772 EVFHKEMNLIATDP 785 >gi|1399245|gb|AAB03226.1| integrin alpha-M [Rattus norvegicus] Length = 205 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 41/134 (30%), Gaps = 10/134 (7%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ K G T ++ + A K Sbjct: 26 THFTFNXFKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELFQKINGAR------DNAAK 79 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PN 426 +V++TDGE D + +A+ GI I + + R L AS + Sbjct: 80 ILVVITDGEKFGDPLNYEDVIPEAEEAGIIRYVIGVXNAFHKPQSRRE-LDTIASKPAGD 138 Query: 427 SFFEANSTHELNKI 440 F+ ++ LN I Sbjct: 139 HVFQVDNFEALNTI 152 >gi|294339336|emb|CAZ87692.1| putative Von Willebrand factor, type A [Thiomonas sp. 3As] Length = 336 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 37/147 (25%), Gaps = 24/147 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F + ++ + TA+ DA+ A Sbjct: 143 DRFALVAFGSHAATLLPPTFDARAAGQMAGLLAVGQLGPD--TALGDAIALALR------ 194 Query: 355 EDEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ- 412 + K ++L TDG ++ A+ G+RI T+ Sbjct: 195 -----QAGALHGLKPVLILYTDGGQSNTGAISPADAVALARHMGVRIYTVEVGTTPDPGR 249 Query: 413 -------EKARYFLSNC--ASPNSFFE 430 + L A+ F+ Sbjct: 250 PYTVPAYAGPQPDLRLIAEATGGRFYF 276 >gi|291225695|ref|XP_002732834.1| PREDICTED: MUscle Positioning family member (mup-4)-like [Saccoglossus kowalevskii] Length = 317 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 44/125 (35%), Gaps = 18/125 (14%) Query: 316 VHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 +T +K+ + E GST A+ A D S + + K + L+ Sbjct: 175 SSSASKTCIKSEIDKIDFEGGSTHTTKALVKARDEAFKSFH------GSRTNSHKVLFLV 228 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-----SFF 429 TDG + N + N K+ + I + + + + E L + S ++ Sbjct: 229 TDGRSN-GNGPLVETANSLKNDDVEIYALGVTSDVVEAE-----LRSIVSDPIPDHLFYY 282 Query: 430 EANST 434 + + Sbjct: 283 DTFNA 287 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 58/173 (33%), Gaps = 16/173 (9%) Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 CN + P ++ V SS ++ +S+ +S K D D V Sbjct: 1508 CNFNEKPCQAPQACHIRKADVAIVVDSSSSVKYNNFQKMKSFTSSLFQSFKLGD---DRV 1564 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSN 354 R +N RV F+ + V+ G T A+Q D + + Sbjct: 1565 RAALVRYNRRVDQRFGFA---DTNTKEEVQDGVDAMPYRGGGTLTGQALQHVIDNTLKTE 1621 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 +A+K ++ +TDG +QD+ + ++ G+ I Sbjct: 1622 NGAR------DDAQKIVITITDGR-SQDDVLTPS--RALRAAGVITFGIGIGA 1665 >gi|298705152|emb|CBJ28595.1| von Willebrand A domain containing protein [Ectocarpus siliculosus] Length = 382 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 39/133 (29%), Gaps = 21/133 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + D G T I + + + +S ++++ Sbjct: 156 GTFYSLEDFNAFVDADTEYSGGTDIIAGIAKGRELLSAS-----------PTTTSFMIVT 204 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANS 433 TDG + + A+++G + + + L + F+ ++ Sbjct: 205 TDGAA----PDPQDEADAARAEGTILYAVGVGSGPS-----EEILLAIGGDEANVFDIDN 255 Query: 434 THELNKIFRDRIG 446 EL+ D + Sbjct: 256 FDELDVALDDIVS 268 >gi|103486591|ref|YP_616152.1| hypothetical protein Sala_1102 [Sphingopyxis alaskensis RB2256] gi|98976668|gb|ABF52819.1| conserved hypothetical protein [Sphingopyxis alaskensis RB2256] Length = 542 Score = 44.6 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 22/58 (37%), Gaps = 4/58 (6%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 M ++ LI+ I A+ MP+++G + VDV L+ A Sbjct: 1 MAIRSRLAS----LIRCRRAGISIAAAIGMPMLIGAAALAVDVGSLYLDRRKLQGIAD 54 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 39/121 (32%), Gaps = 14/121 (11%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTII 351 D+V +G F++ + ++ G T A+ Sbjct: 233 DSVHVGVVQFSNNPQEQFPL---NRYFDQNELEEAIDGIEQLTGDTYTGKALSFISKYFD 289 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 +SN + +++V++TDGE + +G+ I +I + T Sbjct: 290 ASN-------GGRPDVPQFLVVITDGEAHDAVAVP---AKAIRDKGVTIFSIGVASVNTT 339 Query: 412 Q 412 Q Sbjct: 340 Q 340 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 66/183 (36%), Gaps = 22/183 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + + A+ + I + +G F++R I + + Sbjct: 312 DKSGSMAAMDRLNRMNQAVKLFLLQITE-----KGSWVGIVLFDERAIIRNPLIQIISED 366 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + T + G T+I +Q A+ I + IVLLTDGE+ Sbjct: 367 DRNYLMTRLPE-AAGGGTSICSGVQAAFQAIKQKFQTTDGSE---------IVLLTDGED 416 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 ++ C + K G I T+A + Q+ + ++ S EA + + L Sbjct: 417 -----VTVSSCFEEVKQSGATIHTVALGTSAAQELERLSDMTGGISTAPSDEAQN-NGLI 470 Query: 439 KIF 441 F Sbjct: 471 DAF 473 >gi|21226477|ref|NP_632399.1| hypothetical protein MM_0375 [Methanosarcina mazei Go1] gi|20904741|gb|AAM30071.1| conserved protein [Methanosarcina mazei Go1] Length = 829 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 41/381 (10%), Positives = 101/381 (26%), Gaps = 34/381 (8%) Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + S +E ++ + + R + + + ++ S DL Sbjct: 186 QASEQSVSYFLEVFVDGKISQSRQFTQDTRNNTIPINQAFTTLGAHNISVKISDISGGDL 245 Query: 140 LLNPLSLF--LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 F + + + T +++ + VS+ + ++ D + Sbjct: 246 KEINNEFFKSIYVIPKPEITLVTNETGSPLAKVLSNLYEVSVANGYPGAGNITDSKVLVL 305 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM------LYPGPLDP 251 A+ NG I Y + P Sbjct: 306 DNQYINSFSDAEIKEIKNYVSNGGGLIVVGGDRAYNYGNYLNSSLEEVLPVISKPSEFRG 365 Query: 252 SLSEEHFVDSSSLR-HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + +D S + ++ +A+ + + DN+ D F N Sbjct: 366 GRNLILLLDVSPSTVAHKTQGDILSNAVRIL-----EEDNLKDANVAVIAFGNAAYDVSG 420 Query: 311 SFSWGVHKLIRTIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 F + I+K T+++ + A + S + + Sbjct: 421 GFVFMGLPHNVAILKEKVSKLTPTNETQTSLDQGLGIAKQMLESE------------DGE 468 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE------KARYFLSNC 422 ++++DG + + + I + K G+ + I + Q A + Sbjct: 469 LDAIVISDGGIEESYGQSVQIAQELKDMGVNLYFIHIRSSAPSQADRTRNYYAEMLMQEI 528 Query: 423 ASPNSFFEANSTHELNKIFRD 443 ++ N +F + Sbjct: 529 GLEGNYQHIEMGERANIVFEE 549 >gi|163852925|ref|YP_001640968.1| TadE family protein [Methylobacterium extorquens PA1] gi|163664530|gb|ABY31897.1| TadE family protein [Methylobacterium extorquens PA1] Length = 202 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 33/106 (31%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + ++ G I A + PV++ + +++ R + L QA AS Sbjct: 12 RAAAATHAFGRAEGGVSAIEFAFIAPVLVILFIAAIEIPRAIATNNRLAQATIAMADLAS 71 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + +V + A+ ++ N V Sbjct: 72 KNDYADINDVYAAAQVVAAPYSLAGTGIVLTAGGVYQVGNDFVARV 117 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 44.6 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 26/167 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ ++ T ++ G T+I +++A+ I Sbjct: 343 WVGMVTFDSAAHVQSELVQ-INGATERDALTKSLPTVASGGTSICSGLRSAFAVIR---- 397 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C N+ K G I T+A + ++ + Sbjct: 398 ------KKFSTDGSEIVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDRI--GNEIFERVIRI 456 + + A+ + L F +R I++ Sbjct: 447 ELSKM----TGGLQTYASDQAQNNGLIDAFGALSSGNGAASQRAIQL 489 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 56/201 (27%), Gaps = 30/201 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + ++A + + +G F S + Sbjct: 108 SMAATDVPPDRLTAAKEAGKQFADELTPA------INLGLVEFAANASLLVSPT-----T 156 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R VK TA + + TA I + V + + IVL +DG Sbjct: 157 NRAAVKAAIDSLKPAPKTATGEGLFTALQAI--ATVGSVMGGGDGPPPAR-IVLESDGAE 213 Query: 380 TQD-----NEEGIAICNKAKSQGIRIMTIAFSVNKT----------QQEKARYFLSNC-A 423 + AK++G++I TI+F + C Sbjct: 214 NVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVEYEGATIPVPVDDQTLQKICEI 273 Query: 424 SPNSFFEANSTHELNKIFRDR 444 + F A+S L ++ Sbjct: 274 TDGQAFHADSLESLKNVYSTL 294 >gi|291411005|ref|XP_002721795.1| PREDICTED: integrin alpha M [Oryctolagus cuniculus] Length = 1155 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 38/333 (11%), Positives = 87/333 (26%), Gaps = 28/333 (8%) Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + + + N + + + A E + + Sbjct: 8 LTAVALCHGFNLETENTMTFSRNEKGFGQSVVQFEGSRVVVGAPQEITAPNQTGGLYQCD 67 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 L ++ L + + + G + K + Y+ Sbjct: 68 YSSNSCDPIQLQVPPEAVNMSLGLSLAASTSPFQLLAC--GPTMHQFCKENTYVNGLCFL 125 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRH---------VIKKKHLVRDALASVIRSIKKIDN 290 L P + S +++ +++V+ Sbjct: 126 FGSNLLQSPQRVPETLRGCPQQESDIAFLIDGSGSIDSTDFQRMKEFVSTVMEQFT---- 181 Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 +++ + +FS + G T + + Sbjct: 182 -KSNSLFALMQYSEEFRTHFTFSDFKRNPNPRALVKPIRQLL--GRTHTATGILKVVTEL 238 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 S+ A+K +V++TDGE D E + +A+ +G+ + + Sbjct: 239 FHSSSGAR------ANARKVLVVITDGEKFGDTLEYEDVIPRAEREGVIRYVVGVG-DAF 291 Query: 411 QQEKARYFLSNCASPN---SFFEANSTHELNKI 440 E++R L+ AS F N+ LN I Sbjct: 292 NSEQSRQELNTIASKPSREHVFRVNNFEALNTI 324 >gi|254416017|ref|ZP_05029773.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196177192|gb|EDX72200.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 744 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 49/162 (30%), Gaps = 12/162 (7%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D + R T+ G T + + ++ + Sbjct: 287 NPQDTFTILDFSDITTQLSAKPLANTPQNRIKALTYINQLKANGGTYLLNGIRAVLN--F 344 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + + IVL+TDG ++E + K G R+ + + + Sbjct: 345 PAAPEGR---------LRSIVLITDGYIGNESEILAEVKQYLK-SGNRLYSFGVGSSPNR 394 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R + + ST E+ + F I N + + Sbjct: 395 FLLNRMAELGRGTSRIVRQDESTQEVTEKFLRHINNPVLTNI 436 >gi|91788415|ref|YP_549367.1| TadE-like protein [Polaromonas sp. JS666] gi|91697640|gb|ABE44469.1| TadE-like protein [Polaromonas sp. JS666] Length = 179 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 17/152 (11%), Positives = 39/152 (25%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 K G + A++ + + +++ R Y + +++ + A A V Sbjct: 17 FKRQGGVAAVEFAIISLLFFTILFAILEFGRMLYVYNTMQEVTRRAAREAVVRWTNQESA 76 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 V S A T E + L K+ + + + Sbjct: 77 VKSLALFGGTSLPAGAEVTSGNISISYLTKSGAAVNPPPLSPGDNISACGDADRAPISCI 136 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 + + ++ I T Sbjct: 137 YSVRVSIENVTYSPMVSLFSFLNIGLPTSVVT 168 >gi|7463254|pir||E70121 hypothetical protein BB0173 - Lyme disease spirochete Length = 340 Score = 44.2 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 21/143 (14%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + ++ + I + +G F + + + Sbjct: 118 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 170 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + +D +A+ + A + K++ K+ IV+LTDG D + Sbjct: 171 IYIMDL--GNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIK 217 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + N A+ ++I +I + Sbjct: 218 -DQVINLAQGLNVKIYSIGIGSS 239 >gi|288554674|ref|YP_003426609.1| hypothetical protein BpOF4_08295 [Bacillus pseudofirmus OF4] gi|288545834|gb|ADC49717.1| hypothetical protein BpOF4_08295 [Bacillus pseudofirmus OF4] Length = 246 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K I+LLTDG + Q + +AI AK QGI + I ++ + A Sbjct: 7 KQILLLTDGHSNQGE-DPVAIAALAKEQGITVNVIGVVDENHLNKQGIDEIEAIALAG 63 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 26/167 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G F+ ++ T ++ G T+I +++A+ I Sbjct: 343 WVGMVTFDSAAHVQSELVQ-INGATERDALTKSLPTVASGGTSICSGLRSAFAVIR---- 397 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLTDGE+ I+ C N+ K G I T+A + ++ + Sbjct: 398 ------KKFSTDGSEIVLLTDGEDN-----TISSCFNEVKQSGAVIHTVALGPSAAKELE 446 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIFRDRI--GNEIFERVIRI 456 + + A+ + L F +R I++ Sbjct: 447 ELSKM----TGGLQTYASDQAQNNGLIDAFGALSSGNGAASQRAIQL 489 >gi|218192066|gb|EEC74493.1| hypothetical protein OsI_09963 [Oryza sativa Indica Group] Length = 641 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 38/111 (34%), Gaps = 16/111 (14%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L++ A++ VI+++ D R+ F+ + ++ Sbjct: 265 KLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLFPLRRMTLTGRQQALQA-IS 317 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 G T I DA++ + I+LL+DG++T Sbjct: 318 SLVASGGTNIADALKKGAKVVKDR---------RRKNPVSSIILLSDGQDT 359 >gi|193782688|ref|NP_436104.2| hypothetical protein SMa1579 [Sinorhizobium meliloti 1021] gi|193073144|gb|AAK65516.2| conserved hypothetical protein [Sinorhizobium meliloti 1021] Length = 431 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 53/184 (28%), Gaps = 4/184 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +++ + + G +I A+ PV++G G+ V+ W + L+ AA + A+ Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAA 75 Query: 66 ----VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 QS E ++R + T R + Sbjct: 76 VRHRAGDQQSALETAARRVAGGSGFSPGGLTVSTAPGSAGGSNKVTVELTETHPRMFSSV 135 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + ++ L L + + + E S + + Sbjct: 136 FGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNSNAADAF 195 Query: 182 VIDF 185 ++ Sbjct: 196 LMKN 199 >gi|157838288|pdb|1BHQ|1 Chain 1, Mac-1 I Domain Cadmium Complex gi|157838289|pdb|1BHQ|2 Chain 2, Mac-1 I Domain Cadmium Complex Length = 189 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 75 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 128 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 129 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 176 >gi|157831431|pdb|1IDO|A Chain A, I-Domain From Integrin Cr3, Mg2+ Bound Length = 189 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 81 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 134 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 135 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 182 >gi|119572528|gb|EAW52143.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_a [Homo sapiens] Length = 1153 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|119572529|gb|EAW52144.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_b [Homo sapiens] Length = 1152 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|64654539|gb|AAH96347.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|64654595|gb|AAH96346.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|253722212|pdb|1IDN|1 Chain 1, Mac-1 I Domain Metal Free gi|253722213|pdb|1IDN|2 Chain 2, Mac-1 I Domain Metal Free gi|313507148|pdb|1BHO|1 Chain 1, Mac-1 I Domain Magnesium Complex gi|313507149|pdb|1BHO|2 Chain 2, Mac-1 I Domain Magnesium Complex Length = 190 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 76 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 129 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 130 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 177 >gi|224831239|ref|NP_001139280.1| integrin alpha-M isoform 1 precursor [Homo sapiens] gi|307148|gb|AAA59544.1| glycoprotein Mac-1 [Homo sapiens] Length = 1153 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|386975|gb|AAA59903.1| neutrophil adherence receptor alpha-M subunit [Homo sapiens] Length = 1145 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 215 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 268 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 269 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 316 >gi|307114|gb|AAA59491.1| leukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1152 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|88501734|ref|NP_000623.2| integrin alpha-M isoform 2 precursor [Homo sapiens] gi|1708572|sp|P11215|ITAM_HUMAN RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: Full=Neutrophil adherence receptor; AltName: CD_antigen=CD11b; Flags: Precursor gi|263049|gb|AAB24821.1| leukocyte integrin alpha chain [Homo sapiens] gi|64653358|gb|AAH96348.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|68563402|gb|AAH99660.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|168275740|dbj|BAG10590.1| integrin alpha-M precursor [synthetic construct] Length = 1152 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 223 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 277 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 324 >gi|31615654|pdb|1NA5|A Chain A, Integrin Alpha M I Domain Length = 197 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 80 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 133 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 134 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 181 >gi|31615649|pdb|1N9Z|A Chain A, Integrin Alpha M I Domain Mutant Length = 192 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 80 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 133 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 134 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 181 >gi|31615583|pdb|1MF7|A Chain A, Integrin Alpha M I Domain Length = 194 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 80 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 133 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 134 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 181 >gi|22219356|pdb|1M1U|A Chain A, An Isoleucine-Based Allosteric Switch Controls Affinity And Shape Shifting In Integrin Cd11b A-Domain Length = 195 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 87 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 140 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 141 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 188 >gi|157831557|pdb|1JLM|A Chain A, I-Domain From Integrin Cr3, Mn2+ Bound Length = 192 Score = 44.2 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + + K +V++TDGE D + +A Sbjct: 81 GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEKFGDPLGYEDVIPEAD 134 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 +G+ I + + EK+R L+ AS + F+ N+ L I Sbjct: 135 REGVIRYVIGVG-DAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTI 182 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 57/206 (27%), Gaps = 30/206 (14%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 SE + RH + + A V S+ DT R+G ++ + + Sbjct: 373 SEVQDAAGNVQRHGFSTLDVCKHAARCVACSLD------DTCRLGLVAYDAQARVVVGLA 426 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + V ST + ++ D ++ D + ++L Sbjct: 427 RVTPAHV-AKVHAALEKLAPGTSTNLWGGLELGVDELVGGAGDNA----------RAVLL 475 Query: 374 LTDGENTQDNEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 LTDG EG +AK S+ + + F L + A Sbjct: 476 LTDGVPNNSPPEGEVAALRAKRLTKDGSETVAVFAAGFGY-----ALRSDLLLSLAREGG 530 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFE 451 F + F + + Sbjct: 531 GLFSFVPDAGMVGTSFNHLVASLRSS 556 >gi|323498500|ref|ZP_08103493.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] gi|323316389|gb|EGA69407.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] Length = 697 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 51/215 (23%), Gaps = 24/215 (11%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + S + + + +D S K + + + + Sbjct: 283 AYRDPASSKRGTVKLTFTPGDDLTRVTQGRDWVFVLDKSGSMS--GKYSTLVEGVRQGLG 340 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D R F+ V+ T T + + M Sbjct: 341 KLPSED------RFRVVMFDSNTYDLTGGFVAVNAANVTKALQSVEQVEPSNGTNLYEGM 394 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + IVL+TDG E + + +R+ T Sbjct: 395 AAATRKLDDDRPTG-------------IVLVTDGVANVGVTEKRRFFDLMEKHDVRLFT- 440 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 F + + ++ S ++ ++ Sbjct: 441 -FIMGNSANTPLLVPMTKL-SNGIATSVSNADDII 473 >gi|295690805|ref|YP_003594498.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295432708|gb|ADG11880.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 185 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 21/67 (31%), Gaps = 2/67 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGV-GGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + AL+ PV++ + GM + + + L + + Sbjct: 10 RFWRDRRGASAVEFALIAPVLIVMYCGMA-EFTQAMMAQRRLTNITSSIGDLTAQASQTG 68 Query: 72 LEEVSSR 78 + Sbjct: 69 PARTTDI 75 >gi|220922038|ref|YP_002497339.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219946644|gb|ACL57036.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 138 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 19/141 (13%), Positives = 45/141 (31%), Gaps = 8/141 (5%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + + + G + ALL PV++ +G +++ Y AA++A + L Sbjct: 2 RHRRSFPHNQDGTNSVEFALLAPVLIVLGFGIIEFGIMIYT----LNAAESAARDVTRRL 57 Query: 69 IQSLEEVSSRAKNSFT----FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + + + + N EV + Sbjct: 58 ATNRISAAQASSAVIQQLPSWVAAGTTVNVTQTAPTDPSSNRFTTEVAFSAKVATPTTLL 117 Query: 125 RKSAYQVVLSSRYDLLLNPLS 145 + VVL ++ + P + Sbjct: 118 SWAYGGVVLHAKVSMQQEPGT 138 >gi|3182932|sp|Q28902|COCA1_RABIT RecName: Full=Collagen alpha-1(XII) chain gi|13195730|gb|AAB34889.2| type XII collagen [Oryctolagus cuniculus] Length = 639 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 43/129 (33%), Gaps = 18/129 (13%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +++++ A + G+T A+ + R +K VL+TDG+ Sbjct: 1 CRKSLLQAVANLPYKGGNTLTGMALNFIRQQNFKTQAGMRPRA------RKIGVLITDGK 54 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTH 435 + D E K K +G+ + I + L A+ + Sbjct: 55 SQDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFD 105 Query: 436 ELNKIFRDR 444 L+KI D Sbjct: 106 SLSKIVDDL 114 >gi|320155491|ref|YP_004187870.1| von Willebrand factor type A domain-containing protein, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319930803|gb|ADV85667.1| von Willebrand factor type A domain protein, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 406 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 25/292 (8%), Positives = 71/292 (24%), Gaps = 7/292 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ + M + + + +DV L+ A A+ + + Sbjct: 11 RKQQGLVLVLVTVAMLSFVIMAALAIDVTHQVVNRTKLQNAVD-----AAALAAAMVADA 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + K + + + L + + + S + + Sbjct: 66 THDTPTATAAAKTTLNSMHSASGNSELDIDSATFSIDYSNDPLTFPDSSFNSDEDIYVRI 125 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 D L+ F++++G+ + + + S + D S + Y Sbjct: 126 SID-DLSLSEFFMQALGLSKTVSASAVAGPSSSINTISNVVPIGVCKGDESGGIYGYNPA 184 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 +Y + G + ++ G + Sbjct: 185 EVYVLKVGDSTMTTMGSGNYHLLDFGSGADTVR-EALGGGYEGTVQIGGDIGTQTGVAAG 243 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +++ + + D I DT + + Sbjct: 244 PVGQGANTRLGIYQGGVSSSDYPPDFITEQPDTPATVDTDGNVVYDYATDLT 295 >gi|307609426|emb|CBW98915.1| hypothetical protein LPW_07021 [Legionella pneumophila 130b] Length = 1169 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 43/402 (10%), Positives = 93/402 (23%), Gaps = 25/402 (6%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A I A + + + ++ Y + ++ T V + Sbjct: 141 AGIQAIIENYMPTTDFALGTYSTSNISSYNTWVYYMSPPGSDFVFTNTPVAGNRYVTNPC 200 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 S SS +L L + + I + S VS Sbjct: 201 YNYGSASSTVSSNCSSIGNLYGTTLVSSSQYLQIGDSSDDPDINDVLYAGSGFPGIFVSY 260 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 S +Y + Q A+ + + S ++ + Sbjct: 261 NGPTPSSPFPPNYTISNYNQGNIRISY-ANTRPSIGNFSSSPTNAGFVPFSQQVMYVQRG 319 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y + + + V +A+ + + +K ++ Sbjct: 320 FGYYSNQSYATG-----NMLVNMQTAGTNPTTTSVNNAINAFLPHLK---PETNSTATTE 371 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++T+ T + I+D + T Sbjct: 372 IKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPT-----QDLQSRYWP 426 Query: 360 RMKNNLEAKKYIV--LLTDGENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQ-- 411 + + + DG N + ++ K+ G+ I I Sbjct: 427 PLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEIKALKNDGVLIFIIGMGAGVDPAV 486 Query: 412 QEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIF 450 +A L A +++ A S L + N Sbjct: 487 NPEAAATLRAMAVAGGTENYYPATSPETLVSSLNSILSNIQN 528 >gi|229492908|ref|ZP_04386704.1| conserved hypothetical protein [Rhodococcus erythropolis SK121] gi|229320182|gb|EEN86007.1| conserved hypothetical protein [Rhodococcus erythropolis SK121] Length = 151 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 14/137 (10%), Positives = 39/137 (28%), Gaps = 6/137 (4%) Query: 8 IFYSKKLIKSC-TGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +L + G ++T +L P+++ G++VD A+ A + Sbjct: 7 KRLTSRLPEERDKGSVTLMTVILAPILIFFVWGLIVDGGGMHTAAQRADNVAEDAARASG 66 Query: 66 VP----LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 L + + K+ + + + R I + + Sbjct: 67 QQIVGTLSARGIDTVVDPVRAVAAAKRYLADAGVEGEVIPTGPRTLLITTRIIYDNKLLP 126 Query: 122 MNPRKSAYQVVLSSRYD 138 + + ++ Sbjct: 127 LGSKVITGSATVNLNRT 143 >gi|27365108|ref|NP_760636.1| hypothetical protein VV1_1751 [Vibrio vulnificus CMCP6] gi|37680842|ref|NP_935451.1| hypothetical protein VV2658 [Vibrio vulnificus YJ016] gi|27361254|gb|AAO10163.1| hypothetical protein VV1_1751 [Vibrio vulnificus CMCP6] gi|37199591|dbj|BAC95422.1| hypothetical protein [Vibrio vulnificus YJ016] Length = 406 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 25/292 (8%), Positives = 71/292 (24%), Gaps = 7/292 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G ++ + M + + + +DV L+ A A+ + + Sbjct: 11 RKQQGLVLVLVTVAMLSFVIMAALAIDVTHQVVNRTKLQNAVD-----AAALAAAMVADA 65 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + K + + + L + + + S + + Sbjct: 66 THDTPTATAAAKTTLNSMHSASGNSELDIDSATFSIDYSNDPLTFPDSSFNSDEDIYVRI 125 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 D L+ F++++G+ + + + S + D S + Y Sbjct: 126 SID-DLSLSEFFMQALGLSKTVSASAVAGPSSSINTISNVVPIGVCKGDESGGIYGYNPA 184 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 +Y + G + ++ G + Sbjct: 185 EVYVLKVGDSTMTTMGSGNYHLLDFGSGADTVR-EALGGGYEGTVQIGGDIGTQTGVAAG 243 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 +++ + + D I DT + + Sbjct: 244 PVGQGANTRLGIYQGGVSSSDYPPDFITEQPDTPATVDTDGNVVYDYATDLT 295 >gi|291396486|ref|XP_002714579.1| PREDICTED: collagen, type XII, alpha 1 [Oryctolagus cuniculus] Length = 3117 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ + R +K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQNFKTQAGMRPRA------RKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFDS 1363 Query: 437 LNKIFRDR 444 L+KI D Sbjct: 1364 LSKIVDDL 1371 >gi|148725577|emb|CAN88158.1| novel protein containing collagen triple helix repeat domain [Danio rerio] Length = 312 Score = 44.2 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 16/134 (11%) Query: 316 VHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++ +G T A+ A + + +LL Sbjct: 107 RDWQDIDVFQSRVDSMTYIGHGTYSAYAISNATQLFTQE---------TAPTSFRAALLL 157 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCAS--PNSF-FE 430 TDG++ + + + AK I + I ++ +Q+ R L + AS P + F Sbjct: 158 TDGDDHPRSPSAVTAASDAKIHDIHMFVIGLPGSSRDEQDYGR--LRSIASAPPQQYLFS 215 Query: 431 ANSTHELNKIFRDR 444 +K+FR+ Sbjct: 216 LTDPQLDDKLFREL 229 >gi|327405047|ref|YP_004345885.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327320555|gb|AEA45047.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 541 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 67/208 (32%), Gaps = 22/208 (10%) Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + + +D+SS + K +++ + +++ + ID ++ Sbjct: 355 HPDALLISKAQPHHLVYLLDASSSMNESGKLTHLKENASYLVQIQRAIDQIS------IV 408 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F+ + + + + + G T I +T + SS + Sbjct: 409 TFSGKSQV---LMQAIPCDQKKHILEKIDHIHAFGQTNIQSGFETVKTLLSSSKLQKGVN 465 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 ++LLTDGE Q EE +AI N+ K+ I I + ++ L Sbjct: 466 S---------VLLLTDGEF-QLKEETLAIINQLKANEIGICFVYLGEPLKKKTT--KALE 513 Query: 421 NCASP-NSFFEANSTHELNKIFRDRIGN 447 S F + +L + Sbjct: 514 KKYSDLGVIFYDTNRIDLKEALLKIATE 541 >gi|169627579|ref|YP_001701228.1| hypothetical protein MAB_0475 [Mycobacterium abscessus ATCC 19977] gi|169239546|emb|CAM60574.1| Hypothetical protein MAB_0475 [Mycobacterium abscessus] Length = 129 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 10/76 (13%), Positives = 24/76 (31%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ G II ++ V+L + V H + AA + + + + + Sbjct: 14 RADGGAATIIAVAMIAVLLALTAGGAAVGSAVVARHRAQAAADLSALAGAQHALYGVTPA 73 Query: 76 SSRAKNSFTFPKQKIE 91 + A + Sbjct: 74 CAEAGAVARRMGAVVA 89 >gi|269969412|sp|C7G0B5|PIF_PINFU RecName: Full=PIF; Contains: RecName: Full=Pif97; Contains: RecName: Full=Pif80; AltName: Full=Aragonite-binding protein; Flags: Precursor gi|256252203|dbj|BAH97338.1| Pif177 [Pinctada fucata] Length = 1007 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 54/203 (26%), Gaps = 29/203 (14%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK------------IDNVNDTVRMGATF 301 + V S + DA V + ++ +R+G Sbjct: 13 TAVFCVGVKSDECKTADVVVNVDASDDVSDQDFDKLKRAMLMMVRGLSIDDNQIRLGMVT 72 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + V L RTI G + M A S V + Sbjct: 73 YGSEVCDSIPLQGDRLDLARTIRYMK----KPTGPSKPFKGMGEARRMFSSRGRYNVPHI 128 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 N L D +T+ + +KA+ + I++M I + E Sbjct: 129 TMN--------LGGDIVDTEVKDLMDET-DKARDEDIKVMAIGLGAKVDRDEIESIAYDR 179 Query: 422 CASPNSFFEANSTHELNKIFRDR 444 + + + +L + ++ Sbjct: 180 ----DQAYFMDDEDDLIRKVKEI 198 >gi|12957162|dbj|BAB32650.1| complement factor B/C2-A3 [Cyprinus carpio] Length = 754 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 18/209 (8%), Positives = 54/209 (25%), Gaps = 28/209 (13%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + ++I I + + F V S + + Sbjct: 263 DEEDFEKAKGVIKTLIEKISYYEVSPNYEI---LIFATDVARIVSMR-DFKSAQKNNLLE 318 Query: 327 FAIDENEM--------GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + T I A ++ +++ K +++ TDG+ Sbjct: 319 ILKRLKDYEYNSKGDRTGTNIAQAYRSILESMQIEQMTNKEEFKTTQH---IVIMFTDGQ 375 Query: 379 NTQDNEEGIAICNKAKS-----------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++ K + + + + ++ ++ + Sbjct: 376 ANMG-GNPRPWVDQIKDLVKKNSPSEEEENLDLYVFGMGDDVNAEDIN-DLKTDRGNEKF 433 Query: 428 FFEANSTHELNKIFRDRIGNEIFERVIRI 456 FF+ + +L + F I + + Sbjct: 434 FFKLKNLEDLQETFDSMIDEGTSVELCGL 462 >gi|94309589|ref|YP_582799.1| TadE-like protein [Cupriavidus metallidurans CH34] gi|93353441|gb|ABF07530.1| flp pilus assembly protein (TadG-like) [Cupriavidus metallidurans CH34] Length = 176 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 12/135 (8%), Positives = 31/135 (22%), Gaps = 5/135 (3%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + ++ G + +++ ML + + + R Y A +A+ Sbjct: 3 RLTNAHRQSRTRQRGVAAVEFGIMLVPMLLMACGVAEFGRAIYQ----YDTLTKATRSAA 58 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L Q + + + + D V + + Sbjct: 59 RYLSQYSPDDVAYPTAATKCLAAY-GNTGCSGQPLAPGLTTAMVIICDRVDSSGCPGATQ 117 Query: 126 KSAYQVVLSSRYDLL 140 + S Sbjct: 118 TFSNVATYDSTGGGS 132 >gi|94971019|ref|YP_593067.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94553069|gb|ABF42993.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 391 Score = 44.2 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 41/144 (28%), Gaps = 4/144 (2%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + V + + A + + + F+ V + KL R Sbjct: 108 DTSYSVRDELPAEKTASEKFLDDMLAQPK----DQAFLIHFDREVELMTDLTSSKDKLHR 163 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + S+ + + + + + +K IV+LTDGE+ Sbjct: 164 GIGELETSGPPSQSSSDDGQRHRRGGTQLYDAIYLAASEILQKQQGRKAIVVLTDGEDRG 223 Query: 382 DNEEGIAICNKAKSQGIRIMTIAF 405 E A+ + I F Sbjct: 224 SKETLTDAVEAAQRADAIVYAIYF 247 >gi|264680611|ref|YP_003280521.1| membrane protein-like protein [Comamonas testosteroni CNB-2] gi|262211127|gb|ACY35225.1| membrane protein-like protein [Comamonas testosteroni CNB-2] Length = 521 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + S+ I+ +G + AL LM ++ + +D+ + + L++ A A Sbjct: 1 MQSFAKDLSRVRIRRQSGSVATLGALWLMIAVICLA--TIDIGNVFWQKRELQKMADLAA 58 Query: 62 ITASVPLIQSLEEVSSRAKN 81 + + +S S A + Sbjct: 59 LAGASGTPRSGACQSVAADS 78 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 53/171 (30%), Gaps = 16/171 (9%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 V + D D R+G + + + + ++++ G+T Sbjct: 58 VSDLVATFDIGPDYTRVGVVVYAEEPEMAIALNQYTD--RDSLIQAVGNITYLNGNTRTG 115 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + S + ++LTDG + +A++ GI++ Sbjct: 116 KAIRFMNEESFSIANGARDIEFG---YNRLAIVLTDGRAQDNVFNPSL---EAQNNGIQL 169 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI--FRDRIGNEI 449 + S ++ L+ AS + I R+ + I Sbjct: 170 YAVGVSTAVVEE------LNEIASDPDSRHVMQVDDFQAIERIRELLRQII 214 >gi|146338130|ref|YP_001203178.1| hypothetical protein BRADO1027 [Bradyrhizobium sp. ORS278] gi|146190936|emb|CAL74941.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 192 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 14/170 (8%), Positives = 43/170 (25%), Gaps = 8/170 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++ +G A ++P+ML + V+ A + + L Sbjct: 14 RLLRRFRADHSGVAATEFAFIVPLMLVMFFGTVEFCSAI--------AVDRKVTLMARTL 65 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + +S + + + + + ++ V + V + Sbjct: 66 SDLTSQSTSVGDSDMSNFFAASTGIMYPYSTSPVNATISEIVVDSTGKQATVVWSKGSVP 125 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + L + +L + + T + + + Sbjct: 126 RTTGTTVGIPADLLVANTYLIFSEVSYQYVPTIGYVMAKTGINLSDVAYT 175 >gi|223462563|gb|AAI50654.1| Von Willebrand factor A domain containing 3B [Homo sapiens] Length = 1294 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|221040994|dbj|BAH12174.1| unnamed protein product [Homo sapiens] Length = 951 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 152 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 211 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 212 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 251 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 252 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 304 Query: 422 CA---SPNSFFE 430 A F Sbjct: 305 VAALTGGEFHFY 316 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 33/114 (28%), Gaps = 14/114 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R G F+ + + ++ + G T I A+ A + Sbjct: 132 DRFGLITFSRIANIECPLTSEPDMVLLY-LSDLNSSLLPGGGTNIAAALDHAQKQFKENE 190 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 D + +VLL+DGE + + + I + I+ Sbjct: 191 RDS-----------RVVVLLSDGETDGNKWR--ESLEALQKKKIPVNVISLGDP 231 >gi|119622317|gb|EAX01912.1| hypothetical protein MGC26733, isoform CRA_a [Homo sapiens] Length = 1080 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|118918435|ref|NP_659429.4| von Willebrand factor A domain-containing protein 3B [Homo sapiens] gi|296439299|sp|Q502W6|VWA3B_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3B Length = 1294 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|30268323|emb|CAD89964.1| hypothetical protein [Homo sapiens] Length = 1060 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|21740064|emb|CAD39048.1| hypothetical protein [Homo sapiens] Length = 803 Score = 44.2 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 4 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 63 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 64 AWREQLAEVNEDNLEQAQS-----WIRDIKIGSSTNTLSALKTAF--------------- 103 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 104 -ADKETQAIYLLTDGRPDQP---PETVIDQVKRFQEIPIYTISFNYNDEIANR---FLKE 156 Query: 422 CA---SPNSFFE 430 A F Sbjct: 157 VAALTGGEFHFY 168 >gi|332262981|ref|XP_003280535.1| PREDICTED: integrin alpha-M-like [Nomascus leucogenys] Length = 997 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 55/184 (29%), Gaps = 20/184 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + +K + ++ K + ++ T F + D Sbjct: 158 SGSINPNDFQKMKEFVSTVMEQLKKSKTLFSLMQYSEEFWTHFTFKEFQDNP-------- 209 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + + G T ++ + + + K +V++TDGE Sbjct: 210 NPRSLVKPIMQLF--GRTHTATGIRKVVRELFNITQGARKNAF------KILVVITDGEK 261 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHE 436 D + +A +G+ I + +R L+ AS + F+ N+ Sbjct: 262 FGDPLGYEDVIPEADREGVIRYVIGVG-DAFHSMNSRQELNTIASKPPRDHVFQVNNFEA 320 Query: 437 LNKI 440 L I Sbjct: 321 LKTI 324 >gi|329928399|ref|ZP_08282269.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937835|gb|EGG34241.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 562 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 44/140 (31%), Gaps = 16/140 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + +G ++ V + +++ G+TA D + Sbjct: 413 LTGQKYLGRDNSIGFVSYSTDVTINLPIG-KYDLNQQSMFVGAINSLEASGNTATFDGIV 471 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + N + K I +L+DGE + I ++ I + TI Sbjct: 472 VAMKMLQDEMAA-------NPDVKPLIFVLSDGETNVGHS-LDDIRGLIQAFKIPVYTIG 523 Query: 405 FSVNKTQQEKARYFLSNCAS 424 ++ + L + +S Sbjct: 524 YNADI-------QALQSISS 536 >gi|220897446|emb|CAX15330.1| complement component 2 (within H-2S) [Mus musculus] Length = 483 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 48/185 (25%), Gaps = 18/185 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAIDENEMGSTAIND 341 + + ++ + ++ + + L + + T + Sbjct: 12 QSNGVWSGSEPICRQPYSYDFPEDVASALDTSLTNLLGATNPTQNLLTNHENATGTNTYE 71 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR- 399 + Y + S + E + I+LLTDG++ + + + I Sbjct: 72 VLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDS-PKKAVTRIRELLSIEQ 130 Query: 400 -------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 I I ++ S F L +IF Sbjct: 131 NRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEHM------LD 184 Query: 453 VIRIT 457 V ++T Sbjct: 185 VSKLT 189 >gi|261404225|ref|YP_003240466.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261280688|gb|ACX62659.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 562 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 44/140 (31%), Gaps = 16/140 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + +G ++ V + +++ G+TA D + Sbjct: 413 LTGQKYLGRDNSIGFVSYSTDVTINLPIG-KYDLNQQSMFVGAINSLEASGNTATFDGIV 471 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + N + K I +L+DGE + I ++ I + TI Sbjct: 472 VAMKMLQDEMAA-------NPDVKPLIFVLSDGETNVGHS-LDDIRGLIQAFKIPVYTIG 523 Query: 405 FSVNKTQQEKARYFLSNCAS 424 ++ + L + +S Sbjct: 524 YNADI-------QALQSISS 536 >gi|190889875|ref|YP_001976417.1| hypothetical protein RHECIAT_CH0000244 [Rhizobium etli CIAT 652] gi|190695154|gb|ACE89239.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 193 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 44/148 (29%), Gaps = 11/148 (7%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++LI+ G I A+L PV++ + ++ A Sbjct: 8 TRLALTARRLIRERKGAGAIEFAILFPVLIMLYIGAFEITIGLSVSKR--------ATRA 59 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV---E 121 + + + + S K++ T + + +L T V + Sbjct: 60 AGSIADLVTQQQSITKSTLTEMRSVATSIFVPYNSTSLTLKITGVTVDASANAKVLWSWA 119 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + + S + + FL Sbjct: 120 QDGSAPYAKNTAVSDIPADMKTANSFLV 147 >gi|167842405|ref|ZP_02469089.1| hypothetical protein Bpse38_37423 [Burkholderia thailandensis MSMB43] Length = 142 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 32/120 (26%), Gaps = 9/120 (7%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 +L++ G + L+ P M+ V +VD + + A++ A Sbjct: 5 SRLVRDERGVVSLEFVLVFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGV------ 58 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 V + + T Y + + V + +Y Sbjct: 59 ---VVRVPQLAATDIANVALSYAQGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYTY 115 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 44.2 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 49/171 (28%), Gaps = 17/171 (9%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 ID +++ R FN+ S + + F N G T + A+ A Sbjct: 341 AIDRLSEKDRFNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAA 400 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + K +V +TDG + I N+ R+ T+ Sbjct: 401 LKRTTT------------KDFIKQVVFITDGAVGNEAALFSQIKNELGDA--RLFTVGIG 446 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 R S+ +T ++ + + + ++ Sbjct: 447 SAPNSYFMTRAAQ---FGLGSYVFVRNTADIKQQMDSLLYKLESPVLSDLS 494 >gi|326666584|ref|XP_687953.4| PREDICTED: collagen alpha-1(VII) chain [Danio rerio] Length = 2001 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 20/159 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D+ + + + HK ++++ G+T + + + Sbjct: 1002 VVHYSDQPRIEFN--FNTHKDRNSVLRALREVRYGGGNTKTGRGISYVLREMFQESLGMR 1059 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +VLLTDG D E I + G+ ++ I Sbjct: 1060 QEAPH------VLVLLTDGRAQDDVEPPSRIAHAL---GVSVLVIGI------AHADMEE 1104 Query: 419 LSNCASPNSF---FEANSTHELNKIFRDRIGNEIFERVI 454 + ASP ++ F A+ +L I R+ IG+ E + Sbjct: 1105 VRTIASPTTYKNIFYASDFDDLPSIEREFIGSICSEALQ 1143 >gi|94732541|emb|CAK05117.1| novel protein similar to vertebrate collagen family [Danio rerio] Length = 1721 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 20/159 (12%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D+ + + + HK ++++ G+T + + + Sbjct: 891 VVHYSDQPRIEFN--FNTHKDRNSVLRALREVRYGGGNTKTGRGISYVLREMFQESLGMR 948 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + +VLLTDG D E I + G+ ++ I Sbjct: 949 QEAPH------VLVLLTDGRAQDDVEPPSRIAHAL---GVSVLVIGI------AHADMEE 993 Query: 419 LSNCASPNSF---FEANSTHELNKIFRDRIGNEIFERVI 454 + ASP ++ F A+ +L I R+ IG+ E + Sbjct: 994 VRTIASPTTYKNIFYASDFDDLPSIEREFIGSICSEALQ 1032 >gi|296158802|ref|ZP_06841631.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] gi|295891007|gb|EFG70796.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] Length = 597 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 50/171 (29%), Gaps = 10/171 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ---- 70 + G ++ A+ + V + G +DV + L++ A A + A + Sbjct: 25 ARRQHGAIAMLAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQMIGSTGGC 83 Query: 71 ----SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ A I R ++ T + V+ ++ P Sbjct: 84 ATATTAAQQNAAANGFTAGSTTTISTTCGRWDTSSSTYFGTSGNPLNAVQVKTTQVVPYF 143 Query: 127 SAYQVV-LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +S+ + + F GI S Q A + Sbjct: 144 FVGPSRNVSATATAFASNIDTFSLGTGIASINTQQSALLNAILGGLLNTSV 194 >gi|156602970|ref|XP_001618750.1| hypothetical protein NEMVEDRAFT_v1g153509 [Nematostella vectensis] gi|156200182|gb|EDO26650.1| predicted protein [Nematostella vectensis] Length = 133 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 42/124 (33%), Gaps = 16/124 (12%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 F++G + + +++ GST A+Q D I +R + + Sbjct: 5 FTFGKYNNKKDVLEAVDKMPYPKGSTYTGRALQYMNDEI--------YRKATRVGVPNIL 56 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 ++LTDG+ E + GI I +I + + E L A+ Sbjct: 57 IVLTDGKAHDSVAEP---AKALRDIGIEIYSIGVGESYDKAE-----LDAIATDPDASHV 108 Query: 432 NSTH 435 S Sbjct: 109 FSVD 112 >gi|296284150|ref|ZP_06862148.1| hypothetical protein CbatJ_11026 [Citromicrobium bathyomarinum JL354] Length = 198 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 48/183 (26%), Gaps = 11/183 (6%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +L + G + A++ P + + + D+ + Y + A Q A Sbjct: 2 LARALNRLRRDTRGSTIVEFAIIAPSFMILLMGVFDLGQAVYLRAVMNGAMQEA--ARDS 59 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 L ++ T + + + + I D A Sbjct: 60 TLESGPTAEAAIDGMVETRVQHVLRSAELSFDRKSYYDFTDIERAEAINDDNANGECDAG 119 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 ++ N + +G + TV+ +Y K + + Sbjct: 120 ETFEDE---------NGNGSWDSDVGSGGFGGARDITMYTVTATYDKLFPLYGLLGLPQE 170 Query: 187 RSM 189 S+ Sbjct: 171 ASI 173 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 44.2 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 67/193 (34%), Gaps = 25/193 (12%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D SS K A ++ I V F+ + K Sbjct: 151 IDDSSSMKTSDKNDRRLTAANELLEHIDGNRKVG------LIRFSKDIHCYIPM--DYLK 202 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + ++ + ++ + G T INDA+ + + ++LLTDG+ Sbjct: 203 VNKSTLNHELENKAKEGGTDINDALYAVLNAFDKVG---------TATGSRSVILLTDGK 253 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 +T N + + N+A S I+I I+ + F+ S A ++ + Sbjct: 254 STT-NVDEEYLINRANSMNIQINVISLG-----NHTDKAFIKRITSSTGGKAAKTSSDFY 307 Query: 437 LNKIFRDRIGNEI 449 L+ + +G+ I Sbjct: 308 LDAAYGVFLGSHI 320 >gi|326916561|ref|XP_003204575.1| PREDICTED: matrilin-3-like [Meleagris gallopavo] Length = 363 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 29/88 (32%), Gaps = 15/88 (17%) Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K ++++TDG + ++ + A++ GI I + L A Sbjct: 66 RTYFDKVVIVVTDG---RPQDQVENVAANARTAGIEIYAVGVGR------ADMQSLRIMA 116 Query: 424 S---PNSFFEANS---THELNKIFRDRI 445 S F + +L FR+ Sbjct: 117 SEPLDEHVFYVETYGVIEKLTAKFRETF 144 >gi|89054212|ref|YP_509663.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88863761|gb|ABD54638.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 1356 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 56/173 (32%), Gaps = 23/173 (13%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 + + + + + + G + N G T Sbjct: 56 AEMLADMADDVSLGLTVYGHRQRGSCTDIETIVAPAPGTQGR----ILDAVNAINPRGRT 111 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + DA+ A ++ S+ E ++L++DG N + AI + ++ G Sbjct: 112 PMTDAVIAAAQSLRSTEEAAT------------VILVSDGIENC-NPDPCAIAAELEATG 158 Query: 398 I--RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 + I F V + +A+ + A + F A++ EL++ + Sbjct: 159 VDFTAHVIGFDVASEPEARAQ--MQCIADNTGGQFLTADNATELSQALEQVVA 209 >gi|327285304|ref|XP_003227374.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Anolis carolinensis] Length = 3587 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A ++ + A K I L+TDG + + I Sbjct: 168 GGGTYTKGAFQQAAQILL----------HSRANATKVIFLITDGYSNGGDPRP--IAASL 215 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 216 REFGVEIFTFGI 227 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 56/184 (30%), Gaps = 19/184 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K +DA+ + D + + K + + Sbjct: 452 TKLQRAKDAVLKALDHFTAEDEIGLAAFSQVGDGPLTPGVVSPVA--PFKTNKEDLIAKL 509 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGI 387 + + +T + +A+ S + IVLL+DG+N T + Sbjct: 510 NELKAVDATPLFEAV---------SRFAGDQAKEYKDNFINAIVLLSDGKNDTTHPGDLG 560 Query: 388 AICNKAKSQG----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + + Q + + T+A+ + AS +++A + L ++ + Sbjct: 561 GLSEQLGHQNHSTPVLVFTLAYGPDADVPTLREI---ARASGAHYYDATDPNRLEEVLGE 617 Query: 444 RIGN 447 + + Sbjct: 618 LVTS 621 >gi|240140255|ref|YP_002964733.1| hypothetical protein MexAM1_META1p3746 [Methylobacterium extorquens AM1] gi|240010230|gb|ACS41456.1| hypothetical protein MexAM1_META1p3746 [Methylobacterium extorquens AM1] Length = 204 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 17/166 (10%), Positives = 46/166 (27%), Gaps = 8/166 (4%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT-AIITA 64 + ++ G I A + P+++ + +++ R + L QA A + + Sbjct: 14 RAAATLHAFGRAEGGVSAIEFAFIAPILVILFIAAIEIPRAIATNNRLAQATIAMADLAS 73 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + ++ + + L + +F R + + Sbjct: 74 KNDYADINDVFAAAQVVAAPYSLAGAGIVLTAGGVYQVGNDFVARVCSSVQSGDKARIVG 133 Query: 125 RKSAYQVVLSSR-------YDLLLNPLSLFLRSMGIKSWLIQTKAE 163 ++ + L+ LF + + KA Sbjct: 134 SDIGPPPAGTASKGDRFVMAETRLSYRPLFSFFPVLNTLTFTGKAA 179 >gi|78066129|ref|YP_368898.1| Flp pilus assembly protein TadG [Burkholderia sp. 383] gi|77966874|gb|ABB08254.1| Flp pilus assembly protein TadG [Burkholderia sp. 383] Length = 164 Score = 44.2 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 16/146 (10%), Positives = 40/146 (27%), Gaps = 7/146 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + + + G I AL++P+ + ++ + L + A+ Sbjct: 4 RRVAAGMRRRERGTTAIEFALMLPMFFLILYAIITYGMIFAAQQNL-------TLAATEG 56 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 +L + S +Q + T + + + + Sbjct: 57 ARAALNYQQVGSAASVQAAQQAALALRAQAACTAATNLTTWLKGATCAQTQQGTCSYDAT 116 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGI 153 V ++ Y +PL +G Sbjct: 117 MLCVQITLTYPYSQSPLIPAFPLLGS 142 >gi|326797335|ref|YP_004315155.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] gi|326548099|gb|ADZ93319.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] Length = 187 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 55/178 (30%), Gaps = 3/178 (1%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + + +G I AL +P+M G+ + D+ + ++QA + + + + Q Sbjct: 3 RRFLANESGATVIEFALSLPIMFGILLVSTDLYNINRMRGDMEQA--SHNLASILANQQE 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 S T EEY + + N+ ++ +R Y Sbjct: 61 WNADSFDYLIEHTIDNSVGEEYELIVSKVNIDRSMDWSPIRRGEISDVCAEKSSGKYYSD 120 Query: 132 VLSSRYDLLLNPLSLFLRSMGI-KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + L ++ +I + S + V+D S Sbjct: 121 QMPEEDPDSNTASFLVIQLCRYNDDLIINSGLLGSKKMESTSINRLLYHSVVLDKRLS 178 >gi|311028997|ref|ZP_07707087.1| hypothetical protein Bm3-1_00293 [Bacillus sp. m3-13] gi|311032266|ref|ZP_07710356.1| hypothetical protein Bm3-1_17297 [Bacillus sp. m3-13] Length = 245 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K I+L+TDG + + + IA+ AK QGI + I T E+ + A Sbjct: 7 KQILLITDGCSNSGD-DPIAMAALAKEQGITVNVIGVMDEDTIDERGMQEIEGIAMSG 63 >gi|301109920|ref|XP_002904040.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096166|gb|EEY54218.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2146 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 24/209 (11%), Positives = 50/209 (23%), Gaps = 30/209 (14%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK-- 318 S H + + R+ + ++ Sbjct: 1898 SGGKMHHVFVLDCSGSMNGQPWNDLMAAWKEYVYNRIAD---GATLDLVSVVTFDNSAQI 1954 Query: 319 ---LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 T A + G T +++A + + N D IV + Sbjct: 1955 VYEARSITTVTNARIQYRGGGTNYAAGLRSANEVLSRVNFDMFKPA---------IVFFS 2005 Query: 376 DGENTQDNEEGIAICNKAK----SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFF 429 DG D +G + + G++ + F L A ++ Sbjct: 2006 DGHP-CDPLQGEELATHIRGCYERNGLQAFAVGFG------SINLNMLERVAEKLGGTYH 2058 Query: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 + +EL F + + + K Sbjct: 2059 HVLTGNELKATFFSISASLSTRAGLALAK 2087 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 82/413 (19%), Gaps = 49/413 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + N F + +KK D E Sbjct: 305 AGEEDRNVPAQTVQLWGNERNFENSYL-----DYNGAYIKKWVEPVLPSSTASDLHPEDA 359 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++P+ + S T + E + + Sbjct: 360 TTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDL 419 Query: 184 DFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S+ ++ Q + + + S+N + +P Sbjct: 420 LSDPSLDIRIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPL 479 Query: 233 MVSCNKS--LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKI 288 + Y G V + + Sbjct: 480 TLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVME 539 Query: 289 DNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D + + + F N R + N A Sbjct: 540 DKIAEDNDVLFRNFEEFSGNTSNAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKAT 599 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D ++N + + A + + E + I +I Sbjct: 600 HWVGDGSSNNNTNGSPTSQEKSSAINTVAY-HHWLKNKYQENPPS-----------IFSI 647 Query: 404 AFSVNKTQ------QEKARYFLSNCAS------PNSFFEANSTHELNKIFRDR 444 ++ R L N A ++ AN+ +++ D Sbjct: 648 GLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKNDIVTALEDI 700 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 61/208 (29%), Gaps = 24/208 (11%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK---IDNVNDTVRMGA 299 LY L + +D S + V+ A+ ++ ++ Sbjct: 342 NLYQTSLRKPSYTVYCLDYSGSMGDNGGETGVKKAMDMILNQSTAKLYFLQATPQDKIAV 401 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D V ++ G + + G T I + TA + Sbjct: 402 IAFSDSVK-AEWYATGGDLSSMSTLDQNIQKLQAGGGTDIYTPVMTALQQL-------AG 453 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS--QGIRIMTIAFSVNKTQQEKARY 417 + ++L+TDG++ + + KS + I I +I F Q K Sbjct: 454 ADVSQCNP--AVILMTDGQSNTGRTFT-NVQSTYKSIGKDIPIFSIEFGAADPTQLKQFG 510 Query: 418 FLSNCAS-PNSFFEANSTHELNKIFRDR 444 LS A +L F+ Sbjct: 511 TLSKAALFDGR-------KDLVAAFKQA 531 >gi|254460998|ref|ZP_05074414.1| von Willebrand factor, type A [Rhodobacterales bacterium HTCC2083] gi|206677587|gb|EDZ42074.1| von Willebrand factor, type A [Rhodobacteraceae bacterium HTCC2083] Length = 627 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 36/342 (10%), Positives = 88/342 (25%), Gaps = 33/342 (9%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + + + L + + + + VV + + S Sbjct: 123 ADMAVGAIASELAPSSVPVLRMSAKSQSRTLVAGSIADASVVA-------VEVDTEAFAS 175 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + T S ++ + + + +N F Sbjct: 176 ASENTLKVTRETSVSTFSVDVDTASYAIVRSSLMN-GQLPPAAAVRIEEMVNYFPYDYAA 234 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + + + + +L + P L+ +D+S + K Sbjct: 235 PTQGVFATSLAMMAT--PWNENTQLLRIALQGQMPEVEARPPLNLVFLIDTSGSMNQPNK 292 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L++ +L ++ ++ D V + + R + + Sbjct: 293 LPLLKQSLRLLLGQLRAEDQVA------IVAYAGSAGQVLEPT---AAHERDTILGALDN 343 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-- 388 G T +Q AY E+ ++L TDG+ A Sbjct: 344 LAAGGETNGQAGLQQAYALAKQMQEEGEVSR---------VLLATDGDFNVGLSNAEALK 394 Query: 389 --ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I +K + G + + F E + N ++ Sbjct: 395 GYIADK-RDSGTFLSVLGFGRGNLDDETMQALAQNGNGQANY 435 >gi|198436180|ref|XP_002124514.1| PREDICTED: similar to von Willebrand factor A domain containing 3A [Ciona intestinalis] Length = 1107 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 43/141 (30%), Gaps = 21/141 (14%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-----TFAIDENEMGSTAINDAMQTAY 347 + + T FN S+ S W + G +A A+Q A Sbjct: 894 EQLNGNKTAFNIVAFSNTSTKWQDSITESNQSACHDAVQWVSALTAHGGSATLKAIQVAL 953 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + I LLTDG+ + ++ + + I I TI+F+ Sbjct: 954 ADEEAEA----------------IYLLTDGKPDSSIKLTLSEASNLNKKNIPIHTISFNC 997 Query: 408 NKTQQEKARYFLSNCASPNSF 428 + + LS+ + Sbjct: 998 DNREANDFLKSLSSNSGGRFH 1018 >gi|119485135|ref|ZP_01619520.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] gi|119457363|gb|EAW38488.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] Length = 564 Score = 44.2 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 48/156 (30%), Gaps = 21/156 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+ F+ + + ++ F + G T + DA +A + + + Sbjct: 419 RIALIDFDSEIRPPVVVDGTSEGRNQGML--FINNLKAEGGTKLYDATLSAQTWLQQNQQ 476 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK------SQGIRIMTIAFSVNK 409 + +++LTDGE++ + + + + I TI + Sbjct: 477 PDAINA---------VIILTDGEDSASQISLDKLQQELQKTGFNTDKRIAFFTIGYGKEG 527 Query: 410 TQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDR 444 L A + ++ ++ + D Sbjct: 528 EFNS---TVLKKIADLNSGYYRQGDPETISTLMADL 560 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 52/154 (33%), Gaps = 16/154 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + + + + + K ++ T I+ A++ A Sbjct: 159 ISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 214 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 215 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 268 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A + L Sbjct: 269 GSAVNETELSHIT----GGKKNAYTAATFDSLTD 298 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 30/82 (36%), Gaps = 6/82 (7%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + K I+LLTDG T E+ I ++ ++ G+ I+ + + E + Sbjct: 5 ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHI 64 Query: 418 FLSNCASPNSFFEANSTHELNK 439 + + A + L Sbjct: 65 T----GGKKNAYTAATFDSLTD 82 >gi|154687788|ref|YP_001422949.1| YwmD [Bacillus amyloliquefaciens FZB42] gi|154353639|gb|ABS75718.1| YwmD [Bacillus amyloliquefaciens FZB42] Length = 226 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 54/185 (29%), Gaps = 17/185 (9%) Query: 265 RHVIKKKHLVRDALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 +K + ++++ S + + + + N + + ++ L Sbjct: 47 TGGERKIDVAKESVTSFAEVLPEDTNLMLRVFGHKGNNKNSGKAVSCNATETLYGLQPYA 106 Query: 324 V---KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 V + G T I A+ + + K + L+TDGE T Sbjct: 107 VTPFEQSLSKIKPTGWTPIAKALSDTREEFERA----------GATGKNVVYLITDGEET 156 Query: 381 QDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 I K+ + I F+ E A ++ A+S E + Sbjct: 157 CGGNPQAEIQKLRKANVNTIVNIIGFNFGMKGSESLEKAAE--AGGGTYVSADSADEFKQ 214 Query: 440 IFRDR 444 + D Sbjct: 215 AWEDA 219 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 14/142 (9%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G ++ R F ++ +R++++ G T AM+ A + S Sbjct: 42 RIGLIVYSSRSYLVGGFR--RYRNLRSVLQAIKRIRYIRGGTYTGKAMKYALRKLFSRRA 99 Query: 356 DEVHRM-----KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 H + A K +V++TDG +QD A+ + K G+ I ++ Sbjct: 100 GYHHARVRLFRSSRKGAAKILVMITDGI-SQDRVTTPAL--RLKKMGVVIFSVGVG---- 152 Query: 411 QQEKARYFLSNCASPNSFFEAN 432 ++ + + + + P F A Sbjct: 153 KRYRLKQLMQIASRPRLVFTAP 174 >gi|149927035|ref|ZP_01915293.1| hypothetical protein LMED105_09277 [Limnobacter sp. MED105] gi|149824256|gb|EDM83476.1| hypothetical protein LMED105_09277 [Limnobacter sp. MED105] Length = 164 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 33/90 (36%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M T+ ++ G + AL++PV+L V +V +H L A A Sbjct: 1 MTTATRAHSAFVNRAENQRGAQLVELALMLPVVLTVIFAIVGYSLLFMVQHTLSSAVSQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 + +V + E ++R + P Sbjct: 61 ARSVAVAGSTADPEAAARQLLLNSLPSAMY 90 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 52/147 (35%), Gaps = 11/147 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +++ +++ + D + + F V + + R Sbjct: 37 ITELQEGVSTFFAQLLADDVAEYSAEVAVVTFGGNVD----MAVDFAAVTRQT----VPS 88 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + +A++TA + + + E+ + + +V++TDG T + + + Sbjct: 89 LTAGGMTPMGEAVETALELLHTRKEEYKRAGVDYYQPW--LVIMTDGAPTDNISKASRLV 146 Query: 391 NKA-KSQGIRIMTIAFSVNKTQQEKAR 416 + + + + + I + E A+ Sbjct: 147 DDLVREKKLAVFAIGIGKDADMNELAK 173 >gi|294140776|ref|YP_003556754.1| hypothetical protein SVI_2005 [Shewanella violacea DSS12] gi|293327245|dbj|BAJ01976.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 438 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 11/152 (7%), Positives = 43/152 (28%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ + + +L V + +D + L+ A ++++ A+ L + Sbjct: 18 KQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSVLYAAKILQNDGS-LF 76 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + Q + + ++ + D + + +E + + Sbjct: 77 QAREAATLILIQNLGFEENSELDTSIDLSSPDYNSTLVTSNIYIE-FSEWPDPFIPVFDE 135 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 + + + + + Sbjct: 136 TSKYVRVRIENVGLDNFFAQIFNFNKTVRASA 167 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 44.2 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 36/113 (31%), Gaps = 9/113 (7%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---- 403 + + + +K ++LLTDGE+ + + A+ + I Sbjct: 208 TVLYDAVAQAANDKLREQTGRKALILLTDGEDLGSATKPLQAIADAQKANTIVYVILIAD 267 Query: 404 -AFSVNKTQQEKARYFLS--NCASPNSFFEA-NSTHELNKIFRDRIGNEIFER 452 F T + A+ N+ +L F++ I E+ + Sbjct: 268 RGFYGGYTFGYTGDAQMRRLAEATGGRMINVGNNGAKLTAAFKE-IARELRTQ 319 >gi|291393629|ref|XP_002713393.1| PREDICTED: alpha 1 type VII collagen [Oryctolagus cuniculus] Length = 2937 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 47/142 (33%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ +++ + G+T A++ D I Sbjct: 77 VRFAAVQYSDDPRTEFGLDTLASG--GEVIRAIRELSYKGGNTRTGAAIRHVADHIF--- 131 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + ++ A + K QG+++ + + Sbjct: 132 ----LPQLTRPGIPKVCILITDG---KSQDQVDAAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTGDFFFFVND 200 >gi|260892685|ref|YP_003238782.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] gi|260864826|gb|ACX51932.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] Length = 209 Score = 44.2 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 10/41 (24%), Positives = 20/41 (48%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 +++K G+ + A L+PV++ V +D R + L Sbjct: 2 RVLKGEKGYALLYFAFLLPVLVIVAAWALDFTRIRLVKDQL 42 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 69 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKIIV 120 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 121 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPANLRANLALPLIAPQYTTDNSGT 180 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 181 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 240 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 241 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 300 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 301 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 358 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 359 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 407 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 408 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 467 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 468 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|282900951|ref|ZP_06308884.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] gi|281194042|gb|EFA69006.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] Length = 575 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 62/194 (31%), Gaps = 16/194 (8%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 P + + ++ +D+S + V+D L + I Sbjct: 374 VPPVPSGKVLSLGQTFWKTQKDTGKTVYLMAVIDTSGSM-YGGPLNAVKDGLRIASQQIN 432 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + V + D+ ++ + L + G+TA+ D M A Sbjct: 433 PGNYVG------LVTYGDQPVNLVKLAP-FDDLQHKRFLAAIDNLQADGATAMYDGMMVA 485 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 ++ + K N K Y++LLTDG+ Q + + G+R+ IA+ Sbjct: 486 LSELV-------QQKKTNPNGKFYLLLLTDGQTNQG-FNFEEVKEIIQYSGVRVYPIAYG 537 Query: 407 VNKTQQEKARYFLS 420 + A L Sbjct: 538 EVNEAELNAIAALR 551 >gi|218460899|ref|ZP_03500990.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 373 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 29/294 (9%), Positives = 75/294 (25%), Gaps = 20/294 (6%) Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL 148 + + + E + N + + + + + P + Sbjct: 85 AARKSDLNSDEISALLNRPGGTMSEGAKRSVAPTAAAPQQQLAEPMAVAPSPAPPTESRM 144 Query: 149 RSMGIKSWLIQTKAEAE----------TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 + S A A + + S +M D Q Sbjct: 145 QVQVDSSRERFANAAANPIKSVATDPVSTFSADVDSASYSFVRRSLTGGAMPDPQSVRVE 204 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 + +N F R + V + + + ++ P + Sbjct: 205 EMINYFPYDWPRPENAEQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHANLVFL 264 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D S K L++ + ++ +K D V + + + Sbjct: 265 IDVSGSMDEPDKLPLLKSSFRLLVNRLKADDTVA------IVTYAGNAGTVLEPT---RV 315 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ + + GST + ++ AYD + + + +++ V Sbjct: 316 SEKSKILSAIDRLEAGGSTGGAEGIEAAYD-LAKRLSSRTASSGDARDGRRFYV 368 >gi|228931399|ref|ZP_04094324.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228828205|gb|EEM73915.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 448 Score = 44.2 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 58/182 (31%), Gaps = 30/182 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-------IRT 322 K + A+ + + I ++ + +D S S G ++ + Sbjct: 169 KMEAAKKAIYNYLDKIPD---NSNVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKE 225 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T + A+++ D + E + +++DGE T Sbjct: 226 QFNAALSKFGPKGWTPLASAIESVNDDF---------KEYTGEENLNVVYIVSDGEETCG 276 Query: 383 NEEGIAICNKAKSQG-----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 E N AK+ + I F V ++Q + + A ++ ++ EL Sbjct: 277 GEPV----NAAKNLNQSSTHAVVNIIGFDVKNSEQLQLKNTAE--AGKGNYATVSTADEL 330 Query: 438 NK 439 + Sbjct: 331 YQ 332 >gi|332823606|ref|XP_003311226.1| PREDICTED: complement C2 isoform 3 [Pan troglodytes] Length = 538 Score = 43.8 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 51/184 (27%), Gaps = 17/184 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 + + + + ++ P+ ++ T ++E G T A Sbjct: 69 QGNGVWSGTEPICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAA 128 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR-- 399 + + Y + + E + I+LLTDG++ + + I Sbjct: 129 LNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTAVDHIREILNINQK 187 Query: 400 ------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I + S F T L+++F V Sbjct: 188 RNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEHM------LDV 241 Query: 454 IRIT 457 ++T Sbjct: 242 SKLT 245 >gi|317123666|ref|YP_004097778.1| type II secretion system F domain [Intrasporangium calvum DSM 43043] gi|315587754|gb|ADU47051.1| Type II secretion system F domain [Intrasporangium calvum DSM 43043] Length = 652 Score = 43.8 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 75/241 (31%), Gaps = 22/241 (9%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 +++ + + PG + S+ S K+ ++ + + + Sbjct: 39 SSSGTVTGVLTVRSANPVQVDPGSVKASVDGVIVKSFVSEMTHTKRTAMLVIDTSGSMGT 98 Query: 285 IKKIDNVNDT-----VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 T ++ + + R + + G T++ Sbjct: 99 DGMATVRAATRAYLKEAPEDVLIGVVTFANTAGVDLKPTVDRAAAQRVVNGLDARGDTSL 158 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK---AKSQ 396 A+++A + + + +VLL+DG +T ++ + K + Sbjct: 159 YAAVRSAARAMPGDGD-------------RSMVLLSDGADTVSDDRQGDLAEANRELKRR 205 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 G+R+ + F+ + A ++ AS S A + ++ F+ + + Sbjct: 206 GVRVDVVRFNTDDPDAVVALRSFAS-ASGGSVIPATNASDVGAAFKSAARALRSQAQFTL 264 Query: 457 T 457 T Sbjct: 265 T 265 >gi|295849303|ref|NP_001171534.1| complement C2 isoform 3 [Homo sapiens] gi|194390502|dbj|BAG60565.1| unnamed protein product [Homo sapiens] Length = 538 Score = 43.8 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 51/184 (27%), Gaps = 17/184 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 + + + + ++ P+ ++ T ++E G T A Sbjct: 69 QGNGVWSGTEPICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAA 128 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR-- 399 + + Y + + E + I+LLTDG++ + + I Sbjct: 129 LNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTAVDHIREILNINQK 187 Query: 400 ------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I + S F T L+++F V Sbjct: 188 RNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEHM------LDV 241 Query: 454 IRIT 457 ++T Sbjct: 242 SKLT 245 >gi|123283203|emb|CAI17449.2| complement component 2 [Homo sapiens] gi|123857991|emb|CAM25861.1| complement component 2 [Homo sapiens] gi|168983783|emb|CAQ06834.1| complement component 2 [Homo sapiens] gi|168984349|emb|CAI41857.2| complement component 2 [Homo sapiens] gi|168984417|emb|CAQ09273.1| complement component 2 [Homo sapiens] gi|168985078|emb|CAQ07482.1| complement component 2 [Homo sapiens] gi|168985956|emb|CAQ07112.1| complement component 2 [Homo sapiens] Length = 525 Score = 43.8 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 51/184 (27%), Gaps = 17/184 (9%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDA 342 + + + + ++ P+ ++ T ++E G T A Sbjct: 56 QGNGVWSGTEPICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAA 115 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR-- 399 + + Y + + E + I+LLTDG++ + + I Sbjct: 116 LNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTAVDHIREILNINQK 174 Query: 400 ------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 I I + S F T L+++F V Sbjct: 175 RNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEHM------LDV 228 Query: 454 IRIT 457 ++T Sbjct: 229 SKLT 232 >gi|118081930|ref|XP_414992.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 1794 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 12/114 (10%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G+T + + + + ++L+TDG +QD+ Sbjct: 552 YAGGNTKTGRGIAFMLKELFQPSRGMRPEFPH------VLMLVTDGR-SQDDVLPP--AR 602 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A + GIRI+ + E L + + F ++ E +I R+ I Sbjct: 603 AAHALGIRIIAVGV-SGADPAELNDILLQQ--NLQNVFYVSTFDEFPQILRELI 653 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 34/115 (29%), Gaps = 7/115 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + + VR+G F R + S + G T + Sbjct: 39 DFVNSMPISPEDVRVGLITFATRSKVRWNLSDPKATNPSLAISAARSLSYSTGVTYTHYG 98 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +Q A + +N K ++++TDG + + + + G Sbjct: 99 LQDAKKLLYDTN------AGARNNVPKLVLVMTDGASNLP-SQTRSSAAALRDAG 146 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 43.8 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 34/115 (29%), Gaps = 7/115 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + + + VR+G F R + S + G T + Sbjct: 75 DFVNSMPISPEDVRVGLITFATRSKVRWNLSDPKATNPSLAISAARSLSYSTGVTYTHYG 134 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +Q A + +N K ++++TDG + + + + G Sbjct: 135 LQDAKKLLYDTN------AGARNNVPKLVLVMTDGASNLP-SQTRSSAAALRDAG 182 >gi|313221459|emb|CBY32209.1| unnamed protein product [Oikopleura dioica] Length = 1138 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 65/207 (31%), Gaps = 21/207 (10%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + G S + V + + ++ +DSS + ++ R+A Sbjct: 545 DVEQTGCSLFTDSQFFVQRCRPVNLFPAKLRSSFGVNAIFLIDSSGSM-MGERMEQTREA 603 Query: 278 LASVIRSIKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 +I +K D N F+D + + L F G+ Sbjct: 604 FKFMIEGLKPGDTFNIVSFESVNKVFSDNRMVPVNDRSMFAALK------FMDQIQAGGA 657 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN----K 392 T + A+ A + + K + I LTDG T +I + Sbjct: 658 TDVYAALVKASLLLS--------QNKRTSNQENIIYFLTDGAPTAGVTNLNSILDMVDFI 709 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFL 419 A+ I + TIA+ +K + FL Sbjct: 710 AQKSEIVMNTIAYGEEAN-DDKMKDFL 735 >gi|269960460|ref|ZP_06174832.1| hypothetical protein VME_12160 [Vibrio harveyi 1DA3] gi|269834537|gb|EEZ88624.1| hypothetical protein VME_12160 [Vibrio harveyi 1DA3] Length = 154 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 39/137 (28%), Gaps = 11/137 (8%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 D++ I + + G I + +PV+L + +++DV R + +A Sbjct: 1 MDSQIIQLRIRAFQEQKGLAIIEFIIALPVLLMLSVLIIDVCRAFIQYTEVNKAL----- 55 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + F E + + I Sbjct: 56 ------QNGARYALVDTYGTLDFSAIADEASIKNVVVYGKPAGGGTPVIDHIEVGDITVT 109 Query: 123 NPRKSAYQVVLSSRYDL 139 P + QV LS+ YD Sbjct: 110 QPTAATKQVTLSATYDY 126 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 51/161 (31%), Gaps = 16/161 (9%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+G + D ++ + + + + G + A+ Sbjct: 281 GSQFTRVGMMQYGDEPHTEFDL--NTFQNGSQVFEAISNVTQIGGESGPYAAILQVLRRS 338 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +++ + + I+ +TDG D+EE I N+ + G + TI Sbjct: 339 LTAQ------YGSRENVSQIIIFVTDGGVVDDSEESQTILNELRFSGALVYTIGVG---- 388 Query: 411 QQEKARYFLSNCASPNSFFEANST---HELNKIFRDRIGNE 448 + +R L AS + + EL+ I Sbjct: 389 -RMVSRPQLRMIASRPASHHVTTIASYSELSATKSQIIDRI 428 >gi|47229797|emb|CAG06993.1| unnamed protein product [Tetraodon nigroviridis] Length = 1160 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 54/154 (35%), Gaps = 17/154 (11%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLI---RTIVKTFAIDENEMGSTAINDAMQT 345 D ++D + FN++ + + + + I K + G+T Sbjct: 190 DTLSDDDYVNVARFNEKADAVVPCFRTLVQANVRNKKIFKEAVMHMQAKGTTDYKSGFTF 249 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTI 403 A++ +++ + K I++ TDG + I K ++ +R+ T Sbjct: 250 AFEQLLNESSAPRANCN------KMIMMFTDG----GEDRAQEIFEKYNWPNKTVRVFTF 299 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + ++ C++ +FE S + Sbjct: 300 SVGQHNYDVTPLQWI--ACSNKGYYFEIPSIGAI 331 >gi|225420217|ref|ZP_03762520.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] gi|225041138|gb|EEG51384.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] Length = 231 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 51/171 (29%), Gaps = 22/171 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + L ++ + ++ + FN V + F A Sbjct: 40 INELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVVPFCP--------AANYSAPT 91 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA-- 388 + G T++N+A+ D I + + + LLTDGE T N EG A Sbjct: 92 LSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPW--MFLLTDGEPTDQNMEGEAKN 149 Query: 389 -ICNKAKSQGIRIMTIAFSVNKTQQEKAR-------YFLSNCASPNSFFEA 431 + + + + L AS + F EA Sbjct: 150 RLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGNGAVLK--ASASQFKEA 198 >gi|221133174|ref|XP_002171310.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 221 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 53/154 (34%), Gaps = 16/154 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + ++ + + + K ++ T I+ A++ A Sbjct: 49 ISSNGSRVGVITFSYRAKLSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 104 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 105 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 158 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E + + + A + L Sbjct: 159 GSAVNETELSHIT----GGKKNAYTAATFDSLTD 188 >gi|167518794|ref|XP_001743737.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777699|gb|EDQ91315.1| predicted protein [Monosiga brevicollis MX1] Length = 874 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 57/171 (33%), Gaps = 12/171 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDEN 332 A VI ++ ++D +R+ F+ + F + V F N Sbjct: 287 FNLAQQFVISAVSQLDVGLAAIRVAGMMFHAEALPQFDFDDYTSAAQVQNAVANFNYPVN 346 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 E TA +A+ + ++ ++ + + +TDG + + + Sbjct: 347 ENWGTATGNALDSIRTNLLQASAGYRGG-------EVVVYFITDGVSQESPSVVESAAQA 399 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 ++ G ++M I + + + S ++ LN+ RD Sbjct: 400 LRATGAQVMAIGITDQIDETQLEVIA----GSADNVITVADFANLNEAVRD 446 >gi|85706702|ref|ZP_01037794.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] gi|85668760|gb|EAQ23629.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] Length = 240 Score = 43.8 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 50/195 (25%), Gaps = 27/195 (13%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 + R AL V+ I + S + +G Sbjct: 44 DPTAPTRIIEARVALRRVMPEIA---PYRRIGLLSYGAGGSHPCSGITRHFGPMPDAGAA 100 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QD 382 V G T I ++ A + + +VL+TDG T Sbjct: 101 VVAGIEALTPGGLTPIAASVAAAAEVL------------GYRTHPGIVVLVTDGNETCGG 148 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS--------NC---ASPNSFFEA 431 + A+++ + + I F V C A+ F Sbjct: 149 TPCALGTALAAEARDLTVHVIGFRVVHDPFSWNSPEAKGYDGQTVAKCLADATGGLFVST 208 Query: 432 NSTHELNKIFRDRIG 446 + EL R+ +G Sbjct: 209 ETVDELVAALRETLG 223 >gi|218887818|ref|YP_002437139.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758772|gb|ACL09671.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 162 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/143 (11%), Positives = 38/143 (26%), Gaps = 4/143 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L + G + A ++PV+L + LV+ + + A + I Sbjct: 10 RLWRGEGGVGSLEVAFMLPVLLAMLFGLVEFGYNLFARS----TVEKAAQVGARFAITGE 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + + L + + +V Sbjct: 66 GFDTGNRLALIKEAARPLTQVLSGSSGTGVTILVRSYPNGTGAAAVENSGGDPCQTVEVQ 125 Query: 133 LSSRYDLLLNPLSLFLRSMGIKS 155 + RY L + L + + Sbjct: 126 VDYRYAPLTPLVGSLLPAQITVT 148 >gi|16127181|ref|NP_421745.1| hypothetical protein CC_2951 [Caulobacter crescentus CB15] gi|221235982|ref|YP_002518419.1| TadE-like pilus assembly protein [Caulobacter crescentus NA1000] gi|13424579|gb|AAK24913.1| hypothetical protein CC_2951 [Caulobacter crescentus CB15] gi|220965155|gb|ACL96511.1| TadE-related pilus assembly protein [Caulobacter crescentus NA1000] Length = 183 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 10/71 (14%), Positives = 23/71 (32%), Gaps = 2/71 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGV-GGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + G + AL+ PVM+ + G+ +V + + L A + Sbjct: 4 RRPLSSFWRDRRGVSAVEFALIAPVMIVMYCGLA-EVTQAMMAQRRLSNIASQIGDLVAQ 62 Query: 67 PLIQSLEEVSS 77 +++ Sbjct: 63 SNQTGPTKMAD 73 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + + Sbjct: 122 GGGTYTKGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--VAASL 169 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 170 RDFGVEIFTFGI 181 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + + Sbjct: 164 GGGTYTKGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--VAASL 211 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 212 RDFGVEIFTFGI 223 >gi|149918084|ref|ZP_01906577.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] gi|149821089|gb|EDM80495.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] Length = 719 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 36/116 (31%), Gaps = 16/116 (13%) Query: 349 TIISSNEDEVHRMKNNLEAK--------KYIVLLTDGENTQD--NEEGIAICNKAKSQGI 398 +I++N + H +A+ +L+TDG + + A ++G+ Sbjct: 441 ELIAANRSQYHLDGQAPDAEFPTHAQTPYVNILITDGAYAAYSTDAQVQAALEAMFNEGV 500 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFRDRIGNEI 449 + F + LS A + + ++ EL + Sbjct: 501 TTHVVGFGEGADTPQALIE-LSAMAAWGSGGEGAPYHVDTQQELQSALAQIAASIA 555 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 43.8 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 24/72 (33%), Gaps = 12/72 (16%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A Q A + S E+ K I L+TDG + + + Sbjct: 164 GGGTYTKGAFQQAAQILRHSRENST----------KVIFLITDGYSNGGDPRP--VAASL 211 Query: 394 KSQGIRIMTIAF 405 + G+ I T Sbjct: 212 RDFGVEIFTFGI 223 >gi|330969427|gb|EGH69493.1| hypothetical protein PSYAR_02909 [Pseudomonas syringae pv. aceris str. M302273PT] Length = 678 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ F Q + + V V + A + + + Sbjct: 72 A----SAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASASPLAALTIRSAAVTVDSTGAA 184 >gi|220897449|emb|CAX15333.1| complement component 2 (within H-2S) [Mus musculus] Length = 623 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 58/204 (28%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRT 322 K + + + ++ I + V + F + + S S V ++I + Sbjct: 136 TEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITS 192 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D T + + Y + S + E + I+LLTDG++ Sbjct: 193 LDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMG 252 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + I I I ++ S F Sbjct: 253 DS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQD 311 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L +IF V ++T Sbjct: 312 AKALQQIFEHM------LDVSKLT 329 >gi|148694788|gb|EDL26735.1| complement component 2 (within H-2S), isoform CRA_b [Mus musculus] Length = 753 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 58/204 (28%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRT 322 K + + + ++ I + V + F + + S S V ++I + Sbjct: 273 TEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITS 329 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D T + + Y + S + E + I+LLTDG++ Sbjct: 330 LDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMG 389 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + I I I ++ S F Sbjct: 390 DS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQD 448 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L +IF V ++T Sbjct: 449 AKALQQIFEHM------LDVSKLT 466 >gi|148694789|gb|EDL26736.1| complement component 2 (within H-2S), isoform CRA_c [Mus musculus] Length = 809 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 58/204 (28%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRT 322 K + + + ++ I + V + F + + S S V ++I + Sbjct: 273 TEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITS 329 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D T + + Y + S + E + I+LLTDG++ Sbjct: 330 LDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMG 389 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + I I I ++ S F Sbjct: 390 DS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQD 448 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L +IF V ++T Sbjct: 449 AKALQQIFEHM------LDVSKLT 466 >gi|148694790|gb|EDL26737.1| complement component 2 (within H-2S), isoform CRA_d [Mus musculus] Length = 755 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 58/204 (28%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRT 322 K + + + ++ I + V + F + + S S V ++I + Sbjct: 273 TEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITS 329 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D T + + Y + S + E + I+LLTDG++ Sbjct: 330 LDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMG 389 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + I I I ++ S F Sbjct: 390 DS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQD 448 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L +IF V ++T Sbjct: 449 AKALQQIFEHM------LDVSKLT 466 >gi|157951694|ref|NP_038512.2| complement C2 precursor [Mus musculus] gi|3986766|gb|AAC84162.1| C2 [Mus musculus] gi|15029737|gb|AAH11086.1| Complement component 2 (within H-2S) [Mus musculus] gi|74147034|dbj|BAE27452.1| unnamed protein product [Mus musculus] gi|148694787|gb|EDL26734.1| complement component 2 (within H-2S), isoform CRA_a [Mus musculus] gi|220897448|emb|CAX15332.1| complement component 2 (within H-2S) [Mus musculus] Length = 760 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 58/204 (28%), Gaps = 23/204 (11%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRT 322 K + + + ++ I + V + F + + S S V ++I + Sbjct: 273 TEKDFDIFKKSAELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITS 329 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D T + + Y + S + E + I+LLTDG++ Sbjct: 330 LDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMG 389 Query: 383 NEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + I I I ++ S F Sbjct: 390 DS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQD 448 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 L +IF V ++T Sbjct: 449 AKALQQIFEHM------LDVSKLT 466 >gi|86132310|ref|ZP_01050905.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817229|gb|EAQ38412.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 351 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 55/180 (30%), Gaps = 11/180 (6%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++ A S I ++ F + V+ + ++ + D + Sbjct: 132 LKQASTSFITNVMPAVPTESFKMAIYWFDGEDVLHELQPLTTSATQLQEAIDGVTDDISN 191 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 ST + A+ A ++ +L A +V+ TDG + + Sbjct: 192 DPSTDLYGAVIKAA---TNAENIVETLENEDLFAAASVVIFTDGTDQAARYSEQEALDAV 248 Query: 394 KSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + G I TI L+ S F A +++EL +F D + Sbjct: 249 SNAGEEISFFTIGLGSEID-----EEVLTAIGKTGSAF-AENSNELEAVFNDISNGVAGQ 302 >gi|291394751|ref|XP_002713732.1| PREDICTED: collagen, type XXVIII [Oryctolagus cuniculus] Length = 1132 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 50/171 (29%), Gaps = 14/171 (8%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV---NDTVRMGATF 301 + F+ SS I +D + + I ++ +R+ A Sbjct: 38 NDFWASICFIDIVFIVDSSESSKIVHFDKQKDFVERLSDKIFQVTPGHSLKYDIRLAALQ 97 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+ V DP FS T A+ Sbjct: 98 FSSSVQIDPPFSSWKDLKTFKQRAKSLNLI--GQGTFSYYAISNVTRLFKREGR------ 149 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K +L+TDG + N + +I A++ GI +TI S + + Sbjct: 150 ---KNGVKVALLMTDGIDHPKNPDVKSISEDARTSGISFITIGHSTDVNEA 197 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 82/413 (19%), Gaps = 49/413 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + N F + +KK D E Sbjct: 278 AGEEDRNVPAQTVQLWGNERNFENSYL-----DYNGAYIKKWVEPVLPSSTASDLHPEDA 332 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++P+ + S T + E + + Sbjct: 333 TTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDL 392 Query: 184 DFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S+ ++ Q + + + S+N + +P Sbjct: 393 LSDPSLDIRIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPL 452 Query: 233 MVSCNKS--LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKI 288 + Y G V + + Sbjct: 453 TLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVME 512 Query: 289 DNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D + + + F N R + N A Sbjct: 513 DKIAEDNGVLFRNFEEFSGNTSNAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKAT 572 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D ++N + + A + + E + I +I Sbjct: 573 HWVGDGSSNNNTNGSPTSQEKSSAINTVAY-HHWLKNKYQENPPS-----------IFSI 620 Query: 404 AFSVNKTQ------QEKARYFLSNCAS------PNSFFEANSTHELNKIFRDR 444 ++ R L N A ++ AN+ +++ D Sbjct: 621 GLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKNDIVTALEDI 673 >gi|91783018|ref|YP_558224.1| hypothetical protein Bxe_A2791 [Burkholderia xenovorans LB400] gi|91686972|gb|ABE30172.1| Predicted membrane protein [Burkholderia xenovorans LB400] Length = 595 Score = 43.8 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 50/171 (29%), Gaps = 10/171 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ---- 70 + G ++ A+ + V + G +DV + L++ A A + A + Sbjct: 25 ARRQHGAIAMLAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQLIGSTGGC 83 Query: 71 ----SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ A I R ++ T + V+ ++ P Sbjct: 84 ATATTAAQQNAAANGFTAGSTTTISTTCGRWDTSSNTYFGTSGNPLNAVQVKTTQVVPYF 143 Query: 127 SAYQVV-LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 +S+ + + F GI S Q A + Sbjct: 144 FVGPSRNVSATATAFASNIDAFSLGTGIASINTQQSALLNAILGGLLNTSV 194 >gi|313225210|emb|CBY21004.1| unnamed protein product [Oikopleura dioica] Length = 1138 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 44/336 (13%), Positives = 91/336 (27%), Gaps = 26/336 (7%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM--GIKSWL 157 N + + V++ V S L + ++ + S+ Sbjct: 410 NQKLQEALVVDSTKNGESRIVQVKQAGQHLMVRASVEAFSNLTFVMEYVEVLERTRGSYR 469 Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + E S + + + + I F + + + F + + ++ Sbjct: 470 FGVHFDPEFPSENLKVQIMIQDENRIKFLQPIPRQSTLVRKETKRQFLEDYHKAAIEFTP 529 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF---VDSSSLRHVIKKKHLV 274 + R + V + + F + SS + I Sbjct: 530 SDNYESSRYWFGIEFDVEQTGCSLFTDSVFFVQRCRPVNLFPAKLRSSFGVNAIFLIDSS 589 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFN-------DRVISDPSFSWGVHKLIRTIVKTF 327 + + ++ FN ++V SD + +K F Sbjct: 590 GSMMGERMEQTREAFKFMIEGLKPGDTFNIVSFESVNKVFSDNRMVPVSDRSTFAALK-F 648 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G+T A+ A + + K + I LTDG T Sbjct: 649 MDQIQAGGATDAYAALVKASLLLS--------QNKRTSSQENIIYFLTDGAPTAGVTNLN 700 Query: 388 AICN----KAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +I + A+ I + TIA+ +K + FL Sbjct: 701 SILDMVDFIAQKSEIVMNTIAYGEEAN-DDKMKDFL 735 >gi|269958194|ref|YP_003327982.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] gi|269306875|gb|ACZ32424.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] Length = 145 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 32/117 (27%), Gaps = 4/117 (3%) Query: 8 IFYSKKLIKS-CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++L + G A+ M G+ VD + + +QA A A Sbjct: 2 RRLRERLARDPERGSASAWAAVTTIAMFLFVGIAVDFGGQLHAQ---QQARDVATQAARA 58 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 Q + R + P + + + R IV +A Sbjct: 59 GGQQINAPQAIRGQGVTAQPGDAYSAAASYLAGSGVSGSVQVAGARVIVDTSATYNT 115 >gi|194016210|ref|ZP_03054824.1| YwmD [Bacillus pumilus ATCC 7061] gi|194011683|gb|EDW21251.1| YwmD [Bacillus pumilus ATCC 7061] Length = 225 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 40/135 (29%), Gaps = 15/135 (11%) Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 +G + + G T + A+ E K + + Sbjct: 100 FGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQ----------AEGKNIVYV 149 Query: 374 LTDGENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 ++DG+ T + GI + I F VN+ + + A + A Sbjct: 150 VSDGQETCGGS-PSQAAKELHEGGIDTIVNIIGFDVNEKEARSLKSVAK--AGGGQYQPA 206 Query: 432 NSTHELNKIFRDRIG 446 + ELN I ++ Sbjct: 207 ANAEELNYILQNAAS 221 >gi|7959925|gb|AAF71133.1|AF116721_113 PRO2769 [Homo sapiens] Length = 494 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 12/84 (14%), Positives = 28/84 (33%), Gaps = 3/84 (3%) Query: 373 LLTDGENTQDNEEGIAICNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 +LTDG+ T+ + I ++ + + F N + N + Sbjct: 1 MLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIY 60 Query: 430 EANSTHELNKIFRDRIGNEIFERV 453 E + + + F ++ + V Sbjct: 61 EDHDATQQLQGFYSQVAKPLLVDV 84 >gi|54293612|ref|YP_126027.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] gi|53753444|emb|CAH14899.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] Length = 1169 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 29/276 (10%), Positives = 64/276 (23%), Gaps = 24/276 (8%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S S N A+ + + S ++ + Y Sbjct: 266 SSPFPPNYTISNYNQGNIRISYANTRPSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSN 325 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + + + V +A+ + + +K ++ Sbjct: 326 QSYATG-----NMLVNMQTAGTNPTTTSVNNAINAFLPHLK---PETNSTATTEIKAAAV 377 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++T+ T + I+D + T + + Sbjct: 378 QSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPT-----QDLQSRYWPPLGSAA 432 Query: 366 EAKKYIV--LLTDGE-NTQDNEEGIAICN---KAKSQGIRIMTIAFSVNKTQ--QEKARY 417 + DG N +++ N K+ G+ I I +A Sbjct: 433 ATGYGVTATFNADGSLNNTNSQALSDAINEIKALKNDGVLIFIIGMGAGVDPAVNPEAAA 492 Query: 418 FLSNCA---SPNSFFEANSTHELNKIFRDRIGNEIF 450 L A +++ A S L + N Sbjct: 493 TLRAMAVAGGTENYYPATSPEALVSSLNSILSNIQN 528 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 48/192 (25%), Gaps = 32/192 (16%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 L + A ++I ++ + D R+ F V ++K ++ V Sbjct: 103 LDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDS-MNKENKSTVLGAIDK 155 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT----QDNEEG 386 ST + M+ + + +++LTDG Sbjct: 156 LYGKSSTNLWHGMKKGLNIL-----ATNPAQGKIQS----LLVLTDGAPNHMCPAQGYVP 206 Query: 387 IAICNKAKSQGIR-----IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNK 439 + I T F L + A +F + Sbjct: 207 KLRQTLLDHHNLTGTLPLIHTFGFGYY-----LRSPLLQSIAEIGGGTFAFIPDAGMIGT 261 Query: 440 IFRDRIGNEIFE 451 +F + N Sbjct: 262 VFVHAVANLYST 273 >gi|269836152|ref|YP_003318380.1| hypothetical protein Sthe_0119 [Sphaerobacter thermophilus DSM 20745] gi|269785415|gb|ACZ37558.1| hypothetical protein Sthe_0119 [Sphaerobacter thermophilus DSM 20745] Length = 598 Score = 43.8 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 33/348 (9%), Positives = 85/348 (24%), Gaps = 9/348 (2%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G +I A +++ VDV ++ AA A + + + + ++ Sbjct: 11 RGQILVIFAAATVLLIAFLAAAVDVGYLLSQRRGVQNAADAAALAVAHAVHSGVTNTATL 70 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 + + + ++ + + K ++ + + P + +R Sbjct: 71 EQIAAYYVEENGYDATPEITFSPDMKQVRVTVNAEVPKFFVGLVYPG----DWKVGARAA 126 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEG 198 L P+ + + K T + S S++ Q+ + Sbjct: 127 ASLEPVPFDAALLALNPSSGGIKTSGNTGIEVTGGSVMSNYTIETSGSTSIIADQQVNAN 186 Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIR-DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 G + +V +K+ P + + G + Sbjct: 187 DGFRESGASRIEGGMGTNPYAPEVPDPLKDKIQPPTMPPFPNNPVPNVGGSPQQCMEYPQ 246 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + G FN+ Sbjct: 247 PWGVNIYTVPAGTYS--GGGGNCLTIQ-NVQSGNTLEFAHGNYRFNNGAGISIGGGNNGP 303 Query: 318 KLIRTIVKTFAID-ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 ++R TF +G + + M+ ++ + N Sbjct: 304 IVLRGGNYTFNGGPGISVGGSTPDFRMERGNYAFLNGARLSIGGSANG 351 >gi|330895596|gb|EGH27904.1| hypothetical protein PSYJA_02284 [Pseudomonas syringae pv. japonica str. M301072PT] Length = 677 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ F Q + + V V + A + + + Sbjct: 72 A----SAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASASPLAALTIRSAAVTVDSTGAA 184 >gi|328952507|ref|YP_004369841.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] gi|328452831|gb|AEB08660.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] Length = 465 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 32/346 (9%), Positives = 84/346 (24%), Gaps = 21/346 (6%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS-- 71 ++ G ++ A+ + ++GV G +D ++ + A + A + + L Sbjct: 1 MLSDEKGSVAVVVAVCLFALVGVLGFSLDTAFLYVEKNHTQNAVEAAALAGATNLCAGDQ 60 Query: 72 -LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 E R + E + + ++ D V + + Sbjct: 61 EDIEAIVRNVAAENGLTGANHEVFVTFGFYDEMNDYGDFPVYKDFVEKSGMPAGEYVNAV 120 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 V ++ L L+ + + + + ++ H Q +S + Sbjct: 121 AVACLTNNVTLTGLNKSGTVAAM-AVAFLRRFDMVSLDGDILLGHNSQWQNCTFYSNGDI 179 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYS-----SQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 Y + + + Sbjct: 180 KYPASASMSGKSYGPPTFSDCRLLCAGQMLECPVTVKTSWSGSKMTINWDGGSPHALAGA 239 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++E VD L+ + ++ + ++ + + F+ Sbjct: 240 QTGVS-PITEIRPVDEDYLKIWKARADVIYTPADAGKDNVFYSTSSFLYLGKYHYNFDLG 298 Query: 306 VISDP-----------SFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ LI T + G T N Sbjct: 299 GLAGSERRVIFFDAGNPPPDDYRVLIGPKPSTTDMSHTPNGYTISN 344 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 54 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 113 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 114 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 168 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 169 KLGWADDTGAAKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 228 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 229 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 278 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 279 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 327 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 328 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 382 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 383 FDGRSG-DLAAVFRQAKG 399 >gi|225872598|ref|YP_002754053.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793914|gb|ACO34004.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 313 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 54/153 (35%), Gaps = 23/153 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ FN RV F+ + K+ R +E +TA+ A+ + + Sbjct: 106 DRVEIVNFNTRVHEVVPFTNNLKKIDR-----GLNRLSEGPATALYAAIAYGSEELAQRP 160 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---AFSVNKTQ 411 +K +V+++DG+NT N ++A I ++ + + Sbjct: 161 G------------RKVLVVISDGDNTVANSSYQQALDRAVRAETMIFSVIDLPVINDAGR 208 Query: 412 QEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 + + A+ ++ + L +F+ Sbjct: 209 DVGGEHAMIALSEATGGEYYYEADGN-LQGVFK 240 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 73/228 (32%), Gaps = 28/228 (12%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV----- 281 + + V + + YP + S+ D I+ +D + V Sbjct: 172 DPTLLWQVFGSATGVTRYYPATPWRAPSKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGS 231 Query: 282 ----------IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL--IRTIVKTFAI 329 ++ +D ++D + F+++ F V + ++K Sbjct: 232 VSGLTLKLMKTSVMEMLDTLSDDDYVTVASFHEKADPVSCFRQLVQANVRNKKVIKEAVQ 291 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G+T + A+ + ++ K I++ TDG + + Sbjct: 292 EMVARGTTDYKAGFEYAFSQLQNT-------SITRANCNKMIMMFTDGGEDRVQDVFEKY 344 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ +R+ T FSV + + CA+ +FE S + Sbjct: 345 --NWPNKTVRVFT--FSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 43.8 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 53/186 (28%), Gaps = 22/186 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K V+ A ++ +++ D + ++ V + + Sbjct: 66 DKLSDVKTAALELLETLQPEDTIT------LVSYSSDVSMHL-MRTRADDAGQREARRAL 118 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TA+ + A + + +++ +++L +DG Sbjct: 119 LALQARGGTALGPGLFRALEALEGASDRTRMS---------HLMLFSDGIANAGEVRPSV 169 Query: 389 ICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + +A + + T+ V+ + R + + + I D + Sbjct: 170 LGARAAGAFGAGVSVSTMGVGVDYNEDLMTRLADQ---GGGRYHFIQDSEAIASILDDEM 226 Query: 446 GNEIFE 451 + Sbjct: 227 KGLVAT 232 >gi|332214177|ref|XP_003256207.1| PREDICTED: collagen alpha-1(XIV) chain [Nomascus leucogenys] Length = 1796 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|313814173|gb|EFS51887.1| conserved hypothetical protein [Propionibacterium acnes HL025PA1] Length = 169 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 21/58 (36%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++P +L + + R S + + AAQ A SV Sbjct: 36 RGTVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAV 93 >gi|297683547|ref|XP_002819437.1| PREDICTED: collagen alpha-1(XIV) chain-like [Pongo abelii] Length = 1761 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|297300028|ref|XP_002805526.1| PREDICTED: collagen alpha-1(XIV) chain-like [Macaca mulatta] Length = 1717 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|296227292|ref|XP_002759309.1| PREDICTED: collagen alpha-1(XIV) chain [Callithrix jacchus] Length = 1796 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|114621484|ref|XP_001143906.1| PREDICTED: collagen, type XIV, alpha 1 isoform 1 [Pan troglodytes] Length = 1800 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|114621480|ref|XP_519927.2| PREDICTED: collagen alpha-1(XIV) chain isoform 4 [Pan troglodytes] Length = 1796 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|58429499|gb|AAW78153.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 47/167 (28%), Gaps = 15/167 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S + +I+ + D + + + F++ Sbjct: 43 CNDEVDLYLLMDCSGSIRRNNWVNHAVPLAMKLIQQLNLND---NAIHLYVNVFSNNARE 99 Query: 309 DPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 K +I+K+ G T + DA+ + N Sbjct: 100 IIRLHSDASKNKEKALSIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI--------NRE 151 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 A + +V+LTDG + K +G++I Sbjct: 152 NANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQGINVA 197 >gi|55743096|ref|NP_066933.1| collagen alpha-1(XIV) chain precursor [Homo sapiens] gi|125987815|sp|Q05707|COEA1_HUMAN RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|187954653|gb|AAI40894.1| Collagen, type XIV, alpha 1 [Homo sapiens] Length = 1796 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|302632554|ref|NP_001181863.1| integrin alpha-M [Pan troglodytes] Length = 1153 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 54/184 (29%), Gaps = 20/184 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + ++ + ++ K + ++ F + + Sbjct: 158 SGSINPYDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNP-------- 209 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T ++ + + K +V++TDGE Sbjct: 210 NPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEK 261 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHE 436 D + +A +G+ I + EK+R L+ AS + F+ N+ Sbjct: 262 FGDPLGYEDVIPEADREGVIRYVIGVG-DAFHSEKSRQELNTIASKPPRDHVFQVNNFEA 320 Query: 437 LNKI 440 L I Sbjct: 321 LKTI 324 >gi|282897675|ref|ZP_06305674.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197354|gb|EFA72251.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 464 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 66/203 (32%), Gaps = 21/203 (10%) Query: 231 PYMVSCNKSLYYMLYPGPLDPSLSEEHF-VDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 +++ + S+Y ++ D ++ + + + K +V ++L ++ S K Sbjct: 47 TFVIDTSGSMYEVVAGDVEDTGVTYQQDGKEYKQVTGGKSKIDIVIESLLRLVNSGKL-- 104 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 R+ F+D + D G T + ++ A+D Sbjct: 105 --KQQDRVSIVQFDDSASQIIGLTSATETKQIETAIKKLRDF--SGGTRMGLGLRRAFDI 160 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K +L TDG+ T D ++ +I N ++ I I + Sbjct: 161 LSEQ-----------EMTVKRALLFTDGQ-TFDEDQCQSIANHFATRNIPITALGVGEEF 208 Query: 410 TQQEKARYFLSNCASPNSFFEAN 432 + LS+ F+ Sbjct: 209 NEDLLTH--LSDYTGGKLFYVVP 229 >gi|33340728|gb|AAQ14925.1| Mac-1 alpha subunit [Pan troglodytes] Length = 1144 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 54/184 (29%), Gaps = 20/184 (10%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 S + ++ + ++ K + ++ F + + Sbjct: 149 SGSINPYDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNP-------- 200 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T ++ + + K +V++TDGE Sbjct: 201 NPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF------KILVVITDGEK 252 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHE 436 D + +A +G+ I + EK+R L+ AS + F+ N+ Sbjct: 253 FGDPLGYEDVIPEADREGVIRYVIGVG-DAFHSEKSRQELNTIASKPPRDHVFQVNNFEA 311 Query: 437 LNKI 440 L I Sbjct: 312 LKTI 315 >gi|58429519|gb|AAW78163.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 565 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPNSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429497|gb|AAW78152.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPNSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429469|gb|AAW78138.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 43.8 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPNSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 34/308 (11%), Positives = 77/308 (25%), Gaps = 13/308 (4%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G + +++ V+D S SM + + L Sbjct: 488 GNSGHDNFVGDPGGATESTQTTYTDLNVALVVDTSGSMDGTRMSETKEALKDLCDQLKEH 547 Query: 212 VKSYSSQNGKVGIRDEKLS---PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + N + L+ P Y + ++ + + S +V Sbjct: 548 ADEGADVNLSLIGFSGALNINLPVEDITGIEENYRKFTLRDGDTIIGKVGTEFSVTSYVP 607 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ I + + + + V Sbjct: 608 YDDWSGGQISSTTTYRITEDGTLQRWESGFMGWGGSWEDVRTFSTE--SLTGYEAVLAQI 665 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D +G T +D + A D N + ++ +TDGE + Sbjct: 666 DDMEAIGGTVYSDGYEAAKDWF---GGKTSPDSLQNNGGENIVIFVTDGEPN-NEWSAKN 721 Query: 389 ICNKAKSQ--GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 N+ + I + T+ ++ L+ + F +L +F + + Sbjct: 722 AYNQLVAAVDNITVETVGIAITDKDATDLLNGLTTNNNGAHF--IEDASKLGDVFGEIVS 779 Query: 447 NEIFERVI 454 + V Sbjct: 780 DITTSTVT 787 >gi|115889695|ref|XP_786483.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115970407|ref|XP_001195936.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 2064 Score = 43.8 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 40/139 (28%), Gaps = 18/139 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F + + V + + GST A++TA Sbjct: 615 TKFNLVKFGSKASGWKDRAVEVSESSLQSAWQWVRGLEVAGSTNTLMALKTAL------- 667 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 A + I LLTDG +A KS I I I+F+ N Sbjct: 668 ---------ADTATQAIYLLTDGRPDHPPPTILAQVQLRKS--IPIHCISFNCNDRDAND 716 Query: 415 ARYFLSNCASPNSFFEANS 433 LS+ + A+ Sbjct: 717 FMALLSSDTGGRYHYYADD 735 >gi|307318077|ref|ZP_07597513.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306896118|gb|EFN26868.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] Length = 431 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 58/218 (26%), Gaps = 28/218 (12%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +++ + + G +I A+ PV++G G+ V+ W + L+ Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQH---------- 65 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + ++ + + T Sbjct: 66 ---------AADVSAYAAAVRHRAGDQQSALETAARRVAGGSGFSPGGLTVSTVPGSAGG 116 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVI 183 + V L+ + +F G + ++ +A A+ S +S + Sbjct: 117 SNKVTVELTETHP------RMFSSVFGTGTITMKARAVAQVTGGSKACVLALSNSASGAV 170 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK 221 + S N A + + Sbjct: 171 TVTGSTEVLLSGCSVVS-NSNAADAFLMKNGSALMSTD 207 >gi|119470035|ref|ZP_01612840.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] gi|119446745|gb|EAW28018.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] Length = 1090 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 41/361 (11%), Positives = 88/361 (24%), Gaps = 77/361 (21%) Query: 155 SWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL-NCFGQPADRTVK 213 S + V+ + + + + D S SM + + Q ++ Sbjct: 18 SATASAEDIELYVNHNVETDENPRVLMIFDTSGSMDWSVINGDNQVCYIKKITQSNGNGN 77 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 + NG + N + + + ++ + Sbjct: 78 GNGNGNGNGNGNGNGGGGQVYYENVTCFASKDEYAEFNEQCYIGVNGVAVADCHDRRIDV 137 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 ++A+ ++ ID + N +T + + Sbjct: 138 AKNAMTQLVNDNSDID-----FGLMRFRSNVGGYVVAKL-----GTDKTSLISDINSLPA 187 Query: 334 MGSTAINDAMQTAYDTII----------SSNEDEVHRMKNNLEAKKY------------- 370 GST + + + AY I S + K Sbjct: 188 SGSTPMTETLWEAYRYITGQSLDYAFNVSDRDKSADNSVVYTSPFKPNNGDPLRCDNSIN 247 Query: 371 IVLLTDGENTQDNEEGIAICNK------------------------AK------SQGI-- 398 ++L+TDG+ T D+ +I AK + Sbjct: 248 VILMTDGDPTNDDGRDTSIAQTHNSYFNDDIPFDDGTYDDSYLVAMAKILHGTSDTKVDL 307 Query: 399 -----------RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R+ TI F +Q + + EA + L++ ++ I Sbjct: 308 YTPSTDVLDTGRLYTIGFGTGMSQGGIDLLDKAARVGGGQYIEARTAEALSEALKNTISR 367 Query: 448 E 448 Sbjct: 368 I 368 >gi|47219688|emb|CAG12610.1| unnamed protein product [Tetraodon nigroviridis] Length = 717 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 48/158 (30%), Gaps = 17/158 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D ++ ++ R + + H ++ G + A+ + Sbjct: 487 FDINRDVAQLALVAYSRRATTVFNL--DTHDSGSAVLTAIGEASYMGGVASTGTALLHVH 544 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 +++ ++ + K +V++TDG D K + G+ + + Sbjct: 545 SDVLTVDKGARLGVN------KAVVVVTDGSGGTDAVVP---AQKLRDNGVSVFVVGIG- 594 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + L S S +L K F D + Sbjct: 595 ----DMQREKLLQIAGSEEHLILVPSYEDL-KYFEDVL 627 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 45/348 (12%), Positives = 91/348 (26%), Gaps = 34/348 (9%) Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 + T + D T ++ +F +++ Sbjct: 216 AVDGEQVLETTIQNATDAREFTHTFSETGTH--RITAEMESTDKFEQNDVFRKTVRAVDR 273 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + + +++ + Y D + Sbjct: 274 PKILYVSERGYPFESYLRNVYNVEKRSSVPEDLSPYYAVVVQDMPASRIGNVDALQEYVI 333 Query: 217 SQNG---KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 NG G + Y S S+ + S + +D S + Sbjct: 334 DGNGLLVVGGPNSFENGNYESSSLASMLPVTTGDGRGQSTNIVLSIDVSGSSKGGMRVQK 393 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + + D +G FN R S R + Sbjct: 394 AVS--------LSALKQLGDENEVGIVGFNHRTYSVAERQPL--GPNREALADRIRRLQA 443 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G+T I ++ A + ++L++DG + + I+ + Sbjct: 444 GGATDIAGGLRGAGKMLGDDPGT--------------VILISDGHDRVEES--ISYAKQL 487 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +S+G RI+ I N ++ AS S+F A T+ LN +F Sbjct: 488 RSEGKRIIAIGAGKNPNEKNLRTI---ARASGGSYFRATETNRLNILF 532 >gi|85859137|ref|YP_461339.1| flp pilus assembly protein family protein [Syntrophus aciditrophicus SB] gi|85722228|gb|ABC77171.1| flp pilus assembly protein family [Syntrophus aciditrophicus SB] Length = 146 Score = 43.8 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 47/136 (34%), Gaps = 7/136 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ G + A+++P++L + +++ Y + L A++ V I Sbjct: 8 RKLREHKGAAVVEFAIVLPILLVLVFGIIEFGILIYNKQVLTNASREGARAGIV-YIDGT 66 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 V + ++ K + +Y + +V E V Sbjct: 67 SRVPAGNASNSDTIKGIVNDYANDYLITFGSSIPLNTDVEF------PEGQDSGDPLIVT 120 Query: 133 LSSRYDLLLNPLSLFL 148 +S Y +L P++ + Sbjct: 121 VSYGYSFILFPVTDYP 136 >gi|114621482|ref|XP_001144037.1| PREDICTED: collagen, type XIV, alpha 1 isoform 3 [Pan troglodytes] Length = 1780 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 47/153 (30%), Gaps = 22/153 (14%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 F-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + + + L AS + Sbjct: 294 VKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|75906479|ref|YP_320775.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75700204|gb|ABA19880.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 615 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 48/149 (32%), Gaps = 24/149 (16%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D V + S + +K G T ++ + + Sbjct: 78 VVVYDDAVDTVVSPQPVT---DKPALKKSIRQVRAGGITNLSGGWLKGCEYV-------- 126 Query: 359 HRMKNNLEAKKY--IVLLTDGENTQD----NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K+ L+ +K ++LLTDG KA +GI T+ F+ + Sbjct: 127 ---KHQLDPQKINRVLLLTDGHANMGIQDPKILTATSAQKA-EEGITTTTLGFAQGFNED 182 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIF 441 A+ +F+ S E ++F Sbjct: 183 LLIGM---ARAANGNFYFIQSIDEAAEVF 208 >gi|13928960|ref|NP_113879.1| integrin alpha-D precursor [Rattus norvegicus] gi|48428189|sp|Q9QYE7|ITAD_RAT RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|6648592|gb|AAF21241.1|AF021334_1 alpha D integrin [Rattus norvegicus] Length = 1161 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 60/169 (35%), Gaps = 17/169 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D + +++ + +++ + + +F+ + L + + Sbjct: 172 KDFVKALMGEFASTSTL-----FSLMQYSNILKTHFTFTEFKNILDPQSLVDPIVQLQ-- 224 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++T + + S + AKK ++++TDG+ +D E + A Sbjct: 225 GLTYTATGIRTVMEELFHSKN------GSRKSAKKILLVITDGQKYRDPLEYSDVIPAAD 278 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 GI I + Q+ A L+ S + F+ + L I Sbjct: 279 KAGIIRYAIGVG-DAFQEPTALKELNTIGSAPPQDHVFKVGNFAALRSI 326 >gi|60681980|ref|YP_212124.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|60493414|emb|CAH08200.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] Length = 341 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 22/220 (10%), Positives = 58/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ + TAI A+ A + + IV++T Sbjct: 150 YISA-KMFLESISPSLISKQGTAIGAAINLAARSFTPQ-----------EGVGRAIVVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A +A +GI++ + + Sbjct: 198 DGENHEGGAVEAAK--EAAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++++ K I V Sbjct: 256 EAMCQEIAKEGNGIYIRVDNSNSAQKAINQEINKMAKSDV 295 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 67/201 (33%), Gaps = 19/201 (9%) Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 + S+++ + + D + ++I +K+ + ++ F + + Sbjct: 22 DTSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGAGGVKLQTPLS 81 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + G+T + A++ D I + + Y+V++ Sbjct: 82 KIESIQ-------FSPLGTGGNTPLGMALELTRDYIQNKDTFP------GKFYTPYVVMV 128 Query: 375 TDGENTQDNEEGIA--ICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 +DGE D + + I NK +S ++ + + F SPN + A Sbjct: 129 SDGEPNDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDF---SGSPNQVYYA 185 Query: 432 NSTHELNKIFRDRIGNEIFER 452 N L F+ + R Sbjct: 186 NDVQSLINCFKAITASVTQGR 206 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 31/299 (10%), Positives = 85/299 (28%), Gaps = 4/299 (1%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K F++K L S G+F +I+A+++P + + G+++ + ++++++ A++ A+ Sbjct: 22 KIHFFNKLLFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEALSHGM 81 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + R + I NF DI N + Sbjct: 82 SLICYQND--IERDNLAKIILNDLIVSLKKNNFTKQEADLVAKNSKIDITTLINDSTNVK 139 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + + + L + + + I + + + T D Sbjct: 140 SYHFYIKSVYKMPLNKITKIFYPKDLTIVTNVNKIVPCPYTSYVMLSNPRARQFNSDWDL 199 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 +S N + ++ ++ + + + N ++ + Sbjct: 200 IHRRTVNAINSIITDKNIKYMIINGSMTNFDPSHYYTAEVK-QFNNVYRHLNVPIFRSIG 258 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIK-KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + D + + D +I K + + Sbjct: 259 TRDYVDNNGICRDGDVLTNFSTYSCAFAALNDLSWRIINEYKYKLPEINYDVKRWIDYC 317 >gi|308462088|ref|XP_003093330.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] gi|308250341|gb|EFO94293.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] Length = 382 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 51/154 (33%), Gaps = 10/154 (6%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGS 336 S I + +D T R+ +N ++ + V + + Sbjct: 58 AQSRIGTTNYLDPR--TTRVALVTYNSDSYTNADLNQFQSTGDLFNNVFSALATLSSTDQ 115 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + + + TA + + N K+ +++ + + + ++ K Sbjct: 116 SYLETGLSTAEQLLKAGK-----NQFNRAHFKRVVIVYASAYEGEGERAPMPVADRLKGD 170 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G++I+T+A+ + L+ ASP F Sbjct: 171 GVKIITVAYDQRGDGALLDQ--LAKIASPRMNFT 202 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 31/298 (10%), Positives = 75/298 (25%), Gaps = 18/298 (6%) Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + + + + + I V+D S SM ++ + N F Sbjct: 67 TYDIELKVKGSTEVVQNNKILDIVLVMDTSGSMEGKSLENAKKAANNFVDKLLP-----Q 121 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + N +GI + S L + ++ L + Sbjct: 122 NNNVNIGIVSFAEKGEIKSGLTRNVTTLK-NAIKGLKADGGTYTQQGLEKAATVLNGAPA 180 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 V+ I + F+ + + + +G Sbjct: 181 EHKKVMVVIGDGEPTYANGEH--PNFDKGGFYRI---YNPATKKEGYEQWYGNAFKWLGK 235 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 + I + K L+ D + + K+ Sbjct: 236 GYNSAHRNYLVKLINGGFGNGTKERKGWGHVSWPFNLMDDYFENA----TLKAADTIKN- 290 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 I+T+ + + A+ ++ A+ + +A + D I ++ ++V Sbjct: 291 NTEIITVGI--DIENNDLAKSIMNKLATSGKYLKAGAVAGELDKILDDIAEKLQKKVS 346 >gi|167574006|ref|ZP_02366880.1| hypothetical protein BoklC_29490 [Burkholderia oklahomensis C6786] Length = 395 Score = 43.8 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 29/361 (8%), Positives = 85/361 (23%), Gaps = 6/361 (1%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 ++ V++G G+ +D+ + L+ +A A A+ + +S T Sbjct: 1 MLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAAGITSGHL 59 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL 148 + + + T + ++ S V + ++N L Sbjct: 60 NYALFQGFPVQMQTDLSVTFSDSVSGPFQPKSAISSPSSIKYVKCKTSMTGIVNWFIQAL 119 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 + S + + + + ++ + + Sbjct: 120 NMVPGVSVANASVSATAVATIGAAQTTCAIPVFICKAGTQTSPPVAGATYNVGDWLSAKT 179 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + G + + + + Y L +++ Sbjct: 180 GSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGSKAATTNAYNTRF 239 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + T + ++D V +F+ + + T Sbjct: 240 GIYNNPYKDPSYGTPDFTGYAYDATTWPAQSNAYSDFVSKRRTFTSY-----QGDLITGI 294 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++ + + + VL+ D T + I Sbjct: 295 NTGGSYSASYYQAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPMGTGGSASPIH 354 Query: 389 I 389 + Sbjct: 355 L 355 >gi|291223817|ref|XP_002731904.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 978 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T+I + + + N YI+L++DGE T I Sbjct: 398 YAGGRTSIGGGVLKGIEVLS--------NFGNEDTTGGYIILVSDGEETS-EPIIDDIWG 448 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA--NSTHELNKIFRDRIGN 447 + G+ I T+AF + +K L+ SFF + + T L F I Sbjct: 449 DIELAGVVIDTVAF---SDKADKKLESLATKTGGLSFFFSGNDDTTALYDAFTSTITK 503 >gi|29826731|ref|NP_821365.1| hypothetical protein SAV_191 [Streptomyces avermitilis MA-4680] gi|29603827|dbj|BAC67900.1| putative membrane protein [Streptomyces avermitilis MA-4680] Length = 150 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 42/155 (27%), Gaps = 23/155 (14%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + G + A+ + ++G+ G+ VD A A + Sbjct: 8 LSRARWDDRGGVTVFVAVCVLALIGIIGVAVDGGSKMRATERADYIAGEAARAGGQAIDP 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + + YL V ++ Sbjct: 68 AEAISGTAIVVDLQDAQAAAQAYLRSAGATG-----------------TVSVSGDGKTLN 110 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V ++ YD + FL +GI S + + +A Sbjct: 111 VNVTGSYD------TKFLSVVGIGSLSVTGQGKAT 139 >gi|320104266|ref|YP_004179857.1| hypothetical protein Isop_2740 [Isosphaera pallida ATCC 43644] gi|319751548|gb|ADV63308.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 1239 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 49/162 (30%), Gaps = 20/162 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I +++ G F+ R R + A M A Sbjct: 464 AISTLSNYDYAGVLFWTGRDQWLFPL--ITVGPNRNRMLALIDQMIPGDMPAFGPGMTVA 521 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI--- 403 ++++ + K++++++DG+ + N+ I + T+ Sbjct: 522 SNSLLQKTDAIT----------KHMIIISDGD---PAPPPPGLINQLVRGKITVTTVLTA 568 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 A + + A+ F+ + L +I++ I Sbjct: 569 AHGNDPGSFNTMQSIAQ--ATKGRFYNVTNPRALPRIYQKEI 608 >gi|302185170|ref|ZP_07261843.1| hypothetical protein Psyrps6_02464 [Pseudomonas syringae pv. syringae 642] Length = 677 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 50/177 (28%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGR----RGKC 67 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S A + F Q + + V V + A + + + Sbjct: 68 SGTAATAPDFANQSATRNGFTPNTDGRTLMTRCGTLTVDVAGPRVFVADSTQALAIQVVA 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASAAPLAALTIRSAGVTVDSTGAA 184 >gi|297463635|ref|XP_002702824.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] Length = 2933 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 44/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFDLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADRVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDMVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRIASQPTSDFFFFVND 200 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 39/133 (29%), Gaps = 17/133 (12%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K + AL + + K N N R F+D V +G + + + + Sbjct: 211 TAMKLQSAKSALQAAVNYFKS--NYNQNDRFALVPFSDGVREASVVPFGKYSNVASQLDA 268 Query: 327 FAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T + A+ A KKYI+ LTDG T N Sbjct: 269 ILNTGNSLTAGGGTNYSAALSLAKSYFTDPTR------------KKYIIFLTDGMPTVLN 316 Query: 384 EEGIAICNKAKSQ 396 + K Sbjct: 317 TVDTITYREVKQN 329 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 46/371 (12%), Positives = 88/371 (23%), Gaps = 17/371 (4%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + +V T + + A L + Sbjct: 117 TYQITVDAYRVLGNGKEDVYFSFPQTPYQYTRKTEASTAKLDFSLSFSQPEYAKPPNGDA 176 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + + + S + + +V+D S SM + S L Sbjct: 177 QGRLDVTLIPQG---AVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNY 233 Query: 213 KSYSSQ---NGKVGIRDEKLSPYMVSCNKSLYYM--LYPGPLDPSLSEEHFVDSSSLRHV 267 G+R+ + P+ N + L G + ++ + SL Sbjct: 234 NQNDRFALVPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKS 293 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + + + + V +TV V + + T + Sbjct: 294 YFTDPTRKKYIIFLTD---GMPTVLNTVDTITYR---EVKQNFWGGYSYTGNRVTDSLSV 347 Query: 328 AIDENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G TA I Y S+ + Y + E + + Sbjct: 348 TYELYSDGRTAGIRFTDNKGYSRRFYSDGQDYVNGWRVSWDNGYSFTYSSIE-GKIRADA 406 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF-EANSTHELNKIFRDRI 445 A+ I + +I F N LS A + + L + F Sbjct: 407 TAVAKTLGMNNITLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQNLTALFQQFSQLA 466 Query: 446 GNEIFERVIRI 456 IRI Sbjct: 467 TTPAITGTIRI 477 >gi|297488708|ref|XP_002697097.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] gi|296474920|gb|DAA17035.1| collagen, type VII, alpha 1 [Bos taurus] Length = 2932 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 44/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFDLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADRVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDMVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRIASQPTSDFFFFVND 200 >gi|223462031|gb|AAI46869.1| Collagen, type XII, alpha 1 [Homo sapiens] Length = 1899 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L++I D Sbjct: 200 LSRIVDDL 207 >gi|198429401|ref|XP_002121222.1| PREDICTED: similar to integrin alpha 9 [Ciona intestinalis] Length = 1242 Score = 43.8 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 43/138 (31%), Gaps = 14/138 (10%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 N + + + + + G T I A+ + V Sbjct: 201 NIDTKVRIRLQAWSYTSLTSQINNMVNVRSLNGLTYIGYAINLTITEFDDHGRESVP--- 257 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 K ++LLTDG T+ +A++ GI +++ L+ Sbjct: 258 ------KEMILLTDGAATKPENV-KPAAERARANGIVTVSVGVGSRVD----ETQLLTIA 306 Query: 423 ASPNSFFEANSTHELNKI 440 + + F+A + L+ + Sbjct: 307 GNASRVFKATNYDNLDSV 324 >gi|260434112|ref|ZP_05788083.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417940|gb|EEX11199.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 187 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 20/53 (37%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + ++ ++ G I P + V VD+ S++ L++A Sbjct: 4 RLREAARLFRRTEDGSATIELLFWFPFFMWVTYSGVDLGMMSFHHANLERALD 56 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 59/190 (31%), Gaps = 27/190 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +L + ++ D R F+D + S + + F Sbjct: 394 TSIEQAKASLDYALSHLQPGD------RFNVIRFDDTLTRFFEVSVEASQQNIASARHFV 447 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G TA+ A+ A D + + IV LTDGE + + + A Sbjct: 448 MSLEAQGGTAMLPALHAALD------------DSHQGNGLRQIVFLTDGEISNEQQLLDA 495 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 I A+ RI + Y +++ A +F S E+++ R Sbjct: 496 I--AARRGRSRIFMVGIGT-----APNSYLMNHAAELGRGTFTHIGSAAEVDERMRALFD 548 Query: 447 NEIFERVIRI 456 V + Sbjct: 549 KLENPAVTDL 558 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 66/218 (30%), Gaps = 26/218 (11%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRDAL 278 R +++ + S+Y +L P+ + + + K V ++L Sbjct: 34 VSASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESL 93 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGST 337 ++ S + R+ F+D + +K N G T Sbjct: 94 EGLVSSGQADS----RDRIALVRFDDSASVLLPLT---ASTDTASLKNAIGQLRNFSGGT 146 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + M+ A + + + + ++ TDG+ T D + + + G Sbjct: 147 RMALGMEEALNILKNCDLSSRRT-----------LIFTDGQ-TFDESDCRDLATQFAEAG 194 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 I I + E +LS+ + F T Sbjct: 195 IPITALGVG---EYNEDLLLYLSDR-TGGRVFNVVETQ 228 >gi|150398536|ref|YP_001329003.1| TadE family protein [Sinorhizobium medicae WSM419] gi|150030051|gb|ABR62168.1| TadE family protein [Sinorhizobium medicae WSM419] Length = 204 Score = 43.8 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 11/73 (15%), Positives = 19/73 (26%), Gaps = 4/73 (5%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++LI G I A+L V ++ E L A T + Sbjct: 16 RDPFRRLIGDREGATAIEFAILALPFFIVVFASIETFVAFAGEQLLANATDT----LARR 71 Query: 68 LIQSLEEVSSRAK 80 + + Sbjct: 72 IRTGDITTEAGKD 84 >gi|297183644|gb|ADI19770.1| hypothetical protein [uncultured gamma proteobacterium EB000_37F04] Length = 1181 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 34/334 (10%), Positives = 82/334 (24%), Gaps = 59/334 (17%) Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + + + + + F Q + + L Sbjct: 165 NSVIDCKADLQAVNPNNAPMASRDNYQQSGFPQNRESDTPYNAIS---ASADQSALEAAA 221 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + S + + + ++ + ++ + RD + S+I + +D Sbjct: 222 EAALNSRIFGQNDSATLLTENYLTYLHHFRTQTSRQRIDIARDTIISLINATPGVDFGLQ 281 Query: 294 TVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD---- 348 ++ ND + + R + T + T + +++ AY Sbjct: 282 VFNFNSSQGNDDHGGRIIAGVREMSAANRAALVTTINELTATTWTPLCESLFEAYRYFSG 341 Query: 349 ----------TIISSNEDEVHRMKNNLEAKK------YIVLLTDG----ENTQDNEEGIA 388 ++ + + + Y+V++TDG +N D+ Sbjct: 342 GGVLGGFNGGSLAPPADTSIMNNNRYQSPMRSCQKQSYLVVITDGVPFYDNDYDSLLRSE 401 Query: 389 IC--------------------------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + N Q I TI FS L+ Sbjct: 402 LALKTGDRFDDSYLPGVAEWMQTRDVNPNLLGQQNIVTYTIGFSQGANDAA---DLLAET 458 Query: 423 A--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 A ++ A+ L + + Sbjct: 459 ATRGGGQYYAASDALALQGSLQQIFSEILAVNAT 492 >gi|254414399|ref|ZP_05028165.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178629|gb|EDX73627.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 801 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 47/162 (29%), Gaps = 12/162 (7%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F + RT + G T ++ Sbjct: 357 NPDDTFTIIDFANTTTQLSPQPLANTAPNRTQALNYINGLQANGGT-----------ELM 405 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++ + + +VLLTDG +++ + + K G R+ + + + Sbjct: 406 NGIRAVLNFPPSAPNRLRSVVLLTDGYIGNESQVIAEVQRQLK-PGNRLYSFGVGSSVNR 464 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 R + + T E+ + F +I N + + Sbjct: 465 FLLNRLAEVGRGTAKIIRQDEPTQEVAETFARQINNPVLTDI 506 >gi|154250522|ref|YP_001411346.1| hypothetical protein Plav_0066 [Parvibaculum lavamentivorans DS-1] gi|154154472|gb|ABS61689.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] Length = 566 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 20/200 (10%), Positives = 50/200 (25%), Gaps = 2/200 (1%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G ++ + ++L V +++D Y L+ A A + A Sbjct: 10 LTGLLGRYRRDERGVISVMAVGALFLVLAVAMVVIDTGSMLYARRDLQAATDAAALGAVR 69 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + E ++R+ ++ + + T Sbjct: 70 QI--GNAENAARSILDLNGYSPGDAPQVVTGIYSADPSLAPRDRFVEADGATEASQINAV 127 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + Y L R + + +R G++ Q + Sbjct: 128 RVIKYAEAPTYFASLFGFENLTRINAVSTAAYTKTVSFSAGTRVAELNSGLANQLLGGLL 187 Query: 187 RSMLDYQRDSEGQPLNCFGQ 206 + L+ N Sbjct: 188 GTTLNLSLVDYNGLANANID 207 >gi|58429481|gb|AAW78144.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|323345326|ref|ZP_08085549.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] gi|323093440|gb|EFZ36018.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] Length = 340 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 53/202 (26%), Gaps = 42/202 (20%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + +++ + ++G F + + ++ Sbjct: 108 TPSRLAKSKLLVENLVDNFT-------NDKIGLIVFAGDAFVQLPITSDYVSA-KMFLQN 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T I A+ A + + K I+++TDGE+ + Sbjct: 160 IDPSLIATQGTDIAGAINLASKSFTQQ-----------DKVGKAIIVITDGEDHEGGAIE 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQ-----------EKARYFL-----SNC-----ASP 425 A +AK G I + + + C A Sbjct: 209 AAKAARAK--GYNIFILGIGSTNGAPIPMANGGYLQDASGQTVMTKLNEQMCKEIAQAGN 266 Query: 426 NSFFEANSTHELNKIFRDRIGN 447 ++ ++T + + D + Sbjct: 267 GTYIHVDNTSDAQEKLNDELTK 288 >gi|299138149|ref|ZP_07031329.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600079|gb|EFI56237.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 349 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 44/164 (26%), Gaps = 24/164 (14%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS----------TAINDAMQTAY 347 F+ V ++ H+L R I K + T+ Sbjct: 141 FLISFDVNVDLLSDYTNSAHELKRAIDKASINAASSSAGVPGIGGGPFPTSHPRG----- 195 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---- 403 + + H + +K +VLLTDG + E + A+ + I Sbjct: 196 TLLYDAVYLAAHDKLQSQTGRKILVLLTDGGDQGSQETLKSATEAAQKANAILYVILIAD 255 Query: 404 --AFSVNKTQQEKARYFLSNCASPNSFFEA-NSTHELNKIFRDR 444 FS + + N+ +L + F Sbjct: 256 RANFSYGFNADGQMEQLAHE--TGGRVINVGNNGKKLEEAFDQI 297 >gi|156393752|ref|XP_001636491.1| predicted protein [Nematostella vectensis] gi|156223595|gb|EDO44428.1| predicted protein [Nematostella vectensis] Length = 419 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 49/156 (31%), Gaps = 9/156 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAINDA 342 + I D R+ F V F H+ T + FA +E G T + A Sbjct: 72 LNVIAVGFDATRVEIITFGSDVNRRVPFISEAHEKDTKCTFNEKFANVVHEWGMTNMRGA 131 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKAKSQGIRI 400 + AY+ + + ++L+TDG + + + +G+ I Sbjct: 132 FEKAYEVCKGTWSGKKRLNIKTT-----VILITDGHWNWPWQNPDPVPKAQQLIREGVEI 186 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + + A F+ ++ E Sbjct: 187 LAFGVGYGISLSNLQTITANQRAGHTYAFQISNFDE 222 >gi|58429531|gb|AAW78169.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429491|gb|AAW78149.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|42521785|ref|NP_967165.1| hypothetical protein Bd0148 [Bdellovibrio bacteriovorus HD100] gi|39574315|emb|CAE77819.1| hypothetical protein predicted by Glimmer/Critica [Bdellovibrio bacteriovorus HD100] Length = 739 Score = 43.8 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 8/83 (9%), Positives = 32/83 (38%), Gaps = 2/83 (2%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + +++ F + + G + AL+ ++ M+++V +++ L+ + A Sbjct: 1 MLNSRSTFIQN--LNNKRGQIALFVALIFQILFLFFAMVINVGLLVHHKINLQNSVDLAA 58 Query: 62 ITASVPLIQSLEEVSSRAKNSFT 84 ++ + + ++ Sbjct: 59 YYGAMKQAEGMNVIAHTNYQIRQ 81 >gi|319956804|ref|YP_004168067.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319419208|gb|ADV46318.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 398 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 69/192 (35%), Gaps = 25/192 (13%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 +K + R A+ ++R + TV + + G L R + Sbjct: 41 RKIDIARRAMRDLLRDWNPQIPLGLTVYGHRRKGDCN-DIETPIPVGP--LDRQRMIRAV 97 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T I A++ S + I+L++DG+ + + + A Sbjct: 98 EGIRPKGKTPIARALKRVAAQFRGSEDPAT------------IILISDGKESC-DADPCA 144 Query: 389 ICNKAKSQGIRI--MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + GIR I F V++T + A+ S+F A + LN+ Sbjct: 145 TARELRKSGIRFVAHVIGFHVDRTTDRQLACIAR--ATGGSYFSARNAAALNRAI----- 197 Query: 447 NEIFERVIRITK 458 ++ +++ R+TK Sbjct: 198 TQVAKKITRVTK 209 >gi|297678516|ref|XP_002817116.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 2 [Pongo abelii] Length = 1899 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L++I D Sbjct: 200 LSRIVDDL 207 >gi|150024242|ref|YP_001295068.1| BatB protein [Flavobacterium psychrophilum JIP02/86] gi|149770783|emb|CAL42248.1| BatB protein [Flavobacterium psychrophilum JIP02/86] Length = 346 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 52/149 (34%), Gaps = 21/149 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + ++ +I ++ R+G + + + + +++ Sbjct: 109 NRLEKSKQVVSQIINNLGS-------DRIGIIAYAGSAFPVLPITTDY-NVAKMFLQSMT 160 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D + +++A++ + + K +++LTDGE+ + E A Sbjct: 161 SDMVSSQGSNLDEAIKL-----------SAKYFEGSPNTSKLMIMLTDGEDHSEGAESAA 209 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 +AK G++I+TI + Sbjct: 210 E--EAKKIGMKIITIGIGTTAGGPIPNKK 236 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 43.8 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 46/360 (12%), Positives = 101/360 (28%), Gaps = 32/360 (8%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + ++ IV + + + Y + + + + Sbjct: 3382 SALLKDSDRIVEASVTTTDGAHHSASDTAEKSYQID-GVVIQADNGDNTIVGTVGSDLLI 3440 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDS------EGQPLNCFGQPADRTVKSYSSQ 218 + + + ++ +VID S SM + + T+ + Sbjct: 3441 GDLDPAKIVDVPTNVNFVIDTSGSMYYGRLLNLDSIHMNSAEKYKVFVNYGATLTAADGT 3500 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR--HVIKKKHLVRD 276 G K+ DP + + + + + + Sbjct: 3501 QLYNGSSQSGWVTVTYDQMKAGLQYDGYRAEDPIYIKSSIGEDQTYKLTDFPSVFDMTKQ 3560 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF-----SWGVHKLIRTIVKTFAIDE 331 A ++ I N ++ FN V D SF S T + + Sbjct: 3561 AYQVLVDEILTNTNDKSSLNFNVVTFNSTVGGDSSFHYDAESNSFVNSRGTDIHNYLNSL 3620 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T ++T D I++ + + LTDG++ + Sbjct: 3621 IAGGGTEFEAPLKTISDHIVTDG-----------NTRNVVYFLTDGKDNTGFSNSANNSD 3669 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFL------SNCASPNSFFEA-NSTHELNKIFRDR 444 A + +++IA + + + +N S S+ + +T+EL IF+D Sbjct: 3670 YAALKHAEVISIAVGPSGDADQVNQIAQLGEGYNNNNDSEPSYSKVITNTNELTDIFKDI 3729 >gi|115689612|ref|XP_787039.2| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] gi|115969505|ref|XP_001184190.1| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] Length = 717 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 61/191 (31%), Gaps = 17/191 (8%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K + A+ +++ + + D N ++ + + S + +K Sbjct: 75 YPRKLRQTKAAMDAILDQMSENDQFNIMPFSDDIYW-WKENAMIPASPDSITEAKNYIKR 133 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T +NDAM + + ++ + + LLTDG+ + + Sbjct: 134 ----------TNLNDAMLEGFGLLEATGS---MDSSSENPMVCILFLLTDGKPSTGTTDI 180 Query: 387 IAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 I ++ ++T+ F R L N +E +S + + Sbjct: 181 NVIERNVRAANRKRCSLVTLGFGKLVDFNFLGRLALQNRGIARKIYETSSATVQLQGVYN 240 Query: 444 RIGNEIFERVI 454 + + ++ Sbjct: 241 EVATPLLFDIV 251 >gi|148656025|ref|YP_001276230.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568135|gb|ABQ90280.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 932 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 38/121 (31%), Gaps = 14/121 (11%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 I+DA+ A + +S+ ++I+ +TDG +T E + I Sbjct: 487 GSGINIHDALVAAARYVRASDR-----------PIRHIITITDGNDTVQQEGALDIVRAL 535 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + + + + +IA F +L + D I + Sbjct: 536 RDERVTLTSIAVGQGSHVPFIRDM---AAVGGGRTFLTERAADLPDLLLDEAEMIIQPSI 592 Query: 454 I 454 I Sbjct: 593 I 593 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 49/135 (36%), Gaps = 13/135 (9%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + L + IK + + R+G F + V +T+ Sbjct: 143 DVSPNRLGRAKQQIKDMVDEMPGDRVGLVVFAGETRQTLPLTRHVEDFKQTLDSVGIHSV 202 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G + + DA++ A D + D K +V+LTDGE+ + + A Sbjct: 203 R-RGGSRLGDAIRVASDAFLDKTTD-----------HKAMVILTDGEDQESDPVSEAK-R 249 Query: 392 KAKSQGIRIMTIAFS 406 + QGIRI TI Sbjct: 250 AYEEQGIRIFTIGLG 264 >gi|323484869|ref|ZP_08090225.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] gi|323401865|gb|EGA94207.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] Length = 2032 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 53/429 (12%), Positives = 103/429 (24%), Gaps = 85/429 (19%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 S + A S T + +Y + + EV T +++ Sbjct: 954 SSENANITIQSVKLAGVSETTVDIPVGKYFYVDEVQSQGAKAVKWEVSSSGTGTYPQVSG 1013 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMG-------IKSWLIQTKAEAETVSRSYHKEHG- 176 K+ V LL +++ V+ + Sbjct: 1014 SKTEIYEVTKESQGYLLQCTNIYGELNPQVAKRAWKDYTADDEYDVTLEVTGDSIQTTIG 1073 Query: 177 ---VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 I +VID S SM D+ + N DR + + I + Sbjct: 1074 GGTADIVFVIDKSSSMNDWDDSLDDYRWNKLESTVDRFINKLKITSPNSKISFIEYQSSD 1133 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + + Y + + + S + ++ + L++ ++ I V Sbjct: 1134 LGTYQETYDDIRKVAVADTESADKNLEGNKLKYFRT------------LQKWTAISEVGS 1181 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 A +N T A + Sbjct: 1182 KPYGSAPGYN-------------------------------QGTHSAGGYLGAERALD-- 1208 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-------NKAKSQGI-------- 398 + + KYI+ L DG A+ I Sbjct: 1209 -RLKTYNPDEYNSNSKYIIYLADGTAGYYVNNNGTQAGSGSGGNANARRAAITQSGELKK 1267 Query: 399 -----RIMTIAFSVNKTQQEKARYFLSNCASPNS--------FFEANSTHELNKIFRDRI 445 I T+AF + + + + ++ F+ A +T EL + F + Sbjct: 1268 KHPDATIYTVAFGSDSSANMNWMKPGAYNGNSDNPYNPNVTAFYSAANTKELEETFDNLA 1327 Query: 446 GNEIFERVI 454 V Sbjct: 1328 AQVGSSAVT 1336 >gi|114608142|ref|XP_001142833.1| PREDICTED: collagen alpha-1(XII) chain isoform 1 [Pan troglodytes] Length = 1899 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L++I D Sbjct: 200 LSRIVDDL 207 >gi|24375029|ref|NP_719072.1| type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] gi|24349772|gb|AAN56516.1|AE015789_3 type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] Length = 1168 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 35/319 (10%), Positives = 80/319 (25%), Gaps = 27/319 (8%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 ++ + A ++ + + GV + Y EG+ Sbjct: 189 KAFPYNTSSSPGTTWANALAAAKNTGFGVGQPVTFYTDNYLRWYWLSKEGRLPTIKVPRI 248 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML-----------YPGPLDPSLSEEH 257 D K+ S+ D L+ + + + + Sbjct: 249 DIAKKAISNIIRSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDTSRASLLSTIDG 308 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ R + D + +++ V ++ ++ Sbjct: 309 LPAKTNTPLCETMYEAYRYFAGKGVV-FGHADTDYGSYVGNRPPYDNSVETNGTYVSPFK 367 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV----L 373 + D + + N+ ++T S + + + Sbjct: 368 VCTDIAYVIYVTDGSPTVDGSANNNVKTLTAAASKSGNYSSFSQGLSTPSYLPALASYMF 427 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEA 431 D N D+ + Q +R TI FS L+ A +F A Sbjct: 428 NNDLINKPDSSNTEQM------QNVRTYTIGFSKGADDAAP---LLAETAKRGGGLYFAA 478 Query: 432 NSTHELNKIFRDRIGNEIF 450 ++ EL D + N + Sbjct: 479 QNSLELQNALNDALSNILN 497 >gi|307591429|ref|YP_003900228.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986283|gb|ADN18162.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 426 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 23/225 (10%), Positives = 71/225 (31%), Gaps = 17/225 (7%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 AD+ + + + + D + + + P + + Sbjct: 57 VKVTDTADKPMIQLQDTDFNLEVVDIENNKTYNDVTFNWKSPEETIPPPAWIIVLLDMSG 116 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND------RVISDPSFSWG 315 S + + + A+ ++ + + ++ F D D + Sbjct: 117 SMAKEDSRGTTKIEGAIKAIREFTEIAKDRGGNTQVSIVPFGDPGKNCAGYPIDSNTLDN 176 Query: 316 VHKLIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY-- 370 ++ ++ F + ST + + + A + N+ + + ++ + Sbjct: 177 FSRVDDAKLQIFLDNLASLSPCASTNLYEPLSKAVRFLGKKNDSRFYPLDSSGNPIEPQP 236 Query: 371 ---IVLLTDGENTQDN--EEGIAICNKAKSQG-IRIMTIAFSVNK 409 I+LL+DG + + N ++ ++ K I + T+ + + Sbjct: 237 RLSIILLSDGYHNKPNEAQDFQSLNQLLKKNNQIIVHTLGYGLTA 281 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 7/132 (5%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + T I A+ A + + +D + ++L+TDG + + Sbjct: 233 NMTLDGLTNRTGKVGTNITRAVTLAVQILGPAVQDRKLGDSTGP---RQMILITDGRDRR 289 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 N I + ++G+ I TIA E+ LS + FF + + Sbjct: 290 LNNSVIFMLQNDTAKGVVIDTIALG---DGAEEGLPLLSE-VTGGQFFFSPDSDAGGSAL 345 Query: 442 RDRIGNEIFERV 453 D + + + V Sbjct: 346 DDALTATVLKYV 357 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 82/413 (19%), Gaps = 49/413 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + N F + +KK D E Sbjct: 148 AGEEDRNVPAQTVQLWGNERNFENSYL-----DYNGAYIKKWVEPVLPSSTASDLHPEDA 202 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++P+ + S T + E + + Sbjct: 203 TTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDL 262 Query: 184 DFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S+ ++ Q + + + S+N + +P Sbjct: 263 LSDPSLDIRIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPL 322 Query: 233 MVSCNKS--LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKI 288 + Y G V + + Sbjct: 323 TLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVME 382 Query: 289 DNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D + + + F N R + N A Sbjct: 383 DKIAEDNDVLFRNFEEFSGNTSNAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKAT 442 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D ++N + + A + + E + I +I Sbjct: 443 HWVGDGSSNNNTNGSPTSQEKSSAINTVAY-HHWLKNKYQENPPS-----------IFSI 490 Query: 404 AFSVNKTQ------QEKARYFLSNCAS------PNSFFEANSTHELNKIFRDR 444 ++ R L N A ++ AN+ +++ D Sbjct: 491 GLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKNDIVTALEDI 543 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 45/416 (10%), Positives = 99/416 (23%), Gaps = 38/416 (9%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA AS + + + F + +KK + Sbjct: 159 YAADNGS-AASEEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVPSNNP 212 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 213 TADLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 272 Query: 175 HGVSIQWVIDFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 I + +M +++ Q + + + S+N + Sbjct: 273 AVNEISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALN 332 Query: 224 IRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +P + G + + + V R + Sbjct: 333 RTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSIWG 392 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAIND 341 + + A D + +FS T T+A D + T + Sbjct: 393 AW----SSWSVMGDKIAIDRGDLFKNFETFSGNTSNAGFTYPVTYASDFDRPVNGTNVQY 448 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRI 400 + + +N + NT + NK + I Sbjct: 449 RYGEVKEGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHWLK--NKYQENPPSI 506 Query: 401 MTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 +I ++ + R L N A + +++AN+ +++ D Sbjct: 507 FSIGLGIDGSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 562 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 82/413 (19%), Gaps = 49/413 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + N F + +KK D E Sbjct: 147 AGEEDRNVPAQTVQLWGNERNFENSYL-----DYNGAYIKKWVEPVLPSSTASDLHPEDA 201 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++P+ + S T + E + + Sbjct: 202 TTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDL 261 Query: 184 DFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S+ ++ Q + + + S+N + +P Sbjct: 262 LSDPSLDIRIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPL 321 Query: 233 MVSCNKS--LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKI 288 + Y G V + + Sbjct: 322 TLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVME 381 Query: 289 DNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D + + + F N R + N A Sbjct: 382 DKIAEDNDVLFRNFEEFSGNTSNAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKAT 441 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D ++N + + A + + E + I +I Sbjct: 442 HWVGDGSSNNNTNGSPTSQEKSSAINTVAY-HHWLKNKYQENPPS-----------IFSI 489 Query: 404 AFSVNKTQ------QEKARYFLSNCAS------PNSFFEANSTHELNKIFRDR 444 ++ R L N A ++ AN+ +++ D Sbjct: 490 GLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKNDIVTALEDI 542 >gi|213965183|ref|ZP_03393381.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213952297|gb|EEB63681.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 922 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 52/195 (26%), Gaps = 31/195 (15%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVIS---DPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + D+ + + F D + + + D + T Sbjct: 94 MARYADDNKAAINIKLGGFADDYLPHGGWQKLAIEGNGTNVDRDIEAFKDRTDGEYTDYA 153 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT----------QDNEEGIAIC 390 + A + + D K I+ +DG T QD Sbjct: 154 KGLDGAAEAVSDGTSDC-----------KAILFFSDGFPTREAGGAEAVMQDVCRPEGAV 202 Query: 391 NKAKSQGIRIMTIAFS---VNKTQQEKARYFLSNC---ASPNSFFEANSTHELNKIFRDR 444 + ++ I T+ + + R +C A AN+ EL FR Sbjct: 203 GRLRAADTLIFTVGLAPEKSATDTPDLRRISEGDCSNSAPSGQHLFANNPGELLAAFRQM 262 Query: 445 IGNEIFERVIR-ITK 458 + N R +T+ Sbjct: 263 VPNGGSVEAERSLTE 277 >gi|93141049|ref|NP_542376.2| collagen alpha-1(XII) chain short isoform precursor [Homo sapiens] Length = 1899 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L++I D Sbjct: 200 LSRIVDDL 207 >gi|268325023|emb|CBH38611.1| hypothetical protein, containing PKD domain [uncultured archaeon] Length = 1152 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 68/243 (27%), Gaps = 24/243 (9%) Query: 224 IRDEKLSPYMVSCNKSLY-YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 D +V+ S SL +D++ V+ + ++++ Sbjct: 279 CHDNYFLNNVVADPWSDGKCTQIFYGTGLSLDLIFVIDTTGSMGD--DIANVKASASTIV 336 Query: 283 RSIKKIDNVN-----DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 I+ I D + + + + + Sbjct: 337 NEIEAIIPDYQVAVVDYRDFPVDPYGGDGDYPFN-DVLPFSTDKAAIISAIQGLTLGWGG 395 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG------IAICN 391 +++ +A + S + + + +A K I+L+ D +AI Sbjct: 396 DWEESVYSA--LMHSIDAGSLGGWRGEDQALKAIILMGDAPPHDPEPFTGYILTSVAIAA 453 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 + + I TI + + + F A + E+ + I EI + Sbjct: 454 ELADP-VHIYTIQIGGPVGKFAELASQ-----TGGEVFTAENAEEVVDAILEAI-EEITK 506 Query: 452 RVI 454 R I Sbjct: 507 RPI 509 >gi|194217500|ref|XP_001502733.2| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Equus caballus] Length = 1163 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 39/120 (32%), Gaps = 10/120 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D I + + +A K +V+LTDG+ Sbjct: 251 ASLARVQNITQVGNVTKTASAMQHVLDHIFT------PSHSSRKKASKIMVVLTDGDIFG 304 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 D + N K QG+ I Q+ K L AS F+ + L+ Sbjct: 305 DPLNLTTVINSPKMQGVERFAIGVG-EAFQKAKTDKELKLIASDPDETHAFKVTNYAALD 363 >gi|119569134|gb|EAW48749.1| collagen, type XII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1899 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 95 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 148 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 149 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 199 Query: 437 LNKIFRDR 444 L++I D Sbjct: 200 LSRIVDDL 207 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 61/175 (34%), Gaps = 17/175 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + +V+ + D + +++ + +F+ + + Sbjct: 180 NDFNQMKGFVQAVMGQFEGTDTL-----FALMQYSNLLKIHFTFTQFRTSPSQQSLVDPI 234 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T + T + H+ AKK ++++TDG+ +D E Sbjct: 235 VQLK--GLTFTATGILTVVTQLF------HHKNGARKSAKKILIVITDGQKYKDPLEYSD 286 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 + +A+ GI I AR L+ +S + F+ ++ L I Sbjct: 287 VIPQAEKAGIIRYAIGVGHAFQGP-TARQELNTISSAPPQDHVFKVDNFAALGSI 340 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 61/175 (34%), Gaps = 17/175 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + +V+ + D + +++ + +F+ + + Sbjct: 164 NDFNQMKGFVQAVMGQFEGTDTL-----FALMQYSNLLKIHFTFTQFRTSPSQQSLVDPI 218 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T + T + H+ AKK ++++TDG+ +D E Sbjct: 219 VQLK--GLTFTATGILTVVTQLF------HHKNGARKSAKKILIVITDGQKYKDPLEYSD 270 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 + +A+ GI I AR L+ +S + F+ ++ L I Sbjct: 271 VIPQAEKAGIIRYAIGVGHAFQGP-TARQELNTISSAPPQDHVFKVDNFAALGSI 324 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 61/175 (34%), Gaps = 17/175 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + +V+ + D + +++ + +F+ + + Sbjct: 164 NDFNQMKGFVQAVMGQFEGTDTL-----FALMQYSNLLKIHFTFTQFRTSPSQQSLVDPI 218 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T + T + H+ AKK ++++TDG+ +D E Sbjct: 219 VQLK--GLTFTATGILTVVTQLF------HHKNGARKSAKKILIVITDGQKYKDPLEYSD 270 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 + +A+ GI I AR L+ +S + F+ ++ L I Sbjct: 271 VIPQAEKAGIIRYAIGVGHAFQGP-TARQELNTISSAPPQDHVFKVDNFAALGSI 324 >gi|268556154|ref|XP_002636066.1| Hypothetical protein CBG01306 [Caenorhabditis briggsae] gi|187038428|emb|CAP22594.1| CBR-CLEC-218 protein [Caenorhabditis briggsae AF16] Length = 381 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 51/198 (25%), Gaps = 16/198 (8%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV-----RMGATFFNDR 305 + V + +SV ++ +D R+ +N+ Sbjct: 30 NLNNLWLDVYVVIDDSTKMGTSGLSQVASSVFSTLANSRVGSDYSDKRGARVAVITYNEN 89 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S ++ S N +QT I + N Sbjct: 90 AYIRSNLSDLTSNQD---LENVISSLQVSKSDISN--IQTPLKLINEMMGYKDGNGPKNN 144 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 I+ D + D + K G+ I+T+A + + K L AS Sbjct: 145 TMSVIIMYAADYID-YDQPTANQLAYYLKENGVTIITVANMDDSNKIMK----LKALASE 199 Query: 426 NSFFEANSTHELNKIFRD 443 F + L + Sbjct: 200 GYGFSLSD-ENLTAGIQK 216 >gi|328545284|ref|YP_004305393.1| TadE-like protein [polymorphum gilvum SL003B-26A1] gi|326415026|gb|ADZ72089.1| TadE-like protein [Polymorphum gilvum SL003B-26A1] Length = 196 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 14/172 (8%), Positives = 52/172 (30%), Gaps = 13/172 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ G I A++ L + +++ + + + A++ ++ Sbjct: 24 RGLFGSFSRNRRGTTAIEFAIIAGPFLVLMFGIIEFGLAFFV-NRI---LDHAVMESTRL 79 Query: 68 LIQSLEEVSSRAKNS---------FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + + ++ K + F + + T ++ D + Sbjct: 80 IRTGQAQKANFDKAAFKAEVCTHLTDFLCDNARFDVDVRTFSTFSSIGTLPDLVDADGNF 139 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + + S ++ +R + L++ + ++ + V R+ Sbjct: 140 SNNLAYVNSKAGDIVVARVIYRWPMFTSLLQTDPADTGNMERLLVSTAVFRN 191 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 27/200 (13%) Query: 262 SSLRHVIKKKHLVRDA----LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + K + A + ++ + + V G +D + Sbjct: 53 AKDPSGLTKIEAAKKALTGVVGALPDTAQVGLRVYGATVDGKGKPTPAACADTQLIHPIA 112 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 L +T + T +G T I ++ A + K+ IVL++DG Sbjct: 113 ALDKTKLTTTIAAIKALGETPIAHSLTEALKDL-------------GTSGKRNIVLVSDG 159 Query: 378 ENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 E + + I K + G+ +I T+ F VN + + + A ++++A Sbjct: 160 EESC-VPDPCPIVKKLTAAGVDLQIDTVGFGVNAKARTQLQCIAD--AGKGTYYDAKDAG 216 Query: 436 ELNKIFRDRIGNEIFERVIR 455 L N++ +R +R Sbjct: 217 ALATSL-----NKLSQRALR 231 >gi|326922791|ref|XP_003207628.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1224 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 59/174 (33%), Gaps = 19/174 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +++D + +++ + + R+G ++ V + H + + + Sbjct: 789 FEIIKDFVTALVDRVTV---GRNATRIGLVLYSLEVQLEFGL--NKHTTQQDVKRAIRKM 843 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + T A++ A + +K V+LTDG+ + + + Sbjct: 844 QYMGEGTYTGTAIRKATQ---------EGFLGARTGVRKVAVVLTDGQADKREAVKLDVV 894 Query: 391 -NKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 +A + I + I + + Q + + L+ AS + + + L Sbjct: 895 VREAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDDFNTLPA 948 >gi|297678514|ref|XP_002817115.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 1 [Pongo abelii] Length = 3115 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1256 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1309 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1310 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1360 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1361 LSRIVDDL 1368 >gi|260793652|ref|XP_002591825.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] gi|229277036|gb|EEN47836.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] Length = 2660 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 42/355 (11%), Positives = 87/355 (24%), Gaps = 30/355 (8%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + K + + + V + V + R V Sbjct: 1828 DASGSVGSDNFKIGLNFATALIQKVGVSAEPGGTRVGAFSFASDVTPHFRPHDPLVTTEQ 1887 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 D + + + + + + S + Sbjct: 1888 VVDA-MQQIPYTKVLCEVPEAPFDGEVHILQTKKDLRNTRANQTDTAGNSSVPLATVNIT 1946 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL--YYMLYPGPLDPSL 253 + +PL D V S +VG + Y + Sbjct: 1947 TTEEPLFENFSTPDTNVTSAERILFRVGSTVKYSCAYGFELSGPRQRTCQQNKEWSGVHP 2006 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + E + S + K + R+ + + Sbjct: 2007 TCEEIICSDPGSPENGR---------------KVGSSYRFGERVQFRCQTGYTLHGSAER 2051 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 W T D G TA A++ +++I + E+KK + L Sbjct: 2052 WCSAGGRWNGTLTXCDD--AKGGTATRLALKFLRESVIPEAVA---ELDRPDESKKALFL 2106 Query: 374 LTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +TDG++ + K + + G+ I TI + ++ L++ AS Sbjct: 2107 ITDGKSNTG-GDPSEEARKLREELGLEIYTIGI-----SNDVSKTELASVASSPK 2155 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 42/139 (30%), Gaps = 17/139 (12%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D V + + + + S ++ R I G T A+ A Sbjct: 103 TDKDQVHIAMIQYAETPTIEFSL--DTYRNPRDITNHIMTINFHSGGTRTGKALLAAKVE 160 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVN 408 + S + A K IVL TDG ++ + + + + ++I + + Sbjct: 161 LFSEKKGAR------KNASKIIVLFTDGL---SVDDPVKHAQQLREVEKVKIYVVYVGSD 211 Query: 409 KTQQEKARYFLSNCASPNS 427 Y + A S Sbjct: 212 GF-----EYEMDRIAGGKS 225 >gi|149183229|ref|ZP_01861674.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] gi|148849062|gb|EDL63267.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] Length = 245 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 2/72 (2%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 + ++L+TDG + Q + +A+ AK QGI + I N + E+ + A S Sbjct: 7 RQVLLITDGCSNQGE-DPVAMAALAKEQGITVNVIGVMENDSIDERGMEEIEGIAMSGGG 65 Query: 428 FFEANSTHELNK 439 + +L++ Sbjct: 66 VSQIVYAQQLSQ 77 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 40/107 (37%), Gaps = 4/107 (3%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + A TA E V + A K ++++TDGE + D + + +A+ Sbjct: 223 TQLRGATLTATYIQRVVREQFVTEKGSRPGASKVLIVITDGEKSGDPLQYSDVIPEAERA 282 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 GI I A+ L + AS + F ++ L I Sbjct: 283 GIIRFAIGVGK-AFSGGTAKQELISIASQPEDDHVFPVDNFDALKDI 328 >gi|156742722|ref|YP_001432851.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234050|gb|ABU58833.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 452 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 56/179 (31%), Gaps = 23/179 (12%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 V AL +++ + D R+ F D + G ++ Sbjct: 99 RDTASSALDYVVRALHTIVERLDHHD------RLALVVFADHALLLIPGMVGADRVTLVR 152 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 N T + D + A + I ++ + I+LLTDG T+D+ Sbjct: 153 AIERLPGLNLGDGTNLADGIALALNQIRANRDGRCADR---------IILLTDG-FTRDS 202 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 +A+ ++A + I I TI E L+ A S + + Sbjct: 203 AACLALADQAADEHIAITTIGLGG-----EFQDDLLTAIADRSGGHALFLKRVSAIPRA 256 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 15/136 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + L + IK + + R+G F + V +T+ Sbjct: 96 DVSPNRLGRAKQQIKDMVDEMPGDRVGLVVFAGETRQTLPLTRHVEDFKQTLDSVGIHSV 155 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G + + DA++ A D + D K +V+LTDGE+ + + ++ Sbjct: 156 R-RGGSRLGDAIRVASDAFLDKTTD-----------HKAMVILTDGEDQESD--PVSEAK 201 Query: 392 KAKS-QGIRIMTIAFS 406 +A QGIRI TI Sbjct: 202 RAHEEQGIRIFTIGLG 217 >gi|47218289|emb|CAG04121.1| unnamed protein product [Tetraodon nigroviridis] Length = 979 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 43/121 (35%), Gaps = 8/121 (6%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T IN+A+ A ++ ++ + ++ I+L++DGE + + AI K Sbjct: 412 GTNINEALMRAVQMLVRASHQGLVDPRSVS----MIILVSDGEPHR-EIKLSAIQKNVKR 466 Query: 396 Q---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + ++ + R N + T + + F ++ + R Sbjct: 467 AMREEFSLFSLGIGFDVDFDFLERIATENRGVAQRIYANQDTADQLRSFYRQVAAPLLRR 526 Query: 453 V 453 + Sbjct: 527 I 527 >gi|82593966|ref|XP_725227.1| ookinete protein [Plasmodium yoelii yoelii str. 17XNL] gi|23480151|gb|EAA16792.1| ookinete protein-related [Plasmodium yoelii yoelii] gi|110681630|dbj|BAE98270.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium yoelii] Length = 2401 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 51/173 (29%), Gaps = 16/173 (9%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + +D S K+ V I +++ ++ M Sbjct: 804 CIGNDCFCHDIYDLTVILDESGSIGSYNWKNQVYPFTEQFINNLEISEDKVHVGIMLFAQ 863 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDEN----EMGSTAINDAMQTAYDTIISSNEDE 357 FN D + + G T I +A+ E+ Sbjct: 864 FN----RDFVMFSDKESYDKEHMMKLIKGLKDSYKSGGYTYIIEALNYGL-------ENY 912 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIMTIAFSVNK 409 H + + K +L TDG NT ++ ++ + K + ++++ + + Sbjct: 913 THHKDSRSDVPKVTMLFTDGNNTNPGDKLLSDASLLYKEENVKLLVVGVGAST 965 >gi|298375542|ref|ZP_06985499.1| BatB protein [Bacteroides sp. 3_1_19] gi|298268042|gb|EFI09698.1| BatB protein [Bacteroides sp. 3_1_19] Length = 574 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 28/185 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSDYISA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + + K I+L+TDGEN +D+ G Sbjct: 160 INPSMVSTQGTAIGAAINLAARSFTP-----------DETTDKAIILITDGENHEDDAIG 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A K + I + K + N++ + + + +++G Sbjct: 209 AAKAAAEKGIHVNI--VGMGDPKGSPIPIQ-------GSNNYMKDKDGNVVITKLNEQMG 259 Query: 447 NEIFE 451 EI Sbjct: 260 QEIAA 264 >gi|221121786|ref|XP_002165500.1| PREDICTED: similar to procollagen, type XIV, alpha 1 [Hydra magnipapillata] Length = 3126 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 38/121 (31%), Gaps = 19/121 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + A+ A+ A + + +A ++L+ Sbjct: 811 HFNDPDTLTEARDAVRVKPHTGKYTGQALSLAKEGLFDKGH--------RSDALDVLILM 862 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG ++ D E + G++I+ + + R L++ AS F A Sbjct: 863 TDGPSSDDVTEPS---RALRDMGVKIIAVGIG-----NQIDRKQLNDIASDPDDEHVFTA 914 Query: 432 N 432 + Sbjct: 915 D 915 >gi|308068037|ref|YP_003869642.1| hypothetical protein PPE_01262 [Paenibacillus polymyxa E681] gi|305857316|gb|ADM69104.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 738 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 42/153 (27%), Gaps = 2/153 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 +G + L++ VM G + +D R + ++ A+ + +L++ Sbjct: 11 RESGAVTVFLILILAVMFGFIAVFIDYARIAALHVQTERLTHAAVRSVMSAYDPTLQQEY 70 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR--DIVRDTAVEMNPRKSAYQVVLS 134 +Q + + L + + D + + + + Sbjct: 71 GLFAYGEGGGEQIMVKVLNDSARRTARAETLPLMNMKLDSSSLQMERELGKYAIFNEQIR 130 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 F +K K + TV Sbjct: 131 EEMKYKAPVDFTFEVLEKLKPLSQNMKEASHTV 163 >gi|195999034|ref|XP_002109385.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] gi|190587509|gb|EDV27551.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] Length = 356 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 41/314 (13%), Positives = 89/314 (28%), Gaps = 30/314 (9%) Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 +Q + + + + N + ++ V+ + S+ Q S+ Y L NP++ Sbjct: 67 YQQNLTSAVAIVEDLRRQLNASLADIIKFVQSAKSNIESGYSSKQAQSSTSYQLCCNPIN 126 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 L + + V + + + + ++ D Sbjct: 127 LRFNKELNDKINLSSPCITFPV---ETTQSYIPNSLKLAYRQNFADNLSVKWQYFAGADN 183 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + + Y +Y+ P L S Sbjct: 184 IFYQYPTTQRYCK-----------TNYTTETKFKQWYVNAASPSSKRLVLVLDRSGSMSG 232 Query: 266 HVIKKKHLVRDALASVIRSIKKID--NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 K V++A +V+ S+ D V +V + + ++ Sbjct: 233 DRFLK---VKEAATAVLDSLGPNDEIGVIAFDDEIRIHGGCKVTTVSPATPQSIIFLKDF 289 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + E GST A++ A+D + ++ + IV LTDG + Sbjct: 290 INNKIQ--PEFGSTGYVPALKHAFDMLSTNMTSKA------KTKTNLIVFLTDGHPDEPE 341 Query: 384 EEGIAICNKAKSQG 397 I + K++ Sbjct: 342 S---QILDVIKNRN 352 >gi|241207151|ref|YP_002978247.1| hypothetical protein Rleg_4470 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861041|gb|ACS58708.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 193 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 41/134 (30%), Gaps = 8/134 (5%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + ++L + G I A+L PV++ + ++ + +A Sbjct: 8 TRLVLTVRRLARDRRGAGAIEFAILFPVLVMLYIGAFEITVGLSVSKRVTRA------AG 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 +V + + ++ +++ + + +L T + + Sbjct: 62 TVADLVTQQQSVTKSALAQ--MPSVATAIFVPYNSTSLTLKITGITIDAGANAKVLWSWA 119 Query: 125 RKSAYQVVLSSRYD 138 + ++ Sbjct: 120 KDGTVPYAKNTTVS 133 >gi|153834035|ref|ZP_01986702.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148869590|gb|EDL68580.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 423 Score = 43.4 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 30/275 (10%), Positives = 74/275 (26%), Gaps = 2/275 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 +S G ++ ++++ ++LGV +D+ + L+ A A + +V ++ + Sbjct: 10 RRSQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDAAALAGAVVADETSDV 69 Query: 75 VSSRAKNSFTFPK--QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 ++ A T + + N F+ + + Y V Sbjct: 70 AAAEAAAVTTLANISASAGNSELTFTDGNTAVTFSSDRATFVDAASFSTPAGEYDIYVRV 129 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S L S+F + I + + ++ A S + + Sbjct: 130 AVSDIGLTQFLSSIFGINKNISASAVAGRSAAINYSCNLTPIAMCGDPLGDASNAWGYKP 189 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + ++ G + + +L G + Sbjct: 190 PGYDPNVDTDPSLVHELKVGDQDNTDMGPGNFQLLDFGQAGGNSGAALVRDALSGAYNGC 249 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + V + + L IK Sbjct: 250 AAIGDTVITKPGNSAGPVAQGLNTRLNDFSGPIKD 284 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 33/396 (8%), Positives = 91/396 (22%), Gaps = 31/396 (7%) Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + A+ + + + LI + V + Sbjct: 142 ATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNK 201 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + +P + + Q +R + + Sbjct: 202 DGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNV---- 257 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 R + + + ++N ++G + ++ ++ + S Sbjct: 258 ------RGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINM 311 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S+ + ++ + K+ +++ S R N Sbjct: 312 GYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKK 371 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + GV + + + T + T S N + Sbjct: 372 VIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKR 431 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR-----YFLSNC 422 + T GE K +GI I + + + + Sbjct: 432 INSTFIATIGEAM-----------ALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREM 480 Query: 423 AS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 S + ++E A+ +++ + V Sbjct: 481 VSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 516 >gi|198426244|ref|XP_002124558.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 430 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 31/267 (11%), Positives = 70/267 (26%), Gaps = 31/267 (11%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 P ++ + +NG D + +L+P + S Sbjct: 163 VADETPPPHICQALNFKNGSASCTDGNNLSSNCTFECDADLVLHPANVSSSQCTPSGWIE 222 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKID-NVNDTVRMGATF------------------F 302 R V+ S + D + + Sbjct: 223 PPPCCTKLCPPFARTDAIIVVDSSSSVKRPNWDKMIAFIVNMLRQFTIDQSSLRIGAFRY 282 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 N RV +D + + G T A+ + ++ + Sbjct: 283 NRRVHTDTQILLNQFNNDKDGLLAAIQSLPYNGGGTNTGRAIAHVTNVMLKAENGNRP-- 340 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + + +VL+TDG QD + ++ ++ G + +A + + + L Sbjct: 341 ----DVQDLVVLITDGRA-QDRVDLVSA--DLRATGAVVFVVAVILPGS-TIRLSQMLEI 392 Query: 422 CASPNSFFEANST-HELNKIFRDRIGN 447 + + +S L+ F + Sbjct: 393 SGTNETLLIVDSGFDGLDTAFSSMLTK 419 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 60/169 (35%), Gaps = 17/169 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +D + +++ + +++ + + +F+ + L + + Sbjct: 172 KDFVKALMGEFASTSTL-----FSLMQYSNILKTHFTFTEFKNILDPQSLVDPIVQLQ-- 224 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++T + + S + AKK ++++TDG+ +D E + A Sbjct: 225 GLTYTATGIRTVVEELFHSKN------GSRKSAKKILLVITDGQKYRDPLEYSDVIPAAD 278 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 GI I + Q+ A L+ S + F+ + L I Sbjct: 279 KAGIIRYAIGVG-DAFQEPTALKELNTIGSAPPQDHVFKVGNFAALRSI 326 >gi|189190514|ref|XP_001931596.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973202|gb|EDU40701.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 1331 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 35/322 (10%), Positives = 97/322 (30%), Gaps = 34/322 (10%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPL--SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + A +++++ +++ +N + + + ++ K+ + + + + + Sbjct: 932 PVPASEHAIELIVNPDHEIFINDVVSTAAVPAVNQKTEELCLYRQKFSTLSRIPSQVPLR 991 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + Y + + + + + +P ++ ++ Sbjct: 992 FWTKLRNEG--DGYCTGTNYDTHWKTMYSCFNREFCTGILDEEPCVHKVQNAP--INASQ 1047 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 L + L L +E + + + +++ + I + + +G Sbjct: 1048 PLVFKLLLDTLSTPSNE---------TNHLTRLDVLKQMFDAYINRVLA---YSFQPHIG 1095 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 FN + + V + G TAI D++ A D I Sbjct: 1096 LVTFNTKTQVAQKITNAVENSR-----HKLNNLAAYGDTAIWDSVALAQDQIQ------- 1143 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 K AK I+ ++DGE+ + + GI + + + + Sbjct: 1144 QHAKQYPNAKLRIICISDGEDNTSLNTVEDVAKRLTRCGIVVDSFCLGNTGNLRLQTLSS 1203 Query: 419 LSNCASPNSFFEANSTHELNKI 440 L + F + E I Sbjct: 1204 L----TEGYVFAPKTLDEAMAI 1221 >gi|114569600|ref|YP_756280.1| von Willebrand factor, type A [Maricaulis maris MCS10] gi|114340062|gb|ABI65342.1| von Willebrand factor, type A [Maricaulis maris MCS10] Length = 555 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 33/281 (11%), Positives = 71/281 (25%), Gaps = 23/281 (8%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 T E + S+ + + + +N F Sbjct: 93 STADEPVSTFSIDVDTASYSLVRNSLEAGRLPPTDAVRIEEMVNYFDYDYALPPGPDEPF 152 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 V + + + + P + +D S + K L A+ Sbjct: 153 ATHVTVTPTPWNADTQLMHIGIQGYEIIPDERPRANLVFLIDVSGSMNSPDKLPLAVQAM 212 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ + D V + + + + GSTA Sbjct: 213 HLLVDELHPDDTVA------LVVYASASGVVLPPT---EARNAREIHRALDSLSAGGSTA 263 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKS 395 + AYD + +++ ++LLTDG+ +E + + Sbjct: 264 GGAGLALAYDLAEQNFDEDAVNR---------VMLLTDGDFNVGVTQDERLEDFVARKRD 314 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 GI + + F + + A + +S E Sbjct: 315 SGIYLSVMGFGRGNYNDQMMQTIAQ--AGNGTAAYIDSRQE 353 >gi|325275094|ref|ZP_08141074.1| hypothetical protein G1E_17455 [Pseudomonas sp. TJI-51] gi|324099771|gb|EGB97637.1| hypothetical protein G1E_17455 [Pseudomonas sp. TJI-51] Length = 651 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 16/176 (9%), Positives = 41/176 (23%), Gaps = 3/176 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 G ++ + + + L +++D R + L++ A A + A+ Sbjct: 4 SFAARQRGAIGLMAVVTLGLALLFMLLVIDSGRLYLEQRKLQRIADMAALEAAGQYAVCT 63 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + + T V Sbjct: 64 GSGPQATVVA---RAAATRNGHAPGNPLAASCGYLQTGANSLRTFTTDNNRTEAIRVDVS 120 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + SL + + + A A + + +D +S Sbjct: 121 NTVPTSVAAGVYSLAQGTGVPLTTTLHAHAVASSPVPPQAMLSIRTTLATVDSRQS 176 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 43/416 (10%), Positives = 104/416 (25%), Gaps = 60/416 (14%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + KN + + +T+ + + L LF + K Sbjct: 150 SFSFSIDNKNQETAQTNEKTSETSTSYENHEISDDSSLDKDEYRPFEQPQLFNAQLKPKG 209 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + E T + + + + Q + LN + SY Sbjct: 210 LA-TIEPEYTTDEQGTYPKAMWQPDNSQYVRNHQGNRQGQQQWDGLNGWDGNPTNRNNSY 268 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIK 269 G+ D + + + LY + VD S + Sbjct: 269 IEYGGEKEDADYAIRKFAKETATPGLFDLYLNVRGNTQKNITPLDLVLVVDWSGSMNDNN 328 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-A 328 + V+ + + ++ + D + MG ++ S + + + + Sbjct: 329 RIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSIEGYSYSNGAVQMGSFDSVKNQVKSI 387 Query: 329 IDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRMKNNLEAKKYIV 372 G T A++ A + + + +V R+ + Y Sbjct: 388 TPSWTNGGTFTQKALRDAGNMLSVPNGHKKVIVLLTDGVPTFSYKVQRVHAQSSSNYYGT 447 Query: 373 LLTDGENTQDNE------------------------EGIAICNKAKSQGIRIMTIAFSVN 408 ++ ++ N I K +GI I + + Sbjct: 448 QFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQ 507 Query: 409 KTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGNEIFERVI 454 + S + ++E+ ++++ + +I V Sbjct: 508 SDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AVQISATVS 562 >gi|239906054|ref|YP_002952793.1| hypothetical protein DMR_14160 [Desulfovibrio magneticus RS-1] gi|239795918|dbj|BAH74907.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 133 Score = 43.4 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 35/126 (27%), Gaps = 6/126 (4%) Query: 15 IKSCTGHFFIITALLMPVMLG--VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 I G + AL++ + + G+ VD + + + + +AA+ ++ A+ Sbjct: 10 IDRQDGSISVEFALMLVLFFMPLLIGI-VDFGQILHAQSVVTRAAREGVVAAAR---NQD 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + A + + + QVV Sbjct: 66 IPTAVDAYIQNAGYDTSLAHIATAGSRVAGEPVMVTVRYDTSAMVIIPWQGISPNMTQVV 125 Query: 133 LSSRYD 138 ++ Sbjct: 126 ATATAQ 131 >gi|222101614|gb|ACM44012.1| thrombospondin-related anonymous protein [Babesia bovis] gi|222101618|gb|ACM44014.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 39/120 (32%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F + + + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLD----AAASSTRLALTKLDWMAGTKARSGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|198426622|ref|XP_002122673.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 721 Score = 43.4 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 62/177 (35%), Gaps = 14/177 (7%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + S+I + + + RM +N +V + H +T Sbjct: 541 KQFVRSIITTFNFGE---NEARMAVFRYNRQVDTRNQILLSDHINNKTTFLEAYDKLPYN 597 Query: 335 G-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ A + I+++ +K+ I+ +TDG +QDN I+ + Sbjct: 598 GFGTFTGRALRHAKNVILANRNGNRPNVKD------VILTITDGR-SQDNVATIS--TEL 648 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST-HELNKIFRDRIGNEI 449 + G+ I L+ +P + A L+ F +R+ I Sbjct: 649 REMGVTTFVIGIQPGNGAGLDQDQLLAMGGTPENTILAVGGFGGLDATFLNRLSASI 705 >gi|332244061|ref|XP_003271190.1| PREDICTED: collagen alpha-1(XII) chain [Nomascus leucogenys] Length = 3100 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 59/204 (28%), Gaps = 29/204 (14%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + + + + A+ + + ++ R+G FN Sbjct: 98 SGSMWDALDGTPTEDPDRMRITHAKRAIREFLPLL------SERDRVGLAVFNRTYRMIQ 151 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 ++ + E T + +M+ A + ++ Sbjct: 152 PIVDDPALVLEKLDAIERPS-REQAYTELYRSMEEALTSFEEEGR------------RRV 198 Query: 371 IVLLTDGEN-----TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +V+L+DGEN + + A GI I F K + + Sbjct: 199 LVVLSDGENFPVDPEKSPATPGTAVDLAHRYGITCYVIHFGTEKDRLIGDLASE----TG 254 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 F+A + EL ++ I ++ Sbjct: 255 GRVFDARNALELASVY-TAIQEQV 277 >gi|209886526|ref|YP_002290383.1| TadE family protein [Oligotropha carboxidovorans OM5] gi|209874722|gb|ACI94518.1| TadE family protein [Oligotropha carboxidovorans OM5] Length = 187 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 6/39 (15%), Positives = 16/39 (41%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVV 44 + ++ + G + AL+ P+ + +V+V Sbjct: 10 RLSNALRRFGANRKGSAAVQFALVAPLFFALIFAIVEVA 48 >gi|85859128|ref|YP_461330.1| hypothetical protein SYN_01503 [Syntrophus aciditrophicus SB] gi|85722219|gb|ABC77162.1| hypothetical membrane protein [Syntrophus aciditrophicus SB] Length = 173 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 42/145 (28%), Gaps = 4/145 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA---QTAIITA 64 SK+++ G + AL+MP + ++D + Y +H ++ A + Sbjct: 13 KKISKRILSDSRGVTAVEFALIMPFFFLLLFAIIDFGWYFYSQHTIQFATREGTRLALVG 72 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + E+S T + ++ + ++ Sbjct: 73 VQLKDKDGNEMSREDSIIKTIQDNAALAVDPAALQISIYPVAAGYSDPEGWEESQNP-GS 131 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLR 149 +V + Y L + F Sbjct: 132 GGDYMRVRVRYTYHFLTPLIGNFFP 156 >gi|66775067|gb|AAY56127.1| complement component Bf/C2-B [Ginglymostoma cirratum] Length = 757 Score = 43.4 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 54/187 (28%), Gaps = 18/187 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRT-----IVKTF 327 + A+ ++ + I V G F R + + + V +K Sbjct: 278 FKKAINALTTFVTMIKQFEVGVNYGLVMFGSRSCVEVNIAHDDVSDSDSVLQILPTLKYE 337 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 I T + A++T ++ ++ + E + I++ TDG N Sbjct: 338 DITRYSDAGTNMTGALKTIFEMMVLKKASMKDKQAEWREVRHAIMIFTDGRNNMG-GNPK 396 Query: 388 AICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + ++ ++ I + + E + F +L Sbjct: 397 PMMDRIRNFLDIEKVGEEFLDVYVFGLGNDVDTDEMNSVASKKN-NEKHMFFIQ-MEDLT 454 Query: 439 KIFRDRI 445 +F + Sbjct: 455 TVFNSML 461 >gi|326916310|ref|XP_003204451.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 2040 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 58/171 (33%), Gaps = 21/171 (12%) Query: 277 ALASVIRSI-KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 + + I I + D D V++G ++ ++ + ++ +++ + G Sbjct: 133 TVRNFISRIVEVFDIGPDKVQIGLAQYSGDPRTEWNL--NAYRTKEALLEAVTNLPYKGG 190 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 +T A+ + A+K VL+TDG+ +QD+ + + + Sbjct: 191 NTLTGMALDFILK------NNFKQDAGLRPRARKIGVLITDGK-SQDDVVTPS--RRLRD 241 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 +G+ + I + L A+ + L I D Sbjct: 242 EGVELYAIGI------KNADENELKQIATDPDDIHAYNVADFSFLASIVED 286 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 66/218 (30%), Gaps = 26/218 (11%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF--VDSSSLRHVIKKKHLVRDAL 278 R +++ + S+Y +L P+ + + + K V ++L Sbjct: 34 VSASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESL 93 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGST 337 ++ S + R+ F+D + +K N G T Sbjct: 94 ERLVSSGQADS----RDRIALVRFDDSASVLLPLT---ASTDTASLKNAIGQLRNFSGGT 146 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 + M+ A + + + + ++ TDG+ T D + + + G Sbjct: 147 RMALGMEEALNILKNCDLSSRRT-----------LIFTDGQ-TFDESDCRDLATQFAEAG 194 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 I I + E +LS+ + F T Sbjct: 195 IPITALGVG---EYNEDLLLYLSDR-TGGRVFNVVETQ 228 >gi|114579064|ref|XP_515640.2| PREDICTED: hypothetical protein LOC459430 isoform 2 [Pan troglodytes] Length = 1294 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 58/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + I ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLKQAQSWIRHIKI-----GSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETRAIYLLTDGRPDQP---PETVIDQVKLFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|114579066|ref|XP_001156305.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 1059 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 58/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 495 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAV 554 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + I ST A++TA+ Sbjct: 555 AWREQLAEVNEDNLKQAQSWIRHIKI-----GSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETRAIYLLTDGRPDQP---PETVIDQVKLFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 VAALTGGEFHFY 659 >gi|66047599|ref|YP_237440.1| hypothetical protein Psyr_4372 [Pseudomonas syringae pv. syringae B728a] gi|63258306|gb|AAY39402.1| conserved hypothetical protein [Pseudomonas syringae pv. syringae B728a] Length = 678 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 48/177 (27%), Gaps = 4/177 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G ++ AL M + L ++D R + +L++ A A + A+ Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ F Q + + V V + A + + Sbjct: 72 A----SAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADSTQALAIQVVV 127 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + + + A V+ S ++I+ S Sbjct: 128 AHPVPRSIAAGIGALFEKTPSPPNVTISATAVAASASPLAALTIRSAAVTVDSTGAA 184 >gi|332215868|ref|XP_003257064.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VII) chain-like [Nomascus leucogenys] Length = 2944 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 45/150 (30%), Gaps = 22/150 (14%) Query: 290 NVNDTVRMGATFFNDRVISDPSFS-WGVHKLIRTIVKTFAID--ENEMGSTAINDAMQTA 346 + V F SD + +G+ L AI + G+T A+ Sbjct: 67 PFSGAVSAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHV 126 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D + K +L+TDG + + + K QG+++ + Sbjct: 127 ADRVFLPQLARPGIP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI- 175 Query: 407 VNKTQQEKARYFLSNCAS---PNSFFEANS 433 + L AS + FF N Sbjct: 176 -----KNADPEELKRVASQPTSDFFFFVND 200 >gi|218531749|ref|YP_002422565.1| TadE family protein [Methylobacterium chloromethanicum CM4] gi|254562693|ref|YP_003069788.1| hypothetical protein METDI4318 [Methylobacterium extorquens DM4] gi|218524052|gb|ACK84637.1| TadE family protein [Methylobacterium chloromethanicum CM4] gi|254269971|emb|CAX25949.1| hypothetical protein METDI4318 [Methylobacterium extorquens DM4] Length = 202 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 31/98 (31%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++ G I A + PV++ + +++ R + L QA AS + Sbjct: 20 FGRAEGGVSAIEFAFIAPVLVILFIAAIEIPRAIATNNRLAQATIAMADLASKNDYADIN 79 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 +V + A+ ++ N V Sbjct: 80 DVFAAAQVVAAPYSLAGTGIVLTAGGVYQVGNDFVARV 117 >gi|47215752|emb|CAG05763.1| unnamed protein product [Tetraodon nigroviridis] Length = 565 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 24/65 (36%), Gaps = 8/65 (12%) Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + A K I L+TDG + + + + +G+ I T+ + L + Sbjct: 177 SRTNATKVIFLITDGYSNGGDPRP--VAAALRERGVEIFTLGI------WQGNIKELHDM 228 Query: 423 ASPNS 427 AS Sbjct: 229 ASEPK 233 >gi|307294185|ref|ZP_07574029.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] gi|306880336|gb|EFN11553.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] Length = 157 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 44/157 (28%), Gaps = 5/157 (3%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT-AIITASVPL 68 +++ G + AL+MP+++ + M DV + AL++AA+ + + Sbjct: 1 MTRRFFADPMGTSTVELALIMPILVLLACMAGDVAMAFKAKIALQRAAERTGQLATAGGY 60 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + ++ + N + V + Sbjct: 61 TNDTSKT----QAAYNNLAADAAAAAGVSTNNVTVTPTLLCDATVQTASPEVPCADGQQT 116 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 + V + + + + I A Sbjct: 117 KRYVAITISGSYTPMFAKLMPGSNWSTQGIPITGSAS 153 >gi|198417365|gb|ACH87900.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1042 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 30/295 (10%), Positives = 77/295 (26%), Gaps = 42/295 (14%) Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + I V+D S SM D E + + + + P + Sbjct: 439 PLDILVVVDRSGSMQDGIGSIEKYKYWKYKYDEYYHIWRNAGTIYFDNYLGPRYQPDTYT 498 Query: 236 CNKSLYYMLYPGPLDPS-LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 P + ++ + S+ L + V+ + Sbjct: 499 YYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDINPQNQLA--VVGFQGSVAYRYYD 556 Query: 295 VRMGATFFNDRVISDPSFSW-----GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + T +N + + + + ++ + T + A+ A + Sbjct: 557 EKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYEDRNGTNYHAALLKADEK 616 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------------------NEEGIA 388 + + +K +V ++DG T + Sbjct: 617 LQKVANNG---------HRKIMVFISDGVPTFYFGADNYRSGNGTVSDSNIINSQKGSKL 667 Query: 389 ICNKAKSQ--GIRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEANSTHELNK 439 ++ K++ + I ++ S + + L + + + T +L K Sbjct: 668 AIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYLSGDDYYSGITDTEQLEK 722 >gi|28948724|pdb|1N3Y|A Chain A, Crystal Structure Of The Alpha-X Beta2 Integrin I Domain Length = 198 Score = 43.4 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 49/149 (32%), Gaps = 12/149 (8%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + F+++ + +F + G T A+Q + ++ Sbjct: 46 TQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTATAIQNVVHRLFHAS 103 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 +A K ++++TDG+ D+ + + A + GI I + + Sbjct: 104 ------YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNS 157 Query: 415 ARYFLSNCASPN---SFFEANSTHELNKI 440 + L++ AS F+ L I Sbjct: 158 WKE-LNDIASKPSQEHIFKVEDFDALKDI 185 >gi|115666324|ref|XP_798926.2| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] gi|115975269|ref|XP_001180487.1| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] Length = 870 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 14/193 (7%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS--DPSFSWGVHKLIRTIV 324 K ++A+ +++ + D R F+D V S + V+ Sbjct: 340 YGNKIKQTKEAMYTILDEMNDSD------RFNVLPFSDYVYSGWNSGQMVDVNPYNIRDA 393 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 K F + T +NDA+ + S+ ++ + +LTDG+ ++ Sbjct: 394 KDFIRQLDIQRGTNLNDALLGGLSLLESTGS---MNSTSSNPMVCILFVLTDGKPSEGVT 450 Query: 385 EGIAI---CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 I A +Q I+T+ F R L N +E +S + Sbjct: 451 SLSEIERNVRNANNQRCSIVTLGFGRLVNYNFLVRLALQNRGMARKIYEDSSAAGQLRGV 510 Query: 442 RDRIGNEIFERVI 454 + + ++ Sbjct: 511 YSEVATPLLFNIV 523 >gi|114608138|ref|XP_518589.2| PREDICTED: collagen, type XII, alpha 1 isoform 3 [Pan troglodytes] Length = 3119 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|162138242|gb|ABX82825.1| complement factor B [Sus scrofa] Length = 765 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 46/194 (23%), Gaps = 22/194 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDEN 332 A + I+K+ + + G + S + + K D Sbjct: 289 AKNCLKDFIEKVASYGVKPKYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHK 348 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ Y+ + N IVLLTDG + + + + + Sbjct: 349 LKAGTNTKKALLEVYNMMSWGVN-NFPDNWNRTRH--VIVLLTDGLHNMG-GDPVTVIHD 404 Query: 393 AKS-QGIR------------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + I I Q+ F+ L Sbjct: 405 IRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQENINALASKKD-KEQHVFKLKDVDNLED 463 Query: 440 IFRDRIGNEIFERV 453 +F + + Sbjct: 464 VFFQMLDESRSLGL 477 >gi|156120152|ref|NP_001095294.1| complement factor B [Sus scrofa] gi|148724909|emb|CAN87697.1| B-factor, properdin [Sus scrofa] Length = 765 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 46/194 (23%), Gaps = 22/194 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDEN 332 A + I+K+ + + G + S + + K D Sbjct: 289 AKNCLKDFIEKVASYGVKPKYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHK 348 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ Y+ + N IVLLTDG + + + + + Sbjct: 349 LKAGTNTKKALLEVYNMMSWGVN-NFPDNWNRTRH--VIVLLTDGLHNMG-GDPVTVIHD 404 Query: 393 AKS-QGIR------------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + I I Q+ F+ L Sbjct: 405 IRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQENINALASKKD-KEQHVFKLKDVDNLED 463 Query: 440 IFRDRIGNEIFERV 453 +F + + Sbjct: 464 VFFQMLDESRSLGL 477 >gi|148724910|emb|CAN87698.1| B-factor, properdin [Sus scrofa] Length = 549 Score = 43.4 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 46/194 (23%), Gaps = 22/194 (11%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSW----GVHKLIRTIVKTFAIDEN 332 A + I+K+ + + G + S + + K D Sbjct: 289 AKNCLKDFIEKVASYGVKPKYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHK 348 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A+ Y+ + N IVLLTDG + + + + + Sbjct: 349 LKAGTNTKKALLEVYNMMSWGVN-NFPDNWNRTRH--VIVLLTDGLHNMG-GDPVTVIHD 404 Query: 393 AKS-QGIR------------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + I I Q+ F+ L Sbjct: 405 IRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQENINALASKKD-KEQHVFKLKDVDNLED 463 Query: 440 IFRDRIGNEIFERV 453 +F + + Sbjct: 464 VFFQMLDESRSLGL 477 >gi|311251244|ref|XP_003124541.1| PREDICTED: integrin alpha-M-like [Sus scrofa] Length = 448 Score = 43.4 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 44/124 (35%), Gaps = 4/124 (3%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +I + E ST + ++ + E + A K +V++TDGE Sbjct: 158 SGSINRLDFQRMKEFVSTVMGQFQKSKTLVRTNGRELFHSKSGARENALKILVVITDGEK 217 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA---SPNSFFEANSTHE 436 D + +A +G+ I + K+R L+ A S + F+ + Sbjct: 218 FGDPLGYEDVIPEADRKGVIRYVIGVG-DAFNSWKSREELNTIASKPSGDHVFQVTNFEA 276 Query: 437 LNKI 440 L I Sbjct: 277 LKTI 280 >gi|76808650|ref|YP_333444.1| hypothetical protein BURPS1710b_2049 [Burkholderia pseudomallei 1710b] gi|126452631|ref|YP_001066157.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|167738664|ref|ZP_02411438.1| TadE family protein [Burkholderia pseudomallei 14] gi|167824258|ref|ZP_02455729.1| TadE family protein [Burkholderia pseudomallei 9] gi|167894376|ref|ZP_02481778.1| TadE family protein [Burkholderia pseudomallei 7894] gi|167919038|ref|ZP_02506129.1| TadE family protein [Burkholderia pseudomallei BCC215] gi|226196406|ref|ZP_03791988.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237812172|ref|YP_002896623.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|242315614|ref|ZP_04814630.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254188727|ref|ZP_04895238.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|254197381|ref|ZP_04903803.1| TadE family protein [Burkholderia pseudomallei S13] gi|254262100|ref|ZP_04953154.1| TadE family protein [Burkholderia pseudomallei 1710a] gi|254297708|ref|ZP_04965161.1| TadE family protein [Burkholderia pseudomallei 406e] gi|76578103|gb|ABA47578.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|126226273|gb|ABN89813.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|157807056|gb|EDO84226.1| TadE family protein [Burkholderia pseudomallei 406e] gi|157936406|gb|EDO92076.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|169654122|gb|EDS86815.1| TadE family protein [Burkholderia pseudomallei S13] gi|225931623|gb|EEH27628.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237504336|gb|ACQ96654.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|242138853|gb|EES25255.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254220789|gb|EET10173.1| TadE family protein [Burkholderia pseudomallei 1710a] Length = 142 Score = 43.4 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 30/121 (24%), Gaps = 9/121 (7%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +L+ G + L P M+ V +VD + + A++ A Sbjct: 4 LARLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGV----- 58 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 V + + T Y + + V + +Y Sbjct: 59 ----VVRVPQLAATDITNIALSYAQGSLVSGGTVGTPVVNVDQSAGTSPGSPLKVTVSYT 114 Query: 131 V 131 Sbjct: 115 Y 115 >gi|320106178|ref|YP_004181768.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924699|gb|ADV81774.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 365 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 62/207 (29%), Gaps = 32/207 (15%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + K+ V A +R D F+D D F+ V KL Sbjct: 125 DSSGSMYDKREAVGKASLDFVRLSNPKD------EAFLVDFSDEAFIDQDFTSDVKKL-- 176 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G TAI DA+ + D + KN K+ ++++TDG++ Sbjct: 177 ---EDGLGYVKASGGTAIYDAVVASADYL----------AKNAKLPKQVLLIVTDGDDNA 223 Query: 382 DNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKAR----YFLSNCA--SPNSFFEANST 434 + + G I + + L A + + Sbjct: 224 SGSTLEDAIRRVQELDGPVIYCVGLLFGPDSNKSESRHARRVLETLAAQTGGLAYFPRKL 283 Query: 435 HELNKIFRDRIGNEIFERVI---RITK 458 E+ + +I ++ R TK Sbjct: 284 SEV-DSIATEVAQDIRQQYTISYRSTK 309 >gi|297685993|ref|XP_002820555.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Pongo abelii] Length = 150 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 44/149 (29%), Gaps = 17/149 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNV----------NDTVRMGATFFNDRVISDPSFSWGVHK 318 ++ +A S+ F++R+ V Sbjct: 2 RRPEAPWQDVAFSTDSLVSTTTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLISVTP 61 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 K + + G T IN A+Q A + N+ H + IV LTDG+ Sbjct: 62 DSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGK 117 Query: 379 NTQDNEEGIAICNKAKSQG---IRIMTIA 404 T + I N + + I T+ Sbjct: 118 PTVGETHTLKILNNTREAARGQVCIFTLP 146 >gi|153006807|ref|YP_001381132.1| TadE family protein [Anaeromyxobacter sp. Fw109-5] gi|152030380|gb|ABS28148.1| TadE family protein [Anaeromyxobacter sp. Fw109-5] Length = 134 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 38/127 (29%), Gaps = 2/127 (1%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA--SVPLIQ 70 +++ G + AL++PV++ + ++ + + AA+ + + Sbjct: 6 SMVRHARGAAAVEFALVLPVLMLLCLGAIEWGFHFFQREIIVNAAREGARAGSIADADAE 65 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 ++ E + A + R D + + + Sbjct: 66 TVAEDRALAYLGVAGLDGTTCDVAPTLTNTGSPPTPAIRVSIDCTGGSFTLLFKDFMPER 125 Query: 131 VVLSSRY 137 +V ++ Sbjct: 126 IVATAEM 132 >gi|92116018|ref|YP_575747.1| TadE-like [Nitrobacter hamburgensis X14] gi|91798912|gb|ABE61287.1| TadE-like protein [Nitrobacter hamburgensis X14] Length = 189 Score = 43.4 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 14/152 (9%), Positives = 44/152 (28%), Gaps = 8/152 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAI 61 F T + + + G + AL+ P+ + +++ + +L+ +A Sbjct: 7 FRTVAARVAGRFRGNQRGSAAVEFALVAPLFFALLFAIIETGMVFFASQSLETTLQDSA- 65 Query: 62 ITASVPLIQSLEEVS--SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + +++ +A + + + D + Sbjct: 66 ----RMIQTGQAQIAKYDKASFKKDVVCANASVLFDCENGIFVDVHSYPSGFGSVEIDDS 121 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + +P V +++Y + +R Sbjct: 122 ITADPITGDRNFVDNTQYSAGGPGDIVVVRLF 153 >gi|330469792|ref|YP_004407535.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812763|gb|AEB46935.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 565 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 49/400 (12%), Positives = 109/400 (27%), Gaps = 24/400 (6%) Query: 58 QTAIITAS-VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A ++ + + + A+ + + ++ L K T + + Sbjct: 173 DAAGLSGLLSLMTAASSAGGATAEQDRIGALRALASGASALRQDLLAKFPTASDGTTLAS 232 Query: 117 DTAVEMNPRKSAYQVVLS-SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + Q L L + ++ A+ + E Sbjct: 233 SLGAAALSEEDVIQYNAKKPPVPLAALYLKPSPLPLDYPYAVLPGIEPAKDSAARVLFEV 292 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNC-FGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + + + L + G G P+ ++ + GI L+P V Sbjct: 293 LTTPSFKDKLAAQALRGPDGNWGSGFEPPKGAPSPAGGDPSAAPPPQGGIAAGGLAPDAV 352 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSS--SLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 S + + + + + ++ + +A + ++ Sbjct: 353 ERAVSSWSIATLSGRMLCIIDVSGSMKKAVPTANGATRQQVTAEAARRGLNLFDDSWSIG 412 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 V R + + + T+ ++ + G T + D + AY + Sbjct: 413 LWVFSTRLD-GSRDYRQVVPTGPLSRQRSTLERSLDTITSSSGDTGLYDTLLAAYKDVQQ 471 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAK-----SQGIRIMTIAFS 406 + E IVL TDG+N D + + K Q I+++ I Sbjct: 472 NWEPGKVNS---------IVLFTDGKNEDADGISQRQLLAELKRIKDPDQPIQVIIIGIG 522 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ E S F A ++ IF I Sbjct: 523 TEVSKAELDTIAQSA---GGGAFVAADPTKIGDIFLRAIA 559 >gi|316969306|gb|EFV53424.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 412 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 46/164 (28%), Gaps = 15/164 (9%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D M ++ FS + + +TA M A Sbjct: 77 GYDRTLMSILQYSKDAHVLLPFSADQRPEQLNEIVEQIQFLGSITATAEAVQMGLA---- 132 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 +A K +L+TDG + + + N +S G + +AF + Sbjct: 133 -------QFGCGTRSDASKVFILITDGNSNNKWPDVVNAANALQSSGATVAVVAFGDSIY 185 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 E Y S + + + L + D G E+ Sbjct: 186 WPEIDLYA----GSSTNAYTEQNVEHLYGLLMDLTGRVCEEQPT 225 >gi|3273269|dbj|BAA31178.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273273|dbj|BAA31180.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273295|dbj|BAA31191.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 43.4 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|160716|gb|AAA29773.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 568 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RNLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|260837139|ref|XP_002613563.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] gi|229298948|gb|EEN69572.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] Length = 195 Score = 43.4 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 48/140 (34%), Gaps = 11/140 (7%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V + + ++G + + + + S +K + ++ G T Sbjct: 66 NFVATTTSDFQIGPNNAQVGIVQYANWLYEEVSL--NQYKTLDELLPAIYNISYWGGGTY 123 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ + ++ + + K ++++TDG+ + D+ A+ +AK GI Sbjct: 124 TGWAIDYVVNATLTESRGARQ------DVPKVVIVVTDGQ-SADDVRQPAL--RAKQSGI 174 Query: 399 RIMTIAFSVNKTQQEKARYF 418 ++ I E Sbjct: 175 IMVAIGVGSIYDGTELVEIA 194 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 43.1 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 45/364 (12%), Positives = 94/364 (25%), Gaps = 20/364 (5%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I+ + A EE L+ +N + I D + Sbjct: 1062 TISIGGDTGTGSILDNEVAPIVTNQSVSLSEEGLMNGISDNNGLPTDSTDAYSIQGDLVI 1121 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI-KSWLIQTKAEAETVSRSYHKEHGVSI 179 V + + L S G +W + S + ++ Sbjct: 1122 TDANNNLHDTVFSETSTINITTSTGQPLTSSGSLVTWNVSNGGHTLVGSANGQNVIEATL 1181 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + + K S G + Sbjct: 1182 TSDGHYEIHLKGPVDHTNVNGEDSLVLQIPVIAKDTSGLTSTGGQISVSIEDDQPVAKNI 1241 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + ++ S K +++ A++ ++ + VR+ Sbjct: 1242 DISVSPETKSNTNVQLILDTSGSMSNSSNGKLAIMKAAVSKMLDQYHDM----GDVRVQL 1297 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 FN R G + + K G T +DA++ A S + + H Sbjct: 1298 IDFNSRSTRLE--FNGRAWMTVSEAKYLVNRLTAGGGTDYDDAVKKARQ----SWDHDEH 1351 Query: 360 RMKNNLEAKKYIVLLTDG-------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +N Y ++DG + T + E N S GI +I + + + Sbjct: 1352 LQLDNANNVSY--FISDGKPQDGHDDATISDNEETKWANHLISNGITSQSIGINSSGSLN 1409 Query: 413 EKAR 416 + + Sbjct: 1410 DSGQ 1413 >gi|58429529|gb|AAW78168.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 43.1 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RNLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|222101622|gb|ACM44016.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 43.1 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 38/120 (31%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F R + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLDAAASSTR----LALTKLDWMAGTKARSGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|293344916|ref|XP_001079629.2| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|293356747|ref|XP_215375.5| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|149043683|gb|EDL97134.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1034 Score = 43.1 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 65/192 (33%), Gaps = 27/192 (14%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V Sbjct: 67 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLMRMPSGRDELKASID 126 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 127 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 177 Query: 387 ---IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANS---THEL 437 N+AK GI++ ++A + LS A+ +++ F A + + Sbjct: 178 GGLEDAVNEAKHLGIKVFSVAI-----TPDHLEPRLSIIATDHTYRRNFTAADWGHSRDA 232 Query: 438 NKIFRDRIGNEI 449 + I + Sbjct: 233 EETISQTIDTIV 244 >gi|45332244|gb|AAS58046.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 655 Score = 43.1 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 38/120 (31%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F R + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLDAAASSTR----LALTKLDWMNGTKARYGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|17229274|ref|NP_485822.1| hypothetical protein all1782 [Nostoc sp. PCC 7120] gi|17130872|dbj|BAB73481.1| all1782 [Nostoc sp. PCC 7120] Length = 615 Score = 43.1 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 49/148 (33%), Gaps = 22/148 (14%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++D V + + +K G T ++ + + Sbjct: 78 VVVYDDAVDTVVPPQPVT---DKPALKKSIRQVRAGGITNLSGGWLKGCEYV-------- 126 Query: 359 HRMKNNLEAKKY--IVLLTDGENTQDNEEGI---AICNKAKSQGIRIMTIAFSVNKTQQE 413 K+ L+ +K ++LLTDG ++ A + +GI T+ F+ + Sbjct: 127 ---KHQLDPQKINRVLLLTDGHANMGIQDPKILTATSTQKAEEGITTTTLGFAQGFNEDL 183 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIF 441 A+ +F+ S E ++F Sbjct: 184 LIGM---ARAANGNFYFIQSIDEAAEVF 208 >gi|320101968|ref|YP_004177559.1| hypothetical protein Isop_0415 [Isosphaera pallida ATCC 43644] gi|319749250|gb|ADV61010.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 783 Score = 43.1 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 10/59 (16%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 +V+++DG + + + + K++GI I ++ ++T E + + + F Sbjct: 212 MVVVSDGASNSGRP-PLTVAEQFKARGIPITSVGLGSDRTGPEGRDIAVREIVTAPTVF 269 >gi|294139780|ref|YP_003555758.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326249|dbj|BAJ00980.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1195 Score = 43.1 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 18/196 (9%), Positives = 44/196 (22%), Gaps = 13/196 (6%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S++ R + N ++ R + + + + + Sbjct: 310 STNTPLCETLYEAYRYFSGKGVYYGNDDSNYSNWYRANRPPRDTSIEDNGKYISPFKECQ 369 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + D + N ++ S + + T+ + Sbjct: 370 NRAYVVYITDGVPTVDQSANSKVKALPGVSSSDKFVNSSPNFTSYLPALAGWMNTN-DVN 428 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 + Q + TI F + A L+ A ++ A S +L Sbjct: 429 TNAAFPE-------EQNVTTFTIGF---SDGADDAAPLLTRTAELGGGEYYSAKSATQLQ 478 Query: 439 KIFRDRIGNEIFERVI 454 + Sbjct: 479 AALSQVFSQILEVNAS 494 >gi|198430141|ref|XP_002124276.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 606 Score = 43.1 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 18/204 (8%), Positives = 53/204 (25%), Gaps = 14/204 (6%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + + D+ +++ D + ++ Sbjct: 354 VCSDGNNVNSDCLATCDDPAHVIYPPSFETVTCQANTTWTDDAPCCVLPCPPHVPTDLVI 413 Query: 270 KKHLVRDALASVIRSIKKIDN-------VNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 R IK I +++ + +N+R Sbjct: 414 ILDASSSVNEDNWRRIKSITVQLIIKFMSSESTQFAVFRYNNRPDVATQILLQNTNDAAA 473 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ T A+ A D I+ ++ + + ++++TDG+ + Sbjct: 474 LLNAIENIPYGGVGTRTGIALGHAADVILHTDNGNRPKAAD------IVLIMTDGQVNVN 527 Query: 383 NEEGIAICNKAKSQGIRIMTIAFS 406 + + + G +M + Sbjct: 528 D-NTQVPADALHNMGATVMAVGIE 550 >gi|3273249|dbj|BAA31168.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273271|dbj|BAA31179.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 43.1 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 53/184 (28%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + F++ K I+K+ G T + DA+ Sbjct: 83 ENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|158337332|ref|YP_001518507.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307573|gb|ABW29190.1| von Willebrand factor, type A domain protein [Acaryochloris marina MBIC11017] Length = 686 Score = 43.1 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 35/279 (12%), Positives = 83/279 (29%), Gaps = 27/279 (9%) Query: 158 IQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 + + T S +++ I + L + + L + + K Sbjct: 219 LPQRTPLSTFSIDVDTASYSNVRRFI--RQGQLPPKDAVRLEELINYFDYGYASPKGDQP 276 Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDA 277 + + + + L + + +D S K LV+ + Sbjct: 277 FSVSTEVATAPWNNQHKLVHIGLK--GKELEKEQPSNLVFLIDVSGSMKRPNKLALVKKS 334 Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST 337 L ++ +K D V+ + R + G + + GST Sbjct: 335 LCLLVHQLKPEDRVS------LVVYAGRAGIVLPSTPGT---QKATIMNAIDRLEAGGST 385 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAK 394 A ++ AYD E + NN ++L TDG+ + E + + + Sbjct: 386 AGAAGIKMAYDM----AERHFLKNGNNR-----VILATDGDFNVGQSSDAELERLIEQKR 436 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +G+ + + + + K + ++ ++ Sbjct: 437 DRGVFLTVLGYGTGNYKDNKMELLANK--GNGNYAYIDT 473 >gi|114608140|ref|XP_001142912.1| PREDICTED: collagen alpha-1(XII) chain isoform 2 [Pan troglodytes] Length = 3063 Score = 43.1 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|302869502|ref|YP_003838139.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504036|ref|YP_004082923.1| von willebrand factor type a [Micromonospora sp. L5] gi|302572361|gb|ADL48563.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315410655|gb|ADU08772.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 572 Score = 43.1 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 48/398 (12%), Positives = 103/398 (25%), Gaps = 31/398 (7%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + +E A + + + + L+ F + + Sbjct: 189 AAASSTGESGSPKAQEAMVGALRALATNRSSLRQDLLARFPRSSDPTAIANGLGAAALSE 248 Query: 119 AVEMNPRKSAYQVVLSSRY--DLLLNPLSLFLRSMGIKSWLIQTKAEAETV--SRSYHKE 174 + + + L++ Y + F GI+ V + + Sbjct: 249 EDVIAYNSTKPPIRLAALYLEPAPIPLDYPFAVLPGIEPTKASAARVLFEVLRTPGFKDR 308 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 D + P N + + ++ + +S Sbjct: 309 LASQALRAPDGNWGRGFQAPTGAPSPANGGASQVPPSGQGGAAD-----LDPGAISTATT 363 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + + +D S S + V +++ + + I + Sbjct: 364 TWSVATQSGRMLCVIDVSGSMKKPVATANGASREQVTVAAASQGLGLFDDSWSIGLWTFS 423 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + + ++ S L R + G+T + D M AY + Sbjct: 424 TNLQGSQDWSELVGIKPLSSNRGSLQRGLASIK----PSSGNTGLYDTMLAAYKKVQQDW 479 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC----NKAKS--QGIRIMTIAFSVN 408 E IVL TDG+N D+ NK K Q ++++ I Sbjct: 480 EPGKVNS---------IVLFTDGKNEDDDGISQKALLDQLNKLKDDEQPVQVIIIGIGTE 530 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + E + F ++ +IF I Sbjct: 531 VNRAELESIT---KVTGGGAFVTTDPSKIGEIFLRAIA 565 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 43.1 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 16/132 (12%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D+ R+G ++ V + + ++ + G T AM+ A + Sbjct: 256 DSTRVGVVQYSSIVRQEFAL--DTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFT 313 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 S + + IVL+TDG + D + KAK GI + + Sbjct: 314 SANGARPDVGH------VIVLVTDGRSYDDVSQAS---QKAKQAGIVVFAVGIGDGA--- 361 Query: 413 EKARYFLSNCAS 424 L+ AS Sbjct: 362 --VESQLNQIAS 371 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 43.1 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 44/341 (12%), Positives = 87/341 (25%), Gaps = 27/341 (7%) Query: 112 RDIVRDTAVEMNPRKSAYQVVLSS----RYDLLLNPLSLFLRSM-GIKSWLIQTKAEAET 166 + T +N + V S D NP+ L + G +E Sbjct: 164 NATDKSTVTTVNTQGWGQSVTAISQQIKTADEPANPIRLSVELDSGFALTADDITSEHHP 223 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 ++ S E + + L+ A + + G V + Sbjct: 224 INISQQGEKNSGYHIELAQEHIANQDFALTWQPALSDAPSAAHFSETQGKYRYGLVMLTP 283 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 Y + PS +D+S + A+ + ++ Sbjct: 284 PVQDAYHSTGG-------AVAQQMPSREVVFLLDTSGSMAGES-IVQAKRAVDFALTQLR 335 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 DNVN FND + + + + + G T + A+ A Sbjct: 336 PEDNVN------IIQFNDAPQALWKRAMPATAKHIQRARNWVASLHADGGTEMAPALTLA 389 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + +V +TDG + ++ I +K R+ TI Sbjct: 390 ---LNKPSLHRDDSDLLGSHKLRQVVFITDGSVSNEDALMSLIESKLADN--RLFTIGIG 444 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + A +F ++ Sbjct: 445 SAPNSYFMTQAAQ---AGRGTFTYIGDIQQVQHKMTALFNK 482 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 43.1 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 39/133 (29%), Gaps = 17/133 (12%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K + AL + + K N N R F+D V +G + + + + Sbjct: 211 TAMKLQSAKSALQAAVNYFKS--NYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDA 268 Query: 327 FAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 G T + A+ A KKYI+ LTDG T N Sbjct: 269 ILNTGNSLTAGGGTNYSAALSLAKSYFTDPTR------------KKYIIFLTDGMPTVLN 316 Query: 384 EEGIAICNKAKSQ 396 + K Sbjct: 317 AVDTITYREVKQN 329 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 39/370 (10%), Positives = 77/370 (20%), Gaps = 15/370 (4%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + +V T + + L + Sbjct: 117 TYQITVDAYRVLGNGKEDVYFSFPQTPYQYTRQTGVSTAKLDFSLSFSQPEYAKPPNGDA 176 Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 + + + S + + +V+D S SM + S L Sbjct: 177 QGRLDVTLVPQG---AVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNY 233 Query: 213 KSY---SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP-LDPSLSEEHFVDSSSLRHVI 268 + G+R+ + P+ N + + ++ Sbjct: 234 NQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKS 293 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 R + V + + T + Sbjct: 294 YFTDPTRKKYIIFLTDGMPTVLNAVDTITYR-----EVKQNFWGGYSYTGNRVTDSLSVT 348 Query: 329 IDENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + G TA I Y S+ + Y + E + + Sbjct: 349 YELYSDGRTAGIRFTDNKGYSRRFYSDGQDYVNGWRVSWDNGYSFTYSSIE-GKIRADAT 407 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF-EANSTHELNKIFRDRIG 446 A+ I + +I F N LS A + + L + F Sbjct: 408 AVAKTLGMNNITLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQNLTALFQQFSQLAT 467 Query: 447 NEIFERVIRI 456 IRI Sbjct: 468 TPAITGTIRI 477 >gi|160858155|emb|CAP19997.1| collagen type VI alpha 5 precursor [Homo sapiens] Length = 591 Score = 43.1 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 45/134 (33%), Gaps = 17/134 (12%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 D VR+G ++D + + + + + K + G T A+ I Sbjct: 475 GPDKVRVGVVQYSDDTEVEFYITDYSNDID--LRKAIFNIKQLTGGTYTGKALDYILQII 532 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +D + Y+++LTDG +T E + +++ I + + Sbjct: 533 KNGMKD------RMSKVPCYLIVLTDGMSTDRVVEP---AKRLRAEQITVHAVGIGA--- 580 Query: 411 QQEKARYFLSNCAS 424 + L A Sbjct: 581 ---ANKIELQEIAG 591 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 43.1 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 36/309 (11%), Positives = 87/309 (28%), Gaps = 53/309 (17%) Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG----------QPADRTVK 213 + V+ ++ + + +V+D S SM + + + + Sbjct: 77 STKVNDIKIEQQPLEVTFVLDVSGSMAWCAEEHRHSADCGYNCGLEEHRHGDDCYTKCTR 136 Query: 214 SYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS---------- 262 G+ S + YY L + + S + Sbjct: 137 QNHPDHWENWGLVWVHKLGTDCSLSWGSYYYLTCTKAEHTHSSSCYACGKVEHNHDRASG 196 Query: 263 -------SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + A+ +++ +++ + + + F+ + Sbjct: 197 GEPCSMVQSGKAQSRLKIAVSAVKNMVGTLR--EQLGGKLTAKFVVFSSEGYKN---GVD 251 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + +G T ++ + D SS A++ +V++ Sbjct: 252 KRASAKVITEAQLDQLTAVGGTDLSAGVALGVDQFKSS------------SARQVLVVVA 299 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG+ D+ N K++ I T+ F+ + + ANS Sbjct: 300 DGD--SDDGYPNRTANNFKNKDGIIYTVGFTFSSDSFNNLAT------DADHALLANSDT 351 Query: 436 ELNKIFRDR 444 EL + D Sbjct: 352 ELGEAMEDI 360 >gi|33333550|gb|AAQ11891.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 43.1 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + +T+ + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENTIHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|83951472|ref|ZP_00960204.1| hypothetical protein ISM_12955 [Roseovarius nubinhibens ISM] gi|83836478|gb|EAP75775.1| hypothetical protein ISM_12955 [Roseovarius nubinhibens ISM] Length = 178 Score = 43.1 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 6/51 (11%), Positives = 20/51 (39%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + G+ + ++ PV L + +++ + L++ A+ Sbjct: 9 LARFWRGQAGNATVEFVIVFPVFLTLFLFSLELAIITLRHTMLERGLDIAV 59 >gi|330904068|gb|EGH34640.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 84 Score = 43.1 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 27/75 (36%), Gaps = 11/75 (14%) Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 +TA+ DA+ A + + +VL+TDG N + I Sbjct: 21 AGKNTALGDAIGLALKRLR-----------MRPATSRALVLVTDGANNAGQIDPITAARL 69 Query: 393 AKSQGIRIMTIAFSV 407 A +G++I I Sbjct: 70 AAEEGVKIYPIGIGS 84 >gi|119569135|gb|EAW48750.1| collagen, type XII, alpha 1, isoform CRA_c [Homo sapiens] Length = 3063 Score = 43.1 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|93141047|ref|NP_004361.3| collagen alpha-1(XII) chain long isoform precursor [Homo sapiens] gi|146345397|sp|Q99715|COCA1_HUMAN RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor gi|55662663|emb|CAH71310.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203512|emb|CAI19898.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203521|emb|CAI19908.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 3063 Score = 43.1 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|224823582|ref|ZP_03696691.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] gi|224604037|gb|EEG10211.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] Length = 995 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 36/332 (10%), Positives = 82/332 (24%), Gaps = 37/332 (11%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGV-SIQWVIDFSRSMLDYQRDSEGQPLN---- 202 + + S + + +I D S SM + GQ Sbjct: 18 MPTYPANSCTLPAYNSPYPSAPLQLSVKVTPNILLFFDNSGSMAWDVYGNTGQKRERHCN 77 Query: 203 ---CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS-EEHF 258 C+ ++ + + + + L+ D E+ Sbjct: 78 WRGCWYGDVLDPLQGETRLS-IAKDVTKSMIDAYPDMRWGLFSFDPSSSKDYVNDREDLR 136 Query: 259 VDSSSLR-----HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-FNDRVISDPSF 312 + + V+ A+ S+ + + Sbjct: 137 YNRAGRLVAEVGGGKDHLDRVKSAIDSLGADTNTPLAEAYYEMTRYYRGMSSAYYKPVNP 196 Query: 313 SWGVHKLIRTI-------VKTFAIDENEMGSTAINDAMQTA---YDTIISSNEDEVHRMK 362 + G R F + + T N+ + + + ++ + Sbjct: 197 ASGSRGSGRYASPVQYRCQANFVLMMTDGEPTEDNEFPLSGDPLFAGLSLGKDNPLPSFA 256 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A +DGE + E A Q I+ TI F+V+ + + Sbjct: 257 GRAAAGDLFTSGSDGEGQGWDAETFA------RQSIKTYTIGFTVDN--TLLQKTAMQ-- 306 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +F A++ +L F + + + Sbjct: 307 -GGGRYFTASNREQLKASFTSAMDDIYRQNST 337 >gi|260837284|ref|XP_002613635.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] gi|229299021|gb|EEN69644.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] Length = 2411 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 55/179 (30%), Gaps = 19/179 (10%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSS----SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 +Y+ P ++ V S + +L + A +V+ ++ D Sbjct: 128 NWYVSAASPKKKNVVIVIDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRD------ 181 Query: 296 RMGATFFNDRVISDPS-FSWGVHKLIRT---IVKTFAIDENEMGSTAINDAMQTAYDTII 351 G F+ R + + + T I++ F T + A+D + Sbjct: 182 WGGVVSFSARAETPEGCLGDSLGEANPTNIGIMQDFINQRVPETITMYGVGFRKAFD-MF 240 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC--NKAKSQGIRIMTIAFSVN 408 + ++ + I+ L+DG T I + + + I T N Sbjct: 241 AEARNKKPEQFEDCYN--IIIFLSDGSPTDKAFALDEITKGQELMDRSVYIFTYGLGAN 297 >gi|116671475|ref|YP_832408.1| TadE family protein [Arthrobacter sp. FB24] gi|116611584|gb|ABK04308.1| TadE family protein [Arthrobacter sp. FB24] Length = 129 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 39/123 (31%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 S G + A+L PV++ + +++ R + +L AA+ + ++ S Sbjct: 3 RASERGAVAVEFAILAPVLVMLLLGIMEFSRAYNAQASLSAAAREGVRVMAISNKISDAN 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++R P K + +N + + + M + + Sbjct: 63 KAARDTAVSLQPGLKDTNIVFKNLDTGTTSCAPGNRMTITISYNLSTMTGIAGPFPMTGK 122 Query: 135 SRY 137 Sbjct: 123 GAM 125 >gi|121600676|ref|YP_992999.1| hypothetical protein BMASAVP1_A1673 [Burkholderia mallei SAVP1] gi|124385110|ref|YP_001026331.1| TadE-like protein [Burkholderia mallei NCTC 10229] gi|126440021|ref|YP_001058915.1| TadE family protein [Burkholderia pseudomallei 668] gi|126448781|ref|YP_001080395.1| hypothetical protein BMA10247_0832 [Burkholderia mallei NCTC 10247] gi|167815888|ref|ZP_02447568.1| hypothetical protein Bpse9_12144 [Burkholderia pseudomallei 91] gi|167845805|ref|ZP_02471313.1| hypothetical protein BpseB_10993 [Burkholderia pseudomallei B7210] gi|167902778|ref|ZP_02489983.1| hypothetical protein BpseN_11002 [Burkholderia pseudomallei NCTC 13177] gi|167911023|ref|ZP_02498114.1| hypothetical protein Bpse112_11050 [Burkholderia pseudomallei 112] gi|254177893|ref|ZP_04884548.1| putative membrane protein [Burkholderia mallei ATCC 10399] gi|254179868|ref|ZP_04886467.1| TadE family protein [Burkholderia pseudomallei 1655] gi|254358537|ref|ZP_04974810.1| putative membrane protein [Burkholderia mallei 2002721280] gi|121229486|gb|ABM52004.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293130|gb|ABN02399.1| TadE-like protein [Burkholderia mallei NCTC 10229] gi|126219514|gb|ABN83020.1| TadE family protein [Burkholderia pseudomallei 668] gi|126241651|gb|ABO04744.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|148027664|gb|EDK85685.1| putative membrane protein [Burkholderia mallei 2002721280] gi|160698932|gb|EDP88902.1| putative membrane protein [Burkholderia mallei ATCC 10399] gi|184210408|gb|EDU07451.1| TadE family protein [Burkholderia pseudomallei 1655] Length = 142 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 30/121 (24%), Gaps = 9/121 (7%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +L+ G + L P M+ V +VD + + A++ A Sbjct: 4 LARLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGV----- 58 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 V + + T Y + + V + +Y Sbjct: 59 ----VVRVPQLAATDITNIALSYAQGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYT 114 Query: 131 V 131 Sbjct: 115 Y 115 >gi|46201041|ref|ZP_00207942.1| hypothetical protein Magn03010639 [Magnetospirillum magnetotacticum MS-1] Length = 142 Score = 43.1 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 38/142 (26%), Gaps = 14/142 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVM-LGVGGMLVDVVRWSYYEHALKQAA------------Q 58 + I+ G + AL +P++ L + G+ D+ + L A Sbjct: 2 RHWIRDERGVSAVEFALALPILALMMVGLA-DMGLAVNEKMRLTSAVRAGAQSGYGNWND 60 Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 +A I ++V + + + + N + Sbjct: 61 SAAIISAVKDASGVSPTALTVTTATSCACADGSAMACGNTCSGGATRTYLTVTATERYSL 120 Query: 119 AVEMNPRKSAYQVVLSSRYDLL 140 V+ + ++ + Sbjct: 121 LVDYPGLSGPASLSATAVLRVQ 142 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 43.1 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 61/190 (32%), Gaps = 21/190 (11%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + K +VR+ LA+++++ + + V S Sbjct: 38 SGSMWGRLAPDNKPKIDVVREKLATILQTPSSTRVGLVSFGHRRRGDCNDVELIASPDS- 96 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 R + N G + A++ A D I +S + I+++ Sbjct: 97 ----ERAALLGPLAKLNPRGPGPVTAALEIAADAIGTSRPAQ-------------IIIVG 139 Query: 376 DGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 DG + + A + AKS G+ + I V T++ + A+ +++ Sbjct: 140 DGADNCQQDSCAAANDFAKSAPGVAVQVIGIGVPATERPRIACVAQ--ATGGRYYDVTDA 197 Query: 435 HELNKIFRDR 444 LN + Sbjct: 198 AGLNAALDEA 207 >gi|225874357|ref|YP_002755816.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] gi|225791485|gb|ACO31575.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] Length = 333 Score = 43.1 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 37/115 (32%), Gaps = 4/115 (3%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI----DENEMGSTAINDAMQTAYDTI 350 R+ FN V S K + + + D N + + + + Sbjct: 83 PRVFVVQFNRDVDLLEDPSASASKAQQALSQVGVAQFHGDSNADSTANKGRHAKASGAAL 142 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + H + N +K IV+L+DG + A+ G+ + I F Sbjct: 143 YDAIYLATHNVLNTTAGRKVIVVLSDGIDQGSKTTLNGAVEAAQRAGVAVYAIYF 197 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 43.1 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 44/161 (27%), Gaps = 26/161 (16%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D R F D + + R + G+T +NDA+ + + Sbjct: 129 DGDRAHILNFGDSINVAVGMT-----ADRNRLDQAISGLRAWGATRLNDAVFASAGALAG 183 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN------KAKSQGIRIMTIAFS 406 + IVLL++G + + +++ + G + + Sbjct: 184 AEGRGA------------IVLLSEGPDADPSGAPLSVVDSEAALAAVVESGAPVYAVGLG 231 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 R A+ ++F L F D Sbjct: 232 PGADAALLRRLAE---ATGGAYFPVADAAALPATFSDIATR 269 >gi|282601472|ref|ZP_05981787.2| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] gi|282569002|gb|EFB74537.1| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] Length = 246 Score = 43.1 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 51/160 (31%), Gaps = 13/160 (8%) Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 + ++ + S+++ + + F+ L R Sbjct: 49 TGGTSRLDELQKGIKLFYNSVREDEVARYAAEICIVTFDSEAKCRM----DFANLDR--- 101 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 ++ + G TA+ + + A D + S ++ K + ++VL+TDG + Sbjct: 102 QSDLPELTATGDTAMGEGVNLALDLLESRKR--EYQDKGVDYFQPWLVLMTDGVPNGNEG 159 Query: 385 ---EGIAICNKA-KSQGIRIMTIAFSVNKTQQEKARYFLS 420 + C + + + IA Q A++ Sbjct: 160 EFERAVQRCRDMEAQKKLTVFPIAIGDEGDQTALAKFSAK 199 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 43.1 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 61/175 (34%), Gaps = 17/175 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + ++ + +V+ + D + +++ + +F+ + + Sbjct: 164 NDFNQMKGFVQAVMGQFEGTDTL-----FALMQYSNLLKIHFTFTQFRTSPSQQSLVDPI 218 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G T + T + H+ AKK ++++TDG+ +D E Sbjct: 219 VQLK--GLTFTATGILTVVTQLF------HHKNGARKSAKKILIVITDGQKYKDPLEYSD 270 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHELNKI 440 + +A+ GI I AR L+ +S + F+ ++ L I Sbjct: 271 VIPQAEKAGIIRYAIGVGHAFQGP-TARQELNTISSAPPQDHVFKVDNFAALGSI 324 >gi|167648158|ref|YP_001685821.1| hypothetical protein Caul_4199 [Caulobacter sp. K31] gi|167350588|gb|ABZ73323.1| conserved hypothetical protein [Caulobacter sp. K31] Length = 188 Score = 43.1 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 11/134 (8%), Positives = 33/134 (24%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + I + + + G + A + PV++ + ++ + L A + Sbjct: 7 RPRPIKFWVQFWRDRRGASAVEFAFIAPVLVLFYCGMSELTEAMIAQRRLSHIASSIGDV 66 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + + K L + + D + N Sbjct: 67 VARDTQLTDARRTDVFKVGSVLMAPFPTTGLRMCIVSITSNAAGTIDTVDWSEPSNSPTN 126 Query: 124 PRKSAYQVVLSSRY 137 + + + Sbjct: 127 CPAKGAVINIPASV 140 >gi|39997259|ref|NP_953210.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] gi|39984149|gb|AAR35537.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] Length = 575 Score = 43.1 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 40/120 (33%), Gaps = 22/120 (18%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G T + A+ A+ + E +K + +DG+ T D A Sbjct: 476 NITGSGGTRLGHALWWAWGELS-----------LRRETRKICIAFSDGD-TGDGPVTQAA 523 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + +GI + I + + +L P++ + + + + ++ Sbjct: 524 IKRMREEGIEV--IGIGIQDNS---IKQYL-----PDNHRIIKNLDQFTPALLELLREKL 573 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 43.1 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 61/191 (31%), Gaps = 18/191 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS------ 65 K L++ G F + A+++P ++ L+ V + Y + ++ A + Sbjct: 11 KGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTAQRGAEIWNSP 70 Query: 66 -VPLIQSLEEVSSRAKNSFTFPKQKIE------EYLIRNFENNLKKNFTDREVRDIVRDT 118 +I + + + ++ E I + L + + Sbjct: 71 SKDMIYGQITLDNMDDIDLYWRIWEMSKRKKQKEEKIEKYAGYLLSDSPILGEPIELEIE 130 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK-SWLIQTKAEAETVSRSYHKEH-- 175 A + ++ +S + NP S LR GI + I+ ++A Sbjct: 131 AGIVEDYIVYKKLRVSVKAKYK-NPFSSLLRVFGIGKTITIKAHSDAVINEPVEFIRTTD 189 Query: 176 -GVSIQWVIDF 185 + + +D Sbjct: 190 FVIDVVKEVDN 200 >gi|299139640|ref|ZP_07032813.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598264|gb|EFI54429.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 488 Score = 43.1 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 24/191 (12%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + ++ A+A + +++ R+ F + V R Sbjct: 119 PMAGGSTRFQSLKSAIAQFLAGMQEGS-----DRVAIVPFESH-NVVSTIRSAVFTTHRA 172 Query: 323 IVKTFAIDENEMG---STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 G +TA+ A+ + D++ H E + +++++TDG+N Sbjct: 173 DALAQLNALPAPGPKNNTALYQAVFSGVDSMKGELASLQHEGATLAELQPHLIVMTDGKN 232 Query: 380 T----------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 + + ++ + TI + + L FF Sbjct: 233 EVAPGDDPQLLNGDLGLQQAVAQVQTSN--LDTIGIGFG-DKNDIDAGALQK--LTKRFF 287 Query: 430 EANSTHELNKI 440 A+ ++L Sbjct: 288 YASDANQLLAA 298 >gi|255522879|ref|NP_001157342.1| collagen alpha-1(XIV) chain [Equus caballus] Length = 1796 Score = 43.1 bits (99), Expect = 0.099, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 43/152 (28%), Gaps = 20/152 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + ++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFNTKDEVIDAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGARSGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS + Sbjct: 294 V------KNADENELQEIASEPDSTHVYNVAE 319 >gi|329901495|ref|ZP_08272833.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] gi|327549103|gb|EGF33703.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] Length = 1132 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 45/397 (11%), Positives = 88/397 (22%), Gaps = 55/397 (13%) Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + N ++ IV+ + Y +S Y + +S S + Sbjct: 93 NGASRLNVAKAGIKAIVQAYMQNTDFALETYSTSGTSLYTTWVYYMSPAGSGFVFTSTKL 152 Query: 159 QTKAEAE------TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 ++ S K + I ++ + + + D Sbjct: 153 AGNRYVANPCLNYALASSTVKSNCTGIAGAGLYAALAVSGSAYMQIGAASDDANINDVLY 212 Query: 213 KSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 S SPY S + + Y PS + Sbjct: 213 ASGQPGVYINYGNTSPASPYPPSYSLANYNNGSVLITYPSSVPSANRATGPTNAGYVPYS 272 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS---WGVHKLIRTIVKTFAI 329 + V + T + + + L Sbjct: 273 PQVMYAQRGFGYGGSQSPSSGNVLVPMTSAGTVPTASSVTTAVNTFLPPLAPETNTLQTT 332 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG------------ 377 + + A + + ++S N K+Y+VL++DG Sbjct: 333 EIKAVAGQAPTAGLLAKAQSYLASVLSTA-----NCAPKQYVVLISDGLPTQDLAGKYWP 387 Query: 378 ------------------------ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ-- 411 N + I K K+ GI I Sbjct: 388 PLGSAAAAGYGLTASFKADGSLNVTNDTALTDTITNLAKLKAAGINTYVIGLGAGVDPSV 447 Query: 412 QEKARYFLSNCA---SPNSFFEANSTHELNKIFRDRI 445 +A L A +++ A+S L + Sbjct: 448 NPQAAATLQAMAIAGGTANYYPASSPAALVDGLNSIL 484 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 69/187 (36%), Gaps = 21/187 (11%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 V S K L S++ + + D R+G F+ Sbjct: 341 DVSGSMADGNPSKLALAVAGALSLVETARPED------RLGIVTFSSGPRWLFPPRPMTA 394 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + KT GST + +A + A + + + K I++LTDG Sbjct: 395 R-GKLEAKTLLDRLRPGGSTRMLEAYRQAIEALEA-----------LELETKQILVLTDG 442 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + +D +A+ A++QGIR ++A + + AR + F++ + +L Sbjct: 443 QVEEDPAALVALAEAARAQGIRTNSVALGGDADRALLARM---SRVGEGRFWDVPTPEDL 499 Query: 438 NKIFRDR 444 ++F + Sbjct: 500 PRLFLEE 506 >gi|221132796|ref|XP_002166108.1| PREDICTED: similar to fibrillar collagen [Hydra magnipapillata] Length = 2213 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 28/255 (10%), Positives = 66/255 (25%), Gaps = 21/255 (8%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + + P C +++ + + + Sbjct: 615 DNCDVKYNFCESDFDCPGYQEC-VFDGCRRKCAYCGKKCTDKLDILLVLDDSIKTGQENF 673 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFND----RVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ + + + ++D + S S + + + Sbjct: 674 KKSKDFSKAMLEWLSIDQNNTNVAVISYSDIAELHISFPVSGSDDPLQSLYDLQGKIDSI 733 Query: 331 ENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GST+ ++ A+ A + K AKK I+L TDG +E Sbjct: 734 PYKGGSTSRLDRALSLASTRVFPEG-------KRTRNAKKVIILFTDGSTDVSSERLDVA 786 Query: 390 C-----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIF 441 K + + I + L+N SP F A ++ Sbjct: 787 SWPLRKQKKRDGENEVNAIRIMSVTVSNKTNSNGLANVLSPPFIENTFTAADYDDIFNSI 846 Query: 442 RDRIGNEIFERVIRI 456 + ++ Sbjct: 847 QQIAEESCKFSAGKL 861 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 31/78 (39%), Gaps = 11/78 (14%) Query: 373 LLTDGENTQDNEEGIAI-----CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN- 426 +LTDG T+ + A+ K G++I ++ + E L + AS + Sbjct: 91 VLTDGRQTRGSSAPDAVDLHVASRPLKDIGVQIYSLGIGRDYDIGE-----LLDIASDDA 145 Query: 427 SFFEANSTHELNKIFRDR 444 S F ++ EL I Sbjct: 146 SVFRSSDVDELVSIVASI 163 >gi|110629868|gb|ABG80450.1| fibrillar collagen [Hydra vulgaris] Length = 1883 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 28/255 (10%), Positives = 66/255 (25%), Gaps = 21/255 (8%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + + P C +++ + + + Sbjct: 285 DNCDVKYNFCESDFDCPGYQEC-VFDGCRRKCAYCGKKCTDKLDILLVLDDSIKTGQENF 343 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFND----RVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ + + + ++D + S S + + + Sbjct: 344 KKSKDFSKAMLEWLSIDQNNTNVAVISYSDIAELHISFPVSGSDDPLQSLYDLQGKIDSI 403 Query: 331 ENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + GST+ ++ A+ A + K AKK I+L TDG +E Sbjct: 404 PYKGGSTSRLDRALSLASTRVFPEG-------KRTRNAKKVIILFTDGSTDVSSERLDVA 456 Query: 390 C-----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELNKIF 441 K + + I + L+N SP F A ++ Sbjct: 457 SWPLRKQKKRDGENEVNAIRIMSVTVSNKTNSNGLANVLSPPFIENTFTAADYDDIFNSI 516 Query: 442 RDRIGNEIFERVIRI 456 + ++ Sbjct: 517 QQIAEESCKFSAGKL 531 >gi|167463706|ref|ZP_02328795.1| hypothetical protein Plarl_14314 [Paenibacillus larvae subsp. larvae BRL-230010] Length = 695 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 8/62 (12%), Positives = 22/62 (35%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 ++ G + LL+ + +L+D R + +QA + + + + Sbjct: 3 SFWQNRRGSVSVYIILLLVPVFFFQAVLIDFARVKAAQKESEQALKAGLRSVLSAFQPDV 62 Query: 73 EE 74 + Sbjct: 63 QT 64 >gi|149919202|ref|ZP_01907685.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149819916|gb|EDM79338.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 877 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 38/427 (8%), Positives = 108/427 (25%), Gaps = 36/427 (8%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSLEEVSSRAKNSFTFPKQKIEEYLIR 96 VD +R E ++ +T ++ + + Q + Sbjct: 157 AAVDDMRMIIGERTIESEVETKAKAVERFADAREAGHTAALLEQERPNVFTQSVTNVAPG 216 Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 + + M S+ ++P +G + Sbjct: 217 ESVEVEVQYVQTLTQDGGNYEFVFPMVVGPRFSPPGTSAEAHAAVSP-----PIVGEGTR 271 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + + TV+ + S + S + + Q + Sbjct: 272 TGHDVSLSMTVAAGGKVQRWDSPTHTVVGSETSDGFALRLADQKTLPNRD--FVVRWQST 329 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + K V+ + ++ L P + V + VI + + Sbjct: 330 AAQAKATAYFGPQLSQSVAGAQPGHFTLVVEPP--QSDLDSLVGQREMIFVIDRSGSMSG 387 Query: 277 ALASVIRSI--KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + + + ++ F + ++ + F Sbjct: 388 VPLALAKQTLREALSHLRPVDTFNVISFESSTAMLYEAAVPANEQNLVHAERFIDGLQAG 447 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ A+ A + H +Y+ +TDG + ++E + Sbjct: 448 GGTMMSGAVDAA---LSPEIGLGRH---------RYVFFVTDGFISNEDEIARQASALVR 495 Query: 395 SQG-----IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH---ELNKIFRDRIG 446 + R+ + + ++ A + A + + E + + + Sbjct: 496 AADKAGQRARVFGMGIGSSPNRELLASL---SKAGKGRYLAVGNREHPREAVEAYTRMVD 552 Query: 447 NEIFERV 453 + + + Sbjct: 553 SAVLTDI 559 >gi|85374105|ref|YP_458167.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] gi|84787188|gb|ABC63370.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] Length = 195 Score = 43.1 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 39/123 (31%), Gaps = 4/123 (3%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA----IITA 64 + +L+ G + ++ P ++ + + D+ Y L+ Q A + Sbjct: 4 SFLPRLLDDKRGVTIVEFGMVAPTLIVLLLGVFDITYNMYSSSMLQGTVQQAARNSALEG 63 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + Q+ ++ + + E ++F D I D + + Sbjct: 64 ADSMAQNAAVETAVRQVAPNATFTHKRIAYTTFTEVGSAESFDDVNDDGICADGELFEDA 123 Query: 125 RKS 127 + Sbjct: 124 NGN 126 >gi|123233471|emb|CAM28080.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 637 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 45/127 (35%), Gaps = 18/127 (14%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1 KSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKSQ 54 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHEL 437 D E K K +G+ + I + L A+ + L Sbjct: 55 DDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHAYNVADFESL 105 Query: 438 NKIFRDR 444 ++I D Sbjct: 106 SRIVDDL 112 >gi|226326039|ref|ZP_03801557.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] gi|225205581|gb|EEG87935.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] Length = 823 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 39/336 (11%), Positives = 95/336 (28%), Gaps = 29/336 (8%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + + + A T K+ + + N + + ++ + T ++ Sbjct: 250 VKEVVDMVADETKEATVETTETKEVEKAAFVTNSFSTFTITW-KKDSATYFKLTLHHVDE 308 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + LSS L N F + I A+ + +S + Sbjct: 309 KGNELPTTLSSDATLNYNSDISFSDYVEIIDNYDYKYAKVDKISGKTVTSAVTNGSSGKR 368 Query: 185 FSRSMLDYQRDS----EGQPLNCFGQPADRTVKSYSSQNGKVGI---------RDEKLSP 231 ++ + + + + S+ +G + +K Sbjct: 369 AKLTLYNGKDNKIETLSYSTTTKNIYLVYTAKSTGSTGSGTIEDTAPQLSHQKYIKKKDG 428 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ +D+S+ K + A+ +I ++ V Sbjct: 429 NNYDLTLNVSSKKGTTSSKKKFDIVLIMDTSTSMSNNNKWRNSKTAVNKLIDTLSSQTTV 488 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + R F + W + +++ + + E T D + + + Sbjct: 489 DVNYR--LVTFGTTAQIQTN--WTTGETVKSTLSNY--SIKEDQGTNYEDGLVKTKEALS 542 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 S +A+K IV LTDG+ T Sbjct: 543 SGTR---------ADAEKIIVFLTDGQPTFYKSGTS 569 >gi|194374891|dbj|BAG62560.1| unnamed protein product [Homo sapiens] Length = 723 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 244 FKESASPMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 300 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 301 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 359 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 360 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 419 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 420 FEHM------LDVSKLT 430 >gi|58429471|gb|AAW78139.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429479|gb|AAW78143.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429539|gb|AAW78173.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K +I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 37/293 (12%), Positives = 70/293 (23%), Gaps = 29/293 (9%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDY--QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 SI V+D S SM ++ + S + Sbjct: 97 IKTIGGKSGSIVLVLDNSGSMGWGSSPTPADYARDALKEFANEFLKNGNSGNKLGLVTYS 156 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + + Y+ + S VD + ++ + Sbjct: 157 SGSGVPIYDAYDEIKYVQGYSMTENSDIFGQVVDGLQTPSGETDVQMGIKTARDILAADT 216 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + V N + G + D T D Sbjct: 217 SGNPQFILVFSDGAT-NRSARPTSAGELGGANISPCTFGDKIYDM-----TFKFDGFDYG 270 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N+ V T+G ++ AK QGI I ++ + Sbjct: 271 AQGSAVYNDG---------------VYTTNGNVNAHTVAAVSEALLAKDQGIDIYSVFYH 315 Query: 407 VNKTQQEK---ARYFLSNCASPNSFFEAN--STHELNKIFRDRIGNEIFERVI 454 + + + N AS + E + + +IF + I +I E + Sbjct: 316 NPALNDLEYGAGVFVMKNSASSGQYTEISPGNAGAFAEIFTE-IEKQIQESIA 367 >gi|167566937|ref|ZP_02359853.1| hypothetical protein BoklE_30536 [Burkholderia oklahomensis EO147] Length = 389 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 30/361 (8%), Positives = 85/361 (23%), Gaps = 6/361 (1%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 ++ V++G G+ +D+ + L+ +A A A+ + +S T Sbjct: 1 MLAVLIGFVGLALDLGKLYVTRSELQNSA-DACALAAARDLTGAINLSVPEAAGITSGHL 59 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFL 148 + + + T + ++ S V + ++N L Sbjct: 60 NYALFQGFPVQMQTDLSVTFSDSVSGPFQPKSAISSPSSIKYVKCKTSMTGIVNWFIQAL 119 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 + S + + + + ++ + + Sbjct: 120 NMVPGVSVANASVSATAVATIGAAQTTCAIPVFICKAGTQTSPPVAGATYNVGDWLSAKT 179 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + G + + + + Y L +++ Sbjct: 180 GSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYNTRF 239 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 H + T + ++D V +F+ + + T Sbjct: 240 GIYHNPYKDPSYGTPDFTGYAYDATTWPAQSNAYSDFVSKRRTFTSY-----QGDLITGI 294 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ++ + + + VL+ D T + I Sbjct: 295 NTGGSYSASYYQAGADRRLALAPEVDCSVLLSGHSAPVLSWDCVLMLDPMGTGGSASPIH 354 Query: 389 I 389 + Sbjct: 355 L 355 >gi|47228041|emb|CAF97670.1| unnamed protein product [Tetraodon nigroviridis] Length = 1009 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 16/116 (13%) Query: 320 IRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + +G T I+ A+ + S + + ++ V++TDG Sbjct: 119 TKAQFINGISGIRYLGKGTYIDCALTNMTQEMTQSP--------SPFKPLRFAVVITDGH 170 Query: 379 NTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEAN 432 T + GI + +A+ GIRI +A + + A SP + + Sbjct: 171 VTGNPCGGIKVSAERARDAGIRIFAVA-----ASRNIDETGMREIANSPAMVYRDD 221 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 56/187 (29%), Gaps = 28/187 (14%) Query: 250 DPSLSEEHFVDSSSLRH-VIKKKHLVRDALASVIRSIK-------KIDNVNDTVRMGATF 301 D +S +D+S + D + I + V+ T G Sbjct: 42 DCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQNQVSITWMFGGLH 101 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++D V + K T + A+ S +V Sbjct: 102 YSDVVEIYSPLT--RSKDTYLTKLRAINYL--GRGTFTDCAISNMTQQFQSQTARDV--- 154 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K+ V++TDG T G+ + +A+ GI++ +A ++ L Sbjct: 155 -------KFAVVITDGHVTGSPCGGMKMQAERARDMGIKLFAVA-----PSEDVYEQGLR 202 Query: 421 NCASPNS 427 ASP Sbjct: 203 EIASPPH 209 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 57/158 (36%), Gaps = 11/158 (6%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+ F+ S F+ + + IV E+ G+TAI + + + Sbjct: 265 GRRFSRIALITFSSVGKSRTQFNLDRYFDGKDIVTAIRRLESSGGTTAIGEGI-----RL 319 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS---- 406 + +D+ H + AKK +++ TDG + + + + AK G + TI + Sbjct: 320 GTEQKDKQHGGRPVEIAKKIMLVFTDGWSNKG-PDVEEMTRNAKGAGFTLYTIVYEGNGR 378 Query: 407 VNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRD 443 V+ Y + + + +L + R Sbjct: 379 VDANSPGLNLYTIETMVDDHKHVYSERNFTQLIQELRQ 416 >gi|307292638|ref|ZP_07572484.1| hypothetical protein SphchDRAFT_0110 [Sphingobium chlorophenolicum L-1] gi|306880704|gb|EFN11920.1| hypothetical protein SphchDRAFT_0110 [Sphingobium chlorophenolicum L-1] Length = 198 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 11/128 (8%), Positives = 34/128 (26%), Gaps = 1/128 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + +L G A+++ + + +++ Y A + A Sbjct: 1 MRMRL-SLHHRLAADQRGASVPEFAMILMPLCLILFGGLEMGYQIYVRSVTLGALERASR 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 +++ + S + +F N +R +D+ + ++ Sbjct: 60 LSTIQTVNSTAVEADIEATIKRIVPSATISTSKSSFYQYSNINAMERLTKDVNNNGTLDS 119 Query: 123 NPRKSAYQ 130 Sbjct: 120 GDCWEDVD 127 >gi|34481898|emb|CAE46497.1| trap [Plasmodium falciparum] Length = 331 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N AK+ +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENAKQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|53719424|ref|YP_108410.1| hypothetical protein BPSL1811 [Burkholderia pseudomallei K96243] gi|134282410|ref|ZP_01769115.1| TadE family protein [Burkholderia pseudomallei 305] gi|217421983|ref|ZP_03453487.1| TadE family protein [Burkholderia pseudomallei 576] gi|52209838|emb|CAH35810.1| putative membrane protein [Burkholderia pseudomallei K96243] gi|134246448|gb|EBA46537.1| TadE family protein [Burkholderia pseudomallei 305] gi|217395725|gb|EEC35743.1| TadE family protein [Burkholderia pseudomallei 576] Length = 142 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 30/121 (24%), Gaps = 9/121 (7%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +L+ G + L P M+ V +VD + + A++ A Sbjct: 4 LARLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGV----- 58 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 V + + T Y + + V + +Y Sbjct: 59 ----VVRVPQLAATDITNIALSYARGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYT 114 Query: 131 V 131 Sbjct: 115 Y 115 >gi|333028697|ref|ZP_08456761.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] gi|332748549|gb|EGJ78990.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] Length = 418 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 13/119 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 RT K + G T I A+ A D + EA K IVL+TDGE+ Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL------------RGGEASKRIVLITDGED 154 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + + + ++G+ ++ + + + + A+ ++ T +L+ Sbjct: 155 TCHR-DPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTKDLS 212 >gi|291541798|emb|CBL14908.1| hypothetical protein RBR_05200 [Ruminococcus bromii L2-63] Length = 1244 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 34/313 (10%), Positives = 83/313 (26%), Gaps = 23/313 (7%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 K G I +++ M+ V + VD + S + + A + TA L++ Sbjct: 4 FKKTKGAVSIFLVIILVPMMTVSALFVDASKVSLAKSVSESAGDLTLNTALTDYDTKLKD 63 Query: 75 VSS--RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + +E+Y ++ N D + + V+ Sbjct: 64 MYGLFATAQNTDELYASLEDYYRSCITSSGVSNEDADSYVDQIMAQLGLVGDSGDVSDVL 123 Query: 133 LSSRYDLLLNPLS---------------LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 D +N S F++ + + + +T + V Sbjct: 124 NMELVDFSVNKNSDATLSNASVMKKQIVDFMKYRAPINTGLSFLSSIQTFKTLSKQSELV 183 Query: 178 SIQWVIDF-SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + S++ + + P +S + + Y S Sbjct: 184 DKRQDYYKEEESVMKTLKSAWEYINKYNSLPLITDENYFSDMQKNLKDSAGWENDYKFSV 243 Query: 237 NKSL----YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 N+ Y ++ + + V+S + + L R + + + Sbjct: 244 NRKTIMDLYDTQNYIDYYCNVQKVNNVESKNAAGDNVRVDLWRFTYSE-TGVAAPLQDYT 302 Query: 293 DTVRMGATFFNDR 305 ++ Sbjct: 303 QYYPKRVGGYDKN 315 >gi|282899257|ref|ZP_06307228.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] gi|281195826|gb|EFA70752.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] Length = 431 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 54/150 (36%), Gaps = 14/150 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT------- 326 + A+ ++ + K ++ F + + P + L + I + Sbjct: 131 ITGAINAIGQFAKVSSERGGDTQISIVPFGEAGKNCPEYPVDKDTLDKFISASDFKLQNS 190 Query: 327 --FAIDENEMGSTAINDAMQTAYDTIIS--SNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + N GST + ++ + + + + ++ + + I+LL+DG + Sbjct: 191 LEYLSTLNPCGSTNLYQPLKKSLQFLGNPEDPRFTLPENSSDPQPRLSIILLSDGYHNAM 250 Query: 383 N--EEGIAICNKAKS-QGIRIMTIAFSVNK 409 N + + + K + I + T+ + + Sbjct: 251 NEYRDFSELKSLLKDYENITVHTLGYGLTP 280 >gi|219883044|ref|YP_002478208.1| TadE family protein [Arthrobacter chlorophenolicus A6] gi|219862050|gb|ACL42391.1| TadE family protein [Arthrobacter chlorophenolicus A6] Length = 126 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 37/134 (27%), Gaps = 27/134 (20%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++S G + AL++P+++ V +++ R + A + + Sbjct: 4 LRSERGSVAVEFALILPILIAVLLGIMEFGRAYNA----QITVTAAAREGARVMSIQGSP 59 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ + N + V T ++ Sbjct: 60 ALAKTAV----------------QAASPALNPQLSTGQIQVSPTTCTAGAN-------VT 96 Query: 135 SRYDLLLNPLSLFL 148 LN ++ FL Sbjct: 97 VTVTYRLNFITGFL 110 >gi|162448738|ref|YP_001611105.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] gi|161159320|emb|CAN90625.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] Length = 1041 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 50/185 (27%), Gaps = 22/185 (11%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ K L +A A + D + A ++ G Sbjct: 453 SMAATAGAHTKLELANEAAARSASLLGAGDRLGVAHVDTAVRWS--------VPLGPVAD 504 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I K G ++ ++ AY + + + K+++L DG + Sbjct: 505 GAAIEKAIRAVGPGGGGIYVDITLEAAYQALD-----------RDTSSLKHVLLFADGSD 553 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 ++ A+ A +G +A E + F+ L Sbjct: 554 AENMGPCRAMVEAAMRRGTTTSVVALGQGSDVPELEAL---SRLGGGRFYLVEDATRLPA 610 Query: 440 IFRDR 444 +F Sbjct: 611 VFTQE 615 >gi|308472935|ref|XP_003098694.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] gi|308268294|gb|EFP12247.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] Length = 405 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 48/152 (31%), Gaps = 8/152 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-ND 341 I T R+G +N + + A+ + + Sbjct: 76 TRIGSSSAEPRTTRLGLVTYNSVATQNADLNQYQSIEDAYYGIYGALSTTVNTTESYLTT 135 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + + N +K I++ N + + + I N+ K+ G+ I+ Sbjct: 136 GLNAAVELFSRQSFRS-----NRQHYRKVIIVYASEYNGRGEFDPVPIANRLKASGVNII 190 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 TIA+ LS ASP F ++ Sbjct: 191 TIAYEQP--GSAGLLQGLSQIASPGFSFSGDN 220 >gi|253582503|ref|ZP_04859725.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] gi|251835648|gb|EES64187.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] Length = 376 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 49/172 (28%), Gaps = 22/172 (12%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 K + + + K V + + V S + + + Sbjct: 58 GAKTKIWSIVNEVM----QTHKDSKVKIGLVAYRDRGDVYVTKVTQLSENLDE-----IY 108 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENTQD 382 + + G + ++ A + + + I L+ D ++ D Sbjct: 109 SVLMGYKAQGGGDDPEDVRKALHESLEVIQWSTP----RENLSQIIFLVGDAPPHDDYND 164 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 + + KAKS+GI I TI K Y+ A +F + Sbjct: 165 SPDTSDTAKKAKSRGIIINTIQCGDMP----KTDYYWKAIAQFGGGEYFHIS 212 >gi|225310541|emb|CAQ19231.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 806 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 56/174 (32%), Gaps = 19/174 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +++D +A+++ + + R+G ++ V + + + + + I + Sbjct: 458 FEIIKDFVAALVDRLTI---GRNATRVGLVLYSLEVQLEFNLA--RYTTKQDIKEAIRRV 512 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A+ A S KK +++TDG+ + + I Sbjct: 513 HYIGEGTYTGSAIHNATHEAFYSAR---------TGVKKVAIVITDGQTDKREPVKLEIA 563 Query: 391 -NKAKSQGIRIMTIAFSVNKTQ-QEKARYFLSNCAS---PNSFFEANSTHELNK 439 +A + I + + Q + LS AS + + L Sbjct: 564 VREAHAANIEMYALGIMNISNPTQNEFLQELSLIASHPDSEHMYYIKDFNTLPA 617 >gi|170746809|ref|YP_001753069.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170653331|gb|ACB22386.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 204 Score = 43.1 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 16/150 (10%), Positives = 39/150 (26%), Gaps = 18/150 (12%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHA----------LKQAAQTA 60 + + G + AL++P+++ + ++ R L A Sbjct: 8 LSRFRRDERGIAAVEFALVLPLLIILYFGTAELTRVVDATRKLTLFARTLGDLSGRMDNA 67 Query: 61 I--------ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 + I + I + S + I L ++ +N T R Sbjct: 68 LATQDGMTKIAGAATAILRPLDASGLQIVVNAMGVESINGTLKGFVCSSWPQNATKRPAN 127 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 + P + ++ + Sbjct: 128 QANGSNGLPATPAAYQFDGARYILAEVTMP 157 >gi|327191361|gb|EGE58388.1| hypothetical protein RHECNPAF_310002 [Rhizobium etli CNPAF512] Length = 193 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 44/148 (29%), Gaps = 11/148 (7%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++LI+ G I A+L PV++ + ++ A Sbjct: 8 TRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKR--------ATRA 59 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV---E 121 + + + + S K++ T + + +L T V + Sbjct: 60 AGSIADLVTQQQSITKSTLTEMRSVATAIFVPYNSTSLTLKITGITVDASANPKVLWSWA 119 Query: 122 MNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + + S + + FL Sbjct: 120 QDGSAPYAKNTAVSNIPADMKTANSFLV 147 >gi|3273261|dbj|BAA31174.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 39/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|327277464|ref|XP_003223484.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like, partial [Anolis carolinensis] Length = 750 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 52/168 (30%), Gaps = 24/168 (14%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAINDAMQ 344 +D + V++GA F++ + S + +K G T A++ Sbjct: 79 DALDISPEKVKVGAIEFSNTAYLEFSLD---AYFTKHQIKDKLKRIVFKGGRTETGLALK 135 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + N K +++LTDG++ + + K GI + + Sbjct: 136 Y---ILRKGFHGSR-----NSTVPKILIILTDGKSQGNIAAP---AKQLKEMGITVFVVG 184 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDRIGNEI 449 S + ++ L AS A T + F + Sbjct: 185 VSFPRWEE------LHILASDPTEWHLLFAEDTDDAVNGFYTTLTRSA 226 >gi|222101616|gb|ACM44013.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 38/120 (31%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F R + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLDAAASSTR----LALTKLDWMAGTKARSGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 58/204 (28%), Gaps = 29/204 (14%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + + + + A+ + + D R+G FN Sbjct: 98 SGSMWDALDGTPTEDPDRMRITHAKRAIREFLPLLSGRD------RVGLAVFNRTYRVIQ 151 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 ++ + E T + +M+ A ++ Sbjct: 152 PIVGDPSLVLEKLDAIERPS-REQAYTELYRSMEEALTDFGEEGR------------RRV 198 Query: 371 IVLLTDGEN-----TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +V+L+DGEN ++ + A GI I F K + + Sbjct: 199 LVVLSDGENFPVDPSESPSTPGTAIDLAHRYGITCYVIHFGTEKDRLIGDLASE----TG 254 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 F+A + EL ++ I ++ Sbjct: 255 GRVFDARNALELASVY-TAIQEQV 277 >gi|301613500|ref|XP_002936246.1| PREDICTED: collagen alpha-1(XII) chain-like [Xenopus (Silurana) tropicalis] Length = 3127 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 50/160 (31%), Gaps = 20/160 (12%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D +D V++ ++ S+ H ++++ A + G+T A+ Sbjct: 1229 FDIGSDRVQIALAQYSGDPRSEWQL--NAHSTKKSLMDAVANLPYKGGNTLTGMALNFIL 1286 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + ++KK VL+TDG++ D K + QGI + I Sbjct: 1287 Q------NNFKAEAGMRAKSKKIGVLITDGKSQDDIVAPS---KKLRDQGIDLFAIGI-- 1335 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANSTHELNKIFRDR 444 + L S F L I D Sbjct: 1336 ----KNADENELRQIGSDPDETYVFNVADFSLLVNIVDDL 1371 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 82/413 (19%), Gaps = 49/413 (11%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A + N F + +KK D E Sbjct: 61 AGEEDRNVPAQTVQLWGNERNFENSYL-----DYNGAYIKKWVEPVLPSSTASDLHPEDA 115 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++P+ + S T + E + + Sbjct: 116 TTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDL 175 Query: 184 DFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 S+ ++ Q + + + S+N + +P Sbjct: 176 LSDPSLDIRIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPL 235 Query: 233 MVSCNKS--LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV--RDALASVIRSIKKI 288 + Y G V + + Sbjct: 236 TLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVME 295 Query: 289 DNVNDTVRMGATFF-----NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 D + + + F N R + N A Sbjct: 296 DKIAEDNDVLFRNFEEFSGNTSNAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKAT 355 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 D ++N + + A + + E + I +I Sbjct: 356 HWVGDGSSNNNTNGSPTSQEKSSAINTVAY-HHWLKNKYQENPPS-----------IFSI 403 Query: 404 AFSVNKTQ------QEKARYFLSNCAS------PNSFFEANSTHELNKIFRDR 444 ++ R L N A ++ AN+ +++ D Sbjct: 404 GLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKNDIVTALEDI 456 >gi|194216197|ref|XP_001914777.1| PREDICTED: collagen, type XII, alpha 1 [Equus caballus] Length = 3120 Score = 42.7 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 45/133 (33%), Gaps = 18/133 (13%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 HK +++++ A + G+T A+ + R +K VL+ Sbjct: 1254 NAHKDKQSLLEAVANLPYKGGNTLTGMALNFIRQQSFKTQAGMRPRA------RKIGVLI 1307 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG++ D E K K +G+ + + + L A+ + Sbjct: 1308 TDGKSQDDVEAPS---KKLKDEGVELFAVGI------KNADEVELKMIATDPDDTHAYNV 1358 Query: 432 NSTHELNKIFRDR 444 L++I D Sbjct: 1359 ADFESLSRIVDDL 1371 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 15/133 (11%), Positives = 34/133 (25%), Gaps = 14/133 (10%) Query: 21 HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV--PLIQSLEEVSSR 78 F ++ A+ M V + G VD+ + L++ A A + + + ++ Sbjct: 1 SFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAA 59 Query: 79 AKNSFTFPKQKIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 A + + T + +V Sbjct: 60 ANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLGP 119 Query: 128 AYQVVLSSRYDLL 140 + V +S Sbjct: 120 SRTVSATSTAKAT 132 >gi|156364713|ref|XP_001626490.1| predicted protein [Nematostella vectensis] gi|156213368|gb|EDO34390.1| predicted protein [Nematostella vectensis] Length = 1043 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 40/107 (37%), Gaps = 9/107 (8%) Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQ 396 ++ + + R + KK +L+T G T+D + + + ++ Sbjct: 172 GPGLRIDKALKAADKDLFNRRFGMREDQKKVCLLVTSGAQTKDQGPSTQLGTVTALLSAR 231 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFR 442 G+ I + E L N AS + + A+S E+NK+ Sbjct: 232 GVDIYAVGVGDGVDSSE-----LRNIASTEDFIYTASSFEEINKVLE 273 >gi|156084610|ref|XP_001609788.1| thrombospondin-related anonymous protein [Babesia bovis] gi|154797040|gb|EDO06220.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 660 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 38/120 (31%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F R + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLDAAASSTR----LALTKLDWMAGTKARSGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|186686512|ref|YP_001869708.1| vault protein inter-alpha-trypsin subunit [Nostoc punctiforme PCC 73102] gi|186468964|gb|ACC84765.1| Vault protein inter-alpha-trypsin domain protein [Nostoc punctiforme PCC 73102] Length = 818 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 49/165 (29%), Gaps = 16/165 (9%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D R + + N G T + ++T + + Sbjct: 330 NPDDTFNIIDFSDTTQQLSPVPLPNTSANRLLAINYINRLNAGGGTEMLGGIRTVLN-LK 388 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 ++N + IVLLTDG +N+ + + G R+ + + + Sbjct: 389 ATNPGRLRN----------IVLLTDGYIGNENQILAEVKQRL-QPGTRLHSFGAGSSVNR 437 Query: 412 --QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + A E+ + F +I N + + Sbjct: 438 FLLNRIAELGRGIA--RIIRHDEPVDEVVEKFFRQINNPVLANIQ 480 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 50/390 (12%), Positives = 101/390 (25%), Gaps = 49/390 (12%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-MNPR 125 L Q + A + + Y + N++ + N + + +T Sbjct: 242 HLYQKTTVSQTFASIEPDYTTDQSGTYPVANWKTDENSNVLNHQGNKDAGETWDGITTWN 301 Query: 126 KSAYQVVLS-SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVI 183 + S Y + + LR ++ V + ++ + + V+ Sbjct: 302 GDPANLTHSYIEYGGVGDEADFALRKFAKETNTPGLFDVYLNVRGNVQRQIDPIDVVLVV 361 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D+S SM + R +E + GI+D Y+ + Y Sbjct: 362 DWSGSMNEMGRIAEVKKGVDRFLNQIEG----------SGIQDSVYMGYVGYSSDGSNYQ 411 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 S ++ I+ A + + + N Sbjct: 412 NKTC---------QLGKFSEVKETIRSMTPETAAGGTFTQR------GLRQAGDMLSTQN 456 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + GV + K + TA + +T N +N Sbjct: 457 GHKKVIVLLTDGVPTYSYHVSKVHTQADGSYYGTAFSLTQDQPMNTSFLYNGYFAFDQQN 516 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF----- 418 N + NT G A+ K +GI I + + Q Sbjct: 517 NYKWI---------NNTFIATIGEAM--ALKERGIEIHGLGIQLQGDQTAGYTKADVEKK 565 Query: 419 LSNCASPNS----FFE-ANSTHELNKIFRD 443 + S + ++E AN ++ Sbjct: 566 MRQMVSADEDGHLYYESANEAADIADYLEK 595 >gi|196232430|ref|ZP_03131283.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196223502|gb|EDY18019.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 879 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 41/147 (27%), Gaps = 12/147 (8%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + + K G I +M A+ + H + Sbjct: 469 DTKPHTVVPLAPISAKGAAEQKILSITAG--GGGIYIYTSMVEAFQQLRDIPARVKHLLL 526 Query: 363 NNLEAKKYIVL---LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + A ++DG T N + + + I + +Q+K FL Sbjct: 527 FSDAADAEEKAAGEMSDGIRTGGNSL--DLASAMLAAKITTSVVGLGT---EQDKDTPFL 581 Query: 420 SNCA--SPNSFFEANSTHELNKIFRDR 444 A F+ + L +IF Sbjct: 582 RQLAERGSGRFYLTDDATTLPQIFSTE 608 >gi|58429543|gb|AAW78175.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|309364927|emb|CAP23537.2| CBR-CLEC-143 protein [Caenorhabditis briggsae AF16] Length = 666 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 79/285 (27%), Gaps = 22/285 (7%) Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 V D + S+ LN F + + Sbjct: 231 DQDPVSVYEFSRVSDPNSPHSSSNSFSDSFYLNNFDVISSAANSQIFEASLCAFCSPIPS 290 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 V + + PG + + V S + VR+ L V+ SI KI Sbjct: 291 ISDDVYSD-----PICPGNIKNLWLDVVVVVDKSQLMTNAQLWQVRNTLTQVLGSISKIG 345 Query: 290 NVNDTVRMGATF-----FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 V +T ++D + K + + +T + + Sbjct: 346 PVKYPADPRSTCVGIVTYDDNATTQSQL--DASKSFSDLYNVIQSSLISVDNTNTSY-LS 402 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + +D +R K I D + + + I N+ K + I+T+A Sbjct: 403 LALLAAEKALKDGRNRTYRFNYKKVIIAFAADYQGHGTALDAMPIANRLKDNAVTIITVA 462 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHE---LNKIFRDRIG 446 T + + ASP F+ + L F + G Sbjct: 463 ----CTSNSDKQTAIQGIASPG--FDLVDEMDTPKLPTAFAQKSG 501 >gi|148657455|ref|YP_001277660.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569565|gb|ABQ91710.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 429 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 57/180 (31%), Gaps = 19/180 (10%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + R AL +++ + NV SD + R + Sbjct: 191 KTVIARQALIALVERLPATTNVALRTYGHRR---ADDCSDTELVQAPAPIQRADLINRIN 247 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIA 388 + A ++ D + IVL++DG+ T + A Sbjct: 248 AIRPVNGGRTPIA-----QSLEDMARDLAGVDG-----EVLIVLVSDGDETCGGDPVATA 297 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 + +R+ I F++ +QE+ R L A ++F+A + +L + Sbjct: 298 AALHTANPRLRVSVIGFNI---EQEEWRRRLEGIAAYGGGAYFDAANAVQLADALEQAVA 354 >gi|58429467|gb|AAW78137.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429537|gb|AAW78172.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429535|gb|AAW78171.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429461|gb|AAW78134.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|308472851|ref|XP_003098652.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] gi|308268252|gb|EFP12205.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] Length = 396 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 59/157 (37%), Gaps = 13/157 (8%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND- 341 I + T R+G +N ++ I ++ D + + +TA + Sbjct: 71 TRIGSNSSEPRTTRVGLVTYNSNATQKADL--NKYQSIDDVLNEIYDDISTVVNTADSYL 128 Query: 342 --AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 +Q A +I +E+ N K+ +++ + + + + N+ K I Sbjct: 129 ATGLQLAEKMLIDQSENT-----NRAHYKRVVIVYASEYKGEGELDPLNVANRLKLSDIN 183 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 I+T+A+ + + LS ASP F N+ + Sbjct: 184 IITVAYEQKGD--DGLFHDLSQIASPG-FSFVNNASD 217 >gi|226487570|emb|CAX74655.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 837 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 42/160 (26%), Gaps = 17/160 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIIS 352 R F + S + + D N G T +A+++A Sbjct: 334 NCRFQIIGFGSNFAALFSEPTDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL----- 388 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 389 ------HSSPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTA 440 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + +L + + + R Sbjct: 441 LIGGV---ARVGNGTAAYVRDASQLQSAVMETLSAALQPR 477 >gi|226487566|emb|CAX74653.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 42/160 (26%), Gaps = 17/160 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIIS 352 R F + S + + D N G T +A+++A Sbjct: 329 NCRFQIIGFGSNFAALFSEPTDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL----- 383 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 384 ------HSSPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTA 435 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + +L + + + R Sbjct: 436 LIGGV---ARVGNGTAAYVRDASQLQSAVMETLSAALQPR 472 >gi|222101620|gb|ACM44015.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 38/120 (31%), Gaps = 7/120 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 +D N +R+ T ++ +F R + M T M Sbjct: 75 TVDLDNTDIRLSLTTYSTPTRQIFTFLDAAASSTR----LALTKLDWMAGTKARSGMTYT 130 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + K ++L+TDG ++ + + + +G+ +M I Sbjct: 131 GRALNYVRKAILP--YGRKNVPKALLLITDGVSSDGSY-TAQVAAMLRDEGVNVMVIGVG 187 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKI-DNVNDTVRMGA---------TFFNDRVISDPS 311 + V + D +A V + D + + ++D V+ Sbjct: 43 DTSESVALRVKPFGDLVAQVKDFTNRFIDKLTERYFRCDRFLAWNAGALHYSDSVVIIKD 102 Query: 312 FSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + R +K N +G T + A++ + ++ ++L+ KY Sbjct: 103 LTAMPSG--RAELKNSVSAINYIGKGTHTDCAIKQGIERLL--------VGGSHLKENKY 152 Query: 371 IVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 ++++TDG + +E N+AK GI++ ++A L+ A+ + Sbjct: 153 LIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAI-----SPHHLDQRLNIIATDH 207 Query: 427 SF---FEANSTH-----ELNKIFRDRI 445 ++ F A S ++ + + I Sbjct: 208 AYRRNFTATSLKPTRDLDVEETINNII 234 >gi|301781320|ref|XP_002926070.1| PREDICTED: collagen alpha-1(VII) chain-like [Ailuropoda melanoleuca] Length = 2994 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 44/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|281343115|gb|EFB18699.1| hypothetical protein PANDA_015680 [Ailuropoda melanoleuca] Length = 2904 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 44/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ +++ + G+T A+ D + Sbjct: 49 VRFAAVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 106 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 107 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 150 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 151 DPEELKRVASQPTSDFFFFVND 172 >gi|241888979|ref|ZP_04776283.1| G5 domain-containing protein [Gemella haemolysans ATCC 10379] gi|241864228|gb|EER68606.1| G5 domain-containing protein [Gemella haemolysans ATCC 10379] Length = 2200 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 28/304 (9%), Positives = 71/304 (23%), Gaps = 6/304 (1%) Query: 34 LGVGGMLVDVVRWSYYEHALKQA----AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 +G + + + + A +QTA + S + + K+K Sbjct: 29 VGAASVAILSGLFFLGGADAQAAETGSSQTAAQAVATESGSSATANTWEKAITTVVEKEK 88 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 EN + + V + T E + SS + Sbjct: 89 STNVAATGTENKPVETVAPKAVEA--KTTPKETVATAPKAETKESSVVTSKAETREASVE 146 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + ++ +AE + + G + +S ++ Sbjct: 147 TPKAETREATPAPKAEVATANNTANVGSTRSVDNGYSGFRAAPAPTNQPVTNLGNNTVFT 206 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + S G R + V+ + + + + + Sbjct: 207 DISRGSHSFRGNGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFILDKGNGFG 266 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + K I + V + V ++ + + I ++ Sbjct: 267 EPSRATVTTNRLRDQTKDIRKGANFVSHSSYSLTSAVETNTNQTIRFSLPIANPNGDLSV 326 Query: 330 DENE 333 Sbjct: 327 RLRP 330 >gi|2944425|gb|AAC05284.1| complement factor C2 [Mus musculus] Length = 470 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 44/154 (28%), Gaps = 16/154 (10%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V ++I ++ D T + + Y + S + E + I+ Sbjct: 30 SQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTII 89 Query: 373 LLTDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ + + + I I I ++ S Sbjct: 90 LLTDGKSNMGDS-PKKAVTRIRELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKD 148 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F L +IF V ++T Sbjct: 149 GERHAFILQDAKALQQIFEHM------LDVSKLT 176 >gi|160720|gb|AAA29775.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 574 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 42/138 (30%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + A+ F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GKTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|124513464|ref|XP_001350088.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] gi|160691|gb|AAA29767.1| sporozoite surface protein 2 [Plasmodium falciparum] gi|23615505|emb|CAD52497.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] Length = 574 Score = 42.7 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 42/138 (30%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + A+ F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GKTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 8/86 (9%) Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 E H + K+ ++++TDGE+ ++ +++GI I + + + Sbjct: 13 EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTALE-LRNKGITIFAVGVG------KANQ 65 Query: 417 YFLSNCASP-NSFFEANSTHELNKIF 441 L A N+ ++ +L +F Sbjct: 66 KELEGMAGNKNNTIYVDNFDKLKDVF 91 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 49/385 (12%), Positives = 112/385 (29%), Gaps = 33/385 (8%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + + TF + + + + ++ + V + Sbjct: 47 AAFNNSDDGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAV 106 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 ++++ + +++ S S + S S I + Sbjct: 107 TMHYMGSLEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFG 161 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLY 241 I S+++ D G V + + + + + Sbjct: 162 IAKSKAVKLGWADDTGAAKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPS 221 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + ++ S+S E + + + I+ + V Sbjct: 222 WTIWVVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIP 271 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F + + + A + G T I + + +A D + S +E + Sbjct: 272 FETEAHRPVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT 327 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 IVL+TDG + D+++ K++ + + I +I F Q K+ L Sbjct: 328 --------AIVLMTDGRSNSDHQDEFESSYKSRGRDLPIFSIMFGDADPSQLKSLATL-- 377 Query: 422 CASPNSFFEANSTHELNKIFRDRIG 446 S F+ S +L +FR G Sbjct: 378 --SNAKVFDGRSG-DLAAVFRQAKG 399 >gi|156371145|ref|XP_001628626.1| predicted protein [Nematostella vectensis] gi|156215607|gb|EDO36563.1| predicted protein [Nematostella vectensis] Length = 484 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 44/154 (28%), Gaps = 15/154 (9%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + ++I S K F+ R +F+ Sbjct: 290 AKRFVKALIGSFKVSQKGTHVGI---IRFSTRAKVMFTFTEHFTHEDVNYAIDDIE--YT 344 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A++ A + S + K VL+TDG + + Sbjct: 345 EGGTKTELALRLARTELFSKQGGSR----TSPLIFKLFVLMTDGRSEYFHAVARQ-AKML 399 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 K G+ +M + Q+E L AS S Sbjct: 400 KRSGVHVMAVGIGKYTNQRE-----LEVIASSKS 428 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 42/116 (36%), Gaps = 18/116 (15%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A++TA + ++ E N+ + + V++TDG + E A A+ Sbjct: 106 GTMTGLAIRTAVEKAFAA---EAGARLNSTKVARVAVVVTDGRPQDEVERVSA---AARE 159 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS---THELNKIFRDRI 445 GI I + R L AS + F + +L FR+ + Sbjct: 160 SGIEIYAVGV------DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTSRFRETL 209 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/399 (8%), Positives = 101/399 (25%), Gaps = 22/399 (5%) Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A+ A + A ++ + S + + + ++ Sbjct: 180 AEAAKLDAQPAWLELVASDSPIRMSDPRSTTAGMLTLSTTLPLLSPEQGRVIIPKLAQQM 239 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + + + +S + L+ R + AE Sbjct: 240 AASDDQLFATYTSEPAKASAFPTSEAALAEHNRVNPQNKMVAVIPAEGTPSFEYTLINVS 299 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + + + L FG + + + G +G +P Sbjct: 300 TNPAKAQGIEALRAFLRSEEAAKTLASFGLRSTAVPVTLPTPPGSIGEIKPGSTPNAAQV 359 Query: 237 NKSLYYMLY-PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 ++ + + + + ++A + ++ + + V Sbjct: 360 TAAIDVWQAATTSFQLLSVFDVSGSMNEKVGNTTRVRITQEAAGIALNALPRSTKLGLWV 419 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R + A ++ + + + Sbjct: 420 FSSDKGGGRDYKELVPLGLLSDDAQRARMAAAAASLSKE---------VDGWTGLYDTIW 470 Query: 356 DEVHRMKNNLEAKKY--IVLLTDGENT--QDNEEGIAICNKAKSQG-----IRIMTIAFS 406 ++K + + ++ +V+LTDG+N + K K I I TI Sbjct: 471 AAYSKVKASYDPQRVNAVVILTDGKNEDPGGGLSLEQLLAKIKDATDPKRPIAITTIGIG 530 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + + +S + ++ A + ++ + + Sbjct: 531 PGVDAESLRKI---SRSSYSDYYGAENPADMTTVLAKAL 566 >gi|239908149|ref|YP_002954890.1| hypothetical protein DMR_35130 [Desulfovibrio magneticus RS-1] gi|239798015|dbj|BAH77004.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 328 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 34/118 (28%), Gaps = 13/118 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F R + L++ + TA+ DA+ A + S Sbjct: 128 DRIGLVAFGARAYLVVPPTTDRAALVQAL--ASLDTGAAGRKTAMGDAVGLAAKRLDESP 185 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 K +V+ DG + + + A G+ + + + Sbjct: 186 GQA-----------KAVVVFGDGRSNAGETDPVPAAQAAVRHGVAVFAVGVGGDGPAP 232 >gi|114705523|ref|ZP_01438426.1| hypothetical protein FP2506_13694 [Fulvimarina pelagi HTCC2506] gi|114538369|gb|EAU41490.1| hypothetical protein FP2506_13694 [Fulvimarina pelagi HTCC2506] Length = 180 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 17/43 (39%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 + G I AL+ PV++ + D + + L AA Sbjct: 10 RRDDRGAAGIEFALIFPVLILLAISAADAIHAVSIKARLNNAA 52 >gi|71051532|gb|AAH36192.1| COL14A1 protein [Homo sapiens] Length = 534 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 45/152 (29%), Gaps = 20/152 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS + Sbjct: 294 V------KNADVNELQEIASEPDSTHVYNVAE 319 >gi|301609302|ref|XP_002934202.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 904 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 75/259 (28%), Gaps = 27/259 (10%) Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + +R S S+ + + D K +P N + Sbjct: 238 LPSVTEFCDKNHNTEAPTLQNRICNSRSTWDVIMNSTDIKSTPPQADSNIPVPSFSLLQS 297 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 D ++ +D S + + A + I V + +G F+ + Sbjct: 298 SDRVVTLV--LDVSGSMASGGRIERLYQAAEVFLMQI-----VEEGSYVGILTFSTSISL 350 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + R +K+ N G T I ++ +R + Sbjct: 351 LSNLVQVIENTQRKQLKSLLPT-NAFGGTDICLGIREGIKI---------NRQYDGSSYG 400 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +VLLTDGE+ D + + G I IA N + + + + Sbjct: 401 TELVLLTDGEDNYDTSKCFP---DITNSGAIIHVIALGPNAAKALETIVDM----TEGLR 453 Query: 429 FEANSTHE---LNKIFRDR 444 F A + L + F Sbjct: 454 FLATDKVDTQGLIEAFISL 472 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 44/137 (32%), Gaps = 19/137 (13%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + G T + A++++ + + + +VLLTD Sbjct: 492 TGTREEALVAAVNGLKAKGGTGLYATALAAFESLSAQYQPDKPNQ---------VVLLTD 542 Query: 377 GENTQDNEEGI--AICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 G+N + K+ + I+TI + + + + A+ + + Sbjct: 543 GQNDDPTSSMTLTQLIATLKAEYNPKAPVHIITIGYGADADMDALRQI---SAATGSKTY 599 Query: 430 EANSTHELNKIFRDRIG 446 A + + ++ + + Sbjct: 600 PAQDPNSIFQVMVNALT 616 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 49/189 (25%), Gaps = 20/189 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ + D T++ G + ++ Sbjct: 179 FDIYKACSEYLVDELALFD---MTIQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNG 235 Query: 331 EN-----EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + T A++ Y + S E + N I+LLTDG+ Sbjct: 236 LKYSDHKDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHH-IIILLTDGKANLG-GR 293 Query: 386 GIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + I+ + T + + F S +E Sbjct: 294 PAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKD-GETHVFRMESANE 352 Query: 437 LNKIFRDRI 445 + +F+ + Sbjct: 353 MKTVFQKIV 361 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 49/189 (25%), Gaps = 20/189 (10%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + ++ + D T++ G + ++ Sbjct: 270 FDIYKACSEYLVDELALFD---MTIQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNG 326 Query: 331 EN-----EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + T A++ Y + S E + N I+LLTDG+ Sbjct: 327 LKYSDHKDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHH-IIILLTDGKANLG-GR 384 Query: 386 GIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + + I+ + T + + F S +E Sbjct: 385 PAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKD-GETHVFRMESANE 443 Query: 437 LNKIFRDRI 445 + +F+ + Sbjct: 444 MKTVFQKIV 452 >gi|90425191|ref|YP_533561.1| TadE-like [Rhodopseudomonas palustris BisB18] gi|90107205|gb|ABD89242.1| TadE-like [Rhodopseudomonas palustris BisB18] Length = 176 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 46/162 (28%), Gaps = 3/162 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASVP 67 + + G + AL+ P+ + + +++V + L+ A + + Sbjct: 12 RALRGFRRHRRGSAAVEFALVAPIFIALLFAIIEVALVFFAGQLLETGTQDAARVFLTQT 71 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIE-EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 E+ N +++F + + V + D+ V P Sbjct: 72 NPALAEDFKKLVCNRVDMLLSCATLRVDVQSFAPGAAIDIKNPIVNGSLVDSFVYQLPPH 131 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 + R L + + T+ A T + Sbjct: 132 LNSNYTVVVRTFYQWPLFVTKLG-FNLSNVGEGTRLLAATAA 172 >gi|310822679|ref|YP_003955037.1| hypothetical protein STAUR_5440 [Stigmatella aurantiaca DW4/3-1] gi|309395751|gb|ADO73210.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 697 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 8/50 (16%), Positives = 19/50 (38%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 F ++ + G ++ AL++ VM V++ + L+ Sbjct: 2 FTRVLRQSFQRQEGQALVMAALMVLVMSIAVITTVNIGHTVHERVRLQNT 51 >gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 843 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 55/167 (32%), Gaps = 12/167 (7%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I ++N F+D + + + + G T + + +QT Sbjct: 370 IQSLNPDDTFSIIDFSDTTTALSATPLTNTVTNQQKAIAYINQLEANGGTELLNGIQTVM 429 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 D +K + IVL+TDG +NE + N+ K G R+ + Sbjct: 430 D-------FPSPPVKRL----RSIVLITDGYIGNENEVLSVVKNQLK-SGNRLYSFGVGS 477 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + + R + T + ++F ++I N + + Sbjct: 478 SVNRFLLNRLAEIGRGTTQITRPDEPTEDQVELFVNQINNPVLTNLQ 524 >gi|300692338|ref|YP_003753333.1| hypothetical protein RPSI07_2704 [Ralstonia solanacearum PSI07] gi|299079398|emb|CBM10247.1| conserved hypothethical protein [Ralstonia solanacearum PSI07] Length = 361 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 32/344 (9%), Positives = 77/344 (22%), Gaps = 11/344 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQSL 72 + G ++ L+ ++ +G + +D+ R + L+ AA A + + L + Sbjct: 12 SNQRGAVGVMMPFLLIIIFAIGVLAIDIARLLVVRNELQNAADAAALAGANYLYRPGTTP 71 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + Y + V Sbjct: 72 GSSYDWSSAQSNALSALRQNYAAGAKLIDADVQTGYWNTASNTFHVQPYTPSSNDIPAVK 131 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + N + L + L T + A + + S + + Sbjct: 132 VYTEKLSGKNGGPVRLYLAPVIQLLAPTVSAAAVAVVASPTTITKGGLLPLVVSSCIYNA 191 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 D P ++ + +S G G S + + Sbjct: 192 SWDFTKSPPAPKK-DSNGSPVIWSFGPGMSGGSSCGDQSLYTSFDDPQKANSAGMQDLFA 250 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI-DNVNDTVRMGATFFNDRVISDPS 311 + + +V+ + V L I D + + D + Sbjct: 251 NHNPNPMSIGGNTYVLDQADSVSKVLLPKINQCSAAGDKTCEYSTVTIVNKTDGSAGNQQ 310 Query: 312 ------FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + + + + M T Sbjct: 311 IVSFACLHILSAHADSGGNYYITAQLSTLCQNSYSGGMTTPSTV 354 >gi|170751927|ref|YP_001758187.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170658449|gb|ACB27504.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 215 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 12/135 (8%), Positives = 44/135 (32%), Gaps = 4/135 (2%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 ++L + G + AL+ L + G ++ ++ + +A Q A+ + Sbjct: 18 LLRRLARDSGGAAAVEFALVALPFLALCGAILQIMFQMWATQNFDRALQNAV----RTIF 73 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ + + + ++ + + + F + V+ V ++ + + Sbjct: 74 TGQFQLDTAGQTNPATVLATLKTRMCGPSSSPVPTVFNCQNVKIDVSTSSSFASATAAKP 133 Query: 130 QVVLSSRYDLLLNPL 144 + + Sbjct: 134 IDTATGTWSTSFGSN 148 >gi|149925131|ref|ZP_01913438.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] gi|149813994|gb|EDM73632.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] Length = 413 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 41/313 (13%), Positives = 85/313 (27%), Gaps = 51/313 (16%) Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D S D +E + + + T +V + S+ Sbjct: 36 DGSGDATDSTTTAEEDSGSTTAEEDESTTGPDCGTVEITPEYVPPNVMLVVDASGSMVNN 95 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + LDP+ + ++ V + + ++ D D + +N Sbjct: 96 SWDHDLDPNTPQVTRWNTL-HGVVDNVMTNFGSTMYAGVQRFPSADACPDATPQSSNCYN 154 Query: 304 DRVISDPS------FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 + + ++ I A + +G T + +A D +++ NE Sbjct: 155 LGSCIVGTQPEVGVGADNGDAILAAIPGPGAGNTEIVGGTPATLGINSAVDHLLAQNE-- 212 Query: 358 VHRMKNNLEAKKYIVLLTDGENT-------------QDNEEGIAICNKAKSQGIRIMTIA 404 +YI+ +TDG D + + +GI + Sbjct: 213 --------TNPRYILFITDGAANCNTDLPYPEYIESYDETLPTTVEAAFEDEGITTYVVG 264 Query: 405 F---------SVNKTQQEKARYFLSNCASPN-----------SFFEANSTHELNKIFRDR 444 + + + L++ A FF + EL Sbjct: 265 IDIVDMLQGAGTDGSPEANPFVRLNDVALAGGAPKNEGMDAEKFFNTTNQQELLDA-LQV 323 Query: 445 IGNEIFERVIRIT 457 I + E VI +T Sbjct: 324 ILEGVTECVIDLT 336 >gi|170732845|ref|YP_001764792.1| TadE family protein [Burkholderia cenocepacia MC0-3] gi|169816087|gb|ACA90670.1| TadE family protein [Burkholderia cenocepacia MC0-3] Length = 164 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 38/140 (27%), Gaps = 7/140 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + + G I A+++PV + ++ + L + A+ +L Sbjct: 10 MRRRQRGATAIEFAMVLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARAALN 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A S +Q + T + + + V + Sbjct: 63 YQQVGAAASVQAAQQAALAARAQAACTAATNLTTWLKGSTCSPTQQGSCSYDPTMLCVQI 122 Query: 134 SSRYDLLLNPLSLFLRSMGI 153 + Y +PL +G Sbjct: 123 TLTYPYSQSPLIPPFPLLGS 142 >gi|1616950|gb|AAB16869.1| CD11b [Sus scrofa] Length = 920 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 38/109 (34%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + S K +V++TDGE D + +A Sbjct: 90 GRTHTATGIRKVVRELFHSKSGARENAL------KILVVITDGEKFGDPLGYEDVIPEAD 143 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 +G+ I + K+R L+ AS + F+ N+ + I Sbjct: 144 RKGVIRYVIGVG-DAFNSWKSREELNTIASKPCGDHVFQVNNFEAVKTI 191 >gi|58429507|gb|AAW78157.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPF 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RNLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|47207527|emb|CAF87062.1| unnamed protein product [Tetraodon nigroviridis] Length = 409 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 27/218 (12%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + ++ +V S + + L ++ + ++D Sbjct: 5 CSTAANDLVYVLDGSWSVGVSDFDTAKQWLINITSQFDISSHYTQVG---VVQYSDAPRL 61 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN-------DAMQTAYDTIISSNEDEVHRM 361 + G H+ +++ G+T A++ A D + SS++ Sbjct: 62 EIPL--GKHQGQDELIRAIQSISYLGGNTQARRRSERHLPAIKFAVDHVFSSSQRASQVK 119 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + V++TDG+ +QD+ ++ +A++QG+ + + T E N Sbjct: 120 N------RIAVVVTDGK-SQDDVVDASM--EARTQGVTVFAVGVGSEITTSELIAIA--N 168 Query: 422 CASPNSFFEANS---THELNKIFRDRIGNEIFERVIRI 456 S A H + ++ E RI Sbjct: 169 KPSSTYVLYAEDYTTIHHIRDAMEQKLCEE-SVCPTRI 205 >gi|48427894|sp|Q8SQ75|CO2_PONPY RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110309|gb|AAL82820.1| complement C2 [Pongo pygmaeus] Length = 752 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 53/197 (26%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S +I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTDVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|84502750|ref|ZP_01000869.1| hypothetical protein OB2597_00960 [Oceanicola batsensis HTCC2597] gi|84389145|gb|EAQ01943.1| hypothetical protein OB2597_00960 [Oceanicola batsensis HTCC2597] Length = 181 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 22/58 (37%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 T + + TG A+++PV V++ ++ + L++ A+ Sbjct: 5 KTSLRNRIRHWLGDETGAVVADFAIMLPVFTMFMLSSVEMGLMTFRQTMLERGLDMAV 62 >gi|304437812|ref|ZP_07397761.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369169|gb|EFM22845.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 255 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 6/100 (6%) Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 I + + G TA+ + + A D + + + ++ K + ++VL++DGE Sbjct: 106 ANIERQTVPQLHANGLTAMGEGVNLALDLL--AQRKKEYQDKGVDYYQPWLVLMSDGEPN 163 Query: 381 QDNE----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 D + + + + I A Sbjct: 164 GDPTELRRATQRVTELVNAGKLTVFPIGIGSEPGMDALAA 203 >gi|288932694|ref|YP_003436754.1| hypothetical protein Ferp_2362 [Ferroglobus placidus DSM 10642] gi|288894942|gb|ADC66479.1| conserved hypothetical protein [Ferroglobus placidus DSM 10642] Length = 403 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 44/135 (32%), Gaps = 20/135 (14%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + ++ + G T I A++TA + + + KY++L+T Sbjct: 277 FNHRTEKIREGDIVNVDVEGRTDIALALKTARNAL------------RGKDGAKYVILIT 324 Query: 376 DGENTQDNEEGI---AIC----NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 DGE T I + K K + I + + + + L A S Sbjct: 325 DGEPTASYNPLIPPWKMAVIESAKLKDEDINLNVLMLNDDSRFYALCERMLKA-AGKGSI 383 Query: 429 FEANSTHELNKIFRD 443 F + L Sbjct: 384 FYFPNPLNLKNYIYS 398 >gi|50119742|ref|YP_048909.1| hypothetical protein ECA0797 [Pectobacterium atrosepticum SCRI1043] gi|49610268|emb|CAG73711.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 210 Score = 42.7 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 34/88 (38%), Gaps = 2/88 (2%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +F+ + S + TAL P++L +G + D+ ++Q + I + Sbjct: 16 RFLSGLCRFWFSRRASIAVETALAFPIVLAIGSLCADLYTVGLERTRMEQ--RAGAIASI 73 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + + Q L+E + P + Y Sbjct: 74 LAMQQKLDENGLQGLLDTVLPTEGAGNY 101 >gi|239934065|ref|ZP_04691018.1| hypothetical protein SghaA1_38057 [Streptomyces ghanaensis ATCC 14672] gi|291442504|ref|ZP_06581894.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291345399|gb|EFE72355.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 141 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 39/137 (28%), Gaps = 6/137 (4%) Query: 1 MVFDTKFIFYSK-KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 M + + + + G I A++ PV+L V ++ W Y + A Sbjct: 1 MHHRARLKAWWRGRRWADDRGDAAIQMAIVYPVVLLVAIAVIQASMWYYAR---QIALTA 57 Query: 60 A--IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 A + A+ S + ++RA+ + + ++ Sbjct: 58 AREGVAAARAYQASPADGAARAREVLGRTAGDSLHSVSVTAGSTGERVRVQVTGTAQSMI 117 Query: 118 TAVEMNPRKSAYQVVLS 134 + + + Sbjct: 118 PGIAGLTITQSASGPVE 134 >gi|107022592|ref|YP_620919.1| TadE-like [Burkholderia cenocepacia AU 1054] gi|116689541|ref|YP_835164.1| TadE family protein [Burkholderia cenocepacia HI2424] gi|105892781|gb|ABF75946.1| TadE-like protein [Burkholderia cenocepacia AU 1054] gi|116647630|gb|ABK08271.1| TadE family protein [Burkholderia cenocepacia HI2424] Length = 164 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 39/143 (27%), Gaps = 7/143 (4%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + + + G I A+++PV + ++ + L + A+ Sbjct: 7 ASGMRRRQRGATAIEFAMVLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARA 59 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +L A S +Q + T + + + Sbjct: 60 ALNYQQVGAAGSVQAAQQAALAARAQAACTAATNLTTWLKGSTCSPTQQGSCSYDPTMLC 119 Query: 131 VVLSSRYDLLLNPLSLFLRSMGI 153 V ++ Y +PL +G Sbjct: 120 VQITLTYPYSQSPLIPPFPLLGS 142 >gi|288925757|ref|ZP_06419688.1| BatB protein [Prevotella buccae D17] gi|288337412|gb|EFC75767.1| BatB protein [Prevotella buccae D17] Length = 342 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 20/225 (8%), Positives = 60/225 (26%), Gaps = 50/225 (22%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + + + + S++ ++G F Sbjct: 94 ICLDISNSMLAQDVAPSRLDKSKLLVESLVDRFT-------NDKIGLIVFAGDAYVQLPI 146 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + ++ + T I A+ + + + + I+ Sbjct: 147 TSDYVSA-KMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQA-----------DKIGRAII 194 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-----------ARYFL-- 419 ++TDGE+ + A +A+ +G+ + + K + + Sbjct: 195 VITDGEDHEGGAVEAAA--EARKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQTVMTA 252 Query: 420 ---SNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 C A + ++T + E+ + R+ Sbjct: 253 LNEQMCREVAQAGSGKYIHVDNTGD--------AQTELNNDLARL 289 >gi|297746279|emb|CBI16335.3| unnamed protein product [Vitis vinifera] Length = 602 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 34/149 (22%), Gaps = 19/149 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + N TV + + + K G T I + ++ Sbjct: 200 STSHNNFTVLIHLKAPLTSGRQNSGTKLALLKRAMGFALQAVNSLVSNGGTNIAEGLRKG 259 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK----------AKSQ 396 ++ I+LL+DG++T ++ Sbjct: 260 AKVMLDRKW---------KNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRN 310 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASP 425 G I + + + C Sbjct: 311 GGTGFQIPVHAFGFAEGVIQDAFAQCIGG 339 >gi|171912852|ref|ZP_02928322.1| hypothetical protein VspiD_16770 [Verrucomicrobium spinosum DSM 4136] Length = 779 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 19/45 (42%), Gaps = 1/45 (2%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +V+LTDG++ + K GI + T+ + + A Sbjct: 209 LVILTDGQHN-GPGSPEEMAAGLKESGIPVFTVGYGSEVPPPDLA 252 >gi|148252253|ref|YP_001236838.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] gi|146404426|gb|ABQ32932.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] Length = 755 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 59/188 (31%), Gaps = 23/188 (12%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +R A AS++ ++ ++ R F+D + S +F Sbjct: 372 TSIRQAKASLLYALGRLQPN---DRFNVIRFDDTMTVLFPSSVPADAEHVGSATSFVSAL 428 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + AM+ A ++ + + +V LTDG + + I Sbjct: 429 EARGGTEMVPAMRAALT-----------DDGSDSDRVRQVVFLTDGAIGNEQQLFETI-T 476 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 + + RI + Y ++ A +F S ++ + RD Sbjct: 477 AMRGRS-RIFMVGIGS-----APNTYLMTRAAELGRGAFTPIGSVEQVEERMRDLFAKLE 530 Query: 450 FERVIRIT 457 V +T Sbjct: 531 NPVVTGLT 538 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + + A + + V +G F+ Sbjct: 25 DKSGSMATGNRLNRLNQA-----GQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGS 79 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T+I +++A+ I K IVLLTDGE+ Sbjct: 80 DRDTLAKRLPA-AASGGTSICSGLRSAFTVIR----------KKYPTDGSEIVLLTDGED 128 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE-- 436 I+ C N+ K G I T+A + Q+ + + + A+ + Sbjct: 129 N-----TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKM----TGGLQTYASDQVQNN 179 Query: 437 -LNKIFRDR 444 L F Sbjct: 180 GLIDAFGAL 188 >gi|189347157|ref|YP_001943686.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] gi|189341304|gb|ACD90707.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] Length = 2825 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 86/361 (23%), Gaps = 21/361 (5%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 ++ + S L F + Sbjct: 1977 ALDGGNDGDSDVSGIDGDSDGDNAVWSGTLSSLFGVGADDPGRYGLNGTPTGLPVLTSAG 2036 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Y V R ++ S G + + + + ++ V Q Sbjct: 2037 ETVVYDVTGD-RLTAYVDSGSSTGALDGTDRVVFTFEITPIRPTCAAYEFTPVLTQIDHP 2095 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + + + + +G D + + ++D+ P +VS + + + Sbjct: 2096 DAAADNIETLKTIDLSGSVYGYDNDGDRIGLGTVTLVITVQDDVPIPTVVSESFTAKP-I 2154 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 D + ++A+AS+I + V++ F+ Sbjct: 2155 DTNLTIILDVSGSMNDVIPNSGGKTRLQFAKEAIASLINQYDAL----GDVKVQIITFSA 2210 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 G L T + GST +DA+ T ++ + + +N Sbjct: 2211 SAQPII--VSGQVWLDPGTAITQINGLSASGSTNYDDALVDVMTTYNNAGKLSTPQSQN- 2267 Query: 365 LEAKKYIVLLTDGENTQDNEEG-------IAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 L+DG + + I I TQ Sbjct: 2268 -----VGYFLSDGVPNTPATSPGINSPEEQQWESFLTTNNINNYAIGVGAGATQATLDPV 2322 Query: 418 F 418 Sbjct: 2323 A 2323 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 26/78 (33%), Gaps = 9/78 (11%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 S +++ + + G T + ++ A + ++ N Sbjct: 133 TYSTSGYTYLNNGNTEEDLLKIINSIKADGGTNTGEGLRKANYILD---------LEKNK 183 Query: 366 EAKKYIVLLTDGENTQDN 383 A K IV ++DG T + Sbjct: 184 NADKSIVFMSDGMPTYYS 201 >gi|53713709|ref|YP_099701.1| hypothetical protein BF2418 [Bacteroides fragilis YCH46] gi|253565657|ref|ZP_04843112.1| BatB [Bacteroides sp. 3_2_5] gi|265764033|ref|ZP_06092601.1| BatB [Bacteroides sp. 2_1_16] gi|4838139|gb|AAD30859.1|AF116251_2 BatB [Bacteroides fragilis] gi|52216574|dbj|BAD49167.1| conserved hypothetical protein BatB [Bacteroides fragilis YCH46] gi|251945936|gb|EES86343.1| BatB [Bacteroides sp. 3_2_5] gi|263256641|gb|EEZ27987.1| BatB [Bacteroides sp. 2_1_16] gi|301163418|emb|CBW22969.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 341 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 22/220 (10%), Positives = 58/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ + TAI A+ A + + IV++T Sbjct: 150 YISA-KMFLESISPSLISKQGTAIGAAINLAARSFTPQ-----------EGVGRAIVVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A +A +GI++ + + Sbjct: 198 DGENHEGGAVEAAK--EAAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++++ K I V Sbjct: 256 EAMCQEIAKEGNGIYVRVDNSNSAQKAINQEINKMAKSDV 295 >gi|167566517|ref|ZP_02359433.1| hypothetical protein BoklE_28426 [Burkholderia oklahomensis EO147] gi|167573598|ref|ZP_02366472.1| hypothetical protein BoklC_27443 [Burkholderia oklahomensis C6786] Length = 598 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 26/275 (9%), Positives = 69/275 (25%), Gaps = 5/275 (1%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++ G F I+ A+ + V++ G VD+ Y AL+ A A + A+ + Sbjct: 25 ARRSPARQRGSFAIVAAIAIGVVIAALG-AVDLGNLFYQRRALQGIADMAALAAAQTMDD 83 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFEN----NLKKNFTDREVRDIVRDTAVEMNPRK 126 + ++ A+++ ++K N +A + + Sbjct: 84 GCVKPAATAQSAALGNGFDSSAAGQSMTVVCGRWDVKDNTGPSFFAGSAAGSAAGSDAQL 143 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 +A QV ++ + + G T ++ + + Sbjct: 144 NAVQVTVTRTVPYYFLGSQRTIAATGTAQATNVGAYSIGTTLAQLQGGVVNALLNGLLGA 203 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 L + + + S + Sbjct: 204 NLNLSVLSYQGLANSRIRIKDLMAAANVGTVNALLNTQTTVPQLASWMLTALSRTSVANT 263 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV 281 + + V ++ ++ Sbjct: 264 DLQTSIGALQTIVSANIPGGQTFTIGNTANSTGIF 298 >gi|160714|gb|AAA29772.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 565 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429459|gb|AAW78133.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429495|gb|AAW78151.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429503|gb|AAW78155.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429541|gb|AAW78174.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|226137|prf||1411304A thrombospondin related protein TRAP Length = 559 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|136153|sp|P16893|TRAP_PLAFA RecName: Full=Thrombospondin-related anonymous protein; Flags: Precursor gi|9978|emb|CAA31440.1| unnamed protein product [Plasmodium falciparum] Length = 559 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 59/187 (31%), Gaps = 28/187 (14%) Query: 250 DPSLSEEHFVDSSSLRH-VIKKKHLVRDALASVIRSIK-------KIDNVNDTVRMGATF 301 D +S +D+S + D + I + V+ T G Sbjct: 42 DCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQNQVSITWMFGGLH 101 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++D V + + K AI T + A+ S +V Sbjct: 102 YSDVVEIYSPLT---RSKDTYLTKLRAIR-YLGRGTFTDCAISNMTQQFQSQTARDV--- 154 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K+ V++TDG T G+ + +A+ GI++ +A ++ L Sbjct: 155 -------KFAVVITDGHVTGSPCGGMKMQAERARDMGIKLFAVA-----PSEDVYEQGLR 202 Query: 421 NCASPNS 427 ASP Sbjct: 203 EIASPPH 209 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 59/187 (31%), Gaps = 28/187 (14%) Query: 250 DPSLSEEHFVDSSSLRH-VIKKKHLVRDALASVIRSIK-------KIDNVNDTVRMGATF 301 D +S +D+S + D + I + V+ T G Sbjct: 42 DCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQNQVSITWMFGGLH 101 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 ++D V + + K AI T + A+ S +V Sbjct: 102 YSDVVEIYSPLT---RSKDTYLTKLRAIR-YLGRGTFTDCAISNMTQQFQSQTARDV--- 154 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 K+ V++TDG T G+ + +A+ GI++ +A ++ L Sbjct: 155 -------KFAVVITDGHVTGSPCGGMKMQAERARDMGIKLFAVA-----PSEDVYEQGLR 202 Query: 421 NCASPNS 427 ASP Sbjct: 203 EIASPPH 209 >gi|227661|prf||1708291A thrombospondin related protein Length = 559 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + F++ K I+++ Sbjct: 72 AMKLIQQLNLND---NAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|226487568|emb|CAX74654.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 42/160 (26%), Gaps = 17/160 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIIS 352 R F + S + + D N G T +A+++A Sbjct: 329 NCRFQIIGFGSNFAALFSEPTDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL----- 383 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 384 ------HSSPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTA 435 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + +L + + + R Sbjct: 436 LIGGV---ARVGNGTAAYVRDASQLQSAVMETLSAALQPR 472 >gi|254460888|ref|ZP_05074304.1| TadE-like protein [Rhodobacterales bacterium HTCC2083] gi|206677477|gb|EDZ41964.1| TadE-like protein [Rhodobacteraceae bacterium HTCC2083] Length = 181 Score = 42.7 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 21/56 (37%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 K + + G+ + AL+ PV + +++ + + L++ A Sbjct: 2 KKLLNTVRNFRSGERGNVTVEFALVFPVFAMILTSSIEMGIITVRQTLLERGLDIA 57 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 42.7 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 51/397 (12%), Positives = 111/397 (27%), Gaps = 18/397 (4%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 + A Q+ ++ + +++ + + + + + ++ Sbjct: 182 MTSATQSNSGASAYLAFMTALRGGDGPLTEADLADDQLKTSVTKLLSGVDRSSGSSDWLK 241 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 D+V + + +V+ + L + L + Sbjct: 242 DMVVANPDRFDAMVNYESLVIQADKALDKDGHDPLLTVYPSDGIAVSDSPFGYVDRGQKL 301 Query: 173 KEHGVSIQWVI---DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 K S + D R + + + + K Sbjct: 302 KSAFDSFSKALGSKDAKLEFERAGRRTGLGGTLAYATDSQVKQSFRAEWGVNTDASALKT 361 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 P + +Y L VD S K +V+ A++ + Sbjct: 362 IPLPAATVIDSALNVYQTALRKPSWTIWVVDYSGSMSGEGKNGVVKGLNAALDP--DQAK 419 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 G ++ + + A + G T I + + +A D Sbjct: 420 KSYIEPASGDVNILIPFETEAHRPVKATGTSTSDLLHEADATDASGGTDIYEGLLSALDE 479 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + S +E + IVL+TDG + D+++ K++ + + I +I F Sbjct: 480 LPSESEASQYTT--------AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDAD 531 Query: 410 TQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 Q K+ L S F+ S +L +FR G Sbjct: 532 PSQLKSLATL----SNAKVFDGRSG-DLAAVFRQVKG 563 >gi|156367148|ref|XP_001627281.1| predicted protein [Nematostella vectensis] gi|156214186|gb|EDO35181.1| predicted protein [Nematostella vectensis] Length = 166 Score = 42.7 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 41/125 (32%), Gaps = 12/125 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 VR G + R G +N + I A++ AY +++ Sbjct: 45 NVRFGLVVYGTRPRVVFGL-NGFRNNGGLFNALNRPIKNPQTGSRIGLALRAAYTRVLAR 103 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + A K IV+L DG + D N +++G++I I ++ Sbjct: 104 SP--------RRGATKIIVVLADGRSEDDVRRPS---NALQARGVKIFAIGIGRYINGRQ 152 Query: 414 KARYF 418 + Sbjct: 153 LDQLA 157 >gi|148657452|ref|YP_001277657.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] gi|148569562|gb|ABQ91707.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] Length = 392 Score = 42.7 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 54/173 (31%), Gaps = 13/173 (7%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS--SRAKNSFTFPKQKIE 91 + G+ +D + + +A A + A+ LI V+ + T+ + ++ Sbjct: 26 IAFAGLAIDGALAFAWRRNVMNSADGAALIATRALIVDRGSVNGIAITNAVRTYLQAELG 85 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + N + L + V + +N R V + + + + Sbjct: 86 ---VDNPDFELTYVNGAGQSMGTVDSSPAPVNARGIGIVVRHT--------FDTYLMGIL 134 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 G + ++ + A + ++ ++ + S L+ F Sbjct: 135 GQPTLTVRGVSAARFGNGGMLIGDLIAPMGLLPEAAEFFRSNPTSALVDLSGF 187 >gi|254485826|ref|ZP_05099031.1| TadE-like protein [Roseobacter sp. GAI101] gi|214042695|gb|EEB83333.1| TadE-like protein [Roseobacter sp. GAI101] Length = 143 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 42/133 (31%), Gaps = 1/133 (0%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY-EHALKQAAQTAI 61 K + + + +G I L+ P++ + ++ + + + A A Sbjct: 1 MRHKTLRHLNNFRRDTSGAVAIEFVLIAPLLFALLFGIMTIGYYVGVSHSVSQLATGAAR 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + +Q E++ + + + + + N +R Sbjct: 61 ASVAGLDMQERVELAEAYLSRASVNYPLLTQSAVTPDIRTETTNPPGITIRVTYAVDGTV 120 Query: 122 MNPRKSAYQVVLS 134 ++ S ++ +S Sbjct: 121 LDLANSLLKLNIS 133 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 53/167 (31%), Gaps = 12/167 (7%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 V+D ++ I + R F+ F+ + + T+ Sbjct: 112 VKDYKNGILALQTLITRAKEDTRYAGITFSTEANITFYFTDPLDAMKGLGGITY-----A 166 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T A+ + + + K I+++TDG++ + E + + Sbjct: 167 PGMTNTQAALDICRTQL-----WLNKKSGFRRLSFKRILIVTDGQSNINMERTLYNAFQL 221 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 K+ GI I +A + S+ + + + L ++ Sbjct: 222 KNMGIEIFVVAVGKYLRGIAEIVGLASS--TDAHLYRVRNLRGLLEV 266 >gi|116249975|ref|YP_765813.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115254623|emb|CAK05697.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 193 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 17/168 (10%), Positives = 46/168 (27%), Gaps = 13/168 (7%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + ++L + G I A+L PV++ + ++ + +AA T Sbjct: 8 TRLVLTMRRLARDRKGAGAIEFAILFPVLVMLYIGAFEITVGLSVSKRVTRAAGTVADLV 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL-------------KKNFTDREV 111 + + ++ + L K+ T Sbjct: 68 TQQQSVTKSALAQMPSVATAIFVPYNSTSLTLKITGITIDAGANAKVLWSWAKDGTVPYA 127 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 ++ +++ V + + G+++ I Sbjct: 128 KNTAVTNVPADMKTANSFLVRTELSIPYTMFLFAPNFMPDGMRTITIS 175 >gi|42524203|ref|NP_969583.1| hypothetical protein Bd2793 [Bdellovibrio bacteriovorus HD100] gi|39576411|emb|CAE80576.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 354 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 49/179 (27%), Gaps = 33/179 (18%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + + + + G T +A++ + + Sbjct: 134 VAFAGSAALLSPLTNDPGAIKMYLESLEPSSVSSQG-TNFTEALKISKEAFERGGVSTDE 192 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 +K + I++ +DGE+ + K +G+RI ++A+ K R + Sbjct: 193 TVK----VTRVILIASDGEDHEQGALDE--AKKMAGEGVRIFSLAYGTEKGGAIPVRDGM 246 Query: 420 S--------------------------NCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 A SF+ A E K+ + I + Sbjct: 247 GFLKGYKKDRQGQTILTTVKGDALRALAEAGQGSFYFATFGGEQTKLLVEDISKLEKAQ 305 >gi|304390827|ref|ZP_07372779.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|304325710|gb|EFL92956.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 171 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 14/136 (10%), Positives = 40/136 (29%), Gaps = 6/136 (4%) Query: 8 IFYSKKLIK--SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + G ++ ++ + M + V + L++ A A+ Sbjct: 14 RRFVARFERLGDEDGRIMLLGLAACVLLCVIILMSMAVSGVYLEQRRLQRLADQTASMAA 73 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 ++ + + +IE Y + + V+++ Sbjct: 74 ----ANMADTAYYQNGIVDGVPLEIEPYHASERAAEYLSGASISANSGLDGIDLVDVDVA 129 Query: 126 KSAYQVVLSSRYDLLL 141 + QV L + + L Sbjct: 130 STRVQVTLRATGKIPL 145 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/419 (8%), Positives = 93/419 (22%), Gaps = 42/419 (10%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA + + + F + +KK ++ Sbjct: 140 YAADNGSTAG-EEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVLSNNL 193 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 194 TSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 253 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 I + +M + + + + + + ++ Sbjct: 254 AVNEISENLLSDPNM-------DIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTG 306 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVD-SSSLRHVIKKKHLVRDALASVIRSIKKID-NVN 292 S N +L G + + + + + + + Sbjct: 307 SENTALNRTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTR 366 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA---INDAMQTAYDT 349 G + + V+ D + G T + Sbjct: 367 SRTSGGGAWSSWSVMEDKIAVDDGDLFRNFEDFDDDDNTTNAGFTNPVTYASDFDRPVNG 426 Query: 350 IISSNEDEVHRMKNNLEAKKYIV-LLTDGENTQDNEEGIAIC-----------NKAKSQG 397 + ++ ++G + + + + NK + Sbjct: 427 TNVQYRYGEVKEGDDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAYHHWLKNKYQENP 486 Query: 398 IRIMTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 I +I ++ + R L N A + +++AN+ +++ D Sbjct: 487 PSIFSIGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 545 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKI-DNVNDTVRMGA---------TFFNDRVISDPS 311 + V + D +A V + D + + ++D V+ Sbjct: 43 DTSESVALRVKPFGDLVAQVKDFTNRFIDKLTERYFRCDRFLAWNAGALHYSDSVVIIKD 102 Query: 312 FSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + R +K N +G T + A++ + ++ ++L+ KY Sbjct: 103 LTAMPSG--RAELKNSVSAINYIGKGTHTDCAIKQGIERLL--------LGGSHLKENKY 152 Query: 371 IVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 ++++TDG + +E N+AK GI++ ++A L+ A+ + Sbjct: 153 LIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAI-----SPHHLDQRLNIIATDH 207 Query: 427 SF---FEANSTH-----ELNKIFRDRI 445 ++ F A S ++ + + I Sbjct: 208 AYRRNFTATSLKPTRDLDVEETINNII 234 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 36/316 (11%), Positives = 86/316 (27%), Gaps = 30/316 (9%) Query: 153 IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 S + S + ++ + S+ + A + Sbjct: 716 FTSNNNVAIVITDGRSDTLRDRTPLTSLCEVTPVVSLGIGDIFRNNPNPDQLNDIACLGM 775 Query: 213 K-----SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 S N + D L + Y P+ + + + S V Sbjct: 776 PRRQGLSIQRDNYAELLDDSFLQNITSYVCREKKCPDYTCPITFANPADIMLLVDSSTSV 835 Query: 268 IKK-KHLVRDALASVIRS-IKKIDNVNDTVRMGATFFN--DRVISDPSFSWGVHKLIRTI 323 K ++ + + ++ D+VR+ ++ ++ + F + + + Sbjct: 836 GSKNFDTTKNFVKRLAERFLEASKPAEDSVRVSVVQYSGRNQQKVEVPFQRNYTVIAKAV 895 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN 383 + +T +N A+Q S+ KK +++ +DG + Sbjct: 896 DNMEFM----NEATDVNAALQYIMGLYQRSSRSGA---------KKKVLVFSDGNSQGIT 942 Query: 384 EEGIA-ICNKAKSQGIRIMTIAFSVNKTQQE-----KARYFLSNCASPNSF-FEANS-TH 435 I + + GI + +A + + + A F T Sbjct: 943 ARAIERTVQEVQQAGIEVYVLAVGSQVNEPNVRVLVTGKSTNYDVAYGERHLFRVPDYTS 1002 Query: 436 ELNKIFRDRIGNEIFE 451 L +F + +I Sbjct: 1003 LLRGVFYQTVSRKIAV 1018 >gi|167581951|ref|ZP_02374825.1| hypothetical protein BthaT_27659 [Burkholderia thailandensis TXDOH] Length = 142 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 38/134 (28%), Gaps = 1/134 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L++ G + L P M+ V +VD + + A++ A V + L Sbjct: 7 LVRDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGVVVRVPQLA 66 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + ++ + + ++ T + Sbjct: 67 -AADIENVALSYAQGSLVSGGTVGAPVVYVDQSAGTSPGSPLKVTVSYTYQGLVLGSALS 125 Query: 134 SSRYDLLLNPLSLF 147 S + L +++ Sbjct: 126 SLTGPITLTAVTVM 139 >gi|114706777|ref|ZP_01439677.1| TadE-like protein [Fulvimarina pelagi HTCC2506] gi|114537725|gb|EAU40849.1| TadE-like protein [Fulvimarina pelagi HTCC2506] Length = 178 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 12/161 (7%), Positives = 38/161 (23%), Gaps = 4/161 (2%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+L K G I L + +++ ++ + + + Sbjct: 16 SLFKRLRKDRGGVAAIEFVLCAFPFFALFLAIIETAILFTA----GIVLESGVQGVARQI 71 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + + + KQ + + + + + D + + + Sbjct: 72 LTGQLQSAGDEAPTKEEFKQLVCDRIDYFLACDKIQVDLKTFDDYSAIDLSYDPDGFGYD 131 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + + S K +V+ Sbjct: 132 LGGSEDINVLRVFYEWEWQTSMLHALSGDDNGKLTFASVAA 172 >gi|332185399|ref|ZP_08387147.1| tadE-like family protein [Sphingomonas sp. S17] gi|332014377|gb|EGI56434.1| tadE-like family protein [Sphingomonas sp. S17] Length = 257 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 27/263 (10%), Positives = 66/263 (25%), Gaps = 17/263 (6%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + G I AL PV+L +G V++ + + Q I + Sbjct: 2 RLTKGLESVTHDRRGVAMIEFALAAPVILLLGLGGVEIGNYVIANLRVSQ------IAMA 55 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V + + ++ E + + D+ + T Sbjct: 56 VADNAGRIRTTIDEADVTEIMIGAMK-----MGEPLSLASKGRIILSDLEQRTTTTGAGG 110 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH----GVSIQW 181 K + + Y S + T ++ K + V Sbjct: 111 KGSVSASNPNGYRQWFRWQRCAGALSVPSSLGVPTNDAGAPITNLDDKTNDDHGAVETAS 170 Query: 182 VIDFSRSMLD--YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 +ID + + + V + +R +++ + V + Sbjct: 171 IIDGVGKAGNQIAAQGGTAVMVAEVVYDYQPIVPVNLITSAIRSLRIKRVVAFNVRERNA 230 Query: 240 LYYMLYPGPLDPSLSEEHFVDSS 262 ++ D++ Sbjct: 231 FSLHNDGNLTGAKRADCQIYDAA 253 >gi|284029817|ref|YP_003379748.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809110|gb|ADB30949.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 550 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 41/130 (31%), Gaps = 14/130 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R I + A G T + D A + SS ++L TDG+N Sbjct: 428 QRMIGQLKAQSARVGGGTGLYDTAIAAVRAVRSSYNSGAVNT---------VLLFTDGKN 478 Query: 380 TQDNE----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + I N K + IA + L+ A+ + A + Sbjct: 479 DDPGSLSLDRTLQILNGLKDPARPVRIIALGIGPDADADELKRLAQ-ATGGQAYVARNPT 537 Query: 436 ELNKIFRDRI 445 +L +F D + Sbjct: 538 DLKDVFIDAL 547 >gi|221066731|ref|ZP_03542836.1| membrane protein-like protein [Comamonas testosteroni KF-1] gi|220711754|gb|EED67122.1| membrane protein-like protein [Comamonas testosteroni KF-1] Length = 723 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 1/65 (1%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 + AL+ V+ + G+ VD+ Y + L++ A A + A+ + + S Sbjct: 1 MQFALVAGVIFSMLGV-VDLGYMYYAKRNLQRIADLAALEAAQSINAEQDNYSECQTTGE 59 Query: 84 TFPKQ 88 T + Sbjct: 60 TSLDR 64 >gi|149641794|ref|XP_001509032.1| PREDICTED: similar to integrin alpha E2 [Ornithorhynchus anatinus] Length = 1679 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 59/214 (27%), Gaps = 17/214 (7%) Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + ++++ + N + D +D S + Sbjct: 121 NCILYTADLQEKASFNFSSIENNNNNGFEQEMDEDTGTEIAIVLDGSGSIN----PEDFE 176 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A + + + + + ++ + + ++G Sbjct: 177 KAKDFIFNIMTRFYKKCFECEFAVVQYGSVIQTEFDL---LASRDANSSLQKVKNIKQVG 233 Query: 336 S-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 + T A+ I + + + K I++LTDG+ QD + + N + Sbjct: 234 NVTKTASAIYHVLKEIFNV------EKGSRRQNSKIIIVLTDGDVFQDPKNLATVMNLPE 287 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 +GI I + ++ L AS Sbjct: 288 MKGIERFAIGVG---NEFSASKKTLELIASEPHH 318 >gi|118390982|ref|XP_001028302.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila] gi|89281272|gb|EAR80639.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila SB210] Length = 202 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 10/68 (14%), Positives = 22/68 (32%), Gaps = 7/68 (10%) Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 K + + I + F + + A SF+ ++++ F D +G Sbjct: 18 KLQEESFTIHSFGFGNDHDGP-----LMQKIAQIKDGSFYFVEKNDQVDEFFIDALGGLF 72 Query: 450 FERVIRIT 457 +T Sbjct: 73 SVVAQDLT 80 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 52/147 (35%), Gaps = 14/147 (9%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 D VR+G +++ ++ + S + I + ++ T A+ A + I Sbjct: 68 DAVRVGLVQYSESAKTEFNLS-KYSERNDIIAHMETLTFMQVEDTRTGVALNKADEEIFD 126 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 L+A + I++ TDG + + +G++I TI+ + Sbjct: 127 ------FNGGARLKATRLIIIFTDGLSMDK---PSKAAKALRRKGVKIYTISVNSIGFIP 177 Query: 413 EKARYFLSNCASPNSFFEANSTHELNK 439 E L ++ F N + + Sbjct: 178 E----MLGIVGDADNVFGPNDEERIEE 200 >gi|89068016|ref|ZP_01155433.1| CpaB family protein [Oceanicola granulosus HTCC2516] gi|89046255|gb|EAR52312.1| CpaB family protein [Oceanicola granulosus HTCC2516] Length = 281 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 22/59 (37%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSF 83 + L++ V +G+ G V + + + + A + A+ + P ++ Sbjct: 4 VFGLVLIVGVGLAGFAVYMAQGYLQDTQAQLARERAVAAQATPTVEVYAVTRQIGYGEQ 62 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 21/191 (10%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +D+SS + VR +A+++ + + + R+G ++D+ + Sbjct: 549 YDLVFLLDTSSSVG-KEDFEKVRQWVANLVDTF---EVGPERTRVGVVRYSDQPTTAFEL 604 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 G+ + G+T DA++ S + K+ + Sbjct: 605 --GLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSPQAGGRP---GDRAFKQVAI 659 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 LL + + + A GIRI + + L AS F Sbjct: 660 LLP---AGRSQDLVLDAAAAAHRAGIRIFAVGVGA------ALKEELEEIASEPKSAHVF 710 Query: 430 EANSTHELNKI 440 + + ++KI Sbjct: 711 HVSDFNAIDKI 721 >gi|308472999|ref|XP_003098726.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] gi|308268326|gb|EFP12279.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] Length = 380 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 49/165 (29%), Gaps = 22/165 (13%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGST---AI 339 I T R+G +N + + + +T + Sbjct: 68 TRIGSNSVEPRTTRVGLVTYNSAATLNADL--NQFQSFSDLRNGVISFLKVAANTKDSYL 125 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 + A + +K I++ + + + I ++ K G++ Sbjct: 126 ATGLAMAAQVLN--------VQGLRDHYQKVIIVYASKYSGYGDLDPQPIADRLKGSGVK 177 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH-ELNKIFRD 443 I+T+A+ L + +SP F + S + ++ + Sbjct: 178 IITVAYG--------DETVLESLSSPRFGFNSASGYPQIQNALLE 214 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 48/194 (24%), Gaps = 22/194 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK----LIRTIVKT 326 A + I+K+ + + G + + R + + Sbjct: 283 IGNFTRAKNCLRDFIEKVASYGVKPKYGLVTYATIPKVWVKLRDDNSSDADWVTRILNQI 342 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 D T A+Q Y+ + N I+L+TDG + + Sbjct: 343 SYEDHKLKAGTNTKKALQEVYNMMS--WPGNALLEGWNHTRH-VIILMTDGLHNMG-GDP 398 Query: 387 IAICNKAKS--------QGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +++ ++ + + R + Q+ F Sbjct: 399 VSVIHEIRDFLYIGRDRKNPREDYLDVYVFGVGPLVNQENINALASKKD-KEQHVFRVKD 457 Query: 434 THELNKIFRDRIGN 447 L +F + Sbjct: 458 MENLEDVFIQMLDE 471 >gi|119585300|gb|EAW64896.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_d [Homo sapiens] Length = 2978 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|119585298|gb|EAW64894.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_b [Homo sapiens] Length = 2609 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|119585297|gb|EAW64893.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_a [Homo sapiens] Length = 2944 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|119585301|gb|EAW64897.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_e [Homo sapiens] Length = 2597 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|495866|gb|AAA58965.1| collagen type VII [Homo sapiens] Length = 2912 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|4502961|ref|NP_000085.1| collagen alpha-1(VII) chain precursor [Homo sapiens] gi|1345650|sp|Q02388|CO7A1_HUMAN RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor gi|987125|gb|AAA75438.1| alpha-1 type VII collagen [Homo sapiens] Length = 2944 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ +++ + G+T A+ D + Sbjct: 77 VRFATVQYSDDPRTEFGLDALGSGGD--VIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|62286486|sp|Q863A0|CO2_GORGO RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|29690202|gb|AAM10001.1| complement C2 [Gorilla gorilla] Length = 752 Score = 42.3 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 56/197 (28%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F + S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASKPKVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRILGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ + I I I + S F T L+++ Sbjct: 389 VDRIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|116625492|ref|YP_827648.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] gi|116228654|gb|ABJ87363.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] Length = 754 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 51/148 (34%), Gaps = 11/148 (7%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++A++ + + D R A N + + L++ + F + + G Sbjct: 83 SIATLKPQQNVVAVLVDDSRSMAINDNSGTREAAAQAALNSGLLKGLTDKFQVRLYKFG- 141 Query: 337 TAINDAMQTAYDTIISSNEDEVHR-------MKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + +Q A D + ++ +++ IVLL+DG + + I Sbjct: 142 -KDPERIQKA-DQLTAAAPATRIGDTLERVLAESSSLPLGAIVLLSDGADNAGGIDLATI 199 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + Q I I T+ F ++ Sbjct: 200 A-AIRRQRIPIHTVGFGKEHPDRDIEIT 226 >gi|255557524|ref|XP_002519792.1| protein binding protein, putative [Ricinus communis] gi|223541031|gb|EEF42588.1| protein binding protein, putative [Ricinus communis] Length = 436 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 40/132 (30%), Gaps = 34/132 (25%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + ++ + +T I +QT + + + I+L++ Sbjct: 77 ITENSQNDLEILINGLHADAATNITAGLQTGLRVLNDRSLSGGRVVD--------IMLMS 128 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEAN 432 DGE + + A+ I L A +F + Sbjct: 129 DGEQNAGD-------DAAQ--------IPVG--------NMPVLKAIADNSMGGTFSDVQ 165 Query: 433 STHELNKIFRDR 444 +T L+K F D Sbjct: 166 NTDNLSKAFSDL 177 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 40/132 (30%), Gaps = 17/132 (12%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI--V 324 K + AL S + K N ND + + + F G + + + + + Sbjct: 93 TPLKLQSAKYALQSAVDYFKANANPNDRFALIPFSSDVQYNKVVPFPTGAYDVKQHLERI 152 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT---Q 381 A D G T ++Q A KKYI+ LTDG T Sbjct: 153 ANVANDLRAYGGTNYTQSLQQAQSFFNDPTR------------KKYIIFLTDGMPTVSIA 200 Query: 382 DNEEGIAICNKA 393 +C Sbjct: 201 KEPITYKVCEGI 212 >gi|189501233|ref|YP_001960703.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496674|gb|ACE05222.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 339 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 40/119 (33%), Gaps = 9/119 (7%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + R G F + + T++ + D E T I+ AM A Sbjct: 124 ISRGIERGRRGLVAFAGSGVVQCPLTTDQQAF-ETMLGIASPDLIEAQGTDISAAMDVAQ 182 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 S +E + VL++DGE + A K K + +R++ + Sbjct: 183 KMFSGSKTEEKVKAGG------VAVLVSDGEAHEK--GFSAAARKLKEKDVRLIVVGVG 233 >gi|304392393|ref|ZP_07374334.1| putative TadE family protein [Ahrensia sp. R2A130] gi|303295497|gb|EFL89856.1| putative TadE family protein [Ahrensia sp. R2A130] Length = 204 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 11/183 (6%), Positives = 39/183 (21%), Gaps = 11/183 (6%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ K G + AL+ P +L + + + A Sbjct: 15 FLRRFRKDERGISMVEFALISPALLSMYLGAIVATHM-----------EHASTAVGKVTG 63 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + ++ + + + N + Sbjct: 64 TVADIIAQSPVVDRSIIDGAFAAGEAMMSQQYADDLEIVLTGVIVEPVPGDNSNNPQRRG 123 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 +V ++ + + ++ + W + + + + M Sbjct: 124 RVAWTASHQRVSLAKPSRGQTYPLPDWATKRNGFYVVAKGRLKQTPLYGDYFNVGGDGKM 183 Query: 190 LDY 192 Sbjct: 184 TYN 186 >gi|299134236|ref|ZP_07027429.1| TadE family protein [Afipia sp. 1NLS2] gi|298590983|gb|EFI51185.1| TadE family protein [Afipia sp. 1NLS2] Length = 145 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 45/126 (35%), Gaps = 3/126 (2%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ-AAQTAIITASVPLI 69 + ++C+G + AL++PV + + +V + H ++Q AA+ A + + Sbjct: 8 FRDFRRACSGASAVEFALVLPVFMLLVFGIVMFGAYLAMVHDVQQLAAEAARTSVAGLNE 67 Query: 70 QSLE--EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + +N+ ++P + + N V +T + P Sbjct: 68 TERKSLATNYVTQNAASYPLIVPAHLSVNAATSGADPNVFIVTVNYDASNTFIYSLPSFV 127 Query: 128 AYQVVL 133 + Sbjct: 128 PAPPPV 133 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 37/418 (8%), Positives = 92/418 (22%), Gaps = 46/418 (11%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA + + + F + +KK + Sbjct: 297 YAADNGSTAG-EEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVLSNNP 350 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 351 TSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 410 Query: 175 HGVSIQWVIDFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 I + +M +++ Q + + + S+N + Sbjct: 411 AVNEISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALN 470 Query: 224 IRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHV----IKKKHLVRDAL 278 +P + G + + + V I++ + Sbjct: 471 RTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTN 530 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V+ ++ D + N + R Sbjct: 531 WAVMNNMIARDTGDLFKNFETYSGNTSGAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 590 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + A S+ + A + + E + Sbjct: 591 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAY-HHWLKNKYQENPPS---------- 639 Query: 399 RIMTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 I +I ++ + R L N A + +++AN+ +++ D Sbjct: 640 -IFSIGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 696 >gi|302517652|ref|ZP_07269994.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426547|gb|EFK98362.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 418 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 13/119 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 RT K + G T I A+ A D + EA K IVL+TDGE+ Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL------------RGGEASKRIVLITDGED 154 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + + + ++G+ ++ + + + + A+ ++ T +L+ Sbjct: 155 TCHR-DPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTKDLS 212 >gi|148360718|ref|YP_001251925.1| type IV fimbrial biogenesis PilY1-like protein [Legionella pneumophila str. Corby] gi|296106216|ref|YP_003617916.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] gi|148282491|gb|ABQ56579.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila str. Corby] gi|295648117|gb|ADG23964.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] Length = 1169 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 41/402 (10%), Positives = 89/402 (22%), Gaps = 25/402 (6%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A I A + + + ++ Y + ++ T V + Sbjct: 141 AGIKAIIENYMPTTDFALGTYSTSNISSYNTWVYYMSPPGSDFVFTNTPVAGNRYVTNPC 200 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 S SS L L + + I + S VS Sbjct: 201 YNYGSASSTVSSNCSSIGSLYGTTLVSSSQYLQIGDSSDDPDINDVLYAGSGFPGIFVSY 260 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 S +Y + Q A+ + + S ++ + Sbjct: 261 NGPTPSSPFPPNYTISNYNQGNIRISY-ANTRPSIGNFSSSPTNAGFVPFSQQVMYVQRG 319 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y + + + V +A+ + + +K ++ Sbjct: 320 FGYYSNQSYATG-----NMLVNMQTAGTNPTTTSVNNAINAFLPHLK---PETNSTATTE 371 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++T+ T + I+D + T Sbjct: 372 IKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPT-----QDLQSRYWP 426 Query: 360 RMKNNLEAKKYIV--LLTDGENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQE 413 + + + DG N + ++ K+ G+ I Sbjct: 427 PLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEIKALKNDGVLTFIIGMGAGVDPAV 486 Query: 414 KARYF--LSNCA---SPNSFFEANSTHELNKIFRDRIGNEIF 450 L A +++ A S L + N Sbjct: 487 NPEAAATLRAMAVAGGTENYYPATSPEALVSSLNSILSNIQN 528 >gi|91215374|ref|ZP_01252345.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] gi|91186326|gb|EAS72698.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] Length = 689 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 56/169 (33%), Gaps = 22/169 (13%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +DSS K ++A ++ ++ DN F++ ++ + Sbjct: 282 IDSSGSMRGGNKMAQAKEASEFIVNNLNIGDN------FNVIDFDNNIVLFQPELVEYNI 335 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 F + +G+T I++++ TA + + ED+ + IV TDG Sbjct: 336 QNSNAALDFIENIVALGATNISESLVTAINQFEAGAEDKAN----------IIVFFTDGG 385 Query: 379 NTQDNEEGIAICNKAKS------QGIRIMTIAFSVNKTQQEKARYFLSN 421 T+ I A+ I + T + T + N Sbjct: 386 ATEGETNTQNILQLAEDTVNQIETEIFLFTFGIGEDVTTDLLTLLAVQN 434 >gi|318058741|ref|ZP_07977464.1| hypothetical protein SSA3_12405 [Streptomyces sp. SA3_actG] Length = 418 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 13/119 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 RT K + G T I A+ A D + EA K IVL+TDGE+ Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL------------RGGEASKRIVLITDGED 154 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + + + ++G+ ++ + + + + A+ ++ T +L+ Sbjct: 155 TCHR-DPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTKDLS 212 >gi|254458848|ref|ZP_05072272.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084614|gb|EDZ61902.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 629 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 58/201 (28%), Gaps = 44/201 (21%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + L + +I K R+G F S+ + + K Sbjct: 104 YPNRLELAKKKALELINKATK-------DRVGVIAFAKNSYLVSPISFDTKTVSFLLSKL 156 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 T I + T T N KKY+++L+DG D + Sbjct: 157 DTSSIT-QKGTNILTMLGTVEKT-------------NTSTDKKYLLILSDG---GDETDF 199 Query: 387 IAICNKAKSQGIRIMTIAFS--VNKTQQEKARYFLS-----------------NCASPNS 427 A + AK +GI + + V + + K + + Sbjct: 200 SAEIDFAKEKGIIVFVLGIGTEVGASIKNKDGSLIKYNDKVVISKLNESISELAIKTGGV 259 Query: 428 F-FEANSTHELNKIFRDRIGN 447 + S+ ++ +F + I N Sbjct: 260 YIQNTTSSKDIEAMFIEIINN 280 >gi|156404157|ref|XP_001640274.1| predicted protein [Nematostella vectensis] gi|156227407|gb|EDO48211.1| predicted protein [Nematostella vectensis] Length = 476 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 49/395 (12%), Positives = 103/395 (26%), Gaps = 50/395 (12%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 L+ A + I L + K L N N + Sbjct: 24 TTLQTAPGELAKKLAELAINGLGTSEMQGYYDKLTFK---SLDLNGNSILNDLATRFANK 80 Query: 111 VRDIVRDTAVEMNPRKSAYQVVLSSRY--------DLLLNPLSLFLRSMGIKSWLIQTKA 162 ++ V + + +Y + L S F + + + Sbjct: 81 LQTKVTIARKIKDAVEVSYAKSATVTSRTECCKADTRWLKYDSRFRTKVNLDEMCVIISG 140 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 A + + +++ I+ + ++ S S Sbjct: 141 AASSNPKQLQDNVLQTMKQNIENNPTL----TWQYFGSEEGLYTNYPMIRDSSSCS---- 192 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +Y+ P + S + + ++A +V+ Sbjct: 193 ----------SYDPRYRPWYVEAASPQPKDVILVVDYSGSMGGS---RLPIAKEAAKTVL 239 Query: 283 RSIKKIDNV------NDTVRMGATFFNDRVISDPSFSWGVHK-LIRTIVKTFAIDENEMG 335 ++ D V + R+ T + + S + I+K F E G Sbjct: 240 DTLNPRDRVAFLAFESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASG 299 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A A+D + + + N + I+ +TDG D + K ++ Sbjct: 300 GTMYAVAFNAAFDILDKYYK------EKNTTRRPVILFMTDGAPNDDPGTILNTV-KMRN 352 Query: 396 QGIR----IMTIAFSVNKTQQEKARYFLSNCASPN 426 QG+ I+T + T + A + Sbjct: 353 QGLSTKADILTFGLLTSITFPCYHQGKFIGVAGTD 387 >gi|17537919|ref|NP_496258.1| C-type LECtin family member (clec-59) [Caenorhabditis elegans] gi|3881709|emb|CAA88984.1| C. elegans protein ZK666.5, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 396 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 43/144 (29%), Gaps = 8/144 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGSTAIND 341 I T R+ +N + + + + + + + + Sbjct: 66 TQIGTNPTSPKTTRLALVTYNVDATTAADLNKFQSIDDIYSGINSALATISSSEESYLAR 125 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + N +K +++ + + + + K+ G+ I+ Sbjct: 126 GLSQAEKVFQAGKHG-----FNRAHYQKVVIVYASTYKGSGDLNPVPVAQRLKTSGVTII 180 Query: 402 TIAFSVNKTQQEKARYFLSNCASP 425 T+A+ NK L A+P Sbjct: 181 TVAYDQNKDGDILVD--LEKIATP 202 >gi|15897953|ref|NP_342558.1| hypothetical protein SSO1089 [Sulfolobus solfataricus P2] gi|13814278|gb|AAK41348.1| Hypothetical protein SSO1089 [Sulfolobus solfataricus P2] Length = 436 Score = 42.3 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 42/156 (26%), Gaps = 36/156 (23%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS--TAINDAMQTAYDTIISSNED 356 N+ ++ I + T +++A+ A + Sbjct: 147 LYSLNEGDYVTLILFSNHPEIKYQGPAKGIITFDVGKGYTTRLHEAVNFALNL------- 199 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRIMTIAFSVNK 409 K + K I++LTDG+ T K +I+TI + Sbjct: 200 ----AKQSQVPNK-IIMLTDGKPTDKR--------NVKDYEKFDIPPNTQIITIGIGSDY 246 Query: 410 TQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 L A S F+ EL +F Sbjct: 247 N-----ERILKKLADKSSGKFYHLKDISELPDVFES 277 >gi|305680383|ref|ZP_07403191.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659914|gb|EFM49413.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 880 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 45/132 (34%), Gaps = 34/132 (25%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QDNEEGIAIC 390 T A+ A + S + E+ K I+ TDG+ T D AIC Sbjct: 155 TDYGTALNGAMADLASRPD---------PESCKAILFFTDGKLTVQGDQKADIVAQKAIC 205 Query: 391 ------NKAKSQGIRIMTIAFSVNKTQQEKARYFLS------NCASP----NSFFEA-NS 433 K + I++ T+ + +E L +CA +FF A ++ Sbjct: 206 SADGQVKKLRDANIQLFTVGLI--PSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESN 263 Query: 434 THELNKIFRDRI 445 L FR + Sbjct: 264 AASLFSAFRSIL 275 >gi|75812639|ref|YP_320257.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] gi|75705395|gb|ABA25068.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] Length = 405 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 49/156 (31%), Gaps = 16/156 (10%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR------ 321 KK A+ + +K D +T F + L + Sbjct: 100 TKKIEGAIKAIRQLTSVLK--DRGENTQVAIVPFGEAGANCPQGYPVNKDTLDKFFAAND 157 Query: 322 ---TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV--HRMKNNLEAKKYIVLLTD 376 + ST + + ++ A + ++++ + + I+LL+D Sbjct: 158 FKLQNNLDYLASLTPCASTNLYEPLKKAVKFLANTSDSRFYLPKDSPQTPPRLSIILLSD 217 Query: 377 GENTQDN--EEGIAICNKAKSQ-GIRIMTIAFSVNK 409 G + N ++ ++ K I + T+ + + Sbjct: 218 GYHNFANEAQDFQSLTTLLKRNTNIIVHTLGYGLTP 253 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 50/182 (27%), Gaps = 10/182 (5%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + S + ++ I + T R+G +N Sbjct: 53 NGLNYVSIFSLTMPSSLFTKFQIASNIASVFGFGTRIGLNASEPRTTRLGLVTYNSVATQ 112 Query: 309 DPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + + + + ++ + A + Sbjct: 113 MADLNQYQSLHDAFNRIFDDLSNTVDTTESYLSTGLTLAEKMFNDQS-----VNSTRAHY 167 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +K +++ T +I ++ K G++I+T+A+ LS ASP Sbjct: 168 QKVVIVYASKYQTNGESNPESIADRLKLSGVKIITVAYG----NAYGLMKSLSIIASPGF 223 Query: 428 FF 429 F Sbjct: 224 AF 225 >gi|225022540|ref|ZP_03711732.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] gi|224944663|gb|EEG25872.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] Length = 880 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 45/132 (34%), Gaps = 34/132 (25%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT------QDNEEGIAIC 390 T A+ A + S + E+ K I+ TDG+ T D AIC Sbjct: 155 TDYGTALNGAMADLASRPD---------PESCKAILFFTDGKLTVQGDQKADIVAQKAIC 205 Query: 391 ------NKAKSQGIRIMTIAFSVNKTQQEKARYFLS------NCASP----NSFFEA-NS 433 K + I++ T+ + +E L +CA +FF A ++ Sbjct: 206 SADGQVKKLRDANIQLFTVGLI--PSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESN 263 Query: 434 THELNKIFRDRI 445 L FR + Sbjct: 264 AASLFSAFRSIL 275 >gi|50950211|ref|NP_001002980.1| collagen alpha-1(VII) chain [Canis lupus familiaris] gi|33149359|gb|AAO64414.1| type VII collagen [Canis lupus familiaris] Length = 2936 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 45/142 (31%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ G +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFGL--GALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|3850207|gb|AAC72024.1| alpha-1 type VII collagen non-collagenous domain [Canis lupus familiaris] Length = 1253 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 45/142 (31%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ G +++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFGL--GALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DPEELKRVASQPTSDFFFFVND 200 >gi|73976417|ref|XP_539432.2| PREDICTED: similar to Matrilin-2 precursor [Canis familiaris] Length = 524 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 54/178 (30%), Gaps = 20/178 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +++ + ++ + T R+G ++ +V + Sbjct: 195 FEIIKSLVKTLSDQVAL---DLATARIGIINYSHKVEKVAHLTQ--FSNKDDFKLAVDNM 249 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-NTQDNEEGIAI 389 + T A+ A ++ KK +++TDG+ +++D + + Sbjct: 250 QYLGEGTYTATALHEANHMFEAARPG----------VKKVALVITDGQTDSRDEKNLTEV 299 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ-EKARYFLSNCASPN---SFFEANSTHELNKIFRD 443 +A + I I + E ++ A+ ++ + L + Sbjct: 300 VKRASDINVEIFVIGVAKKNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 357 >gi|225543438|ref|NP_001139375.1| complement C2 isoform 2 preproprotein [Homo sapiens] Length = 620 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 141 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 197 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 198 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 256 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 316 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 317 FEHM------LDVSKLT 327 >gi|261855659|ref|YP_003262942.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836128|gb|ACX95895.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 625 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 17/94 (18%) Query: 364 NLEAKKYIVLLTDGENT----------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 N + I+LL+DG E + + +++ IR+ TIA ++ Sbjct: 131 NDGTDRNILLLSDGMVDISPEKAINVRAQEELLQKLVPQLRAEHIRVHTIAL-----SKD 185 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 LS A + F EA+S L + F Sbjct: 186 ADSKLLSQIAADTGGIFVEADSADALQRAFLKIF 219 >gi|194374835|dbj|BAG62532.1| unnamed protein product [Homo sapiens] Length = 620 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 141 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 197 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 198 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 256 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 316 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 317 FEHM------LDVSKLT 327 >gi|126031547|pdb|2ODP|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement gi|126031548|pdb|2ODQ|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement Length = 509 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 30 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 87 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 145 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 146 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 205 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 206 FEHM------LDVSKLT 216 >gi|118138230|pdb|2I6Q|A Chain A, Complement Component C2a gi|118138231|pdb|2I6S|A Chain A, Complement Component C2a Length = 517 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 38 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 94 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 95 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 153 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 154 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 213 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 214 FEHM------LDVSKLT 224 >gi|62896633|dbj|BAD96257.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|197098872|ref|NP_001126081.1| complement C2 [Pongo abelii] gi|55730287|emb|CAH91866.1| hypothetical protein [Pongo abelii] Length = 752 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIRELLNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|207079949|ref|NP_001128930.1| DKFZP469A1324 protein [Pongo abelii] gi|55726315|emb|CAH89929.1| hypothetical protein [Pongo abelii] Length = 740 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|34628|emb|CAA28169.1| unnamed protein product [Homo sapiens] Length = 752 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 46/155 (29%), Gaps = 8/155 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I T R+G +N + + F +T A Sbjct: 70 TRIGTNATEPRTTRVGLVTYNSVAKVNADL-NTFQSINDVYNGVFNYLSAVTDATDSYLA 128 Query: 343 M-QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A + + +S KK +++ + + + + ++ K G+ I+ Sbjct: 129 TGLQAANALFASQSFN----STRNHYKKVVIVYASEYKSYGELDPVKVADEMKGSGVYIV 184 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 T+A+ Q L+ A+P F + Sbjct: 185 TVAYDQGGNGQLLKD--LAGIATPGYSFSNTDDSD 217 >gi|15277207|dbj|BAB63292.1| C2 [Homo sapiens] Length = 577 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|14550407|ref|NP_000054.2| complement C2 isoform 1 preproprotein [Homo sapiens] gi|3915642|sp|P06681|CO2_HUMAN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|298124|gb|AAB97607.1| complement component C2 [Homo sapiens] gi|2347131|gb|AAB67975.1| complement component C2 [Homo sapiens] gi|28175369|gb|AAH43484.1| Complement component 2 [Homo sapiens] gi|33346923|gb|AAQ15273.1| complement component 2 [Homo sapiens] gi|55961814|emb|CAI17451.1| complement component 2 [Homo sapiens] gi|57209923|emb|CAI41858.1| complement component 2 [Homo sapiens] gi|119623954|gb|EAX03549.1| complement component 2, isoform CRA_b [Homo sapiens] gi|123857990|emb|CAM25860.1| complement component 2 [Homo sapiens] gi|168983782|emb|CAQ06833.1| complement component 2 [Homo sapiens] gi|168984416|emb|CAQ09272.1| complement component 2 [Homo sapiens] gi|168985077|emb|CAQ07481.1| complement component 2 [Homo sapiens] gi|168985955|emb|CAQ07111.1| complement component 2 [Homo sapiens] gi|189069137|dbj|BAG35475.1| unnamed protein product [Homo sapiens] Length = 752 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|156383259|ref|XP_001632752.1| predicted protein [Nematostella vectensis] gi|156219812|gb|EDO40689.1| predicted protein [Nematostella vectensis] Length = 157 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 43/153 (28%), Gaps = 18/153 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + L V R D +G F + F + N Sbjct: 23 FKRCLDFVKRIALSFDISASGTHIGIVTFATDPTVELEFDQSFDNTSIATIIDNIR--NP 80 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 T A++T + ++ + +++LTDG + +E + Sbjct: 81 DALTFTGKALETVKKDLFEKSQ--------RANVHRMLIVLTDGRSWDAVQEP---AKQL 129 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K G+ + + Q+ L + AS Sbjct: 130 KESGVTLYAVGVG-----QDYDLEQLKDIASNP 157 >gi|113953124|ref|YP_730426.1| structural toxin protein RtxA [Synechococcus sp. CC9311] gi|113880475|gb|ABI45433.1| structural toxin protein RtxA [Synechococcus sp. CC9311] Length = 2154 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/334 (8%), Positives = 80/334 (23%), Gaps = 19/334 (5%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + L + + S TF + + N + +++ T Sbjct: 849 VTETLNPTALRPGADELGSLTFTGINNGDDALIKSGGNTVSYYLINNGTELIASTKTLAE 908 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++L T + +Y + + Sbjct: 909 WNSLSPTDQADPNAADYVFTITLDPSQSNYTVKEYATIDGTTDTTLNYDVKLTDEDLDFV 968 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 D + + + A T + + + Sbjct: 969 DGDFEVTWAPSPAVLKVGENISDVASSTTPYKVDYDLVSKSDIIPTGSGTIEGTNGEDIL 1028 Query: 244 LYPGPLDPSLSEEHFVDS-------------SSLRHVIKKKHLVRDALASVIRSIKKIDN 290 + +++ + + + + ++ A ++ I + Sbjct: 1029 IGDVGGGSLVNQSINLSLVLDVSRSMILSNINFNNASVTRFSALQTATKDLLSEIAQS-- 1086 Query: 291 VNDTVRMGATFFNDRVISDP--SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 T ++ ++ +F+ G + + D G T + TA + Sbjct: 1087 -GATAKVQIVKYSTEGSDVGYYNFTSGDDQTVLNQAFQDIDDLQAGGGTNYEAGLVTALN 1145 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 I N + K ++ ++DGE + Sbjct: 1146 WISGGITSNTPLNVNQTDKDK-VIFISDGEPSFY 1178 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 709 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDSP------ 762 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG+ + D+ A GI I ++ L + AS Sbjct: 763 --NKNFLVIVTDGQ-SYDDVR--GPAAAAHDAGITIFSVGV------AWAPLDDLKDMAS 811 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 812 KPKESHAFFTREFTGLEPIVSDVI 835 >gi|116618630|ref|YP_819001.1| von Willebrand factor domain-containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] gi|116097477|gb|ABJ62628.1| von Willebrand factor (vWF) domain containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] Length = 920 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/306 (12%), Positives = 82/306 (26%), Gaps = 25/306 (8%) Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA----DRTVKSYSSQNGK 221 + I V+D S SM S + SQ Sbjct: 131 AKGNEVKNIKPIDIVLVVDMSGSMEPANNPSGSNRAQATRDGVKQFLQAIKDAGISQYVN 190 Query: 222 VGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL-VRDALAS 280 VG+ V+ + + +E+ S L +R Sbjct: 191 VGLVGFSSPGNYVTGSNGYLEVGMQSLSTTGQTEQINKTLSPTFSGGTFTQLGIRRGQKM 250 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + + G F+ +V + S + + D+++ GST+ Sbjct: 251 LDDDKNDHKKMMILLTDGVPTFSYKVTQATTISGTTYGTAFSS-----SDQDQPGSTSQF 305 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN---TQDNEEGIAICNKAKSQG 397 + + +++ + A+ Y + G N + ++ G Sbjct: 306 VSGRNNWESRKNRYISGTDPNGATTSAQNYGLYYNTGNNVKIWNTWAATLGEAKISRDSG 365 Query: 398 IRIMTIAFSVNKTQQEK------------ARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 I + + R S AS + +ANS ++ +++ Sbjct: 366 TEIHALGIQLGDDNNNNYWDSPLYLSSNNVRSRASLIASSGLYQDANSASDVETYLKNQA 425 Query: 446 GNEIFE 451 + + + Sbjct: 426 KDVLSQ 431 >gi|310119165|ref|XP_003118918.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] gi|310126588|ref|XP_003120448.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 535 Score = 42.3 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 34/99 (34%), Gaps = 10/99 (10%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + +Y++++TDG+++ ++ G+ I I +E L Sbjct: 6 ADTGRINVARYVIVITDGKSSDS---VAEAAEGLRANGVNIYAIGI------REANIDEL 56 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + F L I ++ + + + +K Sbjct: 57 KEIA-KDKIFFVYEFDLLKDIQKEVVQDICSSEACKNSK 94 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 12/126 (9%), Positives = 32/126 (25%), Gaps = 15/126 (11%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + + T A+ ++Y++++TDG Sbjct: 124 DIYRAIFDVQQMRDGTRTGKALNFTL-------PFFDSSKGGRPSVQQYLIVITDGVAQD 176 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + I I + K L + + + L + Sbjct: 177 NVIIP---AKALRDKNTIIFAIGVG-----EAKKSQLLEITNDEDKVYHDVNFEALQNLE 228 Query: 442 RDRIGN 447 ++ + Sbjct: 229 KEILSK 234 >gi|332823604|ref|XP_003311225.1| PREDICTED: complement C2 isoform 2 [Pan troglodytes] Length = 620 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 141 FKESASLMVDRIFSFE---INVSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYK 197 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 198 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 256 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 316 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 317 FEHM------LDVSKLT 327 >gi|332823602|ref|XP_003311224.1| PREDICTED: complement C2 isoform 1 [Pan troglodytes] Length = 752 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|62897125|dbj|BAD96503.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|38257345|sp|Q8SQ74|CO2_PANTR RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110330|gb|AAL82821.1| complement C2 [Pan troglodytes] Length = 752 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 273 FKESASLMVDRIFSFE---INVSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYK 329 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 330 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 388 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 448 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 449 FEHM------LDVSKLT 459 >gi|291279259|ref|YP_003496094.1| hypothetical protein DEFDS_0862 [Deferribacter desulfuricans SSM1] gi|290753961|dbj|BAI80338.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1] Length = 1216 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/394 (8%), Positives = 93/394 (23%), Gaps = 41/394 (10%) Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + + +Y + NF + + ++ + Sbjct: 109 NKKYSYDDDNDYFQEKEDGEWSGNFLNWLTTRRIDAVK-KVLTGGKYIIEDGNILLTATD 167 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N F + + + T +K +F + + L Sbjct: 168 NFDRDFRKILENNVLDLTGLNNGYT---DNNKGAYFYFYNDNNFVYFDVRPATWNNNCWL 224 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPY---MVSCNKSLYYMLYPGPLDPSLSEEHF 258 R +Q + + Y + + + + G + + Sbjct: 225 IDNNPTTFRVALKVETQPTGIIQKTWDKVRYGLFTFNHDDGGHLVSDIGSKYDNSTFILD 284 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +++ + + +A K ++ + + G Sbjct: 285 INNITPSTWTPLGETLYEATLYFKG--DKTYYNGTFTYDSPIQYSCQQNYVLILTDGEST 342 Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + + + + + + N + + T Sbjct: 343 MDKNFPS-KLDSNDPNSTIDNLLDIIEGNEGFTDNELHNDLNDSNGSYSLAAVSYWTHTT 401 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTI-AFSVNKTQQEKARYFLSNCA-------------- 423 + +++ G Q + I TI AF ++ ++ + + Sbjct: 402 DLREDLTGT--------QKLNIFTIFAFDDSENAKDLLKLSAKYGSFADNNFNNLPDNSN 453 Query: 424 --------SPNSFFEANSTHELNKIFRDRIGNEI 449 P+++FEAN+ +L I + Sbjct: 454 EWDTNNDNIPDNYFEANNATDLETSLSTTIARIL 487 >gi|264679151|ref|YP_003279058.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] gi|262209664|gb|ACY33762.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] Length = 1405 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 44/364 (12%), Positives = 92/364 (25%), Gaps = 19/364 (5%) Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 N E+ V + + + SA +L+ + R + + Sbjct: 546 NIEDYNLAAPGAISVSEALASGSGTFTITASAGLKLLTLDGPGNADASLTLDRLADLVNN 605 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 I T++ + + + + S D+ Q K+ Sbjct: 606 PITLTTSKGTLTLTGYDATTGKVSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATG 665 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH-VIKKKHLVR 275 + L P S S + LD S S ++ + + +++ Sbjct: 666 DLGVLITDTAPSLKPIAESSALSSHGTNIMLTLDTSGSMAWSSGVNNSNGWSLSRLDVLK 725 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 ++ ++ + VR+ FN G + KTF G Sbjct: 726 SSVNGLLDKYGEA----GDVRVLILEFNSSATQK-----GSGWMSLAEAKTFVNGLYADG 776 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK--- 392 T DA+ A ++ + + TDGE + + Sbjct: 777 GTNYQDALTKAM----AAWNNSGTGKLEGNNVQNISYFFTDGEPDSNRSVSSSQQTTWEK 832 Query: 393 -AKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNEIF 450 I I T + + ++ + + L+ I I Sbjct: 833 FLADNHINSYGIGLGTGATGGYIDPISYNPERPADSNTILVKNLNGLDAAIDGTIAPPIN 892 Query: 451 ERVI 454 + Sbjct: 893 GDIA 896 >gi|25244377|gb|AAN72414.1| complement component C2 [Rattus norvegicus] Length = 758 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 46/154 (29%), Gaps = 16/154 (10%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V +++ ++ D T +A+ + Y + + + E + I+ Sbjct: 319 SQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQMDRLGMETSAWQEIRHAII 378 Query: 373 LLTDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ + + GI I I ++ S Sbjct: 379 LLTDGKSNMGGS-PKPAVDNIREILGISRNRNDYLDIYAIGVGKLDVDWKELNELGSKKD 437 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F + ++F V ++T Sbjct: 438 GERHAFILQDAKAVQQVFEHI------LDVSKLT 465 >gi|73972314|ref|XP_860410.1| PREDICTED: similar to complement component 2 precursor isoform 5 [Canis familiaris] Length = 748 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 57/198 (28%), Gaps = 23/198 (11%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFA 328 + +++ ++ I + V + F + S ++I ++ K Sbjct: 271 VFKESAILMVDRIFSFE---INVSVAIITFASKPKIIMSVLNDNSRDATEVINSLNKVNY 327 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D T A+ + + + + + + E + I+LLTDG++ Sbjct: 328 KDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAIILLTDGKSNMGGS-PKL 386 Query: 389 ICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + K I I I + S F T L + Sbjct: 387 AVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTEALYQ 446 Query: 440 IFRDRIGNEIFERVIRIT 457 +F + ++T Sbjct: 447 VFEHM------LDISQLT 458 >gi|73972312|ref|XP_860371.1| PREDICTED: similar to complement component 2 precursor isoform 4 [Canis familiaris] Length = 682 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 57/198 (28%), Gaps = 23/198 (11%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFA 328 + +++ ++ I + V + F + S ++I ++ K Sbjct: 205 VFKESAILMVDRIFSFE---INVSVAIITFASKPKIIMSVLNDNSRDATEVINSLNKVNY 261 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D T A+ + + + + + + E + I+LLTDG++ Sbjct: 262 KDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAIILLTDGKSNMGGS-PKL 320 Query: 389 ICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + K I I I + S F T L + Sbjct: 321 AVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTEALYQ 380 Query: 440 IFRDRIGNEIFERVIRIT 457 +F + ++T Sbjct: 381 VFEHM------LDISQLT 392 >gi|47575877|ref|NP_757376.2| complement C2 [Rattus norvegicus] gi|46237595|emb|CAE83973.1| complement component 2 [Rattus norvegicus] gi|47477793|gb|AAH70923.1| Complement component 2 [Rattus norvegicus] Length = 758 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 46/154 (29%), Gaps = 16/154 (10%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 S V +++ ++ D T +A+ + Y + + + E + I+ Sbjct: 319 SQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQMDRLGMETSAWQEIRHAII 378 Query: 373 LLTDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCA 423 LLTDG++ + + GI I I ++ S Sbjct: 379 LLTDGKSNMGGS-PKPAVDNIREILGISRNRNDYLDIYAIGVGKLDVDWKELNELGSKKD 437 Query: 424 SPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F + ++F V ++T Sbjct: 438 GERHAFILQDAKAVQQVFEHI------LDVSKLT 465 >gi|301788516|ref|XP_002929674.1| PREDICTED: complement C2-like isoform 3 [Ailuropoda melanoleuca] Length = 617 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 57/202 (28%), Gaps = 23/202 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ ++ I + V + F + S + + + ++ + Sbjct: 136 DDFQIFKESAILMVDRIFSFE---INVSVAIITFASKPQVVMSVLYDNSRDVTEVINSLN 192 Query: 329 I----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D T A+ + + + + + + E + I+LLTDG++ Sbjct: 193 NINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNMGGS 252 Query: 385 EGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + I I I + S F T Sbjct: 253 -PKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTE 311 Query: 436 ELNKIFRDRIGNEIFERVIRIT 457 L ++F V ++T Sbjct: 312 ALYQVFEHM------LDVSQLT 327 >gi|301788514|ref|XP_002929673.1| PREDICTED: complement C2-like isoform 2 [Ailuropoda melanoleuca] Length = 749 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 57/202 (28%), Gaps = 23/202 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ ++ I + V + F + S + + + ++ + Sbjct: 268 DDFQIFKESAILMVDRIFSFE---INVSVAIITFASKPQVVMSVLYDNSRDVTEVINSLN 324 Query: 329 I----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D T A+ + + + + + + E + I+LLTDG++ Sbjct: 325 NINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNMGGS 384 Query: 385 EGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + I I I + S F T Sbjct: 385 -PKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTE 443 Query: 436 ELNKIFRDRIGNEIFERVIRIT 457 L ++F V ++T Sbjct: 444 ALYQVFEHM------LDVSQLT 459 >gi|301788512|ref|XP_002929672.1| PREDICTED: complement C2-like isoform 1 [Ailuropoda melanoleuca] gi|281345620|gb|EFB21204.1| hypothetical protein PANDA_019912 [Ailuropoda melanoleuca] Length = 748 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 57/202 (28%), Gaps = 23/202 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +++ ++ I + V + F + S + + + ++ + Sbjct: 267 DDFQIFKESAILMVDRIFSFE---INVSVAIITFASKPQVVMSVLYDNSRDVTEVINSLN 323 Query: 329 I----DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 D T A+ + + + + + + E + I+LLTDG++ Sbjct: 324 NINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNMGGS 383 Query: 385 EGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + I I I + S F T Sbjct: 384 -PKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTE 442 Query: 436 ELNKIFRDRIGNEIFERVIRIT 457 L ++F V ++T Sbjct: 443 ALYQVFEHM------LDVSQLT 458 >gi|256071156|ref|XP_002571907.1| dihydropyridine-sensitive l-type calcium channel [Schistosoma mansoni] gi|238657056|emb|CAZ28137.1| dihydropyridine-sensitive l-type calcium channel, putative [Schistosoma mansoni] Length = 421 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 55/213 (25%), Gaps = 15/213 (7%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 V + +D+S L + +I ++ Sbjct: 195 FPWRQQNVDMFDVRRRSWFIQGSSVPKDLFILLDTSGSMTGQS-LKLANLSAQKLIEALD 253 Query: 287 KID-------NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK--TFAIDENEMGST 337 D + F+ V R ++ G + Sbjct: 254 VDDYFTVAHFPGAKDHVAPMIVTANNESEPICFNSFVQATRRNKLRLFYDLSTLKARGYS 313 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +++ AY+ + E L K +VLLTD D + K Q Sbjct: 314 DFPASLKFAYEMFRNLTESARGDRGKELRN-KILVLLTDNAFVFDESVLSQL----KQQK 368 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 I T +S+ + + + CA+ + + Sbjct: 369 SNITTFIYSLGEPVGAAYEHKMKACATNDYYQY 401 >gi|219125320|ref|XP_002182931.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405725|gb|EEC45667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 523 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 17/202 (8%), Positives = 47/202 (23%), Gaps = 57/202 (28%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R G F + + + K + G T ++ A+ A + Sbjct: 102 QDRFGLISFGSDARVEFP-AQAMSKQNKASALQKIQSLTTRGCTNMSAALGLAVQEL--- 157 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS------------------ 395 ++ + + LTDG + + + + ++ Sbjct: 158 ------KIIEKSNPVRSLFFLTDGLANEGISDLDGLVSLTRNCLLPSDNPSNVLNSEVMI 211 Query: 396 ---------------------------QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 I + T + + L++ ++ Sbjct: 212 AECLDDLATSQHQITRLPVAEIESVCRAPITLHTFGYGRDHNAALLES--LADTTQGGAY 269 Query: 429 FEANSTHELNKIFRDRIGNEIF 450 + + F + +G + Sbjct: 270 YFIEDDSNVGSAFGNALGGIMS 291 >gi|184199785|ref|YP_001853992.1| hypothetical protein KRH_01390 [Kocuria rhizophila DC2201] gi|183580015|dbj|BAG28486.1| hypothetical protein [Kocuria rhizophila DC2201] Length = 455 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 29/259 (11%), Positives = 73/259 (28%), Gaps = 33/259 (12%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP-----SLSEEHFVDS 261 + G R +P + + +P ++ + + Sbjct: 209 DYRSRHADTTLVASTPGDRPASEAPTADDAALTQALGQWQHLAEPFHALVAIDVSGSMGT 268 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 +L + L + A + + + D + + S + Sbjct: 269 KALPDGSTRMDLTKAAATTAVGLFPEHDALGLWTFERHLDGDKDYRSVTPVRELSASVDG 328 Query: 322 TIVKTFAI------DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + G T + D AY ++ + + +++L+ Sbjct: 329 GTQRDQLSQDVQSLTFSPDGYTGLYDTTLAAYRQVL---------HDDAPGHLRTVIVLS 379 Query: 376 DGENTQDNE-EGIAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNS 427 DG N + + + K+ +RI+T+ ++ L A + S Sbjct: 380 DGMNHDPDSIALDELLSTLKAEQDAENPVRIITVGV-----SKDADATVLRQIAEATGGS 434 Query: 428 FFEANSTHELNKIFRDRIG 446 A + ++ K+F D + Sbjct: 435 SHVARTPQDIQKVFVDALT 453 >gi|38639545|ref|NP_943314.1| TerY [Klebsiella pneumoniae] gi|38016643|gb|AAR07664.1| TerY [Klebsiella pneumoniae] Length = 212 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 54/152 (35%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + + + Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQAVPLTDLLSFNLPS-------- 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGLNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 AK +G+ + + L Sbjct: 131 KAAK-KGVVVAC------AAGHDADTAVLKEI 155 >gi|319654932|ref|ZP_08009006.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] gi|317393357|gb|EFV74121.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] Length = 245 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 32/72 (44%), Gaps = 2/72 (2%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 K I+L+TDG + Q + +A+ AK QGI + I E+ + A S Sbjct: 7 KQILLITDGCSNQGE-DPVAMAALAKEQGITVNVIGVMEQDVIDEQGMNEIEGIAMSGGG 65 Query: 428 FFEANSTHELNK 439 + + +L++ Sbjct: 66 VSQVVYSQQLSQ 77 >gi|75674505|ref|YP_316926.1| Flp pilus assembly protein TadG [Nitrobacter winogradskyi Nb-255] gi|74419375|gb|ABA03574.1| Flp pilus assembly protein TadG [Nitrobacter winogradskyi Nb-255] Length = 242 Score = 42.3 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 14/174 (8%), Positives = 42/174 (24%), Gaps = 8/174 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + L + G A+++P+ML + ++V A + Sbjct: 46 MWSCLKDMASALRRDTRGLAATEFAMIVPLMLVMLFGTIEVSSGV--------AVNRKVT 97 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + L + T + ++ ++ + V+ Sbjct: 98 LVARTLSDLTSQSKVVNDADVTNFLAASYGIMWPYSSAPVQATISELYIDPATSVARVQW 157 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + K+ + L + + + + + Y Sbjct: 158 SKGKAPRGAGSTVSIPSGLIGRDSSGKVLPNQYLIFSEVSYIYEPVLGYVMSKA 211 >gi|253576546|ref|ZP_04853874.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251843960|gb|EES71980.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 737 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 8/52 (15%), Positives = 20/52 (38%), Gaps = 3/52 (5%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHA---LKQAAQTAIITA 64 +S G + ++ + + +D R + L AA ++++A Sbjct: 8 RSEQGAVTVFFIVIFAAVFAFVALFIDFARMFALQAQVETLAHAASRSVLSA 59 >gi|198436525|ref|XP_002124218.1| PREDICTED: similar to laminin, beta 2 [Ciona intestinalis] Length = 671 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 24/54 (44%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +AK+ ++LLTDG+ T + I + + +G+ I + + + Sbjct: 140 GPKNRYPDAKRALILLTDGKATDSSYLPDVIQHAERDRGVIIYAVGVGDFDSSE 193 >gi|17537921|ref|NP_496259.1| C-type LECtin family member (clec-60) [Caenorhabditis elegans] gi|3881710|emb|CAA88985.1| C. elegans protein ZK666.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 406 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 56/177 (31%), Gaps = 10/177 (5%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK-TF 327 ++ S I + T R+G +N + Sbjct: 60 TSIAANIASVVSSGTRIGTNPSEPRTTRLGLVTYNKAAAIQADLNQYQSLDDVYDNVFRA 119 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + + + + A D + + + N ++ +++ + + Sbjct: 120 LSSVSTSEESYLANGLARAEDVLEAGKQG-----YNRTHYQRVVIVYASAYKGSGALDPV 174 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + K+ G+ ++T+A+ + A L+ ASP F +T + ++ + Sbjct: 175 PVAERLKTSGVTVITVAYDQDGDGALLAD--LAKIASPPYNF--TNTEDNGQVIGEI 227 >gi|260837103|ref|XP_002613545.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] gi|229298930|gb|EEN69554.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] Length = 184 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 41/128 (32%), Gaps = 11/128 (8%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 V + D ++ + + + + ++ + G T Sbjct: 68 NFVATTTSDFQIGPDNAQVSVVQYESSPTEEFPL--DRYATLEDLLSAINLIPYRGGGTR 125 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ T ++ + + K I+++TDG++ D E +AK GI Sbjct: 126 TGRAIDYVVTTTLTVSRGARQGV------PKVIIVVTDGQSGDDVREP---ARRAKQSGI 176 Query: 399 RIMTIAFS 406 ++ I Sbjct: 177 IMVAIGVG 184 >gi|254443704|ref|ZP_05057180.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258012|gb|EDY82320.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 344 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 51/155 (32%), Gaps = 22/155 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + + S++ +++ F S L R +K Sbjct: 113 SRLDRAKLVVESMLDTLEGESVG-------LVVFAGTAFLQSPMSPDYQIL-RGFLKEIN 164 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T + TA D+ +++ A +++++++DGE + A Sbjct: 165 PSFIPQGGTNYEAMLTTALDSF----------EQSDGMADRFLIIISDGE--SLDSNWKA 212 Query: 389 ICNKAKSQGIRIMTIAFSV--NKTQQEKARYFLSN 421 + K Q +R + + F + A +L + Sbjct: 213 KAEELKEQNVRAICLGFGTKEGSFIPDGAGGYLKD 247 >gi|3273285|dbj|BAA31186.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273297|dbj|BAA31192.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273289|dbj|BAA31188.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273259|dbj|BAA31173.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273255|dbj|BAA31171.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|328949411|ref|YP_004366747.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328449735|gb|AEB15450.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 385 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 10/114 (8%), Positives = 42/114 (36%), Gaps = 1/114 (0%) Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + + S + + +++ +K +++L+DG + + + G+ + T Sbjct: 159 LSEGQPRLYDSIMNLIRKVEQKKGKRKVVIILSDGRDQNSRFSKDQLIETLSNSGLPVYT 218 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + V Q +S + +++ + ++ + + ++++ Sbjct: 219 VGMKVLSNQSLSNLDEISQL-TGGTYYYSTRFKDIPDNLKKVVDCIKQSYIVKL 271 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 34/336 (10%), Positives = 73/336 (21%), Gaps = 17/336 (5%) Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + +L + P + + + + + +V D Sbjct: 22 STNLASASNNVTVNLSVTPSQSVVILPTTSNAKASLNLMLTPTGNPQTERDPIDLVFVFD 81 Query: 185 FSRSM----LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 S SM A ++ + + + V Sbjct: 82 KSGSMDFKVASNSSVKRIDSAKSAMTNALMFFDGQNTSDRFGFVPFSSNANTDVVSLTDS 141 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + L H D + + S D + G Sbjct: 142 SGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVASKLFDS-SSKDKNIIFLTDGTP 200 Query: 301 FFNDRVISDPSFS-WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+ S WG + D T + + Sbjct: 201 TFSFSDEKVYYKSYWGSSSGNDRVEYQSYTDL-----TYQFTETTKYWYKMGDYFYKNNT 255 Query: 360 RMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + N T N + + + S GI++ T+ F + A Sbjct: 256 KYNMNTTYNPNGNATTYKNNFEKLVKQNDKKVVQSLSSSGIKLYTLGFGDSIDGDYLAEL 315 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 + S+ A ++N++ + +V Sbjct: 316 ---ANLTGGSYKNAI-GQDINEVLMNISEEVAAPKV 347 >gi|156349150|ref|XP_001621939.1| predicted protein [Nematostella vectensis] gi|156208297|gb|EDO29839.1| predicted protein [Nematostella vectensis] Length = 147 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 36/110 (32%), Gaps = 18/110 (16%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 L AI G T I +A++ + ++ V + ++++T Sbjct: 32 YSDLNSAKFAVDAIQI-SNGGTRIGEALKLVKTDMFNTARSNVP---------RMLLVMT 81 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 DG ++ D + G+ I+T+ + L AS Sbjct: 82 DGRSSDDVVAPS---RALRDIGVTILTLGLGSDYD-----LDQLKMIASG 123 >gi|149773093|emb|CAO01896.1| collagen type VI alpha 6 [Mus musculus] Length = 1120 Score = 42.3 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ D + VR+G F+D S+ + + Sbjct: 24 PDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQ 83 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T I DA++ + N + +++LTDG + +E Sbjct: 84 IF--GYTHIGDALRKVKYYFQP-----DMGSRINAGTPQVLLVLTDG---RSQDEVAQAA 133 Query: 391 NKAKSQGIRIMTIAFSVNKTQQ 412 + + +G+ I ++ Q+ Sbjct: 134 EELRHKGVDIYSVGIGDVDDQE 155 >gi|326407579|gb|ADZ64650.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 1433 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 42/457 (9%), Positives = 118/457 (25%), Gaps = 86/457 (18%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A + S+ ++ + + + + + F + +N + Sbjct: 121 RANNSISLLNLEIQADAKLIENDQEVISEDILAKNESTIFSLYIPENNKADSKE--KDNK 178 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 E + Q S+ + + GIK+ A+ +++ Sbjct: 179 NTEEVLNNESSQEETVSQLKKDSQLAFSYPSNFGIKA-SFNDLAQNYENISPEYRQDETG 237 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS---YSSQNGKVGIRDEKLSPYMVS 235 I FS + V + ++ + L ++ Sbjct: 238 ISRWNSFSSQWDGVNSWNGEATNLQNSYIEYAGVNNPVDFALRKYAKETETPGLYDVYLN 297 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 ++ + P + + + + K+ VR ++ + +I+ D V Sbjct: 298 VRGNVQNPIKPVDIVLVIDMSGSMQGA-------KETAVRQGVSDFLSTIQNTA-YADYV 349 Query: 296 RMGATFFNDRVISDPSFS-------WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 +G ++ S V A+ G T ++ + Sbjct: 350 NVGIVGYSSPGNYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTE 409 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------------------------- 383 + +++ + +K ++L+TDG T Sbjct: 410 ML----------EQDSSDNQKMMILMTDGVPTFSYKVNSASKVDNVIYGQSFAESRDEPG 459 Query: 384 --------------------------EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 + +K + I T+ + ++ Sbjct: 460 NTSKIQSPYSVKDINGGSNIEIRDTWAATLGEAEISKQEISEIHTLGIQLGNDGSYLSQE 519 Query: 418 FLSN----CASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + A+ + +ANS +++ +++ + Sbjct: 520 EVKSRTSLIATTGLYQDANSANDITDYLKNQANVVLS 556 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 63/195 (32%), Gaps = 28/195 (14%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + S+S + ++++ + +++D ++R +K I NV+ + + + Sbjct: 200 SFVFDKSGSMSWDLNGNNTNYWGPKSRMSILKDKATIMMRDLKDIGNVSVNLVSFSILGS 259 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 G T ++ G T D ++ ++ N Sbjct: 260 YVQKDFSELDKGT-----TTIEASINALQTGGVTNPGDGLRYGMMSLQ-----------N 303 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM------TIA----FSVNKTQQE 413 + KY+VLLTDG + +++ ++ T+ F+ Sbjct: 304 HSAQLKYVVLLTDGIPNAYTVDTND--TSWRNRNVQPYYNRWRETVGELVTFNNGPYDVT 361 Query: 414 KARYFLSNCASPNSF 428 N S +S+ Sbjct: 362 TNLTTDQNRVSYDSY 376 >gi|126730252|ref|ZP_01746063.1| hypothetical protein SSE37_10869 [Sagittula stellata E-37] gi|126708985|gb|EBA08040.1| hypothetical protein SSE37_10869 [Sagittula stellata E-37] Length = 177 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 1/60 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 +F + + ++ G + AL MP+ L + V++ + L++A + Sbjct: 1 MFRPQL-SFLRRFRAEDDGSMVVPIALWMPIFLLLIISSVELGTITVRSTVLERALDQTV 59 >gi|145502983|ref|XP_001437469.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404619|emb|CAK70072.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 38/113 (33%), Gaps = 14/113 (12%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + +F + F+ +K I+ D N GST DA+ A + ++ Sbjct: 162 NHIVKLVWFGSTLFDKCEFTSDFNKFIK-----LVDDANPGGSTKCYDAIDYAINKLLEV 216 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI--MTIA 404 + + I+ LTDGE+ + + + I I + Sbjct: 217 KQKYPDIVLR-------ILALTDGEDNASASKPNTLVQRIFDHKIIIDSFVVG 262 >gi|33333558|gb|AAQ11895.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|160718|gb|AAA29774.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|160723|gb|AAA29776.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|226823202|ref|NP_032425.2| integrin alpha-E isoform 1 [Mus musculus] gi|56206400|emb|CAI24788.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|148680762|gb|EDL12709.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|88702489|gb|ABD49099.1| integrin alpha E [Mus musculus] Length = 1167 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|108758893|ref|YP_632824.1| pilus biogenesis protein [Myxococcus xanthus DK 1622] gi|108462773|gb|ABF87958.1| pilus biogenesis protein, TadE family [Myxococcus xanthus DK 1622] Length = 397 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 17/147 (11%), Positives = 40/147 (27%), Gaps = 18/147 (12%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++ G + AL +P+++ + + + + L++AA+ A+ Sbjct: 1 MRARL-----QMRSRSRGAATVEFALSVPLLVMILMFSMYLTELVRAKLKLQEAARYAV- 54 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + F K ++ K+ + D V Sbjct: 55 ------------WEMTSYALSDFANGKHDDAFEDARREAHKEFVERYKDMDSVEPNGTGG 102 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLR 149 N V + + S L Sbjct: 103 NFIARYTDVTATISNKEIALLESGMLS 129 >gi|25742632|ref|NP_113956.1| integrin, alpha E [Rattus norvegicus] gi|3236342|gb|AAC23662.1| integrin alpha E1 [Rattus norvegicus] gi|187466179|emb|CAQ51889.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|74150598|dbj|BAE32321.1| unnamed protein product [Mus musculus] Length = 1166 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 271 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 324 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 325 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 368 >gi|3273247|dbj|BAA31167.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 55/184 (29%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + A F++ K I+K+ G T ++DA+ Sbjct: 83 ENAIHLYANDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 50/166 (30%), Gaps = 34/166 (20%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 + + ++ + + + + + T + A+ A + + +D Sbjct: 491 IIYQPSINNNKDADVLKNWESSSALNRDDLSYKDKNGTNYHAALVKADEMLNKVADDG-- 548 Query: 360 RMKNNLEAKKYIVLLTDGENTQD-------NEEGIAICNKAKSQ---------------- 396 +K +V ++DG T G + + KS Sbjct: 549 -------HRKIMVFVSDGVPTFYFGSDHYRAGNGTSDASNIKSSQDGTRAAIDDFKKKHP 601 Query: 397 GIRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEANSTHELNKI 440 + I ++ S + + L + + ++ +T EL KI Sbjct: 602 NLSIYSLGVSKDINSDTASSSPVVLKYLSGEDHYYGITNTVELEKI 647 >gi|194221347|ref|XP_001494879.2| PREDICTED: similar to alpha 1 type VII collagen [Equus caballus] Length = 3065 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 43/142 (30%), Gaps = 21/142 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR A ++D ++ ++ + G+T A+ D + Sbjct: 77 VRFAAVQYSDDPRTEFGLDALGSGGD--VIHAIRELSYKGGNTRTGAAILHVADHVFLPQ 134 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + + K QG+++ + + Sbjct: 135 LARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI------KNA 178 Query: 415 ARYFLSNCAS---PNSFFEANS 433 L AS + FF N Sbjct: 179 DHEELKRVASQPTSDFFFFVND 200 >gi|73695956|gb|AAZ80784.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 17/183 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 DSSS K ++ + + D VR+ + + + F+ + Sbjct: 54 DSSSSLRNRDFKTAIKFLQEFLSQYEISSDPNG--VRVSIISYGKGIYPEIGFNLTTYDT 111 Query: 320 IRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +++ ++ G T A+Q ++ +++ K +++TDG Sbjct: 112 KDEVIEAIGKIPHKAGLRTDTGRAIQYMHEAQLANGVVRPGV-------TKVSIVITDG- 163 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHEL 437 N+Q+ + +A+ I + I + E L N A + + ++ ++L Sbjct: 164 NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSE-----LLNIAGDQSRVTKVDNYNQL 218 Query: 438 NKI 440 + I Sbjct: 219 SSI 221 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 80/296 (27%), Gaps = 12/296 (4%) Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 + VS S + + S + + + + C Sbjct: 107 NGVTFVHRKNCSIVSTSQETSVAGNPLQALTMSM-VGASWDEGQSNLVVCNNMWTRNCKD 165 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 +Y ++ + + VD S + Sbjct: 166 AYKTEGVCYVTNTTSELNVLNRPKWKQLSPCKRECYRKMIDFMFVVDGSRSVGNES-FEV 224 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS-DPSFSWGVHKLIRTIVKTFAIDEN 332 V+ + V + V++G ++ + D Sbjct: 225 VKHWIQQVTSGFDISSS----VQVGVVQYSTYQYRKVVQPFIKTEIRLGEYKDHILFDAA 280 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + + TAY + NED M ++++ +VLLTDG+ + D E+ + + Sbjct: 281 VDKIKYHDRSTFTAYAIRKTVNEDFKGNMSRYPDSRRVMVLLTDGQ-STDKEDLSSAAAE 339 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 AK +G+ + E SP+ N +EL I G+ Sbjct: 340 AKQEGVETFAVGVGSKIILSELVLIA----GSPDKVITVNDFNELLGIVNQLQGDI 391 >gi|194385074|dbj|BAG60943.1| unnamed protein product [Homo sapiens] Length = 506 Score = 41.9 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 16/152 (10%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++I ++ D T A+ + Y + + E + I+LL Sbjct: 69 DMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILL 128 Query: 375 TDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASP 425 TDG++ + + I I I + S Sbjct: 129 TDGKSNMGGS-PKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGE 187 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F T L+++F V ++T Sbjct: 188 RHAFILQDTKALHQVFEHM------LDVSKLT 213 >gi|332705169|ref|ZP_08425251.1| hypothetical protein LYNGBM3L_03330 [Lyngbya majuscula 3L] gi|332356119|gb|EGJ35577.1| hypothetical protein LYNGBM3L_03330 [Lyngbya majuscula 3L] Length = 571 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 85/305 (27%), Gaps = 32/305 (10%) Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV-SRSYHKEHGVSIQWVID 184 S Y+ + + S + ++ +A S ++ Sbjct: 266 GSVYESSVIAANSHQQAGQSRYQAVYPQATFTSNMRAILPKAPWVSRQEKAAGQQVIDYL 325 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 S + + +P K + R + LS ++ Sbjct: 326 RSPQVQKIATELGLRPGV---PEVPLGAKFSAQFGVDSKARYDSLSSPQPEVVAAMVKSW 382 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 PS S S+ K V+ L + I S+ + ++ F+ Sbjct: 383 QEFAKKPSQVVVLVNSSGSMSG--NKLAAVQKTLRAYIESLGPTE------QIALIDFDS 434 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 + K R + F ++ N G + DA A + + + ++ Sbjct: 435 EIRPPVLVDSTPQKRDRAM--AFILNLNPDGGNRLYDAALEARNWLQKNYQERAINA--- 489 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIAFSVNKTQQEKARYF 418 +V+LTDG+++ + + + G I TI + + E Sbjct: 490 ------VVILTDGDDSDSKLRLEQLSQELEKSGFSSDNRIAFFTIGYG---NEGEFNPKV 540 Query: 419 LSNCA 423 L A Sbjct: 541 LEQIA 545 >gi|310640817|ref|YP_003945575.1| protein [Paenibacillus polymyxa SC2] gi|309245767|gb|ADO55334.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 738 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 16/152 (10%), Positives = 41/152 (26%), Gaps = 2/152 (1%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G + L++ VM G + +D R + ++ A+ + +L++ Sbjct: 12 ESGAVTVFLILILAVMFGFIAVFIDYARIAALHVQTERLTHAAVRSVMSAYDPALQQEYG 71 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR--DIVRDTAVEMNPRKSAYQVVLSS 135 +Q + + L + + D + + + + Sbjct: 72 LFAYGEGGGEQIMVKVLNDSARRTARAETLPLMNMKLDSSSLQMERELGKYAIFNEQIRE 131 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 +K K + TV Sbjct: 132 EMKYKAPVDFTMEVLEKLKPLSQNMKEASHTV 163 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 49/157 (31%), Gaps = 17/157 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRT-IVKTFAIDENEMGS-TAINDAMQTAYDT 349 ++ R+ F+ + ++ G T I ++Q AY+ Sbjct: 35 SNATRIAIGTFSSDHRINFNYILNPSYANTKCKFNDDFKKIKFDGFMTNIKGSLQDAYNV 94 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKA------KSQG-IRIM 401 S + D V + K ++LLTDG + + A K G + + Sbjct: 95 FRSLDSDPVTHSRRPRSN-KVVILLTDGVGNMVGNRVDSAGADGAPEALRLKQTGYVELY 153 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFF-EANSTHEL 437 T+ + L A+ S F + +L Sbjct: 154 TVGV-----THATDQNMLKKIATDPSLFLYSKDFTDL 185 >gi|160712|gb|AAA29771.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 42/138 (30%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + D + + + A+ F++ K I+K+ Sbjct: 72 AMKLIQQLNLND---NAIHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLEVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 69/200 (34%), Gaps = 27/200 (13%) Query: 262 SSLRHVIKKKHLVRDA----LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + + K + A + ++ + + V G +D + Sbjct: 50 AKDPSGLTKIEAAKKALTGVVGALPDTAQVGLRVYGATVDGKGKPTPAACADTQLITPIG 109 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + + + N +G T I ++ A + + K+ IVL++DG Sbjct: 110 TIDKPALTSAISAINALGETPIAHSLTEALKDLGPTG-------------KRNIVLVSDG 156 Query: 378 ENTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 E + + K + G+ +I T+ F VN + + + A ++++A Sbjct: 157 EESC-TPDPCPAVKKLTAGGVDLQIDTVGFGVNAKARSQLQCIAD--AGKGTYYDAKDAP 213 Query: 436 ELNKIFRDRIGNEIFERVIR 455 L +++ +R +R Sbjct: 214 ALAASL-----SKLSQRALR 228 >gi|311253435|ref|XP_001924360.2| PREDICTED: collagen alpha-1(XIV) chain [Sus scrofa] Length = 1795 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +++ + G+T A+ ++ K +L+TDG Sbjct: 215 STKDEVIEAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGARTGVSKIGILITDG 268 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 269 K-SQDDVIPPS--RNLRESGVELFAIGV------KNADENELREIASEPDNTHVYNVAE 318 >gi|288956975|ref|YP_003447316.1| hypothetical protein AZL_001340 [Azospirillum sp. B510] gi|288909283|dbj|BAI70772.1| hypothetical protein AZL_001340 [Azospirillum sp. B510] Length = 196 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 14/162 (8%), Positives = 41/162 (25%), Gaps = 2/162 (1%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + A++ P+++ V + ++ + + L+ AA+ A + Sbjct: 24 RKGSVAVEFAIVAPMIILVFIAVFELGMLEFSRNCLELAARQASRAGVT--GVLPTGYKT 81 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 R Y ++ ++ T + N + + Sbjct: 82 REDAIQALVTSLTAGYFDPGKVFVTMWVYSSFDLPAPEPWTDLNHNGTWDSGEPYTDINK 141 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + + G ++ + K Sbjct: 142 NGKWDGNMAASGAGGSGDIVLYRITASRPYLTPVWKGLAGDD 183 >gi|123414681|ref|XP_001304538.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121885997|gb|EAX91608.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 709 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 53/165 (32%), Gaps = 21/165 (12%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F D S ++ +R + A +MG T I ++ + Sbjct: 276 CRFSIIQFGDSYKETVSICDYSNRNVRNAMSAIAGINADMGGTDILSPLEYVFKK----- 330 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQ 412 K + I LLTDGE + IC+KA+ + RI I Sbjct: 331 -------KLEKGFIRKIFLLTDGEVNNSD----EICSKAQLERENNRIFAIGLGSGADPG 379 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + N ++ + +N + + + + I + I+ Sbjct: 380 LIKNVSIKN---GGNYVLIADEYNMNNMIVELMKSAISPSLTNIS 421 >gi|90425190|ref|YP_533560.1| TadE-like [Rhodopseudomonas palustris BisB18] gi|90107204|gb|ABD89241.1| TadE-like [Rhodopseudomonas palustris BisB18] Length = 214 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 47/149 (31%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F +L + +G + +++P+ML + +++ + AQT S Sbjct: 18 FERQLGRLRRDTSGLAAVEFVMIVPLMLVMFFGTIELSSGFAAHRKVSIVAQTISDLTSR 77 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + +VS+ + + F+ + + + D + R Sbjct: 78 GKSAAATDVSNFLSIADAIMTPYPAVHSADQFQTTITEVYIDPATGIGHAQWSRGDAVRN 137 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + V + + + ++ I S Sbjct: 138 AGSVVAVPADLVAKDSSNNVIAGQYLIFS 166 >gi|308472925|ref|XP_003098689.1| hypothetical protein CRE_04171 [Caenorhabditis remanei] gi|308268289|gb|EFP12242.1| hypothetical protein CRE_04171 [Caenorhabditis remanei] Length = 396 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 68/221 (30%), Gaps = 12/221 (5%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI-----RS 284 L Y+ P +D S V + L A Sbjct: 12 VSVYADTYSPLSYVDRPCGVDLSNLWLDVVAVVDNSQGMTNDGLNNVASDIFTVFSSGTR 71 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + T R+G +N ++ I + + TA + + Sbjct: 72 IGSDSSEPRTTRLGLVTYNSNATQQADL--NKYQSIDDAFYGIFDSLSTVVYTADSY-LA 128 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 T + +E K+ +++ N + + + N+ K ++I+T+A Sbjct: 129 TGL-ILAEKMFNEQSVNNVRSNYKRVVIVYASEYNEDGELDPLPVANRLKLSNVKIITVA 187 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + LS ASP F +N+ +++ ++ + Sbjct: 188 YEQPGSV--GLENGLSQIASPG-FSFSNNVVNISQEIQNAL 225 >gi|296223008|ref|XP_002757441.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Callithrix jacchus] Length = 1289 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 64/192 (33%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ + + R F Sbjct: 495 DGSQSLFGQLHNDCVYILIDTSHSMKSKLDLVKDKIIKFIQ-----EQLKYKSRFNFVKF 549 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + ++ V++ +++ D ST A++TA+ Sbjct: 550 DGQAVAWREQLAEVNEENLEQAQSWIRDMKIGSSTNTLSALKTAF--------------- 594 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 595 -ADKETQAIYLLTDGRPDQP---PEMVIDQVKVFQEIPIYTISFNYNDEIANR---FLKE 647 Query: 422 CA---SPNSFFE 430 A F Sbjct: 648 IAALTGGEFHFY 659 >gi|293604074|ref|ZP_06686485.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292817556|gb|EFF76626.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 243 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 22/63 (34%), Gaps = 4/63 (6%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T F F + + G I A+ ++L VG + ++ RW + A A Sbjct: 2 TAFFFAASR----QRGTAAIEFAVAGAMVLLVGLLGIEAARWQAVRQMAQVALMQAARAG 57 Query: 65 SVP 67 Sbjct: 58 ITA 60 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 47/469 (10%), Positives = 112/469 (23%), Gaps = 88/469 (18%) Query: 52 ALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 L A + + + + + + + E + V Sbjct: 120 QLSLAVEQSSLQTA--------QPPKLLYENNEYDVSVTSEKITVEDSAKESTEPEKITV 171 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 + ++T + + Q + + ++ L + T + Sbjct: 172 PENTKETNKNDSAPEKTEQPTATEEVTNPFAEARMAPATLRANLALPLIAPQYTTDNSGT 231 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + ++ + SY G D + Sbjct: 232 YPTANWQPTGNQNVLNHQGNKDGSAQWDGQTSWNGDPTNRTNSYIEYGGTGDQADYAIRK 291 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDALASVIRSI 285 Y + +Y E VD S + + V+ + + ++ Sbjct: 292 YARETTTPGLFDVYLNVRGNVQKEITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTL 351 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + + + MG ++ ++ + G ++ +K + G T A++ Sbjct: 352 AD-SGITNNINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKN-ITPSSTRGGTFTQKALRD 409 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICN------------ 391 A D + + N KK IVLLTDG T + Sbjct: 410 AGDMLATPN-----------GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQ 458 Query: 392 -------------------------------------KAKSQGIRIMTIAFSVNKTQQEK 414 K +GI I + + + Sbjct: 459 DQPGSTSYISGSYNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRAN 518 Query: 415 AR-----YFLSNCAS----PNSFFE-ANSTHELNKIFRDRIGNEIFERV 453 + S + ++E A+ +++ + V Sbjct: 519 LSKQQVEDKMREMVSADENGDLYYESADYAPDISDYLAKKAVQISGTVV 567 >gi|123283202|emb|CAM24859.1| complement component 2 [Homo sapiens] gi|123857989|emb|CAM25859.1| complement component 2 [Homo sapiens] gi|168983781|emb|CAQ06832.1| complement component 2 [Homo sapiens] gi|168984348|emb|CAQ08707.1| complement component 2 [Homo sapiens] gi|168984415|emb|CAQ09271.1| complement component 2 [Homo sapiens] gi|168985076|emb|CAQ07480.1| complement component 2 [Homo sapiens] gi|168985954|emb|CAQ07110.1| complement component 2 [Homo sapiens] Length = 353 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH----KLIRTIVKTFAI 329 +++ + ++ I + V + F S ++I ++ Sbjct: 141 FKESASLMVDRIFSFE---INVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYK 197 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D T A+ + Y + + E + I+LLTDG++ Sbjct: 198 DHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS-PKTA 256 Query: 390 CNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + I I I + S F T L+++ Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQV 316 Query: 441 FRDRIGNEIFERVIRIT 457 F V ++T Sbjct: 317 FEHM------LDVSKLT 327 >gi|296473248|gb|DAA15363.1| integrin alpha M [Bos taurus] Length = 1152 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 41/109 (37%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + S+ + A K ++++TDGE D E + +A Sbjct: 223 GRTHTATGIRKVVRELFHSSSGARN------HAIKIMIVITDGEKYLDPLEYSDVIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + I I + + K+R L AS + F+ N+ L I Sbjct: 277 RKKIIRYVIGVG-DAFRSRKSRQELDTIASKPPADHVFQVNNFEALKTI 324 >gi|169344506|ref|ZP_02865475.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens C str. JGS1495] gi|169297426|gb|EDS79535.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens C str. JGS1495] Length = 929 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 37/392 (9%), Positives = 89/392 (22%), Gaps = 28/392 (7%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E S + + ++ NN + E + ++ Sbjct: 35 AEEKSDSMSVEKVLDSEMYNKFNNEILNNKSSIKYTTDNNKGTWPVNWEYGNVSNKNKIN 94 Query: 133 LSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S L +L + + + +++D S SM Sbjct: 95 KSVYGKNQPLEYNEGYLTKKAYTTDEDNVFNINLKIRGKKFHGKKRDVVFLLDNSNSMRT 154 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + + + + Y +KS+ Y Sbjct: 155 NNRVGKIKTQIKRVMD-NLNNDDNTRYALVTYASTILDGRYYHLIDKSIGNDQYENYKGY 213 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVR------DALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ + +S+++ + K D + ++ Sbjct: 214 TSNQCYLNFTSNIQEIYNKIPENVPYERGHDYVGGTFTQEGLLEAKK-----LLKNSTAD 268 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + GV I + + + + + + ++ Sbjct: 269 EKIIIHLTDGVPTYSFLIEEFGGNKPTTFNYNTAYNGIGVRGLGVSYFFDSKYNKPYIFS 328 Query: 366 EAKKYIVLLTDG----ENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFL 419 Y L E + K + I+I TI + K + Sbjct: 329 REMVYSALNRSTNQREEIWNNGIATALEAENIKRENPYIKIYTIGVELTKDVSKWNDA-- 386 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 N ++ A T L + +D + Sbjct: 387 ------NKYYNAEGTLNLPE-IKDLLAKISSS 411 >gi|158259621|dbj|BAF85769.1| unnamed protein product [Homo sapiens] Length = 1105 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 45/152 (29%), Gaps = 20/152 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D ++ R+G ++ + +++ + G+T A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALN 242 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K +L+TDG+ +QD+ + + G+ + I Sbjct: 243 YIFE------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIG 293 Query: 405 FSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS + Sbjct: 294 V------KNADVNELQEIASEPDSTHVYNVAE 319 >gi|170724845|ref|YP_001758871.1| outer membrane adhesin-like protein [Shewanella woodyi ATCC 51908] gi|169810192|gb|ACA84776.1| outer membrane adhesin like proteiin [Shewanella woodyi ATCC 51908] Length = 3259 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 36/380 (9%), Positives = 81/380 (21%), Gaps = 10/380 (2%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 AS +I A + + N + + + V+ Sbjct: 2647 ASTGIITDQPIDIEFAYGATDNDGDAVTFTDTNNGHFTVTLIPENYVPNALNNTYNVDQG 2706 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + + + L I + + Sbjct: 2707 HSVTGNIIEDDTGNGRDRDADGDTLTVTHINGDALVFDNNGNAHIAISGGMLSIKADGSY 2766 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 F+ + + +N D + + + Sbjct: 2767 IFTHNGSANLSNGFSYTVNDGNGDTDTADVEILVFAHQTLNPGDDSLNITGDTQDLVIGD 2826 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI-DNVNDTVRMGATFF 302 + + +D+S + L + + N + + F Sbjct: 2827 TTGIIPGQNYNIAFIIDTSGSMGSTAVNTAEQQLLTVYSQLLSYAEQNNSGVINTLLVDF 2886 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S ++ G+T DA TAYD + + Sbjct: 2887 DTDASLLISI-NLTDPNALQLLSNALATMTSGGATNYYDAFSTAYDWFQNGVPTTNNGNN 2945 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI-AFSVNKTQQEKARYFLSN 421 +TDG+ T DN A + + T+ + L N Sbjct: 2946 ITY-------FITDGQPTTDNGHPGNYYQNALNAFALLNTLSTVQAIGLGGDIDTNVLVN 2998 Query: 422 CASPNSFFEANSTHELNKIF 441 + + ++L Sbjct: 2999 FDTDGNILNNVDVNDLADSI 3018 >gi|3273275|dbj|BAA31181.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + F++ K I+K+ G T ++DA+ Sbjct: 83 ENAIHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|3273291|dbj|BAA31189.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + F++ K I+K+ G T ++DA+ Sbjct: 83 ENAIHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|3273287|dbj|BAA31187.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|257791506|ref|YP_003182112.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] gi|257475403|gb|ACV55723.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] Length = 213 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 50/153 (32%), Gaps = 10/153 (6%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G + +L ++LG VD ++ + L+ + + S+ Sbjct: 17 CRRERGDLNLTFLMLFFLVLGFLAFAVDASQFLTKKTLLENTLN---LAREERMAPSVTL 73 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 V+ ++ + + L + + + + R AYQVVL+ Sbjct: 74 VAKNSQEPDAVIARSVAGTLRDANYQGDIDVYFYEVPEAQLPPSKRDTE-RVYAYQVVLT 132 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 + ++F G+ +++ A + Sbjct: 133 EQVK------TVFASIFGVYDIPLKSTVVAVSN 159 >gi|209545604|ref|YP_002277833.1| TadE family protein [Gluconacetobacter diazotrophicus PAl 5] gi|209533281|gb|ACI53218.1| TadE family protein [Gluconacetobacter diazotrophicus PAl 5] Length = 163 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 35/139 (25%), Gaps = 6/139 (4%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASVPLIQSL 72 + G + LL PV++G+ M+++V E AL+ A + Sbjct: 6 FRRDRRGVTAVEFGLLTPVIIGLLLMVLEVGWQMATEIALQHGVHDAA-----RFAMTGQ 60 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 V + + + N + + + Sbjct: 61 STVPGLDGSPTCRAQAIVWLATAEAPGILSPSNLSVMASANGGTAVGSSQSGFGGNATQT 120 Query: 133 LSSRYDLLLNPLSLFLRSM 151 + + L+ + Sbjct: 121 IVYVFTYSQPFLTPLGSMV 139 >gi|170751926|ref|YP_001758186.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170658448|gb|ACB27503.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 219 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/218 (9%), Positives = 59/218 (27%), Gaps = 7/218 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL-KQAAQTAIITAS 65 + +L + G + A+++P++L + + +V L + A +TA Sbjct: 7 LTRRAARLGQDRRGGAAVEFAVILPILLAIWAGMTEVGHAIDEWRKLTQLGRTVADLTAQ 66 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + +K + + + Sbjct: 67 GDTQNPISRTVMNDILAS------ATAVMRPFDTSKVKIVVSAMGIDAKNPVGPPVVCSS 120 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + S + R G++ L + + S + + I F Sbjct: 121 VANANGTARSLGSAAGLTVPDGYRMPGMRYVLAEVSISYTPMIGSALVKLAKGVSSTITF 180 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + S R + + + + + + K+ Sbjct: 181 TSSAPWPTRGGQAYGTSTYTEVVLPDDEHANCDGTKID 218 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 43/144 (29%), Gaps = 28/144 (19%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + + ++ + R+G F + +++ Sbjct: 109 SPSRLDRAKQQIKDMVDEMSG-------DRVGLVVFAGETRQSVPLTSHYEDFKQSLDAV 161 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 G + + DA+++A I D K IV+ TDGE+ + Sbjct: 162 GPHSVR-RGGSLLGDAIRSATAGFIDKTND-----------HKAIVVFTDGEDQESKPV- 208 Query: 387 IAICNKAKS----QGIRIMTIAFS 406 AK GIRI T+ Sbjct: 209 ----EAAKEAFTKNGIRIFTVGLG 228 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 37/392 (9%), Positives = 82/392 (20%), Gaps = 32/392 (8%) Query: 39 MLVDVVRWSYYEHALKQAA-QTAII--TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 + D + +L A + + S ++ + + + L Sbjct: 2534 IASDGSVINSTPASLSATLLDLADLGGSVSGDVLANDSLGADGLGRIESVSVDGQTFSLN 2593 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + L + +N ++ L + + S G Sbjct: 2594 DAGDGILVSGVGSVTWSFNAMTAELTLNTATGTLKIHLDGVNAGQFDFAAKSGLSFGPSG 2653 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 L Q+ + + I D S ++ D E P+ Sbjct: 2654 QLTQSFTYVVSDGDGDRASANLDICIRGDRSVLVVGSNADDEQGSNVLHHVPSQFDDGKG 2713 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + + + D + L++ Sbjct: 2714 AIEGTFGNDVLVGDLGGAADPVVKPAENYNIALILDRSGSMAD-DPDGSGGYGSRLALLK 2772 Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 DA+ + I + + + + ++S + G Sbjct: 2773 DAVNAFIGKLGTHTGQINIALISFSSSASLLLSGTLAQIQTALAAPN---NVLMALTASG 2829 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN------------ 383 +T AMQ A + LTDG+ T N Sbjct: 2830 ATNYEAAMQQA---------NAWFGGVEVNGYNNLAYFLTDGDPTTYNGDNSNSGSTVNF 2880 Query: 384 ---EEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + ++ + I Sbjct: 2881 NDVNRALDDATTLMAR-AEVHAIGIGTGVNSN 2911 >gi|114586712|ref|XP_001158576.1| PREDICTED: alpha 1 type VII collagen isoform 1 [Pan troglodytes] Length = 2944 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 41/150 (27%), Gaps = 22/150 (14%) Query: 290 NVNDTVRMGATFFNDRVISDPSFS---WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + F SD + +++ + G+T A+ Sbjct: 67 PFSGAASAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHV 126 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D + K +L+TDG + + + K QG+++ + Sbjct: 127 ADHVFLPQLARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI- 175 Query: 407 VNKTQQEKARYFLSNCAS---PNSFFEANS 433 + L AS + FF N Sbjct: 176 -----KNADPEELKRVASQPTSDFFFFVND 200 >gi|114586714|ref|XP_516439.2| PREDICTED: alpha 1 type VII collagen isoform 2 [Pan troglodytes] Length = 2912 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 41/150 (27%), Gaps = 22/150 (14%) Query: 290 NVNDTVRMGATFFNDRVISDPSFS---WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + F SD + +++ + G+T A+ Sbjct: 67 PFSGAASAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHV 126 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 D + K +L+TDG + + + K QG+++ + Sbjct: 127 ADHVFLPQLARPGVP-------KVCILITDG---KSQDLVDTAAQRLKGQGVKLFAVGI- 175 Query: 407 VNKTQQEKARYFLSNCAS---PNSFFEANS 433 + L AS + FF N Sbjct: 176 -----KNADPEELKRVASQPTSDFFFFVND 200 >gi|160710|gb|AAA29770.1| thrombospondin related anonymous protein [Plasmodium falciparum] gi|33333554|gb|AAQ11893.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3142286|gb|AAC18657.1| thrombospondin related adhesive protein [Plasmodium falciparum] Length = 562 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 41/138 (29%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273257|dbj|BAA31172.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + + F++ K I+K+ G T ++DA+ Sbjct: 83 ENAIHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|301614659|ref|XP_002936803.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 720 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 58/174 (33%), Gaps = 16/174 (9%) Query: 275 RDALASVIRSIKKIDNVNDT---VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + + ++ +K++ + RM ++ V+ + +F Sbjct: 77 KKFVLQMVDKLKEVKPNSGRSFSWRMALLQYSSTVVIEQTFRDWKGPENFKSRIAPIAYI 136 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 T + A+ ++ ++ K +L+TDG + N + A + Sbjct: 137 --GHGTYTSYAITNLTQIYMNEG---------THKSVKVAILITDGVDHPRNPDIFAATS 185 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCASPNSFFEANSTHELNKIFRD 443 AK I++ T+ + + L++ + F ++KI ++ Sbjct: 186 NAKHHDIKLFTVGMTSVAKETANNAKLRLLASVPATRFVFHMLDPDVVDKILKE 239 >gi|3273277|dbj|BAA31182.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273281|dbj|BAA31184.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273283|dbj|BAA31185.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 39/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|332260454|ref|XP_003279304.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Nomascus leucogenys] Length = 1160 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 409 DGSQSLFGKLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 468 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 469 AWREQLAEVNEDNLEQAQS-----WIRDMKIGSSTNTLSALKTAF--------------- 508 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 509 -ADKETQAIYLLTDGRPDQP---PEMVIDQVKVFQEIPIYTISFNYNDEIANR---FLKE 561 Query: 422 CA---SPNSFFE 430 A F Sbjct: 562 VAALTGGEFHFY 573 >gi|315608293|ref|ZP_07883283.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] gi|315250074|gb|EFU30073.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] Length = 342 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/225 (8%), Positives = 60/225 (26%), Gaps = 50/225 (22%) Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + + + + + + S++ ++G F Sbjct: 94 ICLDISNSMLAQDVAPSRLDKSKLLVESLVDRFT-------NDKIGLIVFAGDAYVQLPI 146 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + ++ + T I A+ + + + + I+ Sbjct: 147 TSDYVSA-KMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQA-----------DKIGRAII 194 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-----------ARYFL-- 419 ++TDGE+ + A +A+ +G+ + + K + + Sbjct: 195 VITDGEDHEGGAVEAAA--EARKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQTVMTA 252 Query: 420 ---SNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 C A + ++T + E+ + R+ Sbjct: 253 LNEQMCREVAQAGSGKYIHVDNTGD--------AQTELNNDLARL 289 >gi|313827551|gb|EFS65265.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2] gi|314918056|gb|EFS81887.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1] gi|314967895|gb|EFT11994.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1] gi|315080579|gb|EFT52555.1| conserved hypothetical protein [Propionibacterium acnes HL078PA1] Length = 169 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 1/60 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVV-RWSYYEHALKQAAQTA 60 + +F ++ ++ +L+P++L G+ VD R A AA+ A Sbjct: 27 LSRWRFPVSPQRRRDDRGVSMSVLVTVLLPILLISAGVAVDGAERSRAVRVAHTVAAEAA 86 >gi|118367749|ref|XP_001017084.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89298851|gb|EAR96839.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1085 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 56/217 (25%), Gaps = 32/217 (14%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 R ++ + ++ S Y + + ++ Sbjct: 394 REKSAFWDKKNSSQEARLSQYSQSINSLNSKYPLGKMCSIVEKKYFHYYFIQDESGSFSN 453 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 +A + IK D + F+ D L + + Sbjct: 454 DHQYAIQGVAQLFNRIKPNDYIT------YIKFDSSSHVDIP-KTLKSSLSQGDFISKIQ 506 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD----NEE 385 G T A QT I S + + + ++ +TDG++ D + Sbjct: 507 KCR-GGGTNFQSAFQTLLQQIQSKYDQQEYP---------VVIFITDGQDNTDLDSIISQ 556 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 ++C Q I TI + +L N Sbjct: 557 ITSLC-----QDIVFYTIGYG------SVNEKYLKNI 582 >gi|50842569|ref|YP_055796.1| hypothetical protein PPA1091 [Propionibacterium acnes KPA171202] gi|50840171|gb|AAT82838.1| hypothetical membrane associated protein [Propionibacterium acnes KPA171202] gi|314976099|gb|EFT20194.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1] gi|314989870|gb|EFT33961.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3] gi|315077955|gb|EFT50006.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2] gi|315095966|gb|EFT67942.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1] gi|315101403|gb|EFT73379.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1] gi|315108619|gb|EFT80595.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2] gi|327331877|gb|EGE73614.1| hypothetical membrane associated protein [Propionibacterium acnes HL096PA3] gi|327445861|gb|EGE92515.1| hypothetical protein HMPREF9571_01597 [Propionibacterium acnes HL043PA2] gi|327448157|gb|EGE94811.1| hypothetical protein HMPREF9570_00996 [Propionibacterium acnes HL043PA1] gi|327453937|gb|EGF00592.1| hypothetical protein HMPREF9586_01901 [Propionibacterium acnes HL083PA2] gi|328754611|gb|EGF68227.1| hypothetical protein HMPREF9588_02098 [Propionibacterium acnes HL025PA2] gi|328760793|gb|EGF74359.1| hypothetical membrane associated protein [Propionibacterium acnes HL099PA1] Length = 182 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 1/60 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVV-RWSYYEHALKQAAQTA 60 + +F ++ ++ +L+P++L G+ VD R A AA+ A Sbjct: 40 LSRWRFPVSPQRRRDDRGVSMSVLVTVLLPILLISAGVAVDGAERSRAVRVAHTVAAEAA 99 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 41.9 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 50/298 (16%), Positives = 95/298 (31%), Gaps = 22/298 (7%) Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + T + H ++ D ++++++ R + ++ Sbjct: 330 LQVFTPGSYGARANENHVAIVLTDGLSTDRNKTLIEAARLRNANVTILGIRIGQYQIQEL 389 Query: 216 SSQNGKVGIRD-----EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 G V + + K++ + S + FV SS Sbjct: 390 IDITGDVKKVFSVEGFDAIHNLTYDITKAICKQVSNINCTQSYGDIIFVMDSSSSITYPN 449 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 +A+V R+ D VR GA F V + + + Sbjct: 450 YVKQLSFVANVTRNFLI---GKDDVRYGALIFGSNVEKLFDLKKYDSPVD--VEQHIMEA 504 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 ST A+Q D +A K I++LTDGE+T + A Sbjct: 505 TYLASSTDTAAALQYILD-----QRMFADEQGGRPDAVKIIIVLTDGESTYPE-KTRAEA 558 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDRIGN 447 K +S G +M+I + E L+ AS ++ F+A S L+++ ++ + Sbjct: 559 TKLQSLGYHMMSIGIGNEINELE-----LNALASNTSNIFKAASYQVLDQLHKEVVTR 611 >gi|297666856|ref|XP_002811720.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Pongo abelii] Length = 968 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 ++ ++ + + K LV+D + I+ K + + V+ Sbjct: 355 DGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAV 414 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 R + + + D ST A++TA+ Sbjct: 415 AWREQLAEVNEDSLEQAQS-----WIRDMKIGSSTNTLSALKTAF--------------- 454 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 + + I LLTDG Q + ++ K Q I I TI+F+ N + FL Sbjct: 455 -ADKETQAIYLLTDGRPDQP---PETVIDQVKLFQEIPIYTISFNYNDEIANR---FLKE 507 Query: 422 CA---SPNSFFE 430 A F Sbjct: 508 VAALTGGEFHFY 519 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 15 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 74 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 75 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 129 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 130 KLGWADDTGATKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 189 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 190 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 239 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 240 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 288 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 289 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 343 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 344 FDGRSG-DLAAVFRQVKG 360 >gi|29826729|ref|NP_821363.1| hypothetical protein SAV_189 [Streptomyces avermitilis MA-4680] gi|29603825|dbj|BAC67898.1| putative membrane protein [Streptomyces avermitilis MA-4680] Length = 142 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 46/129 (35%), Gaps = 11/129 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA--IITASVPLIQ 70 + G I A++ P +L ++ V W Y + A A +TA+ Sbjct: 15 RRWGDDRGDTSIQMAIIFPFVLIATVAVIQVSMWYYAR---QIALTAAREGLTAARAYES 71 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + +++ + + + L ++ + + VR V TA+ M P Q Sbjct: 72 GPADGAAQ---AREVLGRVAGDSLRGY---SVSASSNGQRVRVQVSGTAMSMIPGVPGLQ 125 Query: 131 VVLSSRYDL 139 V S+ + Sbjct: 126 VTQSASGPV 134 >gi|83859352|ref|ZP_00952873.1| hypothetical protein OA2633_13145 [Oceanicaulis alexandrii HTCC2633] gi|83852799|gb|EAP90652.1| hypothetical protein OA2633_13145 [Oceanicaulis alexandrii HTCC2633] Length = 178 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 8/178 (4%), Positives = 43/178 (24%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M + + ++ G + AL+ + ++V + ++ A Sbjct: 1 MSASARIRDRITRFHRAREGAAAVEFALIAAPFFLLIFATLEVALFFLGSTIIENGVNEA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + +Q + + + + + + +F+ + + Sbjct: 61 ARSIRTGQLQQSGQTVEDFRGAICERISSVADCSRIQLDVRTFDSFSSSSMTSPLNADGE 120 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + + + + + + + + + + Sbjct: 121 IDDTNFTFDPGGGGDVVVVRVFYDWQLMAPGVVSGMTNMSGNKRLISATAVFRNEPFE 178 >gi|310114429|ref|XP_003119946.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 481 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 34/99 (34%), Gaps = 10/99 (10%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + +Y++++TDG+++ ++ G+ I I +E L Sbjct: 6 ADTGRINVARYLIVITDGKSSDS---VAEAAEGLRANGVNIYAIGI------REANIDEL 56 Query: 420 SNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 A + F L I ++ + + + +K Sbjct: 57 KEIA-KDKIFFVYEFDLLKDIQKEVVQDICSSEACKNSK 94 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 12/126 (9%), Positives = 33/126 (26%), Gaps = 15/126 (11%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + + T A+ ++Y++++TDG Sbjct: 124 DIYRAIFDVQQMRDGTRTGKALNFTL-------PFFDSSKGGRPSVQQYLIVITDGVAQD 176 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + + + I I + K L ++ + + L + Sbjct: 177 NVIIP---AKALRDKNTIIFAIGVG-----EAKKSQLLEITNDEDNVYHDVNFEALQNLE 228 Query: 442 RDRIGN 447 ++ + Sbjct: 229 KEILSK 234 >gi|297559736|ref|YP_003678710.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296844184|gb|ADH66204.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 547 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 45/131 (34%), Gaps = 16/131 (12%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + G TA DA+ AY+ + S ++ + +VL+TDGE + Sbjct: 429 ADLSAAVEGLEAEGDTAAYDALVRAYELLESDT-------GSDGDPLMSVVLMTDGEVNR 481 Query: 382 DNE----EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + T+ F + + L + F+A +L Sbjct: 482 GVGLEGFRESLAARSEPVARVPVFTVLFGESDVPEMTELAEL----TGGRVFDARE-QDL 536 Query: 438 NKIFRDRIGNE 448 ++FR+ G + Sbjct: 537 EQVFREIRGYQ 547 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 64/183 (34%), Gaps = 20/183 (10%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + + + A+ ++ ++K+ +++ + + F + +H + Sbjct: 18 DTSSSMREWMRELNTAIRDMLGTLKEQESLKAEIHISFITFGNGG-------ANLHTALT 70 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + D E G T + A++ A + + E + + I+LL+DG Sbjct: 71 PVSNIEFNDFTEGGMTPLGGALRIAKEMV------ENREIIPSKSYAPIILLLSDGAPND 124 Query: 382 D--NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + E N +S+ M++ + L +S +EA + + Sbjct: 125 NGWENEMYRFINDGRSKKCMRMSLGIGRDYDY-----DVLKGFSSNGEVYEAKDSMNIID 179 Query: 440 IFR 442 F+ Sbjct: 180 FFK 182 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 56/170 (32%), Gaps = 20/170 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ +IR + D R+ FND V S V R+I Sbjct: 97 TRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 150 Query: 328 AIDENEMGSTAIN--DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G + +Q A + + +R+ +I+LLTDG++T Sbjct: 151 IDRLQARGGSGSALMPELQEAVKILDERQGNSRNRVG-------FILLLTDGDDTTGFRW 203 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + A + + T A + S ++ + + Sbjct: 204 SRDVIHGAVGK-YPVHTFALGAAHDPEALLHIAQE---SRGTYSFVDDGN 249 >gi|256084540|ref|XP_002578486.1| hypothetical protein [Schistosoma mansoni] gi|238663861|emb|CAZ34724.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 652 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 37/147 (25%), Gaps = 17/147 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISS 353 R F + + + D N G T +A++ A Sbjct: 330 CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL------ 383 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 384 -----HSTPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTAL 436 Query: 414 KARYFLSNCASPNSFFEANSTHELNKI 440 A + +L Sbjct: 437 IGGV---ARAGNGTAEFVRDPSQLQSA 460 >gi|256084538|ref|XP_002578485.1| hypothetical protein [Schistosoma mansoni] gi|238663860|emb|CAZ34723.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 828 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 37/147 (25%), Gaps = 17/147 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISS 353 R F + + + D N G T +A++ A Sbjct: 330 CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL------ 383 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 384 -----HSTPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTAL 436 Query: 414 KARYFLSNCASPNSFFEANSTHELNKI 440 A + +L Sbjct: 437 IGGV---ARAGNGTAEFVRDPSQLQSA 460 >gi|256084536|ref|XP_002578484.1| vacuolar protein sorting 26 vps26 [Schistosoma mansoni] gi|238663859|emb|CAZ34722.1| vacuolar protein sorting 26, vps26, putative [Schistosoma mansoni] Length = 1295 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 37/147 (25%), Gaps = 17/147 (11%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISS 353 R F + + + D N G T +A++ A Sbjct: 330 CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL------ 383 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 H + K I+ LTDG+ +E + + R+ TI + Sbjct: 384 -----HSTPSGEGWFKQIIFLTDGDVGNADEVIGLV--RMNVDKARVFTIGLGQGVSTAL 436 Query: 414 KARYFLSNCASPNSFFEANSTHELNKI 440 A + +L Sbjct: 437 IGGV---ARAGNGTAEFVRDPSQLQSA 460 >gi|254381319|ref|ZP_04996684.1| hypothetical protein SSAG_00986 [Streptomyces sp. Mg1] gi|194340229|gb|EDX21195.1| hypothetical protein SSAG_00986 [Streptomyces sp. Mg1] Length = 148 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 45/153 (29%), Gaps = 23/153 (15%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ G + TA++ +LG+ G+ +D A + A A + Sbjct: 9 RSRLREDRGGIAVYTAIVTIALLGIIGLALDGAGKLR-------ATERADAVAMEAARAA 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + E + +V ++ + V Sbjct: 62 GQAIDPTQAVTGNGLRADPEAAAAVARAYLARAGTQG----------SVSLSADGTQLTV 111 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 +S Y + FL GI S + A Sbjct: 112 TVSDTYA------TKFLSVAGINSMGVSGHGSA 138 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 56/170 (32%), Gaps = 20/170 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ +IR + D R+ FND V S V R+I Sbjct: 73 TRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 126 Query: 328 AIDENEMGSTAIN--DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 G + +Q A + + +R+ +I+LLTDG++T Sbjct: 127 IDRLQARGGSGSALMPELQEAVKILDERQGNSRNRVG-------FILLLTDGDDTTGFRW 179 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + A + + T A + S ++ + + Sbjct: 180 SRDVIHGAVGK-YPVHTFALGAAHDPEALLHIAQE---SRGTYSFVDDGN 225 >gi|3273299|dbj|BAA31193.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|326672458|ref|XP_695815.4| PREDICTED: von Willebrand factor A domain-containing protein 2 [Danio rerio] Length = 795 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 45/151 (29%), Gaps = 16/151 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D ++G + R ++ + +++ G ++ A+ Sbjct: 562 FDINRDVAQVGLVVYGRRPVTVFDL--DKYNSGSAVLRAVGDAAFLGGKASVGSALLHVL 619 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ + + K +V+LTDG +D K + G+ I I Sbjct: 620 SQSLTVGKGARPGVN------KAVVVLTDGTGVEDAAVP---AQKIRDSGVSIFLIGVG- 669 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + L S + S +L Sbjct: 670 ----DIQQELLLRISGSEDHMITVPSYDDLK 696 >gi|319785613|ref|YP_004145089.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171501|gb|ADV15039.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 193 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 6/77 (7%), Positives = 20/77 (25%), Gaps = 4/77 (5%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + ++ G I A+L + +++ + + + Sbjct: 14 RCPRFFARFLRDRRGATAIEFAILSVPFALLVFAILESCISLAAQEVMANITDD----VA 69 Query: 66 VPLIQSLEEVSSRAKNS 82 L A + Sbjct: 70 RKLRTGQLRAVDVAGTN 86 >gi|272938026|gb|ACZ96962.1| vwa2 protein [Danio rerio] Length = 260 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 45/151 (29%), Gaps = 16/151 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D D ++G + R ++ + +++ G ++ A+ Sbjct: 35 FDINRDVAQVGLVVYGRRPVTVFDL--DKYNSGSAVLRAVGDAAFLGGKASVGSALLHVL 92 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 ++ + + K +V+LTDG +D K + G+ I I Sbjct: 93 SQSLTVGKGARPGVN------KAVVVLTDGTGVEDAAVP---AQKIRDSGVSIFLIGVG- 142 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELN 438 + L S + S +L Sbjct: 143 ----DIQQELLLRISGSEDHMITVPSYDDLK 169 >gi|149375210|ref|ZP_01892982.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] gi|149360574|gb|EDM49026.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] Length = 658 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 66/223 (29%), Gaps = 36/223 (16%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 P D + + + ++ VR + Sbjct: 29 PDSADVRIIVDISGSMKTNDPNNLRRPAVRLLARMLPGQANAGVWTFGQYVNMLVPHGKV 88 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + I ++ T + +A+Q A D + ++ + Sbjct: 89 TDDWRGLAVERSDEINSVALR----------TNLGEAIQVASDDYLLGSDSLDNTDF--- 135 Query: 366 EAKKYIVLLTDG-----ENTQDNEE-----GIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +LLTDG +N N+ A+ ++ S+G + T+A +E Sbjct: 136 ------ILLTDGKVDISDNESANDRERERILGALLDELSSRGATLHTVAL-----SEEAD 184 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 L + A + + A+S L F + + + ++ I I Sbjct: 185 LALLKSLAERTGGRYALASSADALTLAFLEALNTAVPQQQIPI 227 >gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica OS195] gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS195] gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein [Shewanella baltica OS678] Length = 771 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 54/180 (30%), Gaps = 20/180 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A +++ ++K + FN + + S + + F Sbjct: 406 DSIVQAKNALLYALKGLKPE---DSFNIIEFNSSLSQFSATSLPATSSNLSRARQFVSRL 462 Query: 332 NEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + A+ A ++ S++ D V + ++ +TDG + I Sbjct: 463 QADGGTEMALALDAALPKSLGSASPDAVQ-------PLRQVIFMTDGSVGNEQALFDLIR 515 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + R+ T+ F+ A +F E+++ + Sbjct: 516 YQIGES--RLFTVGIGSAPNSH-----FMQRAAELGRGTFTYIGKVDEVDEKISALLSKI 568 >gi|29830947|ref|NP_825581.1| hypothetical protein SAV_4404 [Streptomyces avermitilis MA-4680] gi|29608060|dbj|BAC72116.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 582 Score = 41.9 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 45/145 (31%), Gaps = 16/145 (11%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 FF+ + + H+ + T G T+ + A++ Sbjct: 440 HVVFFSTELDGTGELTLTEHENKIDELHTALGRM---GRTSYHAAVEAVL---------A 487 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT-IAFSVNKTQQEKAR 416 H + +A +V TDG A+ + AK+ + +AF + + Sbjct: 488 HHEKSADPKAPALVVFQTDGAPDAKTPATQALTDAAKNHPTVFFSFVAFGEHDNKAFDYL 547 Query: 417 YFLSNCASPNSFFEA-NSTHELNKI 440 L +FF A + EL Sbjct: 548 RKLKT--GNTAFFHAGPTPRELTDA 570 >gi|33333556|gb|AAQ11894.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|255009406|ref|ZP_05281532.1| hypothetical protein Bfra3_09717 [Bacteroides fragilis 3_1_12] gi|313147165|ref|ZP_07809358.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135932|gb|EFR53292.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 341 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 22/220 (10%), Positives = 56/220 (25%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI A+ A + + IV++T Sbjct: 150 YISA-KMFLESINPSLISKQGTAIGAAISLAARSFTPQ-----------EGVGRAIVVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A +A +GI++ + + Sbjct: 198 DGENHEGGAVEAAK--EAAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRLN 255 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + ++++ K I V Sbjct: 256 EAMCQEIAKVGNGIYVRVDNSNSAQKAINQEINKMAKSDV 295 >gi|297567411|ref|YP_003686383.1| hypothetical protein Mesil_3037 [Meiothermus silvanus DSM 9946] gi|296851860|gb|ADH64875.1| conserved hypothetical protein [Meiothermus silvanus DSM 9946] Length = 351 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 22/204 (10%), Positives = 57/204 (27%), Gaps = 47/204 (23%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 T ++G F+D + + R +++ + + +T++ A+ T + Sbjct: 125 TTQIGLITFSDSASVVVAPT-----TDRAVLQEALDNVKPVQNTSLPSAIVTGVRLLPGR 179 Query: 354 NEDEVH----------------------RMKNNLEAKKYIVLLTDGENTQDN-------E 384 E + +++++DG ++ Sbjct: 180 KEVQPPKELQPQNPQNPQPQNPLVQPDTPPIPREFPPGSLLVISDGATNVNSNPRLPNQT 239 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEK----------ARYFLSNCA--SPNSFFEAN 432 A A+ G++I A + L A + + Sbjct: 240 ALEAAAKFAQDNGVKIYAFAVGKEGGAVVRIEGQDYFVPFEPRSLQQLAERTGGKYVYPP 299 Query: 433 STHELNKIFRDRIGNEIFERVIRI 456 + L ++R+ G I ++ Sbjct: 300 TEEALRAVYREL-GTVIRWEATKL 322 >gi|218507575|ref|ZP_03505453.1| hypothetical protein RetlB5_08145 [Rhizobium etli Brasil 5] Length = 161 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 20/151 (13%), Positives = 43/151 (28%), Gaps = 2/151 (1%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++LI+ G I A+L PV++ + ++ +AA + Sbjct: 8 TRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKRATRAAGSIADLI 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + ++ + L + V T P Sbjct: 68 TQQQSITKSTLTEMRSVATAIFVPYNSTSLTLKITGITVDASANPNVLWSWAQTGARPMP 127 Query: 125 RKSAYQVVLSSRYD--LLLNPLSLFLRSMGI 153 R + + P S FL + + Sbjct: 128 RHRRQHIPADMKTPTASWSAPNSAFLYDVPV 158 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 30/316 (9%), Positives = 67/316 (21%), Gaps = 16/316 (5%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 EY T + V + + + L P S+ Sbjct: 143 EYSSSKGSTVKAAETTRVYQKATVVSPVTQPTTLAQETTPISTEPTSSTLKPAPPTAPSI 202 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVS-IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 I +R+ S + + + A Sbjct: 203 TIGKTTTPKLQAVIHRTRNIDSWTSNSAWRTTGQYRPHKSPSLIHRPEAGASFQKPIAVS 262 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + N + G + +D S ++ Sbjct: 263 VHLGETDAWKPELGPFQAGFGPKDELNTRSLEPVSQGDPKCKIDLSFLIDGSWSIG-KRR 321 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + + ++ +D +G + D ++ + + Sbjct: 322 FQIQKQ---FLKTMVQALDVGPAGPLLGIVQYGDDPTTEFNLKTHTNSRDLKAAIEKIT- 377 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G + + A+ S N V++ DG + Sbjct: 378 -QKGGLSNVGRALSFVNKNFFSDANGNRGGAAN------VAVVMVDG---WPTDRVEESS 427 Query: 391 NKAKSQGIRIMTIAFS 406 A+ GI I I Sbjct: 428 RLARESGINIFFITIE 443 >gi|68535931|ref|YP_250636.1| putative secreted protein [Corynebacterium jeikeium K411] gi|68263530|emb|CAI37018.1| putative secreted protein [Corynebacterium jeikeium K411] Length = 550 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 47/404 (11%), Positives = 96/404 (23%), Gaps = 47/404 (11%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + ++R +F + + + + R + Sbjct: 161 ADIADAAARGDLTFGMTDPQESNSGFLALLSVAAEFSDFPQGVAFDSSQGTMDERRMEDF 220 Query: 130 QVVLSSRYDLLLNPLSLFLR----SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + F++ + GI ++ ++ E + D+ Sbjct: 221 FSGQTITSGSSGWLRETFVKNPGKADGIFNYQAVLQSMKENDGVDIEIFVPGNAAGAADY 280 Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S L D + A S Y Y Sbjct: 281 PISALRNPGDQKADEDAQAKVEALADWFREHSVEVGEKTHLNVWGED-AENKGIQRYYEY 339 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIK-------------KKHLVRDALASVIRSIKKIDNVN 292 P D LS R + ++ L +I N Sbjct: 340 PKSEDEILSLVSLYRDKLRRPADSNFLLDTSGSMRGNRLGDLKSILNKLIDGTAGEGNNP 399 Query: 293 D----TVRMGATFFNDRV---ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 + F+ +V + ++ + + ++ + G TAI DA+ Sbjct: 400 KGFARRETITLMPFSSKVADGYTQENYDPD-NAQQKQDLRGYVNGLQPRGETAIYDAVLR 458 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK-----SQGIRI 400 AYD + IVL+TDG + I + + I + Sbjct: 459 AYDRVGEGGGSLN-----------SIVLMTDGASNSGTSRKDFITKMTRMMDETNHKIPV 507 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + + + + F +L K F + Sbjct: 508 FVILYG----EASEEEMRFLANFTGGKVFNVR-GGDLAKAFEEI 546 >gi|23010405|ref|ZP_00051105.1| COG1109: Phosphomannomutase [Magnetospirillum magnetotacticum MS-1] Length = 134 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 29/103 (28%), Gaps = 20/103 (19%) Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV------------------NKTQQEKA 415 + + +Q + C K+ G I + Sbjct: 34 YAEWDGSQPAQIDPNKCTDLKNAGATISVLYIPYKLVKNYTNTSYIVWENDRVNGFSPTL 93 Query: 416 RYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 L CASP F+ AN+ ++ +V RIT+ Sbjct: 94 AEPLRKCASPGFFYTANTAADITASLGAMFDQ--ALKVPRITR 134 >gi|319782170|ref|YP_004141646.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168058|gb|ADV11596.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 179 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 12/142 (8%), Positives = 35/142 (24%), Gaps = 1/142 (0%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + + + G + AL+ P+ML + + + + + ++ Sbjct: 1 MMIATLGHHLNRFQRDGRGTALVEMALIAPLMLMLSAGVFEFGNLIHDKLLMEAGLTDGA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 A+ + + A I + + + T Sbjct: 61 RFAARCNSKLYTDAG-LAAIDCAGVATNIAVFGKTAVGVSGVPDVPRISGWRTADVTVTM 119 Query: 122 MNPRKSAYQVVLSSRYDLLLNP 143 N + A ++ Sbjct: 120 NNTCQEAVVAGVTKYRSTTAQV 141 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 41.9 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 29/309 (9%), Positives = 76/309 (24%), Gaps = 14/309 (4%) Query: 108 DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 D V V++ + Y + L+ L + + Sbjct: 2605 DESVTLSGVPDGVQLVIDGVVQTPNSNGTYTVTLDVAENELTGRVSAELVAPHDYDGSLD 2664 Query: 168 SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDE 227 + ++ + S + + + Sbjct: 2665 FDLSVSATSEEKVGADTETTTVSEPVTLSGSDLVVGNNVDNTLEGHGGNDILIGDQGGYK 2724 Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 V+ N +L D + + V + + + ++++AL +++ S+ Sbjct: 2725 TNVTPGVNYNIALVVDASGSMGDYVYNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLVT 2784 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI--RTIVKTFAIDENEMGSTAINDAMQT 345 D ++ + F+D + + ++ + G T + Sbjct: 2785 HD---GSINIKLIGFDDNIDVTFEALDITNSSDVVAELLSKIENNLPVGGGTDYGVGFEE 2841 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGIAICNKAKSQGIRIMTI 403 A N ++ + LTDGE N + ++M + Sbjct: 2842 A-------NNWYASSSISSNGYENMTFFLTDGEPNSGTLNNGLTEYNELVSTHNAKVMAV 2894 Query: 404 AFSVNKTQQ 412 + Sbjct: 2895 GMGNDIDDS 2903 >gi|3273265|dbj|BAA31176.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273301|dbj|BAA31194.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273303|dbj|BAA31195.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|149188838|ref|ZP_01867128.1| hypothetical protein VSAK1_05795 [Vibrio shilonii AK1] gi|148837258|gb|EDL54205.1| hypothetical protein VSAK1_05795 [Vibrio shilonii AK1] Length = 174 Score = 41.9 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 30/114 (26%), Gaps = 7/114 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL--- 72 K+ G F I L++ V++G+ D+ L A + L + Sbjct: 6 KNQRGTFIIELTLVVIVLIGIFLFTTDISHKLLVRAQL----DRASFALANILKERARFY 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + L + K + T+V+ Sbjct: 62 NDRLVLSDGDRDDMHALASRMLDTAEADIAIKIEALHNTATVDSYTSVQYGAMG 115 >gi|303242740|ref|ZP_07329210.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302589715|gb|EFL59493.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 429 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 55/186 (29%), Gaps = 26/186 (13%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 +A+ ++ I +D D+ F++ + + D G Sbjct: 64 NAIKTLHTCIDLLD---DSCHFCILTFSESTTTIFPME-KSSIEAKEKAHILVADIKVYG 119 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T ++ + TA + K +TDG N+Q + + + K Sbjct: 120 GTKLSAPLNTASLVFA-----------GYDDYLKIAYFVTDGNNSQSDINNLNL--ALKR 166 Query: 396 QGIRIMTIAFSVN--KTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRDRIGNEIFE 451 + F ++ + L A + + E+ F+ + + Sbjct: 167 NKGK-----FHLSCWGLGTDWNPKELKRIADNFAGTAEAITAPDEMASKFQKSLSELFSK 221 Query: 452 RVIRIT 457 +++ T Sbjct: 222 SILKAT 227 >gi|295840348|ref|ZP_06827281.1| secreted protein [Streptomyces sp. SPB74] gi|295827934|gb|EFG65721.1| secreted protein [Streptomyces sp. SPB74] Length = 418 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 13/119 (10%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 RT K + G T I A+ A + + EA K IVL+TDGE+ Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAAEDL------------RGGEAAKRIVLITDGED 154 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELN 438 T + + + ++G+ ++ + + + + A+ ++ T +L+ Sbjct: 155 TC-RRDPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTKDLS 212 >gi|153008053|ref|YP_001369268.1| hypothetical protein Oant_0717 [Ochrobactrum anthropi ATCC 49188] gi|151559941|gb|ABS13439.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 182 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 41/144 (28%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +K + G + AL+ PV+L + VD+ + ++A + + Sbjct: 2 RTCLRKFLNDRRGLGAVEFALIAPVLLLIYLGSVDLADGVDTNKKVSRSASSLADLVARQ 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + +++ S D + + Sbjct: 62 LSVTKNDLNDMFNISRASLLPYGRSTPKIRITAIRIDGTARASNLTPEVDWSYANIADFA 121 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSM 151 A + + LL+ S F++ Sbjct: 122 AKKGDVGDIPSSLLDEGSYFIKVD 145 >gi|83718955|ref|YP_442973.1| hypothetical protein BTH_I2452 [Burkholderia thailandensis E264] gi|167620114|ref|ZP_02388745.1| hypothetical protein BthaB_27657 [Burkholderia thailandensis Bt4] gi|257139197|ref|ZP_05587459.1| hypothetical protein BthaA_08316 [Burkholderia thailandensis E264] gi|83652780|gb|ABC36843.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 142 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 14/134 (10%), Positives = 37/134 (27%), Gaps = 1/134 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L++ G + L P M+ V +VD + + A++ A V + Sbjct: 7 LVRDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGVVVRVP-QL 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + ++ + + ++ T + Sbjct: 66 AAADIENIALSYAQGSLVSGGTVGAPVVYVDQSAGTSPGSALKVTVSYTYQGLVLGSALS 125 Query: 134 SSRYDLLLNPLSLF 147 S + L +++ Sbjct: 126 SLTGPITLTAVTVM 139 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 36 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 95 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 96 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 150 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 151 KLGWADDTGAAKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 210 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 211 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 260 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 261 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 309 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 310 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 364 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 365 FDGRSG-DLAAVFRQAKG 381 >gi|312068806|ref|XP_003137386.1| CUTiclin-Like family member [Loa loa] gi|307767445|gb|EFO26679.1| CUTiclin-Like family member [Loa loa] Length = 691 Score = 41.5 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 41/120 (34%), Gaps = 8/120 (6%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ + F+ ++ ++ G+T+ + A+ AY ++ ++ Sbjct: 6 LITYSGQAYVHFKFNDPQIGNNTAVIGHLNALKSIKGTTSTHIALHQAYKLLMDTDSGN- 64 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 KK I++ TDG + + + K++ + I I ++ Sbjct: 65 ---GAREGVKKMIIIFTDG---HSQQSPQDMALRLKNESVEIFAITL-TPAPYADEGELL 117 >gi|126727881|ref|ZP_01743709.1| hypothetical protein RB2150_00472 [Rhodobacterales bacterium HTCC2150] gi|126702822|gb|EBA01927.1| hypothetical protein RB2150_00472 [Rhodobacterales bacterium HTCC2150] Length = 176 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 49/170 (28%), Gaps = 5/170 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI----I 62 + + K +G+ + + MP+++ + V+ + L++A + + Sbjct: 6 LKRFLTRTAKDESGNATLEFVMTMPLVITLMFSTVESGILLVQQMMLERALDVNVRQLRL 65 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A++ + + + E + + + T V Sbjct: 66 GANMTQSELAASICDDIAVISNCDQSMTLELTVLSKTTWNVPATSAVCYNRQEDITPVTS 125 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 ++L R ++++PL L + S + Sbjct: 126 YTGGIDSDLML-VRACVIVDPLFPLLGLGVAMTETNNGDFTISARSAFVN 174 >gi|160725|gb|AAA29777.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 48/142 (33%), Gaps = 20/142 (14%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + V+ T G ++D V + + K AI T + A+ Sbjct: 11 YQNQVSITWMFGGLHYSDVVEIYSPLT---RSKDTYLTKLRAIR-YLGRGTFTDCAISNM 66 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI-CNKAKSQGIRIMTIAF 405 S +V K+ V++TDG T G+ + +A+ GI++ +A Sbjct: 67 TQQFQSQTARDV----------KFAVVITDGHVTGSPCGGMKMQAERARDMGIKLFAVA- 115 Query: 406 SVNKTQQEKARYFLSNCASPNS 427 ++ L ASP Sbjct: 116 ----PSEDVYEQGLREIASPPH 133 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 22/130 (16%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L ++ G TA DA+ A ++ + +K+++++TDG+ Sbjct: 431 LKDNALRALQKIPYMSGGTATGDAINFAVRSLFKP---------RSSSNRKFLIIITDGQ 481 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTH 435 + D+ A+ A+ +GI + + L AS F Sbjct: 482 -SYDDVRVPAM--AAQREGITVYAVGV------AWAPMEDLKAMASEPKESHVFFTREFT 532 Query: 436 ELNKIFRDRI 445 L + F+ I Sbjct: 533 GLGQ-FQQPI 541 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 26/248 (10%), Positives = 63/248 (25%), Gaps = 18/248 (7%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S Q S Q F ++ S + Sbjct: 102 SSYAHGVQSQSLSQWSASFTVARTISLPLEVSSQTSSSAIVASGAAKKPVKKIVKKPPPA 161 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 D + +DSS ++ +L ++ ++ + +K ++ Sbjct: 162 TAHKDCPVDMALLLDSSYNIGQ-RRFNLQKNFVSKLATMLKVGTQGPHVG---VVQTSET 217 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ + T G+T A+ + + Sbjct: 218 PRTEFYLTNYTTAKDVTFAIKEIPYI--GGNTNTGKAI------LHTVRNFFSPDFGVRR 269 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---KARYFLSNC 422 + IV+ DG ++ A+ GI I ++ + ++ + F+ Sbjct: 270 GYPRVIVVFVDG---WPSDNVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKA 326 Query: 423 ASPNSFFE 430 ++ F Sbjct: 327 VCKDNEFF 334 >gi|27370456|ref|NP_766532.1| integrin alpha-E isoform 2 [Mus musculus] gi|26334103|dbj|BAC30769.1| unnamed protein product [Mus musculus] gi|56206399|emb|CAI24787.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|163848376|ref|YP_001636420.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526300|ref|YP_002570771.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669665|gb|ABY36031.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450179|gb|ACM54445.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 446 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 39/106 (36%), Gaps = 14/106 (13%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENE---MGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + G+ R + +T + +Q A +++++ V R Sbjct: 125 ASDAIVLASGIPGSRRAELVAAIARLPALRLGETTNLAQGLQLALAQFVAADDATVRR-- 182 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 IVL+TDG T D + + +A ++GI + TI + Sbjct: 183 --------IVLITDG-FTTDQTLCLTLAREAAARGISLSTIGLGGS 219 >gi|294011438|ref|YP_003544898.1| tight adherence protein TadE [Sphingobium japonicum UT26S] gi|292674768|dbj|BAI96286.1| tight adherence protein TadE [Sphingobium japonicum UT26S] Length = 201 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 23/78 (29%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K L + +G + + L + DV Y + L+ Q A ++ Sbjct: 3 IKHLHRDESGVAAVEFGITASAFLALLLGGFDVGHTLYMQTVLQGTVQKAGRDLTLSSGA 62 Query: 71 SLEEVSSRAKNSFTFPKQ 88 + + ++ Sbjct: 63 EAAQQLAIDGRVRDAVRR 80 >gi|220913380|ref|YP_002488689.1| TadE family protein [Arthrobacter chlorophenolicus A6] gi|219860258|gb|ACL40600.1| TadE family protein [Arthrobacter chlorophenolicus A6] Length = 148 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 32/128 (25%), Gaps = 4/128 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + K G + AL+ PV+L + +V+ + + A Sbjct: 16 LGDQLKKTRKDSNHKSRGAVAVEFALVAPVLLTLVVGIVEFANAYNA----QISVTQAAR 71 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A+ + + ++A P + + T + T + Sbjct: 72 EAARTMAVKNSQADAKAAAVAGAPGINSAAFAYAFSPAACTPDTTAKVTITYPAATLTGL 131 Query: 123 NPRKSAYQ 130 Sbjct: 132 FGTSVTVT 139 >gi|260818212|ref|XP_002604277.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] gi|229289603|gb|EEN60288.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] Length = 1119 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 47/152 (30%), Gaps = 18/152 (11%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+G ++D + + + V + + G T+ A++ A +I++ Sbjct: 788 DTRVGVLQYSDGSTLECNLGDHPDW--SSFVNSMNTMARQGGGTSTGAALEFA--RLIAA 843 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + +++LTDG++ ++ + + I + Sbjct: 844 WR-------PAPVVPRIMIVLTDGDSEDSVVTP---AQALATEQVTVFAIGVGSFNRSEL 893 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + + FE + + I I Sbjct: 894 LQITNNNQ----DRVFELADFNAIANIMNRII 921 >gi|168998772|ref|YP_001688040.1| TerY1 [Klebsiella pneumoniae NTUH-K2044] gi|238549793|dbj|BAH66144.1| tellurite resistance protein [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 54/152 (35%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + + + Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQAVPLTDLLSFNLPS-------- 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGVNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 AK +G+ + + L Sbjct: 131 KAAK-KGVVVAC------AAGHDADTAVLKEI 155 >gi|149033988|gb|EDL88771.1| similar to Protein KIAA1510 precursor (predicted) [Rattus norvegicus] Length = 1320 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 51/178 (28%), Gaps = 15/178 (8%) Query: 228 KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 + V L + VD S + + I Sbjct: 151 SHTAVSVGKRGKLDDPQFQCTPPTPADIIFLVDGSWSIGHNN----FQQVKDFLASIITH 206 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D V++G T ++ ++ ++ + G+T A+ Sbjct: 207 FAIGPDKVQVGLTQYSGDPQTEWDL--NSFHTKEQVLAAVHHLHYKGGNTFTGLALTHVL 264 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 ++ EA K ++L+TDG+ +QD+ K Q I + T+ Sbjct: 265 ------GQNLKPAAGVRPEAAKVLILVTDGK-SQDDVRT--AARVLKDQDIDVFTVGV 313 >gi|126631586|gb|AAI34052.1| Col6a2 protein [Danio rerio] Length = 310 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 53/166 (31%), Gaps = 21/166 (12%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + +++ + + + + + + F + Sbjct: 57 SLVESIKDFTIRFAEKLQNVNYRGSVNITWAIGGLHFSQLQEFFSTITT--KEKFISNLR 114 Query: 333 E----MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I+ A+ ++ R + +AK++ V++TDG T + GI Sbjct: 115 PIRYLGRGTHIDCAITNMTKQLV--------RFPSRPDAKRFAVVITDGHVTANPCGGIK 166 Query: 389 IC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEAN 432 + +A+ + IRI +A + + L A SP + Sbjct: 167 VAAERARDENIRIFAVA-----SSRNLEETGLREIANSPAGVYRDK 207 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 41.5 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 53/166 (31%), Gaps = 21/166 (12%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + +++ + + + + + + F + Sbjct: 57 SLVESIKDFTIRFAEKLQNVNYRGSVNITWAIGGLHFSQLQEFFSTITT--KEKFISNLR 114 Query: 333 E----MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 T I+ A+ ++ R + +AK++ V++TDG T + GI Sbjct: 115 PIRYLGRGTHIDCAITNMTKQLV--------RFPSRPDAKRFAVVITDGHVTANPCGGIK 166 Query: 389 IC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEAN 432 + +A+ + IRI +A + + L A SP + Sbjct: 167 VAAERARDENIRIFAVA-----SSRNLEETGLREIANSPAGVYRDK 207 >gi|325673437|ref|ZP_08153128.1| type II secretion system protein [Rhodococcus equi ATCC 33707] gi|325555458|gb|EGD25129.1| type II secretion system protein [Rhodococcus equi ATCC 33707] Length = 623 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 51/185 (27%), Gaps = 13/185 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + +K+ + IS + Sbjct: 91 IDVSGGMSGPALDDVKRAASDFVRQAPTGAHIGIVAISSTPQVLSELTTDSEDLLRRIDG 150 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G++AI D++ TA + + + A ++LLTDG +T + Sbjct: 151 LKAGGNSAIADSVVTAAEML-----------ERGEAANNILLLLTDGADTSSAHSMSELP 199 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + + S +T + S + A T L I++ Sbjct: 200 SVLSRSRASLYAVQMSTPETNSALLQQVARE--SRGQYASAGDTAALGAIYQSAARALGN 257 Query: 451 ERVIR 455 V+R Sbjct: 258 LYVVR 262 >gi|330831165|ref|YP_004394117.1| TapY1 [Aeromonas veronii B565] gi|328806301|gb|AEB51500.1| TapY1 [Aeromonas veronii B565] Length = 1214 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 22/224 (9%), Positives = 58/224 (25%), Gaps = 12/224 (5%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 N+ + ++ S + K + + + + + Sbjct: 261 NYNDGNEPGSRNGGRIIRNIVGNDAILTSGKSGAQDLIDK-VNTLSSDTWTPLCETLYES 319 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 +F D ++ ++ I KT + G + D Sbjct: 320 YRYFGGLGVYFGDDDVNRTPKRDSNAEVTSGIYKTPYDKCSNNGYIIYMTDGEPTNDKAA 379 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI----------RIM 401 S + + + Y L++ G + + A+ K+ + Sbjct: 380 DSLVANLISTLSTEDKVAYGGLVSYGS-SGEKSYLAALAGYMKNNDVNTSAIGKQTVTTF 438 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 T+ F + A + ++ A +LN+ + + Sbjct: 439 TVGFGDEAIKGAGALLAETARRGGGKYYPATDATQLNEALKSSL 482 >gi|327260894|ref|XP_003215268.1| PREDICTED: collagen alpha-3(VI) chain-like [Anolis carolinensis] Length = 3053 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 70/198 (35%), Gaps = 23/198 (11%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 ++ + V+ S+ D VR+ +++ ++ + + Sbjct: 1035 DGSDATRSSFPELKSFVQRVVDSL---DVGPGKVRVAVVQYSNDANTEFNL---NEYSDK 1088 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V T MG A+N A + +IS+ + ++++LLT E ++ Sbjct: 1089 ASVITAVQRMTAMGGYAVNTG--AALNYLISNVFTREAGSRVQEGVPQFVILLT-AERSR 1145 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKI 440 D+ A+ + K++G + I F L + P + EL + Sbjct: 1146 DDVRRPAL--ELKTRGAVPLGIGFG------NADITQLQTISFVPEFAVFVSGVSELGR- 1196 Query: 441 FRDRIGNEIFERVIRITK 458 I I ERV R+TK Sbjct: 1197 ----IQQLIAERVTRLTK 1210 >gi|308509664|ref|XP_003117015.1| CRE-CLEC-60 protein [Caenorhabditis remanei] gi|308241929|gb|EFO85881.1| CRE-CLEC-60 protein [Caenorhabditis remanei] Length = 399 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 17/154 (11%), Positives = 45/154 (29%), Gaps = 8/154 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI-VKTFAIDENEMGSTAIND 341 I N T R+G +N + + + + + Sbjct: 71 TRIGTNPNEPRTTRLGLVTYNAVANTVADLDQFQSLDDVYDGIFGGLAQVSSSDESYLAH 130 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + + N ++ +++ + + I + ++ K+ G+ I+ Sbjct: 131 GLAQAEIILEAGQ-----TAVNRSHYERVVIVYASTYKGSGSLDPIPVADRLKTAGVTII 185 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 T+A+ + L ++P F Sbjct: 186 TVAYDQDGDGALLHD--LQKISTPPYNFANTDQA 217 >gi|75907575|ref|YP_321871.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701300|gb|ABA20976.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 819 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 48/167 (28%), Gaps = 22/167 (13%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D + RT + G T + ++ + Sbjct: 331 NPDDTFSIIDFSDTTRQLSPVPLANNSQNRTRAINYINRLTANGGTEMLRGIRAVLN--F 388 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + IVLLTDG +N+ + ++ G R+ + + Sbjct: 389 PVTDSGR---------LRSIVLLTDGYIGNENQILAEVQQHLQA-GNRLYSFGAGSSVN- 437 Query: 412 QEKARYFLSNCASPNS-----FFEANSTHELNKIFRDRIGNEIFERV 453 R+ L+ A T E+ F +I N + + Sbjct: 438 ----RFLLNRIAELGRGIAQIIRHDEPTDEVVDKFYRQINNPVLANI 480 >gi|328885837|emb|CCA59076.1| hypothetical protein SVEN_5790 [Streptomyces venezuelae ATCC 10712] Length = 865 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 39/149 (26%), Gaps = 25/149 (16%) Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE---- 384 T A++ A + + K + LLTDG+ + Sbjct: 146 DAARMGPGTDFPAAIRQAVSRLTADGTAGGTGAAGKPAP-KVVFLLTDGKLDVKDSPEYG 204 Query: 385 ------------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL----SNC----AS 424 +A++ G++I + F + C S Sbjct: 205 TDPESRQSNGEKRLTEELARARAAGVQIWPLGFGSEIDRAALTAMAEGGYRGACSEIPGS 264 Query: 425 PNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ E++K ++ R+ Sbjct: 265 VPHMRVVGTSAEIDKALQETFAAARCARI 293 >gi|187956243|gb|AAI50691.1| Integrin alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|223940571|ref|ZP_03632417.1| von Willebrand factor type A [bacterium Ellin514] gi|223890763|gb|EEF57278.1| von Willebrand factor type A [bacterium Ellin514] Length = 965 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 56/182 (30%), Gaps = 26/182 (14%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + RD V+ ++ D MG ++ + + + Sbjct: 425 QVARDCAQGVLAALGPND------EMGVVLWDGNEHWLFPMT---KVKDKKALGRQIAGM 475 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 N+ + M A++ + K + K+I++ +DG+ + + N Sbjct: 476 NQGDLPSFQGVMTLAHEGL-----------KKSHSNLKHIIVFSDGDPNAPSPSLM---N 521 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 S I + T+ E + + F+ S +EL +IF + Sbjct: 522 DIVSDRITVSTV-LIAGHAGPETMIWIADH--GKGRFYPVTSPNELPQIFIKEAAVILKS 578 Query: 452 RV 453 + Sbjct: 579 AI 580 >gi|73695954|gb|AAZ80783.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 17/183 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 DSSS K ++ + + D VR+ + + + F+ + Sbjct: 54 DSSSSLRNRDFKTAIKFLQEFLSQYEISSDPNG--VRVSIISYGKGIYPEIGFNLTTYDT 111 Query: 320 IRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 +++ ++ G T A+Q ++ +++ K +++TDG Sbjct: 112 KDEVIEAIGRIPHKAGLRTDTGRAIQYMHEAQLANGVVRPGV-------TKVSIVITDG- 163 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANSTHEL 437 N+Q+ + +A+ I + I + E L N A + + ++ ++L Sbjct: 164 NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSE-----LLNIAGDQSRVTKVDNYNQL 218 Query: 438 NKI 440 + I Sbjct: 219 SSI 221 >gi|315656396|ref|ZP_07909285.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315492955|gb|EFU82557.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 171 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 13/135 (9%), Positives = 41/135 (30%), Gaps = 6/135 (4%) Query: 9 FYSKKLIK--SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ + + G ++ ++ + M + + + L++ A A+ Sbjct: 15 RFAARFERLGDEDGRIMLLGLAACVLLCVIILMSMALSGVYLEQRRLQRLADQTASMAA- 73 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 ++ + + +IE Y + + V+++ Sbjct: 74 ---ANMADTAYYQNGIVDGVPLEIEPYHASERAAEYLSGASISANSSLDGIDLVDVDVAS 130 Query: 127 SAYQVVLSSRYDLLL 141 + QV L + + L Sbjct: 131 TRVQVTLRATGKIPL 145 >gi|212635869|ref|YP_002312394.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] Length = 710 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 35/127 (27%), Gaps = 13/127 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + FN V + + + G T ++ A+ A Sbjct: 381 SAQDSFNILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALS--- 437 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + N E + ++ +TDG + + I N+ + R+ TI Sbjct: 438 --------QQDANRERLRQVLFITDGAVGNEPQLFTQIRNQLQQS--RLFTIGIGDAPNA 487 Query: 412 QEKARYF 418 R Sbjct: 488 HFMQRAA 494 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 40/287 (13%), Positives = 69/287 (24%), Gaps = 51/287 (17%) Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 A + + G +P + + ++ V+ + Sbjct: 29 NDEASSITTDGGATEDEAGDDGSTTTPMYFDLPGEE----SQPTAEGATTDCANVEVDTS 84 Query: 265 RHVIKKKHLVRDALASVIR----------SIKKIDNVNDTVRM--GATFFNDRVISDPSF 312 LV + + D VN V+ F + S + Sbjct: 85 PTTPTVVLLVDQSGSMWDDFGGQPRWVALENTLFDPVNGVVKPLEDQVRFGLALYSSMNG 144 Query: 313 SWG----------VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 S+G + + T D+++ +T+ + ED Sbjct: 145 SFGGECPLITEFAPSFGNHASLAATFASAMPLDDTPTGDSIKAVAETLAAFPEDGP---- 200 Query: 363 NNLEAKKYIVLLTDGENTQ--------DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K IVL TDGE + A GIR I+ T Sbjct: 201 ------KIIVLATDGEPDTCAVPDPQEGQPLSLEATQAAFDDGIRTFVISVGNQVTDAHL 254 Query: 415 ARYFLSNCASPNS-------FFEANSTHELNKIFRDRIGNEIFERVI 454 + P F++ + EL F I I Sbjct: 255 QELANAGVGLPTQGAVENAPFYKTLNPAELVSAFEAVINGFIGCEFT 301 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 41.5 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 14/105 (13%), Positives = 34/105 (32%), Gaps = 8/105 (7%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI---- 403 + + + +K IV++TDG + A+ I +I Sbjct: 162 TVLYDAVYLAANEKLKGEVGRKVIVVITDGVDQGSRMSRNQAIEAAQKSDCVIYSIDYSD 221 Query: 404 --AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 A+ L + + ++ + H L+++F++ Sbjct: 222 PRAYGPFNMVGGGGEGELRKMSDETGGRVYKVDRRHTLDQVFKEL 266 >gi|187466178|emb|CAQ51888.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|218671458|ref|ZP_03521128.1| hypothetical protein RetlG_07263 [Rhizobium etli GR56] Length = 94 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 11/85 (12%), Positives = 29/85 (34%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ +++LI+ G I A+L PV++ + ++ +AA + Sbjct: 8 TRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKRATRAAGSIADLV 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQK 89 + + ++ + Sbjct: 68 TQQQSVTKSTLAEMRSVATAIFVPY 92 >gi|270014558|gb|EFA11006.1| hypothetical protein TcasGA2_TC004591 [Tribolium castaneum] Length = 1235 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 36/301 (11%), Positives = 80/301 (26%), Gaps = 30/301 (9%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D SRS + + + + + Sbjct: 143 LAVDTSRSSVHVPTNIFDRHEEAAYAIQWSEKLDEIFVRNYNSDPALSWQYFGSTSGIMR 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV-----------------RDALASVIR 283 +Y P + V + + K ++ + +++ Sbjct: 203 HYPAKKWPNIEKDEFDCRVRTWYIEAATCTKDVIILVDNSGSMDGMGRHIASLTVNTILD 262 Query: 284 SIKKIDNVND-TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + D +N T + + + K G T A Sbjct: 263 TFSNNDYINILYYSNQTTNYTIPCFRN--LLVQATPENIVLFKEAIRHLGPSGKTDFPQA 320 Query: 343 MQTAYDTIISS------NEDEVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAK 394 +Q A+D + + N +E+ + + I+L+TDG N D + K Sbjct: 321 LQMAFDILENYREIRGCNNEEIDEEGKSKACNQAIMLITDGISRNFSDIVMRNNQLDGGK 380 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +RI T + + K CA+ + + + ++ I V+ Sbjct: 381 TIPVRIFT--YLIGKEVTNVEEIRWMACANRGFYTQVQTLEQVTSAVLQYINVIARPLVL 438 Query: 455 R 455 + Sbjct: 439 Q 439 >gi|119620823|gb|EAX00418.1| vitrin, isoform CRA_c [Homo sapiens] Length = 700 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 44/343 (12%), Positives = 84/343 (24%), Gaps = 17/343 (4%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 S S + +++A + K E + + P Sbjct: 126 SALTYSSSKSPAAQAGKCSRVIESKPSESMNTRRVLGDSGEINILTGETTKAYQRPPI-P 184 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 +A V L + + + S T +E + Sbjct: 185 GTTAQPVTLMQLLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYT 244 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM-VSCNKSLYYM 243 S++ + Q + K + + L P S Sbjct: 245 SSQNRPRADPGIQRQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPRGSDGER 304 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 G D + +D S+ ++ + + LA V + +D MG + Sbjct: 305 GTRGCPDCKIDLSFLIDGSTSIG-KRRFRIQKQLLADV---AQALDIGPAGPLMGVVQYG 360 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 D + + + G + + A+ S N Sbjct: 361 DNPATHFNLKTHTNSRDLKTAIEKIT--QRGGLSNVGRAISFVTKNFFSKANGNRSGAPN 418 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +V++ DG T EE + A+ GI I I Sbjct: 419 ------VVVVMVDGWPTDKVEEASRL---ARESGINIFFITIE 452 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 41.5 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 37/418 (8%), Positives = 92/418 (22%), Gaps = 46/418 (11%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA + + + F + +KK + Sbjct: 140 YAADNGSTAG-EEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVLSNNP 193 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 194 TSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 253 Query: 175 HGVSIQWVIDFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 I + +M +++ Q + + + S+N + Sbjct: 254 AVNEISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALN 313 Query: 224 IRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHV----IKKKHLVRDAL 278 +P + G + + + V I++ + Sbjct: 314 RTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTN 373 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V+ ++ D + N + R Sbjct: 374 WAVMNNMIARDTGDLFKNFETYSGNTSGAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 433 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + A S+ + A + + E + Sbjct: 434 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAY-HHWLKNKYQENPPS---------- 482 Query: 399 RIMTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 I +I ++ + R L N A + +++AN+ +++ D Sbjct: 483 -IFSIGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 539 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 15 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 74 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 75 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 129 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 130 KLGWADDTGAAKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 189 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 190 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 239 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 240 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 288 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 289 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 343 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 344 FDGRSG-DLAAVFRQVKG 360 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 70/199 (35%), Gaps = 39/199 (19%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMG----ATFFNDRVISDPSFS---WGVHKLIRT 322 + ++ AL + + + R FND+V + F+ Sbjct: 381 RLAALQQALTGLTGADDSLSGRFARFRAREQVTIITFNDKVTATRQFTVSDPTPGSADLK 440 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + G+TAI A+ AY T + + + + + IVL+TDGEN + Sbjct: 441 AISDYGAALRAGGNTAIYSALDAAYTTAAAGMKADPSALTS-------IVLMTDGENNRG 493 Query: 383 ----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFE 430 N + +G+R + F + R L+ A + + F+ Sbjct: 494 LDSAGFLARYNTRPPDV------RGVRTFAVDFG------DADRAALTQIATSTGGAVFD 541 Query: 431 AN-STHELNKIFRDRIGNE 448 A L+ +FR+ G + Sbjct: 542 ATAPGVSLSDVFREIRGYQ 560 >gi|319937918|ref|ZP_08012318.1| hypothetical protein HMPREF9488_03154 [Coprobacillus sp. 29_1] gi|319806824|gb|EFW03463.1| hypothetical protein HMPREF9488_03154 [Coprobacillus sp. 29_1] Length = 134 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 42/125 (33%), Gaps = 1/125 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 I+ +G + ALL PV+LG+ + V + + A A AI A+ Sbjct: 3 FIRDESGETTVEMALLFPVILGLIMFAIGVNMFYEGKIATSVGANEAIRYAATQTSYENA 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + + I+ + + + D + S Y L Sbjct: 63 KREANERLKAIYTQHNIKLT-KFSLTHIDVNRDGKYSIGDKLVLNIETKKGIWSEYTYEL 121 Query: 134 SSRYD 138 + R + Sbjct: 122 TVRVE 126 >gi|293346541|ref|XP_001058131.2| PREDICTED: alpha 1 type VII collagen-like [Rattus norvegicus] gi|293358308|ref|XP_230973.5| PREDICTED: collagen, type XX, alpha 1 [Rattus norvegicus] Length = 1320 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 31/93 (33%), Gaps = 9/93 (9%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ + G+T A+ ++ EA K ++ Sbjct: 230 DLNSFHTKEQVLAAVHHLHYKGGNTFTGLALTHVL------GQNLKPAAGVRPEAAKVLI 283 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 L+TDG+ +QD+ K Q I + T+ Sbjct: 284 LVTDGK-SQDDVRT--AARVLKDQDIDVFTVGV 313 >gi|87200511|ref|YP_497768.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87136192|gb|ABD26934.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 193 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 36/112 (32%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 KL+ G I AL++PV L +D+ + Y L A + A +A++ + Sbjct: 6 KLVAEDCGVTTIEFALVLPVFLLAIVGCLDLGQMVYAVGVLDGAVEKAARSAALETGDTT 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + +++ N N +R T E Sbjct: 66 AADAEVEDVMSRILPGSTLATSRKSYANYSDINRPERWNDADNNGTCSEGET 117 >gi|108757686|ref|YP_629407.1| hypothetical protein MXAN_1147 [Myxococcus xanthus DK 1622] gi|108461566|gb|ABF86751.1| hypothetical protein MXAN_1147 [Myxococcus xanthus DK 1622] Length = 525 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 30/91 (32%), Gaps = 2/91 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI--ITASVPLIQSL 72 ++ G ++ AL V+ + + ++ + L+Q + + + Sbjct: 19 RRAQRGQSIVLGALSFLVLALMVTLSFNLAHALRQKMGLQQHSDALAYSMAVLEARSLNY 78 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 VS+RA + I Y++ Sbjct: 79 YAVSNRAIAASYVAMNSIHAYMVAASVTGSM 109 >gi|239817419|ref|YP_002946329.1| TadE family protein [Variovorax paradoxus S110] gi|239803996|gb|ACS21063.1| TadE family protein [Variovorax paradoxus S110] Length = 156 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 11/84 (13%), Positives = 28/84 (33%), Gaps = 3/84 (3%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ---TAIITASVPLIQSLEE 74 G + I A++ V + ++ + L+ AA+ A + L + Sbjct: 13 QRGVYAIEFAVVFMVFFSILYAVICYGMLFAFRLGLQNAAEDGARAALQYQSSLNDRKKR 72 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNF 98 + A + + Q ++ + Sbjct: 73 AADVATSQSDWMPQVVKPTVSAQI 96 >gi|198426251|ref|XP_002120495.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 1059 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 77/258 (29%), Gaps = 19/258 (7%) Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEE- 256 ++C + ++ + S + P P L+ Sbjct: 550 NGQVSCTDSNTYDSTCTFQCDQSNGYSLYPSGHSRTTCLSNSSWSWASPCCSRPCLNHSV 609 Query: 257 ----HFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 +DSSS ++++ + VI + + R+ +N R Sbjct: 610 IDMIFILDSSSSVGSDNW-DVMKNFVRDVINLLSITETGT---RVSIFRYNRRPDRQSQI 665 Query: 313 SWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 H + + + G T I +A+ A D I+ + + + + Sbjct: 666 LLNDHIGDKQGLLSALDQMPYNGRGTWIGNALNHAKDFILQLRNGDRPDVID------VV 719 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 + +TDG ++D+ ++ + + QG + + + L S + Sbjct: 720 LTITDGR-SKDDVSVVS--EELRKQGALTYAVGVIPPNGKGPREEELLKIAGSTDRVKIT 776 Query: 432 NSTHELNKIFRDRIGNEI 449 S F + +++ Sbjct: 777 TSFEGTKDHFEKLMSDDL 794 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 14/165 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPS-FSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + D R +N V +D +++ + + T A+ Sbjct: 900 LSSFDISLTATRAAVFRYNSMVYNDSQILLSDFPTNKSGLLEAYDAIVHNGRGTMTGQAL 959 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q DTI++ + ++ +TDG + ++ AI + + +G I Sbjct: 960 QYVNDTILADVAGNRPEAVD------VVLTVTDG---RSVDDVTAISDSLRQRGALTYAI 1010 Query: 404 AFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGN 447 ++ R L + A + F A + + +F IG Sbjct: 1011 GVI-PPNKKGADRDQLLSIAGEANNIFLAINNFD--GVFDKFIGK 1052 >gi|149909169|ref|ZP_01897826.1| hypothetical protein PE36_09161 [Moritella sp. PE36] gi|149807693|gb|EDM67639.1| hypothetical protein PE36_09161 [Moritella sp. PE36] Length = 162 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 9/71 (12%), Positives = 25/71 (35%), Gaps = 4/71 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + ++ +S G I A+ + + L+++ R+ A+ Sbjct: 1 MTIRNKMNCSQVKRSQRGSVSIEFAVTILPFFVLLLALIEISRFMMV----SSVIDVALT 56 Query: 63 TASVPLIQSLE 73 +A+ L+ + Sbjct: 57 SAARQLVVTSA 67 >gi|115358178|ref|YP_775316.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170703648|ref|ZP_02894385.1| TadE family protein [Burkholderia ambifaria IOP40-10] gi|115283466|gb|ABI88982.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170131447|gb|EDT00038.1| TadE family protein [Burkholderia ambifaria IOP40-10] Length = 142 Score = 41.5 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 41/136 (30%), Gaps = 1/136 (0%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G + AL++P +L V ++DV + + A++ A V + L Sbjct: 6 RGARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPML 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + A + ++ + + + T + Sbjct: 66 -TPTQIANVALSYTQNSLVSGGTGTVPTVAVTQANGTTAGTALTVTVTYTYSGLVLGTAL 124 Query: 133 LSSRYDLLLNPLSLFL 148 + ++ S+ L Sbjct: 125 SVLTGPITISASSVML 140 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 49/172 (28%), Gaps = 23/172 (13%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKL---IRTIVKTFAIDENEMGSTAINDAMQT 345 D + R F+ V G+ + R G T + ++ Sbjct: 322 DTLTGGDRFAVLSFDGVVEQPEGLGEGLSEASDRNRYRAVEHLARLEARGGTEMLAPLEQ 381 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A + S D + +VL+TDG+ +++ I A+ G+R+ T+ Sbjct: 382 AVALLSESGRD------------RVLVLVTDGQVGNEDQILERI--GARLAGVRVHTVGI 427 Query: 406 SVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR---IGNEIFERVI 454 R S L++ IG + + Sbjct: 428 DRAVNAGFLGRL---AGLGSGRCELVESEDRLDEAMEHIHRRIGAPLVTDLS 476 >gi|227823966|ref|YP_002827939.1| Flp pilus assembly protein TadG [Sinorhizobium fredii NGR234] gi|227342968|gb|ACP27186.1| Flp pilus assembly protein TadG [Sinorhizobium fredii NGR234] Length = 201 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 11/83 (13%), Positives = 19/83 (22%), Gaps = 4/83 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + L+ G I A+L V ++ E L A T + + Sbjct: 17 QLFRSLLGDRRGATAIEFAILALPFFIVVFASIETFVAFAGEQLLANATDT----MARKI 72 Query: 69 IQSLEEVSSRAKNSFTFPKQKIE 91 +I Sbjct: 73 RTGEITKDMSEAEFRQAFCDEIA 95 >gi|218693343|gb|ACL01175.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGETPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|218693335|gb|ACL01171.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 22/130 (16%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 L ++ G TA DA+ A ++ + +K+++++TDG+ Sbjct: 431 LKDNALRALQKIPYMSGGTATGDAINFAVRSLFKP---------RSSSNRKFLIIITDGQ 481 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTH 435 + D+ A+ A+ +GI + + L AS F Sbjct: 482 -SYDDVRVPAM--AAQREGITVYAVGV------AWAPMEDLKAMASEPKESHVFFTREFT 532 Query: 436 ELNKIFRDRI 445 L + F+ I Sbjct: 533 GLGQ-FQQPI 541 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 26/248 (10%), Positives = 63/248 (25%), Gaps = 18/248 (7%) Query: 186 SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLY 245 S Q S Q F ++ S + Sbjct: 102 SSYAHGVQSQSLSQWSASFTVARTISLPLEVSSQTSSSATVASGAAKKPVKKIVKKPPPA 161 Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 D + +DSS ++ +L ++ ++ + +K ++ Sbjct: 162 TAHKDCPVDMALLLDSSYNIGQ-RRFNLQKNFVSKLATMLKVGTQGPHVG---VVQTSET 217 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 ++ + T G+T A+ + + Sbjct: 218 PRTEFYLTNYTTAKDVTFAIKEIPYI--GGNTNTGKAI------LHTVRNFFSPDFGVRR 269 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---KARYFLSNC 422 + IV+ DG ++ A+ GI I ++ + ++ + F+ Sbjct: 270 GYPRVIVVFVDG---WPSDNVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKA 326 Query: 423 ASPNSFFE 430 ++ F Sbjct: 327 VCKDNEFF 334 >gi|149913214|ref|ZP_01901748.1| hypothetical protein RAZWK3B_04460 [Roseobacter sp. AzwK-3b] gi|149813620|gb|EDM73446.1| hypothetical protein RAZWK3B_04460 [Roseobacter sp. AzwK-3b] Length = 136 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 33/114 (28%), Gaps = 2/114 (1%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL-KQAAQT 59 M F + + K G + L MP+ L + + +DV + + T Sbjct: 1 MNFLPRLARCMRAYRKKSDGTVSVEFVLWMPLFLVILALAIDVSLLFMSQSNYWSVSRDT 60 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 A + + + S + +F Q + + R + Sbjct: 61 ARL-VARHAMDGTTAKSYAEMRAGSFFGQPQATVEYGPSTVTVTLSAPARSIMI 113 >gi|94989622|ref|YP_597722.1| collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|94543130|gb|ABF33178.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|198417325|gb|ACH87880.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417327|gb|ACH87881.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417329|gb|ACH87882.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417331|gb|ACH87883.1| ancillary protein 1 [Streptococcus pyogenes] Length = 890 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 60/219 (27%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFKSFLNKTPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F G + + + + G AIN Y Sbjct: 440 ESFKLFSDRKVPVPGGTTQAAYQVPQNQLSVMSNEGY-AINRGYIYLYWR---DYNWVYP 495 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + T GE T G K +G I T+ VN +A+ Sbjct: 496 FDPKTKTVSATKQIKTHGEPTTLYFNG-----NIKPKGYDIFTVGIGVNGDPGATPLEAK 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 EFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|13474654|ref|NP_106223.1| hypothetical protein mll5590 [Mesorhizobium loti MAFF303099] gi|14025409|dbj|BAB52009.1| mll5590 [Mesorhizobium loti MAFF303099] Length = 421 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 15/179 (8%), Positives = 51/179 (28%), Gaps = 9/179 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 G + AL++P++L + M ++ + + + + + Sbjct: 234 FWSDRKGVAAVEFALIVPILLIMYFMTMEASQAIETSKKVSR--------IGSMVADLVT 285 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + K + + + T +V + + R+ A V Sbjct: 286 QQPTIVKADLDAIMKIGTSTIQPYNRSTPNITITAIQVTTDTPPKVLVVWSRQVANGVYS 345 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR-SYHKEHGVSIQWVIDFSRSMLD 191 ++ + L+ G + + + + + + + + + M + Sbjct: 346 AAAAAGTTTTVPATLKVAGTFLIRVDSNLSYTPIIGWTTDTQQKLGLTKSLTTTIPMGE 404 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 43/119 (36%), Gaps = 14/119 (11%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 ++G F S + + + +M T AMQTA D + Sbjct: 128 NDKVGIIVFAGEASSIMPLTTDFTAVETYVGGVE-TSIVKMQGTDFLKAMQTAADKFRNV 186 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + +VLL+DGE+ + NE A A +GIR++++ + Sbjct: 187 AKGS-----------RKVVLLSDGEDNEGNE--KAAAKLANREGIRVISVGIGSEEGAP 232 >gi|307155059|ref|YP_003890443.1| Vault protein inter-alpha-trypsin domain-containing protein [Cyanothece sp. PCC 7822] gi|306985287|gb|ADN17168.1| Vault protein inter-alpha-trypsin domain protein [Cyanothece sp. PCC 7822] Length = 796 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 33/119 (27%), Gaps = 12/119 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D R++ + G T + ++ + Sbjct: 331 NPDDTFSIIDFSDTTQQLSPVPLANTSQNRSLALNYINRLTAGGGTELMRGIRAVLN--- 387 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + IVLLTDG +N+ + K G R+ + + Sbjct: 388 --------FPITDAGRLRSIVLLTDGYIGNENQILAEVQQHLK-PGNRLYSFGAGSSVN 437 >gi|33333552|gb|AAQ11892.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|3273251|dbj|BAA31169.1| thrombospondin-related protein [Plasmodium falciparum] Length = 574 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + A F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NENAIHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|268324283|emb|CBH37871.1| hypothetical secreted protein, CARDB family [uncultured archaeon] Length = 1149 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 44/140 (31%), Gaps = 21/140 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 GV ++ K++ N G + + T+ + + +A K IVL+ Sbjct: 156 GVLSNSSSMFKSWIDSLNPSGGGDLPE------STLAALMHTVKDQKWRGGDASKIIVLI 209 Query: 375 TDGEN-------TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +D Q+ + + G+ + + ++A + ++ Sbjct: 210 SDAYPHSDEHCCNQEKNTFDGVISALTRSGMTVYVVG-------PDEASMEMIANSTGGK 262 Query: 428 FFEA-NSTHELNKIFRDRIG 446 FF L + + G Sbjct: 263 FFHIRAEGVSLKPVLEEIAG 282 >gi|262371043|ref|ZP_06064366.1| predicted protein [Acinetobacter johnsonii SH046] gi|262314119|gb|EEY95163.1| predicted protein [Acinetobacter johnsonii SH046] Length = 535 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 27/321 (8%), Positives = 64/321 (19%), Gaps = 15/321 (4%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + T + + N + T + + V Sbjct: 206 NTPPVASDDTYNTAFNTPVALNPLANDSDADGNPLTITAINGVTLTPGTAQQINTPNGVV 265 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + ++ + P + F + + + T + + + + V Sbjct: 266 NIDAQGTITFTPNNGFTGQESFNYSISDGQGGSSTATETINVAAAPNTPPVASDDTYNTA 325 Query: 192 YQRDSEGQPLNCFGQ-----PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 Y PL + + G + + ++ + Sbjct: 326 YNTPVTLTPLANDSDADGGTLTITAINGVTLTPGTAQQINTPNGVVNIDAQGTITFTPNA 385 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 G D A ++ S + T FN V Sbjct: 386 GFSGQETFNYSISDGQGG---------SSTATGTINVSAAPNTPPVASDDTYNTAFNTPV 436 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 + + + T TA I + Sbjct: 437 -TLNPLANDTDADGNPLTITAINGVTLTPGTAQQINTPNGVVNIDAQGTITFTPNTGFSG 495 Query: 367 AKKYIVLLTDGENTQDNEEGI 387 + + ++DG+ I Sbjct: 496 QESFNYSISDGQGGTSTANQI 516 >gi|225181999|ref|ZP_03735431.1| flp pilus assembly protein TadD [Dethiobacter alkaliphilus AHT 1] gi|225167284|gb|EEG76103.1| flp pilus assembly protein TadD [Dethiobacter alkaliphilus AHT 1] Length = 319 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 44/145 (30%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K ++ + G ++ A+ M V+LG ++ DV + L AA A + + Sbjct: 7 LKNILNNQRGSVLVLVAVGMTVLLGAAALVADVGVNYVTQARLSVAADAAALAGGTLFDE 66 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + V A + E+ + +N + + M + Sbjct: 67 GRDAVMLAAVETAEKNGVPAEQVFVEVDDNARGVTVRTQAPVQLFFGRIFGMEGGAMEQR 126 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKS 155 + + + GI+ Sbjct: 127 ARAARTRPIAFYGVFPLGVEEGIEF 151 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 22/220 (10%), Positives = 57/220 (25%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKIGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ + TAI +A+ A + + IV++T Sbjct: 150 YISA-KMFLESISPSLISKQGTAIGEAINLAVRSFTPQ-----------EGVGRAIVVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A A +GI++ + + Sbjct: 198 DGENHEGGAVEAAK--VAAEKGIQVSVLGVGMPDGAPIPIEGTNDYRRDREGNVIVTRLN 255 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + ++T+ K + V Sbjct: 256 EAMCQEIAKEGKGIYVRVDNTNSAQKAINQEVNKMAKSDV 295 >gi|172062966|ref|YP_001810617.1| TadE family protein [Burkholderia ambifaria MC40-6] gi|171995483|gb|ACB66401.1| TadE family protein [Burkholderia ambifaria MC40-6] Length = 142 Score = 41.5 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 41/136 (30%), Gaps = 1/136 (0%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G + AL++P +L V ++DV + + A++ A V + L Sbjct: 6 RGARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPML 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + A + ++ + + + T + Sbjct: 66 -TPTQIANVALSYTQNSLVSGGTGTVPTVAVTQANGTTSGTALTVTVTYTYSGLVLGTAL 124 Query: 133 LSSRYDLLLNPLSLFL 148 + ++ S+ L Sbjct: 125 SVLTGPITISASSVML 140 >gi|312139258|ref|YP_004006594.1| type ii secretion system integral membrane subunit [Rhodococcus equi 103S] gi|311888597|emb|CBH47909.1| putative type II secretion system integral membrane subunit [Rhodococcus equi 103S] Length = 622 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 51/185 (27%), Gaps = 13/185 (7%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + +K+ + IS + Sbjct: 90 IDVSGGMSGPALDDVKRAASDFVRQAPAGAHIGIVAISSTPQVLSELTTDSEDLLRRIDG 149 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G++AI D++ TA + + + A ++LLTDG +T + Sbjct: 150 LKAGGNSAIADSVVTAAEML-----------ERGEAANNILLLLTDGADTSSAHSMSELP 198 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + + + S +T + S + A T L I++ Sbjct: 199 SVLSRSRASLYAVQMSTPETNSALLQQVARE--SRGQYASAGDTAALGAIYQSAARALGN 256 Query: 451 ERVIR 455 V+R Sbjct: 257 LYVVR 261 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 37/119 (31%), Gaps = 9/119 (7%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + L++ A+ ++I +I + + + F R + + + Sbjct: 6 GDKTRIQLLKSAVDNMIDNI--AEKEDVDAKWEVIDFATRAAVRGGGWLNTSNVKQYVTT 63 Query: 326 TFA--IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + T M A + + AKK ++ LTDG+ T Sbjct: 64 AINEDNNVDIGRGTNYQAGMDLAQKEFEKKQPES-----DRPNAKKIVLFLTDGQPTYY 117 >gi|148699894|gb|EDL31841.1| procollagen, type VI, alpha 1, isoform CRA_c [Mus musculus] Length = 227 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 51/142 (35%), Gaps = 16/142 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMG---ATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 V+ I +++ D + A ++D V + Sbjct: 58 VDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVD 117 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG- 386 A+ T + A++ + ++ ++L+ KY++++TDG + +E Sbjct: 118 AVK-YFGKGTYTDCAIKKGLEELL--------IGGSHLKENKYLIVVTDGHPLEGYKEPC 168 Query: 387 ---IAICNKAKSQGIRIMTIAF 405 N+AK GI++ ++A Sbjct: 169 GGLEDAVNEAKHLGIKVFSVAI 190 >gi|116622522|ref|YP_824678.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225684|gb|ABJ84393.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 324 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K R A A ++ D FND+ + V ++ Sbjct: 111 KLQKSRMAAAEFFKTANPDD------EFFLVEFNDQPKMVVPLTRDVE-----QIQNQLT 159 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ DA+ A + K + + +K +++++DG + + Sbjct: 160 WAQSKGRTALLDAIFLAMSEL-----------KKSTKNRKALLIISDGGDNSSRYTESEV 208 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQ------EKARYFLSNCA--SPNSFFEANSTHELNKIF 441 N + + I I + L+ + + A +EL Sbjct: 209 RNLVRENDVLIYAIGVYEFAGGRMRTPEEAGGPGLLTELSEQTGGRHLPA-DANELPD-I 266 Query: 442 RDRIGNEIFER 452 +IG E+ R Sbjct: 267 AAKIGVELRNR 277 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 48/384 (12%), Positives = 101/384 (26%), Gaps = 50/384 (13%) Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE-MNPRKSAYQVVLS-SRYDLLL 141 + + Y + N++ + N + + +T + S Y + Sbjct: 255 DYTTDQSGTYPVANWKTDENSNVLNHQGNKDAGETWDGITTWNGDPANLTHSYIEYGGVG 314 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQWVIDFSRSMLDYQRDSEGQP 200 + LR ++ V + ++ + + V+D+S SM + R +E + Sbjct: 315 DEADFALRKFAKETNTPGLFDVYLNVRGNVQRQIDPIDVVLVVDWSGSMNEMGRIAEVKK 374 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 GI+D Y+ + Y Sbjct: 375 GVDRFLNQIEG----------SGIQDSVYMGYVGYSSDGSNYQNKTC---------QLGK 415 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S ++ I+ A + + + N + GV Sbjct: 416 FSEVKETIRSMTPETAAGGTFTQR------GLRQAGDMLSTQNGHKKVIVLLTDGVPTYS 469 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + K + T + +T N +NN + NT Sbjct: 470 YHVSKVHTQADGSYYGTVFSLTQDQPMNTSFLYNGYFAFDQQNNYKWI---------NNT 520 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFE- 430 G A+ K +GI I + + Q + S + ++E Sbjct: 521 FIATIGEAM--ALKERGIEIHGLGIQLQGDQTAGYTKADVEKKMRQMVSADEDGHLYYES 578 Query: 431 ANSTHELNKIFRDRIGNEIFERVI 454 AN ++ + ++ I V Sbjct: 579 ANEAADIAD-YLEKKALHISATVT 601 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 41/292 (14%), Positives = 80/292 (27%), Gaps = 20/292 (6%) Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + KN + + +T+ + + L LF + K Sbjct: 150 SFSFSIDNKNQETAQTNEKTSETSTSYENHEISDDSSLDKDEYRPFEQPQLFNAQLKPKG 209 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + E T + + + + Q + LN + SY Sbjct: 210 LA-TIEPEYTTDEQGTYPKAMWQPDNSQYVRNHQGNRQGQQQWDGLNGWDGNPTNRNNSY 268 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIK 269 G+ D + + + LY + VD S + Sbjct: 269 IEYGGEKEDADYAIRKFAKETATPGLFDLYLNVRGNTQKNITPLDLVLVVDWSGSMNDNN 328 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-A 328 + V+ + + ++ + D + MG ++ S + + + + Sbjct: 329 RIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSIEGYSYSNGAVQMGSFDSVKNQVKSI 387 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 G T A++ A + + N KK IVLLTDG T Sbjct: 388 TPSWTNGGTFTQKALRDAGNMLSVPN-----------GHKKVIVLLTDGVPT 428 >gi|320105608|ref|YP_004181198.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924129|gb|ADV81204.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 373 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 47/153 (30%), Gaps = 49/153 (32%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 TA+ DA+ A D +I +K ++LLTDGE+ E + A+ Sbjct: 189 TALYDAIFLASDEVIHKQPF-----------RKALILLTDGEDNGSKESLSSAIEAAQRA 237 Query: 397 GIRIMTIAFSVNKTQQEK-----------------------------------ARYFLSN 421 + +I F + + L Sbjct: 238 DTAVYSIYFKGEEHNDTSSRRPSFGGGGFPGGGGRHGGGGGQGGGQPQRTHVDGKKILQR 297 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNEIFER 452 + + FFE + L +I++ I E+ + Sbjct: 298 ISDETGGRFFEVSKKEPLAEIYKK-IAQELRSQ 329 >gi|297190882|ref|ZP_06908280.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197722677|gb|EDY66585.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 518 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 66/186 (35%), Gaps = 23/186 (12%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRM----GATFFNDRVISDPSFSWGVHKLIRTIVK 325 + +R+ + + F RV+ + + ++ + + + Sbjct: 344 RITALRETIDGLAGGDDSPSGKFVRFYRGETLTVMRFGGRVLEERNITYDGPRDLDRLRG 403 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 A D+ GSTAI + ++ + + + E + IVL+TDGEN + Sbjct: 404 VVASDDFA-GSTAI-------WSSLDHAYRAVARDLVDRPERRVSIVLMTDGENNAGMDV 455 Query: 386 GIAICNKAK----SQGIRIMTIAFS-VNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + A+ ++ +R TI + + + ++ A+ +A L Sbjct: 456 DAFVRAHARLPEDARRVRTYTIRYGEADTRELDRGAR-----ATGGRMVDATDRS-LLSA 509 Query: 441 FRDRIG 446 F++ G Sbjct: 510 FKEIRG 515 >gi|118096903|ref|XP_425157.2| PREDICTED: similar to type VII collagen [Gallus gallus] Length = 1645 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 35/274 (12%), Positives = 74/274 (27%), Gaps = 21/274 (7%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA-----DRTVK 213 T + S + + V S S+ + + + PA R V Sbjct: 759 SAPVTVTTATASPSHATSLRVTEVQRDSVSLSWTPVPGASEYVLSWSPPAAGGEARRVVP 818 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 +S G+R + + + + + + D L H + Sbjct: 819 GTASSLQVPGLRLGQRYTFTIRPLLGSTPGAESSISERPVCRDARGDIVFLVHGTRDSSS 878 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-AIDEN 332 D + +++ + S P + +++ + Sbjct: 879 DADTVRTLLSNTVTAMGRLGPDGTQVGLATYSYRSLPWLLLNRSSELPAVLEQIRTMRYE 938 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 E AI A+ A ++S +V+L DG ++ I Sbjct: 939 EPSGNAIGAAITFARTYLLS------PGAGRRPSVPAVLVVLADG---PSGDDAITAARD 989 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 AK+ G+R++ + + L S Sbjct: 990 AKAAGVRVLAVGLEG------ADQEQLRRMVSGE 1017 >gi|4809045|gb|AAD30063.1| Itgae protein [Mus sp.] Length = 895 Score = 41.5 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 259 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 312 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I + + L AS F+ + L+ Sbjct: 313 GVVRFAIGVG-DAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 356 >gi|218693363|gb|ACL01185.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|87312294|ref|ZP_01094392.1| hypothetical protein DSM3645_06434 [Blastopirellula marina DSM 3645] gi|87284998|gb|EAQ76934.1| hypothetical protein DSM3645_06434 [Blastopirellula marina DSM 3645] Length = 135 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 29/116 (25%), Gaps = 1/116 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASVPLIQSLE 73 TG + A++ P+ + +++ R + + A+ + A S Sbjct: 3 RSKRTGAAVVEFAIVAPLFFLLVFGMIEYGRMVMVQQVITNASREGARRAVLDGATTSEV 62 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + E + N V V + V P Sbjct: 63 VAAVEEFLEQASVSGGNLEIRVSPDPPGSASNGDPVGVTISVPFSDVSWMPSPMYL 118 >gi|218693409|gb|ACL01208.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|218693353|gb|ACL01180.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 33 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 92 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 93 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 147 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 148 KLGWADDTGATKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 207 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 208 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 257 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 258 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 306 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 307 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 361 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 362 FDGRSG-DLAAVFRQVKG 378 >gi|218693373|gb|ACL01190.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|218693287|gb|ACL01147.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 17/149 (11%), Positives = 43/149 (28%), Gaps = 8/149 (5%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V+D + + + +++G F V + + +++ Sbjct: 50 FQTVKDYVKNFTDIFEAFGPN--DMQVGVIQFGSGVREEILL--NQFYVRHELMEAIDNI 105 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAI 389 T A++ ++ +V++TDG++ Sbjct: 106 RYMETGTMTGLALRKLVTETLTVEHGARVDNPIVHT---VVVIITDGKSQDYSRGGVTKW 162 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 +AK++G I I ++E Sbjct: 163 TKEAKARGFEIFAIGIGRKANRKELLEMA 191 >gi|84501599|ref|ZP_00999771.1| hypothetical protein OB2597_15395 [Oceanicola batsensis HTCC2597] gi|84390220|gb|EAQ02779.1| hypothetical protein OB2597_15395 [Oceanicola batsensis HTCC2597] Length = 432 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 6/48 (12%), Positives = 15/48 (31%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ 55 + + G I++ + +L V +D + L+ Sbjct: 2 TGHFRSFAARAEGSMTILSLYFLAGVLAVSAFAIDFGYLMSARNQLQV 49 >gi|22536818|ref|NP_687669.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|25010692|ref|NP_735087.1| hypothetical protein gbs0632 [Streptococcus agalactiae NEM316] gi|76788219|ref|YP_329407.1| Cna B domain-containing protein [Streptococcus agalactiae A909] gi|77411235|ref|ZP_00787585.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|22533664|gb|AAM99541.1|AE014220_13 cell wall surface anchor family protein, putative [Streptococcus agalactiae 2603V/R] gi|23095046|emb|CAD46276.1| Unknown [Streptococcus agalactiae NEM316] gi|76563276|gb|ABA45860.1| cna B-type domain protein [Streptococcus agalactiae A909] gi|77162661|gb|EAO73622.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|218693223|gb|ACL01115.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693225|gb|ACL01116.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693227|gb|ACL01117.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693229|gb|ACL01118.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693231|gb|ACL01119.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693233|gb|ACL01120.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693237|gb|ACL01122.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693239|gb|ACL01123.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693243|gb|ACL01125.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693245|gb|ACL01126.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693249|gb|ACL01128.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693251|gb|ACL01129.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693253|gb|ACL01130.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693255|gb|ACL01131.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693257|gb|ACL01132.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693259|gb|ACL01133.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693261|gb|ACL01134.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693263|gb|ACL01135.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693265|gb|ACL01136.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693267|gb|ACL01137.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693269|gb|ACL01138.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693271|gb|ACL01139.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693273|gb|ACL01140.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693275|gb|ACL01141.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693277|gb|ACL01142.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693279|gb|ACL01143.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693281|gb|ACL01144.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693283|gb|ACL01145.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693285|gb|ACL01146.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693289|gb|ACL01148.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693291|gb|ACL01149.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693293|gb|ACL01150.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693295|gb|ACL01151.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693297|gb|ACL01152.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693299|gb|ACL01153.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693301|gb|ACL01154.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693303|gb|ACL01155.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693305|gb|ACL01156.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693307|gb|ACL01157.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693309|gb|ACL01158.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693311|gb|ACL01159.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693313|gb|ACL01160.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693315|gb|ACL01161.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693317|gb|ACL01162.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693319|gb|ACL01163.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693321|gb|ACL01164.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693323|gb|ACL01165.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693325|gb|ACL01166.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693327|gb|ACL01167.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693331|gb|ACL01169.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693333|gb|ACL01170.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693337|gb|ACL01172.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693339|gb|ACL01173.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693341|gb|ACL01174.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693345|gb|ACL01176.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693347|gb|ACL01177.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693349|gb|ACL01178.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693355|gb|ACL01181.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693359|gb|ACL01183.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693369|gb|ACL01188.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693371|gb|ACL01189.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693377|gb|ACL01192.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693379|gb|ACL01193.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693381|gb|ACL01194.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693383|gb|ACL01195.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693387|gb|ACL01197.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693389|gb|ACL01198.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693391|gb|ACL01199.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693393|gb|ACL01200.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693395|gb|ACL01201.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693399|gb|ACL01203.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693401|gb|ACL01204.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693403|gb|ACL01205.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693405|gb|ACL01206.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693407|gb|ACL01207.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693411|gb|ACL01209.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693415|gb|ACL01211.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693419|gb|ACL01213.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693421|gb|ACL01214.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693433|gb|ACL01220.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693435|gb|ACL01221.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693439|gb|ACL01223.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693447|gb|ACL01227.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693453|gb|ACL01230.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693455|gb|ACL01231.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693457|gb|ACL01232.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693459|gb|ACL01233.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693461|gb|ACL01234.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693463|gb|ACL01235.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693465|gb|ACL01236.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693467|gb|ACL01237.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693471|gb|ACL01239.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693473|gb|ACL01240.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693475|gb|ACL01241.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.5 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|319918961|gb|ADV78143.1| PilY1 [Pseudomonas aeruginosa] Length = 1163 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 30/341 (8%), Positives = 81/341 (23%), Gaps = 28/341 (8%) Query: 50 EHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 + +L A AI+ ++ + VS + + L + Sbjct: 10 KTSLAAALSGAILLSAQTTQAAALSVSQQPLMLVQGVAPNLLFTLDDSGSMAWAYVPDGI 69 Query: 110 EVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 + + N L L+ + + + + + Sbjct: 70 SGNSGRAGRSSDYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTAAWQDGYAQ--- 126 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 S + + Q + D + + + + Sbjct: 127 -----------------GSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYYTYKVSASCP 169 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + S N Y P + + + + + + A S+ + Sbjct: 170 AQPVSSSNSCYTYNALPTSQESNFAIWYSY-------YRNRILATKTAANLAFYSLPENV 222 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + + + + +K + + + G T ++ A+ A Sbjct: 223 RLTWGALNTCSIGANSRSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAGRF 282 Query: 350 I-ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + + K Y +++TDG N + Sbjct: 283 LQTNGTAYTTEDGKTYSCRASYHIMMTDGIWNGRNVTPGNL 323 >gi|309358766|emb|CAP33788.2| hypothetical protein CBG_15420 [Caenorhabditis briggsae AF16] Length = 876 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 43/141 (30%), Gaps = 16/141 (11%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + I + T A+ A D I + L+A + I++ T Sbjct: 77 YSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFD------YDGGARLKATRLIIVFT 130 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG + + +G++I TI+ + E L ++ F + Sbjct: 131 DGLSMDKPTL---AAKALRRKGVKIYTISVNSIGFVPE----MLGIVGDADNVFGPTDEN 183 Query: 436 ELNKIFRDRIGNEIFERVIRI 456 + + +G R I Sbjct: 184 RIEE---RLLGEVESSRSCEI 201 >gi|291525766|emb|CBK91353.1| hypothetical protein EUR_23650 [Eubacterium rectale DSM 17629] Length = 182 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 11/77 (14%), Positives = 27/77 (35%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 K + L+K G+ + + ML + + + +R +K A ++A+I+ Sbjct: 3 KLRKKTAALLKDRKGNMMPLVVAVTICMLIIILGVAEYMRLVITAAGIKDAMESAVISTV 62 Query: 66 VPLIQSLEEVSSRAKNS 82 + + Sbjct: 63 NDNYNEVYHSVREGYAA 79 >gi|227326309|ref|ZP_03830333.1| hypothetical protein PcarcW_02949 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 210 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 37/90 (41%), Gaps = 2/90 (2%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 ++F+ ++ S + TAL P++L +G + D+ ++Q + I+ Sbjct: 14 TSRFLSVMRRFWFSHRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQ--RAGAIS 71 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + + + Q L+E + P + Y Sbjct: 72 SILAMQQKLDEKGLQGLLDTMLPTEGAGNY 101 >gi|325291589|ref|YP_004277453.1| hypothetical protein AGROH133_03083 [Agrobacterium sp. H13-3] gi|325059442|gb|ADY63133.1| hypothetical protein AGROH133_03083 [Agrobacterium sp. H13-3] Length = 197 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 41/135 (30%), Gaps = 8/135 (5%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 +FI + + G + A++ P++L + ++ A S Sbjct: 15 RFIALVARFARDRRGVGAVEFAIVFPILLALYLTSFELTIGYNTYKRASSA--------S 66 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + +S K T + L+ + ++ + T Sbjct: 67 ATINDLISKTNSVDKAYLTSMQDVTAAVFAPYSTKGLQLKISGIKIDKQKQATIAWSWNE 126 Query: 126 KSAYQVVLSSRYDLL 140 K+A V+ S + Sbjct: 127 KNARPYVVGSPVSVP 141 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 68/449 (15%), Positives = 129/449 (28%), Gaps = 63/449 (14%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L K G F +ITA+L+ + + ++VD V L++ +I + P Sbjct: 13 RLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDH-VIYRTSPKNLYD 71 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + R ++ + Y R+ N + ++ V P + + Sbjct: 72 LREAGRDNFIRHQIEKALNTYNSRDLSN--TGSIESIVKDAVILTKNVNSLPLQFTVDIA 129 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID---FSRSM 189 LS+ L + L +F +S G + K + W+ + S Sbjct: 130 LSTTVQLRGSLLQMFSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLASRGKVADSK 189 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + + E + VK PY C +Y Sbjct: 190 VHPPKYLEYSHYYQQYLNRNTLVK-----------NFLSQIPYKNFCMAPYHYSSILYWA 238 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 +L+ +++ + D T F V Sbjct: 239 VGTLTYSVDNKTTTREYYK--------------------DPYYATWDHFPYSFIKNVFDM 278 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 S +G ++ S M + + ++++E +K Sbjct: 279 TSNQFG----DGQVLTNTNHCFPHGASQNKYMLMLAIGNQLS----------RSSVEKEK 324 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-F 428 +L D + I I ++ FS ++ RY L CAS S + Sbjct: 325 IEKVLQDCHYMHKRHRTG------RDA-ITIFSVGFSPDQD----TRYTLRQCASDPSKY 373 Query: 429 FEANSTHELNKIFRDRIGNEIFERVIRIT 457 +E NS + I + N I + T Sbjct: 374 YEINSDENVMPIAKSLARNVITNWFSQFT 402 >gi|239995187|ref|ZP_04715711.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii ATCC 27126] Length = 1356 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/432 (7%), Positives = 88/432 (20%), Gaps = 46/432 (10%) Query: 55 QAAQTAI---ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 + +A + AS ++ +S + E + + + Sbjct: 256 SSTDSASSREVRASDVGQSGAPQLVVTYDDSTATGCVQGEAIYQVASSRDNLEENVNGYP 315 Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 T + + +L + + Sbjct: 316 NTGSELTFHPSYNDYIGVRFR-NVNIPQGAVVTEAYLEFTAYSTVTYGNPSMRIRGVADD 374 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + S N + + G R Sbjct: 375 DASDFHPDRRNRLRNLPKTSGITWSMPDFYNNYTYRTPDVSNIVKQIVDRSGWRSGNDMA 434 Query: 232 YMVS---CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI------KKKHLVRDALASVI 282 +++ + + P L + ++ K + + Sbjct: 435 FVLDDFVSYRGAHTYNNSPSKAPKLIVKFNGSATPGASATVREHLISKIDELS--ANGLT 492 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID---ENEMGSTAI 339 + + + + + S + ++ S Sbjct: 493 PIVDTLLEAANYYGGRDVDYGRKRGESDVSSSVRRSTRVSHRSSYIGADSILPSGCSEDN 552 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI------------ 387 + I + + +IVLL+DGE ++ Sbjct: 553 LSDSDCITEQIPTPASYISPVSDLQCQTNNHIVLLSDGEANNNHSVSKIQTLLGKSCTGS 612 Query: 388 -------AICNKAKSQGIRI-------MTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + TI F+ N S F++A++ Sbjct: 613 SGEKCGLDLVRNISEASTSVIGPRVITHTIGFAANNNANNFLNQLALQ--SGGGFYQADN 670 Query: 434 THELNKIFRDRI 445 + +L + F + Sbjct: 671 STDLLEAFNTIL 682 >gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS155] gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS155] Length = 771 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 49/179 (27%), Gaps = 18/179 (10%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A +++ ++K + FN + + + + F Sbjct: 406 DSIVQAKNALLYALKGLKPE---DSFNIIEFNSSLSQFSATPLPATSSNLSRARQFVSRL 462 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + A+ A + S + + + ++ +TDG + I Sbjct: 463 QADGGTEMALALDAALPKSLGSAPSDAVQ------PLRQVIFMTDGSVGNEQALFDLIRY 516 Query: 392 KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + R+ T+ F+ A +F E+ + + Sbjct: 517 QIGES--RLFTVGIGSAPNSH-----FMQRAAELGRGTFTYIGKVDEVGEKISALLSKI 568 >gi|218693397|gb|ACL01202.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|241113141|ref|YP_002972976.1| hypothetical protein Rleg_4786 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861349|gb|ACS59015.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 194 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 12/122 (9%), Positives = 37/122 (30%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ L++ + + AL++P+++ + VD+ + + A + S Sbjct: 14 RSRTRHLVRDRSAASGVEFALVLPILVMLLFGTVDLGHALTVSRKIDEIASSTGDMISQQ 73 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + +V+ + + L + V +N + Sbjct: 74 GSWTKSDVAKLLSGASFILQPYETTGLTITVAVDDIAKSGSATVNWSAALNTSALNSGAA 133 Query: 128 AY 129 + Sbjct: 134 SA 135 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 37/418 (8%), Positives = 91/418 (21%), Gaps = 46/418 (11%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA + + + F + +KK + Sbjct: 317 YAADNGSTAG-EEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVLSNNP 370 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 371 TSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 430 Query: 175 HGVSIQWVIDFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 I + +M +++ Q + + + S+N + Sbjct: 431 AVNEISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALN 490 Query: 224 IRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHV----IKKKHLVRDAL 278 +P + G + + + V I++ + Sbjct: 491 RTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTN 550 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 +V+ ++ D + N + R Sbjct: 551 WAVMNNMIARDTGDLFKNFETYSGNTSGAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 610 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + A S+ + A + E + Sbjct: 611 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTAAY-HHWLKNKYQENPPS---------- 659 Query: 399 RIMTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 I +I ++ + R L N A + +++AN+ +++ D Sbjct: 660 -IFSIGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 716 >gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 747 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 38/128 (29%), Gaps = 9/128 (7%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 FN V + + S F G T + A+ A + + N Sbjct: 373 DSFNVIAFNSDVHALSAQSLAATAKNIGRANQFIRTLKADGGTEMGPALTRA---LDNGN 429 Query: 355 EDEVHRMKNNLEAK----KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 H+ + + ++ K ++ +TDG + I +K R+ TI Sbjct: 430 HSTSHQDEEDFDSDGVRLKQVLFMTDGAVANERSLFNLIEDKIGHS--RLFTIGIGAAPN 487 Query: 411 QQEKARYF 418 R Sbjct: 488 SHFMERAA 495 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDSP------ 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDSP------ 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|58429477|gb|AAW78142.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429505|gb|AAW78156.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 581 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 39/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+++ Sbjct: 72 AMKLIQQLNL---NENAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GKTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|58429527|gb|AAW78167.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 575 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 39/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + + F++ K I+++ Sbjct: 72 AMKLIQQLNL---NENAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GKTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLS 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|73974565|ref|XP_532319.2| PREDICTED: similar to collagen, type XIV, alpha 1 [Canis familiaris] Length = 1796 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +++ + G+T A+ ++ K +L+TDG Sbjct: 216 NTKDEVIEAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGARTGVSKIGILITDG 269 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 270 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADENELREIASEPDSTHVYNVAE 319 >gi|218693367|gb|ACL01187.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693469|gb|ACL01238.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|77407764|ref|ZP_00784518.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|77173630|gb|EAO76745.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|218693241|gb|ACL01124.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693247|gb|ACL01127.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693329|gb|ACL01168.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693351|gb|ACL01179.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693361|gb|ACL01184.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693365|gb|ACL01186.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693375|gb|ACL01191.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693413|gb|ACL01210.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693417|gb|ACL01212.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693437|gb|ACL01222.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693441|gb|ACL01224.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693443|gb|ACL01225.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693445|gb|ACL01226.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693449|gb|ACL01228.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693451|gb|ACL01229.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 30/96 (31%), Gaps = 13/96 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G F + ++ + + + +TA+ DA+ A + Sbjct: 134 DRVGLILFGSQAFVQAPLTYDRRTVRVWLDEARI--GIAGKNTALGDAIGLALKRLR--- 188 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +VL+TDG N + I Sbjct: 189 --------MRPATSRALVLVTDGANNAGQIDPITAA 216 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 45/393 (11%), Positives = 109/393 (27%), Gaps = 49/393 (12%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 +I + L +L ++ + T + +++ N T + ++ Sbjct: 982 VINTAGQLSGTLTNLTQVIDGNVTANDSAGADGFATPKLVSVEYNGTTYTFNNSNTSFSI 1041 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + K + Y P + ++ I+ + ++ + + Sbjct: 1042 ALGTNKGTLYIENDGDYRFT-PPSGGAEGAPVEVTYKIKDGDGDTSSAKLTIINPVLVVG 1100 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + + S D +P D + + +G Sbjct: 1101 SNANDTGSGASTAIDDHVRPNPLLAPDVDGAIVGGVGADVLIGDVGGVT----------- 1149 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 S + +D S + L++DA+ +++ I +++ Sbjct: 1150 ---------SGSYNLTFMIDMSGSISG-TEFQLMKDAINNLLAKFSGIS----QLQVEIG 1195 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 F D G + + + + G T A+ T + + Sbjct: 1196 TFADNSNVV-----GTYSSVTAAQQAVSNLTRSGGGTNYQAALTTLNTMMTVDPVAD--- 1247 Query: 361 MKNNLEAKKYIVLLTDGENTQDN-------EEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 KY+ LTDGE T + G+A N + G+ I + V Sbjct: 1248 -------HKYVYFLTDGEPTVGSWTNSTQIANGMAALNALTAPGVVINAVGIGVPSGASF 1300 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + +P+++ ++ +L+ Sbjct: 1301 GNNLNAIDN-TPDNYLAVDNFDDLSSGLGSLFT 1332 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 41.1 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDSP------ 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|218693385|gb|ACL01196.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 64/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPHNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|148980400|ref|ZP_01816043.1| Large exoprotein [Vibrionales bacterium SWAT-3] gi|145961272|gb|EDK26584.1| Large exoprotein [Vibrionales bacterium SWAT-3] Length = 1361 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 31/315 (9%), Positives = 70/315 (22%), Gaps = 8/315 (2%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E + +A +S + + + T T + Sbjct: 549 TETIDYKADDSSDLESSTAQVTIETDSIVKPSPEVTSVSRDLSEDATTTQTVSGNVDVLY 608 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S + +S+ S + S S Sbjct: 609 AESVTLQEPSESYTSGGQSITWVSSNGGQSLVGSANGVEIVTAT-IDDSGNYSVSLSGPI 667 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + N +Q+G + + + P + L ++ G Sbjct: 668 DHLNGSTPVNNMSIDIGVVATNDTGTQSGTISLTIDDDVPVAQAKVHDLAPVVKDGANVQ 727 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + S + V+K+ + ++ K V ++ + Sbjct: 728 LILDVSGSMSGNAMTVMKQSAIALLQGYQLLGETKVQVTVFESDANVVNENGSWSSDNSL 787 Query: 312 FSWGVH-KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 S + G T +DA+ A ++ Sbjct: 788 PSNLSTIWMDVDTAIGVINSLTAGGGTDYDDAVWLA------GSDSIWGNSDMVSGGSNI 841 Query: 371 IVLLTDGENTQDNEE 385 L+DG+ + N Sbjct: 842 SYFLSDGDPSSANHR 856 >gi|283779853|ref|YP_003370608.1| TadE family protein [Pirellula staleyi DSM 6068] gi|283438306|gb|ADB16748.1| TadE family protein [Pirellula staleyi DSM 6068] Length = 156 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 10/121 (8%), Positives = 30/121 (24%), Gaps = 12/121 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ + A++ P+ + +++ R + + A + Sbjct: 23 RSYRRNRRAAATVEFAVVAPIFFLLVLGMIEYGRMVMVQQVITNA------------SRE 70 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + YL + T + V + QV Sbjct: 71 GARTAVLDGTTTAEVTATVNSYLASGSISGATVVVTPNPPTNADGGDPVTVTVSVPFSQV 130 Query: 132 V 132 Sbjct: 131 S 131 >gi|282896561|ref|ZP_06304580.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] gi|281198552|gb|EFA73434.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] Length = 587 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 59/194 (30%), Gaps = 16/194 (8%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 P + + ++ +D+S V++ L + I Sbjct: 386 VPPIPSGEVLSLGQTFWKTQKDAGKTVYLMTVIDTSGSMSGGP-LEAVKNGLRIASQQIN 444 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + V + D+ I+ + L G+TA+ D + Sbjct: 445 PGNYVG------LVSYGDQPINLVKLAP-FDDLQHKRFLAGIDGLEADGATAMYDGVMVG 497 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 ++ + K N K Y++LLTDG+ Q + + G+R+ IA+ Sbjct: 498 LSELL-------QQRKTNPNGKFYLLLLTDGQTNQG-FNFEQVKEIIEYSGVRVYPIAYG 549 Query: 407 VNKTQQEKARYFLS 420 + A L Sbjct: 550 EVNEAELNAIAALR 563 >gi|218693357|gb|ACL01182.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693425|gb|ACL01216.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693427|gb|ACL01217.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 25/247 (10%), Positives = 65/247 (26%), Gaps = 53/247 (21%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 25 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 83 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 84 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDCVPTFSYKVQRV 143 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 144 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 203 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 204 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 262 Query: 448 EIFERVI 454 +I V Sbjct: 263 QISATVS 269 >gi|218693423|gb|ACL01215.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693429|gb|ACL01218.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693431|gb|ACL01219.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|189233997|ref|XP_971945.2| PREDICTED: similar to AGAP009579-PA [Tribolium castaneum] Length = 1056 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 36/301 (11%), Positives = 80/301 (26%), Gaps = 30/301 (9%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 +D SRS + + + + + Sbjct: 143 LAVDTSRSSVHVPTNIFDRHEEAAYAIQWSEKLDEIFVRNYNSDPALSWQYFGSTSGIMR 202 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV-----------------RDALASVIR 283 +Y P + V + + K ++ + +++ Sbjct: 203 HYPAKKWPNIEKDEFDCRVRTWYIEAATCTKDVIILVDNSGSMDGMGRHIASLTVNTILD 262 Query: 284 SIKKIDNVND-TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + D +N T + + + K G T A Sbjct: 263 TFSNNDYINILYYSNQTTNYTIPCFRN--LLVQATPENIVLFKEAIRHLGPSGKTDFPQA 320 Query: 343 MQTAYDTIISS------NEDEVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNKAK 394 +Q A+D + + N +E+ + + I+L+TDG N D + K Sbjct: 321 LQMAFDILENYREIRGCNNEEIDEEGKSKACNQAIMLITDGISRNFSDIVMRNNQLDGGK 380 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +RI T + + K CA+ + + + ++ I V+ Sbjct: 381 TIPVRIFT--YLIGKEVTNVEEIRWMACANRGFYTQVQTLEQVTSAVLQYINVIARPLVL 438 Query: 455 R 455 + Sbjct: 439 Q 439 >gi|17533683|ref|NP_496743.1| C-type LECtin family member (clec-63) [Caenorhabditis elegans] gi|3876684|emb|CAB03058.1| C. elegans protein F35C5.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 411 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 55/147 (37%), Gaps = 8/147 (5%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D + + + +++D + + L +++ T + + + + A Sbjct: 97 YSDPRSTRLGLVTYNGRSTIVADLNLLQSIDDLYQSVFST-LNQVSNSDDSFLAKGIGAA 155 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + S K+ +V+ + + I + ++ KS G+ + T+AF Sbjct: 156 ENVLQSG-----RTNGVRSNYKRLVVVYASAYKGEGELDPIPVADRLKSSGVVVSTVAFD 210 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANS 433 + + A L+N ASPN F + Sbjct: 211 QDGDEALLA--GLTNIASPNYAFTSKD 235 >gi|86355858|ref|YP_467750.1| hypothetical protein RHE_CH00199 [Rhizobium etli CFN 42] gi|86279960|gb|ABC89023.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 193 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 13/134 (9%), Positives = 38/134 (28%), Gaps = 8/134 (5%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + +++L + G I A+L PV++ + ++ A Sbjct: 8 TRLVLTARRLARERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKR--------ATRA 59 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + S K++ + +L T + + Sbjct: 60 AGSIADLVTQQQSVTKSALAQMPSVATAMFVPYNSTSLTLKITGISIDAGANAKVLWSWA 119 Query: 125 RKSAYQVVLSSRYD 138 + ++ Sbjct: 120 QDGTTPYAKNATVS 133 >gi|94498566|ref|ZP_01305121.1| hypothetical protein SKA58_08334 [Sphingomonas sp. SKA58] gi|94422009|gb|EAT07055.1| hypothetical protein SKA58_08334 [Sphingomonas sp. SKA58] Length = 199 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 9/86 (10%), Positives = 27/86 (31%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +L + G + + P ++ + D+ +Y+ + + A AS Sbjct: 4 RIASKLMRLRRDARGVTLLEFGFVAPPLMLTIMAIGDLGYQAYWHAVARGVLEKAARAAS 63 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIE 91 V + + + + + Sbjct: 64 VGTLNGAQFDAYITQQMSAITARNAS 89 >gi|312139253|ref|YP_004006589.1| tade-like protein [Rhodococcus equi 103S] gi|311888592|emb|CBH47904.1| putative TadE-like protein [Rhodococcus equi 103S] Length = 133 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 33/109 (30%), Gaps = 5/109 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L +G + AL++P+++ + +V+ R + A A + + Sbjct: 2 KRLTSD-SGVAAVEFALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIK 56 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + ++RA + + + T Sbjct: 57 KDPAAARAAVKGAGVFSPAITDAEICISTSGTQGCSATSCPSGSTVTLT 105 >gi|291388471|ref|XP_002710799.1| PREDICTED: collagen, type XIV, alpha 1 [Oryctolagus cuniculus] Length = 1796 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 37/120 (30%), Gaps = 20/120 (16%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +++ + G+T A+ ++ + K +L+TDG Sbjct: 216 STKDEVIEAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGSRTGVPKIGILITDG 269 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAF-SVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + + + L AS + Sbjct: 270 K-SQDDIIPPS--RNLRESGVELFAIGVKNADVNE-------LQEIASEPDSTHVYNVAE 319 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 25/247 (10%), Positives = 65/247 (26%), Gaps = 53/247 (21%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 VD S + + V+ + + ++ + D + MG ++ S + + + Sbjct: 25 VDWSGSMNDNNRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSEGYSYSNGAVQMGS 83 Query: 319 LIRTIVKTF-AIDENEMGSTAINDAMQTAYDTII----------------SSNEDEVHRM 361 + G T A++ A + + +V R+ Sbjct: 84 FDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVIVLLTDGVPTFSYKVQRV 143 Query: 362 KNNLEAKKYIVLLTDGENTQDNE------------------------EGIAICNKAKSQG 397 + Y ++ ++ N I K +G Sbjct: 144 HAQSSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERG 203 Query: 398 IRIMTIAFSVNKTQQEKARYF-----LSNCASPNS----FFEANS-THELNKIFRDRIGN 447 I I + + + S + ++E+ ++++ + Sbjct: 204 IEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKK-AV 262 Query: 448 EIFERVI 454 +I V Sbjct: 263 QISATVS 269 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 41.1 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 5/92 (5%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 ++ T ++ + D GSTAI DA+ A +++ ++ + + NN I L+TD Sbjct: 656 TEIDSTAIEEYLEDIEPDGSTAIGDAIDAAVAALMAHDDLDPNSSNNNA-----IFLITD 710 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 GE T +++ A + + Sbjct: 711 GEQTSGDKDVCDALEDAAKDDVPVYIAPVGSF 742 >gi|325673442|ref|ZP_08153133.1| TadE family protein [Rhodococcus equi ATCC 33707] gi|325555463|gb|EGD25134.1| TadE family protein [Rhodococcus equi ATCC 33707] Length = 133 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 33/109 (30%), Gaps = 5/109 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L +G + AL++P+++ + +V+ R + A A + + Sbjct: 2 KRLTSD-SGVAAVEFALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIK 56 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + ++RA + + + T Sbjct: 57 KDPAAARAAVKGAGVFSPAITDAEICISTSGSQGCSATSCPSGSTVTLT 105 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 68/197 (34%), Gaps = 35/197 (17%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVR----MGATFFNDRVISDPSFSWG---VHKLIR 321 + ++ AL + + + R + F R F+ Sbjct: 356 SRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFAGRANDPVDFAVNDPRPGSADL 415 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 V TF TAI A++ Y ++ E + + + IVL+TDGEN Sbjct: 416 AGVNTFVDGLRLQDGTAIYSALEAGYRAAGAAVEADPGYLTS-------IVLMTDGENNS 468 Query: 382 DNEEGIAICN--------KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEA 431 I+ + A ++ +R TIAF E L + + + + F+A Sbjct: 469 G----ISAADFRSSYQRLPAAARAVRTFTIAFG------EADPAALRDISADTGGAVFDA 518 Query: 432 NSTHELNKIFRDRIGNE 448 ++ L F+D G + Sbjct: 519 RTSS-LADAFKDIRGYQ 534 >gi|297559547|ref|YP_003678521.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843995|gb|ADH66015.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 588 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 45/396 (11%), Positives = 101/396 (25%), Gaps = 22/396 (5%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 L ++ Q + + T + + Sbjct: 199 AVRRHLGTGDGADTAMTDFVRDV--QLDSAFGEIDLATFYSSGRTGGGSGEGGGGRVDPL 256 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 V ++ PL + G S +T S + Sbjct: 257 IAVPEQAVVSYNADRAESAPPLEAHYPTEGTVSLDYPYVTTTDTASLRSAAADLHEVLRR 316 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN-GKVGIRDEKLSPYMVSCNKSLY 241 + + + L+ + + + + L + N+ Sbjct: 317 DSYRARLRELGFRDPDGTLSGTAGADPDGFGVTAEEPPTHDDLTGDALLASVTDWNRLSM 376 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 D S + +D + + + A + D MG Sbjct: 377 PSRTLVLADTSANMAEDLDGGPS-----RMEVAQQAALMGLSLF------PDETDMGLWL 425 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 +D S + +H L A E+ A A++ + + R+ Sbjct: 426 MSDENASGRVEAADMHPLGGAEQGDTATRRRELIGVAEEIAVRGGGSRLYDNILAAYDRV 485 Query: 362 KNNLEAKKY--IVLLTDGEN-TQDNEEGIAICNKAKSQ-----GIRIMTIAFSVNKTQQE 413 +++ + K ++LLT G++ + + + + + + IAF + Q Sbjct: 486 QDDYDEDKINSVILLTAGQDEGSSDIAHADLVAALQDRFDPERPVSMFIIAFGSREQQVA 545 Query: 414 KARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + A+ S F + E+ IF I + Sbjct: 546 EEELRRIAAATSGSLFVTDDPDEIGDIFLSSISRRL 581 >gi|153953278|ref|YP_001394043.1| hypothetical protein CKL_0641 [Clostridium kluyveri DSM 555] gi|219853911|ref|YP_002471033.1| hypothetical protein CKR_0568 [Clostridium kluyveri NBRC 12016] gi|189040573|sp|A5N5W4|Y641_CLOK5 RecName: Full=UPF0229 protein CKL_0641 gi|254806510|sp|B9DZE4|Y568_CLOK1 RecName: Full=UPF0229 protein CKR_0568 gi|146346159|gb|EDK32695.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219567635|dbj|BAH05619.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 403 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 62/188 (32%), Gaps = 19/188 (10%) Query: 274 VRDALASVIRSIKKIDNVNDT----VRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFA 328 + A+ V+ + +D+ F ++ + H + + + Sbjct: 226 LNAAIICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEF 285 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + E G T I+ ++ A + I N Y ++DG+N ++ Sbjct: 286 FHKVESGGTYISSGLKKALEVI--------EENYNPAYWNVYTFYVSDGDNWSEDNSLAL 337 Query: 389 ICNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN--STHELNKIFRDR 444 C AK + + + A + ++ N + N+F +L K + Sbjct: 338 KC--AKDLCKVCNLFSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKI 395 Query: 445 IGNEIFER 452 + E+ ER Sbjct: 396 LNKELEER 403 >gi|221108025|ref|XP_002157834.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 299 Score = 41.1 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 12/105 (11%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAK 394 T I+ A++ A + +S + K I+LLTDG T E+ I ++ + Sbjct: 5 TRIDRALRLAQKDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELR 58 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + G+ I+ + + E + + + A + L Sbjct: 59 NDGVVILGVGIGSAVNETELSHIT----GGKKNAYTAATFDSLTD 99 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 46/129 (35%), Gaps = 12/129 (9%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + V + + + + + + K ++ T I+ A++ A Sbjct: 176 ISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFT----TRIDRALRLAQ 231 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAF 405 + +S + K I+LLTDG T E+ I ++ ++ G+ I+ + Sbjct: 232 KDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGI 285 Query: 406 SVNKTQQEK 414 + E Sbjct: 286 GSAVNETEL 294 >gi|228982551|ref|ZP_04142810.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] gi|228776734|gb|EEM25042.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] Length = 690 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 31/366 (8%), Positives = 104/366 (28%), Gaps = 35/366 (9%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + +S E+ + ++ Q+ +E + N D ++ Sbjct: 296 ANSTSESNEDSTEGEDSAKNEETQETQEGTEEESDVNNTSKQDDSTEDSNEDESKSNSIS 355 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 S + ++ + + L++ + + +++ + + V+ Sbjct: 356 DDSIVENIIENVRKDAMKDAEFALKNAKKEEDKQKEESKKKAKIDKDVTKKEVNDVQEAY 415 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 + P+ + +K + + I + + +L Sbjct: 416 SDNKYGKQKPTLTISPITKRKRELTEDIKRDGKR-FRKEIEEIFKNKKTYDSRHRRKGIL 474 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHL--VRDALASVIRSIKKIDNV----------- 291 + ++++ V + + D S++ + + +N Sbjct: 475 DTSSIWKVGTKDYNVFLKKGQPQTTDYVAYMLVDGSGSMMDRVDENNNKVSCSQKACAVI 534 Query: 292 ----NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + +T F + + +++ G +++ A Sbjct: 535 EEGIKGFIPFKSTIFRASGRNVMHTTVSEFSNDSKFNESWNAYFEANGGNMDGFSIRIAI 594 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN------KAKSQGIRIM 401 ++ E + +V+L+DG + + + + +A+ GI+++ Sbjct: 595 KELLQRPERN-----------RLLVILSDGLPSAYPSQKLGQADVKDAVKEARKNGIKVV 643 Query: 402 TIAFSV 407 +I F Sbjct: 644 SICFGS 649 >gi|218693235|gb|ACL01121.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 65/219 (29%), Gaps = 17/219 (7%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + + +++ ++ +S + + D + + V+ Sbjct: 380 FHVTDGVPTMSYAINFNPYISTSYQNQFNSFFNKIPDRSGILQEDFIINGDDYQIVKGDG 439 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + G + + + + G AIN Y + Sbjct: 440 ESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGY-AINSGYIYLYWR-DYNWVYPFD 497 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ---QEKAR 416 + A K I T GE T G + +G I T+ VN +A Sbjct: 498 PKTKKVSATKQIK--THGEPTTLYFNG-----NIRPKGYDIFTVGIGVNGDPGATPLEAE 550 Query: 417 YFLSNCAS-PNSFFEANST----HELNKIFRDRIGNEIF 450 F+ + +S ++ + T ELNK F+ + + Sbjct: 551 KFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHS 589 >gi|110679845|ref|YP_682852.1| hypothetical protein RD1_2616 [Roseobacter denitrificans OCh 114] gi|109455961|gb|ABG32166.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 207 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 7/56 (12%), Positives = 18/56 (32%), Gaps = 3/56 (5%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVML---GVGGMLVDVVRWSYYEHALKQAA 57 T ++ + G+ I +++P+M+ + D R + Sbjct: 6 TAITARLRRFRRDQHGNVAIEAVIILPIMIWAYLAMFTIFDTYRQYTAQQKAAYTI 61 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 62/169 (36%), Gaps = 14/169 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 A V+ + ++ + + +GA FN +V +F+ I+ + ++ Sbjct: 514 PDNFALAQEFVMDLVDQMTISSTAINVGAFLFNSQVQMLTAFTDD-KTAIQAAISGYSYP 572 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + TA A+ A DT++ S + ++TDG + +D+ + Sbjct: 573 PSNTAGTATGAALNFAVDTMLQSGAGYRGGSV-------LVYVITDGRSQEDSSFVASAA 625 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFEANSTHELN 438 + G ++++ + + L A+ + + F +LN Sbjct: 626 ANLHATGAEVVSVGITSSVD-----ETQLRTIATSDSNVFVLEDFSDLN 669 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 55/168 (32%), Gaps = 16/168 (9%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR-VISDPSFSWGVHKLIRTIVKTF 327 + +++ ++ +IR + D R+ FND V S V R+I Sbjct: 73 TRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 126 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 G A+ + + + R N+ +I+LLTDG++T Sbjct: 127 IDRLQARGG--SGSALML---ELQEAVKILDERQGNSRNRVGFILLLTDGDDTTGFRWSR 181 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + A + + T A + S ++ + + Sbjct: 182 DVIHGAVGK-YPVHTFALGAAHDPEALLHIAQE---SRGTYSFVDDGN 225 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 50/175 (28%), Gaps = 18/175 (10%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 ++ A S+ R+++ + R FN + + + + Sbjct: 332 SIKQAKGSLTRALRHLGPN---DRFNVIEFNSSHRALFQHAVPASHHNLQLASEYVRHLE 388 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 G T + A+Q A A + ++ +TDG ++ I + Sbjct: 389 ASGGTEMMPALQLALKL------PGAQDELRPEPALRQVIFITDGAVGNESALFEHIVDS 442 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRI 445 G R+ T+ +F+ A +F E+ + Sbjct: 443 L--GGSRLFTVGIGS-----APNAWFMRKAAEYGRGTFTYIGDVAEVGEKMDALF 490 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 52/154 (33%), Gaps = 18/154 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 +D V +KK N+ +R+ ++ + + + +K+ + + Sbjct: 68 DNWKDIYIFVGDVVKKFPNLK--MRVSFITYSTQGHTLMELTSDRNKI--HNSLSRLKNI 123 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE-GIAIC 390 G+T +++ + A + I N + I+ LT G T + + Sbjct: 124 KPTGATNMHEGFKKANEQIEQENAGGNNAAS-------LIIALTTGPLTPKALQETKSEA 176 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 KA+ G ++ + ++ + L Sbjct: 177 EKAREMGAKVYCVGV------KDYRKDQLDAIVG 204 >gi|150007596|ref|YP_001302339.1| hypothetical protein BDI_0949 [Parabacteroides distasonis ATCC 8503] gi|255013875|ref|ZP_05286001.1| hypothetical protein B2_08202 [Bacteroides sp. 2_1_7] gi|149936020|gb|ABR42717.1| conserved hypothetical protein BatB [Parabacteroides distasonis ATCC 8503] Length = 339 Score = 41.1 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 28/185 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSDYISA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + + K I+L+TDGEN +D+ G Sbjct: 160 INPSMVSTQGTAIGAAINLAARSFTP-----------DETTDKAIILITDGENHEDDAIG 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A K + I + K + N++ + + + +++G Sbjct: 209 AAKAAAEKGIHVNI--VGMGDPKGSPIPIQ-------GSNNYMKDKDGNVVITKLNEQMG 259 Query: 447 NEIFE 451 EI Sbjct: 260 QEIAA 264 >gi|319783910|ref|YP_004143386.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169798|gb|ADV13336.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 393 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 54/172 (31%), Gaps = 1/172 (0%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + +S + I AL +PV++G G V+ W Y L+ A A ++ I + Sbjct: 1 MRRSTGANVATIFALTLPVVVGAAGFGVETSYWYYNSLRLQATADAAAYAGALEQISGSD 60 Query: 74 EVSSRAKNSFTFPKQKIEE-YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + A + + + ++ N N + V IV M V Sbjct: 61 KPTIVAAATQSAASNGLGSGTIVVNTPPASGPNTAKKAVEVIVGQKLDRMFTLIFTQDKV 120 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + + + + + Q + + S +S D Sbjct: 121 PEQARAVAVITDASSACMLALDPSASQAVLFSGSTSVKLTGCSVMSNSIASD 172 >gi|190410234|ref|YP_001965735.1| terY1 [Klebsiella pneumoniae] gi|146151027|gb|ABQ02793.1| terY1 [Klebsiella pneumoniae] Length = 239 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 49/152 (32%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLTDLLS--------FQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A++ G+ + + L Sbjct: 158 KAART-GVVVAC------AAGHDADTSVLKEI 182 >gi|256839783|ref|ZP_05545292.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256738713|gb|EEU52038.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 339 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 28/185 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSDYISA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + + K I+L+TDGEN +D+ G Sbjct: 160 INPSMVSTQGTAIGAAINLAARSFTP-----------DETTDKAIILITDGENHEDDAIG 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A K + I + K + N++ + + + +++G Sbjct: 209 AAKAAAEKGIHVNI--VGMGDPKGSPIPIQ-------GSNNYMKDKDGNVVITKLNEQMG 259 Query: 447 NEIFE 451 EI Sbjct: 260 QEIAA 264 >gi|149053317|gb|EDM05134.1| rCG33209 [Rattus norvegicus] Length = 1169 Score = 41.1 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 39/105 (37%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ +D + + +K Q Sbjct: 274 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFRDPLNLTTVISSSKMQ 327 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELN 438 G+ I N + L AS F+ + L+ Sbjct: 328 GVVRFAIGVG-NAFENNNTYRELKLIASDPKAAHTFKVTNYSALD 371 >gi|257470753|ref|ZP_05634843.1| von Willebrand factor type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] gi|317064958|ref|ZP_07929443.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313690634|gb|EFS27469.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 376 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 19/114 (16%), Positives = 38/114 (33%), Gaps = 13/114 (11%) Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG---ENT 380 + + +D G + ++ A + + + I L+ D ++ Sbjct: 107 IYSVLMDYKAQGGGDDPEDVRKALHESLEIIQWSAP----RENLSQIIFLVGDAPPHDDY 162 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN 432 D+ + + KAKS+GI I TI R + A +F + Sbjct: 163 NDSPDTVVTAKKAKSKGIIINTIQCG---NMPSTDR-YWKAIAQFGGGEYFHIS 212 >gi|340082|gb|AAA36794.1| undulin 1 [Homo sapiens] Length = 843 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 44/149 (29%), Gaps = 20/149 (13%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 D ++ R+G ++ + +++ + G+T A+ + Sbjct: 1 FDVGSEKTRIGLAQYSGDPRIEWHL--NAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 58 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + K +L+TDG+ +QD+ + + G+ + I Sbjct: 59 E------NSFKPEAGSRTGVSKIGILITDGK-SQDDIIPPS--RNLRESGVELFAIGV-- 107 Query: 408 NKTQQEKARYFLSNCASPN---SFFEANS 433 + L AS + Sbjct: 108 ----KNADVNELQEIASEPDSTHVYNVAE 132 >gi|28871878|ref|NP_794497.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28855131|gb|AAO58192.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 224 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 I + G+T++ + A D + D + + +V++TDGE T Sbjct: 80 AIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRAGVDYYQPW--MVVMTDGEPTD 137 Query: 382 DNEEG----IAICNKAKSQGIRIMTIAFSVNKT 410 D +C S+ + + IA Sbjct: 138 DITRASERIREMCE---SKKLTVFPIAIGTAAN 167 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 111/378 (29%), Gaps = 33/378 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + + TF + + + + ++ + V + Sbjct: 33 DGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGS 92 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 ++++ + +++ S S + S S I + I S+++ Sbjct: 93 LEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTST-----TPIVFGIAKSKAV 147 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 D G V + + + + + + ++ Sbjct: 148 KLGWADDTGAAKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPSWTIWVVD 207 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 S+S E + + + I+ + V F Sbjct: 208 YSGSMSGEGKNGVVKGLNAALDPDQAKKSY---------IEPASGDV-NILIPFETEAHR 257 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + + A + G T I + + +A D + S +E + Sbjct: 258 PVKAT----GTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTT------- 306 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF 428 IVL+TDG + D+++ K++ + + I +I F Q K+ L S Sbjct: 307 -AIVLMTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKV 361 Query: 429 FEANSTHELNKIFRDRIG 446 F+ S +L +FR G Sbjct: 362 FDGRSG-DLAAVFRQVKG 378 >gi|319795786|ref|YP_004157426.1| tade family protein [Variovorax paradoxus EPS] gi|315598249|gb|ADU39315.1| TadE family protein [Variovorax paradoxus EPS] Length = 163 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 43/138 (31%), Gaps = 7/138 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G I AL+ V+ G+ + ++QA A + + Sbjct: 13 VRRLQRGLAAIEFALVFLVLFLFIYGLATVGSVFY-----VQQAVSRAAEDGARAALLVS 67 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 +++S T + + L+ +R V + E+N QV Sbjct: 68 HDIASNDSRVQTVIYESLASSLVTPASAGTTPASRLAWLRTKVTPASFEVNISNP-AQVT 126 Query: 133 LSSRYDLLLNPLSLFLRS 150 + Y NPL + Sbjct: 127 VRVTYPYRANPLLPPMPM 144 >gi|307947552|ref|ZP_07662881.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769249|gb|EFO28481.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 212 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 61/210 (29%), Gaps = 16/210 (7%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 T + L + G I ALL ++L + +D+ + A + Sbjct: 1 MYTGIKNRIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSL 60 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A + + + S+ + E R F K+ + ++ V+ Sbjct: 61 FAVTTFRKYVADGMSKNQARKR------AETDARKFLTARTKSLDGTTEKFSIKINIVDR 114 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + V +S +++ + F A++ S + ++ Sbjct: 115 EAKVVKANVNISGKHESYMTHAMGF----------DNIDYTADSESTISFGQGKYEFIFL 164 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTV 212 +D S SM + + Q + Sbjct: 165 VDVSPSMGIGASNRDRQIMQRAIGCQFACH 194 >gi|310824705|ref|YP_003957063.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397777|gb|ADO75236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 881 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 44/164 (26%), Gaps = 25/164 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F+ S + + G T + M A Sbjct: 320 DRFNIIAFDTSFQSFAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQ------ 373 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +VLLTDG+ + E A+ K+ RI + N + Sbjct: 374 ----------AAPEGVVVLLTDGQVGNEAEILQAVLRARKTA--RIYSFGIGTNVSDA-- 419 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 L + A + + + ++ + + R+ + Sbjct: 420 ---LLKDMARQTDGAVEFIHPGERIDDKVVAQFSRALAPRITEL 460 >gi|260814488|ref|XP_002601947.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] gi|229287250|gb|EEN57959.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] Length = 1386 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 59/222 (26%), Gaps = 22/222 (9%) Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + + P P + +D S V V+ + +V+ + Sbjct: 458 TTEDPMPTFPTPEPCHLTPDLFFVLDGSGSVSV-SDFETVKQFVVAVVSAFTI---GLAE 513 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G ++ + + + + + GST A++ A Sbjct: 514 TRVGVLQYSTSSTLACNLGDHPDEA-SFVSAINTMTYQKGGSTYTGAALEFARQ------ 566 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + +++LTDG++ + + I + Sbjct: 567 ----NAAWRPAPVSRIMIVLTDGQSHDSVVAAAQALAA---DQVTVFAIGVGSFDHSELL 619 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 FE + + + + + + IR+ Sbjct: 620 EITSNKL----GHVFELDDFNAMAQNITQIVRAVCKDTTIRL 657 >gi|115372024|ref|ZP_01459336.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] gi|115370989|gb|EAU69912.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] Length = 843 Score = 41.1 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 44/164 (26%), Gaps = 25/164 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F+ S + + G T + M A Sbjct: 282 DRFNIIAFDTSFQSFAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQ------ 335 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +VLLTDG+ + E A+ K+ RI + N + Sbjct: 336 ----------AAPEGVVVLLTDGQVGNEAEILQAVLRARKTA--RIYSFGIGTNVSDA-- 381 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 L + A + + + ++ + + R+ + Sbjct: 382 ---LLKDMARQTDGAVEFIHPGERIDDKVVAQFSRALAPRITEL 422 >gi|322436386|ref|YP_004218598.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164113|gb|ADW69818.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 384 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 41/144 (28%), Gaps = 4/144 (2%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + R A + + + K V F + KL Sbjct: 103 DTSQSQRAALDDERAASTTFLDQMLKGKQDKAFV----IQFAHETELLQDLTDSRPKLQA 158 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + + S+ + + + + + + +K I+LLTDG++ Sbjct: 159 ALKDLDSSPQRSSDSSDSDGHRDRGGTVLYDAAFLAADEILSKPKGRKAIILLTDGDDRG 218 Query: 382 DNEEGIAICNKAKSQGIRIMTIAF 405 E A+ + I F Sbjct: 219 SKETITKAIEAAQRADTTVYAIYF 242 >gi|241662158|ref|YP_002980518.1| hypothetical protein Rpic12D_0541 [Ralstonia pickettii 12D] gi|240864185|gb|ACS61846.1| putative transmembrane protein [Ralstonia pickettii 12D] Length = 344 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 29/285 (10%), Positives = 69/285 (24%), Gaps = 31/285 (10%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA-------------- 60 + G ++ +LM V+L +G + +D+ + L+ AA A Sbjct: 10 ARRQRGVVGVMAPVLMLVILSIGAIAIDLGYLYVIRNELQNAADAAALAGAAGLYPASPK 69 Query: 61 -----------------IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 + + S++ S + + + Sbjct: 70 PNWSNGVSQGTSAIKLNASSGATLKDGSVQAGYWNLTGSPAGLQAQTITPGANDVPAVQV 129 Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 T + +A V ++ + + + Sbjct: 130 TISRSSGKNGGPVGTWLASIFNGAAASVQTTAVAVIAAPGTANAGALFPVALNKCLFDLY 189 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 + + + +VID + S FG D + + +G Sbjct: 190 WNSSTGQPVNDPSTGQPFVIDINSSYPPGSATCASGEWTGFGGATDASTEKSLVTSGNPT 249 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + + K+ Y P ++ V + V Sbjct: 250 PVSIGSNINISNGVKASVYDAISYPTTVTMPVVTTVSPGATTPVY 294 >gi|327270796|ref|XP_003220174.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 948 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 50/151 (33%), Gaps = 15/151 (9%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS--NEDEVHRMKNNLEAKKY 370 ++ +T+++ D T+ I +V K Sbjct: 336 TFNNEAATKTLLQKIVNDGVRQTLTSYLPTTAAGESKICDGVLAGFQVFLNKYPSSEGCE 395 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 IVLLT GE+ I ++ G I TIAF + + + + + F Sbjct: 396 IVLLTHGEDPAIRSCFPQI----QNSGSIIHTIAFGSGTSNELEKLADM----TGGLAFY 447 Query: 431 ANSTHE---LNKIFRDRI--GNEIFERVIRI 456 A + + L F G +I ++ I++ Sbjct: 448 ATDSLDSNGLMDAFSGISSGGGDISQQSIQL 478 >gi|206575582|ref|YP_002235851.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] gi|206570426|gb|ACI12072.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] Length = 212 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 53/152 (34%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + ++ Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTARQTVPLTD--------LLNFNLPS 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + G+TA+ +A+ + I + + K + + L+TDG T D +G+ Sbjct: 73 FSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LVFLMTDGGPTDDWRKGLNEF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 AK +G+ + + L Sbjct: 131 KAAK-KGVVVAC------AAGHDADTGVLKEI 155 >gi|163849427|ref|YP_001637471.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527431|ref|YP_002571902.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670716|gb|ABY37082.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451310|gb|ACM55576.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 851 Score = 41.1 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 39/150 (26%), Gaps = 17/150 (11%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK-----KIDNVNDTVRM 297 +P S ++ ++ ++A+ +I+ D +M Sbjct: 444 WPFPDDPSCDGSPNFAWTPRE----ERRIYVAKEAVRLLIKQTNMPGNPGYDPTRPIDQM 499 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN-------EMGSTAINDAMQTAYDTI 350 +FN V G + + G T A + Sbjct: 500 AIVWFNQDVPESRIMLGGFSSDPAALDRAVLDAGKVNNDPYLSQGGTNTAGAFYRVGQLL 559 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ ++ I+L+TDG Sbjct: 560 -GRAPTGTTQLGREWTYRRAIILVTDGLAN 588 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 42/332 (12%), Positives = 95/332 (28%), Gaps = 20/332 (6%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ + +K G + AL++ V++G+ + VDV R E Sbjct: 4 RTHTYRPLKRVRGQSIPLLALMIVVLIGMVALSVDVGRTFSEERR--------------A 49 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + S P + + N+L+ N D +R A +N R Sbjct: 50 VAAANAASLSAMNTYIRRPAGTTNTVIYDSIVNSLRSNGIDLANNPNIRMEAYYLNARGE 109 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + D + P ++ + ++ + A V+++ + + Sbjct: 110 PIEGGAQITRDGTVAPDNVAYIQVNLEGEVDTF--FARVVNQNQLPIAATAYAGTCPPTD 167 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + D++ N F P D+ + + L PG Sbjct: 168 GVYPIAIDNDYISGNEFRNPGDQ--NGDGKIDNNWQKLTSGTYKGYTTMRLYLTENNVPG 225 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 S + + ++ L S+ + ++ T + ++ Sbjct: 226 GFGWLRWLNGRGASGANANSAQELELALSGTGSLSKGFMEVTPWPAPNLPQPTGYPEQPG 285 Query: 308 SDPSFSW--GVHKLIRTIVKTFAIDENEMGST 337 W G + + A+D + T Sbjct: 286 QLNVGDWVYGSSGFTASNDRRAALDAHISNGT 317 >gi|332992330|gb|AEF02385.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas sp. SN2] Length = 1360 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 45/397 (11%), Positives = 97/397 (24%), Gaps = 37/397 (9%) Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S + ++ + + + ++ + + + L++ Sbjct: 296 SVYQVSRSNDNIEENNNGYENTGSELTFSSNYNSYIGVRFQNVNIPQGAEITEAYIELTA 355 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 N S+ +R + + + R+ K GV+ S + Sbjct: 356 YSSYNQNNSSMLIRGVAAND-ADDFSSGNRYLLRNIGKTSGVTWSMGSFSSGNDYASPDL 414 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 S +S ++ Y S ++S L + Sbjct: 415 SGVVKEIVD----RAGWQSGNAMAFVFSDFTGYRGAYTYSGSQSSAAKLVVKFNGSATPG 470 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + S+ H+I K + + + I R + Sbjct: 471 QT---STVREHLISKVDELTASGYTPIVDTLYEAVNYYGGRDVDYGLTRGNYKVSNTVRK 527 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE-DEVHRMKNNLEAKKYIVLL 374 ++ G + N + + I S + +IVLL Sbjct: 528 NTRVSHRSAYLGGDAVQPNGCSDDNLSDSDCINEYIPSGATYISPVNDLQCQTNNHIVLL 587 Query: 375 TDGENTQD------NEEGIAICN--------------------KAKSQGIRIMTIAFSVN 408 +DGE + C + + TI F+ N Sbjct: 588 SDGEANNNHSVSLIESLLGDSCTGSGGEKCGLNLVKNVSDSDTSVIDRRVITHTIGFAAN 647 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 T S F++A+ + EL + F + Sbjct: 648 STANNFLNQLAVQ--SGGGFYQADDSTELLEAFNTIL 682 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 53/167 (31%), Gaps = 17/167 (10%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D +D VR+G +ND + H L +++ G T A++ Sbjct: 266 DISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMVLEQIQNLPYRPGGTNTGSALEFIRT 323 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 ++ + + ++L+TDGE N+E + ++ K G+ Sbjct: 324 NYLTEESGSRAK----DRVPQIVILVTDGE---SNDEVQEVADRLKEDGVV------VYV 370 Query: 409 KTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 + L AS F + + F I + V Sbjct: 371 VGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 417 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 62/226 (27%), Gaps = 32/226 (14%) Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK 286 + Y + + + VD +DA++S Sbjct: 40 DSDGQYSFTMEGKTTVTANTVNVAVIIDSSGSVDDDEWD---MSMAFAKDAVSSFADQNL 96 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 + + ++ F+ + + L + G T I D + Sbjct: 97 FTNGGSASI----AQFSSSASEGGT----FYSLEDFNAFVDGNTKYSSGGTDIIDGIAKG 148 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + +S ++++ TDG+++ A + A+ +G + + Sbjct: 149 RELLKAS-----------PATTSFMIVTTDGQSSS----PKAEADAARDEGTIVYAVGVG 193 Query: 407 VNKTQQEKARYFLSNCASPN-SFFEANSTHELNKIFRDRIGNEIFE 451 T + L + + F+ + EL+ + Sbjct: 194 TGPT-----QEILLDIGGEEANVFDVDGFDELDVALAGILSASGSS 234 >gi|256072623|ref|XP_002572634.1| tyrosine kinase [Schistosoma mansoni] gi|238657796|emb|CAZ28866.1| tyrosine kinase [Schistosoma mansoni] Length = 847 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 24/288 (8%), Positives = 68/288 (23%), Gaps = 17/288 (5%) Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS--WLIQTKAEAETVSR 169 + + + +Y ++ Y + S+ + + + + Sbjct: 547 NNSKDLKPSTFHTKSRSYAETVTRGYIVTPGVESISMNKFNYRESLMEKRNYTSSTAPLA 606 Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 ++ + + + + + ++ + + + Sbjct: 607 GAYECSMSDPSTEQTTLATTSNGTESNRQASVKLKSRSIQGNTENRVDKVTNSTNANLQH 666 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + S + L D + ++ + + + I Sbjct: 667 RQNQIQPPSSNFC-LDTLLRDQNGTKANSYKTLGYH------DKSSNHNNKFIALSTTNA 719 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N +D FN ++ + + GST TAY Sbjct: 720 NNSDQKGSHLVPFNYNHAHSKPLHS-KSFNSKSTSMVETMPYSGSGSTNNRS--LTAYWL 776 Query: 350 IISSNE-----DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 + + E + + + + GENT + CN Sbjct: 777 METGGGVDSLGYERPSLIRTSFPSQTNSVNSHGENTNISTFIDQNCNN 824 >gi|268529744|ref|XP_002629998.1| Hypothetical protein CBG13361 [Caenorhabditis briggsae] gi|187028722|emb|CAP32249.1| hypothetical protein CBG_13361 [Caenorhabditis briggsae AF16] Length = 359 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D + V + ND ++ ++T + D + ++ + +A Sbjct: 67 YEDPRSSRVAIITYN-NDATTVAQFPTFKSLDQLKTELTVLKNDGTSGNESYLDTGLSSA 125 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + KK +++ + + I + KS G+ ++T+ Sbjct: 126 QAILNKTED--------RMNYKKVVLVFASTYDLLYD-RPDYIAHALKSNGVTVITVYTG 176 Query: 407 VNKTQQEKARYFLSNCASPNSFFE--ANSTHELNKI 440 + + + L + AS F+ N+T +L Sbjct: 177 ND----KTVKGQLKSVASDGFAFQMSVNTTTDLQNA 208 >gi|33600818|ref|NP_888378.1| hypothetical protein BB1833 [Bordetella bronchiseptica RB50] gi|33568418|emb|CAE32330.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 747 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 46/166 (27%), Gaps = 1/166 (0%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I AL + + + + G + + Y + L++ A A +T L E + A Sbjct: 15 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 73 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + + + + V S ++ + Sbjct: 74 GQAVRASVRANLGSRADEGLSIEPTCWRWARDEAELAPRYVREPADASERYNAVAVVLGM 133 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + L FL ++ I + + S + + Sbjct: 134 PVRALFPFLGGRTLEVEAIAARPADPIAAFSVGTGLLATGSGELTR 179 >gi|195121042|ref|XP_002005030.1| GI20245 [Drosophila mojavensis] gi|193910098|gb|EDW08965.1| GI20245 [Drosophila mojavensis] Length = 576 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 7/171 (4%), Positives = 27/171 (15%), Gaps = 5/171 (2%) Query: 211 TVKSYSSQNGKVGIRDEKLS-----PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 + + + P + + P + + + ++ ++ Sbjct: 403 SWNWTTPNWNWTTPTPPSWNWTTPTPPSWNWTTPNWNWTTPATPNWNWTTPNWNWTTPTP 462 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + + + Sbjct: 463 PNWNWTTPTPPSWNWTTPNWNWTTPNWNWTTPNWNWTTPNWNWTTPNWNWTTPNWNWTTP 522 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 T T + + S + L+ D Sbjct: 523 TPPSWNWTTPGTPSWNLTTPGLNASTPSWNLTTTETPATTTLSWWDKLMND 573 >gi|168209268|ref|ZP_02634893.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712663|gb|EDT24845.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] Length = 928 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 37/390 (9%), Positives = 92/390 (23%), Gaps = 30/390 (7%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E S + + ++ NN + E R S ++ Sbjct: 35 AEEKSDSMSVEKVLDSEMYNKFNNEILNNKSSIKYTTDNNKGTWPVNWEYGNRSSKNKIN 94 Query: 133 LSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S L +L + + ++ + +++D S SML Sbjct: 95 NSVYGKNQPLEYDEGYLTKKAYTTDEDNVFNINLKIQGKKNEALKKDVVFLLDNSNSMLT 154 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + + + + + + + S+ Y Sbjct: 155 NNRALKIKEEIKNVMDKLKAKNTRYALVTYAST--ILDGRHYHLMDGSIGNNQYTVYDLY 212 Query: 252 SLSEEHFVDSSSLRHVIKKKHL------VRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ + +S+++ + K + N Sbjct: 213 TSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLK-----AIELLKNSNAD 267 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G+ + + ++ + + + + Sbjct: 268 EKIIIHLTDGLPTFSFLLKEFGGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKTQKPYIYS 327 Query: 366 EAKKYIVLL---TDGENTQDNEEGIAI-CNKAKSQG--IRIMTIAFSVNKTQQEKARYFL 419 + Y L E +N + K + I I TI + Sbjct: 328 REEVYSALNRSKNKYEAIWNNGFPTTLEAENIKKENPDINIYTIGIE-------LKKRVY 380 Query: 420 SNCASPNSFFEANSTHELNKI--FRDRIGN 447 +++A EL +I F + I + Sbjct: 381 KW-TDNRQYYDAEGIVELPEIRKFLESISS 409 >gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis HAW-EB4] gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis HAW-EB4] Length = 850 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 19/175 (10%), Positives = 43/175 (24%), Gaps = 19/175 (10%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS- 352 FN V + + + G T ++ A+ A + + Sbjct: 489 QDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDND 548 Query: 353 -------------SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 ++ + ++ +TDG ++ I N+ R Sbjct: 549 RGHNSKPVHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGES--R 606 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + TI R ++ E+N+ + +V Sbjct: 607 LFTIGIGSAPNAHFMQRAAE---VGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVT 658 >gi|227818615|ref|YP_002822586.1| CpaE2 pilus assembly protein [Sinorhizobium fredii NGR234] gi|36958871|gb|AAQ87296.1| CpaE [Sinorhizobium fredii NGR234] gi|227337614|gb|ACP21833.1| CpaE2 pilus assembly protein [Sinorhizobium fredii NGR234] Length = 580 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 29/86 (33%), Gaps = 2/86 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVM-LGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 ++ L KS G + AL PV+ LG+ M DV + + + A Sbjct: 426 LRNKAQDLKKSEAGASAVEFALFAPVLALGLVAMA-DVALALHERMTIDHVLRAGAQAAM 484 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIE 91 + + ++ + + + Sbjct: 485 ADPGATQVDKVLQSTLAQSAKPANVT 510 >gi|296268733|ref|YP_003651365.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] gi|296091520|gb|ADG87472.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] Length = 587 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 9/84 (10%), Positives = 28/84 (33%), Gaps = 8/84 (9%) Query: 371 IVLLTDGENTQDNEEGIAICNKAK-----SQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 +++LT G++ + + + ++I+ + F + + + Sbjct: 499 LLVLTYGQDDGRGISRQELAEALRKEWNPDRPVQIVVVMFGAGRDRAALEEA---AAITN 555 Query: 426 NSFFEANSTHELNKIFRDRIGNEI 449 + A E+ +F I + Sbjct: 556 GEVYVARQPGEIIDVFLSAIARRL 579 >gi|156382210|ref|XP_001632447.1| predicted protein [Nematostella vectensis] gi|156219503|gb|EDO40384.1| predicted protein [Nematostella vectensis] Length = 8745 Score = 41.1 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 43/409 (10%), Positives = 108/409 (26%), Gaps = 38/409 (9%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + L ++ + T + ++D ++ Sbjct: 6620 ITDTAQPKKIAEKIIKNAFNDSINAPKLDLLFAVSSSSPDADQTFQHMQDTLQSVVETYG 6679 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 Y +V+ + F L + A + + ++ Sbjct: 6680 MHSIRYGLVVFGSAP---DVKFTFDEYFPNTEVLRKRMLTANNTAGKPDIVKVLEKAVLM 6736 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSS------QNGKVGIRDEKLSPYMVSCN 237 S S + ++ + + + VG + ++ + Sbjct: 6737 FASASSRPMAKKVLVVMIDAKSDNSADELLEAAKPLKNKKIKAPVGSKAVPDELTTITTD 6796 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL----------VRDALASVIRSIKK 287 K + +L + + + + + L +D A + IK Sbjct: 6797 KGNVIVTNTSQPSETLVNKLINKARTDKRDVPVLDLGFVISATSRDAKDVYAKMKDVIKS 6856 Query: 288 I--DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 +R F D SFS + I +++ A++ Sbjct: 6857 FINRYGTKNIRYSLITFGDDPRVKLSFSSNLGSDRELRSFIDVIPRTSDSGRSLDKALEQ 6916 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 A S + +AKK +V++ DG++T D E + G+ + I Sbjct: 6917 ARQLFTSE---------SRPDAKKVLVVMLDGKSTSDPRELPKKATALEKAGVTV--IPL 6965 Query: 406 SVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNEIFER 452 + + + +P ++ AN + + + + + Sbjct: 6966 VFEPSGSKDEMALI----TPYKDNIISANKSSDPMDVAEQILDKSLALE 7010 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 46/148 (31%), Gaps = 13/148 (8%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + + + D + SF + A+ G + + A+ A Sbjct: 1463 KYGIDRIHLSLIVYGDLPYTRVSFGQDIPDARTLKNLVEALS-RVSGGSDLGKALAEATK 1521 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 AKK +V++TD +++ + + GIR++ +A + Sbjct: 1522 VF--------RNDDVRPNAKKVLVVITDSKSSYGVASLVKAAEPLEHAGIRVLAVAIGAD 1573 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHE 436 E +A+S+ + Sbjct: 1574 ADASELRSIVQHQ----GDVIKASSSED 1597 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 50/164 (30%), Gaps = 18/164 (10%) Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + +R G + D V + F+ G+ + + + GS ++ A+ Sbjct: 1648 TRYGAERMRYGLMSYGDDVTTTIFFTDGITNPNSLLPFIDILQKTPSGS-SLEKALVGGK 1706 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 K++V++TD ++T D + G+ ++ IA Sbjct: 1707 KLFEHG--------GTRPGVDKFLVVITDRKSTGDAAQTENAKADLAKAGVNVLVIAVGS 1758 Query: 408 NKTQQEKARYFLSNCASPNSFFEAN----STHELNKIFRDRIGN 447 +E A+ S S E+ D I Sbjct: 1759 EVDHKELNET-----ATTPSHKLVTAPGESPAEVADRVVDIINK 1797 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 34/380 (8%), Positives = 92/380 (24%), Gaps = 35/380 (9%) Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 I+ + + + + V+ S+ L+ Sbjct: 491 IDLVFAISASASDPDTTLALMKKTAQHIVDTYGTSKIRYSLVIFGSQTSTQLSFDRKMPD 550 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + S + + + + + Q Sbjct: 551 RDSLSSMIAGMPRIPGRPAIDQAFLEADKLFQLQGTRPNARKVVVMMVDQRSAATDDDLR 610 Query: 210 RTVKSYSSQNGKVGIRDEKLSPY----MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 R+ + + V + + + + + P+ + Sbjct: 611 RSSRRLHQRKVDVIPVAIGSESDPDQLLHTTSDNTNLIKAAKSELPNSLANQIMQKIVQG 670 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + L A+ + + ++ + I + L + + Sbjct: 671 RKVPELDLAVAVSATATDADTTFMLAKNALKSIMDRYGVGAIRLSFIVYSDMALKKFTFQ 730 Query: 326 TFAI----------DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 G +A++ A++ + + A K +VL+T Sbjct: 731 DKYDEEELKRSIESAMRMDGGSALDKALEAGSEVLQ--------GPGARENAHKVLVLIT 782 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY------FLSNCASPNSFF 429 D + + E A + + I ++ IA + T +E + ++ + Sbjct: 783 DMTSGLSDAEIEAAAKPLQQKQIAVVPIAVGNDVTTRELEKTTPYRENVMTA-------Y 835 Query: 430 EANSTHELNKIFRDRIGNEI 449 A + L D+I I Sbjct: 836 RAENPEVLGAKIIDKILQAI 855 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 43/135 (31%), Gaps = 19/135 (14%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----------MG 335 + + D ++ + + ++G R + G Sbjct: 2416 TFEYMKDVIKSVINKYGAYKLRYALITYGNSAAERISFDDQISNTTALKHMLNQTERSRG 2475 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 + I A+ S A+K +V++TD + T D A + ++ Sbjct: 2476 GSNIGVAIAALKQLFASDA--------VRSNARKVLVIMTDSQATGDLNAAKAAAKELEN 2527 Query: 396 QGIRIMTIAFSVNKT 410 GI+++T+A N Sbjct: 2528 DGIKVITLALGPNAD 2542 >gi|325473817|gb|EGC77005.1| BatB protein [Treponema denticola F0402] Length = 286 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 43/148 (29%), Gaps = 36/148 (24%) Query: 318 KLIRTIVKTFAIDENE----MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + + T + + A D+ N K IVL Sbjct: 107 SFEKNALSSAINALSPLILSSTGTNLEAGVLRALDSF-----------GENRGNSKIIVL 155 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV---------NKTQQEKARYF------ 418 TDG T + + K K ++ + F ++ + + Sbjct: 156 CTDGGETSGS--LLHAAEKIKKTDAILIIVGFGTLEETKIKVLDEKGNTQLKDARLEEAF 213 Query: 419 LSNCAS----PNSFFEANSTHELNKIFR 442 L AS +++ A S+ + I + Sbjct: 214 LKKIASIAGGESTYISALSSGSIETILK 241 >gi|149173297|ref|ZP_01851928.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] gi|148848103|gb|EDL62435.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] Length = 1020 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 49/132 (37%), Gaps = 24/132 (18%) Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + K++ I D +T MQ + +I S+ K++ Sbjct: 518 PAGDYEKMVPKINGAQIGDMPSFANT-----MQLGLNGLIKSDAST-----------KHM 561 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFF 429 ++++DG+ + I K++ + + +A ++ ++ + + A ++ Sbjct: 562 IIISDGDPQPPTPQLIG--QFLKNK-VSVSMVAI-FPHGGEDISK--MRDIAGVTGGRYY 615 Query: 430 EANSTHELNKIF 441 + ++L IF Sbjct: 616 FPSDPNQLPSIF 627 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 44/170 (25%), Gaps = 7/170 (4%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE-VSSRAK 80 I ++ V+LGV +D+ + L++ A + A P+ +S Sbjct: 1 MAAIWVMVAIVVLGV----IDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKN 56 Query: 81 NSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA--YQVVLSSRYD 138 N + + Y N T + V + +++ Sbjct: 57 NVTSSANINDKGYAFTLISATATANPTAGNDQIAVSCGRWDSATAYVTPASASANAAQVT 116 Query: 139 LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 FL + S + T + V + S Sbjct: 117 AYRQVNYFFLGLLSQLSGRQAVVSATATARAAAIDTFSVGSTLANLNTSS 166 >gi|262381905|ref|ZP_06075043.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|301310438|ref|ZP_07216377.1| BatB protein [Bacteroides sp. 20_3] gi|262297082|gb|EEY85012.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|300832012|gb|EFK62643.1| BatB protein [Bacteroides sp. 20_3] Length = 339 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 28/185 (15%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + + L+ + ++G F + + + + + Sbjct: 108 SPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSDYISA-KMFLSS 159 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 TAI A+ A + + K I+L+TDGEN +D+ G Sbjct: 160 INPSMVSTQGTAIGAAINLAARSFTP-----------DETTDKAIILITDGENHEDDAIG 208 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 A K + I + K + N++ + + + +++G Sbjct: 209 AAKAAAEKGIHVNI--VGMGDPKGSPIPIQ-------GSNNYMKDKDGNVVITKLNEQMG 259 Query: 447 NEIFE 451 EI Sbjct: 260 QEIAA 264 >gi|94986631|ref|YP_594564.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] gi|94730880|emb|CAJ54243.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] Length = 530 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 13/73 (17%), Positives = 23/73 (31%), Gaps = 1/73 (1%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T A + + + + E +K + ++TDG I AK Sbjct: 425 TNSFAAEAHGSTPMTEALWWVLGMLSTRPEHRKIVFVVTDGYPDDPETAKETIA-VAKRM 483 Query: 397 GIRIMTIAFSVNK 409 GI ++ I Sbjct: 484 GIEVLGIGIDAPA 496 >gi|66805723|ref|XP_636583.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] gi|60464969|gb|EAL63080.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] Length = 585 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 64/188 (34%), Gaps = 16/188 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + L + + ++ V +G F +R+ + + + A Sbjct: 190 SRIELAQTLFQTFTDKAISLEV---PVAVGLVTFGERIELTFDLTRNFDSFSTELGEVVA 246 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN----LEAKKYIVLLTDGENTQDNE 384 T + +A+Q A +T++ E + LTDGE+ N Sbjct: 247 NQCK----TRLFEAIQLAAETLVKFKESCDAAATGGLKLSSNPMLRVFCLTDGEDNS-NF 301 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + K I + +I ++ +E+ F A+ S F A+S+ E ++F Sbjct: 302 DPYPVYQYMKKHNIILDSIPIGLD--GRERLSSFSQ--ATGGSCFIADSSLEGVELFERE 357 Query: 445 IGNEIFER 452 + R Sbjct: 358 ALLSLSAR 365 >gi|320105056|ref|YP_004180647.1| hypothetical protein Isop_3541 [Isosphaera pallida ATCC 43644] gi|319752338|gb|ADV64098.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 818 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 23/50 (46%), Gaps = 1/50 (2%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 I++ TDG+ T+ + + AK +G+ I+T+ ++ L Sbjct: 238 IIMFTDGQTTEGE-DLASAAELAKRKGVPIITVGLGDPDAPRDLELLELQ 286 >gi|319795785|ref|YP_004157425.1| tade family protein [Variovorax paradoxus EPS] gi|315598248|gb|ADU39314.1| TadE family protein [Variovorax paradoxus EPS] Length = 166 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 40/150 (26%), Gaps = 13/150 (8%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + I A++ + + +V L+ AA+ A + Sbjct: 22 QRGIYAIEFAIVFLLFFALLYAIVCYGLVVALRFGLQNAAEDGARAALRYQLNLPARQVE 81 Query: 78 RAKNS---FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + N N T V R + + Sbjct: 82 AEAVALLRSNWMPAVVTRDADANVCQVATNNCTSP-------VCGVAWEDRCQMVVTITA 134 Query: 135 SRYDLLLNPLSLFL---RSMGIKSWLIQTK 161 + LL PL F + +G S L+ + Sbjct: 135 TNMRQLLPPLPSFAVPSQLIGQASMLLDGR 164 >gi|301780322|ref|XP_002925578.1| PREDICTED: collagen alpha-1(XIV) chain-like [Ailuropoda melanoleuca] Length = 1796 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +++ + G+T A+ ++ K +L+TDG Sbjct: 216 NTKDEVIEAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGARAGVSKIGILITDG 269 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 270 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADENELREIASEPDSTHVYNVAE 319 >gi|163752614|ref|ZP_02159791.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161327493|gb|EDP98702.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 564 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 46/196 (23%), Gaps = 13/196 (6%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S++ R + N ++ R + + + + + Sbjct: 126 STNTPLCETLYEAYRYFAGKGVYYGNDDSNYSNWYRANRPPRDTSIEDNGKYISPFKECQ 185 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + D + N ++ S + + T+ NT Sbjct: 186 NRAYVVYITDGVPTVDQSANSKIKGLPGVSSSDKFVNTSPNFTSYLPALAGWMNTNDVNT 245 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 N + Q + TI F + A L+ A ++ A S +L Sbjct: 246 --NAAFPEM------QSVTTFTIGF---SDGADDAAPLLTKTAELGGGEYYSAKSATQLQ 294 Query: 439 KIFRDRIGNEIFERVI 454 + Sbjct: 295 AALSQVFSQILEVNAS 310 >gi|126666680|ref|ZP_01737657.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] gi|126628725|gb|EAZ99345.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] Length = 1318 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 49/388 (12%), Positives = 100/388 (25%), Gaps = 34/388 (8%) Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + S+ + T ++ D R + + + Sbjct: 62 DNSNSMDEAPTGRAVGSANSGSKSEIARTAVKAIVNNFGDRTRIGLMAYKQSGINKRYLH 121 Query: 134 SSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 S+YD NP + G S + + T S +Y ++ S ++ + Sbjct: 122 DSQYDASFNPNNYDATFTGDRASVTKRNRVSNPTNSGAY---IYYNVALPFYSSSNLGNA 178 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 S + + A +Y K D + + S + Sbjct: 179 FCYSSTADFDNGSETASGPWDNYDCYQRKTNTSDSGAAGFSQYLFSSRFEASDSDFAQNI 238 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 L ++ + + +I +D + T N S S Sbjct: 239 LDFGTYLTWQYVSLSWFSNSSPGQ--GMLHVAIDDVDAAHKT--RLLNKLNTSQFSSSSD 294 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + + + T G T N + T+ + + Y+V Sbjct: 295 TPLRNAGLTPLAGTLESASKYFGGTLSNGEAASGVSTLAPTPNF--------CGTQDYVV 346 Query: 373 LLTDGENTQDNEEGIAI------------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 L+TDG + + SQG+R + F + + L+ Sbjct: 347 LVTDGLPSVNKSGSKTTDVTSAVDAVADEAAALLSQGVRTYVVGFGLPTG---VDKSLLN 403 Query: 421 NCA---SPNSFFEANSTHELNKIFRDRI 445 A + AN + L+ Sbjct: 404 KIAIAGGTKATLFANDSTTLDAALGSIF 431 >gi|118364357|ref|XP_001015400.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89297167|gb|EAR95155.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 2720 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 42/408 (10%), Positives = 107/408 (26%), Gaps = 41/408 (10%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + + ++KI + + L R + V DT + N Q Sbjct: 2311 NSQIQIMTEENQIEERKKIIKDATYSGAITLLTKIFGRGIDFQVNDTIIFQNYGVHVIQT 2370 Query: 132 VLSSRYD--LLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 S + + G S ++Q + + ++ + + I Sbjct: 2371 FFSEDKSEETQIKGRTARQGEEGSFSMVLQ-QDTLQAFLKTEEDQKKIENLDEIYEVLDR 2429 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 N S++ ++D + Sbjct: 2430 NRQVNFENDFKSNIEYNQKTSITSHKQSEDFLNFLKDCNQQEVKKYLLQLNQGPKENTFK 2489 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 + S+ + + ++ + +N + Sbjct: 2490 TLIAMDVTGSMSNLITQTKNTIQTTFEQTRDILTEKGYDPQCFQIMISCFRSYNSKWEEI 2549 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 S + +++F G T ++++ +N+ + Sbjct: 2550 FQTS--TWENNPDKLRSFLQKITASGGTWPGESVEVGLW---------WANKQNDEDPIS 2598 Query: 370 YIVLLTD----------------GENTQDNEEGIAI------CNKAKSQGIRIMTIAFSV 407 +++L D G++ D + C K K++ I + T Sbjct: 2599 QVIILGDQPAHLMNEAQKFRNQFGQSYWDTTPLKGLTYYVPECQKLKNKNIPVNTFYLKQ 2658 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTH---ELNKIFRDRIGNEIFER 452 + L+N S + + NS +L +F ++I +I ++ Sbjct: 2659 GAKSTFENIAKLTNGIS--QYLDINSGQSSKQLTTLFVEQILKDIGKQ 2704 >gi|3236344|gb|AAC23663.1| integrin alpha E2 [Rattus norvegicus] Length = 1167 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 39/105 (37%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ +D + + +K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFRDPLNLTTVISSSKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSF---FEANSTHELN 438 G+ I N + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-NAFENNNTYRELKLIASDPKAAHTFKVTNYSALD 369 >gi|320161534|ref|YP_004174758.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|320161798|ref|YP_004175023.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] gi|319995387|dbj|BAJ64158.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|319995652|dbj|BAJ64423.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] Length = 150 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 17/48 (35%), Gaps = 7/48 (14%) Query: 17 SCTGHFFIITA----LLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 G TA L++ +L + G D R Y L QA A Sbjct: 9 RERGSSLAWTAAFLGLVLIPLLMLVG---DGARLLYVRGRLAQATDAA 53 >gi|308472823|ref|XP_003098638.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] gi|308268238|gb|EFP12191.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] Length = 417 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 48/149 (32%), Gaps = 10/149 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN-- 340 I T R+G +N D L F + + +T Sbjct: 72 TRIGSNSTEPRTTRVGLVTYNSGAKLDADL-NKFQDLDGLYNGVFKDLSDVVDTTDSYLA 130 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A + + S + + KK I++ + + + N+ K G+ I Sbjct: 131 TGLNAAEELLQSQSLNT-----TRDHYKKVIIVYASEYKGSGELDPVPVANRLKGSGVVI 185 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFF 429 +T+A+ + L+N ASP + Sbjct: 186 VTVAYDQGGDEGLLRD--LANIASPGFAY 212 >gi|308509926|ref|XP_003117146.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] gi|308242060|gb|EFO86012.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] Length = 713 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 48/165 (29%), Gaps = 16/165 (9%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-ND 341 I + R+ +N+ + F T G+ + + Sbjct: 412 SQIGTGYTDPRSSRVAIVTYNNYATTVADFRTIKSLKQLKTQLTALDQAGNSGNQSYLDQ 471 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + +A + + + KK +++ T + + KS G+ ++ Sbjct: 472 GLISAQSILT--------KFDDRENYKKVVLIFT-SSFDFIDSRPDRLAETMKSNGVTVI 522 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFE--ANSTHELNKIFRDR 444 T+ N + + L AS F+ N+T L Sbjct: 523 TV----NTGKDKTVEKQLEGVASAGFAFKMSTNTTETLQNALTTI 563 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 16/161 (9%), Positives = 46/161 (28%), Gaps = 17/161 (10%) Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + + ++ +R + D G T +++ + + ++ + + Sbjct: 186 TQVAPAFNNLTALRADIGGPLNDPPNAGDTPTGESIDAIAENFPDNGPNDKPLIVLATDG 245 Query: 368 KKYIVLLTDGENTQDNEEGIA------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + TD + D +A GI + ++ + + + + Sbjct: 246 EPDS--CTDPDPNTDPGRAATRRLSEEATQRAFEAGIELYVLSVGNDV-GADHLQRVANA 302 Query: 422 CAS--------PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 P + + N+ EL F + I + Sbjct: 303 GVGKALDESNDPATVYIGNNQQELVDAFSEIIRSARSCEFT 343 >gi|33596976|ref|NP_884619.1| hypothetical protein BPP2383 [Bordetella parapertussis 12822] gi|33566427|emb|CAE37680.1| putative exported protein [Bordetella parapertussis] Length = 749 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 46/166 (27%), Gaps = 1/166 (0%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I AL + + + + G + + Y + L++ A A +T L E + A Sbjct: 17 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 75 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + + + + V S ++ + Sbjct: 76 GQAVRASVRANLGSRADEGLSIEPTCWRWARDEAELAPRYVREPADASERYNAVAVVLGM 135 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + L FL ++ I + + S + + Sbjct: 136 PVRALFPFLGGRTLEVEAIAARPADPIAAFSVGTGLLATGSGELTR 181 >gi|91977979|ref|YP_570638.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684435|gb|ABE40737.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 208 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 44/151 (29%), Gaps = 12/151 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++L + +G A ++P+ML + V++ W A + S Sbjct: 12 RRCGERLARDRSGLAATEFAFIVPLMLLMFFATVEISTWV--------AVDRKVTLVSRT 63 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD----TAVEMN 123 L + + +S L + + ++ V + + + Sbjct: 64 LSDLVSQATSVTDKDLPNVFLASYGVLAPYPTDTAEATISEIYVNNAGVAKVQWSKLAKV 123 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK 154 + + V + ++ G+K Sbjct: 124 AKSGNNKPVATIEVSSRKPGDTVTTLPDGLK 154 >gi|311899781|dbj|BAJ32189.1| hypothetical protein KSE_64300 [Kitasatospora setae KM-6054] Length = 154 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 13/107 (12%), Positives = 27/107 (25%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ A+ ++ V G+++D AQ A A L + Sbjct: 17 DRGSISVMVAITAVSLVAVVGLVLDFGGQLRSTERADALAQEAARVAGQQLDITKLRAGG 76 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + YL + ++ +I Sbjct: 77 GYQVDPKQAVSAGRAYLASQHVSGSVDFPRYPDLSEISVTVDTTYPT 123 >gi|325108017|ref|YP_004269085.1| TadE family protein [Planctomyces brasiliensis DSM 5305] gi|324968285|gb|ADY59063.1| TadE family protein [Planctomyces brasiliensis DSM 5305] Length = 162 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 11/104 (10%), Positives = 34/104 (32%), Gaps = 5/104 (4%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-----TASVPLIQS 71 G + A+++PV +V+ + L+ AA+ A + + + Sbjct: 16 KRRGAALVEFAIVVPVFGLFLAAMVEFGHVYMVQTTLRGAAKKAARLGIGDGVTSADVSA 75 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 ++ I++ + + + + ++ +I Sbjct: 76 EATRIVQSACRTDGLTVLIKDAGLFDTSTEVPDDLDFDDLPNIE 119 >gi|323525753|ref|YP_004227906.1| hypothetical protein BC1001_1410 [Burkholderia sp. CCGE1001] gi|323382755|gb|ADX54846.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 613 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 9/77 (11%), Positives = 27/77 (35%), Gaps = 1/77 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + + G ++ A+ + + + G VD+ + Y L++ A A + + Sbjct: 39 VARRQRGAIALLAAVWVSLAVIALG-AVDIGHFYYARRDLQRTADLAAAAGVQLIGSAGG 97 Query: 74 EVSSRAKNSFTFPKQKI 90 ++ + + Sbjct: 98 CAAAASSAQLNATANGL 114 >gi|163750668|ref|ZP_02157905.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161329663|gb|EDQ00654.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 1180 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 32/318 (10%), Positives = 72/318 (22%), Gaps = 61/318 (19%) Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLS 254 + F + T S + + L + Sbjct: 169 QNGNHIARGFPVDSIGTSSSPVPYTYITSSASSGDRAAALDKAFLTNFGLGKSVTLYTER 228 Query: 255 EEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID----NVNDTVRMGATFFNDRVISDP 310 ++ + + + + + V+ + +D N T + + Sbjct: 229 YVNWYHGRKPKEWKTRMEIAKRVMEDVVVTTPSVDFGLAVFNYNTSNYNTTYRNDGGRII 288 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY----------------------D 348 S ++ + + + + + T + + + AY Sbjct: 289 SGIQRLNAIEKKSLVQTINNLSAETWTPLCETLYEAYLYFGAKAPLYALEAGIASPPRDT 348 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI------------------- 389 +I + ++ + YIV +TDG T D+ + Sbjct: 349 SIEREGKYLSPFKESQCSNRSYIVYITDGAPTNDSAANYTVSQLPGYDKNNIVNGSYLPA 408 Query: 390 -----------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHE 436 A Q + TI F A L A ++ A + Sbjct: 409 LTSIMNNEDINTKLAGDQFVSTFTIGF---SDGATSAAPILKKAAELGGGDYYAAKDAIK 465 Query: 437 LNKIFRDRIGNEIFERVI 454 L D + Sbjct: 466 LQSALSDIFSQILEVNAS 483 >gi|156394499|ref|XP_001636863.1| predicted protein [Nematostella vectensis] gi|156223970|gb|EDO44800.1| predicted protein [Nematostella vectensis] Length = 175 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 39/123 (31%), Gaps = 14/123 (11%) Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D R ++D GV + I E G T I+ A+ Sbjct: 46 DIFDISPGQSRASLIIYSDFPKLIFDLEDGV--TNQNITSVLKNLEYLRGRTRIDKALMM 103 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIAIC-NKAKSQGIRIMTI 403 A + + + +LTDG+ TQD + + + + K G++I I Sbjct: 104 AEEVFADARP----------TVPRIAFILTDGKQTQDYDAIPLDVSSQRLKKMGVKIYVI 153 Query: 404 AFS 406 Sbjct: 154 GVG 156 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 51/165 (30%), Gaps = 21/165 (12%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 ++ + + + +++ ++ F + + Sbjct: 99 SMAAQDVRPSRIDASKRMVLALLDRLEGNQVG-------MLMFGSSSYVQFPLTSDLAAA 151 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R++V+ +G T + + + + + ++L+TDG + Sbjct: 152 -RSLVEPINPRGLSLGGTDVEEVITEGLRSFP-----------IGQIEGRTMILITDGGD 199 Query: 380 T--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + Q + E +A +A G+ I TI + Q L N Sbjct: 200 SDEQSDGEAVAAAREAAKMGLTIHTIGMATEAGGQIPIYDDLGNI 244 >gi|33593029|ref|NP_880673.1| hypothetical protein BP2003 [Bordetella pertussis Tohama I] gi|33563404|emb|CAE42283.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382442|gb|AEE67289.1| hypothetical protein BPTD_1973 [Bordetella pertussis CS] Length = 747 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 47/166 (28%), Gaps = 1/166 (0%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I AL + + + + G + + Y + L++ A A +T L E + A Sbjct: 15 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 73 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + + + + + + V S ++ + Sbjct: 74 GQAVRASVRANLGSRADDGLSIEPTCWRWARDEAELAPRYVREPADPSERYNAVAVVLGM 133 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + L FL ++ I + + S + + Sbjct: 134 PVRALFPFLGGRTLEVEAIAARPADPIAAFSVGTGLLATGSGELTR 179 >gi|118431306|ref|NP_147675.2| hypothetical protein APE_1031.1 [Aeropyrum pernix K1] gi|116062628|dbj|BAA80016.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 463 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 51/168 (30%), Gaps = 18/168 (10%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI------DNVNDTV 295 Y L +DSS + + + I + + ++ + Sbjct: 274 YFWASFSSSKLLLYRKVLDSSRGPIYVL-LDKSGSMVGAKIDWARAVAVALFRRSLAENR 332 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R A FF+ + + G T I A++TA D I + Sbjct: 333 RFSARFFDSVTYPAIHLRPRSKPRDFLELVKYLAAVKAGGGTDITAAIKTAADDISRTPR 392 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 E IVL+TDGE+ + + + K R+ T+ Sbjct: 393 GEQRISD--------IVLITDGEDRLNIDVVE---DSLKRSDARLHTV 429 >gi|157412073|ref|YP_001481413.1| TerY1 [Escherichia coli APEC O1] gi|99867098|gb|ABF67743.1| TerY1 [Escherichia coli APEC O1] Length = 239 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 49/152 (32%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLTDLLS--------FQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A++ G+ + + L Sbjct: 158 KAART-GVVVAC------AAGHDADTSVLKEI 182 >gi|329847247|ref|ZP_08262275.1| tadE-like family protein [Asticcacaulis biprosthecum C19] gi|328842310|gb|EGF91879.1| tadE-like family protein [Asticcacaulis biprosthecum C19] Length = 177 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 6/68 (8%), Positives = 19/68 (27%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G + AL+ ++ + +++ +L A A Sbjct: 6 VRNRLQSFCRDHKGAAAVEFALIAGPLVFLICACIELALVILLSVSLDNATDVASRQIRT 65 Query: 67 PLIQSLEE 74 + + Sbjct: 66 GIATTGNT 73 >gi|313247257|emb|CBY15545.1| unnamed protein product [Oikopleura dioica] Length = 409 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 16/108 (14%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T ++ ++ A+ + R + N +KK IV++TDGE T+ + AK Sbjct: 163 TNVHFGLRKAFTEFSTC-----PRDRRNQNSKKAIVIITDGEFTEPKIAYEDL-KIAKKN 216 Query: 397 GIRIM--TIA-FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 + I I F +N + + + A+ EL + Sbjct: 217 EVEIFILAIGDFDLNLSHDDIEDTI-------DYIIHADDFEELLEKI 257 >gi|313216071|emb|CBY37449.1| unnamed protein product [Oikopleura dioica] Length = 107 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 29/81 (35%), Gaps = 12/81 (14%) Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + KK++VLLTDG++ D AK+ IR I + L Sbjct: 8 EAGSRPYVKKHMVLLTDGQSQDDVGAP---ARAAKNFNIRTFAIGVG------DAIEDEL 58 Query: 420 SNCASP---NSFFEANSTHEL 437 A+P ++ + + Sbjct: 59 KLVATPPFSDTLYHVEDYDGI 79 >gi|283769328|ref|ZP_06342227.1| TadE-like protein [Bulleidia extructa W1219] gi|283103985|gb|EFC05369.1| TadE-like protein [Bulleidia extructa W1219] Length = 189 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 25/51 (49%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 + + +K G FI AL++P+++ V ++ DVVR + L A Sbjct: 16 LLKKITRFLKREDGQSFIEFALVLPILITVLSVVFDVVRIVDAKMVLNNVA 66 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ A G TA DA+ + + Sbjct: 470 TYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVF--------GPIRD 521 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 522 SPNKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLRDMAS 572 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 573 KPKESHAFFTREFTGLEPIVSDVI 596 >gi|251791983|ref|YP_003006703.1| TadF [Aggregatibacter aphrophilus NJ8700] gi|247533370|gb|ACS96616.1| TadF [Aggregatibacter aphrophilus NJ8700] Length = 200 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 18/56 (32%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 M F +SK + G I ++ ++ + L D + L A Sbjct: 1 MNITDFFSSHSKNFRTNKRGSVTIEFLFMLMLLTFIFAFLADFAIIRITQGKLDNA 56 >gi|1846005|gb|AAC51244.1| collagen type XII alpha-1 [Homo sapiens] Length = 3063 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 18/128 (14%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 +++++ A + G+T A+ ++ + A+K VL+TDG++ Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQ------QNFRTQAGMRPRARKIGVLITDGKS 1312 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHE 436 D E K K +G+ + I + L A+ + Sbjct: 1313 QDDVEAPS---KKLKDEGVELFAIGI------KNADEVELKMIATDPDDTHDYNVADFES 1363 Query: 437 LNKIFRDR 444 L++I D Sbjct: 1364 LSRIVDDL 1371 >gi|325108016|ref|YP_004269084.1| TadE family protein [Planctomyces brasiliensis DSM 5305] gi|324968284|gb|ADY59062.1| TadE family protein [Planctomyces brasiliensis DSM 5305] Length = 166 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 13/136 (9%), Positives = 36/136 (26%), Gaps = 3/136 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASVPLIQSLE 73 S G + A + PV L + +++ L A + + E Sbjct: 14 ADSRRGAAMVEFAFVAPVFLLLIIGTIEMGNALEASTQLSSALREGGRLAGMDWEGLISE 73 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + K + + +V + + +++ Sbjct: 74 NETPNQKVIRDIKNFLTAAGYPGDSVTVTITSAEGTDVGETFNLGNPDNE--MRLFKLEA 131 Query: 134 SSRYDLLLNPLSLFLR 149 + Y+ + + F++ Sbjct: 132 TIPYEQISSFPHGFMK 147 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 42/125 (33%), Gaps = 17/125 (13%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 G F+ + R + + G T+I ++TA+ I + Sbjct: 99 WTGMVTFDSSATIQSALIQIETDAQRNSLISRLPT-AAGGGTSICSGLRTAFTVIKNKF- 156 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 IVLLTDGE++ I+ C ++ K G I T+A + + Sbjct: 157 ---------STDGSEIVLLTDGEDS-----TISSCFDEVKQSGAIIHTVALGPSADPGLE 202 Query: 415 ARYFL 419 + Sbjct: 203 ELAKM 207 >gi|114764814|ref|ZP_01443996.1| hypothetical protein 1100011001322_R2601_10479 [Pelagibaca bermudensis HTCC2601] gi|114542700|gb|EAU45723.1| hypothetical protein R2601_10479 [Roseovarius sp. HTCC2601] Length = 249 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 19/60 (31%), Gaps = 4/60 (6%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGML---VDVVRWSYYEHALKQAA 57 M T +K + G+ I ++PV+ + G DV R A Sbjct: 1 MNMTT-IKTLLRKFRRDNEGYVTIEVMFMLPVLFVLFGAAWVYFDVFRQQSVNQKANYAI 59 >gi|73745523|emb|CAI61969.2| putative TerY1 protein [Escherichia coli] Length = 239 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 49/152 (32%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 48 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLTDLLS--------FQMPA 99 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 100 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 157 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A++ G+ + + L Sbjct: 158 KAART-GVVVAC------AAGHDADTSVLKEI 182 >gi|301623011|ref|XP_002940816.1| PREDICTED: integrin alpha-M-like [Xenopus (Silurana) tropicalis] Length = 907 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 35/276 (12%), Positives = 76/276 (27%), Gaps = 22/276 (7%) Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVK-SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 SM + + + F DR + ++ + + + + Sbjct: 147 SMFQGGKLNVLPSADGFFVDTDRPITFQNPDRSFGYQVVQLNQRVIVSAPLQQDGVNKTG 206 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLV-------RDALASVIRSIKKIDNVNDTVRMGA 299 S ++ +V + + V + + Sbjct: 207 TLYSCDPSTSRCQRVPINECPVRSLDIVFLIDGSGSIYPDQFQEMLTFVSKVIEDFKGTD 266 Query: 300 TFFNDRVIS---DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 T F S F + + G+T A+Q + Sbjct: 267 TLFAIMQYSSWFVLHFDFNKFLATKDHNSLVRNINQLKGATWTATAIQKVLSQLF----- 321 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 + ++K ++++TDGE D+ + +A+ +GI I ++ A Sbjct: 322 -IPSRGARDGSQKLLIVITDGEKYGDSLDYSIPIAEAERKGIVRFAIGVG-RAFSEDTAY 379 Query: 417 YFLSNCA---SPNSFFEANSTHELNKIFRDRIGNEI 449 L A S + L+K FR + ++I Sbjct: 380 SELITIASQPSDKYVYRVGDFSALSK-FRKDLQDKI 414 >gi|270159275|ref|ZP_06187931.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|289165901|ref|YP_003456039.1| type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] gi|269987614|gb|EEZ93869.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|288859074|emb|CBJ13003.1| putative type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] Length = 1159 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 42/384 (10%), Positives = 90/384 (23%), Gaps = 24/384 (6%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + + + + Y + +N T V + S Q Sbjct: 145 STDFALATYKTSSVGIYNTWVYHMSPQGSNFSFTNTPVAGNTYVNNPCYNYTSASSTIQS 204 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 SS + + M I + + S ++ + Sbjct: 205 NCSSIASFYGSSVLSSNAYMQIGATSDNPAINDVLYAGSGFPGVFLTYSGPSPATP---- 260 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 Y + N + + + PY S Y G Sbjct: 261 YPPNFSISNYNNGNVLISYSKSLPNIGGFGTSPTNAGFVPYSPQVLYSQRGFGYSGTQSA 320 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + +S+ + + +A+ ++ N T + + + S Sbjct: 321 TSGNVLVPMTSAGTNPTT--TSINNAIGIFTPYLQPETNSTSTKEI-------KASAVQS 371 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 G+ +T + + + + N + + I Sbjct: 372 PVAGLMTTAKTYLTSLGNTSGNGCPQKKYVILISDGLPTQDLNGKLWPPLGSAAATGYGI 431 Query: 372 V--LLTDG----ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ--QEKARYFLSNCA 423 DG N Q + I+ K QGI I +A L+ A Sbjct: 432 TATFNPDGSLQSTNCQALTDAISAITALKQQGIPTYIIGLGAGVNPSLNPQAAASLTAMA 491 Query: 424 SPN---SFFEANSTHELNKIFRDR 444 +++ A ++ +L Sbjct: 492 VAGGTTNYYPATNSTDLVNDLNAI 515 >gi|163846487|ref|YP_001634531.1| hypothetical protein Caur_0910 [Chloroflexus aurantiacus J-10-fl] gi|222524270|ref|YP_002568741.1| hypothetical protein Chy400_0993 [Chloroflexus sp. Y-400-fl] gi|163667776|gb|ABY34142.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl] gi|222448149|gb|ACM52415.1| conserved hypothetical protein [Chloroflexus sp. Y-400-fl] Length = 170 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 40/163 (24%), Gaps = 11/163 (6%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL----- 68 + G I AL++P + ++++V L+ A Q A +A L Sbjct: 1 MWHRHHGQALPILALMLPALTLFILVVIEVANLWLDVALLEDALQQATRSAVQHLDYAML 60 Query: 69 -----IQSLEEVSSRAKNSFTFPKQKIEEYLIRN-FENNLKKNFTDREVRDIVRDTAVEM 122 + I + N + + + Sbjct: 61 ARNTQSLRGSRTCQAVTVTNPGHCAAIVNVAHQFLLVNLRTARLDGHDPDKLAAAVRWTV 120 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 P+ V SS L G+ + +A Sbjct: 121 LPQGGTCAYVTSSTPLLCAEVQPTLHGLFGLGEIRPRIRAADT 163 >gi|300788919|ref|YP_003769210.1| hypothetical protein AMED_7090 [Amycolatopsis mediterranei U32] gi|299798433|gb|ADJ48808.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 134 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 15/108 (13%), Positives = 27/108 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ G ++ L +L G VD R A+ A L Sbjct: 2 RSVRGERGSASVLVVGLSLAVLIGVGFAVDGSRKGLAYSEATSIAEEAARAGDQALRVPA 61 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + A + Y++ + R V D Sbjct: 62 LAAGTDADVVPDLAAAEARRYVVSAGATGTVQVDNGRIVVDTTIVRQT 109 >gi|78186536|ref|YP_374579.1| hypothetical protein Plut_0658 [Chlorobium luteolum DSM 273] gi|78166438|gb|ABB23536.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 140 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 9/57 (15%), Positives = 19/57 (33%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + G + A ++PV+L + +V Y + L A + + Sbjct: 19 ARDQKGSVLVEFAFILPVLLMLLFGVVYFSVALYNKTVLTMATREGARAGVLYDADG 75 >gi|2497428|sp|Q60677|ITAE_MOUSE RecName: Full=Integrin alpha-E; AltName: Full=Integrin alpha M290; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor gi|535477|gb|AAC52142.1| alpha M290 integrin [Mus musculus] Length = 1167 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 38/105 (36%), Gaps = 10/105 (9%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T AMQ D I + + +A K +V+LTDG+ D + N K Q Sbjct: 272 TKTASAMQHVLDNIF------IPSRGSRKKALKVMVVLTDGDIFGDPLNLTTVINSPKMQ 325 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 G+ I ++ + L AS F+ + L+ Sbjct: 326 GVVRFAIGVG-DRFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 >gi|88857524|ref|ZP_01132167.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] gi|88820721|gb|EAR30533.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] Length = 1220 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 47/180 (26%), Gaps = 20/180 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 + + + V + GV + A Sbjct: 352 SYTGNTPPYDASTMSGSSYVSPFKEGCGSNAYIIL-ITDGVPTVDSDANSYIATLIATK- 409 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T N+ +++ + N Y+ L N D ++ + Sbjct: 410 -TDPNN---------QTTSSSPFYFKVGNKTNSSYLPNLAHWMNNNDVNTNVSG-----N 454 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 Q + + TI F ++ L +F A+S+ EL++ F + + Sbjct: 455 QNVSLYTIGFGEGA--EDAEELLLEAATKGGGKYFGASSSIELSQAFSKALSKILEVNTT 512 >gi|297671393|ref|XP_002813835.1| PREDICTED: collagen alpha-1(VII) chain-like [Pongo abelii] Length = 2889 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 21/124 (16%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + R + A D E +IN M A + ++ D V K + Sbjct: 81 TVQYSDDPRGQWASDASDSFEGTGLSINVLMIRAILSSVTQARDCVP---------KVCI 131 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFF 429 L+TDG + + + K QGI++ + + L AS + FF Sbjct: 132 LITDG---KSQDLVDTAAQRLKGQGIKLFAVGI------KNADPEELKRVASQPTSDFFF 182 Query: 430 EANS 433 N Sbjct: 183 FVND 186 >gi|189350352|ref|YP_001945980.1| hypothetical protein BMULJ_01515 [Burkholderia multivorans ATCC 17616] gi|189334374|dbj|BAG43444.1| putative membrane protein [Burkholderia multivorans ATCC 17616] Length = 599 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 38/142 (26%), Gaps = 5/142 (3%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 ++ V +DV + L++ A A I + + + +S A + Sbjct: 1 MIAIVVLGAIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSPAMMAAQSNALANG 60 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 + + + T + V +S + L F Sbjct: 61 FDYRASGQTLSIECGRWDT-----SATPYFNSTFTPLNAVSVSVTQQVPYIFLGRFFGKS 115 Query: 152 GIKSWLIQTKAEAETVSRSYHK 173 G + + A+ ++ Sbjct: 116 GSTGATVAAFSTAKAINIDSFT 137 >gi|91228647|ref|ZP_01262563.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] gi|91187798|gb|EAS74114.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] Length = 2510 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 35/347 (10%), Positives = 92/347 (26%), Gaps = 22/347 (6%) Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 I + S S I E + N N + D + + Sbjct: 1886 ITDNDGDSKTGYFSVNPYAPPIVEGVTLNVSEEGLSNAIADDSALNGFDDTTDSSTDVEQ 1945 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + L P + + +W + + S + + +S+ S Sbjct: 1946 LTLGQTVDSVTLSEPSTALTSNGVSVTWTLSNYDQMLIGSANGEEVIKISVNDTGVVSTE 2005 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 +L S + V S + + + ++ ++ Sbjct: 2006 LLGPIDHSNPSGEDTLNIEVPVLV-SNARGLTNSTTVNVIVEDDSPDSASIIHDVVAETK 2064 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + + +D S + +++D+ ++ + I R+ ++ + Sbjct: 2065 ES--ANVQLIMDVSGSMRTDNRLQIMKDSATQLLNQYESI----GQTRVQIITYSSTAST 2118 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + K + G+T N+A+ A + A Sbjct: 2119 YAIGAATWLTVEEA--KAYIETLTAGGATNYNNALNEAKQ--------SWDDVGKLTSAS 2168 Query: 369 KYIVLLTDGENTQ-----DNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 L+DG+ ++ + + +I +A+ + Sbjct: 2169 NVSYFLSDGQPNPASSFINDAREQSWIDHLTDPDNQITALAYGMGVN 2215 >gi|308510002|ref|XP_003117184.1| CRE-CLEC-143 protein [Caenorhabditis remanei] gi|308242098|gb|EFO86050.1| CRE-CLEC-143 protein [Caenorhabditis remanei] Length = 441 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 31/255 (12%), Positives = 79/255 (30%), Gaps = 10/255 (3%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 A S+ +G G + S + Sbjct: 23 PAFTETIAVSVALVLIAIYVYVSFFIYSGVFGSPIPQNSDVYADPLCPGNIKNLWLDVVV 82 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + + +S L V V A++++ + D + V + + V ++ Sbjct: 83 VVDKSQLMTNSQLWQVRNTISQVFGAVSNIGPTKWPADPRSTCVGVVTYDNSSTVNAEV- 141 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 K + + +T + + + + +D R + I Sbjct: 142 ---DASKSFSDLYNVIQSSLVAVDTTNTSY-LSSGLLAAEKAFKDSRQRTYRYKFQQVVI 197 Query: 372 VLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN-SFFE 430 D + + ++ + N+ K+ G+ I+T++ + + + + ASP +F + Sbjct: 198 AFAADYQGSGTIKDAKPVANRLKNNGVSIITVSCTNDPNASNEIQSL----ASPGFNFID 253 Query: 431 ANSTHELNKIFRDRI 445 +T +L K + + Sbjct: 254 EMNTAKLVKQLTNAL 268 >gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella baltica OS223] gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS223] Length = 772 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 52/180 (28%), Gaps = 20/180 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A +++ ++K + FN + + S + + F Sbjct: 407 DSIVQAKNALLYALKGLKPE---DSFNIIEFNSSLSQFSATSLPATSSNLSRARQFVSRL 463 Query: 332 NEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + A+ A ++ S + D V + ++ +TDG + I Sbjct: 464 QADGGTEMALALDAALPKSLGSVSPDAVQ-------PLRQVIFMTDGSVGNEQALFDLIR 516 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + R+ T+ F+ A +F E++ + Sbjct: 517 YQIGES--RLFTVGIGSAPNSH-----FMQRAAELGRGTFTYIGKVDEVDAKISALLSKI 569 >gi|90403616|ref|NP_001035046.1| integrin alpha M [Bos taurus] gi|74229861|gb|AAX46797.1| integrin alpha M [Bos taurus] Length = 1152 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++ + S+ + A K ++++TDGE D E +A Sbjct: 223 GRTHTATGIRKVVRELFHSSSGARN------HAIKIMIVITDGEKYLDPLEYSDAIPEAD 276 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANSTHELNKI 440 + I I + + K+R L AS + F+ N+ L I Sbjct: 277 REKIIRYVIGVG-DAFRGRKSRQELDTIASKPPADHVFQVNNFEALKTI 324 >gi|325267978|ref|ZP_08134626.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] gi|324980581|gb|EGC16245.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] Length = 1106 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 38/374 (10%), Positives = 82/374 (21%), Gaps = 40/374 (10%) Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 ++ + T P + + + V N ++ VV + Sbjct: 22 QSAAAATQPYPSVPLIWQSGTAAIKPNILLFLDTSGSMTTGGVRYNGGTASRIVVAKA-- 79 Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + ++ + T SR G+ + ++ + S D Sbjct: 80 -VARDVITATRENNNWGLMTFNGSMRDATHSRDIANGTGILVYDALNPTASEQRIWSDQW 138 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 QP + + N I P + Y Sbjct: 139 YSIGGKLKQPVLDVSSTTGAANYNQLISTINGLPANTNTPIPSAYYEAIRYFRGMPEGTT 198 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ K ++ + V + + + Sbjct: 199 ANSPNTQYTSPIKYRCQKNYIIFVSD---------GEPTGFPVRYQLGRLMRQDYILRSD 249 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +R + A + + A ++ S D + N+ G Sbjct: 250 ATLRQAINNGANGSWDSAYSTARIAYLAHNWDMMDSGRDAEGKSFNDATGSPV------G 303 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTH 435 ++ I T+ F N LS A +F AN Sbjct: 304 QDFSKQT-------------ITTYTVGFQANV-------QLLSEMARQGGGQYFIANEGD 343 Query: 436 ELNKIFRDRIGNEI 449 +L + Sbjct: 344 QLKAALTSALNAIA 357 >gi|209546010|ref|YP_002277900.1| hypothetical protein Rleg2_5625 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538867|gb|ACI58800.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 546 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 51/198 (25%), Gaps = 2/198 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+L + G I + + + +++ + Y + + A A + + Sbjct: 4 TLIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAALVHADWE 63 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + L ++ S + L T + P Sbjct: 64 ARYLNIMAMNNVASSQATVVMATSVAFQLTTAELALRSTAILAKLAEYSFTEGFGPASLL 123 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK--EHGVSIQWVIDFS 186 + + G +A T++ +Y + +I+ S Sbjct: 124 PPMPPMPYCPGWQKVPIVGGIIYGACLAFQGFRATEATIAIAYTVKAQIDYDPWGLINKS 183 Query: 187 RSMLDYQRDSEGQPLNCF 204 ++D D + F Sbjct: 184 SDIIDGMNDLNDYLVESF 201 >gi|153830633|ref|ZP_01983300.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] gi|148873874|gb|EDL72009.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] Length = 1426 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 40/304 (13%), Positives = 80/304 (26%), Gaps = 18/304 (5%) Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSM 151 L + K+ E + T + + + + + L + Sbjct: 678 ADLATLKVDVDLKDTDGSERFTDLTITGLPTGAVVQGGSLQDDGSWKVPASMLDKDGKVS 737 Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 + ++ TV + V + + + +P N G Sbjct: 738 LQVELPVGSQNIKVTVVAGSQEYGMDDQPIVGEVKYT--EADTGFMVKPSNPNGDNTVEG 795 Query: 212 VKSYSSQNGKVGIRDEKLSP-------YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSL 264 + G +G + P +V + S+ Y L + Sbjct: 796 GRGDDVLLGDIGGIKTSVEPGKNYNIALIVDTSGSMRYDLAGNQNATYDGWNDSWSQTPE 855 Query: 265 RHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIV 324 ++ + L DAL + + D TV + FN F + + + Sbjct: 856 QYAASRMKLTIDALKVLATQLADHD---GTVNITLIGFNGTAADALPF-NNLSAANLSDL 911 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 G T DA A + + +L+ + LTDG+ T N Sbjct: 912 IGKINLLIADGGTNYEDAFIEATK-----WFNTQPGSETDLKFENLTYFLTDGDPTVHNG 966 Query: 385 EGIA 388 + Sbjct: 967 DSST 970 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ A G TA DA+ + + Sbjct: 436 TYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVF--------GPIRD 487 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 488 SPNKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLRDMAS 538 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 539 KPKESHAFFTREFTGLEPIVSDVI 562 >gi|34481888|emb|CAE46492.1| trap [Plasmodium falciparum] Length = 331 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+K+ Sbjct: 72 AMKLIQQLNL---NESAIHLYVNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENASQLVVMLTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|227551457|ref|ZP_03981506.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895933|ref|ZP_05675586.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293377027|ref|ZP_06623237.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227179401|gb|EEI60373.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257832498|gb|EEV58919.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292644243|gb|EFF62343.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 477 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 77/283 (27%), Gaps = 22/283 (7%) Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 ++ T+ ++ + +S L LSL ++ + Sbjct: 140 TDAKEHKKETEVSTSNTDETKEGVSEENENSVSPFTLPRLSLPAVTVSTNQTISTEYTTD 199 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + V+D + + N + + SY G Sbjct: 200 DQGTYPKANWQPTGNTNVLDHQGNKNGSNQWDGI---NSWDGDPNDRTHSYIEYGGTGNQ 256 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF------VDSSSLRHVIKKKHLVRDAL 278 D + + + +Y + VD S + + V+ + Sbjct: 257 ADYAIRKFAKETTTPGLFDVYLNARGNVQKDITPLDLVLVVDWSGSMNDNDRIGEVKIGV 316 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF-AIDENEMGST 337 + ++ + D + MG ++ + + + + + G T Sbjct: 317 DRFVDTLAD-SGITDKINMGYVGYSSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGT 375 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 ++ A D + N KK IVLLTDG T Sbjct: 376 FTQKGLRDAGDMLSVPN-----------GHKKVIVLLTDGVPT 407 >gi|149377596|ref|ZP_01895335.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] gi|149358138|gb|EDM46621.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] Length = 718 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 37/304 (12%), Positives = 81/304 (26%), Gaps = 26/304 (8%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL---NCFG 205 + ++ + + G+ + V S + + + Sbjct: 247 QISPFTVRPEDVGPDSHRATIALEINAGLPVSRVSSPSHRVTSTWNGNTVEVAPESRAVL 306 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 D V+ ++ + + +L G +D+S Sbjct: 307 MDRDLIVRWAPARGMEPAAAVFQEHWQGKDYLLALVVPGLNGNRSLPRELVFVIDTSGSM 366 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 R AL + ++ D R FN + S S + Sbjct: 367 AGES-IRQARQALLRGLGTLDADD------RFNVIQFNSQTHSLFMESVPASGNNIARAR 419 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + N G T + A+ A +T E R+ + +V +TDG ++ Sbjct: 420 RYVKGLNADGGTEMAPALDAALETNGDGGEASRARV-------RQVVFITDGAVGNESAL 472 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 I + S R+ T+ F+ A ++ + ++ + + Sbjct: 473 FGKIRDGLGSS--RLFTVGIGSAPNMH-----FMREAARYGRGTYTAISDLSDVARPLDE 525 Query: 444 RIGN 447 G Sbjct: 526 LFGK 529 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 42/121 (34%), Gaps = 14/121 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K LV+ L +++ ++ D R+ ++ V D + S + R+ Sbjct: 58 TKLELVKKTLETLVAQLRACD------RLALVTYDTEVTLDLALSP-MDDKGRSKATQVV 110 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + ST ++ + + + + D + + ++LLTDG Sbjct: 111 NGIRDGSSTNLSGGLLEGLNILRNRPTDSRREVSS-------VLLLTDGLANVGISTTEG 163 Query: 389 I 389 I Sbjct: 164 I 164 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 7/58 (12%), Positives = 18/58 (31%), Gaps = 7/58 (12%) Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGNEIFE 451 +G + T + + L A +++ + + F D +G + Sbjct: 228 EGCTVYTFGYGSDADSN-----MLKAISDAGSGTYYFIENKDTVATSFGDCLGGLVSV 280 >gi|319648589|ref|ZP_08002803.1| YabS protein [Bacillus sp. BT1B_CT2] gi|317389356|gb|EFV70169.1| YabS protein [Bacillus sp. BT1B_CT2] Length = 245 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 24/63 (38%), Gaps = 9/63 (14%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-----ASP 425 I+LLTDG + + + A+ AK QGI + I + E + + A Sbjct: 9 ILLLTDGCSNRGE-DPQAMAAFAKEQGITVNVIGIM---DEHEMDQEAMKEVEGIALAGG 64 Query: 426 NSF 428 Sbjct: 65 GVH 67 >gi|254426967|ref|ZP_05040674.1| TadE-like protein [Alcanivorax sp. DG881] gi|196193136|gb|EDX88095.1| TadE-like protein [Alcanivorax sp. DG881] Length = 150 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 9/81 (11%), Positives = 28/81 (34%), Gaps = 5/81 (6%) Query: 16 KSCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTA---IITASVPLIQS 71 + G + +L P ++ + G V + ++ ++ A A + +Q Sbjct: 5 RKQAGAVALEFLMLFPFVVAMLYGAAV-YGLTFFAQYRMQDAVDNAVSTALYIDRSALQG 63 Query: 72 LEEVSSRAKNSFTFPKQKIEE 92 +S + + + + + Sbjct: 64 AALGASVTQRANSALAGLVTQ 84 >gi|193788347|dbj|BAG53241.1| unnamed protein product [Homo sapiens] Length = 325 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 16/152 (10%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + ++I ++ D T A+ + Y + + E + I+LL Sbjct: 102 DMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILL 161 Query: 375 TDGENTQDNEEGIAICNKAKS-QGIR--------IMTIAFSVNKTQQEKARYFLSNCASP 425 TDG++ + + I I I + S Sbjct: 162 TDGKSNMGGS-PKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGE 220 Query: 426 NSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 F T L+++F V ++T Sbjct: 221 RHAFILQDTKALHQVFEHM------LDVSKLT 246 >gi|52078560|ref|YP_077351.1| YabS [Bacillus licheniformis ATCC 14580] gi|52783922|ref|YP_089751.1| YabS [Bacillus licheniformis ATCC 14580] gi|52001771|gb|AAU21713.1| YabS [Bacillus licheniformis ATCC 14580] gi|52346424|gb|AAU39058.1| YabS [Bacillus licheniformis ATCC 14580] Length = 245 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 24/63 (38%), Gaps = 9/63 (14%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-----ASP 425 I+LLTDG + + + A+ AK QGI + I + E + + A Sbjct: 9 ILLLTDGCSNRGE-DPQAMAAFAKEQGITVNVIGIM---DEHEMDQEAMKEVEGIALAGG 64 Query: 426 NSF 428 Sbjct: 65 GVH 67 >gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291346197|gb|EFE73101.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 597 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 44/432 (10%), Positives = 102/432 (23%), Gaps = 44/432 (10%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 L G + D VR AA A + + + S+R + + Sbjct: 178 LVAGALGSDGVRL--------GAADPARSATGLLALAGIGASSARQGGDSDTRVAQTAKV 229 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP---LSLFLRS 150 L + + + + + ++ L LF Sbjct: 230 LAERMSDGDAQVLETLARSTSGAEEGNPKRNQAVLISEQAAFTHNAEATGAGKLDLFYPE 289 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G E + + ++ + + Sbjct: 290 DGTPLLDYPYTLVNEPQLSTTESRAALRFMNLLGDREARATFAEHGFRAGDGSAEDSLVA 349 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 Q + + + + L + + + + Sbjct: 350 AAGGRKPQPYAEPAAEAPSAEALQETLGMWTITVQSARLTTVVDASGSMATLVPGRNQSR 409 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + +++L + D +G F + G R + D Sbjct: 410 MDVTKESLIQALDQFTPND------EIGLWEFATTL-------DGEKDYRRLMETKRLGD 456 Query: 331 ENEMGST------AINDAMQ---TAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGEN 379 G T A +Q + + ++ K+ +V+LTDG N Sbjct: 457 PAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFNALVILTDGSN 516 Query: 380 -TQDNEEGIAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + K + + ++ IA + + E A + +E + Sbjct: 517 QDTNGISRSGLITELKELVDPERPVPVIAIAVGPDADRDEVAEI---ARITGGDGYEVSD 573 Query: 434 THELNKIFRDRI 445 E+ + I Sbjct: 574 PAEIQAVILQAI 585 >gi|260789946|ref|XP_002590005.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] gi|229275192|gb|EEN46016.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] Length = 159 Score = 40.7 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 17/142 (11%), Positives = 41/142 (28%), Gaps = 12/142 (8%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + R+G F+DR + + H ++ + G T + Sbjct: 30 VNYFTLGENDTRVGVVTFSDRDRQNTRVTLNEHYTRVELLTEIRDIPYDRGHTYTGLGLD 89 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 N + +++++TD E+ D + + GI + + Sbjct: 90 HV------RNNSFLEVNGRRNNTLDFLIVITDDESEDDIVRPAQL---IRQMGITVFVVG 140 Query: 405 FSVNKTQQEKARYFLSNCASPN 426 + ++ L A Sbjct: 141 VGQES---DISQPTLETIAGDP 159 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 65/446 (14%), Positives = 128/446 (28%), Gaps = 57/446 (12%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L K G F +ITA+L+ + + ++VD V L++ +I + P Sbjct: 13 RLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDH-VIYRTSPKNLYD 71 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + R ++ + Y R+ N + ++ V P + + Sbjct: 72 LREAGRDNFIRHQIEKALNTYNSRDLSN--IGSIESIVKDAVILTKNVNSLPLQFTVDIA 129 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 LS+ L + L +F +S G + K + W+ Sbjct: 130 LSTTVQLRGSLLQMFSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLASR-----GK 184 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 DS+ P + K + + ++ +Y + Sbjct: 185 VADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFCMA---PYHYSSILYWAVGT 241 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 L+ +++ + D T F V S Sbjct: 242 LTYSVDNKTTTREYYK--------------------DPYYATWDHFPYSFIKNVFDMTSN 281 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 +G ++ S M + + ++++E +K Sbjct: 282 QFG----DGQVLTNTNHCFPHGASQNKYMLMLAIGNQLS----------RSSVEKEKIEK 327 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEA 431 +L D + I I ++ FS ++ RY L CAS S ++E Sbjct: 328 VLQDCHYMHKRHRTG------RDA-ITIFSVGFSPDQD----TRYTLRQCASDPSKYYEI 376 Query: 432 NSTHELNKIFRDRIGNEIFERVIRIT 457 NS + I + N I + T Sbjct: 377 NSDENVMPIAKSLARNVITNWFSQFT 402 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 402 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 453 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 454 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 504 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 505 KPKESHAFFTREFTGLEPIVSDVI 528 >gi|227827418|ref|YP_002829197.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584633|ref|YP_002843134.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619574|ref|YP_002914399.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459213|gb|ACP37899.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019682|gb|ACP55089.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380643|gb|ACR41731.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 356 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 32/104 (30%), Gaps = 23/104 (22%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVNFTINLAKQSQVPTK-IIMLTDGKPTDKR--------NVKDYEKLDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +TI + L A S F+ EL IF Sbjct: 158 ITIGIGNDYN-----ERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|209527388|ref|ZP_03275895.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492179|gb|EDZ92527.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 396 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 38/98 (38%), Gaps = 5/98 (5%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--DEVHRMKNNLEAKKYIVLL 374 + + +T + + +T I + A + + + V E + ++LL Sbjct: 147 NDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRPEPRLSVILL 206 Query: 375 TDGENTQDNEEGI--AICNKA-KSQGIRIMTIAFSVNK 409 +DG + Q NE+ + ++ I + T+ + + Sbjct: 207 SDGFHNQPNEQQDFDNLITLLERNNNIIVHTLGYGLTP 244 >gi|198426626|ref|XP_002122822.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1823 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 90/363 (24%), Gaps = 14/363 (3%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + +I EN + + D + ++ + + Y + + P + + Sbjct: 742 AKNVILAEENGNRPSVVDVVLTITDGRAQDDVGNISRELRANGALTYVIGVQPGNK-KKL 800 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + I + E + S + S R Sbjct: 801 DENELLEIAGRKENMFLVTSGFAALDQQFAEKLSQSICTSQCGRPPVPIVSAEIQCLTSL 860 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH---FVDSSSLRHV 267 + + S G + S + L LSE+ + + Sbjct: 861 SAPTNGSIVCSDGYNLGSRCEFACEVTPSQSFTLVGNTESVCLSEDGLTASWSHPAPTCI 920 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + V + + + T V + + + Sbjct: 921 RTQCSQVPPIPNGRMTCLDGNFVNSVCTFHCETSNGYSVYPETNNEITCLSNGTWDKQVP 980 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + T A++ A D + S + + ++ +TDG + + Sbjct: 981 CCSKEYHLGTLTGQALRHAKDVMFSEENGNRPGVVD------VVLTITDG---RSQDHVR 1031 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST-HELNKIFRDRIG 446 + ++ G+ I ++ L S + F L++ F ++ Sbjct: 1032 NVSRALRANGVLTYVIGIQPGNNKKLDKNELLDIAGSRENMFLVTGGFSALDQQFAYKLS 1091 Query: 447 NEI 449 I Sbjct: 1092 RSI 1094 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 53/139 (38%), Gaps = 20/139 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+ A ++ R F + + R G T+ A++ A + Sbjct: 44 TRVAAIVYSSRPRVA--FDFNRYTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLFRR- 100 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + +K ++L+TDG+++ D + K +G++I + ++ ++ E Sbjct: 101 --------YGRKRRKVLMLITDGKSSDDVLKPS---KALKRKGVQIFAVGVGMSVSRNEL 149 Query: 415 ARYFLSNCAS-PNSFFEAN 432 AS P+ ++A+ Sbjct: 150 IL-----IASHPSQVYQAS 163 >gi|116255769|ref|YP_771602.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115260417|emb|CAK03521.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 545 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 16/198 (8%), Positives = 50/198 (25%), Gaps = 2/198 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 K+L + G I + + + +++ + Y + + A A + + Sbjct: 4 TLIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAALVHADWE 63 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + L ++ S + L + P Sbjct: 64 ARYLNIMAMNNVASSQATVVMATSVAFQLTTAELALRSGVILAKLAEYSFTDGFGPASLL 123 Query: 129 YQVVLSSRYDLL--LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + + + + + ++ + ++ + + +I S Sbjct: 124 PPMPPMPHCPGWQKIPVIGGIIYGACLAFQGVRAIGATKAIAYTVKAQIDYDPWGLIVKS 183 Query: 187 RSMLDYQRDSEGQPLNCF 204 ++D D + F Sbjct: 184 SDIIDAMNDLNDYLVESF 201 >gi|73970088|ref|XP_531792.2| PREDICTED: hypothetical protein XP_531792 [Canis familiaris] Length = 1465 Score = 40.7 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 54/183 (29%), Gaps = 25/183 (13%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 + ++ + + K LV+D + I+ K + V+ A + Sbjct: 702 FGKVHDDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAVAWQEKL 761 Query: 309 DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAK 368 + + D ST A+Q A+ + Sbjct: 762 VEINEDNLRGAQS-----WIRDIQIGSSTNTLHALQIAF----------------ADKET 800 Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + I LLTDG Q + + K Q I I TI+F+ N L++ Sbjct: 801 QVIYLLTDGRPDQP---PEMVIEQVKVFQKIPIYTISFNYNDEIANGFLKELASLTGGEF 857 Query: 428 FFE 430 F Sbjct: 858 HFY 860 >gi|317124425|ref|YP_004098537.1| hypothetical protein Intca_1292 [Intrasporangium calvum DSM 43043] gi|315588513|gb|ADU47810.1| hypothetical protein Intca_1292 [Intrasporangium calvum DSM 43043] Length = 159 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 41/164 (25%), Gaps = 11/164 (6%) Query: 5 TKFIFYSKK--LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 ++ S + ++ G ++ L ++L VDV L + Sbjct: 2 SRLFSSSSRTRMVSGEDGQIAVLILGLFVIVLVFILGAVDVTAAQLARMRL--------L 53 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + + + + I + V Sbjct: 54 DTADAVALDAADALDATAVYRGGVAGPLVLTDASVRAAAQEHLARTPTPPGITEWSLVAA 113 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIK-SWLIQTKAEAE 165 + ++ R L L S+G + + ++A A Sbjct: 114 TGSRDGRTAEVTLRGRAALPMTGWILESLGGGVTVTVTSRARAP 157 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 40.7 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 42/363 (11%), Positives = 90/363 (24%), Gaps = 18/363 (4%) Query: 97 NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSW 156 N E+ V + + + SA L+ + R + + Sbjct: 595 NIEDYNLAAPGAISVSEALASGSGTFTITASAGLKSLTLDGPDNADATLTLDRLADLVNN 654 Query: 157 LIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYS 216 + T++ + + + + S D+ Q K+ Sbjct: 655 PVTLTTSKGTLTLTGYDATTGKVSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATG 714 Query: 217 SQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 + L P S S + LD S S + + + + +++ Sbjct: 715 DLGVLITDTAPSLKPIAESSALSSHGTNIMLTLDTSGSMNYGSGVYNGWTQLSRLAVLKS 774 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 ++ +++ + VR+ FN G + K G Sbjct: 775 SVNNLLDKYGEA----GDVRVMIVEFNTSASQK-----GGGWMSLAEAKALVSGLGYGGG 825 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK---- 392 T A+ TA + + + + TDGE + A Sbjct: 826 TNYQTALDTAMN----AWNNSGTGKLEGGNVQNISYFFTDGEPDSNRSVNSAQQATWEKF 881 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLS-NCASPNSFFEANSTHELNKIFRDRIGNEIFE 451 I I T + + + ++ + L+ I I Sbjct: 882 LADNHINSYGIGLGTGATGGYIDPISYNPDRPADSNTILVKDLNGLDTAIEGTIAPPING 941 Query: 452 RVI 454 + Sbjct: 942 DIA 944 >gi|321466601|gb|EFX77596.1| hypothetical protein DAPPUDRAFT_30003 [Daphnia pulex] Length = 3316 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 28/80 (35%), Gaps = 12/80 (15%) Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 EA K + L+TDG + + + + QG+RI T + L + A Sbjct: 6 RSEAVKAVFLITDGYSNGGDPRP--AAKRLREQGVRIFTFGI------RNGNVKELYDMA 57 Query: 424 SPNSFFEANSTHELNKIFRD 443 S A + F + Sbjct: 58 SEP----AQEHSYIVDSFEE 73 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 463 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 464 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|325678275|ref|ZP_08157903.1| hypothetical protein CUS_5520 [Ruminococcus albus 8] gi|324110038|gb|EGC04226.1| hypothetical protein CUS_5520 [Ruminococcus albus 8] Length = 1201 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 30/304 (9%), Positives = 77/304 (25%), Gaps = 24/304 (7%) Query: 14 LIKSCT--GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ- 70 K G I+ LL+ + V ++++ R+ + L + + + Sbjct: 6 FPKHRRVSGTISIMLTLLLLPIYSVLSIIIEGARYQSAKQQLDELTYLGQLAILADYMDF 65 Query: 71 ----------SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + + +E + K N D + Sbjct: 66 LEENYDLYSFYSASDAEGKQMLDESFQYYVESATSAGGIDTTKLNKLFNLPMDCCTVEGM 125 Query: 121 EMNPRKSAYQVVLS--SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + +Y + + LS F M + A+ +S + Sbjct: 126 YSLADPEILKYQIKQYGKYRMPADILSDFSFKMIFDLLESKLGGVAKKLSVVNAASKQMK 185 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD---------RTVKSYSSQNGKVGIRDEKL 229 I + + +S ++ F D + + + Sbjct: 186 DASSIMNNAKAVFSAEESLTSNISTFSGKYDEFQNKKSEVTDWNIDQATYDIIKNYTDAD 245 Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + +LY L + +++ I+ + A V I+ Sbjct: 246 MNAQTNTANNLYSRLKSCMSSYQSTANTYLNYVEQIKDIESQSNWSSDSAKVAEHDSLIE 305 Query: 290 NVND 293 + ++ Sbjct: 306 SRDN 309 >gi|281350861|gb|EFB26445.1| hypothetical protein PANDA_015099 [Ailuropoda melanoleuca] Length = 1741 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 +++ + G+T A+ ++ K +L+TDG Sbjct: 188 NTKDEVIEAVRNLPYKGGNTLTGLALNYIFE------NSFKPEAGARAGVSKIGILITDG 241 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 242 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADENELREIASEPDSTHVYNVAE 291 >gi|239832734|ref|ZP_04681063.1| TadE family protein [Ochrobactrum intermedium LMG 3301] gi|239825001|gb|EEQ96569.1| TadE family protein [Ochrobactrum intermedium LMG 3301] Length = 180 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 43/124 (34%), Gaps = 12/124 (9%) Query: 1 MVFDTKFIFYSKKLIK---------SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEH 51 ++ +F F + + + + +G + A+L PV L + ++ + + Sbjct: 24 IMGRKRFSFVTDRFRRIPLLLFFNRNKSGTAAVEFAILAPVFLLILMGMIAFGLYLGVAN 83 Query: 52 ALKQ-AAQT--AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 A++Q AA + P + S KN+ + E+ + N Sbjct: 84 AVQQLAADATRTALAGIDPPERLALATSYIRKNAAKYSLIDPEKMQVNVDNAQSDPNQFT 143 Query: 109 REVR 112 ++ Sbjct: 144 VTIQ 147 >gi|227326308|ref|ZP_03830332.1| hypothetical protein PcarcW_02944 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 190 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 39/145 (26%), Gaps = 4/145 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +S G A L+PV+L ML ++ R A A A+ + Sbjct: 29 RHWRSTRGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAF-RLDNLAS 87 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLK---KNFTDREVRDIVRDTAVEMNPRKSAY 129 + + YL + N + + N ++ Sbjct: 88 DSAEQMGARLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNGSQDNPNGEETTT 147 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIK 154 V S + ++ + + Sbjct: 148 LPVWSVTVQIKKAFITPLPEVLTLG 172 >gi|3273293|dbj|BAA31190.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 52/184 (28%), Gaps = 12/184 (6%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 ++ + E ++ + V+ A+ ++ I++++ Sbjct: 23 VNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWVKHAVPLAMKLIQQLNLN 82 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEMGSTAINDAMQTAYD 348 + + F++ K I+K+ G T + DA+ Sbjct: 83 ESAIHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRK 142 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 + N A + +V+LTDG + K +G++I Sbjct: 143 HLNDRI--------NRENASQLVVILTDGIPDSIQDSLKES-RKLNDRGVKIAVFGIGQG 193 Query: 409 KTQQ 412 Sbjct: 194 INVA 197 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 43/116 (37%), Gaps = 15/116 (12%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 R+G F + + ++++ D T+I +A+ A SN Sbjct: 130 RLGVILFAGEAYMQVPLTTDYEAV-SLLLQSANPDMISSQGTSIGEALAIAQTNTSKSNG 188 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + ++++TDGE+ + E A +A G++I TI + Sbjct: 189 N------------RVVLVITDGEDHEARAEAQAR--QAARAGMKIFTIGIGSEEGG 230 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 42/132 (31%), Gaps = 27/132 (20%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK---- 392 T + A++TA D + + ++LTDG+ Sbjct: 134 TNLGAAIETASDGYFTGGVLSNT----------HFIVLTDGKVDISRNPSANKAEANRIL 183 Query: 393 ------AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDR 444 K QG R +A S FL AS SF A + ++L++ F D Sbjct: 184 DTLVPPLKQQGARFHAVALSAEAD-----TEFLRKLASDSNGSFHVAENANDLSRAFLDA 238 Query: 445 IGNEIFERVIRI 456 + + I Sbjct: 239 FNAAVPQEQTPI 250 >gi|111026119|ref|YP_708402.1| hypothetical protein RHA1_ro10051 [Rhodococcus jostii RHA1] gi|110824962|gb|ABH00244.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 153 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 27/89 (30%), Gaps = 1/89 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G ++T + PV++ G++VD + A+ A A +I S+ Sbjct: 17 HRDKGAVTLMTVVFAPVLIFFVWGLIVDGGGMLTADQRADNVAEDAARAAGQQIIGSVSA 76 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 + YL + Sbjct: 77 RGIDTVVDPVRATAAAKRYLFEAGVDGDV 105 >gi|109090611|ref|XP_001091779.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 3 [Macaca mulatta] Length = 781 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 48/178 (26%), Gaps = 25/178 (14%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAIN 340 I +D + VR+GA F+ + + VK G T Sbjct: 75 ITVCDALDISTERVRVGAFQFSSTPHLEFPLD---SFSTQQEVKAKIKRMIFKGGHTDTG 131 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + N + +V++TDG++ + K +G+ + Sbjct: 132 LALKYLLNRGFPGGR--------NASVPQILVIVTDGKSQGHVALP---AKQLKEKGVTV 180 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI----FRDRIGNEIFERVI 454 + + L AS ++ F + I Sbjct: 181 FAVGV------RFPRWEELHALASEPREQHVLLAEQVEDATNGLFSTLSNSAICSSTT 232 >gi|109090613|ref|XP_001091550.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 1 [Macaca mulatta] Length = 725 Score = 40.7 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 48/178 (26%), Gaps = 25/178 (14%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS-TAIN 340 I +D + VR+GA F+ + + VK G T Sbjct: 75 ITVCDALDISTERVRVGAFQFSSTPHLEFPLD---SFSTQQEVKAKIKRMIFKGGHTDTG 131 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ + N + +V++TDG++ + K +G+ + Sbjct: 132 LALKYLLNRGFPGGR--------NASVPQILVIVTDGKSQGHVALP---AKQLKEKGVTV 180 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI----FRDRIGNEIFERVI 454 + + L AS ++ F + I Sbjct: 181 FAVGV------RFPRWEELHALASEPREQHVLLAEQVEDATNGLFSTLSNSAICSSTT 232 >gi|154250681|ref|YP_001411505.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154631|gb|ABS61848.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 187 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 37/131 (28%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+ +++C G + AL+ PVM+ ++ + A TA + Sbjct: 16 LKRFLRNCAGIAAVEFALIFPVMIAFYFGSIETTNMLTANRRVTSVAYTAADITAQATSI 75 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 S +++ S L + + + +V + + Sbjct: 76 SNSDLADIFAASSAILAPFSTTPLKVRITSVVAYSSNIAKVAWSDGLNIAPRSTGSTVSL 135 Query: 131 VVLSSRYDLLL 141 + + Sbjct: 136 PSGLTTAGSSV 146 >gi|217424378|ref|ZP_03455877.1| TadE family protein [Burkholderia pseudomallei 576] gi|217392843|gb|EEC32866.1| TadE family protein [Burkholderia pseudomallei 576] Length = 594 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 428 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 487 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 488 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 547 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 548 PDPASLAGSMNLV 560 >gi|76818230|ref|YP_336470.1| hypothetical protein BURPS1710b_A1313 [Burkholderia pseudomallei 1710b] gi|76582703|gb|ABA52177.1| putative membrane protein [Burkholderia pseudomallei 1710b] Length = 722 Score = 40.7 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 556 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 615 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 616 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 675 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 676 PDPASLAGSMNLV 688 >gi|254475237|ref|ZP_05088623.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214029480|gb|EEB70315.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 199 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 14/37 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGM 39 T ++ + G + A MP++LGV Sbjct: 1 MLTSLRTLFRRYRRDTDGSVSVEFAFYMPLLLGVFAA 37 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + + Sbjct: 411 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPMRD 462 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 463 SPNKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 513 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 514 KPKESHAFFTREFTGLEPIVSDVI 537 >gi|261820222|ref|YP_003258328.1| hypothetical protein Pecwa_0903 [Pectobacterium wasabiae WPP163] gi|261604235|gb|ACX86721.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 212 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 36/89 (40%), Gaps = 2/89 (2%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++F+ + S + TAL P++L +G + D+ ++Q I + Sbjct: 15 SRFLSRLRGFWFSRRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQRV--GAIAS 72 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 + + Q+L+E + PK+ Y Sbjct: 73 ILAMQQALDEQGLQGLLDTVLPKEGTGNY 101 >gi|284173928|ref|ZP_06387897.1| hypothetical protein Ssol98_04615 [Sulfolobus solfataricus 98/2] gi|261602665|gb|ACX92268.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 356 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 32/105 (30%), Gaps = 23/105 (21%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVNFALNLAKQSQVPNK-IIMLTDGKPTDKR--------NVKDYEKFDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +TI + L A S F+ EL +F Sbjct: 158 ITIGIGSDYN-----ERILKKLADKSSGKFYHLKDISELPDVFES 197 >gi|145614348|ref|XP_365937.2| hypothetical protein MGG_10157 [Magnaporthe oryzae 70-15] gi|145021285|gb|EDK05414.1| hypothetical protein MGG_10157 [Magnaporthe oryzae 70-15] Length = 754 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 53/173 (30%), Gaps = 9/173 (5%) Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 ++Y + + + D + K A + + + D Sbjct: 7 TVYIIDLGSSMGDCNNGRTESDLDWSMRYVWDKLCTTTAASRKTWQVGVVGVRTDVTDNP 66 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN--DAMQTAYDTIISSNED 356 ++ ++ S V + + +K ++ + A+ A I + Sbjct: 67 LDDAGEKGYNNISVLQNVAPMTLSALKLLQDKLVPSSTSDGDVVSAIVVAIAMIDEAAP- 125 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 L+ K+ IVL+TDG+ D ++ I + I I I + Sbjct: 126 ------QRLKYKRKIVLVTDGQGAIDGDDVGDIARRINELNIEITIIGVDFDD 172 >gi|110681634|dbj|BAE98272.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium gallinaceum] Length = 1875 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 56/166 (33%), Gaps = 8/166 (4%) Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 +D S+ I K V +++++K +N Sbjct: 720 CKGDECFCHNYYDLTLILDESASIGHINWKEQVAPFTEEIVKNLKVSENEVHIGIFLFAQ 779 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 FN + ++ V+ D E G T I +A++ Sbjct: 780 FNRDFVKFSEEESYKKNILTQKVENLKNDYKEGGYTYIVEALEYGLQNYTK-------DP 832 Query: 362 KNNLEAKKYIVLLTDGENTQDNEE-GIAICNKAKSQGIRIMTIAFS 406 + +A K +L TDG NT +E+ I I + K++ ++++ + Sbjct: 833 SSRSDASKVTILFTDGNNTNPSEDILINISSMYKNEQVKLLVLGVG 878 >gi|56675030|gb|AAW19657.1| matrilin-3 [Cervus elaphus] Length = 146 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 41/136 (30%), Gaps = 19/136 (13%) Query: 317 HKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +K T A+QTA D + K +++T Sbjct: 18 THSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAFTVEAGARGPS---SNIPKVAIIVT 74 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEAN 432 DG + ++ + +A++ GI + + L AS F Sbjct: 75 DG---RPQDQVNEVAARARASGIELYAVGV------DRADMESLKMMASEPLDEHVFYVE 125 Query: 433 S---THELNKIFRDRI 445 + +L+ F++ Sbjct: 126 TYGVIEKLSSRFQETF 141 >gi|323474472|gb|ADX85078.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477209|gb|ADX82447.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 356 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 32/105 (30%), Gaps = 23/105 (21%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVSFTINLAKQSQVPTK-IIMLTDGKPTDKR--------NVKDYEKLDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRD 443 +TI + L A S F+ EL IF Sbjct: 158 ITIGIGNDYN-----ERILKKLADRSSGKFYHIKDISELPNIFES 197 >gi|145493674|ref|XP_001432832.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399947|emb|CAK65435.1| unnamed protein product [Paramecium tetraurelia] Length = 618 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 40/129 (31%), Gaps = 14/129 (10%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 A +I S+ + + F ++ S + + + +V Sbjct: 452 DRFNQAQNGLISSLSSAKDNQNIRVTIIIFNDNARCVVDSQTINMQTIKNAVVCN----- 506 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T+ A Q AY I + E I TDG ++ + N Sbjct: 507 --GGGTSFQSAFQLAYQKIAAVKNFEQFNKH-------VIFFYTDGGDSYPTQALNQFAN 557 Query: 392 KAKSQGIRI 400 ++Q ++I Sbjct: 558 LPQAQRMKI 566 >gi|152988351|ref|YP_001350200.1| hypothetical protein PSPA7_4864 [Pseudomonas aeruginosa PA7] gi|150963509|gb|ABR85534.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7] Length = 556 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 56/172 (32%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G I+ A + + L ++VD R + L++ A A + ++ ++ S Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQTS 66 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 ++A + L + + + + R ++ N V + Sbjct: 67 AQATSFAKASASLNGFDANAAGSGLLAEVGGVQSLGGLRRFSSSASNVAVDNEAVRVLVS 126 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 + + ++ G + + +AEA + ++ S+S Sbjct: 127 KSVPGSLVANLAGLFGGANANVNLRAEAVARRLPSATISAGTGLASLNSSQS 178 >gi|115525743|ref|YP_782654.1| TadE-like protein [Rhodopseudomonas palustris BisA53] gi|115519690|gb|ABJ07674.1| TadE-like protein [Rhodopseudomonas palustris BisA53] Length = 213 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 28/100 (28%), Gaps = 8/100 (8%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L + G + AL++P+ML + ++ + A + + + Sbjct: 19 RLRRDTRGLGAVEFALIVPLMLAMIFGVIQISSGI--------AIDRKVSMVTQTTSDLV 70 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 A+ L LK T + Sbjct: 71 SRYKEVAEVDLDGIITIANAILTPYDSTPLKAKITQVYIN 110 >gi|307943135|ref|ZP_07658480.1| TadE family protein [Roseibium sp. TrichSKD4] gi|307773931|gb|EFO33147.1| TadE family protein [Roseibium sp. TrichSKD4] Length = 178 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 5/74 (6%), Positives = 25/74 (33%), Gaps = 4/74 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 +F + ++ ++ G I A++ + +++ + + + A+ Sbjct: 1 MFYSVIGRLLRRYTRNDQGVTAIEFAIVGTPFFMLIFGILEFGLAFFV-NRI---VDNAV 56 Query: 62 ITASVPLIQSLEEV 75 + + + + Sbjct: 57 LETARLVRTGQAKD 70 >gi|226360795|ref|YP_002778573.1| hypothetical protein ROP_13810 [Rhodococcus opacus B4] gi|226239280|dbj|BAH49628.1| hypothetical protein [Rhodococcus opacus B4] Length = 891 Score = 40.4 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 22/70 (31%), Gaps = 14/70 (20%) Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-------------SPNSFFEA 431 I ++ +S GI + + Q + + A SP F+ A Sbjct: 203 RPAGIADQLRSSGIVTFAVGLAAGTAQPS-DFDLMRSIATGGDGACGKTTSPSPGDFYLA 261 Query: 432 NSTHELNKIF 441 + +L F Sbjct: 262 QNIDDLLFAF 271 >gi|315613111|ref|ZP_07888021.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] gi|315314673|gb|EFU62715.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] Length = 863 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/325 (11%), Positives = 75/325 (23%), Gaps = 67/325 (20%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + V D S SM + S T++ + + G + Sbjct: 66 TSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKGLIDT 125 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF---VDSSSLRHVIKKKHLVRDALASV 281 + + ++ D ++ Sbjct: 126 ILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWPYERMTKYYDGVSPWDDANTI 185 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + V + + + ++ T I Sbjct: 186 LGWSNNARAAKTAVYNMSIAGGNSIGTESGI---------------------GTGTNIGA 224 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICN-KAKSQGIR 399 + A + S+ AKK ++LL+DG + G I N + I Sbjct: 225 GLTLANQLMGSA----------RSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIE 274 Query: 400 I---------------------------MTIAFSVNKTQQEKARYFL---SNCAS-PNSF 428 +I F + + AS PN Sbjct: 275 TAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYMRQHNASIPNEI 334 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 F AN +L F+D + + Sbjct: 335 FSANDEDQLRDSFKDITDKILPLGI 359 >gi|257790640|ref|YP_003181246.1| hypothetical protein Elen_0884 [Eggerthella lenta DSM 2243] gi|257474537|gb|ACV54857.1| hypothetical protein Elen_0884 [Eggerthella lenta DSM 2243] Length = 210 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 42/144 (29%), Gaps = 16/144 (11%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G +I + + +L G+ VDV + Q A++ A + + Sbjct: 14 GQVAVIFVVAISCLLIAVGLGVDVSTAYSAKT-----GQEAVLEAVRQSSLGMSNAVKYS 68 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 +N ++++ E L N + + + L Sbjct: 69 ENPGREAREEVVELLSDNGYTGTAEIWYAELPE-----------SESGSNDRYAGVYVRL 117 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAE 163 + + FL+ G ++ A Sbjct: 118 SNDTETTFLKLAGRDKLTARSTAV 141 >gi|156390493|ref|XP_001635305.1| predicted protein [Nematostella vectensis] gi|156222397|gb|EDO43242.1| predicted protein [Nematostella vectensis] Length = 229 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 61/186 (32%), Gaps = 8/186 (4%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI--VKTFAIDE 331 +R A VI + I VR+ + D D SF VH + +K + Sbjct: 32 IRQAQKHVISISETISRTAYNVRLALVEYRDHPPQDKSFVTRVHDFTSDVKEMKVWVDKM 91 Query: 332 NEMGSTAINDAMQTAYDT---IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ--DNEEG 386 + G +++ A + + + + DG DN + Sbjct: 92 SASGGGDCPESVADAIFKACKLGYREDATKMCVLIADAPPHGLGFAHDGFPNGCPDNHDP 151 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 +A C+ +GI + TI + ++ + + +S + ++K+ D Sbjct: 152 LASCHVMAEKGITLYTIGCEPSVDSFRDFLVGMAAI-TGGQYVPLHSANVMSKMIIDGAL 210 Query: 447 NEIFER 452 E+ Sbjct: 211 EEVSLE 216 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ A G TA DA+ + D Sbjct: 414 TYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAISFTVRNVFGPIRDSP------ 467 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 468 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 516 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 517 KPKESHAFFTREFTGLEPIVSDVI 540 >gi|293365338|ref|ZP_06612055.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307703880|ref|ZP_07640821.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] gi|291316788|gb|EFE57224.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307622715|gb|EFO01711.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] Length = 863 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/325 (11%), Positives = 75/325 (23%), Gaps = 67/325 (20%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + V D S SM + S T++ + + G + Sbjct: 66 TSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKGLIDT 125 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF---VDSSSLRHVIKKKHLVRDALASV 281 + + ++ D ++ Sbjct: 126 ILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWPYERMTKYYDGVSPWDDANTI 185 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + V + + + ++ T I Sbjct: 186 LGWSNNARAAKTAVYNMSIAGGNSIGTESGI---------------------GTGTNIGA 224 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICN-KAKSQGIR 399 + A + S+ AKK ++LL+DG + G I N + I Sbjct: 225 GLTLANQLMGSA----------RSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIE 274 Query: 400 I---------------------------MTIAFSVNKTQQEKARYFL---SNCAS-PNSF 428 +I F + + AS PN Sbjct: 275 TAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYMRQHNASIPNEI 334 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 F AN +L F+D + + Sbjct: 335 FSANDEDQLRDSFKDITDKILPLGI 359 >gi|153008591|ref|YP_001369806.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] gi|151560479|gb|ABS13977.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] Length = 165 Score = 40.4 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 15/118 (12%), Positives = 38/118 (32%), Gaps = 9/118 (7%) Query: 3 FDTKFIFYSKK------LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ- 55 +F ++ ++ +G + A+L PV L + ++ + +A++Q Sbjct: 14 ISDRFSNLHRRIPSLLFFKRNKSGTAAVEFAILAPVFLLILMGMIAFGLYLGVANAVQQL 73 Query: 56 AAQT--AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 AA + P + + KN+ + + + N + Sbjct: 74 AADATRTALAGIDPPERLTLATTYIQKNAAKYSLIDPAKMQVSVDNAQSDPNQFTVTI 131 >gi|239939820|ref|ZP_04691757.1| hypothetical protein SrosN15_02385 [Streptomyces roseosporus NRRL 15998] gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_03262 [Streptomyces roseosporus NRRL 11379] Length = 592 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 44/432 (10%), Positives = 102/432 (23%), Gaps = 44/432 (10%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 L G + D VR AA A + + + S+R + + Sbjct: 173 LVAGALGSDGVRL--------GAADPARSATGLLALAGIGASSARQGGDSDTRVAQTAKV 224 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP---LSLFLRS 150 L + + + + + ++ L LF Sbjct: 225 LAERMSDGDAQVLETLARSTSGAEEGNPKRNQAVLISEQAAFTHNAEATGAGKLDLFYPE 284 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G E + + ++ + + Sbjct: 285 DGTPLLDYPYTLVNEPQLSTTESRAALRFMNLLGDREARATFAEHGFRAGDGSAEDSLVA 344 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 Q + + + + L + + + + Sbjct: 345 AAGGRKPQPYAEPAAEAPSAEALQETLGMWTITVQSARLTTVVDASGSMATLVPGRNQSR 404 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + +++L + D +G F + G R + D Sbjct: 405 MDVTKESLIQALDQFTPND------EIGLWEFATTL-------DGEKDYRRLMETKRLGD 451 Query: 331 ENEMGST------AINDAMQ---TAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGEN 379 G T A +Q + + ++ K+ +V+LTDG N Sbjct: 452 PAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFNALVILTDGSN 511 Query: 380 -TQDNEEGIAICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 + + + K + + ++ IA + + E A + +E + Sbjct: 512 QDTNGISRSGLITELKELVDPERPVPVIAIAVGPDADRDEVAEI---ARITGGDGYEVSD 568 Query: 434 THELNKIFRDRI 445 E+ + I Sbjct: 569 PAEIQAVILQAI 580 >gi|118357550|ref|XP_001012024.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293791|gb|EAR91779.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 853 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 30/128 (23%), Gaps = 19/128 (14%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIIS 352 F S + + G T I + ++ Sbjct: 348 DSYFNVVSFGSSFEKLFPSSQKYNSQNLEQAVQIINQYSANLGGTEIYQPLHNVFNE--- 404 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKT 410 K K I LLTDG+ + N K RI +I F + Sbjct: 405 ---------KKIEGYNKQIFLLTDGQVD----NPKQVVNLIKKNNKFSRIHSIGFGNDAD 451 Query: 411 QQEKARYF 418 +Q Sbjct: 452 KQLIQETA 459 >gi|308473938|ref|XP_003099192.1| CRE-CLEC-62 protein [Caenorhabditis remanei] gi|308267665|gb|EFP11618.1| CRE-CLEC-62 protein [Caenorhabditis remanei] Length = 352 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 30/94 (31%), Gaps = 7/94 (7%) Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + + + K+++++ N + I + K+ G+ I Sbjct: 90 GIAAAEQVLTAG-----RKNNTRSNYKQFVLIYASAYNDDGMGDPRPIAERLKTSGVNIA 144 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 T+AF Q + ASP F Sbjct: 145 TVAFDQTGDQD--ILKAIGEIASPGFNFTNKDED 176 >gi|307729999|ref|YP_003907223.1| hypothetical protein BC1003_1969 [Burkholderia sp. CCGE1003] gi|307584534|gb|ADN57932.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 602 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 18/165 (10%), Positives = 45/165 (27%), Gaps = 6/165 (3%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + + G ++ A+ + + + G VD+ + Y L++ A A + + Sbjct: 28 VARRQRGAIALLAAVWVSLAVIALG-AVDIGHFYYARRDLQRTADLAAAAGVQLIASAGG 86 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 ++ + + +A QVV+ Sbjct: 87 CAAAANSAQLNASANGLPADGTVQSSCGRWDPSANAGQSYFAAS-----GAPINALQVVV 141 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 S L S + +A + + S + ++ Sbjct: 142 SRPVPFLFLIGQTRELSASAVAQATNVRAFSLSTSIAGLSGGALN 186 >gi|289524039|ref|ZP_06440893.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289502695|gb|EFD23859.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 1071 Score = 40.4 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 37/111 (33%), Gaps = 28/111 (25%) Query: 368 KKYIVLLTDGENTQ---DNEEGIAICNKA---------------KSQGIRIMTIA-FSVN 408 + ++++LTDG +TQ N + K+Q +R M I + N Sbjct: 301 QNWLIVLTDGADTQTWPYNTSPVTAVKNLYDEHAKTSWPSFYGKKAQPVRTMVIGLINPN 360 Query: 409 KTQQEKARYFLSNCA---------SPNSFFEANSTHELNKIFRDRIGNEIF 450 + + L+ A + + A EL + F+D Sbjct: 361 ASGVSTLKNTLNRMADMGDDGQENGSSHAYFATDVDELMQAFKDIFKQIQS 411 >gi|330834639|ref|YP_004409367.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566778|gb|AEB94883.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 383 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 44/123 (35%), Gaps = 20/123 (16%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + + ++ N G TA+ A+ TA++ N Y++LLTDG Sbjct: 91 VDPEDLTNEIVNLNAGGQTALFTALLTAFNL------------HNKHGVPSYVILLTDGN 138 Query: 379 NTQDNEEGIAICNKAKSQGI--RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 T D + I + ++F + E L++ S F+ + E Sbjct: 139 PTDDTNI-----ETYRRLQIPSSVQAVSFGLGDDYNESILKSLADK-SGGVFYHVSDAME 192 Query: 437 LNK 439 + + Sbjct: 193 IPE 195 >gi|260803822|ref|XP_002596788.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] gi|229282048|gb|EEN52800.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] Length = 547 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 30/115 (26%), Gaps = 20/115 (17%) Query: 336 STAINDAMQTAYDTIISSNEDEVHR---------MKNNLEAKKYIVLLTDGENTQDNEEG 386 T N + A + K + +LL+DG Q + Sbjct: 98 YTISNPGLPFAISNLTQEGGLSRTGHALSFMTDTSKFRTGIPRTAILLSDGFP-QSDANA 156 Query: 387 IAI------CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANST 434 A+ A+ GI + + L S + F +++ Sbjct: 157 QAMDDYEAQAEAARDAGIDLYAVGVGAAGLVNW---DVLETITGSSDRVFRSDNP 208 >gi|186472639|ref|YP_001859981.1| von Willebrand factor type A [Burkholderia phymatum STM815] gi|184194971|gb|ACC72935.1| von Willebrand factor type A [Burkholderia phymatum STM815] Length = 332 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 66/230 (28%), Gaps = 33/230 (14%) Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + L M G ++ ++S + + K + RDA+ + + Sbjct: 80 TGTGAEILILMDGSGSMNQAISSGSMNVADAPTAGETKNQMARDAITAFVAQ-------R 132 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 R+ F + F+ + I T T ++ +Q+A + Sbjct: 133 ANDRLAFMLFGTHPMLAVPFTRNRTVIDAAIAATGVGRGTPD--TLLDRGIQSAVELF-- 188 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + + IVL++DG D+ I G+ + I Sbjct: 189 --------DGRPRTSSRAIVLVSDGGARLDDVAREHIRAGLSRNGVALYFIYLRSGIYSP 240 Query: 413 EKARYFLSNCASPN--------------SFFEANSTHELNKIFRDRIGNE 448 + SP ++A+S ++ + D E Sbjct: 241 DLHVRLADADHSPEAELHRFFLSLPTAYRLYQADSPQQVARAMSDIARTE 290 >gi|296134299|ref|YP_003641546.1| TadE family protein [Thermincola sp. JR] gi|296032877|gb|ADG83645.1| TadE family protein [Thermincola potens JR] Length = 133 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 34/130 (26%), Gaps = 7/130 (5%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + + G + AL++PV++ + +V+ R A + + Sbjct: 5 FHQFRQRENGQALVEMALVLPVLILIIFGIVEFGRIMNT----YLIVTNAAREGARHGVV 60 Query: 71 SLEEVSSRAKNSFTFPK---QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + R KI + T + + + + Sbjct: 61 GGTDADIRDAVINNAYTLDPAKITVTISPAGSRTRGAPLTVQVSYSLDIIAPLIGVITGN 120 Query: 128 AYQVVLSSRY 137 Y V S+ Sbjct: 121 PYIVTASTTM 130 >gi|119510959|ref|ZP_01630081.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464398|gb|EAW45313.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 464 Score = 40.4 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 63/197 (31%), Gaps = 21/197 (10%) Query: 218 QNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP-LDPSLSEEHFVDSSSLRHVIKKKHLVRD 276 I +++ + S+Y ++ + + + K +V + Sbjct: 34 TKDVAAIPPSTTFTFVIDTSGSMYEIVAGDTTPTGKTYTVDGKEYTQVTGGKSKIDIVSE 93 Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MG 335 +L ++IRS + + R+ F+D + ++ G Sbjct: 94 SLLALIRSGRLG----ASDRIAIVQFDDTASQIIGLTSATEI---NKLEDAIAQLRTFSG 146 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T + ++ A + + + +L TDG+ T D+++ AI + + Sbjct: 147 GTRMGLGLRRALEMLNNQQMTVRRT-----------LLFTDGQ-TFDDDQCRAIASDFAT 194 Query: 396 QGIRIMTIAFSVNKTQQ 412 I I + + + Sbjct: 195 NNIPITALGVGEDFNED 211 >gi|167836681|ref|ZP_02463564.1| hypothetical protein Bpse38_09356 [Burkholderia thailandensis MSMB43] Length = 323 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 29/273 (10%), Positives = 72/273 (26%), Gaps = 8/273 (2%) Query: 27 ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ----TAIITASVPLIQSLEEVSSRAKNS 82 L + V+L G +D+ R + L+ A + P + + + S Sbjct: 1 MLFVTVLLIFGAFAIDLPRVITVRNELQNDADAAALAGAGALTTPGVSGPAWTQAASATS 60 Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 ++ + + + + T A QV ++ + Sbjct: 61 AAILLNASDDQTLTSGIVQTGYWNLTGQPSTLQPTTITPGPYDMPAVQVTVTRAANQNGG 120 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 + L + + S + ++ G VID + Y QP Sbjct: 121 AIPLLMGNFLGVSTANGSATAVAIIASPSTVGAGGVFPMVIDQCV-LSQYWNAQTNQPKI 179 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML---YPGPLDPSLSEEHFV 259 ++ + Q + + ++ + N + P ++ + +V Sbjct: 180 DPSTGQPYDIQIGNGQTYGGSCTGGQWTSFLTNANDAPTVRGLITNGNPSPLNIGDNIWV 239 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + + + V + V Sbjct: 240 QPGAKTTLYSSVPTGVTIVMPVATQVSSKTYVP 272 >gi|152994355|ref|YP_001339190.1| hypothetical protein Mmwyl1_0314 [Marinomonas sp. MWYL1] gi|150835279|gb|ABR69255.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 186 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 21/45 (46%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 + + + + AL +PVM+G+ + D+ + ++Q A Sbjct: 4 RFFRDENANAVLEFALFVPVMIGIILVSADLYNINRMRGVMEQTA 48 >gi|127512111|ref|YP_001093308.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] gi|126637406|gb|ABO23049.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] Length = 1204 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 44/191 (23%), Gaps = 51/191 (26%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + F D SWG R + + T Sbjct: 347 YFAGKSVEFGDDDKDYTYESWGRWVTYRANKNDSLDKDALVKGTN--------------- 391 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------------------- 393 + YIV +TDGE T D+ + + Sbjct: 392 ---TYQSPFKKCQNNAYIVFITDGEPTLDSAADVEVTKLTGGVDKHSTSYMSALSSWMYR 448 Query: 394 KSQG--------IRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELNKIFRD 443 K + TI F + A L AS S+F+A +L + Sbjct: 449 KDVNPNMDGDQHVITHTIGF---SEGAKNAEDLLKKVASKGGGSYFDATDATKLQGSIQQ 505 Query: 444 RIGNEIFERVI 454 + + Sbjct: 506 AVNEVLANSAS 516 >gi|332290703|ref|YP_004429312.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332168789|gb|AEE18044.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 351 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 54/178 (30%), Gaps = 11/178 (6%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 ++ A +S I ++ + F + V+ + +++ + Sbjct: 127 TSLQELKQASSSFISNVMPAIPTDSFKMAIYWFDGEDVLHELQPLTTSAAMLQEAIDGIT 186 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 D + ST + A+ A + + + + + +V+ TDG + Sbjct: 187 EDISNDPSTDLYGAVIKAAEIADNIIDVSENEDLFAAAS---VVIFTDGTDQAARYTEEE 243 Query: 389 ICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 + + TI LS S F A++ EL +F D Sbjct: 244 AVDAVSNADDDVSFFTIGLGSEID-----EGILSKIGQTESVF-ASNAAELESVFNDI 295 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 413 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDSP------ 466 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 467 --NKNFLVIITDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 515 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 516 KPKESHAFFTREFTGLEPIVSDVI 539 >gi|198426777|ref|XP_002120162.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 367 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 40/312 (12%), Positives = 82/312 (26%), Gaps = 31/312 (9%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 + + L S+ + A S V D + Sbjct: 45 KIHFHSDHSLLVSVPYDTHGSNDVKSASGGLSSCPIRFNVEQHSQCDDVTPTSPNLSARD 104 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 G L+ P + + + + + S Sbjct: 105 GFGLDFEVDP---RGNVTACAPLRSQTCRGNVLKLGYCYTGTGH--------GTVWSPLS 153 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 V+ + + L+ + SI + D N + S Sbjct: 154 IVEGFNSDCPHDQVDLL--FVLDGSTSINEADPNNFNTVKNWVKNITKRFDITSSGSAAV 211 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + + ++ G+T A+ A +S+ KK +VLLTDG Sbjct: 212 AMDQIKLRL--------GATFTAAALSKATTVFKNSSRFNDPL------TKKVLVLLTDG 257 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN-CASPNSFFEANSTHE 436 ++ D E A + ++ I TIA V ++ + + + + ++ Sbjct: 258 QSN-DREGLNASATQVRNLNIT--TIAVGVKADVLQELQIIANGVIGNNDRVYQLRDFSN 314 Query: 437 LNKIFRDRIGNE 448 L+ I + Sbjct: 315 LDSIVQSIFQEI 326 >gi|219883045|ref|YP_002478209.1| hypothetical protein Achl_4441 [Arthrobacter chlorophenolicus A6] gi|219862051|gb|ACL42392.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 327 Score = 40.4 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 7/35 (20%), Positives = 11/35 (31%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE 50 G +I A M +LG + +D Sbjct: 11 SDERGAVAVIVAFAMVGILGFAAVAIDTGALYSER 45 >gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS185] gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS185] Length = 772 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 51/180 (28%), Gaps = 20/180 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A +++ ++K + FN + + + + F Sbjct: 407 DSIVQAKNALLYALKGLKPE---DSFNIIEFNSSLSLLSATPLPATSSNLSRARQFVSRL 463 Query: 332 NEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + A+ A ++ S + D V + ++ +TDG + I Sbjct: 464 QADGGTEMALALDAALPKSLGSVSPDAVQ-------PLRQVIFMTDGSVGNEQALFDLIR 516 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + R+ T+ F+ A +F E++ + Sbjct: 517 YQIGES--RLFTVGIGSAPNSH-----FMQRAAELGRGTFTYIGKVDEVDAKISALLSKI 569 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 396 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 447 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 448 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 498 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 499 KPKESHAFFTREFTGLEPIVSDVI 522 >gi|332798630|ref|YP_004460129.1| pilin isopeptide linkage domain-containing protein [Tepidanaerobacter sp. Re1] gi|332696365|gb|AEE90822.1| pilin isopeptide linkage domain protein [Tepidanaerobacter sp. Re1] Length = 925 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 38/407 (9%), Positives = 95/407 (23%), Gaps = 48/407 (11%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + Y + N T + +V + Sbjct: 38 TTVIDNIDEYPGLKNASFYYDAKRIIEMTSNNTYPALYPSNSTIKTFSWKHGQTAKVSWN 97 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR-------SYHKEHGVSIQWVIDFSR 187 L F + S + E + ++ + Sbjct: 98 KSNGSSNCDLLWFGANSKTDSHISLASVEYNANGNLKQYTYDDNGRAKAITSYNWLQVIN 157 Query: 188 SMLDYQRDSEGQPLNCFGQ----PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 S + + N D + + + + + +Y Sbjct: 158 SSKGEAKVTLHFRYNPDIDEVEPDPDPEPEYAKTIDYLGDGIPNPDTSANGLNDYRIYLD 217 Query: 244 LYPGPLDPSLSEEH------FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 + + + + + +++++ + S + S+ + + R+ Sbjct: 218 VTTEASETETDRDIIFVLDVSNSMDTALGNTTRFNVLKNTVKSAVNSLVQNPSN----RI 273 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F R + KLI + G T ++M A + S Sbjct: 274 SIITFGTRAQIVTTRETDRTKLINCVNSLSLPGGTA-GGTNYYESMLHAAQIVNGSINGS 332 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI---------------MT 402 +K I ++DGE T A+ A ++ I + Sbjct: 333 ---------HEKVIFFVSDGEPTASLPAANAMGYAAYAEVATIYAYHAAQEFQNVDRFYS 383 Query: 403 IAFSVNKTQQEKARYFLSNC-ASPNSFF-EANSTHELNKIFRDRIGN 447 + + + + + +A+S +L F + Sbjct: 384 VFIGDDSGSASTLQTITQMVEVNNEKYMVQASSAEQLTSAFNRFVSK 430 >gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] Length = 666 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 44/158 (27%), Gaps = 21/158 (13%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 FN++V + + F + G T I A+ + Sbjct: 357 DDSFNIIGFNNQVTAMSDTPLVASDFNLRRARRFIYNLQADGGTEIQGALDAVLNGAQFE 416 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + +V LTDG + ++E +I R+ T+ Sbjct: 417 GF------------VRQVVFLTDGSVSNEDELFKSIARTLGDS--RLFTVGIGS-----A 457 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 R+F+ A S+ ST ++ + Sbjct: 458 PNRFFMRRAADIGKGSYTFIGSTFDVQPKMQQLFDKLA 495 >gi|307292637|ref|ZP_07572483.1| TadE family protein [Sphingobium chlorophenolicum L-1] gi|306880703|gb|EFN11919.1| TadE family protein [Sphingobium chlorophenolicum L-1] Length = 209 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 51/144 (35%), Gaps = 4/144 (2%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +L + +G + AL +P++LG+ ++ +Y L A + + + + Sbjct: 6 LHRLWPNRSGVAAVEFALSLPILLGLTMYSMEAANMAYTSQKLG---DIATL-TADSVSR 61 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 +S+ + + + + + + + T ++ ++ Sbjct: 62 IRLSISNGDLTDALGGMKILGDSIDLRNRGRIIVSSVQPVLDSSGNVTNQKVRWQRCTGA 121 Query: 131 VVLSSRYDLLLNPLSLFLRSMGIK 154 ++ S Y + N + + + G K Sbjct: 122 LIKDSPYVVNANLGTAGIGATGRK 145 >gi|149573051|ref|XP_001520334.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 581 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 4/86 (4%) Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 +A K +V++TDG+ D+ + A+++G+ I A Sbjct: 5 NPISGAREDANKILVVITDGQKYGDSLKYSEAIPLAEAKGVIRYAIGVGDAFDFSSTAEE 64 Query: 418 FLSNCASP---NSFFEANSTHELNKI 440 + AS + F ++ L+ I Sbjct: 65 LI-TIASSPEQDHVFRVDNFGALDNI 89 >gi|111018687|ref|YP_701659.1| hypothetical protein RHA1_ro01688 [Rhodococcus jostii RHA1] gi|110818217|gb|ABG93501.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 904 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 22/70 (31%), Gaps = 14/70 (20%) Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-------------SPNSFFEA 431 I ++ +S GI + + Q + + A SP F+ A Sbjct: 203 RPAGIADQLRSSGIVTFAVGLAAGTAQPS-DFDLMRSIATGGDGACGKTTSPSPGDFYLA 261 Query: 432 NSTHELNKIF 441 + +L F Sbjct: 262 QNIDDLLFAF 271 >gi|27379054|ref|NP_770583.1| hypothetical protein blr3943 [Bradyrhizobium japonicum USDA 110] gi|27352204|dbj|BAC49208.1| blr3943 [Bradyrhizobium japonicum USDA 110] Length = 219 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 57/211 (27%), Gaps = 8/211 (3%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 F ++ L G A++ P ML + V++ + + Sbjct: 5 LSFRARDLWTDARGVAATEFAIVSPFMLLLYIGGVELGNGLAMNVKVS--------ATAH 56 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + + + T + + + V + D+ + Sbjct: 57 SVADMVSQNTQVTASQMTGILAASTAIMAPYAVKSGSTSLMTITVSGVSTDSKGNATVQW 116 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 S ++R LS F + VS Y G +I + + Sbjct: 117 STSTKSGAARTVGQQMTLSQFTATDPKNPNNANISLILSEVSYDYTPNLGYTIAGTVQLT 176 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSS 217 S + R S P N D + ++ Sbjct: 177 DSYYLFPRCSTNSPANSSFPYYDVKYPATTT 207 >gi|699607|dbj|BAA02763.1| complement factor B [Lethenteron japonicum] Length = 763 Score = 40.4 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 58/191 (30%), Gaps = 15/191 (7%) Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKL 319 K L V I +I +R + S S + + Sbjct: 263 DASYSVGKEDFDTGLNFVKDLINRIGMYVRNIRYSIVMYATNPSLKLSVRDSWSNDPNAV 322 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 I+ + + ++ T A++ + + +V + ++ I+LLTDG + Sbjct: 323 IKILDDLDYYEFDDTPGTNTAMALKM---VLDTMALYKVANQNTFKDIRQAIILLTDGRS 379 Query: 380 TQDNEEGIAICNKAKSQGIR-----IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANST 434 G + + I + F + +++ S + F Sbjct: 380 NVGPPPGKFLMDNI-DLDIPKEHMDVYV--FGMGDVYKDEIETIASQKPNEQHSFILRDY 436 Query: 435 HELNKIFRDRI 445 +LN++F + Sbjct: 437 DDLNEVFEKML 447 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 60/182 (32%), Gaps = 31/182 (17%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K V+ A + I + D + F V + + V+ Sbjct: 671 KIDRVQSAANAAIGLLAPKD------ELAVWTFGSSVHKT---ALAPMGNRISQVRAEIG 721 Query: 330 DENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-------- 380 G+T + A+Q A+D + +N+ + + K +VLLTDG Sbjct: 722 AIEAGGTTTQLPAAVQAAHDALAQTNDPDNPK-------TKAVVLLTDGATNLTPDGADE 774 Query: 381 -QDNEEGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 ++ A+ + +RI TI + + + + AS ++ A + L Sbjct: 775 EENKAANDALVADIRGSESHVRIYTIPYGNSADKCLLEKV---AAASGARYYGAGARESL 831 Query: 438 NK 439 Sbjct: 832 IN 833 >gi|301614661|ref|XP_002936793.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 344 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 35/114 (30%), Gaps = 14/114 (12%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 T A++ A +K ++LTDG+ + + I Sbjct: 2 MYMGEGTYTGTAIRKATQ---------EGFFGARTGVRKVAIVLTDGQTDKRESVKLDIA 52 Query: 391 -NKAKSQGIRIMTIA-FSVNKTQQEKARYFLSNCASPN---SFFEANSTHELNK 439 +A++ I + I + + Q L+ AS + + + L Sbjct: 53 VREAQAANIEMYAIGIVNASDPTQVDFLRELNLIASDPDSEHMYLIDDFNTLPA 106 >gi|254506780|ref|ZP_05118920.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] gi|219550361|gb|EED27346.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] Length = 385 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 42/328 (12%), Positives = 84/328 (25%), Gaps = 24/328 (7%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 M ++GV + +DV + L+ + A + A++ L E + A T Sbjct: 1 MLALIGVAALSIDVNHAMLNKTRLQNSVDAAALAAAIVLDNGGSEAQATAAAKTTLTN-- 58 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + + + D +P + +Y V S + L + F Sbjct: 59 VANATGNTEMDFTSAQVVVQFSNDPATFPFSGFDPDEDSYARVAVSNFALS----NFFAH 114 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + L T + S D S S D + + Sbjct: 115 VFSVDKNLASTAVAGPSPSVIDSNNIVPMAVCEGDSSGSAGYVPGDLYALKIADQKLGSM 174 Query: 210 RTVKS------YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV---- 259 + + + + + V + GP+ L+ V Sbjct: 175 GSGNYQLLDFGSGASTVRTALAGDYAISVGVGDQVTTKPGNTIGPVGQGLNTRFGVYKGG 234 Query: 260 --DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF------FNDRVISDPS 311 S+ I K + +I D + + + SD + Sbjct: 235 GLSSTDYAPDIYVKEPNKSPSIDSSGNITYSDTSGAGGQPWSYSDYLAALPDCTGDSDCN 294 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAI 339 G + R ++ G T+ Sbjct: 295 IGSGGQQNRRVLIVPIVDCSGASGGTSS 322 >gi|171320625|ref|ZP_02909645.1| TadE family protein [Burkholderia ambifaria MEX-5] gi|171094138|gb|EDT39225.1| TadE family protein [Burkholderia ambifaria MEX-5] Length = 142 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 36/131 (27%), Gaps = 1/131 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G + AL++P +L V ++DV + + A++ A V + L Sbjct: 8 ARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPMLTT 67 Query: 75 VSSRAKNSFTFPKQKIE-EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + N T V T VL Sbjct: 68 TQIANVALSYTQNSLVSGGSATAPTVAVTQANGTTAGTALTVTVTYTYSGLVLGTALSVL 127 Query: 134 SSRYDLLLNPL 144 + + + + Sbjct: 128 TGPITISASSV 138 >gi|48477361|ref|YP_023067.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] gi|48430009|gb|AAT42874.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] Length = 379 Score = 40.4 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 53/186 (28%), Gaps = 34/186 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L ++ ++I + + + F D V T Sbjct: 54 KLDLAKEGAMNLI------EKIPRDNIVSLIAFGDTAKVI------VEGKEPTFALEAIP 101 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G+TA+ A+ TA N I+LLTDG T + Sbjct: 102 SLKVAGNTAMYTALLTATKLADKYNMPGR------------IILLTDGMPT--DVSMNES 147 Query: 390 CNKAK-SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIG 446 + +G I I L A + F+ + EL K+ + Sbjct: 148 YENLQVPEGFTIDCIGIG-----DNYRDDLLKLLADKGNSIFYHLENPEELPKVMESTVS 202 Query: 447 NEIFER 452 ++I + Sbjct: 203 SDISAK 208 >gi|308483657|ref|XP_003104030.1| hypothetical protein CRE_02342 [Caenorhabditis remanei] gi|308258687|gb|EFP02640.1| hypothetical protein CRE_02342 [Caenorhabditis remanei] Length = 1471 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 27/270 (10%), Positives = 77/270 (28%), Gaps = 13/270 (4%) Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + + + P +++ + I+ + S+ + ++ Sbjct: 751 TVEEKIEVRPHTIYTEEPYFSTSSIEWNPTTSESEDYSNPTTPKSVIREDEDYDYTNEFA 810 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSL 253 + + + + +D + Sbjct: 811 AFPPTVTTTIASTINSTNADDKVTTSKPKHVSSSEEIQPNFVIPSGQCPTPNQPNIDRNR 870 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 ++ F+ SS ++ H + + D V++ +ND + S Sbjct: 871 TDILFLLDSSDNFNEQRFHRAIKLIGETVSKFNNF--GADGVQVSLVQYNDEPYLEFSLR 928 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 H + ++ A E G + +N A++ + + +N +++ Sbjct: 929 --KHNCKKHLLDDIADTEFMTGGSQLNKALEKVSQFAFTKKRGDRPDAEN------ILII 980 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +TDG+ +E + AK + ++ I Sbjct: 981 VTDGQYNGRIQEPTRL---AKDNNVTVLVI 1007 >gi|309789715|ref|ZP_07684295.1| magnesium chelatase [Oscillochloris trichoides DG6] gi|308228201|gb|EFO81849.1| magnesium chelatase [Oscillochloris trichoides DG6] Length = 603 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 61/191 (31%), Gaps = 32/191 (16%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-NDRVISDPSFSW 314 VD+S ++ + A+ S++ D ++G F D + Sbjct: 422 CFVVDASWSMAAEERMQATKAAVLSLL-----RDAYQRRDQVGLVSFQRDYARVLLPLTN 476 Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 V R G T + M TAY+ + + + E +VLL Sbjct: 477 SVELAQRR-----LQTMPTGGKTPLARGMLTAYELL-------ERARRQDHEVVPLMVLL 524 Query: 375 TDGENT------QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF---LSNCASP 425 TDG+ +E AI + ++ IR + I ++ +R L Sbjct: 525 TDGQANVAIGNAPPQQEAYAIADLIAARDIRAIVIDTEHPNFERGLSRRLSEHLK----- 579 Query: 426 NSFFEANSTHE 436 ++ H+ Sbjct: 580 GRYYRLEDLHD 590 >gi|300697744|ref|YP_003748405.1| hypothetical protein RCFBP_mp20592 [Ralstonia solanacearum CFBP2957] gi|299074468|emb|CBJ54018.1| conserved protein of unknown function, putative TadE-like [Ralstonia solanacearum CFBP2957] Length = 159 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 14/108 (12%), Positives = 32/108 (29%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I A+++PV+L + +V + L AA+ A + + V++ Sbjct: 24 GVASIEFAVVVPVLLLMMLGIVYYGVIFAMQQMLTLAAEEGARAALRYQSTNTQRVAAAY 83 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + I + N + + + + Sbjct: 84 SAVSSVLPSFISGRVQTNQSSTPLVTCQNVAGMQCLSVVLTMPLTTGN 131 >gi|156409365|ref|XP_001642140.1| predicted protein [Nematostella vectensis] gi|156229281|gb|EDO50077.1| predicted protein [Nematostella vectensis] Length = 156 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 39/130 (30%), Gaps = 21/130 (16%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G +++ + + +K T A+ + Sbjct: 48 TRVGLVLYSNNPYKIFG--FNKYTNKNAAMKATGRIPYPRRGTKTGRALAYTGRYLF--- 102 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + ++ ++LLTDG + + A K + GI I + N Sbjct: 103 --------RSSKRRRVLILLTDG---RSYDRVSAPARKLRQAGIHIYAVGVGRNYN---- 147 Query: 415 ARYFLSNCAS 424 L + AS Sbjct: 148 -IKQLRSIAS 156 >gi|92113788|ref|YP_573716.1| membrane protein-like protein [Chromohalobacter salexigens DSM 3043] gi|91796878|gb|ABE59017.1| membrane protein-like protein [Chromohalobacter salexigens DSM 3043] Length = 604 Score = 40.4 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 47/155 (30%), Gaps = 9/155 (5%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G ++ ++ +++ + +DV R Y + L++ A +++ Sbjct: 10 ARAQRGAIGLLAMGVLGLVILCLVLALDVGRLYYEKARLQK-------LADTAAMEAAAA 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 ++ S L V D + E P A +VVL Sbjct: 63 LARTQDMSTLREMAAANLARSEGDGAALTLQRGTIAVDDGINVFVAEEAPTSDAVRVVLR 122 Query: 135 SRYDLL--LNPLSLFLRSMGIKSWLIQTKAEAETV 167 +N LF + + T + A Sbjct: 123 QSVPTSLIVNVERLFSDRDIEDTRTLSTASVARRT 157 >gi|256393183|ref|YP_003114747.1| hypothetical protein Caci_4038 [Catenulispora acidiphila DSM 44928] gi|256359409|gb|ACU72906.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 125 Score = 40.4 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 12/124 (9%), Positives = 25/124 (20%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 T +++ ++ G+ D R AQ A + L A Sbjct: 2 VTAFTVVIVIALMVFVGLAYDGGRALDGRVKALNEAQEAARAGAQALNLGALRAGGTAVL 61 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + YL + + A + + Sbjct: 62 DPEAAVGAAKAYLAGTGDAGTVDVAGTTVTVSVTHVQATKFLGLVGVGSITAHVTASARA 121 Query: 142 NPLS 145 + Sbjct: 122 EQGT 125 >gi|304310230|ref|YP_003809828.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] gi|301795963|emb|CBL44164.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] Length = 371 Score = 40.4 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 36/107 (33%), Gaps = 3/107 (2%) Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 Q Y + + + N + I++L+DG++ + N+ G + I Sbjct: 186 QRGYSKLYDAISKAIEVSNCNPKLVCSIIVLSDGDDVGSALPLADLHNQLYQAGTAVFPI 245 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 + N + + S +++ + + + + ++ Sbjct: 246 GYGDNINVNKLQELATN---SGGAYYTSEDSSQFSAVYERIWARLSN 289 >gi|221117275|ref|XP_002154690.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 895 Score = 40.4 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 35/83 (42%), Gaps = 8/83 (9%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G++AI+ A++ Y+ + + + ++LLT G++T D + Sbjct: 5 NGTSAIDVALKFVYNNM--------PKTFKEDGSPYVVLLLTYGKSTVDGKTLYKAAKPI 56 Query: 394 KSQGIRIMTIAFSVNKTQQEKAR 416 + GI ++T+AF E Sbjct: 57 RQSGIYVVTVAFGQQSNFSELLS 79 >gi|258651507|ref|YP_003200663.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554732|gb|ACV77674.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 593 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 63/196 (32%), Gaps = 18/196 (9%) Query: 263 SLRHVIKKKHLVRDALASVIR-SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + L +DA A + D + +D P S G Sbjct: 402 PGTNGADRIDLAKDAAARGLGLYRADSDIGLWEFSTRLSPTSDHRELIPISSLGPDGQGS 461 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGEN 379 T A A+ + + D ++ + + ++LLTDG N Sbjct: 462 TGAARLAAAL------NGLQAIPDGGTGLYDTVLDATRTVRAGYDPDRVNVVLLLTDGMN 515 Query: 380 TQDNEEGIA------ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 N + + ++ + +++IAF + + + A+ + + + Sbjct: 516 DDVNSITMDQLLSTLAAEQDPARPVPVISIAFGPDSDVAALQQI---SRATGGATYLSQD 572 Query: 434 THELNKIFRDRIGNEI 449 ++ +IF D +G + Sbjct: 573 PRQIGEIFLDAVGQRL 588 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 263 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 314 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 315 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 365 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 366 KPKESHAFFTREFTGLEPIVSDVI 389 >gi|308472877|ref|XP_003098665.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] gi|308268265|gb|EFP12218.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] Length = 396 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 56/173 (32%), Gaps = 16/173 (9%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA---I 339 I + T R+G +N ++ I + + + T + Sbjct: 70 TRIGLNASEPRTTRLGLVTYNSVATQKADL--NQYQSIGDVFHGIFDALSNIVDTNESYL 127 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 ++ A +I + +K +++ + + + I + K G++ Sbjct: 128 ATGLELAERMLIDQS-----VNSTRAHYQKVVIVYASEYDGNGELDPLPIAERLKLSGVK 182 Query: 400 IMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 I+T+A+ SN ASP F +++ K+ G+ + Sbjct: 183 IITVAYG----NAYGLTKSFSNIASPGFAF--SNSDNQGKLIGQIQGSLLQAN 229 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 13/130 (10%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D + T + + A + + + + IVL+TD + Q + + Sbjct: 197 DIADTSGTNMQAGLDAATGEMRA---CRTCMEADRAATENRIVLITDAQPNQGDISDEGL 253 Query: 390 CNKAKSQ---GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + K+ GI I ++ + + ++F +S E FR R+ Sbjct: 254 LARLKANAADGIHTTIIGVGLDFNTELVEGI---SKVRGANYFSVHSPGE----FRRRLT 306 Query: 447 NEIFERVIRI 456 +E V + Sbjct: 307 DEFDYAVSPL 316 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 22/182 (12%) Query: 270 KKHLVRDALASVIRSIKKID----NVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIV 324 + L++D ++ +I D ++ ++ F+ S + G R + Sbjct: 184 RITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVISETLGAVGSPSRIDL 243 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + G T I DA+ AY + IVL+TDGE T Sbjct: 244 QQRVEALQADGGTGIYDAVLAAY------------AESAGGDYIPSIVLMTDGELTAGRT 291 Query: 385 EGIAICN-KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFR 442 + A IR +I V + A + F+A + +L++ F+ Sbjct: 292 YDQFLTEWNALPSNIR--SIPVFVILYGEANVADMEQLAATTGGKTFDAING-DLDEAFK 348 Query: 443 DR 444 + Sbjct: 349 EI 350 >gi|119510964|ref|ZP_01630086.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] gi|119464403|gb|EAW45318.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] Length = 412 Score = 40.4 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 52/156 (33%), Gaps = 17/156 (10%) Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT- 326 KK +A+ I+ + ++ F + + L + Sbjct: 108 TKKITGAINAVRQFIQVL---SQRGGNTQISIVPFGESGSRCQGYPVNKETLDKFFAAND 164 Query: 327 --------FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK--YIVLLTD 376 + ST + + + A + +S E+ + +N+ + + I+LL+D Sbjct: 165 FKLQNHLNYLAKSTPCASTNLYEPLNQAVRFLATSEEERFYVAENSDQPQPRLSIILLSD 224 Query: 377 GENTQDNEEGI-AICNKAKSQG--IRIMTIAFSVNK 409 G + + NE+ + I + T+ + + Sbjct: 225 GYHNKANEQQDFTALTNLFRRNPLITVHTLGYGLTP 260 >gi|254787962|ref|YP_003075391.1| PKD domain-containing protein [Teredinibacter turnerae T7901] gi|237687416|gb|ACR14680.1| PKD domain protein [Teredinibacter turnerae T7901] Length = 1083 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 49/352 (13%), Positives = 106/352 (30%), Gaps = 56/352 (15%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQP 207 I +Q + + + E M+ YQ++ N Sbjct: 296 PVIGPIDGIKLQPRNQYDWARIGATIETINGSAGWKPIFLGMMPYQQNPTNSVDNEVLAA 355 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 T S ++ + + +Y + LD +++ FVD + + Sbjct: 356 LTGTSGSGATIDNNGDNIFVS----SDRVAREVYGESWTASLDKVKAQQQFVDEAPRTYS 411 Query: 268 IKKKHLVRDALASVIRSI----------KKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 L+ D+ S+ + ++ ++G F+ F+ Sbjct: 412 ELDSLLIIDSSGSMRTTDPQSRRLEAANTYVNISAAEDKIGIIDFDGSARVVKPFTLLGQ 471 Query: 318 --KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV----------------- 358 R ++ + +G T I ++ A T+++S +D++ Sbjct: 472 QGSTERIQLENAINGIDAVGGTEIAGSLGLACSTLMASWQDDIIEQQHMVNDIKALVEEF 531 Query: 359 ---------------HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + + + +KK +LLTDG+ T + C G + TI Sbjct: 532 PNNTISHSNARRAAIPGIAHAINSKKIAILLTDGD-TPSSYSQANQC--FIENGWMLYTI 588 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANST-HELNKIFRDRIGNEIFERVI 454 F + AS + A+S+ +LN ++ + E+ Sbjct: 589 GFG----GANGEKLAPLAEASGGYYIAADSSLLDLNCAYQQIRSHAAGEQAS 636 >gi|170091398|ref|XP_001876921.1| predicted protein [Laccaria bicolor S238N-H82] gi|164648414|gb|EDR12657.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1228 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 32/112 (28%), Gaps = 11/112 (9%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 + G T ++ + I +D + + I++LTDG Sbjct: 210 ANPADMLDIFDAVQPRGWTYTGAKLKFLLNRCIKRFDDAAGKPEYADIKPVDIIVLTDGA 269 Query: 379 NTQDNEEGIAICNKAKS--------QGIRIMTIAFSVNKTQQEKARYFLSNC 422 T D I + + I I + ++ A L +C Sbjct: 270 PTDDPAVV--IADAIRRLDGAKHHLNAIGIQFVQIG-DEDGAAAALQLLKDC 318 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 54/139 (38%), Gaps = 22/139 (15%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTD 376 V +A G TAI DA+ A R + + +++ IVLLTD Sbjct: 411 AASAQVLAYADSLVADGGTAIYDALTLAQQ---------QARQELRADPERFVSIVLLTD 461 Query: 377 GENTQDNEEG--IAICNKAKSQG---IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEA 431 G NT + A+ G +R+ I F ++ + +A L + F+A Sbjct: 462 GANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGEMQALAAL----TGGRAFDA 517 Query: 432 NST--HELNKIFRDRIGNE 448 +T L +F++ G + Sbjct: 518 RNTGKSGLPLVFKEIRGYQ 536 >gi|110799635|ref|YP_695225.1| von Willebrand factor type A/Cna B-type domain-containing protein [Clostridium perfringens ATCC 13124] gi|110674282|gb|ABG83269.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens ATCC 13124] Length = 928 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 30/322 (9%), Positives = 79/322 (24%), Gaps = 29/322 (9%) Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 L +L + + ++ + +++D S SM R + + Sbjct: 103 PLEYNEGYLTKKAYTTDEDNVFDINLKIQGKKNQSLKKDVVFLLDNSNSMTTNNRAIKIK 162 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 T + + Y ++S+ Y + ++ + Sbjct: 163 EQIKNVMDKLNT--NNTRYALVTYASTILDGRYYHLIDRSIGDNKYTVYKGYTSNQCYLN 220 Query: 260 DSSSLRHVIKKKHL------VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 +S+++ + K + + + + Sbjct: 221 FTSNIQEIYNKIPTTVPNQRNNGYVGGTFTQEGLLK-----AIELLKNSDADEKIIIHLT 275 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G+ + + ++ + + + + + Y L Sbjct: 276 DGLPTFSFLLKEFGGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKTQKPYIYSREEVYSAL 335 Query: 374 LTDGENT---QDNEEGIAI-CNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 +N + K + I I TI + K + Y Sbjct: 336 NRSINKNESIWNNGFPTTLEAENIKKENPDINIYTIGIELKKEVYKWDDY--------RK 387 Query: 428 FFEANSTHELNKI--FRDRIGN 447 ++ A EL +I F + I + Sbjct: 388 YYNAEGVVELPEIKKFLESISS 409 >gi|86360430|ref|YP_472318.1| hypothetical protein RHE_PE00154 [Rhizobium etli CFN 42] gi|86284532|gb|ABC93591.1| hypothetical protein RHE_PE00154 [Rhizobium etli CFN 42] Length = 547 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 6/54 (11%), Positives = 19/54 (35%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + +L + G I + + + +++ + Y + + A A + Sbjct: 4 SFINRLHRDQRGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 463 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 464 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ A G TA DA+ + + Sbjct: 414 TYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVF--------GPIRD 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 SPNKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLRDMAS 516 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 517 KPKESHAFFTREFTGLEPIVSDVI 540 >gi|307592357|ref|YP_003899948.1| YD repeat-containing protein [Cyanothece sp. PCC 7822] gi|306986002|gb|ADN17882.1| YD repeat protein [Cyanothece sp. PCC 7822] Length = 4762 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 16/280 (5%), Positives = 54/280 (19%), Gaps = 11/280 (3%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + S + P + + N + + + + V Sbjct: 3661 TVTDTQGETAQQSKSVTVNNVAPSVNAGLDFASSEGETVSFNGSYTDPGILDTHSIVWNF 3720 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + L+ + N G + + + ++ + Sbjct: 3721 GDGTTTSENLTPTHTYTNN---------GTYTVTLSVIDSDGDIGTDTLTVTVNNLAPTL 3771 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 S P + + + ++ + N Sbjct: 3772 TNVNGQTTLNEGSSANFSATATDPGNDALTYTWDFGDNSDPVNAASVNHIWANNGEYTVS 3831 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKK--KHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 L + + + + + + + + ++ + D Sbjct: 3832 LTVTDAEGASTSQTLLVTVNNVAPVVSAGLDQTVSEGQAITLNGSFTDPGILDTHTIIWD 3891 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 F D + + + G T + Sbjct: 3892 FGDGTTLSNQLNPTHVYANSGVYVATLQVTDSDGGTDSDT 3931 >gi|227830111|ref|YP_002831890.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|227456558|gb|ACP35245.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] Length = 356 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 32/104 (30%), Gaps = 23/104 (22%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVSFTINLAKQSQVPTK-IIMLTDGKPTDKR--------NVKDYEKLDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +TI + L A S F+ EL IF Sbjct: 158 ITIGIGNDYN-----ERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|270308522|ref|YP_003330580.1| hypothetical protein DhcVS_1138 [Dehalococcoides sp. VS] gi|270154414|gb|ACZ62252.1| hypothetical protein DhcVS_1138 [Dehalococcoides sp. VS] Length = 538 Score = 40.4 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 24/314 (7%), Positives = 74/314 (23%), Gaps = 13/314 (4%) Query: 11 SKKLIKSCTGH---FFIIT----ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 KKLI+ G +I A+++P +L + G + + ++ + Sbjct: 4 IKKLIRGEKGASLAVALIFLAIGAIMIPPLLMLVGSGLKQGAAIENRTDVLYSSDAGVEW 63 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 L E + + + P + + N D+ V Sbjct: 64 VINILKTG-SEGVTDSHGNIGLPNPQSPTRTYNMSDLNGSAVMVTLTYHDLGSYYDVVST 122 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + ++ ++ S+F ++ ++ + +S + + Sbjct: 123 ASLNGRSITTNATLRYQMSGGSVFDNAITSLDGDVKLTGSSSVISDPRNPPGAIVYSGGE 182 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + G T + + + + Sbjct: 183 LILTGSSFIEGNVYADDGIELGWSTPITGDAVTPASVNRPGNISGTVTTGAAPQLPSILT 242 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 S + ++ + + D + + Sbjct: 243 DAEIAAIVSDIQNETAFTAFSPGPVTRADNKGDWSIGYWPV-----PASIYYTAERIQRD 297 Query: 304 DRVISDPSFSWGVH 317 + +D ++ Sbjct: 298 LNISTDTPLTFKSS 311 >gi|72009654|ref|XP_784730.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 580 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 47/139 (33%), Gaps = 20/139 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + + +T GST I +++ + S + Y Sbjct: 23 ADLTDITDTALRQTLVNALPPSPTGSTCIGCGIESGLKVLGSYAQGG------------Y 70 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 I+LL+DG + + K+ G+ I TI S + QQ + LS S S F Sbjct: 71 ILLLSDGVENNKPYI-RYMYDDIKNSGVIIDTITISNSADQQMED---LSTNTSGISSFC 126 Query: 431 ANSTHE--LNKIFRDRIGN 447 +++ L F I Sbjct: 127 SDTGTGPCLINAFHRTIAE 145 >gi|260827134|ref|XP_002608520.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] gi|229293871|gb|EEN64530.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] Length = 1236 Score = 40.4 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 49/154 (31%), Gaps = 24/154 (15%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 D +MG F + + + ++ + + G T ++D + +A Sbjct: 669 KDEDQMGIVTFANDAKVVLPMTRMDSSGRDSALE-KIQNISTRGQTNLSDGLLSAISMFK 727 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN--KAKSQG--------IRIM 401 S+ + H I+L TDG+ Q + + +K G I Sbjct: 728 GSSGSDFHNG---------IILFTDGQANQGIIDAAELVQEYNSKMAGLGEGVCLPISTF 778 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 TI + + N S F+ ++ T Sbjct: 779 TIG----DYRPKLLCEVAQNLGSDAFFWLSDDTD 808 >gi|310830437|ref|YP_003965538.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] gi|309249904|gb|ADO59470.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] Length = 338 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 11/107 (10%), Positives = 33/107 (30%), Gaps = 4/107 (3%) Query: 15 IKSCTGHFFIITALLMPVML-GVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL---IQ 70 I+ G + + +L +++ + + A ++ L + Sbjct: 8 IRDEKGFVSVFVIFAVSFILPFFIFHTIEMTYLYGMKDKFQNFNDAAASAGAMQLEKKVV 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 S ++ K+ + E + N ++N +V ++ Sbjct: 68 SKGDLKFHEDEVKNVVKRILSENYGLDAHLNPQENSYITDVPNVKVS 114 >gi|298674641|ref|YP_003726391.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] gi|298287629|gb|ADI73595.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] Length = 825 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 39/392 (9%), Positives = 107/392 (27%), Gaps = 37/392 (9%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + + + E N FT E + + + Sbjct: 182 NNENQFKVIVSQAKDETINYDLEVYA-NGSLIRSGTFTQSEREKTIDVKNTFESLGANNL 240 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + V++ + + + + F KS + K + + + + + D + S Sbjct: 241 RAVITPHREDIDSINNRF-----YKSVYVIPKPNIQLTANNIDSPLSRILYNLYDVTTSN 295 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 D + + Q + ++ + L + S L Sbjct: 296 NFTNID-NKKAVILDNQHINTLSQNEVEKLNNYVTDGNGLVVVGGDQSYSFGGYLNSTFE 354 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN-------DTVRMGATFF 302 + + R+++ + + A +S + VN +G F Sbjct: 355 ELLPVISKPTKWTGGRNIVLVLDVSQSTAAHGTQSDILGNAVNLIQNENLKDANLGVIAF 414 Query: 303 NDRVISDPS-FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + R ++ + T++N + + + + Sbjct: 415 GTEGEDVSNGLVYLGIPSNRERLENKIRNLKTETDTSLNLGLSVSQEWLQGETGKLE--- 471 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS---------VNKTQQ 412 +++++DG ++ + I + K +G+ + + Sbjct: 472 ---------VIVISDGAIGSSYQKSLDIADDMKKKGVNFYFVNIEPSDPSIEGRYDNRGN 522 Query: 413 EKARYFLSNC-ASPNSFFEANSTHELNKIFRD 443 A+ F+ S +++F+ N +F D Sbjct: 523 LYAKRFIQEIEGSTDTYFQIQRGQRANLVFED 554 >gi|160880107|ref|YP_001559075.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160428773|gb|ABX42336.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 513 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 98/365 (26%), Gaps = 30/365 (8%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 + ++E L F+ ++ + +D + DI + + + + L+++ Sbjct: 176 EDLNSEALQESLKTLFKGVVRNSGSDEYLIDIFNEGGYDALVNYESSLIELNNQLIKSNK 235 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 F + A + + K + S + Sbjct: 236 EPLRF--IYPSDGVSVSDSPFAYIDNNNEKKLEIFNKLQSFLLSSDTQQKLESMGRRTTY 293 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 D K + + K Y S LY +D S Sbjct: 294 GGLVSNDEVFKESYGIDKNAYLSPIK---YPASSVIKNALNLYQDLFRKPSVVVFCLDYS 350 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + A+ ++ +++ +M F + + Sbjct: 351 GSMFGEGN-EQLVAAMEKILDHKLASEDMIQFSKMDKIFVIPFSSELKWVDSAISGIDTA 409 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + + D G T I ++ A + + + + K IVL+TDGE+ + Sbjct: 410 NLISRIKDTEAHGKTNIYAPVEHAIEIL---------KDFDADVYTKSIVLMTDGESAGN 460 Query: 383 NEEGIAICNKAKSQGIRIMTIAFS-VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 +G + I + +I F N Q ++ L F Sbjct: 461 FRKGTSY-------DIPVFSIMFGEANPKQLDEISRLTKGKTFDGRI-------GLINAF 506 Query: 442 RDRIG 446 ++ G Sbjct: 507 KEIRG 511 >gi|34481890|emb|CAE46493.1| trap [Plasmodium falciparum] Length = 331 Score = 40.4 bits (92), Expect = 0.71, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+++ Sbjct: 72 AMKLIQQLNL---NESAIHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENASQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|326798146|ref|YP_004315965.1| LuxR family transcriptional regulator [Sphingobacterium sp. 21] gi|326548910|gb|ADZ77295.1| two component transcriptional regulator, LuxR family [Sphingobacterium sp. 21] Length = 211 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 24/72 (33%), Gaps = 2/72 (2%) Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL 437 + + GI +C + K I I N ++ L N A + S E+ Sbjct: 58 DINLPDISGIDLCKQIKQLHPHIHIIGI-SNYNERSMITKMLQNGA-NGYVLKNASAEEI 115 Query: 438 NKIFRDRIGNEI 449 + + ++ Sbjct: 116 INAIQSVVNKKL 127 >gi|308472901|ref|XP_003098677.1| hypothetical protein CRE_04172 [Caenorhabditis remanei] gi|308268277|gb|EFP12230.1| hypothetical protein CRE_04172 [Caenorhabditis remanei] Length = 392 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 65/223 (29%), Gaps = 16/223 (7%) Query: 230 SPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV-----IRS 284 L Y+ P +D S V + L A + Sbjct: 12 VSVYADTYSPLSYVDRPCGMDLSNLWLDVVVVVDNSQGMTNNGLDNVAANILNVFSSGTR 71 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I + T R+G +N + + ST + A Sbjct: 72 IGSNSSEPRTTRLGLVTYNSNATQQADL------NKYQSINDAGNGIFDSLSTVVYTADS 125 Query: 345 TAYDTII--SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + E K+ +++ N + +++ N+ K ++I+T Sbjct: 126 YLATGLTLAEKMFIEQSVNTIRGHYKRVVIVYASEYNEDGELDPLSVANRLKLSNVKIIT 185 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 +A+ + LS ASP F +N+ +++ ++ + Sbjct: 186 VAYEQPGSV--GLENGLSQIASPG-FSFSNNVVNISQEIQNAL 225 >gi|297699669|ref|XP_002826908.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Pongo abelii] Length = 1273 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 40/115 (34%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 356 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 409 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + + N K QG+ I + + + L+ AS F+ + Sbjct: 410 DPLDLTTVINSPKMQGVERFAIGVG-EEFKSARTERELNLIASDPDETHAFKVTN 463 >gi|88857994|ref|ZP_01132636.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] gi|88819611|gb|EAR29424.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] Length = 974 Score = 40.4 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 47/176 (26%), Gaps = 23/176 (13%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + ++ K I++ + +D ++ D E Sbjct: 166 VNYAKNIADRDKDIEDNSSYDSPFKPKKDD-----SGLDILRCDNSINMIIMTDGDPTED 220 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G N ++ +S + K+ + N + + A+ Sbjct: 221 GGQNGN------IKSLYNSKYGAYPKSKSGSYLNSLAKYM---LNVDLFPTTSGVTDIAR 271 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASP--NSFFEANSTHELNKIFRDRIGNE 448 + TI F + + L AS + A++ +L + + I Sbjct: 272 T-----FTIGFGSGMS--DDGLDLLKQTASDGGGEYLLASTAEQLTEALKKTITKI 320 >gi|269128868|ref|YP_003302238.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268313826|gb|ACZ00201.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 515 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 42/131 (32%), Gaps = 16/131 (12%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 R + + + + +S N ++L+TD Sbjct: 382 SGTRRQQILSSLASLQPKP--DGDTGLYDTVLAALSYMRKTYKPDMVNT-----VLLMTD 434 Query: 377 GENTQDNEEGI-AICNKAKS-----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G N D+ + K ++ + ++++ I F ++E + A+ S Sbjct: 435 GRNDDDDGPTLRQTLAKLRAGHDPERPVQLVIIGFGDEVDRKELQQLAE---ATGGSVHF 491 Query: 431 ANSTHELNKIF 441 A + ++ IF Sbjct: 492 AKTAEDMRNIF 502 >gi|115680671|ref|XP_001197987.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115719718|ref|XP_001196973.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 528 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 48/150 (32%), Gaps = 19/150 (12%) Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F + R + + G+T I +++ + S + Sbjct: 148 VVFESASREIAELTDITDTASRQRLVNALNT-SPKGTTCIGCGIKSGLKVLGSYAQGG-- 204 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 YI+LL+DG +D + + + G+ I TI + ++ L Sbjct: 205 ----------YILLLSDGVENEDPSITD-MYDDINNSGVIIDTITI---SNEADQQMEDL 250 Query: 420 SNCASPNSFFEANSTHE--LNKIFRDRIGN 447 S S S F +++ L F+ I Sbjct: 251 SKNTSGKSSFCSDTGIGPCLIYAFQSTITE 280 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 431 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 482 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 483 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 533 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 534 KPKESHAFFTREFTGLEPIVSDVI 557 >gi|296273948|ref|YP_003656579.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296098122|gb|ADG94072.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 592 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 68/215 (31%), Gaps = 20/215 (9%) Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 N S ++ + + L +++ + + + +D+ Sbjct: 35 NQDVLNRLTIPNSNLTKKSRNILLFFALISMVIALARPVTNEKEQSINQEVIPVVVAIDA 94 Query: 262 SSLR-HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 S ++ A V+ IK + + +G F + L Sbjct: 95 SKSMLAQDVFPSRLKMAKKKVLDLIKA----PNQLSVGVIIFGQSSFILSPLTNDFTSLD 150 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +++ F + N + I A++ + + + K I+ LTDG N Sbjct: 151 -FLLQNFDYNLNINNGSNIFSALEASNKLLKN-------------YKSKNIIFLTDGGNE 196 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 D ++ I NK + I I+TIA + K Sbjct: 197 GDYKKEIEYANK-NNLNIYIITIATNKPTPIPSKD 230 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 25/200 (12%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 + + K+ V A +++++ D + F++ F+ V +L Sbjct: 93 DTSASMSNKRDRVNSAALAMVKASNPED------EVFVISFSEEAFITQDFTSDVKQLES 146 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ K G TA+ DA+ D + + + KK +V++TDGE+ Sbjct: 147 SLRK-----LGSKGETAMRDALSLGLDHLRAPARKD----------KKVLVVITDGEDNS 191 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS----NCASPNSFFEANSTHEL 437 ++ + A + I I + R S A+ + + ++ Sbjct: 192 SIQKQENLIRAAHLSNVIIYGIGLLAAEAPASAQRAKASLDVLTLATGGRSWYPENVADI 251 Query: 438 NKIFRDRIGNEIFERVIRIT 457 KI + + VI T Sbjct: 252 EKITPEIAHEIRNQYVIGYT 271 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 456 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 507 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 508 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 558 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 559 KPKESHAFFTREFTGLEPIVSDVI 582 >gi|10048261|gb|AAG12328.1|AF249739_1 sporozoite surface protein 2 [Plasmodium falciparum] Length = 559 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 39/138 (28%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+++ Sbjct: 72 AMKLIQQLNL---NESAIHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T ++DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLSDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 43/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ A G TA DA+ + + Sbjct: 414 TYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVF--------GPIRD 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 SPNKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLRDMAS 516 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 517 KPKESHAFFTREFTGLEPIVSDVI 540 >gi|148258228|ref|YP_001242813.1| hypothetical protein BBta_7023 [Bradyrhizobium sp. BTAi1] gi|146410401|gb|ABQ38907.1| hypothetical protein BBta_7023 [Bradyrhizobium sp. BTAi1] Length = 184 Score = 40.0 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 40/176 (22%), Gaps = 8/176 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + G A ++P+ML + V+ A + Sbjct: 1 MGDRLLRRLGAFGADKRGIAATEFAFIVPLMLVMFFGTVEFCSGI--------AVDRKVT 52 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + L + +S + + + + T+ V V Sbjct: 53 LMARTLSDLTSQSTSVGDSDMSNFFAASTGIMYPYSTTPVNATITELYVDPKTMQATVMW 112 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + + L +L + + T + + + Sbjct: 113 SKGSAPRSSGTTVGVPADLLVSGTYLIFSEVNYQYVPTIGYVMAKTGIKLSDVAYT 168 >gi|281417929|ref|ZP_06248949.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] gi|281409331|gb|EFB39589.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] Length = 1050 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 42/341 (12%), Positives = 96/341 (28%), Gaps = 37/341 (10%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 L++N F+ ++ I E V+ G ++ + + Sbjct: 589 GAGTLVINSSMYFMGNVLISLTNTNNVGEGFIVADGNIIIQGQNLYPNGPNDKLYVYSIG 648 Query: 195 DSEG--QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + A + +GK+ +K + L + ++ + Sbjct: 649 GNIEFQTSNSTINGIAYAPGNPANPNSGKIFFSGDKNTINGSIAANELDFFAGGLVVNHT 708 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + V+ + LV+DA + + M ++D + Sbjct: 709 EGQFDTVEEKYIDKSTY-LKLVKDAAKNFVDKFAGSKTK-----MAVIQYSDSANDNDFK 762 Query: 313 SWGVHKLIR-TIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNE------------DE 357 + + + +K G + + D M+ AY + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNDPPPKGQISKYIVVITGS 822 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKAKS-------QGIRIMTIAFSVN 408 V ++ KK +G + + + AK +GI ++ I FS Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 409 KTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFRDR 444 + A + +++AN+ EL I + Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNM 923 >gi|156743748|ref|YP_001433877.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235076|gb|ABU59859.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 936 Score = 40.0 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 43/375 (11%), Positives = 95/375 (25%), Gaps = 33/375 (8%) Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 +++ + + + R + + + + + + Sbjct: 222 ESNTSVAGRLMVTGPDRTTVATIPVEIGTDRQTVEILLPEAPAGFNRYTAYLEVPNDART 281 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 N + F G L+ +A + +S I+ + S+ + Sbjct: 282 QNNAIETFSYVRGTPRVLLVAQAPDDAISLERALR-AARIEVTVVSPASIPATFGELIRY 340 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + + Q +RD MV +S + G + Sbjct: 341 DAIALINVPRALFSNETVQRIAAYVRDFGGGLLMVGGPQSFGPGGWRGTPVEAALPVTMD 400 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN--------------DTVRMGATFFNDR 305 R +V D S+ + I ++ D + F+DR Sbjct: 401 IPERQRQPPVSIVVVIDISGSMAATEDGIPKLSLALEGARRIAALLRDEDELTVIPFDDR 460 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNN 364 G R + G I+DA++ A +S Sbjct: 461 PGVIVGPLPGS---RRDVAIEQLNQVRLGGSGINIHDALRVAARYTRASER--------- 508 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 ++I+ +TDG +T E + I +G+ + ++A Sbjct: 509 --PVRHIITITDGNDTTQQEGALDIVRSLHDEGVTLTSVAIGQGDHVPFIRDM---AAVG 563 Query: 425 PNSFFEANSTHELNK 439 F ++ Sbjct: 564 GGRTFLTERAADVPD 578 >gi|229550752|ref|ZP_04439477.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] gi|229315887|gb|EEN81860.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] Length = 898 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 44/396 (11%), Positives = 96/396 (24%), Gaps = 43/396 (10%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + V++ A T+ Y +++ ++N ++ D V Sbjct: 24 MLTGLVTSGSSVVTAAANIRPTYQTDANGTYPTNSWQVTGQQNVINQRGGDQVSGWDNNT 83 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH-GVSIQW 181 A S N +R ++ V + + V I Sbjct: 84 TWDGDATNTTNSYLKFGDPNKPDYQIRKYAKETNTPGLYDVYLNVKGNTQQNVKPVDIVL 143 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 V+D S SM + G T + V + S KS Y Sbjct: 144 VVDMSGSMESNSSGTNRAGAVRTGVKNFLTSIQNAGLGNYVNVGLIGFSSPGYIGGKSGY 203 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 + G + ++ ++ + +R G+ Sbjct: 204 ISVKLGKAGNASQQQAIN-------------------GALSPRFQGGTYTQIGLRQGSAM 244 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG---STAINDAMQTAYDTIISSNEDEV 358 N + + + T E G T + +T Sbjct: 245 LNADASGNKKMMILLTDGVPTFSNEVINSEWINGTLYGTNFGSSRDEPGNTAQLGWPYID 304 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + + KAK G + + + ++ + Sbjct: 305 SSGNRIYDTW---------------PATLGEAKKAKDSGNEVHALGIQLADDRKYMTKEK 349 Query: 419 LSNC-----ASPNSFFEANSTHELNKIFRDRIGNEI 449 + SP+ + +A+S + ++ + I Sbjct: 350 IRQNMQLITNSPDLYEDADSADAVEAYLNNQAKDII 385 >gi|308472959|ref|XP_003098706.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] gi|308268306|gb|EFP12259.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] Length = 399 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 49/143 (34%), Gaps = 10/143 (6%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGSTAINDAMQTAYD 348 + T R+G +N + V + + + + ++ AYD Sbjct: 84 TEHRTTRVGLITYNAEATQIADLNVLQSFFNLTNHVNSSLAEVSNSTWSFDKVGLKAAYD 143 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + + K I+ +D + + + + + K G++I+T+ + Sbjct: 144 LLQNQ---SFPPNSRSHYQKVVILFASDSQAQNSEELDPYPMDYQLKDAGVKIVTVGYG- 199 Query: 408 NKTQQEKARYFLSNCASPNSFFE 430 E LSN +SP F+ Sbjct: 200 ----NETLLERLSNISSPEYAFD 218 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 55/178 (30%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + R G F D F+ + + A Sbjct: 122 SRLTAAKKVLRDFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQDVWL--NLLDEA 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ S + +++LTDG +T + Sbjct: 173 ETGMAGQSTNLGDAIGLGIKVFEQSPS---------TSQDQIMLVLTDGNDTGSFVSPVD 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 A ++GIRI IA + +Q +S +S F A +LN+ ++ Sbjct: 224 AAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVSRVSSLTQARSFVAIDQSQLNEAYQ 281 >gi|220924566|ref|YP_002499868.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219949173|gb|ACL59565.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 128 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 38/130 (29%), Gaps = 4/130 (3%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 ++K G I A LMP+++ + +V+ + +A Q S + + Sbjct: 1 MLKDQDGAVAIELAFLMPILIFILMAVVEFGLIFFT----YEAEQRVTWDVSRQVAANRI 56 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S + + + L V + + +VL Sbjct: 57 SPSDAPAKVRDGLPGWVGDRSVTVIGPTLANGQYTMTVSLPASAASPTSILARFYGGLVL 116 Query: 134 SSRYDLLLNP 143 S+ + P Sbjct: 117 STTVVMQKEP 126 >gi|149773087|emb|CAO01893.1| collagen type VI alpha 5 [Mus musculus] Length = 620 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 46 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 95 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 96 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 141 >gi|149773085|emb|CAO01892.1| collagen type VI alpha 5 [Mus musculus] Length = 1171 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 1020 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 1069 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 1070 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 1115 >gi|148689166|gb|EDL21113.1| mCG4444 [Mus musculus] Length = 860 Score = 40.0 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 41/112 (36%), Gaps = 16/112 (14%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 ++ +++ + T G I A+Q A + + + K I+ Sbjct: 284 TYDNQSIMKNYIYTSLQQL--NGDATIGLALQWAMEGLFLGTPNPR--------KHKVII 333 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ GEN ++ E + +AK QG + I+ R + AS Sbjct: 334 VISAGENHEEKEFVKTVALRAKCQGYVVFVISLG------STQRDEMEELAS 379 >gi|332846919|ref|XP_003315346.1| PREDICTED: integrin alpha-E [Pan troglodytes] Length = 1241 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 326 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 379 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + N K QG+ I + + + L+ AS F+ + Sbjct: 380 DPLNLTTVINSPKMQGVERFAIGVG-EEFKSARTARELNLIASDPDETHAFKVTN 433 >gi|332019999|gb|EGI60451.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Acromyrmex echinatior] Length = 688 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 61/223 (27%), Gaps = 33/223 (14%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV-IRSIKK 287 + S +++ +S R+ + L ++ Sbjct: 119 TVRDVYDFRMSNWFVGAANSPKDLAILIDSATYTSDRNRRLTIATTKVVLDTLGPDDYIN 178 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 I + F D ++ + K+ + +K T I+ A+ TA+ Sbjct: 179 IYRYGENGEEIVQCFKDSLVQASPENIHEMKVALSSLKHEDTV------TNISAALSTAF 232 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-----IRIMT 402 + + N N +++ +D + K +RI T Sbjct: 233 EILHKYNRSSQGSQCNQA----IMLITSDTDGPP--------AEVIKRYNWPHMPVRIFT 280 Query: 403 --IAFSVNKTQQEKARYFLSNCASPNSFFEANSTHEL-NKIFR 442 I + Q C + + + E+ NK+F Sbjct: 281 YLIGGDKSLDLQNT------ACTTKGFYARITNADEIHNKVFE 317 >gi|308472943|ref|XP_003098698.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] gi|308268298|gb|EFP12251.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] Length = 416 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 48/149 (32%), Gaps = 10/149 (6%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI--N 340 I T R+G +N D L F + + +T Sbjct: 71 TRIGSNSTEPRTTRVGLVTYNSAAKLDADL-NKFQDLDGLYNGVFKDLSDVVDTTDSFLA 129 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A + + S + + KK I++ + + + N+ K G+ I Sbjct: 130 TGLNAAEELLQSQSLNT-----TRDHYKKVIIVYASEYKGSGELDPVPVANRLKGSGVVI 184 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFF 429 +T+A+ + L+N ASP + Sbjct: 185 VTVAYDQGGDEGLLRD--LANIASPGFAY 211 >gi|167836682|ref|ZP_02463565.1| hypothetical protein Bpse38_09361 [Burkholderia thailandensis MSMB43] Length = 137 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 30/120 (25%), Gaps = 11/120 (9%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G + L+ P + V +VDV + + A++ A Sbjct: 2 RRFG--QRGVVALEFVLVFPFFMLVLFGIVDVSLLLCDKAIITNASREAARAGV------ 53 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 V+ + + Q Y N V T+ Y Sbjct: 54 ---VACVPPMTASAISQVALNYAQNNLVTGGTATTPVVAVDQSAGTTSGSPLKVTVTYTY 110 >gi|148728188|ref|NP_002199.3| integrin alpha-E precursor [Homo sapiens] gi|226694184|sp|P38570|ITAE_HUMAN RecName: Full=Integrin alpha-E; AltName: Full=HML-1 antigen; AltName: Full=Integrin alpha-IEL; AltName: Full=Mucosal lymphocyte 1 antigen; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor Length = 1179 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 264 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 317 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + N K QG+ I + + + L+ AS F+ + Sbjct: 318 DPLNLTTVINSPKMQGVERFAIGVG-EEFKSARTARELNLIASDPDETHAFKVTN 371 >gi|119610886|gb|EAW90480.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] Length = 1196 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 281 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 334 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + N K QG+ I + + + L+ AS F+ + Sbjct: 335 DPLNLTTVINSPKMQGVERFAIGVG-EEFKSARTARELNLIASDPDETHAFKVTN 388 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 46/171 (26%), Gaps = 15/171 (8%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I +N R F+D + R + G T + A+Q A Sbjct: 384 AISKLNPDDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLTADGGTEMLPALQAA 443 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 444 ---LRNQGPVATGAL-------RQVVFLTDGAIGNERQLFQEI--TANRSDARVFTVGIG 491 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 + +F ST ++ + + IT Sbjct: 492 SAPNTYFMTKAAE---IGRGTFTAIGSTDQVASRMGELFAKLQNPAMTDIT 539 >gi|4406708|gb|AAB59359.2| integrin alpha E precursor [Homo sapiens] gi|109659254|gb|AAI17208.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|109730475|gb|AAI13437.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|313883908|gb|ADR83440.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1 [synthetic construct] Length = 1179 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 264 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 317 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + N K QG+ I + + + L+ AS F+ + Sbjct: 318 DPLNLTTVINSPKMQGVERFAIGVG-EEFKSARTARELNLIASDPDETHAFKVTN 371 >gi|7239181|gb|AAF43107.1| HUMINAE [Homo sapiens] Length = 1127 Score = 40.0 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 39/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D+I +S+ + K +V+LTDG + Sbjct: 212 ASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------SKVMVVLTDGGIFE 265 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + N K QG+ I + + + L+ AS F+ + Sbjct: 266 DPLNLTTVINSPKMQGVERFAIGVG-EEFKSARTARELNLIASDPDETHAFKVTN 319 >gi|322514175|ref|ZP_08067240.1| Flp pilus assembly protein TadG [Actinobacillus ureae ATCC 25976] gi|322119961|gb|EFX91963.1| Flp pilus assembly protein TadG [Actinobacillus ureae ATCC 25976] Length = 47 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 7/36 (19%), Positives = 14/36 (38%) Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + K+ +C +++ A L F+ IG Sbjct: 2 DKSKSADAWKHCVGDRNYYSAKDKESLLNSFKQAIG 37 >gi|226349296|ref|YP_002776410.1| hypothetical membrane protein [Rhodococcus opacus B4] gi|226245211|dbj|BAH55558.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 153 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 27/89 (30%), Gaps = 1/89 (1%) Query: 16 KSCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G ++T + PV++ G++VD + A+ A A +I S+ Sbjct: 17 HRDKGAVTLMTVVFAPVLIFFVWGLIVDGGGMLTADQRADNVAEDAARAAGQQIIGSVSA 76 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLK 103 + YL + Sbjct: 77 RGIGTVVDPVRATAAAKRYLFEAGVDGDV 105 >gi|52840865|ref|YP_094664.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52627976|gb|AAU26717.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 1172 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 40/402 (9%), Positives = 88/402 (21%), Gaps = 25/402 (6%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + A + + + ++ Y + +N T V + Sbjct: 144 AGVKAIIENYMPTTDFALGTYSTSNISTYNTWVYYMSPPGSNFVFTNTPVAGNRYVINPC 203 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 S +S L L + + I + S VS Sbjct: 204 YNYGSASSTVSSNCASIGSLYGTSLVSSSQYLQIGGASDDPTINDVLYAGSGFPGVFVSF 263 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + S N + S G S ++ + Sbjct: 264 NGPSPATP-YPPNFSLSNYNNGNVLLSYSSTRPSIGSFATGPTNAGFVPFSQQVMYVQRG 322 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ ++ V +A+ + + ++ ++ Sbjct: 323 FGYYSNQSFNTGNMRVNMTTAGTNPTTT-----SVTNAINAFLPHLR---PETNSTSTTE 374 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++T+ T + I+D + T Sbjct: 375 IKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPT-----QDLQARYWP 429 Query: 360 RMKNNLEAKKYIV--LLTDGENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQE 413 + + + DG N + ++ N K+ G+ I Sbjct: 430 PLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKNDGVLTFVIGMGAGVDPAV 489 Query: 414 KARYF--LSNCA---SPNSFFEANSTHELNKIFRDRIGNEIF 450 L A +++ A S L + N Sbjct: 490 NPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 531 >gi|226365428|ref|YP_002783211.1| hypothetical protein ROP_60190 [Rhodococcus opacus B4] gi|226243918|dbj|BAH54266.1| hypothetical protein [Rhodococcus opacus B4] Length = 548 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 41/114 (35%), Gaps = 11/114 (9%) Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGI----AICNKAK 394 + + + + ++++ + + +++LTDG N + I + Sbjct: 433 EGLIGGGTGLYDTTLAAFRTVQDSYDPRAVNSVIILTDGANEDPDSITKEQLLGILEREM 492 Query: 395 S--QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 + + I+TI + + A + + S + A ++ +F + + Sbjct: 493 DPARPVIIVTIGITDDADAATLAEI---SRVTGGSSYVAKDPADIANVFVNALA 543 >gi|150377239|ref|YP_001313834.1| TadE family protein [Sinorhizobium medicae WSM419] gi|150031786|gb|ABR63901.1| TadE family protein [Sinorhizobium medicae WSM419] Length = 141 Score = 40.0 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 15/108 (13%), Positives = 35/108 (32%), Gaps = 4/108 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 + L ++ +G + AL+ +L + +++ R Y + L AA A + ++ Sbjct: 8 IRSLRRNQSGATAVEFALVCLPLLLLVIGVIEFGRAFYVRNELSHAADVA----ARRVLI 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 ++ + + E L + V I Sbjct: 64 GQIAHNASDSEAQAKLDSAVRESFRSGDPTLLTIAVSKETVDGIAFRV 111 >gi|239917284|ref|YP_002956842.1| hypothetical protein Mlut_07560 [Micrococcus luteus NCTC 2665] gi|281414238|ref|ZP_06245980.1| hypothetical protein MlutN2_03378 [Micrococcus luteus NCTC 2665] gi|239838491|gb|ACS30288.1| hypothetical protein Mlut_07560 [Micrococcus luteus NCTC 2665] Length = 152 Score = 40.0 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 35/138 (25%), Gaps = 4/138 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + +L G ++T L V++ + +++ V L+ A +A I + Sbjct: 3 RRLTARLGHD-DGQTTVLTVGLCAVLVALMLVMLAVTTVQLQHRRLQSLADSAAIAGAEE 61 Query: 68 LI---QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 L V + + + L ++ Sbjct: 62 LGFRLGEDPGVVLSDDDVAASASAHLAAVGAQEAVPGLGGMSARVAEDGTTVVVTLDARA 121 Query: 125 RKSAYQVVLSSRYDLLLN 142 A + L + Sbjct: 122 DLFAMTGPFAGTLPLSVP 139 >gi|304411390|ref|ZP_07393004.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|307305288|ref|ZP_07585036.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] gi|304350245|gb|EFM14649.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|306911591|gb|EFN42016.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] Length = 771 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 51/180 (28%), Gaps = 20/180 (11%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A +++ ++K + FN + + + + F Sbjct: 406 DSIVQAKNALLYALKGLKPE---DSFNIIEFNSSLSQFSATPLPATSSNLSRARQFVSRL 462 Query: 332 NEMGSTAINDAMQTAY-DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T + A+ A ++ S + D V + ++ +TDG + I Sbjct: 463 QADGGTEMALALDAALPKSLGSVSPDAVQ-------PLRQVIFMTDGSVGNEQALFDLIR 515 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + R+ T+ F+ A +F E++ + Sbjct: 516 YQIGES--RLFTVGIGSAPNSH-----FMQRAAELGRGTFTYIGKVDEVDAKISALLSKI 568 >gi|311744035|ref|ZP_07717841.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311313165|gb|EFQ83076.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 139 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 17/42 (40%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 ++ G I LL+P+++ ++V R+ + Sbjct: 6 RRRRGDERGSMAIEVVLLVPILVMFLLLVVAAGRYVTVRSDI 47 >gi|242061830|ref|XP_002452204.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] gi|241932035|gb|EES05180.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] Length = 369 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 12/87 (13%), Positives = 26/87 (29%), Gaps = 7/87 (8%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 ++L++DG+ K + T F + ++ +F Sbjct: 20 VMLMSDGQQNHGGN-----AADVKIGNAPVYTFGFGADYDPTVLNAVARNSM--GGTFSV 72 Query: 431 ANSTHELNKIFRDRIGNEIFERVIRIT 457 N +L F + + V +T Sbjct: 73 VNDVDKLTMAFSQCLAGLLTVLVQDLT 99 >gi|54296651|ref|YP_123020.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] gi|53750436|emb|CAH11830.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] Length = 1169 Score = 40.0 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 40/402 (9%), Positives = 88/402 (21%), Gaps = 25/402 (6%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + A + + + ++ Y + +N T V + Sbjct: 141 AGVKAIIENYMPTTDFALGTYSTSNISTYNTWVYYMSPPGSNFVFTNTPVAGNRYVINPC 200 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 S +S L L + + I + S VS Sbjct: 201 YNYGSASSTVSSNCASIGSLYGTSLVSSSQYLQIGGASDDPTINDVLYAGSGFPGVFVSF 260 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + S N + S G S ++ + Sbjct: 261 NGPSPATP-YPPNFSLSNYNNGNVLLSYSSTRPSIGSFATGPTNAGFVPFSQQVMYVQRG 319 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 Y ++ ++ V +A+ + + ++ ++ Sbjct: 320 FGYYSNQSFNTGNMRVNMTTAGTNPTTT-----SVTNAINAFLPHLR---PETNSTSTTE 371 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++T+ T + I+D + T Sbjct: 372 IKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPT-----QDLQARYWP 426 Query: 360 RMKNNLEAKKYIV--LLTDGENTQDNEEGIAIC----NKAKSQGIRIMTIAFSVNKTQQE 413 + + + DG N + ++ N K+ G+ I Sbjct: 427 PLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKNDGVLTFVIGMGAGVDPAV 486 Query: 414 KARYF--LSNCA---SPNSFFEANSTHELNKIFRDRIGNEIF 450 L A +++ A S L + N Sbjct: 487 NPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 528 >gi|198420433|ref|XP_002131388.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 438 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 25/232 (10%), Positives = 64/232 (27%), Gaps = 27/232 (11%) Query: 207 PADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + ++ + + + + N S P + SS++ Sbjct: 194 NQPGSHCTFECTDHEYDVYPAGKNRNHCLSNGS----WSESPPCCATDIVTLYHSSNIHA 249 Query: 267 V-----IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 I + A+A + + FN+ + Sbjct: 250 YEHMVRIAFLSAEQTAIAGDAVKYAGLYYSGEVNMESIVHFNEYNDDKLTMQGLF----- 304 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 ++ ++ AM+ A + + + E + ++++TD EN+ Sbjct: 305 --MQRVIMNNTADRKVNTGAAMEFARTHMFNQSNGVRG------EVPRILMVITD-ENST 355 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 D+ K +++G+ I + + + R L A + Sbjct: 356 DDVL--EPAEKLRAEGVLTYAIGLA--EDGKILDRDQLMQIAGEDDHLFNPD 403 >gi|170571356|ref|XP_001891697.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158603658|gb|EDP39502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 319 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 46/113 (40%), Gaps = 6/113 (5%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 R+ F+ + F+ + + IV E+ G+TA+ + ++ Sbjct: 204 GRRFSRIALITFSSVGKTRTQFNLDRYFNGKDIVTAIRRLESSGGTTAVGEGIRLGI--- 260 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 +D+ H + N AKK +++ TDG + + + + AK G + T+ Sbjct: 261 --EQKDKQHGGRPNEIAKKAMLVFTDGWSNKG-PDVEEMSRNAKGAGFTLYTV 310 >gi|57163755|ref|NP_001009219.1| integrin alpha-E [Felis catus] gi|20530606|gb|AAM27173.1|AF420018_1 alpha E integrin [Felis catus] Length = 1160 Score = 40.0 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 47/172 (27%), Gaps = 14/172 (8%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + A + +K + + + ++ L + Sbjct: 203 DPPDFQKAKDFISNMMKNFYAKCFECNFALVQYGEVIQTEFDLRDSQDALAS--LARVQN 260 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 T A+Q D I + + A K IV++TDG+ D + Sbjct: 261 ITQVKNVTKTASAIQHVLDNIFT------PSHGSRKNAPKVIVVITDGDIFGDPLNLTTV 314 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANSTHELN 438 + K QG+ I + K L AS F + L+ Sbjct: 315 ISSPKMQGVERFAIRVG---NESTKTLKELKLIASDPFERHAFTVTNYSALD 363 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 62/195 (31%), Gaps = 28/195 (14%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + S+S + ++++ + +++D ++R +K I NV+ + + + Sbjct: 200 SFVFDKSGSMSWDLNGNNTNYWGPKSRMSILQDKATIMMRDLKDIGNVSVNLVSFSILGS 259 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 G T ++ G T D ++ ++ N Sbjct: 260 YVQKDFSELDKGT-----TTIEASINALQTGGVTNPGDGLRYGMMSLQ-----------N 303 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM------TIA----FSVNKTQQE 413 + KY+VLLTDG + +++ + T+ F+ Sbjct: 304 HSAQLKYVVLLTDGIPNAYTVDTND--TSWRNRNVHPYYNRWRETVGELVTFNNGPYDVT 361 Query: 414 KARYFLSNCASPNSF 428 N S +S+ Sbjct: 362 TNLTTDQNRVSYDSY 376 >gi|271963054|ref|YP_003337250.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] gi|270506229|gb|ACZ84507.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] Length = 594 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 34/80 (42%), Gaps = 9/80 (11%) Query: 371 IVLLTDGENT-QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNS 427 +V LTDG+N + + K + +R+ TI + + L A+ + Sbjct: 520 VVFLTDGKNEKTGGSDLDNLLGKL-NPDVRLFTIGYGEGAD-----QGVLKRIAEATDGA 573 Query: 428 FFEANSTHELNKIFRDRIGN 447 ++++ ++++F I N Sbjct: 574 AYDSSRADTIDQVFTSVISN 593 >gi|239825644|ref|YP_002948268.1| hypothetical protein GWCH70_0060 [Geobacillus sp. WCH70] gi|239805937|gb|ACS23002.1| conserved hypothetical protein [Geobacillus sp. WCH70] Length = 244 Score = 40.0 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 31/72 (43%), Gaps = 2/72 (2%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 + I+L+TDG + + IA+ + AK QGI + I T E + A S Sbjct: 7 RQILLITDGCSNHGE-DPIAMASLAKEQGITVNVIGVLDQDTIDESGLREIEGIALSGGG 65 Query: 428 FFEANSTHELNK 439 + +L++ Sbjct: 66 ISQVVYAKQLSQ 77 >gi|256420212|ref|YP_003120865.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035120|gb|ACU58664.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 212 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 51/172 (29%), Gaps = 17/172 (9%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +I ++++ +T + F+ + + Sbjct: 21 IEAVKNGVQVLISTLRQDPYALETAFLSLITFDSDARQLVPLTDLSS--------FQMPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G T++ A++ D+I +K + + + L+TDG T + G+ Sbjct: 73 LKASGGTSLGSALELVADSINREVAKSTPDVKGDWKP--LVFLMTDGIPTDTWQNGLNAF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFR 442 K GI + L + + + F+ Sbjct: 131 QNTK-IGITVAC------AAGNGADVNLLKQITNTVVSLDTADAATIKAFFK 175 >gi|114684811|ref|XP_531503.2| PREDICTED: collagen, type VI, alpha 1 isoform 3 [Pan troglodytes] Length = 997 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 56/150 (37%), Gaps = 23/150 (15%) Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + ++ R +V T + A++ + ++ ++L+ KY Sbjct: 72 NLRDRYYRCDRNLVVDAVKYF--GKGTYTDCAIKKGLEQLL--------VGGSHLKENKY 121 Query: 371 IVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 ++++TDG + +E N+AK G+++ ++A + LS A+ + Sbjct: 122 LIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAI-----TPDHLEPRLSIIATDH 176 Query: 427 SF---FEANSTHELNKIFRDRIGNEIFERV 453 ++ F A + + I I V Sbjct: 177 TYRRNFTAADWGQSRDA-EEAISQTIDTIV 205 >gi|126173305|ref|YP_001049454.1| type IV pilin biogenesis protein [Shewanella baltica OS155] gi|125996510|gb|ABN60585.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS155] Length = 1168 Score = 40.0 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 42/400 (10%), Positives = 88/400 (22%), Gaps = 17/400 (4%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + +L E+ F E N +DI + Sbjct: 106 ASLNGCSQSKAALAELGRFTGYIREFTASGTTGIWKELPETNGTAISVVDCWQDISKT-- 163 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 N + + + A A + + V++ Sbjct: 164 DPNNSSSYSNGFAANGLTTGSGKKKKAYPYFTSSSPGTSWADALAAANNTDFGVGQPVTL 223 Query: 180 QWV-------IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + + + T + + Sbjct: 224 YTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNEGNRD 283 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + M + ++ + ++ R ++ D Sbjct: 284 GGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKY-GHGDTDY 342 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ V S+ + D N+ + + Sbjct: 343 GSYVGNNPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDKNANNDVISLTA--SG 400 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 S E NL+ Y+ L D + N + Q +R TI FS Sbjct: 401 SKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQVQNVRTYTIGFSKGAEDA 460 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ A +F A ++ EL D + N + Sbjct: 461 AP---LLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|300788143|ref|YP_003768434.1| hypothetical protein AMED_6298 [Amycolatopsis mediterranei U32] gi|299797657|gb|ADJ48032.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 534 Score = 40.0 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 15/115 (13%), Positives = 33/115 (28%), Gaps = 7/115 (6%) Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDN----EEGIAICNKAKS 395 A + S D + E + ++++TDG N + + + Sbjct: 420 ATPDGGTGLYDSVLDTYRTARREWEPGRLNLVIVMTDGRNEDPRGISRADLLTELAGLQD 479 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIF 450 I I + + L+ A+ F A ++ +F + Sbjct: 480 ARRPIPLIGVGIGPDADKAELDQLTA-ATGGQAFLAPDPAKITDVFFGALSRLAG 533 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 33/82 (40%), Gaps = 7/82 (8%) Query: 373 LLTDGENTQD---NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSF 428 TDG + N + + +K ++ T+ F T + +L AS P + Sbjct: 260 FKTDGTDAAGKSWNGDPKDPADYSKQL-VQTFTVGFGQGITP--TGKRYLQLAASRPEYY 316 Query: 429 FEANSTHELNKIFRDRIGNEIF 450 +EA+ L+K+F D + Sbjct: 317 YEADKPESLSKVFNDIVEQIKS 338 >gi|119595693|gb|EAW75287.1| collagen, type XX, alpha 1, isoform CRA_c [Homo sapiens] Length = 1329 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 207 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 264 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 265 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 315 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 316 NVFAVGV------KNADEAELRLLASPPR 338 >gi|66347353|emb|CAI95066.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595691|gb|EAW75285.1| collagen, type XX, alpha 1, isoform CRA_a [Homo sapiens] Length = 1297 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 207 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 264 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 265 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 315 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 316 NVFAVGV------KNADEAELRLLASPPR 338 >gi|112734845|ref|NP_065933.2| collagen alpha-1(XX) chain [Homo sapiens] gi|292495087|sp|Q9P218|COKA1_HUMAN RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|66347352|emb|CAI95065.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595695|gb|EAW75289.1| collagen, type XX, alpha 1, isoform CRA_e [Homo sapiens] Length = 1284 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 200 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 257 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 258 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 308 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 309 NVFAVGV------KNADEAELRLLASPPR 331 >gi|45946128|gb|AAH43183.1| Collagen, type XX, alpha 1 [Homo sapiens] Length = 1284 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 200 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 257 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 258 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 308 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 309 NVFAVGV------KNADEAELRLLASPPR 331 >gi|119595694|gb|EAW75288.1| collagen, type XX, alpha 1, isoform CRA_d [Homo sapiens] Length = 637 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 200 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 257 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 258 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 308 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 309 NVFAVGV------KNADEAELRLLASPPR 331 >gi|7959281|dbj|BAA96034.1| KIAA1510 protein [Homo sapiens] Length = 1140 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 48/149 (32%), Gaps = 17/149 (11%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 50 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 107 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 108 TGLALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 158 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + + L ASP Sbjct: 159 NVFAVGV------KNADEAELRLLASPPR 181 >gi|134299959|ref|YP_001113455.1| hypothetical protein Dred_2113 [Desulfotomaculum reducens MI-1] gi|134052659|gb|ABO50630.1| hypothetical protein Dred_2113 [Desulfotomaculum reducens MI-1] Length = 314 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 42/129 (32%), Gaps = 3/129 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 G ++ +L M V++G ++ DV + ++ L AA A + + L + Sbjct: 7 SFFSDDRGLAVVLVSLAMTVLIGAAALVTDVGLVTLVKNRLSNAADAASLAGAQELPDNP 66 Query: 73 EEVS--SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + A ++ + + + + + V + ++ Sbjct: 67 NTALSIAHAYALSNGLSEEQIDIHLVTDPTDTYYDGVKVDTHQTVDY-IMAKALGFTSAN 125 Query: 131 VVLSSRYDL 139 V ++ + Sbjct: 126 VQATATAKV 134 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 412 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDSP------ 465 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 466 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|170743967|ref|YP_001772622.1| TadE family protein [Methylobacterium sp. 4-46] gi|168198241|gb|ACA20188.1| TadE family protein [Methylobacterium sp. 4-46] Length = 134 Score = 40.0 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 36/134 (26%), Gaps = 1/134 (0%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ G A++ P+++ + ++ Y + A A + I + Sbjct: 2 RRFGSDERGATAAEFAVVAPILIALVMGSIEFGSIMYTLGTTEFATNDAARQLATNRITA 61 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + A ++ + + ++ + K N + Sbjct: 62 SQVAGIIALRLPSWAQASAAVTISQSSTDPNK-NQYTVTTNVPLSSATPTQFFSLIYGTK 120 Query: 132 VLSSRYDLLLNPLS 145 + + P S Sbjct: 121 SFNCTAVMQQEPTS 134 >gi|302801818|ref|XP_002982665.1| hypothetical protein SELMODRAFT_421953 [Selaginella moellendorffii] gi|300149764|gb|EFJ16418.1| hypothetical protein SELMODRAFT_421953 [Selaginella moellendorffii] Length = 2190 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 45/124 (36%), Gaps = 8/124 (6%) Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + + G T + + + ++ S D K+ +V L+DG+N + Sbjct: 2064 DKLLTFEDAGGTVYSSGIARVEEILVRSVSDPAVAGKSPA-----VVFLSDGDN-YGGLD 2117 Query: 386 GIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 + N+ K Q + TI F+ + T K A F + +L++ F D Sbjct: 2118 PVHCVNQLKKLEQSLIFHTIMFATDPTNSAKKLLTDMAAAGDGMFQVSIDEIQLSRSFED 2177 Query: 444 RIGN 447 + Sbjct: 2178 LAKS 2181 >gi|196228665|ref|ZP_03127531.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] gi|196226946|gb|EDY21450.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] Length = 921 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 23/45 (51%), Gaps = 1/45 (2%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 +VL +DG++ + + + ++G+ I T+ F + ++ A Sbjct: 340 VVLFSDGQHNEGES-PLEVAKILAARGLPIYTVGFGSDIRPRDLA 383 >gi|163738633|ref|ZP_02146047.1| TadE-like protein [Phaeobacter gallaeciensis BS107] gi|163742981|ref|ZP_02150364.1| hypothetical protein RG210_01907 [Phaeobacter gallaeciensis 2.10] gi|161383664|gb|EDQ08050.1| hypothetical protein RG210_01907 [Phaeobacter gallaeciensis 2.10] gi|161387961|gb|EDQ12316.1| TadE-like protein [Phaeobacter gallaeciensis BS107] Length = 186 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 7/44 (15%), Positives = 20/44 (45%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 G+ + A+++P L + V++ + + L++A + Sbjct: 24 EDGNATVEFAIVIPAFLFLLMNTVELGMITIQQSMLERALDQTV 67 >gi|145219383|ref|YP_001130092.1| TadE family protein [Prosthecochloris vibrioformis DSM 265] gi|145205547|gb|ABP36590.1| TadE family protein [Chlorobium phaeovibrioides DSM 265] Length = 144 Score = 40.0 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 16/140 (11%), Positives = 43/140 (30%), Gaps = 2/140 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + +++ +S G + A ++PV+L + +V Y + L A + Sbjct: 6 MNRGALRAPAREHTRSQKGSVLVEFAFILPVLLMLLFGVVYFSVALYNKTVLTMATREGA 65 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + + + + Q + +I + + + Sbjct: 66 RAGVLYVAGQTDYSGNISNAVAA--AQPLCNSVISFGSDATPVAQASVSGDILTVSASGN 123 Query: 122 MNPRKSAYQVVLSSRYDLLL 141 +VLS++ + L Sbjct: 124 YTGLYIFSGLVLSAQTSMRL 143 >gi|313212817|emb|CBY36735.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 40.0 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 47/148 (31%), Gaps = 23/148 (15%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I + + D R+ ++ D S R I+ G T Sbjct: 205 LIDYAQDSNIAADATRIAIRTYSTNSDLDFSL---NDFKTRNIISEINNLVYASGGTNTA 261 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS--QGI 398 DA+ + + E+ K +V +TDG+++ D A + K+ + I Sbjct: 262 DAITKGLNDF----------GNDRSESVKIMVTITDGQSSYD--HVKAAADLLKADPRNI 309 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPN 426 + I A L A+ + Sbjct: 310 QSFAIGI----DGANMAE--LQAIATTD 331 >gi|221213132|ref|ZP_03586108.1| membrane protein [Burkholderia multivorans CGD1] gi|221167345|gb|EED99815.1| membrane protein [Burkholderia multivorans CGD1] Length = 609 Score = 40.0 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 24/266 (9%), Positives = 70/266 (26%), Gaps = 4/266 (1%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS--LEEVSSRA 79 ++ A+ + V L V G +DV + L++ A + A + + + A Sbjct: 1 MAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSVNDLCPQTDTTVTA 59 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 S + F+ N L++ + Sbjct: 60 SGSNAVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAGYFGTATNQLNAVRVV 119 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + LF ++ + A+A + D + + + + + G Sbjct: 120 AAKTVPLFF-IGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSADQRNTGL 178 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 G + ++ + K+ + + + + Sbjct: 179 VNALLGALLNTSLSLNIGSYQALACTRVKVGDLVKAQVGAGTVDQLLATKLTLNQLVSVM 238 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSI 285 ++ + + A ++ + ++ Sbjct: 239 ATALKNTSVANLGIYSSASSAALDTL 264 >gi|260787567|ref|XP_002588824.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] gi|229273994|gb|EEN44835.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] Length = 1344 Score = 40.0 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 58/227 (25%), Gaps = 22/227 (9%) Query: 199 QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF 258 P N +R S +V + P + + Sbjct: 1004 NPQNGMVTCTNRLENGR-CTFTCNDGYHLLGSSQVVCGADGRWIGTLPSCAACDSAADII 1062 Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-NDRVISDPSFSWGVH 317 V V + V+ S+ R+G F R + S + + Sbjct: 1063 FSIDVSGSVSGHFDTVSTFINGVVNSLTI---GRSHARVGLIKFAGSRAETYISLTDFDN 1119 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 K + D G +I + ++ +K ++LTDG Sbjct: 1120 KYNLVSAISSVFDHYSSGEFSI-AGLSVMGHEFSTNGRSAA---------RKIGIVLTDG 1169 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 +++ ++ G I + + R L N AS Sbjct: 1170 QDSGSGYVIPD-ATALRNDGTTIFCVGV------ADVRRETLDNMAS 1209 >gi|295401970|ref|ZP_06811932.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|312109213|ref|YP_003987529.1| von Willebrand factor type A [Geobacillus sp. Y4.1MC1] gi|294975972|gb|EFG51588.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|311214314|gb|ADP72918.1| von Willebrand factor type A [Geobacillus sp. Y4.1MC1] Length = 244 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 + I+L+TDG + + IA+ AK QGI + I T E + A S Sbjct: 7 RQILLITDGCSNHGE-DPIAMAALAKEQGITVNVIGVLDQDTIDESGLREIEGIALSGGG 65 Query: 428 FFEANSTHELNK 439 + +L++ Sbjct: 66 ISQVVYAKQLSQ 77 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 8/143 (5%) Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + ++R+ ++I + + +G ++D Sbjct: 206 SGSMANNMSGGRVTSSNPSRISILRNRAKALIDQFSGL----GNIYVGIIPYSDDAYISG 261 Query: 311 SFSWGVHK-LIRTIVKTFAIDENEMGSTAINDAMQ---TAYDTIISSNEDEVHRMKNNLE 366 + S+ + +K G T DAM+ A S + + + + Sbjct: 262 TKSFVLANGTNVNTIKNKIDSLTAQGMTNTGDAMRVSYYATKQFKDSPNSIDNTLPTDTK 321 Query: 367 AKKYIVLLTDGENTQDNEEGIAI 389 Y++LL+DG+ T + Sbjct: 322 VIPYMILLSDGDPTVFSATTREW 344 >gi|268530438|ref|XP_002630345.1| C. briggsae CBR-CLEC-63 protein [Caenorhabditis briggsae] Length = 414 Score = 40.0 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 50/150 (33%), Gaps = 8/150 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAM 343 I + R+G +N + + + + V + + + + + Sbjct: 96 IGTQYTDPRSTRLGLVTYNGQSTVVADLNAIQSIDQLYSTVFSALSSVSTSDDSYLARGI 155 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + K+ +V+ + + I + ++ K+ G+ + T+ Sbjct: 156 GAAEMVLQQG-----RNNGIRSNYKRLVVVYASAYKGEGELDPIPVADRLKASGVVVSTV 210 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANS 433 AF + + A L+N AS N + + Sbjct: 211 AFDQDGDEALLA--GLTNIASTNYAYTSKD 238 >gi|241667106|ref|YP_002985190.1| hypothetical protein Rleg_7224 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240862563|gb|ACS60228.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 546 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 7/54 (12%), Positives = 19/54 (35%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 K+L + G I + + + +++ + Y + + A A + Sbjct: 4 TLIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|114683021|ref|XP_001148451.1| PREDICTED: collagen alpha-1(XX) chain [Pan troglodytes] Length = 1284 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 56/178 (31%), Gaps = 23/178 (12%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I + D V++G T ++ ++ ++ + G+T Sbjct: 200 DFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLSTKEQVLAAVRRLRYKGGNTF 257 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A+ ++ EA K ++L+TDG+ +QD+ K G+ Sbjct: 258 TGLALTHVL------GQNLQPVAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGV 308 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + + L ASP +S + F + + R+ Sbjct: 309 NVFAVGV------KNADEAELRLLASPPRDITVHSVLD----FLQL--GTLAGLLSRL 354 >gi|60551291|gb|AAH91051.1| Clca1 protein [Xenopus (Silurana) tropicalis] Length = 937 Score = 40.0 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 54/186 (29%), Gaps = 27/186 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + A + I ++ + +G F S V Sbjct: 313 DVSGSMGGGNRIGRLYQAAEVFVMQIVEMGSY-----VGIVQFESTASVRSSLLQIVDDT 367 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R +K+ G T I ++ ++ + +VLLTDGE+ Sbjct: 368 QRNRLKSLLPK-TATGGTNICAGIREGIKV---------NKKYDGSSYSTELVLLTDGED 417 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STH 435 ++C + G I IA N ++ + + + F A Sbjct: 418 NY----ATSLCFPDVTNSGSIIHVIALGPNAAKELETIVDM----TGGLRFLATDKVDAQ 469 Query: 436 ELNKIF 441 L F Sbjct: 470 GLIDAF 475 >gi|294460157|gb|ADE75661.1| unknown [Picea sitchensis] Length = 350 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 15/57 (26%), Gaps = 3/57 (5%) Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 I + T F + S +F + + F IG + + Sbjct: 3 IPVHTFGFGTDHDSASMHSISE---TSGGTFSFIETESIIQDAFAQCIGGLLSVVIQ 56 >gi|327270788|ref|XP_003220170.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 930 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 50/150 (33%), Gaps = 23/150 (15%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G FN + R + + G + I + + TA+ Sbjct: 348 WVGIVTFNSKANIQAGLQKVFSDKERESLTSHLPT-TASGDSDICEGVTTAFQVFS---- 402 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEK 414 K IVLLT+GE ++ C +K +SQ I I TIAF + + + Sbjct: 403 -----RKLTSTEGCEIVLLTNGE-----GLDLSPCLSKIQSQEIIIHTIAFGSKASNELE 452 Query: 415 ARYFLSNCASPNSFFEANSTHE---LNKIF 441 + + F A + + L F Sbjct: 453 KLADM----TGGKTFYATDSLDSNGLIDAF 478 >gi|126733210|ref|ZP_01748957.1| hypothetical protein RCCS2_03624 [Roseobacter sp. CCS2] gi|126716076|gb|EBA12940.1| hypothetical protein RCCS2_03624 [Roseobacter sp. CCS2] Length = 188 Score = 40.0 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 55/185 (29%), Gaps = 5/185 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + ++ +K +G I L+ PV G M + S + L++ + V Sbjct: 2 KHFLQRFLKDQSGTSSIEIVLVFPVFFGFFLMTYEAGILSSRQVMLERGLDITVRKVRVG 61 Query: 68 LIQSLEEVSSRAKNSFTF--PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 I ++ R + + + + + ++++ V+ Sbjct: 62 AIDKADQDELRTELRKSICDNAGILPDCETQLEVEMIQRDPRAVWNPISTDIQCVDRGDI 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGI---KSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + L++ + L L +R + A +T + + + V Sbjct: 122 NAPSTSSLANTANNELLFLRACIRIDPFLASSTLGKTLVAGNDTAAGDSYALIATAAFVV 181 Query: 183 IDFSR 187 F Sbjct: 182 EPFRA 186 >gi|271962487|ref|YP_003336683.1| cellulose 1,4-beta-cellobiosidase [Streptosporangium roseum DSM 43021] gi|270505662|gb|ACZ83940.1| cellulose 1,4-beta-cellobiosidase [Streptosporangium roseum DSM 43021] Length = 979 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 28/340 (8%), Positives = 68/340 (20%), Gaps = 12/340 (3%) Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+VP + + + N + V + Sbjct: 211 AAVPAGDYSLTAKATDDKGSATTSAPVGISVQANSAPAVLLTPATLAVPEGGGADLSVKL 270 Query: 124 PRKSAYQVVLSSRYDLLLNPLSL-FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 + + V +++ L++ S+ T + + S ++ Sbjct: 271 SKAPSANVTVTTARTSGDADLTVASGGSLTFTPANWNTAQTVRVAAAEDTDQTSGSAEFT 330 Query: 183 IDFSRSMLDYQRDS---EGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + P T ++ G + PY Sbjct: 331 STATGHTPAKATATEVDNDTPGGDNEYVKRFTTMYNKLKDPANGYFSPQGVPYHSVETFM 390 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK----KHLVRDALASVIRSIKKIDNVNDTV 295 + + + ++ V +I + + Sbjct: 391 VEAPDHGHETTSEAYSYYLWLEAAYGKVTGDWSRFNDAWASMEKYIIPATADQPTNSFYN 450 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 + + + + A + T M + N Sbjct: 451 PSKPATYAGEWDDIKQYPSKLDGGVSVGSDPIANELKTAYGTNDVYGMHW---LLDVDNT 507 Query: 356 DEVHRMKNNLEAKKYIVLLTDG-ENTQDNEEGIAICNKAK 394 R + YI G E + C+ K Sbjct: 508 YGFGRCGDGTTKPAYINTYQRGPEESVFETIPQPSCDTFK 547 >gi|218129580|ref|ZP_03458384.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] gi|217988310|gb|EEC54633.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] Length = 212 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 38/112 (33%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++I +++ +T + FN + Sbjct: 21 IEAVKNGVQTLISTLRGDPYALETAYISIITFNSVAQQVTPLTELSAFQQPQ-------- 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G TA+ +A+ + + +K + + + L+TDGE T D Sbjct: 73 IEASGCTALGEALTLLAQKVDTEIVKTTQEVKGDWKP--LVFLMTDGEPTDD 122 >gi|126306104|ref|XP_001362407.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 870 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 63/205 (30%), Gaps = 34/205 (16%) Query: 259 VDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHK 318 +D+S V + + +R A + I I+ + T F+ Sbjct: 311 IDTSRSMKVGNRLNRLRQASQFFLLQI--IEKGSWTGI---VTFDSSATIQSELIQIESD 365 Query: 319 LIRTIVKTFA-IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + R + + G I ++TA+ + K L +VLLTDG Sbjct: 366 VQRKTLISRLPNVTVAGGGAHICSGLRTAFMVV----------KKKFLTDGSEMVLLTDG 415 Query: 378 ENTQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 E+ N C + K G I TI + + + + + A + Sbjct: 416 EDNTTNT-----CFEEVKQSGAIIHTIVLGPSTEKGLEKLSEM----TGGMKTTATDNVQ 466 Query: 437 ---LNKIFRDRIGNEIFERVIRITK 458 L F + + IT+ Sbjct: 467 NNGLIDAF-----SALSSGNAAITQ 486 >gi|157412050|ref|YP_001481390.1| hypothetical protein APECO1_O1R37 [Escherichia coli APEC O1] gi|99867075|gb|ABF67720.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 670 Score = 40.0 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 30/92 (32%), Gaps = 19/92 (20%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A+ A D ++ S D +K I L+TDG + + KAK Sbjct: 577 YTPTGSALMAAVDLLLDSQFD-----------RKIIFLITDGYPNKSEFTIGEVMEKAKC 625 Query: 396 QGIRIM--------TIAFSVNKTQQEKARYFL 419 GI I+ I F + L Sbjct: 626 NGIEIVGVGIKTDEIIGFETDTFVTVDDTSLL 657 >gi|309364362|emb|CAP25016.2| CBR-CLEC-63 protein [Caenorhabditis briggsae AF16] Length = 415 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 50/150 (33%), Gaps = 8/150 (5%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH-KLIRTIVKTFAIDENEMGSTAINDAM 343 I + R+G +N + + + + V + + + + + Sbjct: 96 IGTQYTDPRSTRLGLVTYNGQSTVVADLNAIQSIDQLYSTVFSALSSVSTSDDSYLARGI 155 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 A + K+ +V+ + + I + ++ K+ G+ + T+ Sbjct: 156 GAAEMVLQQG-----RNNGIRSNYKRLVVVYASAYKGEGELDPIPVADRLKASGVVVSTV 210 Query: 404 AFSVNKTQQEKARYFLSNCASPNSFFEANS 433 AF + + A L+N AS N + + Sbjct: 211 AFDQDGDEALLA--GLTNIASTNYAYTSKD 238 >gi|209551753|ref|YP_002283670.1| hypothetical protein Rleg2_4182 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537509|gb|ACI57444.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 193 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 40/145 (27%), Gaps = 5/145 (3%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 T+ + ++L + G I A+L PV++ + ++ +AA T Sbjct: 8 TRLVLTVRRLAQDRKGAGAIEFAILFPVLIMLYIGAFEITIGLSVSKRATRAAGTVADVV 67 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + ++ + + L T + + + Sbjct: 68 TQQQSVTKSALAQMPSVANSIFVPYNTTSLTLKI-----TGITIDAGANAKVLWSWAQDG 122 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLR 149 + S + + FL Sbjct: 123 TVPYAKNTAVSNVPSDMKTANSFLV 147 >gi|149066378|gb|EDM16251.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1127 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 217 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 270 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 271 K-SQDDIIPPS--RNLREAGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 320 >gi|170743328|ref|YP_001771983.1| TadE family protein [Methylobacterium sp. 4-46] gi|168197602|gb|ACA19549.1| TadE family protein [Methylobacterium sp. 4-46] Length = 240 Score = 40.0 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 24/229 (10%), Positives = 61/229 (26%), Gaps = 14/229 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L G I A +PV+L V + + V + + ++ + + Q + Sbjct: 14 RLWGDAAGVAAIEFAAALPVLLVVMAVGLQVALYVNAK--------RSVERLARTISQMI 65 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + A + + D + I + + +N A+ V Sbjct: 66 SQAVPPAGAATATVNAADIRFGFDAAIVLFPYVLADAARQGIPWQSNIAINAAGIAFTKV 125 Query: 133 LSSRYDLLLNPLSLFLRSM----GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 S D + G + A+ + + ++ + S Sbjct: 126 ASGCSDPTDQSACYVANVVWTSSGTGGASYRPCLVAQQPAGNAAPPSPTTLPRSVFGPAS 185 Query: 189 MLDYQRDSEGQPL--NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 ++ +P + A Y + + + ++ Sbjct: 186 LVVVDVVFTFRPTFGATYVPSARIAHSVYVQPRYAALVSYDPTNTDGIA 234 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 40.0 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 42/123 (34%), Gaps = 17/123 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + +D Sbjct: 622 TYDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGA------ 675 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V+LTDG+ + D+ G A+ A+ GI + ++ L + AS Sbjct: 676 --NKNFLVILTDGQ-SYDDVRGPAV--AAQKAGITVFSVGV------AWAPLDDLKDMAS 724 Query: 425 PNS 427 Sbjct: 725 EPR 727 >gi|209516840|ref|ZP_03265690.1| von Willebrand factor type A [Burkholderia sp. H160] gi|209502656|gb|EEA02662.1| von Willebrand factor type A [Burkholderia sp. H160] Length = 328 Score = 40.0 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 43/175 (24%), Gaps = 27/175 (15%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + + T T ++ M A Sbjct: 130 NDRLALMMFGTNPVLAMPFTYNHRVIEAAVAATAIGRGMPD--TELDRGMLAAIAQF--- 184 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ IVL++DG D I + I + I + + Sbjct: 185 -------DGRLSSGRRAIVLVSDGGALLDEPMQHRIEAGLRRDQIALYFIYLRSSIYSPD 237 Query: 414 KARYFLSNCASPN--------------SFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ AS F+A + + I + Sbjct: 238 LNATLPASEASAEAQLHRFFLTLRTPYRLFQAEDPKAMMAAIAE-INRQQNASTT 291 >gi|115687249|ref|XP_792282.2| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 2422 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 24/74 (32%), Gaps = 12/74 (16%) Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T A + A + + + + LLTDG + N +A+ Sbjct: 282 YVGGGTYTKGAFELAKKVLQ----------GARANSTQAVFLLTDGLSNGPNPVPVAV-- 329 Query: 392 KAKSQGIRIMTIAF 405 K G+ + + Sbjct: 330 SLKDDGVEVFSFGI 343 >gi|92113789|ref|YP_573717.1| TadE-like protein [Chromohalobacter salexigens DSM 3043] gi|91796879|gb|ABE59018.1| TadE-like protein [Chromohalobacter salexigens DSM 3043] Length = 153 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 34/114 (29%), Gaps = 1/114 (0%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV- 66 + K +G I A++ PV V L+ + L+ A + + Sbjct: 5 THLKRARRKRQSGVAAIEFAIVFPVFFLVLYALIGYAFVFLIQSGLQNLAGETVRQVATI 64 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ R + +E+ + F+ +L + + V Sbjct: 65 STTPVEDDNDQREALLEERVTRLLEDSWLARFDTSLLSSERCPGAPTPTPERIV 118 >gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120] gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120] Length = 427 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 45/166 (27%), Gaps = 35/166 (21%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 +V +A+ ++ ++ D R+ F + + +I Sbjct: 61 MVVEAVEKLLDRLQPGD------RISVVAFAGSATVIIP--NQIVENPESIKTQIRKKLQ 112 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ----------- 381 G T I + +Q ++ V + LLTDG Sbjct: 113 ASGGTVIAEGLQQGITELMKGTRGAVSQA----------FLLTDGHGEDSLKIWKWEIGP 162 Query: 382 -DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 D+ + KA + I T+ F + L A Sbjct: 163 DDSRRCLEFAKKAAKINLTINTLGFG-----NNWNQDLLETIADAG 203 >gi|302381516|ref|YP_003817339.1| hypothetical protein Bresu_0401 [Brevundimonas subvibrioides ATCC 15264] gi|302192144|gb|ADK99715.1| Protein of unknown function DUF2134, membrane [Brevundimonas subvibrioides ATCC 15264] Length = 543 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 21/289 (7%), Positives = 69/289 (23%), Gaps = 19/289 (6%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK--------------QA 56 + G ++ A ++ + + +DV + L+ Sbjct: 4 PSGFRQDTRGGIAVMAAAAGSLICALAAISIDVASVALSGRTLQGVADLAAMSAAANLDV 63 Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A A ++ + R N V Sbjct: 64 ADQAAQATVSA----NLRAATVKTELGAYVADPDIAPSGRFSVQADSANAARVTVTATAP 119 Query: 117 DTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 ++ K V S+ + P + S+G + + + Sbjct: 120 LFFGKIVIGKDEVVQVRSAVAAARIEPPR-AMFSIGSRLAAVDGGVANAVIGGLTGSSVS 178 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 +++ + + ++ + D ++ + ++ + Sbjct: 179 LTVMDYDALASARINLLGFFDALATETGVTVGDYETLVNTTVDAGRIVKVLESLSTGQDA 238 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + +++ + R + + AL V+ ++ Sbjct: 239 AALRKLRTPAANVRVRIGDVIGIEADAGRGLAEGLDASVTALDLVMATL 287 >gi|269795767|ref|YP_003315222.1| hypothetical protein Sked_24760 [Sanguibacter keddieii DSM 10542] gi|269097952|gb|ACZ22388.1| conserved repeat protein [Sanguibacter keddieii DSM 10542] Length = 1516 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 38/369 (10%), Positives = 90/369 (24%), Gaps = 24/369 (6%) Query: 100 NNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQ 159 + + D + P + Q V++ + + ++ G+ L Sbjct: 155 TSPAARAAAPAEETLADDVVPAVVPDPTTTQSVVTVKVGGIRTSVTQVSPLAGVTLQLFD 214 Query: 160 TKAEAETV-----SRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKS 214 A T + + +V+ + + + Sbjct: 215 GGAGGATTPRTEPWATCVSDASGDCSFVVPETGDGGANRDARFWVRQAGAAPGGYFASPT 274 Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 + + + + + S Sbjct: 275 LAIGGDTTSQTYQFRTGDRLRGGLTYTSTADFMVSSGRGFTASGGIWQSSLANPGLPPQC 334 Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 +A V+ + +R A+ F D + PS T A Sbjct: 335 GLRVALVVDLSGSVGRYIAAMRSAASGFVDSLTGTPSSVALFTFADNAPAATGANLGLTP 394 Query: 335 GSTAINDA-----MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE----- 384 ST + T ++ + ++++ + V+LTDG T Sbjct: 395 VSTTAGADTVKNRIATYTAGFDTNWDRGLYQVAASATPFDVAVVLTDGNPTVYAAHEGPG 454 Query: 385 ---------EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 GI N K++G R++ + +S + + +++ + Sbjct: 455 DLTRFREVENGIFSANAVKAEGTRVIAVGVGDGIGGAPDNLRAISGPSGGSDYYQTDDYA 514 Query: 436 ELNKIFRDR 444 E + R Sbjct: 515 EAGEALRAL 523 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 40.0 bits (91), Expect = 0.94, Method: Composition-based stats. Identities = 38/351 (10%), Positives = 88/351 (25%), Gaps = 56/351 (15%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 D + + + + + T + + ++ + D + Sbjct: 50 DNSGSMATAMDVVVVKEPYDPSTSYSGRADKDRVYYKDDGDWYYINKSTIQCSDLIDRID 109 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + A + +S+ S N + + + + + + Sbjct: 110 AVGELKSYRMAYKRGRSWRSFNENNNASRDVDCEADNDVSGVNWSNITAHTYVSG-NYRN 168 Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 ++ SS+ + +++D + ++ V +G FN + Sbjct: 169 WLSSSTETVRKTRMEIMQDVAKRLADTVTG-------VNIGLMAFNQSQNGEGGRVLNNV 221 Query: 318 KL---IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL----EAKKY 370 + K G T +++ + A + K Sbjct: 222 SNVKDNASAFKAKVDGLYPSGQTPLSETLFGAMRYFQGGKPFLDRNPVSGTVDGSNNYKS 281 Query: 371 ----------IVLLTDGENTQDNEE----------------GIAICNKAK---------- 394 ++LLTDG T D I Sbjct: 282 PIELECQANNVILLTDGAPTSDTNHNSFIGSAIGKTCSGNCLDEIAGYMATNDMSAAFSG 341 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 Q I+ T+ FS++ A ++ AN+ +L F D + Sbjct: 342 DQTIKTYTVGFSIDDPLLGAAATA-----GGGEYYVANNAQQLADAFDDIL 387 >gi|327270790|ref|XP_003220171.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 866 Score = 39.6 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 47/149 (31%), Gaps = 21/149 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G FN + + R + + G I + + A+ Sbjct: 351 WVGIVTFNSKGNIQAGLQKIFSDIERQSLTSHLPT-TAAGDCNICEGVNAAFQVFSQ--- 406 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K IVLLT+GE + + K +SQ I I TIAF + + + Sbjct: 407 ------KLTSTEGCEIVLLTNGEGSDLSPCLS----KIQSQEIIIHTIAFGSKASNELEK 456 Query: 416 RYFLSNCASPNSFFEANSTHE---LNKIF 441 + + F A + + L F Sbjct: 457 LADM----TGGKTFYATDSLDSNGLIDAF 481 >gi|162451937|ref|YP_001614304.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] gi|161162519|emb|CAN93824.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] Length = 641 Score = 39.6 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 42/144 (29%), Gaps = 27/144 (18%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK--YIVLLTDGENT 380 +++ F G T + A+ A + ++ +V LTDG + Sbjct: 310 LMQAFINFPTPNGETPMYPALDGAT---------TWANNYKDAHPEEEVAVVFLTDGVPS 360 Query: 381 QDNEEGIAICNKAKSQ----GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 N AI N A++ G+R+ + F ++ A + ++ Sbjct: 361 SCNTVPNAIGNLARNAFVNHGVRVHAVGFG------NSNAELINLIADQGGGRAYNLSAG 414 Query: 435 HEL----NKIFRDRIGNEIFERVI 454 L G V Sbjct: 415 STLEGSVLDALVSIRGEARSCDVA 438 >gi|221066732|ref|ZP_03542837.1| TadE family protein [Comamonas testosteroni KF-1] gi|220711755|gb|EED67123.1| TadE family protein [Comamonas testosteroni KF-1] Length = 156 Score = 39.6 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 10/59 (16%), Positives = 23/59 (38%), Gaps = 1/59 (1%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY-EHALKQAAQTA 60 +F + L ++ TG + ALL+ ++L + + + ++ A A Sbjct: 1 MYHRFSLFYPSLRRTQTGVAAVEFALLVSILLTIFFGIFVYWHALQAQQSVVRAAGDGA 59 >gi|152999662|ref|YP_001365343.1| type IV pilin biogenesis protein [Shewanella baltica OS185] gi|151364280|gb|ABS07280.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS185] Length = 1169 Score = 39.6 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 42/400 (10%), Positives = 88/400 (22%), Gaps = 17/400 (4%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + +L E+ F E N +DI + Sbjct: 106 ASLNGCSQSKTALAELGRFTGYIREFTASGTTGIWKELPETNGTAISVVDCWQDISKT-- 163 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 N + + + A A + + V++ Sbjct: 164 DPNNSSSYSNGFPANGLTTGSGKKKKAYPYFTSSSPGTSWADALAAANNTDFGVGQPVTL 223 Query: 180 QWV-------IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + + + T + + Sbjct: 224 YTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNEGNRD 283 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + M + ++ + ++ R ++ D Sbjct: 284 GGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKY-GHGDTDY 342 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ V S+ + D N+ + + Sbjct: 343 GSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDKNANNDVISLTA--SG 400 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 S E NL+ Y+ L D + N + Q +R TI FS Sbjct: 401 SKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQVQNVRTYTIGFSKGAEDA 460 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ A +F A ++ EL D + N + Sbjct: 461 AP---LLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 39.6 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 52/373 (13%), Positives = 105/373 (28%), Gaps = 23/373 (6%) Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + T N + + + +V ++ + Sbjct: 3 ALTGCTAFNNSDDGTADGNGTSATTQTFQPSGGKPTATLSIASGSENKEVAVAIQKAADQ 62 Query: 142 NPLSLFLRSMGIKSWLIQTKAEA---ETVSRSYHKEHGVSIQWVIDF---SRSMLDYQRD 195 + +++ + MG + KA + V + + I S S Sbjct: 63 SNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMWISMGDTKHIVKDAASTSTTPIVFG 122 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK--SLYYMLYPGPLDPSL 253 G D S + + + ++ +Y L Sbjct: 123 IAKSKAVKLGWADDTGATKPVSTADILAAVSDGKLTFSMTSATVIDSALNVYQTALRKPS 182 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 VD S K +V+ A++ + G ++ Sbjct: 183 WTIWVVDYSGSMSGEGKNGVVKGLNAALDP--DQAKKSYIEPASGGVNILIPFETEAHRP 240 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + A + G T I + + +A D + S +E + IVL Sbjct: 241 VKATGTSTSDLLHEADATDASGGTDIYEVLLSALDELPSESEASQYTT--------AIVL 292 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 +TDG + D+++ K++ + + I +I F Q K+ L S F+ S Sbjct: 293 MTDGRSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATL----SNAKVFDGRS 348 Query: 434 THELNKIFRDRIG 446 +L +FR G Sbjct: 349 G-DLAAVFRQAKG 360 >gi|260800505|ref|XP_002595170.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] gi|229280413|gb|EEN51181.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] Length = 183 Score = 39.6 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 40/138 (28%), Gaps = 13/138 (9%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F F+ + I I + + G TA A+ Sbjct: 49 FQYSNFVVQEFALDTYGSIGEINQAVDAVMYQGGGTATGLALYEMRQ------YGFSFAN 102 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 ++ +LLTDG ++ A GI + + N E L+ Sbjct: 103 GGRPGTRRVAILLTDGMSSD---TVDKHAMAAWQAGISLYVVGIGSNVDMNE----LLAI 155 Query: 422 CASPNSFFEANSTHELNK 439 +P++ F + +L Sbjct: 156 GGTPDNVFSLGNFGQLQD 173 >gi|3273263|dbj|BAA31175.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 39.6 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 278 LASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR---TIVKTFAIDENEM 334 +I+ + + + F++ K I+++ Sbjct: 72 AMKLIQQLNL---NESAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPY 128 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + DA+ + N A + +V+LTDG + K Sbjct: 129 GRTNLTDALLQVRKHLNDRI--------NRENANQLVVILTDGIPDSIQDSLKES-RKLN 179 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +G++I Sbjct: 180 DRGVKIAVFGIGQGINVA 197 >gi|313831092|gb|EFS68806.1| TadE-like protein [Propionibacterium acnes HL007PA1] gi|314932293|gb|EFS96124.1| TadE-like protein [Propionibacterium acnes HL067PA1] Length = 133 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 15/125 (12%), Positives = 35/125 (28%), Gaps = 2/125 (1%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + L++PV+L + G++ D R A + AA A + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISATQAAQDAAVAASRAGTDAAATP 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 A + +Q + + T + + + Sbjct: 61 QLAGHDGAAVAVQAARQALSAAGVDGSVQEDGSTITVTTSQSRPTVFLSAIGISQVTGHG 120 Query: 132 VLSSR 136 ++ Sbjct: 121 QAHAQ 125 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 39/132 (29%), Gaps = 20/132 (15%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + ++ G TA DA+ + K ++V++TD Sbjct: 468 YNTKENVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRESPNKNFLVIVTD 519 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS---PNSFFEANS 433 G++ D + A GI I ++ L + AS + F Sbjct: 520 GQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMASKPKESHAFFTRE 570 Query: 434 THELNKIFRDRI 445 L I D I Sbjct: 571 FTGLEPIVSDVI 582 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 35/103 (33%), Gaps = 6/103 (5%) Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFS--VNKT 410 + N K L +N N + K + + IAFS V+ + Sbjct: 529 HNNWSAATGNTYVSKVGKLYNTPDNYLKNSSPQSR-KAIKDMVTKKRVYIIAFSRSVSTS 587 Query: 411 QQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 S F ANS +L+KIF++ IG I + Sbjct: 588 GLNAIADEFSIKGDSKRVFRANSGEDLSKIFQE-IGESINNDL 629 >gi|238062108|ref|ZP_04606817.1| hypothetical protein MCAG_03074 [Micromonospora sp. ATCC 39149] gi|237883919|gb|EEP72747.1| hypothetical protein MCAG_03074 [Micromonospora sp. ATCC 39149] Length = 150 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 14/114 (12%), Positives = 29/114 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + + G + A M +L + G+ D A A T + ++ Sbjct: 11 RRWRGDEGRVSLFLAAAMIGVLAIIGLAFDGAGQLRSRQLADNLAAEAARTGGQSIDRAK 70 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + + YL + F ++R R Sbjct: 71 AIEGGSKEIDEAAARTAVTSYLAAAGATDHDVEFPVVGGEKLIRVRVTVTYDRY 124 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 415 TYDQRTEFSFTDYSTKENVLAVIRSIRYMSGGTATGDAISFTVRNVFGPVRDSP------ 468 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 469 --NKNFLVIITDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 517 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 518 KPKESHAFFTREFTGLEPIVSDII 541 >gi|121534454|ref|ZP_01666277.1| hypothetical protein TcarDRAFT_1555 [Thermosinus carboxydivorans Nor1] gi|121306947|gb|EAX47866.1| hypothetical protein TcarDRAFT_1555 [Thermosinus carboxydivorans Nor1] Length = 366 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 18/265 (6%), Positives = 59/265 (22%), Gaps = 5/265 (1%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ +K G I+ ++M ++ +G V + A I A Sbjct: 2 RTQLRRYLKGQKGSAAILAIVVMLILAVLGAAYVGLGMTETSTA----ANFRDGIAAQYL 57 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 ++ N+ + + + + ++N + Sbjct: 58 AEAGVKRALIELYNNSAWDPGAGGLTEKQGNGSFTVIVRPGETSSQKKVVSTGQVNRARR 117 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + ++ + + + G + + + E Sbjct: 118 QVVLDVNLPSNSGNGSVYDYSVFAGSNMVVHNKARVIGAIHSNNGMELKNGSYISGKVEV 177 Query: 188 SMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG 247 + + ++++ + K G + + + N Y Sbjct: 178 HGAFTDHGATLGTPPAITNAPEIPFPAFNAADYKQGAQVLPWGRTLSAGNYRGKYYYMGS 237 Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKH 272 L + + Sbjct: 238 QLTIDSGW-GSIQGDATIFATGDID 261 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 31/361 (8%), Positives = 81/361 (22%), Gaps = 21/361 (5%) Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 I + + ++ + + + + + Sbjct: 3020 AVVDGGITLTTQDAQTIGSASDTATGSFAAAFLAAAVPSYGADGPGTTTVSGYSLSVTDS 3079 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWV 182 N ++ + ++ S + S + H V Sbjct: 3080 NSGLTSNGLAITLTKVGSDIVGSTTAGEVFRISVASNGTVTLTQSAELDHLPEDVDNSND 3139 Query: 183 ---IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 I + + + + + + + K + +++ Sbjct: 3140 NNLISLANGKVLLSATVTVVDGDNDTATGTVSADLGGNIRFEDDVPTAKDNSVVITEAGL 3199 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + L +S + L +DAL +I S V +G Sbjct: 3200 PPFNLVMVIDTSGSMLWQI--GTSTNGSPNRLELAKDALNHMIDS---------YVALGV 3248 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-MGSTAINDAMQTAYDTIISSNEDEV 358 + + K G T N + A + + + + Sbjct: 3249 PLVFTVIDFASGAVLIPQTSDPDVAKASISGLPTDGGGTNYNAPLVLAQNQLTADLANPA 3308 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + L+DG + N + S + + + +V+ A+ Sbjct: 3309 LAGYETK-----VYFLSDGAPNEGN-VPAGWTSFVNSNNVEVYAVGLNVSGNATAIAQLG 3362 Query: 419 L 419 L Sbjct: 3363 L 3363 >gi|187479012|ref|YP_787036.1| hypothetical protein BAV2525A [Bordetella avium 197N] gi|115423598|emb|CAJ50136.1| putative exported protein (partial) [Bordetella avium 197N] Length = 116 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 1/77 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + AL++ + + + G + Y + L++ A +TA + S + Sbjct: 2 RQRGSILVPAALIILIGMVLLGGA-QFGYFYYVKRELQKTADLVALTAVQVIDGSAASCA 60 Query: 77 SRAKNSFTFPKQKIEEY 93 + KQ + Y Sbjct: 61 FADLTAKANVKQNLSAY 77 >gi|34980918|gb|AAH57200.1| Itgax protein [Mus musculus] Length = 304 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 14/167 (8%) Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +D S + D + +V+ +++ R Sbjct: 138 NFPTAQQECPKQDQDIVFLIDGSGSISS-TDFEKMLDFVKAVMSQLQRPST-----RFSL 191 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 F+D +F+ + + G T A++ + ++ Sbjct: 192 MQFSDYFRVHFTFNNFISTSSPLSLLDSVRQLR--GYTYTASAIKHVITELFTTQSGARQ 249 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 +A K ++++TDG DN ++ A++ I I Sbjct: 250 ------DATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIRYAIGVG 290 >gi|32477847|ref|NP_870841.1| hypothetical protein RB13063 [Rhodopirellula baltica SH 1] gi|32448404|emb|CAD77919.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 362 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 28/81 (34%), Gaps = 12/81 (14%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ + +G + L+ V+ + G + W Y ++QAA Sbjct: 1 MMRSR--------RHNRSGQALVEFGLVALVLYMLVGGAITFGIWIYAAGQIQQAANV-- 50 Query: 62 ITASVPLIQSLEEVSSRAKNS 82 + L Q+ +N+ Sbjct: 51 --GARELSQTPLPFDETFENA 69 >gi|116621434|ref|YP_823590.1| hypothetical protein Acid_2316 [Candidatus Solibacter usitatus Ellin6076] gi|116224596|gb|ABJ83305.1| conserved hypothetical protein [Candidatus Solibacter usitatus Ellin6076] Length = 321 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 44/121 (36%), Gaps = 19/121 (15%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 TA DA+ ++ + + + +V+ +DGE+ + A++ Sbjct: 171 GTAFYDAIYYGSTQMMQNVDRGR----------RALVVFSDGEDNSSAHHMMETVEAAQA 220 Query: 396 QGIRIMTIAFSVNKTQQEKARY-----FLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 + + TI ++ + AR + A + F+ L + FR IG + Sbjct: 221 NDVLLFTIRYTEINNGRLNARNKYGMGVMERIARETGGLDFDGQ-GKGLAEGFRQ-IGEQ 278 Query: 449 I 449 + Sbjct: 279 L 279 >gi|290997816|ref|XP_002681477.1| predicted protein [Naegleria gruberi] gi|284095101|gb|EFC48733.1| predicted protein [Naegleria gruberi] Length = 452 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 51/174 (29%), Gaps = 23/174 (13%) Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + G S S +D SS + A SVI + + D V FN Sbjct: 69 WFMGTASTSKSIIIILDVSSSMGAYHRLENAIYATRSVINYLTEKDYVG------IVLFN 122 Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAI---DENEMGSTAINDAMQTAYDTIISSNEDEVHR 360 + + + K KT + GST A ++ + Sbjct: 123 AGAFTCKKQTEFLLKATAQNKKTLIDCIENMMPFGSTNFEAAFNETFNL------FDRSE 176 Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI----RIMTIAFSVNKT 410 + + ++ LTDG T+ I + + + I +I + Sbjct: 177 EIASSTCDRVVLFLTDGTITKG-ANPIPL---IRKRNIEYQAKIFGFSLGSVAD 226 >gi|304320376|ref|YP_003854019.1| hypothetical protein PB2503_04012 [Parvularcula bermudensis HTCC2503] gi|303299278|gb|ADM08877.1| hypothetical protein PB2503_04012 [Parvularcula bermudensis HTCC2503] Length = 143 Score = 39.6 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 10/114 (8%), Positives = 28/114 (24%), Gaps = 3/114 (2%) Query: 6 KFIFYSKK---LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + +SK+ + G + AL+ PV + + + + A+ A Sbjct: 2 RARRHSKRSFSFFGNQRGSAAVEFALVCPVFILLMTGVFSGGMVFHVRETNYLYAREAAR 61 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 ++ E + + + + + Sbjct: 62 GLALGYFNETEAKDYAEDQAEDALGIDVTVSVDPATVGDPTDQNVVVSISVTKN 115 >gi|323702227|ref|ZP_08113893.1| hypothetical protein DesniDRAFT_1105 [Desulfotomaculum nigrificans DSM 574] gi|323532717|gb|EGB22590.1| hypothetical protein DesniDRAFT_1105 [Desulfotomaculum nigrificans DSM 574] Length = 207 Score = 39.6 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 52/164 (31%), Gaps = 15/164 (9%) Query: 13 KLIKSCTGHFFIITAL--LMPVMLGVGGM-LVDVV---RWSYYEHALKQAAQTAIITASV 66 +LIK G I+ + ++ M+ V + + + + + +A + A+ Sbjct: 37 RLIKDQRGFISILVLMCGMLIFMVIVPYITGIYYGVKHKGMQTKAWMNEALE---FAAAA 93 Query: 67 PLIQSLEEVSSRA-KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + ++S + + + + L + N ++ T E+ Sbjct: 94 SNMNGDLDISQLDPDMAKQYFSIALADILDGSLSNETITPLNTDYYPGPIKITDFEIVSP 153 Query: 126 KSAYQVVLSSR-----YDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 +++R + + L L +G K+ A Sbjct: 154 GETLPNGITARQPGFLATIEVPLLDADLPWVGHKTISFPITNFA 197 >gi|170726477|ref|YP_001760503.1| cell wall anchor domain-containing protein [Shewanella woodyi ATCC 51908] gi|169811824|gb|ACA86408.1| LPXTG-motif cell wall anchor domain protein [Shewanella woodyi ATCC 51908] Length = 739 Score = 39.6 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 38/341 (11%), Positives = 82/341 (24%), Gaps = 26/341 (7%) Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLF---LRSMGIKSWLIQTKAEAETVSRSYHKEH 175 A N + + + + + L + + + + R Sbjct: 226 ATIENGTAFNDHIGIDAAAKVSIEVELDAGVELGLISSPYHKVSHTQLSGSHYRVSLTGV 285 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 +V+++ + + + + + N K + Sbjct: 286 KADRDFVLNWRPQLDTKPVAAVFSQQGKTHSLSSKAQVEPTDSNASTKADSNKAVEDDYA 345 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 L + S +D+S + AL + +K D Sbjct: 346 LLMLLPPSDQKQDVSISRELILVIDTSGSMSGAS-IAQAKRALNYALAGLKAKDT----- 399 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 FN V S +S + + G T + A+ A + E Sbjct: 400 -FNVIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTEMQLALNAA---LDKGTE 455 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 E E + ++ +TDG + I K R+ T+ Sbjct: 456 TEA----LGSERLRQVLFMTDGSVGDEQSLFHLIKQKIGES--RLFTLGIGSAPNSH--- 506 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 F+ A +F E+ + ++ Sbjct: 507 --FMRRAAEFGRGTFTYIGKLDEVQSKIESLLYQIERPQLT 545 >gi|317123684|ref|YP_004097796.1| hypothetical protein Intca_0524 [Intrasporangium calvum DSM 43043] gi|315587772|gb|ADU47069.1| hypothetical protein Intca_0524 [Intrasporangium calvum DSM 43043] Length = 152 Score = 39.6 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 12/94 (12%), Positives = 27/94 (28%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G ++ ++ +L V + + H + AA A++ + L + + Sbjct: 39 ERGAATVLVIGVLVALLVVTTGGLLIASAVVASHRARLAADLAVLAGATALRDASTVERA 98 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREV 111 A E + L+ T Sbjct: 99 CAAARRVATLNGAELGACQARGTELELTVTVAAP 132 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 39.6 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 40/128 (31%), Gaps = 13/128 (10%) Query: 316 VHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 R+ D T A+ A ++ + ++ I++ Sbjct: 110 FRYANRSQAIEKLNDLPYMACKTNTQLALNLAEMIFFNNTLGPLRPG------RRRILIF 163 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF--FEAN 432 TDG++ + + + K +G+ I +A + A+ +S + Sbjct: 164 TDGQSNVKEQMTLYRAFRLKKRGVEIYVVAVGKYLYGMHEIIGL----ATSSSHHLYRVR 219 Query: 433 STHELNKI 440 S + KI Sbjct: 220 SMKDFVKI 227 >gi|260439389|ref|ZP_05793205.1| conserved hypothetical protein [Butyrivibrio crossotus DSM 2876] gi|292808185|gb|EFF67390.1| conserved hypothetical protein [Butyrivibrio crossotus DSM 2876] Length = 292 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 57/242 (23%), Gaps = 14/242 (5%) Query: 26 TALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 AL++PV L +V ++ + L+Q+ + S + ++ Sbjct: 44 AALVLPVFLAALVAVVFFLQAIQVQSRLQQSLYNQVKKVSGYAYYMNI--ADMSEQVEQI 101 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + + +Y + N + ++ +K L D+ N Sbjct: 102 MQAEYVKYAVINEIGRDYLENSVITGGSSGIHINFLVDAKKGILDAELDYSMDIPFN--- 158 Query: 146 LFLRSMGIKSWLIQTKAEAET----VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 +G S ++ T S + V + Sbjct: 159 ----LLGFPSIRFSSRLRCHTWIGNTSGDEVQSSDVVYVTANGEVYHLYSDCSYLVSSIK 214 Query: 202 NCFGQPADRTVKSYSSQNGKVG-IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 NC G S + + P S VD Sbjct: 215 NCKGTEIADKRNSSGEKYRPCQLCCKDNEEPVTAFYTDYGNRYHSQSLCSNLHSNIFSVD 274 Query: 261 SS 262 + Sbjct: 275 KN 276 >gi|308238185|ref|NP_001184129.1| chloride channel accessory 1 [Xenopus (Silurana) tropicalis] Length = 933 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 54/186 (29%), Gaps = 27/186 (14%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + A + I ++ + +G F S V Sbjct: 309 DVSGSMGGGNRIGRLYQAAEVFVMQIVEMGSY-----VGIVQFESTASVRSSLLQIVDDT 363 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R +K+ G T I ++ ++ + +VLLTDGE+ Sbjct: 364 QRNRLKSLLPK-TATGGTNICAGIREGIKV---------NKKYDGSSYSTELVLLTDGED 413 Query: 380 TQDNEEGIAIC-NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN---STH 435 ++C + G I IA N ++ + + + F A Sbjct: 414 NY----ATSLCFPDVTNSGSIIHVIALGPNAAKELETIVDM----TGGLRFLATDKVDAQ 465 Query: 436 ELNKIF 441 L F Sbjct: 466 GLIDAF 471 >gi|284053489|ref|ZP_06383699.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 396 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 39/98 (39%), Gaps = 5/98 (5%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--DEVHRMKNNLEAKKYIVLL 374 + + +T + + +T I + A + + + V LE + ++LL Sbjct: 147 NDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRLEPRLSVILL 206 Query: 375 TDGENTQDNEEGI--AICNKA-KSQGIRIMTIAFSVNK 409 +DG + Q NE+ + ++ I + T+ + + Sbjct: 207 SDGFHNQPNEQQDFDNLITLLERNNNIIVHTLGYGLTP 244 >gi|256374530|ref|YP_003098190.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918833|gb|ACU34344.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 550 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 63/194 (32%), Gaps = 24/194 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + ++ ++ +V+ + R+G F D + + G + Sbjct: 67 LNNIKTSINAVVGDVVSAS--GGDYRLGLVTFKDSINVVTGLAAGNAGT---VTGYVTNV 121 Query: 331 ENEMGST----AINDAMQTAYDTIISSNEDEVHRMKNNLE---AKKYIVLLTDGENT--- 380 G A ++A++TA ++ + + A+K++VL+TD Sbjct: 122 LAASGGGGEPEASDEALRTAV-SLRPAAGIPQNADFTGPWRSNARKFVVLVTDARPGGFD 180 Query: 381 -----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEAN- 432 D A+ N A + G+++ + + + + N A + + +A Sbjct: 181 DAFTAADQASATAVANSALAAGVKLSAVYVPTSPSMTPTIAPIMQNYATTTSGVYVQAQA 240 Query: 433 STHELNKIFRDRIG 446 R + Sbjct: 241 DGSGTADAIRKALS 254 >gi|308473944|ref|XP_003099195.1| CRE-CLEC-63 protein [Caenorhabditis remanei] gi|308267668|gb|EFP11621.1| CRE-CLEC-63 protein [Caenorhabditis remanei] Length = 414 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 48/152 (31%), Gaps = 8/152 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGSTAIND 341 I + R+G +N + ++V + + + + Sbjct: 93 TRIGNQYTDPRSTRVGLVTYNKVATEVADLNHIQSIDDLYSVVFSTLTSVSSEDDSYLAT 152 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A + V L DG+N I + ++ KS G+ I Sbjct: 153 GIGAAEKVFQNGRNGNVRSNYKRLVLVYASAYKGDGQND-----PIPVSDRLKSSGVVIS 207 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEANS 433 TIAF + + A L+ ASPN F + Sbjct: 208 TIAFDQDGDEALLA--GLAQIASPNYAFTSED 237 >gi|149202861|ref|ZP_01879832.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] gi|149143407|gb|EDM31443.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] Length = 212 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 39/128 (30%), Gaps = 24/128 (18%) Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT-QDNEEGIAI 389 G T I ++ A + + + +VL+TDG T + Sbjct: 80 LMPGGLTPIAASVAAAAEVLEYRTQPG------------IVVLVTDGNETCGGTPCALGA 127 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLS--------NC---ASPNSFFEANSTHELN 438 A+++ + + I F V C A+ F + EL Sbjct: 128 ALTAEARDLTVHVIGFRVVHDPFSWNSPEAQGYDGQTVAKCLADATGGVFVSTETVDELV 187 Query: 439 KIFRDRIG 446 + R+ +G Sbjct: 188 EALRETLG 195 >gi|260177188|gb|ACX33912.1| hypothetical CBS domain-containing protein [uncultured prokaryote AT3] Length = 433 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 34/116 (29%), Gaps = 8/116 (6%) Query: 27 ALLMPVMLGVGGMLV-DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTF 85 +L ++ G + ++ S L+Q A + ++A+ L ++ S Sbjct: 5 GILAFLIFTCGFFALAEMALASSRRAKLQQLADSGDLSAARALQ-------IKSTPSRFI 57 Query: 86 PKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + +N V + + +N KS + + Sbjct: 58 AATQTGLTAASLLAGIFGENALAIHVEHFIEQSLPVLNAAKSEIAITATVTVVTAF 113 >gi|149005573|ref|ZP_01829312.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|168482703|ref|ZP_02707655.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|307126680|ref|YP_003878711.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|147762513|gb|EDK69473.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|154432930|gb|ABS82113.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432938|gb|ABS82120.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432946|gb|ABS82127.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432954|gb|ABS82134.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432970|gb|ABS82148.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|172043667|gb|EDT51713.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|306483742|gb|ADM90611.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|332076908|gb|EGI87370.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17545] gi|332203636|gb|EGJ17703.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA47368] Length = 883 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 29/91 (31%), Gaps = 8/91 (8%) Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---QQEKARYFLSNC-ASPNSFFE 430 TD + G + T+ VN + A F+ + +SP+++ Sbjct: 499 TDWITNHGDPTTWYYNGNMAQDGYDVFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTN 558 Query: 431 ANSTH----ELNKIFRDRIGNEIFERVIRIT 457 ELN+ F + + IT Sbjct: 559 VADPSQILQELNRYFYTIVNEKKSIENGTIT 589 >gi|229578924|ref|YP_002837322.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|284997528|ref|YP_003419295.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|228009638|gb|ACP45400.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|284445423|gb|ADB86925.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 356 Score = 39.6 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 32/104 (30%), Gaps = 23/104 (22%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVSFTINLAKQSQVPTK-IIMLTDGKPTDKR--------NVKDYEKLDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +TI N L A S F+ EL IF Sbjct: 158 ITIGIGNNYN-----ERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|319791199|ref|YP_004152839.1| tade family protein [Variovorax paradoxus EPS] gi|315593662|gb|ADU34728.1| TadE family protein [Variovorax paradoxus EPS] Length = 393 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 17/50 (34%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + G T + +PV++ +V L+ AA A S Sbjct: 9 RRQRGVSMTETMIALPVLILFVMCVVQFGLIYRARLTLEYAAHEAARAGS 58 >gi|218506872|ref|ZP_03504750.1| hypothetical protein RetlB5_04304 [Rhizobium etli Brasil 5] Length = 41 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 9/20 (45%), Positives = 13/20 (65%) Query: 14 LIKSCTGHFFIITALLMPVM 33 I +G+F I+TALLM + Sbjct: 8 FISDRSGNFGIMTALLMVPL 27 >gi|212703143|ref|ZP_03311271.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] gi|212673409|gb|EEB33892.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] Length = 512 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 29/83 (34%), Gaps = 12/83 (14%) Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 T + G T + +A+ ++ E +K +++LTDG D + Sbjct: 403 TDRMQMQAHGGTPLAEALWWVMRQML-----------ILRETRKVVLILTDGVPD-DVTQ 450 Query: 386 GIAICNKAKSQGIRIMTIAFSVN 408 + + G+ + I + Sbjct: 451 CLQALEALRKTGVEVYGIGMKFD 473 >gi|327538643|gb|EGF25298.1| TadE family protein [Rhodopirellula baltica WH47] Length = 134 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 1/52 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITAS 65 I++ G + + +PV+L + ++ R Y +LK AA + A + Sbjct: 7 IRARRGAALMEFVMCLPVLLVITLGTLETCRMIYLRQSLKLAAYECARLAIV 58 >gi|170746810|ref|YP_001753070.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170653332|gb|ACB22387.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 207 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/139 (7%), Positives = 35/139 (25%), Gaps = 4/139 (2%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + ++ G + AL+ + G ++ + + L+QA A Sbjct: 14 IRELASRYAQARDGATAVEFALVALPFFALVGACLENGIVFWEQEILQQAVSDAS----R 69 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + ++ + + + + + + Sbjct: 70 QIYTGAFQTTNAGTTDTATLMSRFRTAICTQPNGTPRVTIFTCANVRVSVTKVADYDSAN 129 Query: 127 SAYQVVLSSRYDLLLNPLS 145 V ++ NP Sbjct: 130 PVSPVATNASGASDWNPNF 148 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 43/416 (10%), Positives = 97/416 (23%), Gaps = 38/416 (9%) Query: 55 QAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 AA + + + F + +KK + Sbjct: 317 YAADNGSTAG-EEDVNVPAQTVQLWGDERNFENSYL-----DYNGAYIKKWVEPVLSNNP 370 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 D E + + ++P+ + S T + E Sbjct: 371 TSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIE 430 Query: 175 HGVSIQWVIDFSRSM-----------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 I + +M +++ Q + + + S+N + Sbjct: 431 SVNEISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALN 490 Query: 224 IRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 +P + G + + + V R + Sbjct: 491 RTPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSIWG 550 Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAIND 341 + + A D + +FS T T+A D + T + Sbjct: 551 AW----SSWSVMGDKIAIDRGDLFKNFETFSGNTSNAGFTYPVTYASDFDRPVNGTNVEY 606 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL-LTDGENTQDNEEGIAICNKAKSQGIRI 400 + + +N + NT + NK + I Sbjct: 607 RYGEVKEGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHWLK--NKYQENPPSI 664 Query: 401 MTIAFSVNKTQQ------EKARYFLSNCA------SPNSFFEANSTHELNKIFRDR 444 +I ++ + R L N A + +++AN+ +++ D Sbjct: 665 FSIGLGIDGSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDI 720 >gi|298345270|ref|YP_003717957.1| hypothetical protein HMPREF0573_10144 [Mobiluncus curtisii ATCC 43063] gi|298235331|gb|ADI66463.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 43063] Length = 171 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 41/139 (29%), Gaps = 5/139 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 +F ++L+ G ++ ++ + M + V + L++ A Sbjct: 12 MPHRFAARFERLV-DEDGRIMLLGLAACVLLCVIILMSMAVSGVYLEQRRLQRLADQ--- 67 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + ++ + + +IE Y + V++ Sbjct: 68 -TASLAAANMADAAYYQNGIVDGVPLQIEPYHASERAAEYLSGAAISANSGLEGIDLVDV 126 Query: 123 NPRKSAYQVVLSSRYDLLL 141 + + QV L + + L Sbjct: 127 DVASTRVQVTLRATGKIPL 145 >gi|242009952|ref|XP_002425745.1| calcium channel, putative [Pediculus humanus corporis] gi|212509649|gb|EEB13007.1| calcium channel, putative [Pediculus humanus corporis] Length = 652 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 66/226 (29%), Gaps = 30/226 (13%) Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 Y S VDSSS KK + + + ++ ++ D + Sbjct: 175 DFRTSSWYIDAATSSKDIVILVDSSSSMGGKKK-GIAKAIVNIILDTLGNND------FV 227 Query: 298 GATFFNDRVISDPSFSWGV----HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 F++ V ++ GS A+ T ++ + Sbjct: 228 NIYRFSESATEIVPCFKDVLVQATAENIRELRIAFDFVKYEGSANFTSALVTGFEILH-- 285 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG-----IRIMTIAFSVN 408 +R + + I+L+TDG ++ K +R+ T + V Sbjct: 286 ---RYNRTGQGCQCNQAIMLITDGPSSSYK-------EIFKQYNWPHMPVRMFT--YLVG 333 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 K + CA+ F + ++ + + I V+ Sbjct: 334 KDGSNQEDMNWMACANKGYFAKVQNSEDAQEKVLQYIAVLARPMVL 379 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 412 TYDQRTEFSFTDYSTKEDVLAVIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 463 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI I ++ L + AS Sbjct: 464 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 514 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 515 KPKESHAFFTREFTGLEPIVSDVI 538 >gi|291569126|dbj|BAI91398.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 396 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 39/98 (39%), Gaps = 5/98 (5%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE--DEVHRMKNNLEAKKYIVLL 374 + + +T + + +T I + A + + + V LE + ++LL Sbjct: 147 NDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRLEPRLSVILL 206 Query: 375 TDGENTQDNEEGI--AICNKA-KSQGIRIMTIAFSVNK 409 +DG + Q NE+ + ++ I + T+ + + Sbjct: 207 SDGFHNQPNEQQDFDNLITLLERNNNIIVHTLGYGLTP 244 >gi|59714345|ref|YP_207120.1| RTX repeat-containing calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] gi|59482593|gb|AAW88232.1| RTX (repeats in toxin) calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] Length = 3933 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 43/426 (10%), Positives = 101/426 (23%), Gaps = 26/426 (6%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 + VD+ + T ++T +V + + + Sbjct: 2612 VAVDLAGNTVSTTQ-DVTVDTKVLTETVGDGDLGITSDITDNTNTGSNSDTVTNNTTPDI 2670 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + + + + LN + L I Sbjct: 2671 TGVTEAGAKVTITYTDKDGVIHTTDEVTANQNGEYTISLPYQLNEGANALTVTAIDIAGN 2730 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 +T+ + + + IQ V D S S+ + + S Sbjct: 2731 KTEVIQDVTINADPIANDFDIQLVDDISTQFSFDAYVSDLEDDENDTDNKFVDIMITVSP 2790 Query: 219 NGKVGIRDEKLSPYMVSCNKSLY------YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKH 272 + + ++ + L Y+L D + + + Sbjct: 2791 EFGTLYVVDGDTRTEITSSTILTESDQIEYVLDGSINDDLSFNAQEDFAPNYSNGSVNSF 2850 Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + + N + T + D V + G ++ ID + Sbjct: 2851 TLVSGVIISGGQFTGDSPDNTSTLTPETLYYDNVAQETGLGVGNSEIDVKTKDYVEIDFS 2910 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL------------LTDGENT 380 +GS + + T + S + + ++ D N Sbjct: 2911 NVGSPNTTEVVITEVNMDFGSVWGNYSENSSADAKVQVLLFKDGVLVGESPYTFDDESNG 2970 Query: 381 QDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQE----KARYFLSNCASPNSFFEANS 433 + G N G IR+ TI + + + + S + ++A Sbjct: 2971 IYDGSGEFTANIQLDSGFDQIRVYTIHGEGSTSPNSNLTLQGVEVVDAMVSEDIHYQATD 3030 Query: 434 THELNK 439 + + Sbjct: 3031 SDDAID 3036 >gi|86739467|ref|YP_479867.1| twin-arginine translocation pathway signal [Frankia sp. CcI3] gi|86566329|gb|ABD10138.1| Twin-arginine translocation pathway signal [Frankia sp. CcI3] Length = 880 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 22/351 (6%), Positives = 80/351 (22%), Gaps = 6/351 (1%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + + A + A + S ++ + + +++ + + + + + Sbjct: 337 QVTLEQAALLAGLVQSPSRYNPAAHPEAALARRNTVLDKMAAEGYVSTTQAEAAKQLPLN 396 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 +V+ + +++ L ++ ++ I + Sbjct: 397 VVKAPPPAADSCETSSAPFFCDYVRSQLRASPALGSTLEERNRRIYEGGLVIRTTLDPLV 456 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 + S S + QP S+ + + Sbjct: 457 QQAAQDAVNSTISPSNRVSATEVVIQPGTGNILAMAVNRVYGSNTAANQTVVPLPTNATF 516 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + ++L + F + + + Sbjct: 517 QPGSTFKTFVLAAALEQGYGTSTAFYSPACYESKKFPLDRGEGDCPRGFSNSDPAEAGIY 576 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM------GSTAINDAMQTAY 347 + G + + G+ ++ + G+TAI Sbjct: 577 DIPKGTWDSVNTFYVQLAEKTGIPAVLEMAKRLGVTPPKADKIGSTDGATAIGGGQYMYV 636 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 + ++ ++ D + + C + ++G+ Sbjct: 637 SPLQMADAYATIAGGGVRCTPRFATGAVDSSHDPIDIAQPPQCEQVLAKGV 687 >gi|253996765|ref|YP_003048829.1| TadE family protein [Methylotenera mobilis JLW8] gi|253983444|gb|ACT48302.1| TadE family protein [Methylotenera mobilis JLW8] Length = 173 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 19/175 (10%), Positives = 50/175 (28%), Gaps = 10/175 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + YSK+ + G + A++ ++ + +++ R Y + +++ + A Sbjct: 1 MMLKMMKSYSKR--QRQQGAAAVEFAIIALLLFTILFGILEFGRLFYVYNTVQEVTRHAA 58 Query: 62 I-TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFE------NNLKKNFTDREVRDI 114 + S + E N + + + D Sbjct: 59 REAVVRWVDNSNTSPAKILALFGGASVPAGAEITAANIDIQYLTASGAVPSPFPLSASDN 118 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS-MGIKSWLIQTKAEAETVS 168 + + QV + + L FL + + + ++ T S Sbjct: 119 ISACLTGPAGCIALVQVSIIGANYAPMVGLFPFLNLPIPASTVTMPAESLGYTGS 173 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 55/189 (29%), Gaps = 24/189 (12%) Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 + +D+S + + ++ + S + A +FN + Sbjct: 23 NLVFLIDNSGSIND-TEFDNFKEFAKKLAESFTISAT---YTHVAAVYFNTLANFGFNLK 78 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + ++ + I N G T I A+ D + K +V+ Sbjct: 79 YDINVIKTAID----NLPNIGGGTHIGKALTYTLDNVFKVAP--------RQNVKNVLVV 126 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP---NSFFE 430 LTDG++ A G+ + + + + L+ AS + F Sbjct: 127 LTDGKSHDSVTLPAAAVRNY-GPGVEVFAVGVGAGDSFVAQ----LNVIASDPDEDHVFH 181 Query: 431 ANSTHELNK 439 ++ Sbjct: 182 VEHFSQIES 190 >gi|116694138|ref|YP_728349.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113528637|emb|CAJ94984.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 158 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 43/153 (28%), Gaps = 13/153 (8%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ-- 58 M+ + + + G I A++ PV+L + +V + L AA+ Sbjct: 1 MLLHARVNRGINR-RRHGAGSAAIEFAIVAPVLLAIVIGIVYYGVMLALQQVLTLAAEEG 59 Query: 59 -TAII------TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK---NFTD 108 A + + S V + A + I L TD Sbjct: 60 ARAALRYPAGVSGSGLAATQGARVDAAAAMARGTLPTSIRNLLSTGNVAQAVPCASGSTD 119 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 V+ + + P V + ++ Sbjct: 120 VCVQVTLNLPTTNILPAVPMVPVPATLSGSAMV 152 >gi|134299958|ref|YP_001113454.1| TadE family protein [Desulfotomaculum reducens MI-1] gi|134052658|gb|ABO50629.1| TadE family protein [Desulfotomaculum reducens MI-1] Length = 137 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 45/142 (31%), Gaps = 20/142 (14%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 I +KL +S G + AL++P+++ + ++ R + +T + Sbjct: 2 IKLLRKLRQSQRGQTLVELALILPILVVMLMGTIEFGRIFFT-----------YLTVTHA 50 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + T+ +QK+E+ T + V + Sbjct: 51 SREVARATVIHTNKDDTYIRQKVEDAASW---------LTTTGLTVEVTPSLPTNRTSGV 101 Query: 128 AYQVVLSSRYDLLLNPLSLFLR 149 V +S +L LS + Sbjct: 102 PLTVTVSYPVELYTPVLSDVMN 123 >gi|154496734|ref|ZP_02035430.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] gi|150273986|gb|EDN01086.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] Length = 786 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 46/165 (27%), Gaps = 23/165 (13%) Query: 273 LVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 + S++ R+G F+ ++ + + V Sbjct: 152 VNSLYNESLMDQATGA--GGPRSRVGVISFSADAQTETIPVELTSEAEVSFVAGEIDAIT 209 Query: 333 EM----GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------- 381 G+T + A+ + + + + + K I+L TDG Sbjct: 210 YDKVNTGATDLGRAVLSGTEMLRGAQDGVR---------KDMIILFTDGYTDALTPEGME 260 Query: 382 -DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 + A+ G I + + ++ R + N A+ Sbjct: 261 RSSAMMAEGLEAARQLGCEIYVVGLNYQGRINDQGRGEIWNIANS 305 >gi|314980651|gb|EFT24745.1| TadE-like protein [Propionibacterium acnes HL072PA2] Length = 133 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 15/116 (12%), Positives = 32/116 (27%), Gaps = 2/116 (1%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + L++PV+L + G++ D R A + AA A + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISATQAAQDAAVAASRAGTDAAATP 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 A + +Q + T + + + Sbjct: 61 QLAGHDGAAVAVQAARQALSAAGGDGSVQEDGSTITVTTSQSRPTVFLSAIGISQV 116 >gi|308171956|ref|YP_003918661.1| hypothetical protein BAMF_0065 [Bacillus amyloliquefaciens DSM 7] gi|307604820|emb|CBI41191.1| conserved hypothetical protein [Bacillus amyloliquefaciens DSM 7] gi|328551766|gb|AEB22258.1| hypothetical protein BAMTA208_00330 [Bacillus amyloliquefaciens TA208] gi|328910026|gb|AEB61622.1| hypothetical protein LL3_00067 [Bacillus amyloliquefaciens LL3] Length = 245 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 28/74 (37%), Gaps = 10/74 (13%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-----NCASP 425 ++L+TDG + + IA+ AK QGI + I + + + A Sbjct: 9 VLLITDGCSNHGE-DPIAMAAFAKEQGITVNVIGIM---EENAIDQEAMKEVEGIAMAGG 64 Query: 426 NSFFEANSTHELNK 439 + +L++ Sbjct: 65 GVH-QVVYASQLSQ 77 >gi|260461955|ref|ZP_05810200.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259032202|gb|EEW33468.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 207 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 13/142 (9%), Positives = 37/142 (26%), Gaps = 1/142 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + G + AL++P++L + M ++ + + + + + Sbjct: 13 IRGKAVGFWSNRRGVAAVEFALIVPILLVMYFMTMEASQAIETSKKVSR-IGSMVADLVT 71 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + T Q ++ ++V + Sbjct: 72 QQPTIVAADLDAIMKIGTSTIQPYNRSTPSIIITAIQVTTDTPPKVNVVWSRKLVNGVSS 131 Query: 127 SAYQVVLSSRYDLLLNPLSLFL 148 A + ++ L FL Sbjct: 132 IATTLPATTTVPTTLRVAGTFL 153 >gi|254881903|ref|ZP_05254613.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|319641094|ref|ZP_07995798.1| von Willebrand factor [Bacteroides sp. 3_1_40A] gi|254834696|gb|EET15005.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|317387338|gb|EFV68213.1| von Willebrand factor [Bacteroides sp. 3_1_40A] Length = 212 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 40/118 (33%), Gaps = 10/118 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++I +++ +T + FN + I Sbjct: 21 IEAVKNGVQTLISTLRSDPYALETAYISIITFNSSAQQVTPLTELAAFQQPNID------ 74 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G TA+ A++ I S +K + I ++TDG T D G+A Sbjct: 75 --ASGCTALGGALELLSQKIDSEITKTTAEVKGDWRP--LIFIMTDGVPTDDISRGLA 128 >gi|169832403|ref|YP_001693966.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] gi|154432882|gb|ABS82071.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432906|gb|ABS82092.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|168994905|gb|ACA35517.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] Length = 883 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 29/91 (31%), Gaps = 8/91 (8%) Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT---QQEKARYFLSNC-ASPNSFFE 430 TD + G + T+ VN + A F+ + +SP+++ Sbjct: 499 TDWITNHGDPTTWYYNGNMAQDGYDVFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTN 558 Query: 431 ANSTH----ELNKIFRDRIGNEIFERVIRIT 457 ELN+ F + + IT Sbjct: 559 VADPSQILQELNRYFYTIVNEKKSIENGTIT 589 >gi|154684584|ref|YP_001419745.1| YabS [Bacillus amyloliquefaciens FZB42] gi|63146677|emb|CAG28929.1| YabS protein [Bacillus amyloliquefaciens FZB42] gi|154350435|gb|ABS72514.1| YabS [Bacillus amyloliquefaciens FZB42] Length = 245 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 28/74 (37%), Gaps = 10/74 (13%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS-----NCASP 425 ++L+TDG + + IA+ AK QGI + I + + + A Sbjct: 9 VLLITDGCSNHGE-DPIAMAAFAKEQGITVNVIGIM---EENAIDQEAMKEVEGIAMAGG 64 Query: 426 NSFFEANSTHELNK 439 + +L++ Sbjct: 65 GVH-QVVYASQLSQ 77 >gi|222147188|ref|YP_002548145.1| hypothetical protein Avi_0226 [Agrobacterium vitis S4] gi|221734178|gb|ACM35141.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 207 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 45/187 (24%), Gaps = 7/187 (3%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI--IT 63 ++ +++L +S G I A+L + +++ E + A T + Sbjct: 17 RWRLVARRLRRSRDGSAAIEFAILAIPYFLIIFAILETFVAFIAEQTVNAAVDTLGRQLR 76 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR-----DT 118 A +I L + N +VR T Sbjct: 77 TGNITYNQARSTDKTATEFRQLFCNEISFLLTCDAAEVATPNRLWLDVRTYTAFSAMPTT 136 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 + + +N L F + A S + ++ + Sbjct: 137 IATTSGSLDTSSFAFTPGGASSINMLRAFYYWPVTTDLVRPYIATIHRPGVSSNSDYLIV 196 Query: 179 IQWVIDF 185 Sbjct: 197 STLAFQN 203 >gi|328474273|gb|EGF45078.1| hypothetical protein VP10329_16240 [Vibrio parahaemolyticus 10329] Length = 154 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 40/134 (29%), Gaps = 12/134 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 KF ++ + G I L +PV+L + +++DV R + +A Sbjct: 5 KFKTQMRR-GRHQEGLAIIEFILALPVLLMLTVLVIDVCRAFIQYTEVNKAL-------- 55 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + F E + T + I D V P Sbjct: 56 ---QNGARYAVVDTYGTLDFEGIADETKIKNVVVYGSPTASTTPVIDYIGVDDIVITQPT 112 Query: 126 KSAYQVVLSSRYDL 139 + V LS+ Y+ Sbjct: 113 GTNKVVTLSATYNY 126 >gi|330819181|ref|YP_004348043.1| TadE family protein [Burkholderia gladioli BSR3] gi|327371176|gb|AEA62531.1| TadE family protein [Burkholderia gladioli BSR3] Length = 257 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 30/105 (28%), Gaps = 5/105 (4%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + ++ PV+L L+ +A + A+ + E+ Sbjct: 11 RQGGQSLVEFVIVAPVLLFFCFGLLQYALLFQARA----TLDSATLEAAREGAVNHAELD 66 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + + Y ++ ++ VRD R + Sbjct: 67 AMQRGLARGLSPLYA-YEANAAGASVALARANQAVRDRARIAIIN 110 >gi|311694769|gb|ADP97642.1| von Willebrand factor, type A [marine bacterium HP15] Length = 581 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 17/127 (13%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + A++TA D + + + +LLTDG+ ++ + + Sbjct: 119 TNLGAAIETASDDYFTDGDLSRT----------HFILLTDGKVDISDDPAKNTAEETRIL 168 Query: 397 GIRIMTI-----AFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 + + F + FL A S F A++ LN F + + Sbjct: 169 DTIVADLIERGATFHPVALSEAADTDFLKALATDSGGRFQVADTADALNLAFLQALNTAV 228 Query: 450 FERVIRI 456 + I I Sbjct: 229 PQEQIPI 235 >gi|327389735|gb|EGE88080.1| hypothetical protein SPAR5_1425 [Streptococcus pneumoniae GA04375] gi|332200111|gb|EGJ14184.1| hypothetical protein SPAR93_1586 [Streptococcus pneumoniae GA47368] Length = 1064 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 89/345 (25%), Gaps = 11/345 (3%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + R E L A A + S+ QSL + ++ + + I+ +I + Sbjct: 134 LSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSN 193 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T E + VE + S D + P S + + + Sbjct: 194 SNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEI 253 Query: 163 EAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +S S +++ + +D S ++ + + G Sbjct: 254 LKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRG 313 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 R + V+ + + + + + + + Sbjct: 314 NGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNR 373 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K I + V + V ++ + + I ++ + Sbjct: 374 LRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVT----- 428 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 S+ ++ + N +K ++L D Sbjct: 429 --FNVGQGESGSATSNDPYSNSNYYFREKPLLL--DANPNGGTNN 469 >gi|225621507|ref|YP_002722766.1| hypothetical protein BHWA1_02609 [Brachyspira hyodysenteriae WA1] gi|225216328|gb|ACN85062.1| putative membrane protein containing von Willebrand factor (vWA) type A domain [Brachyspira hyodysenteriae WA1] Length = 324 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 51/185 (27%), Gaps = 31/185 (16%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 D + + F F+ + + G T I DA+ TA +T Sbjct: 101 KKTDNLSVALVGFAGTSFIASPFTQDMETFTYILDNLTTKSVTLQG-TRIADALVTAKNT 159 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 KK I+L+TDGE+ I + + + T+ Sbjct: 160 FNVDAVS-----------KKSIILITDGEDHGGY--FDDILKQLNEMNVSVYTVGVGTEV 206 Query: 410 TQ--------------QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 ++ L A + + A + L IF D + Sbjct: 207 GATISTDLGVREKSVVSKRDDNTLKLIADSTHGKSYIAENVS-LESIFDDMKQSMDSVSA 265 Query: 454 IRITK 458 +R + Sbjct: 266 VRNNR 270 >gi|17533685|ref|NP_496744.1| C-type LECtin family member (clec-64) [Caenorhabditis elegans] gi|3876682|emb|CAB03056.1| C. elegans protein F35C5.7, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 385 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 64/205 (31%), Gaps = 11/205 (5%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV--IRSIKKIDNV 291 + S + L VD+S + + + I + Sbjct: 13 ATTQASTDRNCGSNWNNLWLDIVIVVDNSKGMTNEGITEVAANIATTFASGPRIGTDYSD 72 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + R+ +N + + +F + ++ + + + A + Sbjct: 73 PRSTRLSILTYNSEATVVADLNQFQSADDVYQTLFSFLNEVSDSDDSFLAKGLGMAESVL 132 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + V N L DGE+ + I + +S G+ I T+AF + Sbjct: 133 YNGRMNGVRENYNRLVVVYASAYREDGED------PLNIAKRLRSSGVAIATVAFDPDGD 186 Query: 411 QQEKARYFLSNCASPNSFFEANSTH 435 ++ LS ASPN F++ + Sbjct: 187 GLLLSK--LSKIASPNMSFKSRDPN 209 >gi|38347866|ref|NP_941115.1| hypothetical protein SMR0040 [Serratia marcescens] gi|190410209|ref|YP_001965710.1| hypothetical protein pK29_p040 [Klebsiella pneumoniae] gi|226807603|ref|YP_002791297.1| hypothetical protein pEC-IMP_036 [Enterobacter cloacae] gi|226809913|ref|YP_002791607.1| hypothetical protein pEC-IMPQ_035 [Enterobacter cloacae] gi|38259343|emb|CAE51568.1| hypothetical protein SMR0040 [Serratia marcescens] gi|146151002|gb|ABQ02768.1| conserved hypothetical protein [Klebsiella pneumoniae] gi|226425828|gb|ACO53921.1| hypothetical protein [Enterobacter cloacae] gi|226426139|gb|ACO54231.1| hypothetical protein [Enterobacter cloacae] Length = 668 Score = 39.6 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 31/92 (33%), Gaps = 19/92 (20%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A+ A D ++ S D +K I L+TDG + + + KAK Sbjct: 575 YTPTGSALMAAVDLLLDSQFD-----------RKIIFLITDGYPNKSEFTIVEVMEKAKC 623 Query: 396 QGIRIM--------TIAFSVNKTQQEKARYFL 419 GI I+ I F + L Sbjct: 624 NGIEIVGVGIKTDEIIGFETDTFVTVDDTSLL 655 >gi|284991843|ref|YP_003410397.1| hypothetical protein Gobs_3425 [Geodermatophilus obscurus DSM 43160] gi|284065088|gb|ADB76026.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 298 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/277 (7%), Positives = 44/277 (15%), Gaps = 37/277 (13%) Query: 31 PVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +LG + VDV L+ ++ + + Sbjct: 8 VPILGFTAIAVDVGALYAERARLQ-----------------TAADAAALAVARDCARGAC 50 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + + +T + F Sbjct: 51 GDMRAT-------------AQDLVDANTGNATAAPPVLASDPVRVTVTGSTPTEHWFAPV 97 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 +G +S + A + + + FS Q Sbjct: 98 IGHESTQVTATATVAWGAPD-----AGTAMLPLIFSWCEFTAQTGGGLPSTTTQRTIELP 152 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 ++ + SE S Sbjct: 153 KKSDTGCTGPSNMFVPGGFG--WLTTDGGNTCRATSRVGGRFTSEPGNNPSQGCDTEDVA 210 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + R L + + G F Sbjct: 211 ALVGRTVLLPIFDEAGGTGSSAWYHVHGYAAFTITAY 247 >gi|168217441|ref|ZP_02643066.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens NCTC 8239] gi|182380509|gb|EDT77988.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens NCTC 8239] Length = 928 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 39/390 (10%), Positives = 97/390 (24%), Gaps = 30/390 (7%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E S + + ++ NN + E + ++ Sbjct: 35 AEEKSDSMSVEKVLDSEMYNKFNNEILNNESSIKYTTDNNKGTWPVNWEYGNVSNKNKIN 94 Query: 133 LSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S L +L + + ++ + +++D S SML Sbjct: 95 KSVYGKNQPLEYNEGYLTKKAYTTDEDNVFNINLKIQGKKNEALKKDVVFLLDNSNSMLT 154 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + + + + + + ++S+ Y Sbjct: 155 NNRALKIKEEIKNVMDKLKAKNTRYALVTYAST--ILDGRHYHLIDRSIGNNQYTVYDLY 212 Query: 252 SLSEEHFVDSSSLRHVIKKKHL------VRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ H +S+++ + K + N Sbjct: 213 TSNQCHLNFTSNIQEIYNKIPSTVPNQRNNPYAGGTFTQEGLLK-----AIELLKNSNAD 267 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G+ + + + + + + + + Sbjct: 268 EKIIIHLTDGLPTFSFLLKEFGGNKKAIFDYNTNYNGIGVRGLGTSYFFDTKTQKPYIYS 327 Query: 366 EAKKYIVLL---TDGENTQDNEEGIAI-CNKAKSQG--IRIMTIAFSVNKTQQEKARYFL 419 + Y L E+ +N + K++ I I TI + K + Y Sbjct: 328 REEVYSALNRSKNKYESIWNNGFPTTLEAENIKNENPDINIYTIGVELKKEVYKWDDY-- 385 Query: 420 SNCASPNSFFEANSTHELNKI--FRDRIGN 447 ++ A T EL +I F + I + Sbjct: 386 ------RQYYNAEGTVELPEIRKFLESISS 409 >gi|313235273|emb|CBY10837.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 58/201 (28%), Gaps = 25/201 (12%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 ++ V K L + S+ + +I + Sbjct: 153 ITHNTVDGFKDKVDELDQNTQNECTSKALDIVFVVDESGSVGPDNFDLVKQFLIDYAQDS 212 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE-NEMGSTAINDAMQTAY 347 + + R+ ++ + ++ + + + +E G T DA+ Sbjct: 213 NIAANATRIAIRTYSTY----SDLDFSLNDFKTSNIIFEINNLVHESGGTNTADAITNGL 268 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS--QGIRIMTIAF 405 + + E+ K +V +TDG++ D A + K+ + I+ I Sbjct: 269 NDF----------GNDRSESVKIMVTITDGQSNYD--RVKAAADLLKADPRNIQSFAIGI 316 Query: 406 SVNKTQQEKARYFLSNCASPN 426 A L A+ + Sbjct: 317 ----DGANMAE--LQAIATTD 331 >gi|254293208|ref|YP_003059231.1| TadE family protein [Hirschia baltica ATCC 49814] gi|254041739|gb|ACT58534.1| TadE family protein [Hirschia baltica ATCC 49814] Length = 187 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 13/130 (10%), Positives = 37/130 (28%), Gaps = 8/130 (6%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 I + K+ K+ G + AL +P++ + +++ A + + + Sbjct: 2 LINFIKQFWKNDEGVAALEFALCLPLLTVLFFGTIEISLLVE--------ADRRVTSTAS 53 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + + S + + +K + VE + + Sbjct: 54 TIADLVARTSEVNYCEVEDIFYASSRIIRPKNASTVKMRLSSVVEDSKSGKAVVEWSQGR 113 Query: 127 SAYQVVLSSR 136 + S + Sbjct: 114 NGMAAYASGK 123 >gi|197337244|ref|YP_002158801.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] gi|197314496|gb|ACH63945.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] Length = 3927 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 42/423 (9%), Positives = 100/423 (23%), Gaps = 26/423 (6%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNF 98 + VD + T ++T +V + + + Sbjct: 2606 VAVDSAGNTVSTTQ-DVTVDTKVLTETVGDGDLGITSDITDNTNTGSNSDTVTNNTTPDI 2664 Query: 99 ENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + + + + LN + L I Sbjct: 2665 TGVTEAGAKVTITYTDKDGVIHTTDEVTANQNGEYTISLPYQLNEGTNALTVTAIDIAGN 2724 Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 +T+ + + + IQ V D S S+ + + S Sbjct: 2725 KTEVIQDVTINADPIANDFDIQLVDDISTQFSFDAYVSDLEDDENDTDNKFVDITITDSP 2784 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS----SLRHVIKKKHLV 274 + ++ + L + + ++ + + + Sbjct: 2785 EFGTLYVVDGDIRTEITSSTILTESDQIEYVLDGSINDDLSFNAQEDFAPNYSNGSVNSF 2844 Query: 275 RDALASVIR--SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN 332 A +I N + T + D V + G ++ ID + Sbjct: 2845 TLASGVIISGGQFTGDSPDNTSTLTPETLYYDNVAQETGLGVGNSEIDVKTKDYVEIDFS 2904 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL------------LTDGENT 380 +GS + + T + S + + ++ D N Sbjct: 2905 NVGSPNTTEVVITEVNMDFGSVWGNYSENSSADAKVQVLLFKDGVLVGESPYTFDDESNG 2964 Query: 381 QDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQE----KARYFLSNCASPNSFFEANS 433 + G N G IR+ TI + + + + S + ++A Sbjct: 2965 IYDGSGEFTANIQLDSGFDQIRVYTIHGEGSTSSNSNLTLQGVEVVDAMVSEDIHYQATD 3024 Query: 434 THE 436 + + Sbjct: 3025 SDD 3027 >gi|148697326|gb|EDL29273.1| procollagen, type XIV, alpha 1, isoform CRA_b [Mus musculus] Length = 1093 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 217 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 270 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 271 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 320 >gi|138893737|ref|YP_001124190.1| hypothetical protein GTNG_0058 [Geobacillus thermodenitrificans NG80-2] gi|196250779|ref|ZP_03149466.1| conserved hypothetical protein [Geobacillus sp. G11MC16] gi|134265250|gb|ABO65445.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] gi|196209729|gb|EDY04501.1| conserved hypothetical protein [Geobacillus sp. G11MC16] Length = 246 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 25/58 (43%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + I+L+TDG + + A+ A+ QGI + I T E + + A+ Sbjct: 7 RQILLITDGCSNHGE-DPAAMAALAREQGITVNVIGVLDQDTIDENGQREIEAIAAAG 63 >gi|327538082|gb|EGF24772.1| TadE family protein [Rhodopirellula baltica WH47] Length = 346 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 26/71 (36%), Gaps = 4/71 (5%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + +G + L+ V+ + G + W Y ++QAA + L Q+ Sbjct: 2 RSRRHNRSGQALVEFGLVALVLYMLVGGAITFGIWIYAAGQIQQAANV----GARELSQT 57 Query: 72 LEEVSSRAKNS 82 +N+ Sbjct: 58 PLPFDETFENA 68 >gi|153836830|ref|ZP_01989497.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|260361572|ref|ZP_05774599.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|260876729|ref|ZP_05889084.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|260900400|ref|ZP_05908795.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308095540|ref|ZP_07663324.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|149749976|gb|EDM60721.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|308089064|gb|EFO38759.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091406|gb|EFO41101.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109136|gb|EFO46676.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308113987|gb|EFO51527.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 154 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 40/134 (29%), Gaps = 12/134 (8%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 KF ++ + G I L +PV+L + +++DV R + +A Sbjct: 5 KFKTQMRR-GRHQEGLAIIEFILALPVLLMLTVLVIDVCRAFIQYTEVNKAL-------- 55 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + F E + T + I D V P Sbjct: 56 ---QNGARYAVVDTYGTLDFEGIADETKIKNVVVYGSPTASTTPVIDYIGVDDIVITQPT 112 Query: 126 KSAYQVVLSSRYDL 139 + V LS+ Y+ Sbjct: 113 GTNKVVTLSATYNY 126 >gi|310825519|ref|YP_003957877.1| hypothetical protein STAUR_8296 [Stigmatella aurantiaca DW4/3-1] gi|309398591|gb|ADO76050.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1416 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 11/60 (18%), Positives = 19/60 (31%), Gaps = 9/60 (15%) Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 R+ TI F ++ L N A + A+ L + I + R + Sbjct: 429 RVHTIGFGIDSN-------LLKNTADVGKGLYATADDAASLTNAILNAIAAQDIRRTTAL 481 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + D Sbjct: 386 TYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDSP------ 439 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D A GI I ++ L + AS Sbjct: 440 --NKNFLVIVTDGQSYDDVRGP---AAAAHDAGITIFSVGV------AWAPLDDLKDMAS 488 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 489 KPKESHAFFTREFTGLEPIVSDVI 512 >gi|254473781|ref|ZP_05087176.1| TadE-like protein [Pseudovibrio sp. JE062] gi|211957167|gb|EEA92372.1| TadE-like protein [Pseudovibrio sp. JE062] Length = 191 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 21/54 (38%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 K K+ G + A++ P +LG+ ++++ E L A A Sbjct: 19 KILIGKFAKNARGVTAVEFAIIAPFLLGMVMAILELGLSFLVEVVLDNAVAEAS 72 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + Sbjct: 344 TYDQRTEFSFTDYSTKENVLAAIRNIRYMSGGTATGDAISFTVRNVF--------GPIRE 395 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V++TDG++ D + A GI + ++ L + AS Sbjct: 396 SPNKNFLVIVTDGQSYDDVQGP---AAAAHDAGITVFSVGV------AWAPLDDLKDMAS 446 Query: 425 ---PNSFFEANSTHELNKIFRDRI 445 + F L I D I Sbjct: 447 KPKESHAFFTREFTGLEPIVSDVI 470 >gi|42526760|ref|NP_971858.1| batB protein, putative [Treponema denticola ATCC 35405] gi|41817075|gb|AAS11769.1| batB protein, putative [Treponema denticola ATCC 35405] Length = 322 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 17/94 (18%) Query: 318 KLIRTIVKTFAIDENE----MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 + + + + T + + A D+ N K IVL Sbjct: 143 SFEKNALSSAINALSPLILSSTGTNLEAGVLRALDSF-----------GENRGNSKIIVL 191 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 TDG T + + K K ++ + F Sbjct: 192 CTDGGETSGS--LLHAAEKIKKTDAILIIVGFGT 223 >gi|303233418|ref|ZP_07320087.1| conserved hypothetical protein [Atopobium vaginae PB189-T1-4] gi|302480547|gb|EFL43638.1| conserved hypothetical protein [Atopobium vaginae PB189-T1-4] Length = 183 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 45/134 (33%), Gaps = 2/134 (1%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ-AAQT- 59 V ++ + + + G + A LMPV++ V + ++V + + A Sbjct: 20 VAASRQRAFLRTFFHNAQGQICVEFAALMPVVIVVMLIAYNIVCYVQLCSKFDRCARDAI 79 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A++ S P QS + +S K + + + ++ + + T Sbjct: 80 AVLGVSPPGKQSADNAASEIKTAIQTAMGEAACDVDVQAQSACDETGAKFKFFISPLLTT 139 Query: 120 VEMNPRKSAYQVVL 133 + + + Sbjct: 140 YTCKMSYKPWPLSV 153 >gi|193734230|gb|ACF19883.1| PilC1 [Kingella kingae] Length = 1362 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 55/221 (24%), Gaps = 12/221 (5%) Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVND 293 + N S Y + S S S+ + + + S +I+ +N Sbjct: 250 IDRNSSKYRGMCSTAYQKSNSNVKNWCSALPQQFVYYPYSGISDPDDTTTSAYRINYMNT 309 Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + A + + + L + G + Sbjct: 310 WSGIFAKKY--TEYPLRDLTNDLDFLDSGSAYYTNTNLPSSGFGRYKKFFFNNQPILAPQ 367 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + E I L G + + +Q I TI F ++ Sbjct: 368 TQYAGAMLSFFSEPINKIDLRIGGNDATGKSWDDS---AFPTQNITTFTIGFGNGLSESG 424 Query: 414 KA-------RYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 A L + +++ A+ L K F + Sbjct: 425 LALLRGGASNIKLKDGTVQKAYYSASDQTALQKAFDAILAQ 465 >gi|193786519|dbj|BAG51302.1| unnamed protein product [Homo sapiens] Length = 164 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 31/91 (34%), Gaps = 12/91 (13%) Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T A++ + S + R N + ++ TDG +QD+ +AK Sbjct: 4 GTMTGLALRHMVEHSFSEAQGARPRALNVP---RVGLVFTDGR-SQDDISV--WAARAKE 57 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +GI + + + L AS Sbjct: 58 EGIVMYAVGVG------KAVEAELREIASEP 82 >gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] Length = 684 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 47/165 (28%), Gaps = 21/165 (12%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 FN +V + + + + + F G T I A + D Sbjct: 364 QDSFNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFEQVLD----- 418 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + + IV LTDG + + E I RI TI Sbjct: 419 -------NSEHADYLRQIVFLTDGSISNETEVFAQIKGSLGDS--RIFTIGIGS-----A 464 Query: 414 KARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 YF++ A +F ++ + ++ + + Sbjct: 465 PNSYFMTRAATLGRGTFTFIGDVTDVQRTMKNLFVQLANAALKEL 509 >gi|283782261|ref|YP_003373016.1| TadE family protein [Pirellula staleyi DSM 6068] gi|283440714|gb|ADB19156.1| TadE family protein [Pirellula staleyi DSM 6068] Length = 145 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 36/100 (36%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 SK+ ++ + + AL +P++ + +++ R + E++++ AA A VP Sbjct: 9 RSSSKRTTRARSAAVTVEFALTIPLLFLLFFGMIEFARIAMIENSVENAAYEGARAAIVP 68 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT 107 S ++ + + + T Sbjct: 69 GGTSTSAQTAAQSALSAAMISGATVSVTPSNITIATTSVT 108 >gi|160897506|ref|YP_001563088.1| membrane protein [Delftia acidovorans SPH-1] gi|160363090|gb|ABX34703.1| membrane protein-like protein [Delftia acidovorans SPH-1] Length = 762 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 43/152 (28%), Gaps = 5/152 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G + ALL V+ + G+ VD+ Y + L++ A A A+ + Sbjct: 27 RRQGGSLLVQFALLGSVLFLLLGV-VDLGYMYYAKRDLQRIADLAATQAAQAINAERSNR 85 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKK----NFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ L RN + N D + + +A +V Sbjct: 86 AACVAAGEDSITANWPVPLTRNSTQTVVSCGNWNPADPARQSSGGRFFDASSGPLNAARV 145 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + + + + A Sbjct: 146 TVGGTSPTIFPGPWSRAVVAEAIAMRNDSVAS 177 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 4/85 (4%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 H+ AKK ++++TDG+ +D E + +A+ GI I Q+ AR Sbjct: 6 HKNGARKSAKKILIVITDGQKYKDPLEYRDVIPQAEKAGIIRYAIGVG-RAFQEPTARQE 64 Query: 419 LSNCASP---NSFFEANSTHELNKI 440 L+ S + F+ ++ L+ I Sbjct: 65 LNTIGSAPPQDHVFKVDNFAALSSI 89 >gi|229582324|ref|YP_002840723.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013040|gb|ACP48801.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 356 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 32/104 (30%), Gaps = 23/104 (22%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-------QGIRI 400 + + ++ K + K I++LTDG+ T K +I Sbjct: 107 TRLHEAVSFTINLAKQSQVPTK-IIMLTDGKPTDKR--------NVKDYEKLDIPPNTQI 157 Query: 401 MTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +TI N L A S F+ EL IF Sbjct: 158 ITIGIGNNYN-----ERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|111022920|ref|YP_705892.1| hypothetical protein RHA1_ro05957 [Rhodococcus jostii RHA1] gi|110822450|gb|ABG97734.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 548 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 41/114 (35%), Gaps = 11/114 (9%) Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGI-AICNKAK--- 394 D + + + ++ + + +++LTDG N + + + + Sbjct: 433 DGLIGGGTGLYDTTLAAFRTVQETYDPRAVNSVIILTDGANEDPDSITKEQLLSILQRET 492 Query: 395 --SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 ++ + I+TI + + A + + S + A ++ +F + + Sbjct: 493 DPARPVIIVTIGITGDADAATLAEI---SRVTGGSSYVAKDPADIANVFVNALA 543 >gi|119774146|ref|YP_926886.1| type IV pilin biogenesis protein [Shewanella amazonensis SB2B] gi|119766646|gb|ABL99216.1| type IV pilin biogenesis protein, putative [Shewanella amazonensis SB2B] Length = 1148 Score = 39.6 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 27/312 (8%), Positives = 75/312 (24%), Gaps = 28/312 (8%) Query: 152 GIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRT 211 G S + + ++ ++ + + ++ + Sbjct: 167 GSPSPFFRVSSGSKEATKELAMSKAKNTGFGTGRVVTLYTKTYLTWYHSKKKQVNRTRID 226 Query: 212 VKSYSSQNGKVGIRDE-----KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRH 266 + + N + + + ++ + +++ +++ L Sbjct: 227 IAKEAVTNVLLTTPGVDFGLAIFNSNVYEGYDDGGRIIAGIKPATASNKKDLIEAVDLLK 286 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN-DRVISDPSFSWGVHKLIRTIVK 325 L + D + + D + ++ + Sbjct: 287 GTTWTPLCETLYEAYRYFSGGEVWFGDDDPTLKPYRDKDAIDANSRYKSPFKTCQNRAYI 346 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NE 384 + D + N + + + Y+ L+ NTQD N Sbjct: 347 VYVTDGEPTRDSNANQLVYDLTGGVDAYT----------SSPASYLSSLSSWMNTQDVNP 396 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 Q + TI FS L + A +++A + +L K Sbjct: 397 NMTG------KQSVSTYTIGFSQGAASAAG---LLRHTAEKGGGKYYDATNVDDLQKSLM 447 Query: 443 DRIGNEIFERVI 454 N + + Sbjct: 448 QVFKNILEKNAS 459 >gi|289596620|ref|YP_003483316.1| von Willebrand factor type A [Aciduliprofundum boonei T469] gi|289534407|gb|ADD08754.1| von Willebrand factor type A [Aciduliprofundum boonei T469] Length = 2166 Score = 39.2 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 72/243 (29%), Gaps = 22/243 (9%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 E + E + D S +++ + + G Y++ + Sbjct: 1166 EATVLDINGESPLDFINRRDVSTNVVGDWDWTLSGDTDSDGVYTQDGNTFYTNDTKVLNF 1225 Query: 225 RDEKLSPYMVS-------------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + ++ + G ++ + D + + Sbjct: 1226 TGGAYHSASSAFWMPEAPRLNKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRI 1285 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A ++ + D V G N F++ + TI+ D Sbjct: 1286 DVAIQAAIDAVKELGPQDRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISD-LKDI 1340 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + D + A + + + D + +A + I++LTDG + DN + N Sbjct: 1341 QADGGTPLYDTLSWAVYYMDTQSADNP----DREDATRGILVLTDGLSNSDNYGPNNVRN 1396 Query: 392 KAK 394 A+ Sbjct: 1397 GAR 1399 >gi|269126096|ref|YP_003299466.1| hypothetical protein Tcur_1855 [Thermomonospora curvata DSM 43183] gi|268311054|gb|ACY97428.1| hypothetical protein Tcur_1855 [Thermomonospora curvata DSM 43183] Length = 382 Score = 39.2 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 21/45 (46%), Gaps = 3/45 (6%) Query: 16 KSCTGHFFIITALLM--PVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + G + ALL+ +++G+ ++VD+ R L+ A Sbjct: 17 RDE-GAVTTLFALLLSSLIVVGMLALVVDIGRLYIEREELQNGAD 60 >gi|217970127|ref|YP_002355361.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507454|gb|ACK54465.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 840 Score = 39.2 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 44/161 (27%), Gaps = 15/161 (9%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISSN 354 R + F V W R + +A+ G T + +A+ + + Sbjct: 303 RFSLSRFGSTVEHRSRALWRTSAATRQAGQRWAMQLQADLGGTEMENALASTLALAGDAE 362 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 ++L+TDG+ + +A++ G RI + Sbjct: 363 PSPGTEEGAAAV---DLLLITDGQIHAIDRTV----KRARALGNRIFVVGIGS-----AP 410 Query: 415 ARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A L A + + + +R+ Sbjct: 411 AEGVLRRLADETGGACDFVAPGEAVEPAVLRMFARLRSQRM 451 >gi|221204654|ref|ZP_03577671.1| membrane protein [Burkholderia multivorans CGD2] gi|221175511|gb|EEE07941.1| membrane protein [Burkholderia multivorans CGD2] Length = 609 Score = 39.2 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 24/266 (9%), Positives = 70/266 (26%), Gaps = 4/266 (1%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS--LEEVSSRA 79 ++ A+ + V L V G +DV + L++ A + A + + + A Sbjct: 1 MAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSVNDLCPQTDTTVTA 59 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 S + F+ N L++ + Sbjct: 60 SGSNAVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAGYFGTATNQLNAVRVV 119 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + LF ++ + A+A + D + + + + + G Sbjct: 120 ATKTVPLFF-IGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSADQRNTGL 178 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 G + ++ + K+ + + + + Sbjct: 179 VNALLGALLNTSLSLNIGSYQALACTRVKVGDLVKAQVGAGTVDQLLATKLTLNQLVSVM 238 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSI 285 ++ + + A ++ + ++ Sbjct: 239 ATALKNTSVANLGIYSSASSAALDTL 264 >gi|262201266|ref|YP_003272474.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262084613|gb|ACY20581.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 594 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 44/122 (36%), Gaps = 18/122 (14%) Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 G T + D A+ + S+ + ++++TDG+N + Sbjct: 480 GGGTGLYDTTLAAFKKVQSTYDPNYSNS---------VIIMTDGQNEDPGSITLTELLAE 530 Query: 394 K------SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGN 447 ++ + I+TI S + + A+ + + A + ++ ++F + I Sbjct: 531 LKELEDPARPVLILTIGISEDADTNALRQIAQ---ATGGTTYVAKTAADIKQVFTNAIAA 587 Query: 448 EI 449 + Sbjct: 588 RV 589 >gi|254167891|ref|ZP_04874740.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623182|gb|EDY35748.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 2164 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 72/243 (29%), Gaps = 22/243 (9%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 E + E + D S +++ + + G Y++ + Sbjct: 1164 EATVLDINGESPLDFINRRDVSTNVVGDWDWTLSGDTDSDGVYTQDGNTFYTNDTKVLNF 1223 Query: 225 RDEKLSPYMVS-------------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + ++ + G ++ + D + + Sbjct: 1224 TGGAYHSASSAFWMPEAPRLNKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRI 1283 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A ++ + D V G N F++ + TI+ D Sbjct: 1284 DVAIQAAIDAVKELGPQDRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISD-LKDI 1338 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN 391 G T + D + A + + + D + +A + I++LTDG + DN + N Sbjct: 1339 QADGGTPLYDTLSWAVYYMDTQSADNP----DREDATRGILVLTDGLSNSDNYGPNNVRN 1394 Query: 392 KAK 394 A+ Sbjct: 1395 GAR 1397 >gi|170701750|ref|ZP_02892686.1| TadE family protein [Burkholderia ambifaria IOP40-10] gi|170133333|gb|EDT01725.1| TadE family protein [Burkholderia ambifaria IOP40-10] Length = 156 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 47/164 (28%), Gaps = 12/164 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 Y G + AL+ P+ + +V + +L + A+ Sbjct: 3 RQYRPAACSRERGATAVEFALVFPLFFLILYAVVTFGLIFAVQQSL-------TLAAT-- 53 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + Q + + + V+ I + + + Sbjct: 54 --EGARSALNYVYEANGSGSQALTDRASAAKTTAVGLTSWLAHVQ-IPTPVSGTCSYDPT 110 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 Y V ++ Y +PL L +G+ + T ++ + Sbjct: 111 MYCVTVTVTYPYQAHPLVPSLPLLGLVTPAQLTGTATVQINPAT 154 >gi|195350113|ref|XP_002041586.1| GM16666 [Drosophila sechellia] gi|194123359|gb|EDW45402.1| GM16666 [Drosophila sechellia] Length = 1407 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/349 (7%), Positives = 61/349 (17%), Gaps = 31/349 (8%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 V S + ++ TA + ++ S + +S Sbjct: 8 VAGASRA-NKMQAVMDTAAMAIAMGSSGSGSVSGLSSGSSSGPGSSSSGSSSSGYGSATT 66 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 + Y L+S Sbjct: 67 TPTSGGHFDNSPTSMPMATIATVAPFYSEALTSLDAQQRQQQQQQHNPSHYYPQNYGYGN 126 Query: 163 EAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 + + + + + G A + Sbjct: 127 SLPLDTAYNYSGAPYNSYSSGAVGQQVSGNSFPVSGIRYEKLTPAAGAKYGANPIGATGT 186 Query: 223 GIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI 282 GI+ + + + +Y +++ K + A S++ Sbjct: 187 GIKPQATA--------TPFYAQPLSGGAAGSGGYISQSNATYDPYKYKIPSAQAATHSMM 238 Query: 283 RSIKKIDNVNDTVRMGATFFN---------DRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 S V T + + + Sbjct: 239 PSNMAGSAVPGTSYGLPLNPGAVPQKQTQVSQQTQLTQLEPNYAAIKPSRSYQGMSTAAV 298 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G T N ++ + KY + ++ Sbjct: 299 YGGTGGNS-------------SGVMNNPTSAQYYDKYGMTMSMYSPNGG 334 >gi|149003228|ref|ZP_01828124.1| G5 domain family protein [Streptococcus pneumoniae SP14-BS69] gi|147758688|gb|EDK65685.1| G5 domain family protein [Streptococcus pneumoniae SP14-BS69] Length = 850 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 35/345 (10%), Positives = 89/345 (25%), Gaps = 11/345 (3%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + R E L A A + S+ Q+L + ++ + + I+ +I + Sbjct: 134 LSRAKKLESKLYTEASAANLQTSIQAGQNLLGKADASEAELSAAESSIQSSIIGLELRSN 193 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T E + VE + S D + P S + + + Sbjct: 194 SNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEI 253 Query: 163 EAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +S S +++ + +D S ++ + + G Sbjct: 254 LKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRG 313 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 R + V+ + + + + + + + Sbjct: 314 NGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNR 373 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K I + V + V ++ + + I ++ + Sbjct: 374 LRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVT----- 428 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 S+ ++ + N +K ++L D Sbjct: 429 --FNVGQGESGSATSNDPYSNSNYYFREKPLLL--DANPNGGTNN 469 >gi|283787707|ref|YP_003367572.1| tight adherence protein TadE [Citrobacter rodentium ICC168] gi|282951161|emb|CBG90852.1| putative tight adherence protein TadE [Citrobacter rodentium ICC168] Length = 170 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 28/70 (40%), Gaps = 3/70 (4%) Query: 7 FIFYSKKLIK---SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + +KL G + AL+ ++ + +L ++ R +Y AL A A Sbjct: 8 ITVFRRKLFAGAGDERGAVAVELALVFFPLMFMIFLLFELCRVTYISSALNLATAEASRY 67 Query: 64 ASVPLIQSLE 73 A+V I + Sbjct: 68 AAVAKISDKD 77 >gi|114327015|ref|YP_744172.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] gi|114315189|gb|ABI61249.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] Length = 329 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 43/152 (28%), Gaps = 20/152 (13%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 + + K + R R G F+ I + + Sbjct: 101 NQTPNAAKESKTDAAIRLVKDFFRQ-------RPHDRFGVVAFSTSPILAMPLTEHRAAV 153 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 ++ A+ + T I + A+ + +S+ D + ++ ++DG Sbjct: 154 EASLD---AMRRPAIARTNIGRGLSLAFAQLQNSSPDAA----------RVVLFVSDGAG 200 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 D E + +A G+ I + Sbjct: 201 VIDGELEPRLHAEAVKLGVHIYYLYLRTEGDP 232 >gi|268532320|ref|XP_002631288.1| Hypothetical protein CBG03101 [Caenorhabditis briggsae] Length = 394 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 53/166 (31%), Gaps = 12/166 (7%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF-----FNDRVISDPSFSWGVHKLIR 321 + VR+ L V+ SI KI V +T ++D + K Sbjct: 45 TNAQLWQVRNTLTQVLGSISKIGPVKYPADPRSTCVGIVTYDDNATTQSQL--DASKSFS 102 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + + +T + + A + +D +R K I D + Sbjct: 103 DLYNVIQSSLISVDNTNTSY-LSLALLAAEKALKDGRNRTYRFNYKKVIIAFAADYQGHG 161 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + + I N+ K + I+T+A T + + ASP Sbjct: 162 TALDAMPIANRLKDNAVTIITVA----CTSNSDKQTAIQGIASPGF 203 >gi|116625432|ref|YP_827588.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228594|gb|ABJ87303.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 50/150 (33%), Gaps = 22/150 (14%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 V S R + + R ++ +R+ D F+D F+ Sbjct: 93 STGVVFDSSRSMKNRLQDARQSVEQFLRTGSTGDEY------FLIRFSDEAKMLAPFTAD 146 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 ++ R G TA+ND++ A + + + +K +++++ Sbjct: 147 TEEIARQ-----LGSIEAKGWTALNDSIVLAAN-----------QSRKARNHRKALLVIS 190 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 DG + + + + +R+ ++ Sbjct: 191 DGGDNNSRYTVGEMISILREADLRVYAVSI 220 >gi|329902233|ref|ZP_08273073.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] gi|327548825|gb|EGF33456.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] Length = 338 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 49/170 (28%), Gaps = 27/170 (15%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F++ F+ + I + +G+T + AM A T + Sbjct: 144 LMMFSENQFQVMPFNMRPDMIQAAIQAGGV--GSGLGNTDVGSAMLAALRTFDDRPDSG- 200 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + I+L++DG + I + K I + I Sbjct: 201 ---------SRIIMLVSDGGAQIAPAVRLQIADGLKRNRIALYWIYLRSYNQPALADSDS 251 Query: 419 --------------LSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + ++P FEA++ + + + +G + + Sbjct: 252 AEFDGVVEVAMHRYFRSLSTPYRAFEADNQASMQRAI-EAVGRQQMLPIT 300 >gi|160931316|ref|ZP_02078716.1| hypothetical protein CLOLEP_00153 [Clostridium leptum DSM 753] gi|156869700|gb|EDO63072.1| hypothetical protein CLOLEP_00153 [Clostridium leptum DSM 753] gi|314974315|gb|EFT18411.1| TadE-like protein [Propionibacterium acnes HL053PA1] gi|314977802|gb|EFT21896.1| TadE-like protein [Propionibacterium acnes HL045PA1] gi|315081091|gb|EFT53067.1| TadE-like protein [Propionibacterium acnes HL078PA1] gi|315090034|gb|EFT62010.1| TadE-like protein [Propionibacterium acnes HL072PA1] gi|315096617|gb|EFT68593.1| TadE-like protein [Propionibacterium acnes HL038PA1] gi|327325745|gb|EGE67540.1| putative membrane protein [Propionibacterium acnes HL096PA2] gi|327330167|gb|EGE71917.1| putative membrane protein [Propionibacterium acnes HL096PA3] gi|327333253|gb|EGE74974.1| putative membrane protein [Propionibacterium acnes HL097PA1] gi|327443402|gb|EGE90056.1| TadE-like protein [Propionibacterium acnes HL043PA1] gi|327447494|gb|EGE94148.1| TadE-like protein [Propionibacterium acnes HL043PA2] gi|327456785|gb|EGF03440.1| TadE-like protein [Propionibacterium acnes HL087PA3] gi|328761106|gb|EGF74653.1| putative membrane protein [Propionibacterium acnes HL099PA1] Length = 133 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 15/116 (12%), Positives = 33/116 (28%), Gaps = 2/116 (1%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + L++PV+L + G++ D R A + AA A + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISATQAAQDAAVAASRAGTDAAATP 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 A + +Q + + T + + + Sbjct: 61 QLAGHDGAAVAVQAARQALSAAGVDGSVQEDGSTITVTTSQSRPTVFLSAIGISQV 116 >gi|115374116|ref|ZP_01461404.1| hypothetical protein STIAU_3108 [Stigmatella aurantiaca DW4/3-1] gi|115368892|gb|EAU67839.1| hypothetical protein STIAU_3108 [Stigmatella aurantiaca DW4/3-1] Length = 1408 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 11/60 (18%), Positives = 19/60 (31%), Gaps = 9/60 (15%) Query: 399 RIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 R+ TI F ++ L N A + A+ L + I + R + Sbjct: 421 RVHTIGFGIDSN-------LLKNTADVGKGLYATADDAASLTNAILNAIAAQDIRRTTAL 473 >gi|73539238|ref|YP_299605.1| transmembrane protein [Ralstonia eutropha JMP134] gi|72122575|gb|AAZ64761.1| probable transmembrane protein [Ralstonia eutropha JMP134] Length = 563 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/83 (10%), Positives = 22/83 (26%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + +L + V + +DV + L+ + ++ + L + Sbjct: 11 RERGAVSTVAVILTATVAFVALLSIDVGHVFMRQRQLQNMVDLSALSGAQQLKLATSAAL 70 Query: 77 SRAKNSFTFPKQKIEEYLIRNFE 99 A T + Sbjct: 71 QSAGVLGTVQTIGSQNGYPSGIA 93 >gi|227114883|ref|ZP_03828539.1| hypothetical protein PcarbP_18060 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 182 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 38/145 (26%), Gaps = 4/145 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +S G A L+PV+L ML ++ R A A A+ + Sbjct: 21 RHWRSTRGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAF-RLDNLAS 79 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLK---KNFTDREVRDIVRDTAVEMNPRKSAY 129 + + YL + N + N ++ Sbjct: 80 DSAEQMGTRLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNGNQDNPNGEETTT 139 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIK 154 V S + ++ + + Sbjct: 140 LPVWSVTVQIKKAFITPLPEVLTLG 164 >gi|158319480|ref|YP_001511987.1| hypothetical protein Clos_0429 [Alkaliphilus oremlandii OhILAs] gi|158139679|gb|ABW17991.1| conserved hypothetical protein [Alkaliphilus oremlandii OhILAs] Length = 244 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 22/62 (35%), Gaps = 3/62 (4%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF--SVNKTQQEKARYFLSNCASPN 426 K ++L+TDG++ I KA GI + TI + A Sbjct: 8 KQMILVTDGQSNVGGS-PIIAAEKAYRNGIIVNTIGIVDGKESNEDALNEIVEIAKAGGG 66 Query: 427 SF 428 ++ Sbjct: 67 TY 68 >gi|156386852|ref|XP_001634125.1| predicted protein [Nematostella vectensis] gi|156221204|gb|EDO42062.1| predicted protein [Nematostella vectensis] Length = 398 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 47/162 (29%), Gaps = 14/162 (8%) Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 +VD S V + + D R+ ++ + Sbjct: 206 CSKKADIAFYVDVSGSIGQNN-LERVIAYILKFLDRFDFAQ---DKTRVAVVEYDVVPHT 261 Query: 309 DPSF-SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + G+ + + ST + A++ A + V Sbjct: 262 KLTLLGSGLASISDIREIVASQIRFSGKSTRTDLALKNAQNLFQREQRSGVP-------- 313 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 K +V++TDG + D+ K+ G++ + +A + Sbjct: 314 -KTLVIITDGRSWYDDTLLNGAIAGIKALGVKTLAVAVGDDS 354 >gi|119476218|ref|ZP_01616569.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] gi|119450082|gb|EAW31317.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] Length = 750 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 45/168 (26%), Gaps = 23/168 (13%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 F V + + + T I A++ A Sbjct: 80 VWTFGQWVNNLVPSNTVTTAWRANASAQAEKISSVALRTNIPAALEKAM----------A 129 Query: 359 HRMKNNLEAKKYIVLLTDG---------ENTQDNEE-GIAICNKAKSQGIRIMTIAFSVN 408 + +++LLTDG +N + I K G+ I T+A S N Sbjct: 130 DVKSKGTDYSIHLILLTDGMVDVSLSSIDNETARQRIVAEILPALKGAGVTIHTVALSQN 189 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + + A + +L+++F + + Sbjct: 190 ADLELMELLAAE---TGGLSAVAETAEDLSRVFVQAFDAAAPAEQLPL 234 >gi|323138935|ref|ZP_08073996.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322395781|gb|EFX98321.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 204 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/71 (12%), Positives = 21/71 (29%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 LI G + L+ L + +++ ++ + L Q A +Q+ Sbjct: 22 NLIHDSRGFAAVEFGLIALPFLLLIVAILEYSYGNFAQSRLDAVVQQASRQIMTGYVQNQ 81 Query: 73 EEVSSRAKNSF 83 + Sbjct: 82 SVGGKALDANQ 92 >gi|198430887|ref|XP_002124814.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 400 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 49/370 (13%), Positives = 91/370 (24%), Gaps = 24/370 (6%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL- 141 + K+ L+ F F R I TA N V++ + + Sbjct: 2 KKYAKKIYTVMLLTVFALGACNCFNLRNFNLIESSTAGFQNGTYFGQTVIVKASANGTAK 61 Query: 142 -----------NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 N S GIK + + + V+ + + Sbjct: 62 LLIGAPNSRRNNFGSSQGVLYGIKFAVASGIYDIVAKDPEKNAPIVVTNLRKDADAIGIS 121 Query: 191 DYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLD 250 R+ S + K K LD Sbjct: 122 VSNSKEAVTVCAPMRYRHCRSHNIVS-----GSCFNSKDFGESWQSAKPTVPFCSDQALD 176 Query: 251 PSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDP 310 + + K +V A + I I + + ++ + Sbjct: 177 LLFVLDGSFSVGETNFGLVKNWVVALAKSFDIEKQDNIGIIQYSHWYPGVPYSQQPYMKT 236 Query: 311 SFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 G++K K + +T A+ ++S + K Sbjct: 237 EVPLGLYKNFTLFEKIAQNISLQGFTTYTAHALNKTVLDFMASERFM------HENVTKV 290 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 ++L+TDG D + + +SQGI I + + + FE Sbjct: 291 MILITDGRAD-DAIDLYSSAEYVRSQGIITFAIGIGNSVLRDQLQIVANGKLGEDTRVFE 349 Query: 431 ANSTHELNKI 440 + LN I Sbjct: 350 VTTFTSLNSI 359 >gi|163757619|ref|ZP_02164708.1| putative transmembrane protein [Hoeflea phototrophica DFL-43] gi|162285121|gb|EDQ35403.1| putative transmembrane protein [Hoeflea phototrophica DFL-43] Length = 190 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 38/133 (28%), Gaps = 4/133 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 +++L + G + AL+ PV++ + +++ L A T + + Q Sbjct: 12 ARRLRGNKDGVGAVEFALIAPVLIILYMGSLEISVAMSVNKKLA----RASSTVADLITQ 67 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 + + + + + + + + R + Sbjct: 68 EESVDKVYLTSMVNVVESVMTPFRSEGVRVKVTGIAINGAGNATASWSWQDNGSRPYSAG 127 Query: 131 VVLSSRYDLLLNP 143 + DL + Sbjct: 128 STQTLPADLAIPN 140 >gi|156382085|ref|XP_001632385.1| predicted protein [Nematostella vectensis] gi|156219440|gb|EDO40322.1| predicted protein [Nematostella vectensis] Length = 1221 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 49/171 (28%), Gaps = 16/171 (9%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 + + +++ +N + + ++T Sbjct: 85 DLAKTKSLVTTLVNQFSDSENSIRFGIT------TYGQEARTLANFKQNFDEAKLRTAIK 138 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 + G A + + N+ ++++ + G NT ++ Sbjct: 139 GIQKTGVQARRHDLAA----MAVKNDLFSLEGGMRQGHPRFVIFFSAGANTGTADDLKKA 194 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHELNK 439 G+ ++ I + N Q A AS N F F ANS EL+ Sbjct: 195 SKPLTDLGVNMIAIGVNSNADQASLAEL-----ASENRFIFSANSPAELDA 240 >gi|294496937|ref|YP_003560637.1| hypothetical protein BMQ_0087 [Bacillus megaterium QM B1551] gi|295702304|ref|YP_003595379.1| hypothetical protein BMD_0085 [Bacillus megaterium DSM 319] gi|39640|emb|CAA45619.1| unnamed protein product [Bacillus megaterium] gi|294346874|gb|ADE67203.1| conserved hypothetical protein [Bacillus megaterium QM B1551] gi|294799963|gb|ADF37029.1| conserved hypothetical protein [Bacillus megaterium DSM 319] Length = 246 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K ++++TDG + + +A+ AK QGI + I T E++ + A Sbjct: 7 KQMLVITDGGSNTGE-DPVAMAALAKEQGISVNVIGVMEEDTIDEQSTNEIEGIAMSG 63 >gi|163847608|ref|YP_001635652.1| hypothetical protein Caur_2051 [Chloroflexus aurantiacus J-10-fl] gi|222525464|ref|YP_002569935.1| hypothetical protein Chy400_2212 [Chloroflexus sp. Y-400-fl] gi|163668897|gb|ABY35263.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl] gi|222449343|gb|ACM53609.1| conserved hypothetical protein [Chloroflexus sp. Y-400-fl] Length = 811 Score = 39.2 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 30/311 (9%), Positives = 71/311 (22%), Gaps = 30/311 (9%) Query: 29 LMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQ 88 L+ V + + G+ + ++ L+ A + + A+ + ++ Sbjct: 403 LLLVTVLIVGVAIVYGINLSQQNNLQLATE--YLVAAEERVAAVRTAPDETTAREALLIA 460 Query: 89 KIEEYLIRNFENNLKKN---FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS 145 + + R N + + + + R S + L + + L + Sbjct: 461 QQAIDVARASPEITTTNPGLWLRFSELEREYERLLAAVDRVSFLEPTLLASHPLPNGIFT 520 Query: 146 LFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFG 205 + + S A + + + + I + + Sbjct: 521 TIVVPPAMSSVTDPANLTALRYLYALDSDPEAARLYRIPRDGGTPEPYLEPGQ------T 574 Query: 206 QPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLR 265 A ++ + +P L S Sbjct: 575 VGAAIVGSLRAALWRIDQVVAIDQAPNGFGYYFRQNDTWNYSKLGSSEIWI--------- 625 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 ++ + A I F+ D W + R Sbjct: 626 -DPERLDVEEYAGNLYI---------WGAQPGEVLKFSSGRYGDTPEFWLNPGVTRDAAV 675 Query: 326 TFAIDENEMGS 336 AID GS Sbjct: 676 LSAIDMAVDGS 686 >gi|194474004|ref|NP_001124020.1| collagen alpha-1(XIV) chain [Rattus norvegicus] gi|149066377|gb|EDM16250.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 1794 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 34/122 (27%), Gaps = 18/122 (14%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 ++ + G+T A+ ++ + K +L+ Sbjct: 214 NAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILI 267 Query: 375 TDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEA 431 TDG+ +QD+ + + G+ + I + L AS + Sbjct: 268 TDGK-SQDDIIPPS--RNLREAGVELFAIGV------KNADLSELQEIASEPDSTHVYNV 318 Query: 432 NS 433 Sbjct: 319 AE 320 >gi|254255256|ref|ZP_04948572.1| Flp pilus assembly protein TadG [Burkholderia dolosa AUO158] gi|124900993|gb|EAY71743.1| Flp pilus assembly protein TadG [Burkholderia dolosa AUO158] Length = 177 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 49/172 (28%), Gaps = 5/172 (2%) Query: 14 LIKSC-TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +S G + AL++ M+ + + + R Y L +A + A SV L Sbjct: 6 FRRSRTRGVVAVEFALVLMPMIVLVTGVAEFGRAIYQYETLTKATRDAARYLSVYLPSDP 65 Query: 73 EEVSSRAKNSFTFPK---QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ A+ + L+ ++ D + Y Sbjct: 66 AYPTAAAQCLVVYGSTTCGASGTELVPGLTTSMVVICDAAHTTDCSDASDPPQYANLPTY 125 Query: 130 QVVLSS-RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 ++ + + ++ G + I +T++ + Sbjct: 126 DASNNAPSGTPAGSINLVAVKIKGYQYQPIPAYPGLKTITFGNIVTVMRQVS 177 >gi|172060493|ref|YP_001808145.1| TadE family protein [Burkholderia ambifaria MC40-6] gi|171993010|gb|ACB63929.1| TadE family protein [Burkholderia ambifaria MC40-6] Length = 168 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/156 (10%), Positives = 43/156 (27%), Gaps = 5/156 (3%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G + A++ P+ + +V + +L AA A S Sbjct: 13 WRRQRGVAAVEFAVVFPLFFLIFYAIVTFGMVFVIQQSLTFAASEGARAA--LNYTSAPC 70 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT---AVEMNPRKSAYQV 131 + +N + + + + T + S++ V Sbjct: 71 DRLLVGARNAVTQALAGSPWSQNVKFAAQVSTSVPVPTSTPGVTCSATSFASTSTSSFNV 130 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 ++++ Y NPL ++ + + Sbjct: 131 MVTTTYSYAANPLIPWIFVFNAPQLQSSATVQVQPS 166 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 55/176 (31%), Gaps = 16/176 (9%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K + DA +I+ + + + F + V ++ V Sbjct: 32 KIENLYDATNEMIKVFSDAVSKEKIIDIAIITFGENVELHTPYTSVVD-----FKSRGLN 86 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-NEEGIA 388 G T + A++ A D I + + +VL++DG T + Sbjct: 87 PFLASGMTPLGTALRMAKDMIEDKETTPSNIYRPA------VVLVSDGVPTDEWRGPLDN 140 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 N +S + +A + Q + C +FF A + ++ F+ Sbjct: 141 FKNNGRSSKCQRFAVAIGNDADNQML--KSFAEC--NENFFIAENVSDIVDKFKQI 192 >gi|297587450|ref|ZP_06946095.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] gi|297575431|gb|EFH94150.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] Length = 249 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 52/149 (34%), Gaps = 17/149 (11%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDT--VRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 K V A+ ++ + I N N V+M FN + + V + Sbjct: 30 TKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNSEIQWITPKTGPVDPGVYLWR-- 87 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 D N G+T M A++ + S + I L++DG T+ E+ Sbjct: 88 ---DLNANGTTR----MGAAFEELESKLHGDKFMKSATSSYAPVIFLMSDGMPTETEEQF 140 Query: 387 IAICNKAKSQ-----GIRIMTIAFSVNKT 410 + NK K+ GI++ + + Sbjct: 141 QSGLNKLKANKWFKSGIKV-ALGIGQDAD 168 >gi|15602709|ref|NP_245781.1| TadF [Pasteurella multocida subsp. multocida str. Pm70] gi|12721153|gb|AAK02928.1| TadF [Pasteurella multocida subsp. multocida str. Pm70] Length = 187 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 15/46 (32%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 K K+ G I + ++ + L D+V L A Sbjct: 9 FKHFWKNKKGAVTIEFLFMSMFLIVLFAFLFDLVMLRSTLGKLDNA 54 >gi|260797293|ref|XP_002593638.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] gi|229278864|gb|EEN49649.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] Length = 388 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 35/305 (11%), Positives = 79/305 (25%), Gaps = 29/305 (9%) Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 V + +Y V + + ++++ + E ++ + + V Sbjct: 113 VIVVTDGVSYDAVAAPALEAQQKGITVYA-----IGVSGYDADQLEQIASNNNTLAFVDN 167 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 ++D C + + S N + ++ N Sbjct: 168 FNLLDNL--RNTLLTGVCDAKNICSTRNHYCCLSICCSHNEANICHRGTDREFGIN-NCG 224 Query: 240 LYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGA 299 + +P L +D S V+ I + Sbjct: 225 RKFSCFPTLCRNPLDIIFLLDGSGSVG-ASNFEKVKQFTKKAISGFDISPSGTQVG---V 280 Query: 300 TFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH 359 ++ R + S + V K + G T A++ ++ Sbjct: 281 IQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMR--GGTLTGKAIRYVTKYGFGKSDGARP 338 Query: 360 RMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFL 419 + K ++++TDG + +A+ +GI + I S L Sbjct: 339 GV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGITVYAIGVSG------YDADQL 383 Query: 420 SNCAS 424 AS Sbjct: 384 EQIAS 388 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 45/143 (31%), Gaps = 17/143 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I D ++G ++ R + S + V K + G T A++ Sbjct: 35 ISGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMR--GGTLTGKAIR 92 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K ++++TDG + +A+ +GI + I Sbjct: 93 YVTKYGFGKSDGARPGV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGITVYAIG 143 Query: 405 FSVNKTQQEKARYFLSNCASPNS 427 S L AS N+ Sbjct: 144 VSG------YDADQLEQIASNNN 160 >gi|198426249|ref|XP_002120426.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1937 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 116/375 (30%), Gaps = 30/375 (8%) Query: 88 QKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF 147 Q +++++R N D + + + P + + Y L + P + Sbjct: 1569 QYAQDHVLRYRNGNRLV--RDVIITITDGRSYDSVMPVSDQLRSQGALTYALGITPGNGL 1626 Query: 148 LRSMGI-----KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 S + S + + S + ++ S ++ Sbjct: 1627 GPSYSLLLEIAGSMDRVVFSTPALLGISDIFKQKLTNDLCGHPCASQCPRLLPPTNGLIS 1686 Query: 203 CFGQPADRTVKSYSSQNG-------KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 C + +V S++ + G + D S L + Sbjct: 1687 CTNSNVEDSVCSFTCRPGFVLSGEPQTRCVDGPNSSNDGVWTSPAPMCLAQCRNYGVIDL 1746 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 +DSS + + + + S+I S + +R+ A +N RV +D Sbjct: 1747 VVVLDSSRSIGLKSWTDM-KAFVRSLIGSFVI---GPNAMRVSAFRYNRRVDTDTQILLN 1802 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + AI N T A+Q D I++ E +N ++ +T Sbjct: 1803 SYSDNSFFAAYDAIPYN-GAGTRTALALQHVRDVILTRANGEWPAARN------VVLTVT 1855 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 DG + + I+ + ++ G+ + L+ SP+ F AN+ + Sbjct: 1856 DGR-SVSSIVPIS--EQLRANGVLTYALGVPPIIGNGMNTVNLLALAGSPSRRFLANNGN 1912 Query: 436 --ELNKIFRDRIGNE 448 L + F + + E Sbjct: 1913 YGGLYEQFVNSLVQE 1927 >gi|254454548|ref|ZP_05067985.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198268954|gb|EDY93224.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 237 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 31/96 (32%), Gaps = 18/96 (18%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED------- 356 + + H + + D + G+TAI+ ++ + S + Sbjct: 19 TWCFTGDQNAIIPHTTSESDLHNAINDLHAYGNTAIDMGVKWGVALLDPSTKSLIASLAG 78 Query: 357 -----------EVHRMKNNLEAKKYIVLLTDGENTQ 381 + K +VL+TDGENTQ Sbjct: 79 ASGSGVPSVAAGRPEQFTQSDVLKVLVLMTDGENTQ 114 >gi|294146457|ref|YP_003559123.1| hypothetical protein SJA_C2-00280 [Sphingobium japonicum UT26S] gi|292676874|dbj|BAI98391.1| putative membrane protein [Sphingobium japonicum UT26S] Length = 598 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 8/46 (17%), Positives = 18/46 (39%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 + + L+ + + A L V++G G +D + + L Sbjct: 11 RAWLRPLLGDRSAAVAPMFAALGAVLIGAAGFALDGGLYYVGQRNL 56 >gi|51245321|ref|YP_065205.1| fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] gi|50876358|emb|CAG36198.1| related to fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] Length = 1640 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 36/396 (9%), Positives = 95/396 (23%), Gaps = 31/396 (7%) Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + S+ + TF N E + + E + Sbjct: 514 ASITFATSKATSAFLATALTFNGPDTVAVAFSNLEQDSVQISLQEEGSNDTEVRHTTEAV 573 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG---VSIQW 181 A + LL + + L ++ + +++ Sbjct: 574 GYLALSPRTNYNIALLTDQVPKGLLHDIKDDVRQGLSYYKYNLTTNNIYNSWWHGGTMRL 633 Query: 182 VIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 I + + + + V + G + Y V Sbjct: 634 RIPTNPFVKKSSDTNFRTIDTPIDSDIEILVDAVEHYPLIWGTTPLAENFYEVIRYFQQK 693 Query: 242 YMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATF 301 Y ++ + F+ + K + + ++ + T Sbjct: 694 GPYYNSSVNSAGDVASFLTEKAWDPYYFK-------------ELDAMIRCANSSVIIFTD 740 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 S + + + + T D + I + N D ++ Sbjct: 741 GGSYTDSYVPPFNDEEFNESSASQAASAYSDGSNYTNDYDGNIVTGEGINTRNTDNSYKN 800 Query: 362 KNNLEAKKYIV-----LLTDGENTQDNEEGIAICNKAKSQGI-RIMTIAFSVNKTQQEKA 415 + A + LT+ + + + K + T+ F ++ Sbjct: 801 NLDDLALWANLNRKGQALTEALSNSRDLR-----DDLKGEQYLTTYTVGFGISANSA--E 853 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEI 449 L + A ++ A +L + + I + + Sbjct: 854 ERLLQDTAAHGQGTYSLAEDGQQLKDVLKGTINSIL 889 >gi|16127180|ref|NP_421744.1| hypothetical protein CC_2950 [Caulobacter crescentus CB15] gi|221235981|ref|YP_002518418.1| TadG-like pilus assembly protein [Caulobacter crescentus NA1000] gi|13424578|gb|AAK24912.1| hypothetical protein CC_2950 [Caulobacter crescentus CB15] gi|220965154|gb|ACL96510.1| TadG-related pilus assembly protein [Caulobacter crescentus NA1000] Length = 183 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 49/170 (28%), Gaps = 12/170 (7%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + ++ G + AL+ L + ++++ + A+I A Sbjct: 13 RLARSAGRFARADEGATAVEFALVAIPFLMLLFAIIELGLVFLV----SITLENAVIDAG 68 Query: 66 VPLIQSLEEVSSRAKNSF--------TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + + + NSF ++ K L + + Sbjct: 69 RTIRTGEVQTTGGNANSFKTAVCNRMSWLGSKCSSALRLDVRTFTDYATGQASATNTTVP 128 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETV 167 T + NP S VV+ + Y L L I A + T Sbjct: 129 TTMNWNPGASGSIVVVRAYYTWPLVTPMLNTGLQSSNGNRIIYAATSFTN 178 >gi|226423922|ref|NP_851794.3| collagen alpha-1(XIV) chain [Mus musculus] gi|148697325|gb|EDL29272.1| procollagen, type XIV, alpha 1, isoform CRA_a [Mus musculus] gi|219519338|gb|AAI45245.1| Collagen, type XIV, alpha 1 [Mus musculus] gi|223459902|gb|AAI38346.1| Collagen, type XIV, alpha 1 [Mus musculus] Length = 1794 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 217 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 270 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 271 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 320 >gi|30420885|gb|AAO64442.1| collagen type XIV [Mus musculus] Length = 1797 Score = 39.2 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 217 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 270 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 271 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 320 >gi|319941895|ref|ZP_08016216.1| hypothetical protein HMPREF9464_01435 [Sutterella wadsworthensis 3_1_45B] gi|319804548|gb|EFW01418.1| hypothetical protein HMPREF9464_01435 [Sutterella wadsworthensis 3_1_45B] Length = 168 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 23/52 (44%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 F ++S G + ALL+P+ V + ++ R S +++A + Sbjct: 7 FRSAFSAFLRSRLGAAGVEFALLLPMAAFVLAVALESARLSIAYALIERAVE 58 >gi|209515958|ref|ZP_03264819.1| TadE family protein [Burkholderia sp. H160] gi|209503616|gb|EEA03611.1| TadE family protein [Burkholderia sp. H160] Length = 157 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 29/115 (25%), Gaps = 9/115 (7%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + L+ P ++ V ++DV + + A++ A + Sbjct: 26 ERGVVALEFVLVFPFLMMVLFGIIDVSLLLCDKAVITNASREAARAGV---------IVR 76 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + + T Y +N V + K Y Sbjct: 77 IPQLTATQITNVALNYTQKNLVTGGTATTPTVTVDQSSGTSTGSPLTVKVTYTYE 131 >gi|148697327|gb|EDL29274.1| procollagen, type XIV, alpha 1, isoform CRA_c [Mus musculus] Length = 1802 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 222 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 275 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 276 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 325 >gi|126724455|ref|ZP_01740298.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] gi|126705619|gb|EBA04709.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 38/130 (29%), Gaps = 17/130 (13%) Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + G+T I ++ AY + S + +K ++LLTDG D Sbjct: 230 AFDNIQPGGTTNIYGPLREAYGWLSES---------ERTDHQKAVILLTDGRANDDAASE 280 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIG 446 K + + + + + + + +L + F Sbjct: 281 SQTL-AMKDDAYT--FVYYMGDSDDRWLRSLADNYFSGGGHV-----SAQLERYFNVVSD 332 Query: 447 NEIFERVIRI 456 + V+ + Sbjct: 333 AYSAQTVLEL 342 >gi|325525566|gb|EGD03356.1| TadE family protein [Burkholderia sp. TJI49] Length = 136 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 32/91 (35%), Gaps = 1/91 (1%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 K+ G + A ++PV+L + +V+ + Y + + A++ V L Sbjct: 3 RKNQKGVAVVEFAFVLPVLLLIMFGIVEFGLFLYDKAVITNASREGARAGIV-LSNPPPT 61 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 SS + + + ++N Sbjct: 62 SSSVQTVVQNYTSNHLLTFGMQNTPTVASSG 92 >gi|289706415|ref|ZP_06502773.1| conserved hypothetical protein [Micrococcus luteus SK58] gi|289556910|gb|EFD50243.1| conserved hypothetical protein [Micrococcus luteus SK58] Length = 152 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 36/138 (26%), Gaps = 4/138 (2%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 + +L + G ++T L V++ + +++ V L+ A +A + + Sbjct: 3 RPLTARLGRD-DGQTTVLTVGLCAVLVALMLVMLAVTTVQLQHRRLQSLADSAAVAGAEE 61 Query: 68 LI---QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 L V + + R L ++ Sbjct: 62 LGFRLGEDPGVVLSDGDVAASASAHLAAVGAREAVPGLGGMSARVAEDGTTVVVTLDARA 121 Query: 125 RKSAYQVVLSSRYDLLLN 142 A + L + Sbjct: 122 DLFAMTGPFAGTLPLSVP 139 >gi|156343886|ref|XP_001621150.1| hypothetical protein NEMVEDRAFT_v1g222316 [Nematostella vectensis] gi|156206821|gb|EDO29050.1| predicted protein [Nematostella vectensis] Length = 225 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 54/196 (27%), Gaps = 16/196 (8%) Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 +VD S V + + + + N R+ ++ Sbjct: 21 ANSVCSNKADIAFYVDVSGNLGQ-SNLERVIEYILKFLDRSDVAQDKN---RVAVVGYDV 76 Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDEN-EMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + I K ST + A+++A V Sbjct: 77 VPHIKLTLQGSAFLSNTDIRKIIQTQLKFSDKSTRTDLALESALSLFQREQRSGVP---- 132 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 K +V++TDG + D+ A K+ G++ + +A + L C Sbjct: 133 -----KTLVIITDGRSWYDDSVMDAAIAGLKALGVKTLAVAVGSDSVLDLDKLQLL--CG 185 Query: 424 SPNSFFEANSTHELNK 439 + ++ + L Sbjct: 186 TLDNVIKVADYESLLA 201 >gi|225012027|ref|ZP_03702464.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003582|gb|EEG41555.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 346 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 49/148 (33%), Gaps = 21/148 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + +++++ + R+G + + + + + ++ Sbjct: 110 NRLEKSKRLVSAILNQLAS-------DRVGIIAYAAQAVPQLPITTDYSAA-KMFLQALN 161 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + TA++ A+ + ++ + I L++DGE+ ++ A Sbjct: 162 TEMLSSQGTALDSAIDLSGTFFDDEDQ-----------TNRVIFLISDGEDHSEDASNAA 210 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 +A + GI+I T + Sbjct: 211 S--RAAAMGIKIFTFGVGTEAGAPIPIK 236 >gi|146345398|sp|Q80X19|COEA1_MOUSE RecName: Full=Collagen alpha-1(XIV) chain; Flags: Precursor Length = 1797 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 34/119 (28%), Gaps = 18/119 (15%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 ++ + G+T A+ ++ + K +L+TDG Sbjct: 217 NTKDEVIDAVRSLPYKGGNTLTGLALNFIFE------NSFKPEAGSRSGVSKIGILITDG 270 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 + +QD+ + + G+ + I + L AS + Sbjct: 271 K-SQDDIIPPS--RNLRESGVELFAIGV------KNADLSELQEIASEPDSTHVYNVAE 320 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 34/125 (27%), Gaps = 30/125 (24%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------NEEGIAICN---------- 391 I + +IVLL+DGE + A C Sbjct: 565 QQIPQGARYISPISDRQCQVNNHIVLLSDGEANNNHSVDEIESLLSASCTGSGGEKCGLS 624 Query: 392 ----------KAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCASPNSFFEANSTHELNKI 440 I TI F+ N + L F++A+++ EL Sbjct: 625 LVRNVADSEGSVIDSRIITHTIGFAANTEANSFLNQIALQ---GGGGFYQADNSQELLGA 681 Query: 441 FRDRI 445 F+ + Sbjct: 682 FQSIL 686 >gi|50727102|gb|AAT81212.1| endo-chitinase [Microbulbifer hydrolyticus] Length = 957 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 33/330 (10%), Positives = 71/330 (21%), Gaps = 8/330 (2%) Query: 45 RWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKK 104 R Q+ + S + + + + + Sbjct: 569 RTPTARAGGDQSVDSGASVTLDGSTSSDLDNDPLTYSWIQVSGPSVTLQNANSVSASFTA 628 Query: 105 NFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEA 164 + + T + + + QV ++ D S + I Sbjct: 629 PTVSADEELVFELTVTDDSDASGSDQVSVTVLADQPNQAPSANAGADQIVVTPATVSLNG 688 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S + V S S+ D S + + S Sbjct: 689 TGSSDPDGDVLTYAWTQVSGASVSLSDATAASPSFSAAEVTAEQELVFELTVSDGTLSDD 748 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 +++S + L P + S+ + +S DAL Sbjct: 749 TPDQVSIF------LLPPDANTAPEVSAPSQVTIQEGASDSITATGTDADGDALTYTWSG 802 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG--STAINDA 342 + T+ + + S G+ T+ T N Sbjct: 803 MANGSGDTITITAPQVEADTEFSLTVTVSDGIDSASATVTVVVTNAGGGGSCVLTDPNAG 862 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 A+ + + + + KY Sbjct: 863 NFPAWQSGSTYLGGDQVSFDQLVWEAKYWT 892 >gi|219848640|ref|YP_002463073.1| hypothetical protein Cagg_1736 [Chloroflexus aggregans DSM 9485] gi|219542899|gb|ACL24637.1| conserved hypothetical protein [Chloroflexus aggregans DSM 9485] Length = 170 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 22/49 (44%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 G + AL++P++ +++++ L+ A Q A +A + Sbjct: 7 GQALPLVALMLPILTLFVLVVIELANLWLAVAKLEDALQQATRSAVQQI 55 >gi|319937919|ref|ZP_08012319.1| hypothetical protein HMPREF9488_03155 [Coprobacillus sp. 29_1] gi|319806825|gb|EFW03464.1| hypothetical protein HMPREF9488_03155 [Coprobacillus sp. 29_1] Length = 170 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 11/70 (15%), Positives = 23/70 (32%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K ++ G+ I +LL P ++ +V + Y + K A + + + Sbjct: 3 KAFYRNQDGNVTIAFSLLTPFIIFYFLWVVSSWQAMYIQLQTKAVIDFATLGGATTGVAE 62 Query: 72 LEEVSSRAKN 81 S Sbjct: 63 RTSASDIQAA 72 >gi|311271483|ref|XP_003133150.1| PREDICTED: anthrax toxin receptor-like [Sus scrofa] Length = 728 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 12/144 (8%) Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 K K + + + + + ++ + E N I+ LT Sbjct: 179 REKKTTASSKAGIQNLKQESFGEAFEGLPDPLYRVEANEQIEQANSGENKVPS-MIIALT 237 Query: 376 DGENTQDNEE-GIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANS 433 DG + + ++A+ G + I ++ LS A S + F + Sbjct: 238 DGTLESISLQETKQQADRARKLGANVYCIGV------KDYETEQLSEIADSSDHVFGVDQ 291 Query: 434 THELNKIFRDRIGNEIFERVIRIT 457 K ++ I + + I IT Sbjct: 292 G---FKALKNIIDPLMTKTCIEIT 312 >gi|212637907|ref|YP_002314427.1| hypothetical protein Aflv_0058 [Anoxybacillus flavithermus WK1] gi|212559387|gb|ACJ32442.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Anoxybacillus flavithermus WK1] Length = 247 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 + I+L+TDG + + IA+ AK QGI + I E+ + A S Sbjct: 7 RQILLITDGCSNHGE-DPIAMAALAKEQGITVNVIGVLDQDIIDEQGLREIEGIAQSGGG 65 Query: 428 FFEANSTHELNK 439 + +L++ Sbjct: 66 ISQVVYAKQLSQ 77 >gi|13476804|ref|NP_108373.1| hypothetical protein mll8236 [Mesorhizobium loti MAFF303099] gi|14027565|dbj|BAB53834.1| mll8236 [Mesorhizobium loti MAFF303099] Length = 379 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 14/160 (8%), Positives = 40/160 (25%), Gaps = 8/160 (5%) Query: 36 VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 + +D+ S L+ + A + + S+ S A TF + Sbjct: 1 MVAGAIDLTGTSDDAAQLQNSLDAAGLAIGTKYLPSMA-ASDVAALGLTFFAANLSLADQ 59 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY-------QVVLSSRYDLLLNPLSLFL 148 + +++ ++ + + ++ S+ + + L Sbjct: 60 QENADSVSAFSATASGDPSAYYISLSSSINRPSFINGAAPWPAHRSATVKMNPGAQACVL 119 Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 S + + + S + S Sbjct: 120 ALDPHASAAVSLQGSTDVAMSSCVIAANSDASDAVRRGGS 159 >gi|145297762|ref|YP_001140603.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] gi|88866552|gb|ABD57328.1| TapY1 [Aeromonas salmonicida subsp. salmonicida A449] gi|142850534|gb|ABO88855.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 1203 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 32/364 (8%), Positives = 96/364 (26%), Gaps = 11/364 (3%) Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 + + ++ Y+ ++ N + + + D ++ ++ ++ + L Sbjct: 125 YFQSQVWTYVYSSYRNGVPRQGDWETISGENEDDITIVDCKQDIQNRTPANPFSGTLEVG 184 Query: 145 SLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCF 204 S+ K+ + + + S+ + Sbjct: 185 SVSDGYPVDKTSNTRGEWYYTGNANDATSGSNTSVTLYTANYIRWYYGPTGYAVESRLRI 244 Query: 205 GQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV-DSSS 263 + A ++V S + + + + + +L + +++ E + ++ Sbjct: 245 AKDAVKSVISSTPGVDFGLAIFNTNNSTSTNGGRVVRRILSNDDMVGTITAEQNLLNTVE 304 Query: 264 LRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTI 323 L + V + G T + +++ K Sbjct: 305 GLSATTNTPLCETLYEAY--RFYGGQPVL-YGKQGGTRDKLAEDPEGTYTSPYDKCSNNG 361 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVH--RMKNNLEAKKYIVLLTDGENTQ 381 + D T + A++ D++ + + AK Y+ + + Sbjct: 362 YVIYVTDGAPTSDTDADSAVKALMDSLSTDERAAYGSTVGYGSGSAKSYLAAMAGYMKNK 421 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 D Q + T+ F + ++ A S L + Sbjct: 422 DVNTSSPG-----KQTVTTFTVGFGDEAISGAGNLLAETARRGGGQYYPAVSAEALTQAL 476 Query: 442 RDRI 445 R + Sbjct: 477 RSSL 480 >gi|109112823|ref|XP_001117651.1| PREDICTED: integrin alpha-E-like [Macaca mulatta] Length = 956 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 38/115 (33%), Gaps = 10/115 (8%) Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 + T AMQ D I +S+ + K +V+LTDG + Sbjct: 428 ASLAKVQNITQVGSVTKTASAMQHVLDNIFTSSHGSRRKA------SKVMVVLTDGGIFE 481 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN---SFFEANS 433 D + + N K G+ I + + + L+ AS F+ + Sbjct: 482 DPLDLTTVINSPKMHGVERFAIGVG-EEFKSARTERELNLIASDPDETHAFKVTN 535 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 39.2 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 59/190 (31%), Gaps = 27/190 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + +L + ++ D R F+D + S ++ + F Sbjct: 367 TSIEQAKASLDYALSQLQPGD------RFNVIRFDDTLTKFFEDSVDANQENIASARRFV 420 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T + A+ A + SN+ + IV LTDGE + + + A Sbjct: 421 TSLEAQGGTEMLPALHAA---LDDSNQG---------NGLRQIVFLTDGEISNEQQLLDA 468 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIG 446 + A+ RI + Y ++ A +F S E+++ R Sbjct: 469 V--AARRGRSRIFMVGIGSAPN-----SYLMNRAAELGRGTFTHIGSAAEVDERMRALFD 521 Query: 447 NEIFERVIRI 456 V + Sbjct: 522 KLENPAVTDL 531 >gi|304412583|ref|ZP_07394188.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|307303599|ref|ZP_07583352.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] gi|304349059|gb|EFM13472.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|306912497|gb|EFN42920.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] Length = 1168 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 42/400 (10%), Positives = 88/400 (22%), Gaps = 17/400 (4%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A + +L E+ F E N +DI + Sbjct: 106 ASLNGCSQSKAALAELGRFTGYIREFTASGTTGIWKELPETNGTAISVVDCWQDISKT-- 163 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 N + + + A A + + V++ Sbjct: 164 DPNNSSSYSNGFPANGLTTGSGKKKKAYPYFTSSSPGTSWADALAAANNTDFGVGQPVTL 223 Query: 180 QWV-------IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + + + T + + Sbjct: 224 YTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNEGNRD 283 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 + M + ++ + ++ R ++ D Sbjct: 284 GGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKY-GHGDTDY 342 Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ V S+ + D N+ + + Sbjct: 343 GSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDKNANNDVISLTA--SG 400 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 S E NL+ Y+ L D + N + Q +R TI FS Sbjct: 401 SKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQMQNVRTYTIGFSKGAEDA 460 Query: 413 EKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ A +F A ++ EL D + N + Sbjct: 461 AP---LLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|167522505|ref|XP_001745590.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775939|gb|EDQ89561.1| predicted protein [Monosiga brevicollis MX1] Length = 1927 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 68/214 (31%), Gaps = 21/214 (9%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + P T ++ + + +P+ + + ++ P + Sbjct: 1075 NATSTTPNVNATTTTPLVDTTTPNANVTTTTPMNNGTTTPFDTTMASTSTTLILPCAVTA 1134 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + + + +++R + + +G FF+ + Sbjct: 1135 DIVIVIDASGAMSINDFT---AAKTTALAILRRLALAQP---DISVGLIFFSQQAQVALP 1188 Query: 312 FSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY 370 + ++ G +T + A+ +A D + +SN Sbjct: 1189 LLDINDETEFQLLLLVLQAGQYQGQATNLGSALSSAADLLETSNNGARQ----------- 1237 Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 +L +DG +++G + ++ GI+I+T+A Sbjct: 1238 FILFSDG---SSDDQGTLVAQNIRATGIQILTVA 1268 >gi|154301924|ref|XP_001551373.1| hypothetical protein BC1G_10199 [Botryotinia fuckeliana B05.10] gi|150855591|gb|EDN30783.1| hypothetical protein BC1G_10199 [Botryotinia fuckeliana B05.10] Length = 864 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 29/73 (39%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T DA+ + + + + + IVL+TDG ++EE I + + Sbjct: 103 TDDGDAISAIVVAMEMIKKHTTLKSGKPGKYARKIVLITDGNGFMNSEELDDIAKEINNN 162 Query: 397 GIRIMTIAFSVNK 409 IR++ I + Sbjct: 163 DIRLVVIGVDFDD 175 >gi|326666380|ref|XP_003198256.1| PREDICTED: hypothetical protein LOC100537740 [Danio rerio] Length = 2720 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 28/290 (9%), Positives = 65/290 (22%), Gaps = 6/290 (2%) Query: 49 YEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTD 108 + L A T + + Q+ +S +N+ P I + N + Sbjct: 1099 VKSKLNSANDTTTPSQASTNEQATPPPASTKENTTPPPTITINKTTPLPALTNETSTASQ 1158 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVS 168 ++ V N + + V + + P+S ++ + + Sbjct: 1159 VSTKEKTTPPPVPTNEKTTPPPVPTNEKTTP--PPVSTNEKTTPPPVPTNEKTTPPPVST 1216 Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK 228 VS + + + N P + ++ Sbjct: 1217 NEKTTPPPVSTNEKTAPPPASTNEKNTPPSVSANEKTTPPPVSTNEKTTSPSVSNNEKTT 1276 Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKI 288 P + + +L P + + +K + I Sbjct: 1277 PPPASTNERTTSPPILTNEKTTPPSVSINQKTTPPPVSTNQKTTPP----PVLTNEITIP 1332 Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + + + + P K V T T Sbjct: 1333 PPASTNEKTAPPPVSTNEKTTPPSVSTNEKTTPPTVSTNENTTPPPVLTN 1382 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 30/76 (39%), Gaps = 10/76 (13%) Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS-FFEANST 434 DG+ + K+Q ++ T+ F + +L N AS ++ +F A Sbjct: 283 DGDYSDPRG------TNFKNQLVQTYTVGFGSGVS--ATGEAYLRNGASGSNNYFNARDE 334 Query: 435 HELNKIFRDRIGNEIF 450 + L F D I + I Sbjct: 335 NGLFAAF-DAITDSIA 349 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 42/324 (12%), Positives = 79/324 (24%), Gaps = 18/324 (5%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + S + + + + S + + +V+D S SM + S L Sbjct: 46 SSQSEYAKPPNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSAL 105 Query: 202 NCFGQPADRTVKSY---SSQNGKVGIRDEKLSPYMVSCNKSLYYM--LYPGPLDPSLSEE 256 + G+R+ + P+ N + L G + Sbjct: 106 QAAVNYFKSNYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGT 165 Query: 257 HFVDSSSLRHVIKKKHLVRDALASVIR--SIKKIDNVNDTVRMGATFFNDRVISDPSFSW 314 ++ + SL + + + T R F + Sbjct: 166 NYSAALSLAKSYFTDPTRKKYIIFLTDGMPTVLNTVDTITYREVKPKFRGGYQYTGNRVT 225 Query: 315 GVHKLIRTIVKTFAIDENEMGSTA-INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G + + G TA I Y S+ + Y Sbjct: 226 GSLSVT--------YELYSDGWTAGIRFTDNKGYSRQFYSDGQDYVNGWWVSWDNGYSFT 277 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF-EAN 432 + E + + A+ I + +I F N LS A + Sbjct: 278 YSSIE-RKIRADATAVAKTLGMNNITLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQ 336 Query: 433 STHELNKIFRDRIGNEIFERVIRI 456 + L + F IRI Sbjct: 337 NLTALFQQFSQLATTPAITGTIRI 360 >gi|113475854|ref|YP_721915.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166902|gb|ABG51442.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 460 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 36/277 (12%), Positives = 89/277 (32%), Gaps = 23/277 (8%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 G + ++ + + +++ + I +I ++ + + + N + Sbjct: 23 FGCAAIPENIISQDDCSNTKENRQKCLEITRIITAEKNTVKMRIKVYNKAGNFNPNLQES 82 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 + + LD S S + S R + K Sbjct: 83 DFSVETISEFGSKKTIKPTVILPTDTRAKTTPADIIIMLDMSGSMKFRDSSPGRRRIKFK 142 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI----SDPSFSWGVHKLIRTIVKT 326 + +A+ I + D N TVR+G F S+ S + +K Sbjct: 143 GAI--NAIYKFIDAAN--DKPNLTVRIGLAPFGKGGNQFKVSNKSLDANFYPSNSEKLKE 198 Query: 327 FAIDENE---MGSTAINDAMQTAYDTIISS---NEDEVHRMKNNLEAKKYIVLLTDGENT 380 + ST + ++TA +I+S D ++ ++ + +++L+DG + Sbjct: 199 KIEELANQELSASTNLYQPLETAVKYLINSVNSTSDSNNKTDDSQSKQLVVIVLSDGFHN 258 Query: 381 QDNEEGIAICNKAKS---------QGIRIMTIAFSVN 408 D + + K+ +++ T+ + + Sbjct: 259 HDRDTEEIQFERLKNILQPQDSQMPKVKVHTLGYGQS 295 >gi|217974384|ref|YP_002359135.1| type IV pilin biogenesis protein [Shewanella baltica OS223] gi|217499519|gb|ACK47712.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS223] Length = 1168 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 34/341 (9%), Positives = 75/341 (21%), Gaps = 15/341 (4%) Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 N + + + A A + + V+ Sbjct: 163 TDPNNSSSYSNGFSANGLTTGSGKKKKAYPYFTSSSPGTSWADALAAANNTDFGVGQPVT 222 Query: 179 IQWV-------IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + + + + T + + Sbjct: 223 LYTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNEGNR 282 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + M + ++ + ++ R ++ D Sbjct: 283 DGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKY-GHGDTD 341 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + ++ V S+ + D N+ + + Sbjct: 342 YGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDKNANNDVISLTA--S 399 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 S E NL+ Y+ L D + N + Q +R TI FS Sbjct: 400 GSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQMQNVRTYTIGFSKGAED 459 Query: 412 QEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ A +F A ++ EL D + N + Sbjct: 460 AAP---LLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|118351474|ref|XP_001009012.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila] gi|89290779|gb|EAR88767.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila SB210] Length = 2382 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 34/109 (31%), Gaps = 12/109 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 ++ I + FND + ++ + + + G+T + Sbjct: 2259 LLDFIDNRSRYTAQDFITLIGFNDSIKLYTKLEKLNEQIKQKVPEKNMN-----GNTNFS 2313 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 +Q + N ++ I L+DGE + + + + + Sbjct: 2314 APLQQVLKILSQDNCKTFNKNN-------VIFFLSDGEANKPDTDILQL 2355 >gi|70943972|ref|XP_741969.1| hypothetical protein [Plasmodium chabaudi chabaudi] gi|56520682|emb|CAH74986.1| hypothetical protein PC000452.00.0 [Plasmodium chabaudi chabaudi] Length = 831 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 33/331 (9%), Positives = 84/331 (25%), Gaps = 21/331 (6%) Query: 93 YLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + + +++ D+ +V + +Y L + Sbjct: 473 HYALDLVPFSDAARYNKDSLINRIDSLKTNYGNGHESFIVKTLKYALYNYTKGSDRTNAP 532 Query: 153 IKSWLI-QTKAEAETVSRSYHKEHGVSIQWV--IDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + L +E+ Y+ + V + SM + + Sbjct: 533 KITMLFTDGNDSSESDIEMYNIGSLYRTERVKLLVIGVSMASESKLKQLVGCAQNVPCPF 592 Query: 210 RTVKSYSSQNGKVGIRDE----------KLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFV 259 + + + + + S + + + Sbjct: 593 VIKTEWGTLDPLSKVFVDKICDTGAILPPESNKPEPEAPAPPCIGNDCFCHDIYDLTVIL 652 Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + V + +++ + M +N + + Sbjct: 653 DESGSIGANNWERQVYPFTEKFLNNLEISEKNVHVGIMLFAQYNRDFVKFSDKESYDKEH 712 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + T VKT G T I A+ H + K +L TDG N Sbjct: 713 LMTHVKTLKKSYKSGGYTYIISALNYGL------VNYTRHADSRSDVP-KVTMLFTDGNN 765 Query: 380 TQDNEEGIAICNKA-KSQGIRIMTIAFSVNK 409 T ++ ++ + K + ++++ + + Sbjct: 766 TDPGDKLLSDASLLYKQENVKLLVVGVGAST 796 >gi|13476309|ref|NP_107879.1| hypothetical protein mll7595 [Mesorhizobium loti MAFF303099] gi|14027070|dbj|BAB54024.1| mll7595 [Mesorhizobium loti MAFF303099] Length = 589 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 46/206 (22%), Gaps = 2/206 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 ++ + +F ++TAL PV L + + +D + A ITA+ Sbjct: 9 ISRLARSMAGDRKANFAVMTALSAPVALALAAVAIDEASIYSERREAQAMVDLAAITAAS 68 Query: 67 PLIQSLEEVSS--RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + V + Q + + + R T Sbjct: 69 NINNVNTAVVTTLTDNGMPGVVVQASGQTIPPAVGKTVVTVTQGRYASTTTNVTQRFQAG 128 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 V ++ S + + I + + T S Sbjct: 129 VTPYNAVRVTLAKVPARYFASSLIPTPVIGTQATASMTPQATFSVGSRLLSVNGGILNAL 188 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADR 210 S + S Sbjct: 189 LSGLLGGNISLSVMDYNGLISADVSV 214 >gi|323700352|ref|ZP_08112264.1| TadE family protein [Desulfovibrio sp. ND132] gi|323460284|gb|EGB16149.1| TadE family protein [Desulfovibrio desulfuricans ND132] Length = 155 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 14/137 (10%), Positives = 37/137 (27%), Gaps = 6/137 (4%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + G + TAL++P++ + +++ Y +++ A + Sbjct: 5 RKRSRRAGLAAVETALILPILFMLVMAVIEGGNAVYAWVTVQK----AAQMGARFAATGR 60 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + N N + +V+ + +V Sbjct: 61 --GADEGTRLDDIIAATEAGLTTLNQANIEISVRSWPDVQASGDGIDNDPGAPCQLAEVA 118 Query: 133 LSSRYDLLLNPLSLFLR 149 + Y+ +S L Sbjct: 119 VVYNYEPFTPLVSPLLP 135 >gi|313238993|emb|CBY13979.1| unnamed protein product [Oikopleura dioica] Length = 676 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 33/113 (29%), Gaps = 12/113 (10%) Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 ++ WG+ + MG T + AYD ++ A Sbjct: 259 TNSELIWGLDDGLDGFTNAVTNMAWTMGGTYTAKGLDLAYDHMLLRGR---------RSA 309 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIR--IMTIAFSVNKTQQEKARYF 418 K IVL+TDG T + A K K + + I F Sbjct: 310 TKTIVLMTDG-YTSNQSAYDAEIKKIKGSSVSFNLQAIGFGYTMNANVNMEDL 361 >gi|297480083|ref|XP_002691213.1| PREDICTED: von Willebrand factor A domain containing 3B [Bos taurus] gi|296482857|gb|DAA24972.1| von Willebrand factor A domain containing 3B [Bos taurus] Length = 1347 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 64/183 (34%), Gaps = 28/183 (15%) Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 G S+ ++ + + K LV++ + +I+ + + + F Sbjct: 494 DGSQGLFGKVPSDCVYILIDTSHSMTGKLDLVKNKVIQLIQ-----EQLKHKKKFNFVQF 548 Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + + +++ + + D ST A+Q A+ Sbjct: 549 DAQASAWQEKLVEINEDNLKGAQAWIRDIKIGSSTNTLSALQIAF--------------- 593 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIRIMTIAFSVNKTQQEKARYFLSN 421 E + I LLTDG + ++ + ++ K Q I I TI+F N FL + Sbjct: 594 -ADEETQVIYLLTDG---RPDQSPEMVMDQVKVFQKIPIYTISFCYNDEIANG---FLKD 646 Query: 422 CAS 424 AS Sbjct: 647 LAS 649 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 49/122 (40%), Gaps = 19/122 (15%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM----GSTAINDAMQTAYDTI 350 R+ F R + ++ F + + ++++ A++ + + Sbjct: 138 DRVALIAFAGRSYILTPLT-----GDDGALELFLDNLDPGVVGQAGSSLSRAIRQGSELL 192 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 ++ ++ A + +VLL+DGE+ E+ + ++A S+GI ++T+ F Sbjct: 193 LA----------SDGSADRALVLLSDGESFDSAEDIESAASEAGSKGISVVTVGFGTRDG 242 Query: 411 QQ 412 Sbjct: 243 AT 244 >gi|149919601|ref|ZP_01908080.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] gi|149819544|gb|EDM78972.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] Length = 349 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 60/189 (31%), Gaps = 28/189 (14%) Query: 275 RDALASVI--RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA-IDE 331 A+ + + + + +R G + +++ P L T Sbjct: 99 WQAVGDTLLDPDVGVVAPLQGDIRFGLSLYDNPGDMCPRVESTPLALNALADMTALYQSA 158 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ--------DN 383 G T A+ + D + ++ +K IVL TDGE Sbjct: 159 APEGDTPTGSALTSVADVV----------AQDPDPGEKVIVLATDGEPDTCAQPNPDEGQ 208 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA------SPNSFFEANSTHEL 437 E +A A +QG+R + ++ + + + + A S +++A L Sbjct: 209 PEAVAAAQAAYAQGVRTVIVSVGSGIS-ADHLQDMANAGAGVQPGGSDAVYYQALDQASL 267 Query: 438 NKIFRDRIG 446 F + I Sbjct: 268 IDAFSEIIA 276 >gi|297538282|ref|YP_003674051.1| von Willebrand factor type A [Methylotenera sp. 301] gi|297257629|gb|ADI29474.1| von Willebrand factor type A [Methylotenera sp. 301] Length = 328 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 45/171 (26%), Gaps = 29/171 (16%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 N +G F++ + + + I T T I + +A Sbjct: 122 NSRQNDMIGVITFSNSAMYVLPLTESREAIQAAIKATAGNSLF---QTNIGGGLTSAVSL 178 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI------ 403 + + + I+LL+DG + + + I + I Sbjct: 179 FENVPDSG----------SRAIILLSDGGGRLGGDVQQKLREWLQRYNITLYWIVLRQPG 228 Query: 404 AFSVNKTQQEKARYFLSNCA----------SPNSFFEANSTHELNKIFRDR 444 S+ +E L SP S +EA L D Sbjct: 229 GISIFNEYKEIDGEPLPQEVELYQYFKTLRSPFSAYEAEDPKSLANAIADI 279 >gi|242034177|ref|XP_002464483.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] gi|241918337|gb|EER91481.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] Length = 222 Score = 39.2 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 12/83 (14%), Positives = 23/83 (27%), Gaps = 7/83 (8%) Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 L++DG+ K + T F + ++ +F N Sbjct: 2 LMSDGQQNHGGN-----AADVKISNAPVYTFGFGADYDPTVLTAVVGNSM--GGTFSVVN 54 Query: 433 STHELNKIFRDRIGNEIFERVIR 455 +L F + + V R Sbjct: 55 DVDKLTMAFSQCLTGLLTVLVRR 77 >gi|254414923|ref|ZP_05028687.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178412|gb|EDX73412.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 928 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 44/163 (26%), Gaps = 12/163 (7%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F+D R + N G T + +Q + Sbjct: 458 NPHDTFTIIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLN--- 514 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + + IVLLTDG +N+ + K G R+ + + + Sbjct: 515 ------FPEVDPGR--LRSIVLLTDGYIGNENQILAEVQRHLK-LGNRLHSFGAGSSVNR 565 Query: 412 QEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 R T E+ + F +I N + + Sbjct: 566 FLLNRIAEIGRGISRIVRYDEPTEEVAEQFFGQINNPVLTNIQ 608 >gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] Length = 740 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 40/398 (10%), Positives = 93/398 (23%), Gaps = 30/398 (7%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 + ++ ++ + I + + + Sbjct: 150 EAKANGQKAGLVEQHRPNVFRNSIANVGPGETVLVQIEFQAPIAQVAGDYSLRMPLVVGP 209 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 ++ L L + + A A + + + Sbjct: 210 RYIAGSANANGTLSRASLLAGSADLIAPTADPNMVARAGGGLNPVSITVNLDPGFA-PEA 268 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 S + G AD V + + + M+ K + L Sbjct: 269 ISSPYHAVSVRGSGSTRTVTLADGAVPANRDFELRWSASGDAP---MLGLFKQRHGELEY 325 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS-----IKKIDNVNDTVRMGATF 301 + V + R +I + + + ++ + R Sbjct: 326 VMATITPPALERVGEAPPREMIFVIDNSGSMAGESMPAARRSLLYALETLRPQDRFNVIR 385 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 F+D + + + KTF + G T + A++ A + DE Sbjct: 386 FDDTMTELFASAVQASDSNIAAAKTFTHNLMANGGTEMLPALRAA---LRDRAPDER--- 439 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 + ++ LTDG + + + I K R+ + Y + Sbjct: 440 ------VRQVIFLTDGALSNEADMMEEINRNRKDS--RVFMVGIGS-----APNTYLMRR 486 Query: 422 C--ASPNSFFEANSTHELNKIFRDRIGNEIFERVIRIT 457 A +F E + + +T Sbjct: 487 MAEAGRGTFTHVGMGEEAEDQMQRLLDRLSLPVATGLT 524 >gi|296158792|ref|ZP_06841621.1| TadE family protein [Burkholderia sp. Ch1-1] gi|295890997|gb|EFG70786.1| TadE family protein [Burkholderia sp. Ch1-1] Length = 165 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 44/148 (29%), Gaps = 15/148 (10%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + ++++ + G + AL+ P+ + ++ + L AA+ A Sbjct: 13 RQRIHARR-AGAQRGATAVEFALVFPLFFMILYAIITFSLILVAQQNLTMAAEEGARAAL 71 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + K+ ++ P Sbjct: 72 NWQSNTSLQTALTNRGNAACAAAKL--------------VTAKLVQSAQCTPSSTTCGPG 117 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 + V + Y+ NP+ L +GI Sbjct: 118 GTMQCVNVLLIYNYQANPIVPILPLLGI 145 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 43/308 (13%), Positives = 91/308 (29%), Gaps = 29/308 (9%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 +S L Q K E + + +S + + S D +++ Q L + Sbjct: 226 IFNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERP 285 Query: 210 RTVKSYSSQNGKVGIRDEKLS-PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + S+ + + PY + + V +S Sbjct: 286 DKLTSFHLRVDNSDLPGTVFELPYPANEQTVDALEAAFAHELRNPGNTALVLDTSGSMEG 345 Query: 269 KKKHLVRDALASVIRSIKKIDNV-------NDTVRMGATFFNDRVISDPSFSWGVHK-LI 320 ++ L++ +L +I + ++ ++ K Sbjct: 346 ERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKPAT 405 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + G TA +A+ A+D + +S D +VL+TDGE T Sbjct: 406 TKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT-----------VVLMTDGEVT 454 Query: 381 QDNEEGI---AICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A + + I + I + Q+ + L + F+A + + Sbjct: 455 RGRTFAQFKDAYAQLPEDKKEIPVFVILYGEANIQEMEELAQL----TGGKTFDALNG-D 509 Query: 437 LNKIFRDR 444 L F + Sbjct: 510 LAAAFEEI 517 >gi|332262298|ref|XP_003280198.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Nomascus leucogenys] Length = 1284 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 48/147 (32%), Gaps = 17/147 (11%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + I + D V++G T ++ ++ ++ + G+T Sbjct: 202 LASVIAPFEIGPDKVQVGLTQYSGDAQTEWDL--NSLGTKEQVLAAVHRLRYKGGNTFTG 259 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ ++ EA K ++L+TDG+ +QD+ K G+ + Sbjct: 260 LALTHVL------GQNLQPAAGLRPEAAKVVILVTDGK-SQDDVHT--AARVLKDLGVNV 310 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNS 427 + + L ASP Sbjct: 311 FAVGV------KNADEAELRLLASPPR 331 >gi|145502106|ref|XP_001437032.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404179|emb|CAK69635.1| unnamed protein product [Paramecium tetraurelia] Length = 556 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 38/107 (35%), Gaps = 12/107 (11%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + +F + F+ + I+ D + G T DA+ A + + Sbjct: 165 NHIVKLVWFESFITDKCDFTNDFNNFIK-----LVDDASPRGGTKCYDAIAYAIEQL-KE 218 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + + + I+ LTDG++ Q E ++ N+ I I Sbjct: 219 IKKKYPNIILR------IIALTDGDDNQSKENPQSLVNRIFENQIII 259 >gi|126433420|ref|YP_001069111.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126233220|gb|ABN96620.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 233 Score = 38.8 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 34/93 (36%), Gaps = 4/93 (4%) Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD--NEEGI 387 G+T + + A D + ++ + +I+LLTDG+ D +E Sbjct: 87 AFTASGTTNMAAGIHLALDILEDRKH--AYKAAGLQYYRPWILLLTDGKPNLDGFDEAVA 144 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 + ++G+ + + Q+ R L Sbjct: 145 RLNAVESARGVTVFAVGAGPRVDYQQLGRLSLQ 177 >gi|315655690|ref|ZP_07908588.1| hypothetical protein HMPREF0388_1484 [Mobiluncus curtisii ATCC 51333] gi|315489754|gb|EFU79381.1| hypothetical protein HMPREF0388_1484 [Mobiluncus curtisii ATCC 51333] Length = 171 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 41/139 (29%), Gaps = 5/139 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 +F ++L+ G ++ ++ + M + V + L++ A Sbjct: 12 MPHRFAARFERLV-DEDGRIMLLGLAACVLLCVIILMSMAVSGVYLEQRRLQRLADQ--- 67 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + ++ + + +IE Y + V++ Sbjct: 68 -TASLAAANMADAAYYQNGIVDGVPLEIEPYHASERAAEYLSGAAISANSGLEGIDLVDV 126 Query: 123 NPRKSAYQVVLSSRYDLLL 141 + + QV L + + L Sbjct: 127 DVASTRVQVTLRATGKIPL 145 >gi|317403484|gb|EFV83991.1| hypothetical protein HMPREF0005_03145 [Achromobacter xylosoxidans C54] Length = 683 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 46/153 (30%), Gaps = 2/153 (1%) Query: 24 IITALLMPVMLGVGGMLV-DVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV-SSRAKN 81 + A++ V++G+ + V + Y + +++A+ A ++A L + + Sbjct: 1 MTVAIVFAVIVGLVLLGVAQLGYAYYMKREIQKASDLAALSAVQVLGLGAPADCARAQQA 60 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 T + L +++ + +P L + L Sbjct: 61 GRTAVLANVPAILDTFSADDVTVECKVWDSTRADASGMYVFDPAAGQTPNALRITVNKTL 120 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + L + G ++ V+ Sbjct: 121 SSLLPDIVVDGAPVRVVSVAVSTRPVAVFTVGS 153 >gi|254482796|ref|ZP_05096033.1| hypothetical protein GPB2148_1280 [marine gamma proteobacterium HTCC2148] gi|214036877|gb|EEB77547.1| hypothetical protein GPB2148_1280 [marine gamma proteobacterium HTCC2148] Length = 1225 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 43/428 (10%), Positives = 98/428 (22%), Gaps = 47/428 (10%) Query: 37 GGMLVDVVRWSYYEHALKQA-------AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 GG D Y H L++ A++T + S E+ ++ + Sbjct: 101 GGGGSDNRNGLYVRHHLQEYSGTAKTRLDAALLTIYANIGDSSEKKNAPDEY------GY 154 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + + + + +Q L D+ +N +++ Sbjct: 155 VLNDVYNYLSGEDQTKLGAATPTTLADSAGYNPPGNWQTFQSPLVGE-DVCVNTYVIYIG 213 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + A + + Y + + S+ + G Sbjct: 214 NNANGNISTDQDANSTNLRAFYTSVGETAPDQIQGGDDSLAPLPIQA-FSYEYEQGPDVI 272 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVS-CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + Y + + + P V + + Sbjct: 273 VEIPDYVEGTEETCYTPDPIGPTTVDEGTELGVTEGKYKDSQVPDCTINENSGGLCDGTP 332 Query: 269 KKKHLVRDALASVIRSIKKIDNV-NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 D + +V +T + F+ + G Sbjct: 333 AGVCECTDRVTNVNGQFYTRKVEITETYVIPLQEFDPVCEG---PTLGYQPDPIYGPGEV 389 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + + D L + G +G Sbjct: 390 IEIATPLDDPDLTSGDSWNMD-------------------DWAKALFSYGAPVPVEVDGE 430 Query: 388 AICNKAKSQGIRIMTI-AFSVN--KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFR 442 +A + TI AFS + + + + + A +F+A S E+ F Sbjct: 431 TTIERA---NVVTYTIDAFSADRLQLRNAQLSAVWQSTADVGGGRYFQATSEQEVRAAFY 487 Query: 443 DRIGNEIF 450 +G + Sbjct: 488 AILGEILA 495 >gi|256004503|ref|ZP_05429482.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|255991508|gb|EEU01611.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|316940035|gb|ADU74069.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 1313] Length = 1050 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 42/341 (12%), Positives = 96/341 (28%), Gaps = 37/341 (10%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 L++N F+ ++ I E V+ G ++ + + Sbjct: 589 GAGTLVINSSMYFMGNVLISLTNTNNVGEGFIVADGNIIIQGQNLYPNGPNDKLYVYSIG 648 Query: 195 DSEG--QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + A + +GK+ +K + L + ++ + Sbjct: 649 GNIEFQTSNSTINGIAYAPGNPANPNSGKIFFSGDKNTINGSIAANELDFFAGGLVVNHT 708 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + V+ + LV+DA + + M ++D + Sbjct: 709 EGQFDTVEEKYIDKSTY-LKLVKDAAKNFVDKFAGSKTK-----MAVIQYSDSANDNDFK 762 Query: 313 SWGVHKLIR-TIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNE------------DE 357 + + + +K G + + D M+ AY + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGS 822 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKAKS-------QGIRIMTIAFSVN 408 V ++ KK +G + + + AK +GI ++ I FS Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 409 KTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFRDR 444 + A + +++AN+ EL I + Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNM 923 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 23/228 (10%), Positives = 54/228 (23%), Gaps = 43/228 (18%) Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 V ++ Y +D S + L ++A Sbjct: 80 WTGVFRYFPGIAQESCGTYDPRIRPWYVAASSGPKDVVIVLDVSGSMSQYGRLDLAKEAA 139 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDEN------ 332 +VI ++ + F++ + S + + + Sbjct: 140 ETVINTLGA------DSFVNVVTFSETARVLLTNSTTLVRATEDNLGELVSLVQNLEFDL 193 Query: 333 EMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 T A +T +D + E + + + IV LTDG Sbjct: 194 ANVGTNFGAAFETTFDIL----EASRTSEETSSNCQTAIVFLTDGNTN------------ 237 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 ++ + + C + + +L++ Sbjct: 238 ------------VGLSTDEVTSKQI---ACDTGGIYEHVEDGGDLSQA 270 >gi|296158519|ref|ZP_06841349.1| TadE family protein [Burkholderia sp. Ch1-1] gi|295891087|gb|EFG70875.1| TadE family protein [Burkholderia sp. Ch1-1] Length = 278 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 13/153 (8%), Positives = 31/153 (20%), Gaps = 9/153 (5%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 T+ ++K + +G ++ PV+L V ++ V + A++ Sbjct: 15 RTRLTGNARKSM--QSGQSMTEFIIVAPVLLFVCFGILQFVLLYQAKS----TLDAAVLE 68 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ + + S + Sbjct: 69 AAREGAVNHGSMQSMRSGLARGLAPIYAHQANAEGVAAALASGQTDAANFSSITVLNPTP 128 Query: 124 PRKSAYQ---VVLSSRYDLLLNPLSLFLRSMGI 153 Y P + Sbjct: 129 AAIQDYSRPRYYADQAATYSEIPNDSLMYRDPS 161 >gi|254454002|ref|ZP_05067439.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198268408|gb|EDY92678.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 186 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 4/55 (7%), Positives = 18/55 (32%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++ G + AL+ + + ++ +++ A+ Sbjct: 8 LARFLRLFRRNEDGSPTVEFALIFLPFIILPVSGFELGLLMTRHVMMERGIDMAV 62 >gi|254450087|ref|ZP_05063524.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264493|gb|EDY88763.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 139 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 4/55 (7%), Positives = 18/55 (32%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++ G + AL+ + + ++ +++ A+ Sbjct: 8 LARFLRLFRRNEDGSPTVEFALIFLPFIILPVSGFELGLLMTRHVMMERGIDMAV 62 >gi|85713499|ref|ZP_01044489.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] gi|85699403|gb|EAQ37270.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] Length = 193 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 12/129 (9%), Positives = 39/129 (30%), Gaps = 11/129 (8%) Query: 6 KFIFYSK---KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAI 61 +F ++ + G + AL+ P+ + +++ + +L+ A +A Sbjct: 12 RFKATARMAARFQGHKGGSAAVEFALVAPMFFALLFAIIETGLVFFASQSLETALQDSA- 70 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ +++ +F + +Y F+ K + + Sbjct: 71 ----RTIMTGQAQIADLT--KQSFKTNIVCKYAGILFDCENKIYVDVQSYPSGFGSVEIS 124 Query: 122 MNPRKSAYQ 130 + Sbjct: 125 NPISGEDFT 133 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 20/144 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + + Sbjct: 554 TYDQRTEFSFTDYTTKENVLAVIRQIRYMSGGTATGDAVAFTVRNVF--------GPLRD 605 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V+LTDG++ D A GI + +I L + AS Sbjct: 606 SPNKNFLVVLTDGQSYDDVRGP---AAAAHKAGITVFSIGM------AWAPLDDLKDMAS 656 Query: 425 PN---SFFEANSTHELNKIFRDRI 445 F L +I D I Sbjct: 657 EPKESHTFFTREFAGLEQIVTDVI 680 >gi|118589698|ref|ZP_01547103.1| hypothetical protein SIAM614_04640 [Stappia aggregata IAM 12614] gi|118437784|gb|EAV44420.1| hypothetical protein SIAM614_04640 [Stappia aggregata IAM 12614] Length = 184 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 10/72 (13%), Positives = 27/72 (37%), Gaps = 4/72 (5%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ ++ +G + AL+ V +++V + + + + A+ Sbjct: 7 LLTSRQNRGRAAFGRNDSGATAVEFALIAIPFFTVVFGIIEVGLYHFV-NRM---FDNAV 62 Query: 62 ITASVPLIQSLE 73 ITAS + Sbjct: 63 ITASREIRTGQA 74 >gi|85709840|ref|ZP_01040905.1| cytochrome c oxidase assembly protein [Erythrobacter sp. NAP1] gi|85688550|gb|EAQ28554.1| cytochrome c oxidase assembly protein [Erythrobacter sp. NAP1] Length = 363 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 30/78 (38%), Gaps = 1/78 (1%) Query: 7 FIFYSKKLIK-SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 I+ ++++ K + +A + ++LG+ ++ +V W H L A A + Sbjct: 286 LIYLARRVRKSDRRASIAVHSAFGVMIILGIATVMTEVSLWVAAAHQLTGALLVASTAWA 345 Query: 66 VPLIQSLEEVSSRAKNSF 83 + + + S Sbjct: 346 MHSDGVRIQGQRASAKSA 363 >gi|327193256|gb|EGE60162.1| hypothetical protein RHECNPAF_1700075 [Rhizobium etli CNPAF512] Length = 251 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 39/132 (29%), Gaps = 7/132 (5%) Query: 5 TKFIFYSKKLI---KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + F F +L+ + + AL++P+++ + VD+ + Sbjct: 65 SLFRFARSRLLDLARDRLAASGVEFALVLPILVMLLFGTVDLGHALTVSRKI----DEIA 120 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 T + Q SS + ++ Y + L N + V +A Sbjct: 121 STTGDLIGQQSSWTSSDVTKLLSGASFILQPYDTSGLKITLTVNDISKNGNATVNWSAAF 180 Query: 122 MNPRKSAYQVVL 133 ++ Sbjct: 181 NTSALNSGAASA 192 >gi|302687344|ref|XP_003033352.1| expressed protein [Schizophyllum commune H4-8] gi|300107046|gb|EFI98449.1| expressed protein [Schizophyllum commune H4-8] Length = 184 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 28/92 (30%), Gaps = 12/92 (13%) Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 FN V + + + + + + G T A+ + E Sbjct: 25 FNSTVSTVIANDF---ARNPDQLLQGIVGQTAGGGTNYTAALSATETLLRQHWSTERTP- 80 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 ++ L+DGE + + A+C A Sbjct: 81 --------VVIFLSDGECSVGDHSVHALCQAA 104 >gi|256374467|ref|YP_003098127.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918770|gb|ACU34281.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 564 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 38/112 (33%), Gaps = 5/112 (4%) Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNEEGI-AICNKAKSQG 397 A+ + + RM++ + +K +VL+TDG N + +S+ Sbjct: 446 AALVGGGTALNDTTLAAFRRMQSTYDPEKINSVVLITDGRNDDYASITTAQLLQALESES 505 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 I + QE L A+ ++A ++ + D I Sbjct: 506 DPARPIPLIMVGLGQEADMEALQEISSATGGKAYQALEAADIRSVLLDAISQ 557 >gi|222100886|ref|YP_002535454.1| hypothetical protein CTN_1912 [Thermotoga neapolitana DSM 4359] gi|221573276|gb|ACM24088.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359] Length = 759 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/348 (8%), Positives = 95/348 (27%), Gaps = 18/348 (5%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 I + + ++ ++ + + N ++ + L+ + + F ++ Sbjct: 154 TNRPIGDLVISAGMTMPAKKTSNVRFEGNLNSNVGPGSFTITLTDDSGVDHDVRVWFEKT 213 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQP---LNCFGQP 207 G + + S+ Y V+ D S + + + + Sbjct: 214 AG------DLGTDPFSSSQRYTMRIDVNNDGTADVSGYAVFNEFGRVEEAGIYADSQTIT 267 Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 A+ ++ +S + + V N ++ + + L Sbjct: 268 ANGSITGSTSLSDGTYQAVVLDNSGNVVYNGTVDVSNGSYTISDPNIVGGNDYTVMLNSD 327 Query: 268 IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTF 327 + D+ S ++ + + + ++ + T + Sbjct: 328 YQTVTANTDSSLSGTTTLSDGNYNATVLDSSGNVIYSGTVDVSGGNFTISDSDITSGNDY 387 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + T + + T+ N + R + E +++G T + + Sbjct: 388 TVILYSESQTTTANGIIEGTTTLSDGNYQVIVRDSSGNEVYNGTTTVSNGSFTISDSDIT 447 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 + + T+ F + + + F+EA++ Sbjct: 448 SGND---------YTVTFLSSSLTIAPGAELVIPTSGEPRFYEADNPT 486 >gi|125973772|ref|YP_001037682.1| carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] gi|125713997|gb|ABN52489.1| Carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] Length = 1050 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 41/341 (12%), Positives = 95/341 (27%), Gaps = 37/341 (10%) Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 L++N F+ ++ I E V+ G ++ + + Sbjct: 589 GAGTLVINSSMYFMGNVLISLTNTNNVGEGFIVADGNIIIQGQNLYPNGPNDKLYVYSIG 648 Query: 195 DSEG--QPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + + + +GK+ +K + L + ++ + Sbjct: 649 GNIEFQTSNSTINGIVYAPGNPANPNSGKIFFSGDKNTINGSIAANELDFFAGGLVVNHT 708 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + V+ + LV+DA + + M ++D + Sbjct: 709 EGQFDTVEEKYIDKSTY-LKLVKDAAKNFVDKFAGSKTK-----MAVIQYSDSANDNDFK 762 Query: 313 SWGVHKLIR-TIVKTFAIDENEM--GSTAINDAMQTAYDTIISSNE------------DE 357 + + + +K G + + D M+ AY + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGS 822 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI--CNKAKS-------QGIRIMTIAFSVN 408 V ++ KK +G + + + AK +GI ++ I FS Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 409 KTQQEKARYFLSNCA-----SPNSFFEANSTHELNKIFRDR 444 + A + +++AN+ EL I + Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNM 923 >gi|226325178|ref|ZP_03800696.1| hypothetical protein COPCOM_02970 [Coprococcus comes ATCC 27758] gi|225206526|gb|EEG88880.1| hypothetical protein COPCOM_02970 [Coprococcus comes ATCC 27758] Length = 162 Score = 38.8 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 52/141 (36%), Gaps = 4/141 (2%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA--SVPLIQSLE 73 S G + A+++P+++ + L+ V +Y A+ A ++A + + S + Sbjct: 5 HSERGSEVLEAAIVLPMLIVLAFCLIQVCLIAYEGTAMSAAIESAAMKSDFSAAIASGNV 64 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + + + + P + + N + + + V D ++ + +L Sbjct: 65 NQAVKDELASNTPGIVKDNITVTNSKVSYSDRSDEATVTDTGSTGITTISKSSTT--ALL 122 Query: 134 SSRYDLLLNPLSLFLRSMGIK 154 S D + L GIK Sbjct: 123 ESTVDYKIPTLISVGSFDGIK 143 >gi|306829469|ref|ZP_07462659.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] gi|304428555|gb|EFM31645.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] Length = 861 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 41/321 (12%), Positives = 82/321 (25%), Gaps = 58/321 (18%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + V D S SM S T+K + G + Sbjct: 66 TSKLGTETQSEPLDVVLVADLSGSMNKRDVPSSTGRTITRLDALKNTLKGTRDRQGLIDT 125 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 S +S + S + +++ D ++ + Sbjct: 126 -ILSNSNNRLSMVGFGGKIDNKFAEQSWNSYYRKWEWGYRYWPYEERTAFYDGVSPWDDA 184 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 ++ ND V K+ ++ + T I+ ++ Sbjct: 185 DTILNWNND---------------ASGSKTAVSKMRIAGGQSIGTESGIGTGTNISAGIR 229 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---------NEEGIAICNKA-- 393 A I S+ AKK +++L+DG N A Sbjct: 230 IANQLIDSARP----------NAKKVVIVLSDGFANMYYNDSGYTVYNYNNQDGSETAPE 279 Query: 394 ---KSQGIRI--------------MTIAFSVNKTQQEKARYFL----SNCASPNSFFEAN 432 + + I +I F + N + PN AN Sbjct: 280 WFWNNLDVSINNLAYSLAPKLDGFYSIKFRYSNNVDSITSLQYYIRYHNSSIPNEILSAN 339 Query: 433 STHELNKIFRDRIGNEIFERV 453 + +L F+D + + Sbjct: 340 NEDQLRDSFKDITDKILPLGI 360 >gi|324503178|gb|ADY41386.1| C-type lectin domain-containing protein 160 [Ascaris suum] Length = 358 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 56/168 (33%), Gaps = 16/168 (9%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A + ++ +R+ F D + H + F ID + + Sbjct: 33 ATLFLPMNVSSESAQGQFIRVALVAFADNAVIVGDL-NKYHNYASLVEGLFTIDYHGGKT 91 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 I ++ A + SS I+L + + + AI N+ K Sbjct: 92 LNIEAGLKAASTVLESSRHYAKT----------VILLYSSAYSAGGFADPNAIANQIKES 141 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH---ELNKIF 441 G +I+TIAF + L + +SPN F + T ++ F Sbjct: 142 GTKIITIAFRQQPEGTLVKK--LGHLSSPNFAFGSMDTSIIAKITNAF 187 >gi|124504959|ref|XP_001351221.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] gi|4493942|emb|CAB38978.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] Length = 2114 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 33/269 (12%), Positives = 76/269 (28%), Gaps = 17/269 (6%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 N +F++ + ++ + +E+ S + S S S SE P Sbjct: 746 NVSEVFVKKICDNGVVLPPGSPSESTPGSPSESTPGSPSESTPGSPSESTPGNPSESTP- 804 Query: 202 NCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS 261 + + N S ++S + +D Sbjct: 805 --------GSPSESTPGNPSESTPGSPSESTPGSPSESTPCSGTECLCHNTYDLTLIIDE 856 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S+ + V + +++ + V MG F+D++ + + Sbjct: 857 SASIGYSNWEKEVVPFTIGLASNLEISEKK---VNMGILLFSDKIREFIKYGQ-KESYDK 912 Query: 322 TIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT- 380 + D + + + + + L K +LLTDG NT Sbjct: 913 NNLVRRIHDLKKY---YKSGGFSYIVEALKYGLYSYAKSTSSRLNVPKVNILLTDGNNTD 969 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + + + K + ++++ I Sbjct: 970 TSDFILTEVSSLYKKENVKLLLIGIGGPT 998 >gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas haloplanktis TAC125] gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 664 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 47/181 (25%), Gaps = 20/181 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A ++ ++ +D+ F++ V + F G Sbjct: 341 QAKNALFYALSLLDSN---DSFNIIGFDNVVTLMSDKPLVASGFNLRRAERFIYGLQADG 397 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T I A+ D + ++ LTDG + ++ +I K Sbjct: 398 GTEIQGALDAVLDG------------SQFDGFVRQVIFLTDGSVSNEDALFKSIQAKLGD 445 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 R+ T+ R SF ST E+ + + Sbjct: 446 S--RLFTVGIGSAPNSFFMRRA---ADVGKGSFTFIGSTSEVQPKMQQLFDKLAHPAITN 500 Query: 456 I 456 + Sbjct: 501 L 501 >gi|134295602|ref|YP_001119337.1| hypothetical protein Bcep1808_1494 [Burkholderia vietnamiensis G4] gi|134138759|gb|ABO54502.1| membrane protein-like protein [Burkholderia vietnamiensis G4] Length = 706 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 14/162 (8%), Positives = 40/162 (24%), Gaps = 6/162 (3%) Query: 16 KSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQ---TAIITASVPLIQS 71 + G I+ A+ L ++ + G +D+ R+ ++ AA + + + Sbjct: 20 RRQRGSIAIMAAIWLSVAVIVLSG--IDIGRFYSERRHMQAAADLAALSSVMKASADSTC 77 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + + + + + Q Sbjct: 78 TAAKTAAQQVANPTANAVPANATLSVTCGVWTAPQSSSGSALFTPTAGDPTAGQCGGLQA 137 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 ++ +++ G A A S Sbjct: 138 GTANGQSANAVCVTVSEPVTGFFRSGTTISASAIAKSTPTDT 179 >gi|77458974|ref|YP_348480.1| transporter, putative [Pseudomonas fluorescens Pf0-1] gi|77382977|gb|ABA74490.1| Putative transport protein [Pseudomonas fluorescens Pf0-1] Length = 804 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 11/33 (33%), Positives = 16/33 (48%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 +T +PVM+ G+ VD + Y L QA Sbjct: 684 LTIATLPVMVLAVGIGVDYAFYIYNRLQLHQAL 716 >gi|301617755|ref|XP_002938301.1| PREDICTED: integrin alpha-E-like [Xenopus (Silurana) tropicalis] Length = 1148 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 43/114 (37%), Gaps = 8/114 (7%) Query: 314 WGVHKLIRTIVKTFAIDENEMGS-TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + + D ++G+ T A+ +++ + ++ A K I+ Sbjct: 240 DLLESRKGRFILQKVQDIKQVGNVTKTASALDHVLESVFTEEH------GSSETATKIIL 293 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 +LTDG+ D + + N +K + I I Q+EKA L AS Sbjct: 294 VLTDGDIFMDPMDINDVMNNSKMKKIERFVIGVG-EAFQKEKALKTLKTIASQG 346 >gi|148675425|gb|EDL07372.1| mCG142757 [Mus musculus] Length = 1320 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + ++ + G+T A+ + ++ EA K ++ Sbjct: 230 DLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVLE------QNLKPAAGVRPEAAKVLI 283 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 L+TDG+ +QD+ K Q I + + Sbjct: 284 LVTDGK-SQDDVRT--AARILKDQDIDVFVVGV 313 >gi|123231311|emb|CAM23231.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938197|emb|CAX15853.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1280 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + ++ + G+T A+ + ++ EA K ++ Sbjct: 222 DLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVLE------QNLKPAAGVRPEAAKVLI 275 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 L+TDG+ +QD+ K Q I + + Sbjct: 276 LVTDGK-SQDDVRT--AARILKDQDIDVFVVGV 305 >gi|258679414|ref|NP_082794.1| collagen alpha-1(XX) chain [Mus musculus] gi|292495082|sp|Q923P0|COKA1_MOUSE RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|123231310|emb|CAM23230.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938196|emb|CAX15852.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1320 Score = 38.8 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + ++ + G+T A+ + ++ EA K ++ Sbjct: 230 DLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVLE------QNLKPAAGVRPEAAKVLI 283 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 L+TDG+ +QD+ K Q I + + Sbjct: 284 LVTDGK-SQDDVRT--AARILKDQDIDVFVVGV 313 >gi|308473011|ref|XP_003098732.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] gi|308268332|gb|EFP12285.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] Length = 396 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 41/151 (27%), Gaps = 8/151 (5%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI-ND 341 I T R+G +N + A+ + Sbjct: 69 TRIGSNSVEPRTTRLGLVTYNSVASQKADLNQYQSIADAYTGVFDALSTTVDTIQSYLAT 128 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 + A ++ K+ +++ N + + + + K I I+ Sbjct: 129 GLALAERMLVDQT-----VNSTRAHYKRVMIVYASEYNGNGESDPLPLAERLKLSNINII 183 Query: 402 TIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 T+A+ + L+ ASP F + Sbjct: 184 TVAYEQPGSV--GLLQGLTQIASPGFSFSSE 212 >gi|149200659|ref|ZP_01877657.1| probable transcriptional regulatory dna-binding repressor transcription regulator protein [Lentisphaera araneosa HTCC2155] gi|149136249|gb|EDM24704.1| probable transcriptional regulatory dna-binding repressor transcription regulator protein [Lentisphaera araneosa HTCC2155] Length = 368 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 23/58 (39%) Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 KK V+ D E + +E + + AK+ I + +I + Q + AS Sbjct: 217 KKLAVIAGDKEQSTTDERLDGVWSAAKAHNIPLESIRVEYSDYQINGGYEAMERLASD 274 >gi|218528585|ref|YP_002419401.1| TadE family protein [Methylobacterium chloromethanicum CM4] gi|218520888|gb|ACK81473.1| TadE family protein [Methylobacterium chloromethanicum CM4] Length = 193 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 8/75 (10%), Positives = 19/75 (25%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + G + AL+ L + + + + L A A Sbjct: 15 RMLARFRSDAEGVVVVEFALVAMPFLMLVAAIFECCLVCLGQLTLDTAMDRATRAVFTGT 74 Query: 69 IQSLEEVSSRAKNSF 83 Q + + ++ Sbjct: 75 FQEASDGTDPSERMQ 89 >gi|73992734|ref|XP_543096.2| PREDICTED: similar to Protein KIAA1510 precursor [Canis familiaris] Length = 1405 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 57/184 (30%), Gaps = 19/184 (10%) Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 + P + + + + VD S + + Sbjct: 176 SSPSQDPPTLAGSDWGKAGPQFRCTPPSPVDMIFLVDGSWSIGH----SHFQQVKDFLAS 231 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIVKTFAIDENEMGSTAIND 341 I+ + V++G T ++ ++ W ++ L ++ + G+T Sbjct: 232 VIEPFEIGPSKVQVGLTQYSGDPQTE----WDLNALRTKEDVLAAVRRLRYKGGNTFTGL 287 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM 401 A+ + + EA K ++L+TDG+ +QD+ K G+ I Sbjct: 288 ALTHVLE------HNLRPAAGPRPEATKVLILVTDGK-SQDDARAAG--RILKDLGVAIF 338 Query: 402 TIAF 405 + Sbjct: 339 AVGV 342 >gi|114564047|ref|YP_751561.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] gi|114335340|gb|ABI72722.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] Length = 1204 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 25/284 (8%), Positives = 61/284 (21%), Gaps = 10/284 (3%) Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRD 226 + +D + +Y + + + + N K + Sbjct: 213 TVAKRVLNEVIVNTPGVDLGLMLFNYNASGSNRSSENKDGGRVVSKINTGNINNKKALAS 272 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI-KKKHLVRDALASVIRSI 285 + + N + + + V + + I Sbjct: 273 IVKNTIAIKSNNTPLCETLYEAALYLHGKPLEYGHDTYDSVAATPLNKTWKWVKKWNGWI 332 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 ++ VN + + K + D T +++ Sbjct: 333 EESTPVNPAQYNPGYDATAESVDGYISPFIGKKCGSNASIIYITDGAPTSDTDAQGKVKS 392 Query: 346 AYDTIISSNEDEVHRMKN-NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + +V ++ + + L T D Q + TI Sbjct: 393 LLGLASTTTDADVPGSESLGNDNYSLLPPLAHWLATNDVNPNSEG-----DQVVNTYTIG 447 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 FS + ++ A EL I Sbjct: 448 FSADAPVDILISTAAL---GGGKYYSALDPAELQASLISAIAAI 488 >gi|188534085|ref|YP_001907882.1| hemolysin activator protein [Erwinia tasmaniensis Et1/99] gi|188029127|emb|CAO96997.1| hemolysin activator protein [Erwinia tasmaniensis Et1/99] Length = 576 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 87/337 (25%), Gaps = 18/337 (5%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN--NLKKNFTD 108 + A A L ++ + K + ++ ++ T Sbjct: 104 ARFQWALDAAADAKGRCLGGQGIVLAINKVQNALLAKGYVTTRVMAQEQDLTQNVLTLTM 163 Query: 109 REVRDIVRDTAVEMNPRKSAYQVVLSSRYDL----LLNPLSLFLRSMGIKSWLIQTKAEA 164 + R ++ R + V+ +S D+ + R + + I+ A Sbjct: 164 QPGRIGEIRFDEPVSWRGRLWNVIPASSGDILNLRDIEQGLENFRRVPSANADIKIVPGA 223 Query: 165 ETVSRSYHKE----HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 + + V + +D S S + D Sbjct: 224 QEATSDLQVNWHEGRPVRLTLGLDDSGSKSTGRYLGS------ATLALDAPFAQNDLFYA 277 Query: 221 KVGIRDEKLSPYMVSCNKSLYYML-YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALA 279 +G + P+ + Y+ + ++ + + + + D + Sbjct: 278 NIGKDLFQHGPFGNRSHALNYFFPLGYWAFSANYNDYTYHQNIPNANEVMSYSGKSDNVQ 337 Query: 280 SVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 + + D + T ++ + S K + + G+T + Sbjct: 338 LTLSRLLYRDQSHKTTLNLRSYRKHSTNAVDSVDMVSQKRRTAGWELGLNQRSYFGTTTL 397 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + + N + + + +LL D Sbjct: 398 DANISW-RRGTGAFNALPAPEESRHEGSARTGMLLGD 433 >gi|220929575|ref|YP_002506484.1| lipolytic protein G-D-S-L family [Clostridium cellulolyticum H10] gi|219999903|gb|ACL76504.1| lipolytic protein G-D-S-L family [Clostridium cellulolyticum H10] Length = 436 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 37/367 (10%), Positives = 84/367 (22%), Gaps = 35/367 (9%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 IK +G F II +LL + L G ++V A + + Sbjct: 6 IKRKSGAFLIIFSLLFVLFLSSVG-SINVS-----------ALEPPRLYG---DCNGDNN 50 Query: 75 VSSRAKNS-FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 V + + + ++ Y + N+ + + ++ Sbjct: 51 VDALDFVAFKQYIMEQNHAYDLYKDLNSDNTVDAVDFAILKQFLLGIVKSLPINSAPTQP 110 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQ 193 + G + I+ + +S + + VID Sbjct: 111 GVLLVGRFDTGDPAGPKFGWSTSTIKANFKGTGISANLNSMGDNWFNVVIDGVVKSPVNV 170 Query: 194 RDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL---- 249 P+ + + + I + + + V+ L Sbjct: 171 PSGSSGPVTLATDLTNGNHTIELVRRTEAWIGEVQFLGFNVTDGSLLAPPAPSNRRIEFI 230 Query: 250 -DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI- 307 D +S + K A + + D + N Sbjct: 231 GDSITCGYGNEGTSQYQSFTNKNENAYLAYGATTARLLNADPITVCWSGKGLVQNYGGDL 290 Query: 308 -------------SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 + + +W K I +V + + AY ++ Sbjct: 291 NDLMPAVYSRILPYNSTLTWDTSKWIPQVVVINLGTNDFSTGSIDKTTFAKAYTNFVTKI 350 Query: 355 EDEVHRM 361 + Sbjct: 351 RGQYPDA 357 >gi|239820677|ref|YP_002947862.1| transcriptional regulator, IclR family [Variovorax paradoxus S110] gi|239805530|gb|ACS22596.1| transcriptional regulator, IclR family [Variovorax paradoxus S110] Length = 546 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 27/291 (9%), Positives = 64/291 (21%), Gaps = 13/291 (4%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A A + A + + + + L + + + Sbjct: 185 APDRAAVAAPIFDYEGRPIGAVAISTDASRLDAARAHTLSSALMGAARSITHSAAGKAMS 244 Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 T +P ++ V + ++ + S + Sbjct: 245 IATQ---SPPETTSSVEVRCVSEVRSLLGEGPMWSPRDNALYWVDILTPSVHRYDAATGV 301 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 S + T + + + +VG + + + Sbjct: 302 DTETPLGTMASLVVPKASGGLLVATPGGIMALDLSTRRLTAFCHPEVGRTNVRYNDGKCD 361 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSS-SLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 L+ + V+++ S + V + K V DT Sbjct: 362 RRGRLWVGSMDMAAAANRGHLFCVEANGSWKQVDTGFTVPN---GIGWSPDNKTMYVTDT 418 Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 R ++ + G R ++ A D G T + Sbjct: 419 YRQTIYRYD------FDLACGTVGNRRPLITIAASDGKPDGLTVDDQGCLW 463 >gi|168214300|ref|ZP_02639925.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] gi|170714242|gb|EDT26424.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] Length = 928 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 36/390 (9%), Positives = 92/390 (23%), Gaps = 30/390 (7%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E S + + ++ NN + E R S ++ Sbjct: 35 AEEKSDSMSVEKVLDSEMYNKFNNEILNNKSSIKYTTDNNKGTWPVNWEYGNRSSKNKIN 94 Query: 133 LSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 S L +L + + ++ + +++D S SML Sbjct: 95 NSVYGKNQPLEYDEGYLTKKAYTTDEDNVFNINLKIQGKKNEALKKDVVFLLDNSNSMLT 154 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 R + + + + + + + S+ Y Sbjct: 155 NNRALKIKEEIKNVMDKLKAKNTRYALVTYAST--ILDGRHYHLMDGSIGNNQYTVYDLY 212 Query: 252 SLSEEHFVDSSSLRHVIKKKHL------VRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 + ++ + +S+++ + K + N Sbjct: 213 TSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLK-----AIELLKNSNAD 267 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + G+ + + ++ + + + + Sbjct: 268 EKIIIHLTDGLPTFSFLLKEFGGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKTQKPYIYS 327 Query: 366 EAKKYIVLL---TDGENTQDNEEGIAI-CNKAKSQG--IRIMTIAFSVNKTQQEKARYFL 419 + Y L E +N + K + I I TI + Sbjct: 328 REEVYSALNRSKNKYEAIWNNGFPTTLEAENIKKENPDINIYTIGIE-------LKKRVY 380 Query: 420 SNCASPNSFFEANSTHELNKI--FRDRIGN 447 +++A +L +I F + I + Sbjct: 381 KW-TDNRQYYDAEGIVKLPEIRKFLESISS 409 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 43/308 (13%), Positives = 91/308 (29%), Gaps = 29/308 (9%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 +S L Q K E + + +S + + S D +++ Q L + Sbjct: 205 IFNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERP 264 Query: 210 RTVKSYSSQNGKVGIRDEKLS-PYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + S+ + + PY + + V +S Sbjct: 265 DKLTSFHLRVDNSDLPGTVFELPYPANEQTVDALEAAFAHELRNPGNTALVLDTSGSMEG 324 Query: 269 KKKHLVRDALASVIRSIKKIDNV-------NDTVRMGATFFNDRVISDPSFSWGVHK-LI 320 ++ L++ +L +I + ++ ++ K Sbjct: 325 ERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKPAT 384 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 + G TA +A+ A+D + +S D +VL+TDGE T Sbjct: 385 TKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT-----------VVLMTDGEVT 433 Query: 381 QDNEEGI---AICNKAKSQG-IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 + A + + I + I + Q+ + L + F+A + + Sbjct: 434 RGRTFAQFKDAYAQLPEDKKEIPVFVILYGEANIQEMEELAQL----TGGKTFDALNG-D 488 Query: 437 LNKIFRDR 444 L F + Sbjct: 489 LAAAFEEI 496 >gi|205371983|ref|ZP_03224801.1| hypothetical protein Bcoam_00436 [Bacillus coahuilensis m4-4] Length = 245 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 K I+LLTDG + + +A+ A QG+ I I N T EK + A+ Sbjct: 7 KQILLLTDGCSNSG-GDPVAMAALAYEQGLTINVIGVMDNDTIDEKGMSEIEGIATSG 63 >gi|85374480|ref|YP_458542.1| hypothetical protein ELI_08265 [Erythrobacter litoralis HTCC2594] gi|84787563|gb|ABC63745.1| hypothetical protein ELI_08265 [Erythrobacter litoralis HTCC2594] Length = 233 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 9/52 (17%), Positives = 25/52 (48%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA 57 + I ++++ G + A + P++L +G + +++ + + QAA Sbjct: 10 RRIHALCRIVRDTRGVALVEFAFISPIILLMGVVGIEMANQAVVNMRISQAA 61 >gi|320537260|ref|ZP_08037220.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145888|gb|EFW37544.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 319 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 33/90 (36%), Gaps = 9/90 (10%) Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ + TA +Q A + ++ N K I+L TDGE ++ Sbjct: 144 VLLKAIESLSPSSYTASGTNLQKA--LLKAAAVFP-----KNRATAKTIILCTDGEQSEG 196 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + + GI+++ + F + Sbjct: 197 NIL--EAAKEIQRHGIQLIIVGFGTIEGGD 224 >gi|222147189|ref|YP_002548146.1| hypothetical protein Avi_0228 [Agrobacterium vitis S4] gi|221734179|gb|ACM35142.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 194 Score = 38.8 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 44/178 (24%), Gaps = 10/178 (5%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + L +G + AL++P++L + ++ ++ Sbjct: 11 LCRPFRLLALDRSGVGAVEFALIVPLLLVLYLGAFELTMALSVSQR------ATTSAGAI 64 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 I + ++ + ++ T +V V+ T + Sbjct: 65 ADIVARKQKTVDKTFLAN-MPDVLKAMFAPTATTGYTLKITGIKVDSNVKATIAWSWAQD 123 Query: 127 SAYQVVLSSRYDLLLNP---LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQW 181 + + L + F+ + + T T S + Sbjct: 124 GSKPYATGATVTLPSGMAAANAFFVHAELTIPHELVTYLPGFTGSSVSTITIARDYYF 181 >gi|83859351|ref|ZP_00952872.1| hypothetical protein OA2633_13140 [Oceanicaulis alexandrii HTCC2633] gi|83852798|gb|EAP90651.1| hypothetical protein OA2633_13140 [Oceanicaulis alexandrii HTCC2633] Length = 178 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 9/77 (11%), Positives = 22/77 (28%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 + G + ALL P M+ + V + + + Q A + + + Sbjct: 11 FHRDERGVSAVEFALLAPFMIALYLGSVQLTLGLTADRKVSQVANSVADLVTQDDFVTDA 70 Query: 74 EVSSRAKNSFTFPKQKI 90 ++ + Sbjct: 71 DLLDIYAAADAILNPFA 87 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 35/125 (28%), Gaps = 30/125 (24%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD------NEEGIAIC----------- 390 I+ + +IVLL+DGE + A C Sbjct: 560 QQIVQGARYLSPITDRQCQVNNHIVLLSDGEANNNHSVDEIETLLSASCTGSGGEKCGLS 619 Query: 391 ---------NKAKSQGIRIMTIAFSVNKTQQE-KARYFLSNCASPNSFFEANSTHELNKI 440 I TI F+ N + L F++A+++ EL Sbjct: 620 LVRNIADTEESVIDSRIITHTIGFAANTQANSFLNQIALQ---GGGGFYQADNSQELLGA 676 Query: 441 FRDRI 445 F+ + Sbjct: 677 FQSIL 681 >gi|194686862|dbj|BAG66069.1| complement factor B-2 [Lethenteron japonicum] Length = 749 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 43/139 (30%), Gaps = 4/139 (2%) Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS----WGVHKLIRTIVKTF 327 + + ++ I K+ N + F + + + + +I + + Sbjct: 263 DNFQLGVGFLMNLILKMSNYAIQAKYAIVVFASQAKTKLALTEKANQDPKAVINILKELS 322 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 + E T +A+ I S + + H + K ++TDG + I Sbjct: 323 YSEFEESPGTNTGEALLQVVALINSYKQQQQHLPIPWNQNKHISFIVTDGRSNMGPPPKI 382 Query: 388 AICNKAKSQGIRIMTIAFS 406 + K + I T Sbjct: 383 VMNEKFADIPMDIYTFGIG 401 >gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 427 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 46/192 (23%), Gaps = 37/192 (19%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A+ ++ ++ D R+ F+ + + +I G Sbjct: 65 AVERLLDRLQPGD------RISVVAFSGSATVIIP--NQIVEDPESIKTQIRKKLQASGG 116 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ------------DNE 384 T I + +Q ++ V + LLTDG D+ Sbjct: 117 TVIAEGLQQGITELMKGTRGAVSQA----------FLLTDGHGEDSLKIWKWEIGPDDSR 166 Query: 385 EGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELNKIFR 442 KA + I T+ F + L A + + F Sbjct: 167 RCQEFAKKAAKINLTINTLGFG-----NNWNQDLLETIADAGGGTLAHIERPEQAVHHFN 221 Query: 443 DRIGNEIFERVI 454 + Sbjct: 222 RLFARVQSVGLT 233 >gi|168204511|ref|ZP_02630516.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens E str. JGS1987] gi|170663839|gb|EDT16522.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens E str. JGS1987] Length = 928 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 34/324 (10%), Positives = 84/324 (25%), Gaps = 29/324 (8%) Query: 138 DLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE 197 D L +L + + ++ + +++D S SML R + Sbjct: 101 DQPLEYNEGYLTKKAYTTDEDNVFNINLKIQGKKNEALKKDVVFLLDNSNSMLTNNRALK 160 Query: 198 GQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEH 257 + + + + + ++S+ Y + ++ H Sbjct: 161 IKEEIKNVMDKLKAKNTRYALVTYAST--ILDGRHYHLIDRSIGNNQYTVYDLYTSNQCH 218 Query: 258 FVDSSSLRHVIKKKHL------VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 +S+++ + K + N Sbjct: 219 LNFTSNIQEIYNKIPSTVPNQRNNPYAGGTFTQEGLLK-----AIELLKNSNADEKIIIH 273 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 + G+ + + + + + + + + + Y Sbjct: 274 LTDGLPTFSFLLKEFGGNKKAIFDYNTNYNGIGVRGLGTSYFFDTKTQKPYIYSREEVYS 333 Query: 372 VLL---TDGENTQDNEEGIAI-CNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASP 425 L E+ +N + K++ I I TI + K + Y Sbjct: 334 ALNRSKNKYESIWNNGFPTTLEAENIKNENPNINIYTIGVELKKEVYKWDDY-------- 385 Query: 426 NSFFEANSTHELNKI--FRDRIGN 447 ++ A T EL +I F + I + Sbjct: 386 RQYYNAEGTVELPEIRKFLESISS 409 >gi|149177852|ref|ZP_01856451.1| hypothetical protein PM8797T_31975 [Planctomyces maris DSM 8797] gi|148843342|gb|EDL57706.1| hypothetical protein PM8797T_31975 [Planctomyces maris DSM 8797] Length = 759 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 3/67 (4%) Query: 371 IVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 I+LL+DG + + + GI I TI F + Q + + Sbjct: 166 ILLLSDGAERTVPPYDVDAKTQASYFGDTGIPIYTIGFGSSSLQDTTLDLVMESLVIDPL 225 Query: 428 FFEANST 434 FE +T Sbjct: 226 VFEKKNT 232 >gi|1510110|dbj|BAA09092.1| ubiquitin-conjugating enzyme [Paramecium tetraurelia] Length = 425 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 38/107 (35%), Gaps = 12/107 (11%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 + +F + F+ + I+ D + G T DA+ A + + Sbjct: 34 NHIVKLVWFESFITDKCDFTNDFNNFIK-----LVDDASPRGGTKCYDAIAYAIEQL-KE 87 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + + + I+ LTDG++ Q E ++ N+ I I Sbjct: 88 IKKKYPNIILR------IIALTDGDDNQSKENPQSLVNRIFENQIII 128 >gi|317126805|ref|YP_004093087.1| hypothetical protein Bcell_0064 [Bacillus cellulosilyticus DSM 2522] gi|315471753|gb|ADU28356.1| hypothetical protein Bcell_0064 [Bacillus cellulosilyticus DSM 2522] Length = 245 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 38/97 (39%), Gaps = 10/97 (10%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNS 427 + I+LLTDG + + + +AI AK +G+ + I E+ + + A + Sbjct: 7 RQILLLTDGCSNSGD-DPVAIAALAKEEGMTVNVIGIVGEGEMSERGIDEIESIAEAGGG 65 Query: 428 FFEANSTHELN--------KIFRDRIGNEIFERVIRI 456 + +L+ K I + + + +I Sbjct: 66 ISQIVYAKQLSQTVQMVTRKAMTQTIHGVVNKEITQI 102 >gi|206559901|ref|YP_002230665.1| hypothetical protein BCAL1535 [Burkholderia cenocepacia J2315] gi|198035942|emb|CAR51834.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 582 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 14/144 (9%), Positives = 40/144 (27%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFT 84 + A++ ++ + V Y +++ A A + A + + + ++ A ++ Sbjct: 1 MMAIVATLIAITTLGAIGVGNLFYQRRDVQRIADMAALAAVQRMDDACSQPTATATSNAQ 60 Query: 85 FPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPL 144 N E + A + V + + P Sbjct: 61 SNGLNASNGDTINIECGRWDTSVNPAPSYYAAAAAGTTQLNAAKVVVTRQVPFFFVGPPQ 120 Query: 145 SLFLRSMGIKSWLIQTKAEAETVS 168 ++ S + + A + Sbjct: 121 TVSAVSTARSTNIDTFSVGATLAA 144 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 24/215 (11%), Positives = 65/215 (30%), Gaps = 34/215 (15%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS-----LRHVIKKKHLVRDALASVIR 283 + ++ + M P E ++ + S + K + ++ + ++ Sbjct: 243 FNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSMEKDFSVYKNNILKILD 302 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + +I N + FND + + + +K + + G T + + Sbjct: 303 KLAEIPNWQINI----VVFNDESTARS---FSNQENNIEDIKVYINNLKANGYTKLYGTI 355 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA------ICNKAKSQG 397 + A ++ ++ +++ TDG++ N K+ Sbjct: 356 KEALESFKGKIDESST-----------LIVFTDGKDEGTNSNVTEKDVVDVTSEVIKNPQ 404 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + T+ F + F A+ F + Sbjct: 405 FNMYTVGFGQYYN-----QEFFEQVATRGGFTHVS 434 >gi|290958909|ref|YP_003490091.1| hypothetical protein SCAB_44831 [Streptomyces scabiei 87.22] gi|260648435|emb|CBG71546.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 708 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 45/153 (29%), Gaps = 17/153 (11%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D TV FF+ + + + G T+ + A++ + Sbjct: 560 DPEKTTVP--VVFFSTELDGTGELTLDAFDNKVDDLHAGLGRM---GRTSYHVAVE---E 611 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ-GIRIMTIAFSV 407 + + +V TDG A+ + A++ G+ +AF Sbjct: 612 VLAQHRKAAAGAPA-------LVVFQTDGAPDAKTPANQALADAAENHPGVHFAFVAFGD 664 Query: 408 NKTQQEKARYFLSNCASPNSFFEANSTHELNKI 440 + + L A+ + F + EL Sbjct: 665 PENKAFDYLRKLKT-ATTSHFLAGETPRELTDA 696 >gi|163745747|ref|ZP_02153107.1| TadE-like protein [Oceanibulbus indolifex HEL-45] gi|161382565|gb|EDQ06974.1| TadE-like protein [Oceanibulbus indolifex HEL-45] Length = 176 Score = 38.8 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 12/49 (24%), Positives = 24/49 (48%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + +S G I +L+P++L + ++ WS E L++A A+ Sbjct: 8 RFRRSEAGGVTIEFVILLPLVLYFFFLALETGLWSAREITLRRATNLAV 56 >gi|268561228|ref|XP_002646395.1| Hypothetical protein CBG15364 [Caenorhabditis briggsae] gi|187027191|emb|CAP33691.1| hypothetical protein CBG_15364 [Caenorhabditis briggsae AF16] Length = 315 Score = 38.8 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 3/76 (3%) Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 +K +V+ ++ + + N+ K+ G+RI+T+A+ + LS A Sbjct: 65 GSHFQKVVVIYASSYRGTGEQDPLPVANRLKTSGVRIITVAYDQGGDGTLLRQ--LSQVA 122 Query: 424 SPN-SFFEANSTHELN 438 SP +F A + + Sbjct: 123 SPGFNFSSAENEGNII 138 >gi|332884780|gb|EGK05036.1| hypothetical protein HMPREF9456_03189 [Dysgonomonas mossii DSM 22836] Length = 342 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 53/185 (28%), Gaps = 37/185 (20%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 ++ F + + ++T + TAI A+ + Sbjct: 128 RKNDKVAIVVFAGEAFIQLPLTPDNQSA-KLFLETINPSLVPVQGTAIGSAIDMSMSCFS 186 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 N+ + K IVL+TDGE + N E A +A S+G+ + + + Sbjct: 187 -----------NDADIDKAIVLITDGEGHEGNAEEAAA--RAASKGVHVNVVGIGTAEGA 233 Query: 412 QEKARYFLSNC-----------------------ASPNSFFEANSTHELNKIFRDRIGNE 448 + A + A++++ K + + Sbjct: 234 MIPEAENSRDIKRDTQGQPVVTKLNEEMCRQIAKAGEGLYAHADNSNSALKSLQAELDKL 293 Query: 449 IFERV 453 + + Sbjct: 294 QKKEI 298 >gi|156974653|ref|YP_001445560.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] gi|156526247|gb|ABU71333.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] Length = 2127 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 31/346 (8%), Positives = 85/346 (24%), Gaps = 22/346 (6%) Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA--VEMNPRKSAY 129 + + F ++ N+ + + D + Sbjct: 1502 TITDNDGDSKAGNFAVNPYAPPVVEGVTLNVSEEGLGNAIADDFALNGFMDTTDSTLDMD 1561 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 Q+ L D + + + ++ + + S E + I + S Sbjct: 1562 QLTLGKTVDSVALNVPTANLTSNGEAITWALSNNDQLLVGSGDGEEVIKISVNDTGAIST 1621 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + V ++ + S + Sbjct: 1622 ELLGPIDHPDTSGEDSLNIEVPVVVSNALGLTNSAVANVVIEDD-SPYSTSIVHDVVAET 1680 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 S + + +D S + +++D+ ++ + I V++ + + Sbjct: 1681 KESANVQLIMDVSGSMRTGNRLQIMKDSATQLLNQYESI--GQTRVQIIKYSGSATTYAI 1738 Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 +W K + G+T N A+ + +D A Sbjct: 1739 AGATWLTVD----EAKAYIDTLTAGGATNYNRAI--------NEAKDSWDDAGKLPSASN 1786 Query: 370 YIVLLTDGENTQ-----DNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 L+DG+ ++ + + +I +A+ + Sbjct: 1787 VSYFLSDGQPNPASSFINDARENSWIDHLTDSDNQITALAYGMGVN 1832 >gi|146340336|ref|YP_001205384.1| hypothetical protein BRADO3363 [Bradyrhizobium sp. ORS278] gi|146193142|emb|CAL77154.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 183 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 31/122 (25%), Gaps = 2/122 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 + + + G + L+ L + L+ + + L+ AQ + Sbjct: 13 RFCRDFLGDRRGATAVEFGLVAAPFLALVIALIQTFLVFFAQQLLESVAQQSARAVMT-- 70 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 Q + QKI + N + + + T + + Sbjct: 71 GQVRASSMTADAFKQNVVCQKIRIFFDCNGMMIDMQVVSSWSAASVSPPTLTFNSSGGVS 130 Query: 129 YQ 130 Sbjct: 131 NT 132 >gi|72162665|ref|YP_290322.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916397|gb|AAZ56299.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 596 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 25/151 (16%) Query: 311 SFSWGVHKLIRTIVKTFAIDE---NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 V+ + T A+ + G T + D + AY + + + Sbjct: 451 PLDTDVNGATQHETLTAALAQLAPKPDGDTGLYDTILAAYQELTETYAPDRVNT------ 504 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAK------SQGIRIMTIAFSVNKTQQEKARYFLSN 421 +++LTDG N + + K + + I+ I+F + L Sbjct: 505 ---LLVLTDGNNDDYDSITLEELLKELGEAYQPDRPVSIIAISFG-----PDVDPEPLER 556 Query: 422 CA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 A + + + E+ +IF I Sbjct: 557 IAEVTRGAAYTTEDPTEIGEIFLSAFALRIS 587 >gi|296448098|ref|ZP_06890000.1| TadE family protein [Methylosinus trichosporium OB3b] gi|296254412|gb|EFH01537.1| TadE family protein [Methylosinus trichosporium OB3b] Length = 186 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 32/82 (39%), Gaps = 4/82 (4%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE----HALKQA 56 M+ ++ + G + AL+ PV++ + ++D Y L A Sbjct: 1 MMRRAAPTDMWRRFARCVAGASALEFALIAPVVVLILTGMIDYGLAVYTRFSMNERLSAA 60 Query: 57 AQTAIITASVPLIQSLEEVSSR 78 A A+++A+ S +++ Sbjct: 61 ANYAMLSAASVGSSSGATLAAT 82 >gi|114568965|ref|YP_755645.1| TadE family protein [Maricaulis maris MCS10] gi|114339427|gb|ABI64707.1| TadE family protein [Maricaulis maris MCS10] Length = 185 Score = 38.4 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 4/69 (5%), Positives = 27/69 (39%), Gaps = 4/69 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +++ +G + A++ + ++++ + L+ A++ ++ + Sbjct: 19 RFVRARSGATAVEFAMIGAPFFLLLFAMIEIAAVFFTGTVLEN----AVLESARKIRTGE 74 Query: 73 EEVSSRAKN 81 + ++ Sbjct: 75 AQTGGMSQA 83 >gi|240145313|ref|ZP_04743914.1| conserved hypothetical protein [Roseburia intestinalis L1-82] gi|257202610|gb|EEV00895.1| conserved hypothetical protein [Roseburia intestinalis L1-82] Length = 182 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 9/72 (12%), Positives = 24/72 (33%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+L+ G + ML + + + +R +K A ++A+++ Sbjct: 8 FKRLLTGQEGSMAPVAVAATIAMLFIILGVSEYMRLVITAAGIKDAMESAVVSVVNDNYN 67 Query: 71 SLEEVSSRAKNS 82 + + Sbjct: 68 EVYHGVREGYAA 79 >gi|224024928|ref|ZP_03643294.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] gi|224018164|gb|EEF76162.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] Length = 339 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 48/183 (26%), Gaps = 36/183 (19%) Query: 293 DTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIIS 352 + ++ F + + + +++ + T I A+ A + Sbjct: 127 NNDKVAMIVFAGEAFAQLPITSDYISA-KMFLESISPSLIATQGTDIRGAIDLAMKSFTP 185 Query: 353 SNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 N + IVL+TDGEN + A A +G+R+ + Sbjct: 186 -----------NEGVGRAIVLITDGENHEGGAVEAAK--AAAEKGVRVFVLGVGSPDGSP 232 Query: 413 EK-----------------ARYFLSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIF 450 R C A + ++T+ K I Sbjct: 233 IPVEGTNEFRRDKDGNVVVTRLNEEMCREIAQAGNGMYVRVDNTNNAEKALNAEINKLAK 292 Query: 451 ERV 453 V Sbjct: 293 ADV 295 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 45/125 (36%), Gaps = 11/125 (8%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N+ +R+ A F+ + + I+ + G+T+ + A I+ Sbjct: 937 NNRIRVAAISFSSEAQINFQ--FNEFNNRTEILNALLSLTHSGGNTSSVSGINLAIKEIL 994 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQ 411 + ++ IVL++DG + E+ + ++ + + IA + + Sbjct: 995 ERGR---------EDVRRMIVLMSDGNSQDCWEDLLDASDRLHATNTIVYAIAANPDYYF 1045 Query: 412 QEKAR 416 +E Sbjct: 1046 RELEA 1050 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 54/210 (25%), Gaps = 13/210 (6%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD--SSSLR 265 DR + S + D L M + G + + + S Sbjct: 73 TDRCMHVSESLSRNSKTNDAPLQAAMGTYKTEEQRDEKFGLERTCKAGKIDIMIALDSSG 132 Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 V R +I S+ + + +++ F F Sbjct: 133 SVFNVFEDERKLAHDLIDSLVPVTLKDGRIQVSVMRFASSAEVVIPFKISRTPNEIMEKL 192 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 GST I A+ A + +A + +L++DG + Sbjct: 193 DKIK--FTGGSTRIAKAVDLALTDLS---------RWRRNDAIQIFILISDGNGHELWHV 241 Query: 386 GIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K ++ I + + S + E Sbjct: 242 AQTAGRKLQNANIEVFAVPVSQDHNLNELI 271 >gi|220912876|ref|YP_002488185.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859754|gb|ACL40096.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 333 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 60/178 (33%), Gaps = 18/178 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV-HKLIRTIVKTFAIDENE 333 A A+V+ ++ D R+G T F+ + + + ++ A D Sbjct: 103 SSADAAVVDVFAELAREFDGERIGLTIFDSTGSQVFPLTDDYGYARDQLLLARDAFDGKP 162 Query: 334 M------------GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ 381 GS+ I D + + + + + + ++ + +VL TD + Sbjct: 163 GSSGFLDGTWGGRGSSLIGDGLASCLNGFPRTQDSDAAADNPAVKRSRSVVLATDNYVSG 222 Query: 382 DNEEGIAICNK-AKSQGIRIMTIAFSVNKTQQEKARYFLS----NCASPNSFFEANST 434 + + + AK + +R+ + + + ++ S++ ++ Sbjct: 223 NPILTLPQASALAKDRTVRVYALNPGDLDYGSDAGQPGAQLRVAAESTGGSYYALDNP 280 >gi|114798948|ref|YP_761697.1| Flp/Fap pilin component [Hyphomonas neptunium ATCC 15444] gi|114739122|gb|ABI77247.1| Flp/Fap pilin component [Hyphomonas neptunium ATCC 15444] Length = 205 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 10/103 (9%), Positives = 30/103 (29%), Gaps = 8/103 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 K L+++ G + A++ P+M+ + +++ A + + Sbjct: 19 KRGVKSLLRNEDGVSAVEFAVIAPLMVLIFFGCIELSFLMR--------ADRRVTATASS 70 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE 110 L +++ + + + T E Sbjct: 71 LGDLTSRLATVTDADMRELYNAATVMMQPYPASETRMRITSIE 113 >gi|109896512|ref|YP_659767.1| type II secretory pathway component [Pseudoalteromonas atlantica T6c] gi|109698793|gb|ABG38713.1| type II secretory pathway component [Pseudoalteromonas atlantica T6c] Length = 146 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 14/143 (9%), Positives = 42/143 (29%), Gaps = 6/143 (4%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL-----KQA 56 + + + ++ G +I AL + ++L + G+ + + + + + ++A Sbjct: 1 MSHKRLCAEQPRGLRKQRGSMLVI-ALFIIIVLAILGLAMLRMLAASADSVVSEVYGQRA 59 Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR 116 A + + V + L + + + D + Sbjct: 60 LNAARAGVERSIAAAFPVVGVSNCMASDNFTFTSTPGLANCGYQSSCAVVSVVDDGDSYQ 119 Query: 117 DTAVEMNPRKSAYQVVLSSRYDL 139 + A V+S + Sbjct: 120 YFRFTAQGQCIAGDTVVSRNVSV 142 >gi|78062899|ref|YP_372807.1| TadE-like protein [Burkholderia sp. 383] gi|77970784|gb|ABB12163.1| TadE-like protein [Burkholderia sp. 383] Length = 142 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 34/131 (25%), Gaps = 1/131 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G + L++P +L V ++D + + A++ A + + L Sbjct: 8 ARHARGVVSLEFVLMLPFLLMVLIGIIDTSLILCDKAVITNASREAARAGVMLRVPMLTT 67 Query: 75 VSSRAKNSFTFPKQKIE-EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + I + N T V T + L Sbjct: 68 TQIASIAQNAMQNSLITGGTATTPAVTVTQANGTTAGTALTVTVTYTYSGLVLGSAFSAL 127 Query: 134 SSRYDLLLNPL 144 + + + Sbjct: 128 TGPVTVSATSV 138 >gi|5670335|gb|AAD38177.2|AF152598_6 TadF [Aggregatibacter actinomycetemcomitans] gi|26000720|gb|AAN75216.1| TadF [Aggregatibacter actinomycetemcomitans] gi|307548795|dbj|BAJ19117.1| TadF [Aggregatibacter actinomycetemcomitans] gi|307548810|dbj|BAJ19131.1| TadF [Aggregatibacter actinomycetemcomitans] Length = 200 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 20/54 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + +KK + G + ++ +++ + L D+V + L A Sbjct: 9 MKKNILTNTKKFFHNKRGAVSLEFLFMLILLVFIFAFLTDLVIVRTTQGKLDNA 62 >gi|311070712|ref|YP_003975635.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] gi|310871229|gb|ADP34704.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] Length = 245 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 2/70 (2%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFF 429 I+L+TDG + + +A+ AK QGI + I T +A + A Sbjct: 9 ILLITDGCSNHGE-DPLAMAAFAKEQGITVNVIGIMEENTVDHEAMKEVEGIALAGGGVH 67 Query: 430 EANSTHELNK 439 + +L++ Sbjct: 68 QVVYASQLSQ 77 >gi|298255195|ref|ZP_06978781.1| G5 domain family protein [Streptococcus pneumoniae str. Canada MDR_19A] Length = 904 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 89/345 (25%), Gaps = 11/345 (3%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + R E L A A + S+ QSL + ++ + + I+ +I + Sbjct: 134 LSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSN 193 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T E + VE + S D + P S + + + Sbjct: 194 SNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEI 253 Query: 163 EAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +S S +++ + +D S ++ + + G Sbjct: 254 LKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRG 313 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 R + V+ + + + + + + + Sbjct: 314 NGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNR 373 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K I + V + V ++ + + I ++ + Sbjct: 374 LRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVT----- 428 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 S+ ++ + N +K ++L D Sbjct: 429 --FNVGQGESGSATSNDPYSNSNYYFREKPLLL--DANPNGGTNN 469 >gi|24374985|ref|NP_719028.1| hypothetical protein SO_3479 [Shewanella oneidensis MR-1] gi|24349714|gb|AAN56472.1|AE015784_5 hypothetical protein SO_3479 [Shewanella oneidensis MR-1] Length = 596 Score = 38.4 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 40/120 (33%), Gaps = 12/120 (10%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T + +A D II K+ K Y+ L+ G ++ N A Q Sbjct: 424 TDGEPTLDSAADGIIE--RLTNGVGKHTSNPKSYLSALS-GWMRTEDVNP----NLAGKQ 476 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +R TI F A + L A +F+A +L + + N + + Sbjct: 477 TVRTFTIGF---SEGAASAEHLLKQTAENGGGKYFDATDASQLRSSLQTALNNILEKNAS 533 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 36/107 (33%), Gaps = 18/107 (16%) Query: 363 NNLEAKKYIVLLTDGENTQDNEE--------------GIAICNKAKSQGIRIMTIAFSVN 408 ++ +++L+DGE+ + + G+ + I F + Sbjct: 196 GERSGRRAVIVLSDGEDYSYATHSGNPHPIYGNQQLSPDEVVEEYIRNGVTLYAIHFGLE 255 Query: 409 KTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIR 455 K Q L + + ++A EL I+ D E +IR Sbjct: 256 KD-QYLGEMALK---TGGAVYDAKDQEELTGIYHDIRQKIDGEYLIR 298 >gi|113866862|ref|YP_725351.1| RND superfamily exporter [Ralstonia eutropha H16] gi|113525638|emb|CAJ91983.1| predicted exporter of the RND superfamily [Ralstonia eutropha H16] Length = 792 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 17/45 (37%), Gaps = 5/45 (11%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQA-----AQTAIITA 64 +T +PVM+ G+ VD + Y ++ A + Sbjct: 672 LTVATLPVMVLAVGIGVDYAFYIYNRLQVQLANGLAFTDALAVAL 716 >gi|78186670|ref|YP_374713.1| hypothetical protein Plut_0798 [Chlorobium luteolum DSM 273] gi|78166572|gb|ABB23670.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 156 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 30/114 (26%), Gaps = 6/114 (5%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS------V 66 + I+S G + AL++PV L + +V Y + L A + Sbjct: 26 RCIRSRKGSVLVEFALILPVFLALLFGVVSFSAALYNKTVLTMATREGARAGVLFVPDRT 85 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 I + + N+ V V T + Sbjct: 86 DAIIRSSATLAADRVCQNNLISFGAPVSAAVSTNSTPTLDNILTVSATVDFTGI 139 >gi|320034646|gb|EFW16589.1| Ku family DNA helicase [Coccidioides posadasii str. Silveira] Length = 705 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 58/140 (41%), Gaps = 12/140 (8%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF----SWGVHKLIRTIVKTFAI--- 329 + + + ++D M ++ + + + GV L + ++ Sbjct: 12 DVGHSMGKCRGGRTISDLDWMMLYVWDRITTTVSTGRKTATVGVVGLRQMMLMPDIRKLR 71 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 D + +T DA+ + I D + R L+ K+ IVL+TDG +T D++ +I Sbjct: 72 DLVKPSNTNQGDAISSIILAI-----DMIVRYCKRLKYKRKIVLVTDGRSTTDSDGIDSI 126 Query: 390 CNKAKSQGIRIMTIAFSVNK 409 +K K +GI ++ + + Sbjct: 127 VSKIKEEGIELVILGVDFDD 146 >gi|254440736|ref|ZP_05054229.1| hypothetical protein OA307_151 [Octadecabacter antarcticus 307] gi|198250814|gb|EDY75129.1| hypothetical protein OA307_151 [Octadecabacter antarcticus 307] Length = 186 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 3/55 (5%), Positives = 18/55 (32%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + ++ G+ + L+ + + ++ +++ A+ Sbjct: 8 LARFLRLFRRNEDGNPTVEFTLVFLPFIILPVSGFELGLLMTRHVMMERGIDMAV 62 >gi|323529154|ref|YP_004231306.1| TadE family protein [Burkholderia sp. CCGE1001] gi|323386156|gb|ADX58246.1| TadE family protein [Burkholderia sp. CCGE1001] Length = 272 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 15/174 (8%), Positives = 39/174 (22%), Gaps = 8/174 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +G + ++ P++L V + +V + A++ A+ + + Sbjct: 19 HKESGQSMVEFIIIAPLLLFVCFGTLQLVLLYQAKS----TLDVAVLEAAREGAVNHGSM 74 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY---QVV 132 + N V + Y + Sbjct: 75 QAMRSGLARGLAPIYARQASAEGAAAALANAQADASNFSVITVLNPTPAAINDYAQPRYY 134 Query: 133 LSSRYDLL-LNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 + SL R + S + H + ++ + Sbjct: 135 ADEAATYTEIPNDSLMYRDASVPSTASSGMNIQDANILKIHVHYCYNMYVPLVN 188 >gi|86559780|gb|ABD04185.1| collagen alpha-1 chain, type XII-like protein [Anthopleura elegantissima] Length = 101 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 32/93 (34%), Gaps = 7/93 (7%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEG--IAICNKAKSQGIRIMTIAFSVNKTQQEKAR 416 K ++TDG T+ + + K++G+R+ + N E Sbjct: 9 PAAGMRRNVPKMAFVITDGVQTRRDPITPLATASGRLKAKGVRVYALGIGKNIKDNE--- 65 Query: 417 YFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 L+ AS F S+ +L + I I Sbjct: 66 --LNAIASNRKFVYRTSSFKLLTPLINGIVKSI 96 >gi|153214333|ref|ZP_01949334.1| hypothetical protein A55_A0690 [Vibrio cholerae 1587] gi|124115390|gb|EAY34210.1| hypothetical protein A55_A0690 [Vibrio cholerae 1587] Length = 4127 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 98/374 (26%), Gaps = 18/374 (4%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ A +F + N + + T M + + Sbjct: 1378 NGQFQLVQGADTVASFALDSSVNPVQGLTSNGVAVTLSAPVDDGNGNLTYTAMAGSVTVF 1437 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + L+S + ++ S + K A + ++ D + Sbjct: 1438 TLTLNSDGTYSFTLSAPVDHALNSDSLTLNFKVIATDFDGDSD-SIVLPVKINDDKPSFI 1496 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYS-------SQNGKVGIRDEKLSPYMVSCNKSLYY 242 L + ++ + + D ++P + + Sbjct: 1497 HRQPLSVNEDDLPQGSDGSKESLSASGQFTVVQGADQVVSYQLDSSINPVQGLTSNGVAV 1556 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASV---IRSIKKIDNVNDTVRMGA 299 +L D + + + + + ++ I D+ ++ A Sbjct: 1557 VLSAPVDDGNGNLTYTATAGGNPIFTLVLNTDGSYTFTLSGPIDHAAGSDSKQLNFKVLA 1616 Query: 300 TFFN---DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNED 356 F+ + + + + D+ GS + +++ + D Sbjct: 1617 VDFDGDTASMNLPVTIIDDKPTITAVQALSVDEDDLTQGSDSSKESLTATGHFTTTQGAD 1676 Query: 357 EVHRMKNNLEAKKYIVLLTDGE----NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 V NL A + + G+ + + G K G I T+ + N + Q Sbjct: 1677 HVVSYTLNLTANPLAGVTSGGQVLTLSDSVDTHGNHTYTATKPDGTLIFTLQLNANGSYQ 1736 Query: 413 EKARYFLSNCASPN 426 L + AS + Sbjct: 1737 FTLSGPLDHAASSD 1750 >gi|116623633|ref|YP_825789.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] gi|116226795|gb|ABJ85504.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] Length = 185 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 17/168 (10%), Positives = 43/168 (25%), Gaps = 4/168 (2%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G I L+M +LG +++D+ Y L+ A+ + + + + Sbjct: 14 GVELIEFTLVMMPLLGFIFLVIDLGWTIYKRATLQF----AVREGCRYAVINQLQPLKDS 69 Query: 80 KNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDL 139 + ++ + + L +D I + +A + + Sbjct: 70 NGNNYGMIDSVKYVVQQRAMGFLGSTPSDPGYATIQVRFYDPNSSLTTAVAMPPDCTSTI 129 Query: 140 LLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + N + A + + S Sbjct: 130 VPNKGGYLVEVSVEHYKSQILGAIYKGSTPLDFVARSSDRMEADPSSG 177 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 65/190 (34%), Gaps = 23/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR--TIV 324 + ++ AL + S D D + F RV + + + + Sbjct: 367 DRLAQLKRALGQLAGSDVSPTGDRFRDREEVTLMPFGSRVKGVRTHTVPGERPAPVLAAI 426 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + A G TAI ++Q AYD + R + IVL+TDGENT Sbjct: 427 RADAEALTADGDTAIFSSLQAAYDHL------AQRRSALGDDRFTSIVLMTDGENTTGAT 480 Query: 385 --EGIAICNKAK--SQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 + A + + + + I F + R L + A + F+A L+ Sbjct: 481 ASDFDAYYRRLRGPERTAPVFPIVFG------DSDRSQLQSIATLTGGRLFDATKGS-LD 533 Query: 439 KIFRDRIGNE 448 F + G + Sbjct: 534 GAFEEIRGYQ 543 >gi|290243156|ref|YP_003494826.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] gi|288945661|gb|ADC73359.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] Length = 615 Score = 38.4 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 26/67 (38%), Gaps = 1/67 (1%) Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 T + ++ + M E++K ++++TDG AI A+ + + I Sbjct: 516 STGATPMSNAILGVLPSMFARSESRKVMLVITDGAPNDSESAMEAI-RMARDVNVEMYAI 574 Query: 404 AFSVNKT 410 + + Sbjct: 575 GIETDPS 581 >gi|302381762|ref|YP_003817585.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] gi|302192390|gb|ADK99961.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] Length = 179 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 9/139 (6%), Positives = 30/139 (21%), Gaps = 4/139 (2%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 +G I A++ + ++ + + L+ A + A+ + + Sbjct: 19 RSGVAAIEFAMVALPFFFMIFAVMQLGLLFVVDSVLEN----ATLQAARLIRTGEASTRN 74 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 F +N + + Sbjct: 75 LTPAQFKTELCSRMSVFSGECAGRATVEVRTITQFRNQTLPDPVVNGQLPTSPPYTNGAA 134 Query: 138 DLLLNPLSLFLRSMGIKSW 156 L+ + + + + Sbjct: 135 SSLILIRVWYKQPLIAPTM 153 >gi|259505645|ref|ZP_05748547.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] gi|259166776|gb|EEW51330.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] Length = 530 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 43/131 (32%), Gaps = 25/131 (19%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T I DA+ AY+ + IVL+TDGE Sbjct: 415 QRQELVDGVTALQAEGGTGIYDALLRAYEQVEPGASIPS------------IVLMTDGEQ 462 Query: 380 TQD--NEEGIAICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANS 433 T + ++ + + I + I + E + N A F+A + Sbjct: 463 TSGLSFGHFQRLYSELPTEKKRIPVFVILYG------EANITEMENLAGLTGGKTFDAMN 516 Query: 434 THELNKIFRDR 444 L + F++ Sbjct: 517 G-GLEEAFKEI 526 >gi|25028093|ref|NP_738147.1| hypothetical protein CE1537 [Corynebacterium efficiens YS-314] gi|23493377|dbj|BAC18347.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 531 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 43/131 (32%), Gaps = 25/131 (19%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 R + G T I DA+ AY+ + IVL+TDGE Sbjct: 416 QRQELVDGVTALQAEGGTGIYDALLRAYEQVEPGASIPS------------IVLMTDGEQ 463 Query: 380 TQD--NEEGIAICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANS 433 T + ++ + + I + I + E + N A F+A + Sbjct: 464 TSGLSFGHFQRLYSELPTEKKRIPVFVILYG------EANITEMENLAGLTGGKTFDAMN 517 Query: 434 THELNKIFRDR 444 L + F++ Sbjct: 518 G-GLEEAFKEI 527 >gi|283457760|ref|YP_003362348.1| hypothetical protein RMDY18_06960 [Rothia mucilaginosa DY-18] gi|283133763|dbj|BAI64528.1| hypothetical protein RMDY18_06960 [Rothia mucilaginosa DY-18] Length = 197 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 11/119 (9%), Positives = 32/119 (26%), Gaps = 9/119 (7%) Query: 3 FDTKFIFYSKKLIKS------CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + S+ L G + ++ ++L +G + V + L+ Sbjct: 42 LQARARAQSRALASDPLASDPEEGSIAPLIVGMLALLLLIGSVTVAITGAYLQTQHLQDV 101 Query: 57 AQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A + + + + ++ + + E L+ + V Sbjct: 102 ADAQANSITRTI---RTPEEANGSAAWEYASAYLAEIQPGQDFQALRLENVSVDSDQSV 157 >gi|198426173|ref|XP_002130506.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 384 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 64/201 (31%), Gaps = 20/201 (9%) Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDS------SSLRHVIKKKHLVRDAL 278 + +C +S + +D S + ++ Sbjct: 163 YVSPNEADVRTCTESGTWEGIKSCCARPCPPNGNIDVVFVLHASDPGNWKTGLDVIAT-- 220 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI-RTIVKTFAIDENEMGST 337 I+S+ K+ V + + + + +++ A + G++ Sbjct: 221 ---IQSLFKMGKDFARVGFVGDIGHPFESTAMHLNDFSERASLSLAIRSQAFNNTANGAS 277 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 +I + ++ ++ S+ R N A+ IVL+TD N+ + + + G Sbjct: 278 SIAEVLRYVKRSMFST------RNGNRKNAENIIVLVTDQNNSAG--GMTSAAYELRRSG 329 Query: 398 IRIMTIAFSVNKTQQEKARYF 418 I+ + F ++ ++ Sbjct: 330 IKTFVVNFGETESNKDLRNLL 350 >gi|543095|pir||B53274 complement factor B subunit Bb - pig (fragment) Length = 170 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 37/153 (24%), Gaps = 18/153 (11%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + S + + K D T A+ Y+ + + Sbjct: 23 IRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWGVNNFPDNWNRTR 82 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKS-QGIR------------IMTIAFSVNKTQQ 412 IVLLTDG + + + + ++ + I I Q+ Sbjct: 83 H---VIVLLTDGLHNMG-GDSVTVIDEIRDLLNIGKDRKNPREDYLDIYVFGVGPLVNQE 138 Query: 413 EKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 F+ L +F + Sbjct: 139 NINALASKKD-KEQHVFKLKDVDNLEDVFFQML 170 >gi|110632965|ref|YP_673173.1| TadE-like [Mesorhizobium sp. BNC1] gi|110283949|gb|ABG62008.1| TadE-like protein [Chelativorans sp. BNC1] Length = 580 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 39/127 (30%), Gaps = 2/127 (1%) Query: 14 LIKSCTGHFFIITALLMPVM-LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + KS +G + AL+ PV+ + M D+ + + A ++ Sbjct: 434 MAKSESGVSAVEFALIAPVLAFSLVVMA-DIGLALNERMTIDHMLRAGAQAAMSDPGEAQ 492 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ S + L++ + + T P Y++ Sbjct: 493 VQKVLQSMVSQPGAPADVTFDLVKRYCACPENADVPPATAPACSTTCSGSAPPFIFYRLQ 552 Query: 133 LSSRYDL 139 S YD Sbjct: 553 ASKLYDA 559 >gi|145596959|ref|YP_001161256.1| hypothetical protein Strop_4450 [Salinispora tropica CNB-440] gi|145306296|gb|ABP56878.1| hypothetical protein Strop_4450 [Salinispora tropica CNB-440] Length = 161 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 15/125 (12%), Positives = 33/125 (26%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ++ +G + A+ +L V + D A A + Sbjct: 25 RTDSGRVSVFLAIAALGILTVVALAFDGAGQLRALQRADNLASEAARAGGQSIDLVSAIE 84 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + ++ + EYL + +F + +VR R V Sbjct: 85 GGPKRIDEPQARRAVTEYLAAAGASTHTVSFPVVDGETVVRVRVTVTYERSLLGLVGFDD 144 Query: 136 RYDLL 140 ++ Sbjct: 145 TVNVT 149 >gi|307943458|ref|ZP_07658802.1| putative TadE family protein [Roseibium sp. TrichSKD4] gi|307773088|gb|EFO32305.1| putative TadE family protein [Roseibium sp. TrichSKD4] Length = 181 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 35/125 (28%), Gaps = 6/125 (4%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGV---GGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + LI G + A++ P+ML ++ D R + L A+ S Sbjct: 3 RAFRNLILREGGATAVEFAIMFPLMLVFFINVIVMFDGFRANRA---LSVASHAGSDLLS 59 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + +N D + + + + + Sbjct: 60 RFQENLSSKDIQNVLATTSAIMGQYADKTDPVIVMASIRNPFDNKPDLQLVCSQSNKSGK 119 Query: 126 KSAYQ 130 + Sbjct: 120 ELTKD 124 >gi|253687075|ref|YP_003016265.1| hypothetical protein PC1_0674 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753653|gb|ACT11729.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 186 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 39/145 (26%), Gaps = 4/145 (2%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + +S G A L+PV+L ML ++ R A A A+ + Sbjct: 25 RHWRSTHGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAF-RLDNLAS 83 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLK---KNFTDREVRDIVRDTAVEMNPRKSAY 129 + + YL + N + + N ++ Sbjct: 84 DSAEQMGTRLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNSSQDNPNGEETTT 143 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIK 154 V S + ++ + + Sbjct: 144 LPVWSVTVQIKKAFITPLPEVLTLG 168 >gi|119493558|ref|ZP_01624223.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] gi|119452612|gb|EAW33794.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] Length = 757 Score = 38.4 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 16/119 (13%), Positives = 31/119 (26%), Gaps = 13/119 (10%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 N F + R++ + G T + + ++ Sbjct: 316 NPNDTFNIMDFAHTTCTLSETPLANSPENRSLAIHYINQLRANGGTELLNGIREVLK--- 372 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + IVLLTDG +N + + K G R+ + + Sbjct: 373 ---------FPELTGRLRSIVLLTDGYIGNENAILSEVQDNLK-PGNRLHSFGVGSSVN 421 >gi|307591436|ref|YP_003900235.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986290|gb|ADN18169.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 441 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 42/138 (30%), Gaps = 14/138 (10%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + + + + + + F+D+ + + D Sbjct: 64 KSIVIESLYQLVRSGRLTQDDHIAIIEFHDQASTLIGLTPATQVFQLENAIARLNDF--S 121 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T + M A + + + +++ TDGE T D E+ I + Sbjct: 122 GGTCMGKGMNEALVLLTNQ-----------SMTSRRVLIFTDGE-TFDEEDCEIIAQQFS 169 Query: 395 SQGIRIMTIAFSVNKTQQ 412 +QGI I + + Sbjct: 170 NQGISITAMGVGDEFNED 187 >gi|73749041|ref|YP_308280.1| hypothetical protein cbdb_A1307 [Dehalococcoides sp. CBDB1] gi|147669807|ref|YP_001214625.1| hypothetical protein DehaBAV1_1168 [Dehalococcoides sp. BAV1] gi|73660757|emb|CAI83364.1| hypothetical protein cbdbA1307 [Dehalococcoides sp. CBDB1] gi|146270755|gb|ABQ17747.1| hypothetical protein DehaBAV1_1168 [Dehalococcoides sp. BAV1] Length = 535 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 29/314 (9%), Positives = 77/314 (24%), Gaps = 16/314 (5%) Query: 11 SKKLIKSCTGH---FFIIT----ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 KKLIK G +I A+++P +L + G + ++ + Sbjct: 4 IKKLIKGEKGASLAIALIFLAIGAIMIPPLLMLIGSGLKQGTTFEDRTGAIYSSDAGVEW 63 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 L E + A + P ++ + + N D+ V Sbjct: 64 VINILKTGG-EGVTDAYGNIGLPNEQDSIRIYNLSDLNGSTVEVTLTYHDVGSYYNVIST 122 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + + + L S+F + I S K + S + ++ + Sbjct: 123 ASLNGKSITTQATLKYQLGGGSVFDNA--ITSLDGDVKLTGSSSVISDPRNPPGAMVYSG 180 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM 243 + + + + + R +S + + Sbjct: 181 GELILTGSSFIEGNVYADDGIDLGWSTPITGDAVTPADIN-RPGNISGTVTTGAAPQLPN 239 Query: 244 LYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFN 303 + ++ + +++ D + + Sbjct: 240 VLTDAEIAAIVTDIQNETAFAAFSPGAITRSSDWNIKYYPV-----PASIYYTAERIQRD 294 Query: 304 DRVISDPSFSWGVH 317 + +D ++ Sbjct: 295 LNINTDTPLTFKSS 308 >gi|320108734|ref|YP_004184324.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319927255|gb|ADV84330.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 318 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 44/147 (29%), Gaps = 9/147 (6%) Query: 291 VNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTI 350 T F+ F+ KLI + + M TA+ D + A Sbjct: 107 NLRTDAAFLQAFDTEATMLQPFTADSTKLISALRHVDERAYSRMTGTALFDTLYRA---- 162 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + + + +I+L TDG + + + + + + FS++ Sbjct: 163 -CRDRFTPQEVHASAPTNNFILLFTDGADNASHAYMRDVIDICQQTHTAVYI--FSLDSG 219 Query: 411 QQEKARYFLSNCA--SPNSFFEANSTH 435 + + L A S F + Sbjct: 220 AHGQPQRVLRELAEQSGGRIFYSKEED 246 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 55/178 (30%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + R G F D F+ + + A Sbjct: 122 SRLTAAKKVLRDFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQDVWL--NLLDEA 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ S + +++LTDG +T + Sbjct: 173 ETGMAGQSTNLGDAIGLGIKVFEQSPS---------TSQDQIMLVLTDGNDTGSFVSPVD 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 A ++GIRI IA + +Q ++ +S F A +LN+ ++ Sbjct: 224 AAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQ 281 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 63/182 (34%), Gaps = 22/182 (12%) Query: 270 KKHLVRDALASVIRSIKKID----NVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIV 324 + L++D ++ +I D ++ D ++ F+ S + G RT + Sbjct: 355 RITLLKDTMSDLISGGATTDLANVSLRDREKVSIIPFSFGPHEVISETLGAVGSPSRTDL 414 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + G T I DA+ AY + IVL+TDGE T Sbjct: 415 QQRVEALQADGGTGIYDAVLAAY------------AESAGGDYIPSIVLMTDGELTAGRT 462 Query: 385 EGIAICN-KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA-SPNSFFEANSTHELNKIFR 442 + A IR +I V + A + F+A + +L++ F+ Sbjct: 463 YDQFLTEWNALPSNIR--SIPVFVILYGEANVADMEQLAATTGGETFDAING-DLDEAFK 519 Query: 443 DR 444 + Sbjct: 520 EI 521 >gi|327270794|ref|XP_003220173.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 917 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 47/149 (31%), Gaps = 21/149 (14%) Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 +G FN + + R + + G I + + A+ Sbjct: 351 WVGIVAFNSKGNIQAGLQKVFSDIERESLTSHLPT-TAAGDCNICEGVNAAFQVFSQ--- 406 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 K IVLLT+GE + + K +SQ I I TIAF + + + Sbjct: 407 ------KLTSTEGCEIVLLTNGEGSDLSPCLS----KIQSQEIIIHTIAFGSKASNELEK 456 Query: 416 RYFLSNCASPNSFFEANSTHE---LNKIF 441 + + F A + + L F Sbjct: 457 LADM----TGGKTFYATDSLDSNGLIDAF 481 >gi|237728581|ref|ZP_04559062.1| TerY1 [Citrobacter sp. 30_2] gi|226910059|gb|EEH95977.1| TerY1 [Citrobacter sp. 30_2] Length = 212 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 49/152 (32%), Gaps = 17/152 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 21 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLTDLLS--------FQMPA 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 G+T++ +A+ +I + K + + L+TDG D +G+ Sbjct: 73 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDDWRKGLNDF 130 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 A++ G+ + + L Sbjct: 131 KAART-GVVVAC------AAGHDADTSVLKEI 155 >gi|167767549|ref|ZP_02439602.1| hypothetical protein CLOSS21_02068 [Clostridium sp. SS2/1] gi|167710841|gb|EDS21420.1| hypothetical protein CLOSS21_02068 [Clostridium sp. SS2/1] gi|291558491|emb|CBL37291.1| hypothetical protein CL2_01850 [butyrate-producing bacterium SSC/2] Length = 480 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 7/37 (18%), Positives = 14/37 (37%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYE 50 +I+ G A+L V L + + V+ + Sbjct: 1 MIRDEQGSMSAFLAMLFLVFLLLISVCVEGIYMYTAR 37 >gi|149278144|ref|ZP_01884282.1| hypothetical protein PBAL39_11322 [Pedobacter sp. BAL39] gi|149230910|gb|EDM36291.1| hypothetical protein PBAL39_11322 [Pedobacter sp. BAL39] Length = 458 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 68/252 (26%), Gaps = 3/252 (1%) Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 AI++AS + E + + + + DI D Sbjct: 138 DAAILSASAHHRTTSSESIVGLPEKAAGIEGAKNQLAANVNQPLAVSAKFPVQKTDIELD 197 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 +EM + + + + D + FL++ IK + K + + V Sbjct: 198 LPLEMKGKNTTSPASATQKKDPVKQSFEEFLKAESIKPQPMMAKTTKQQD--KWEVGVMV 255 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + +M + + + + +G +P M S Sbjct: 256 APSFGNAKKINMGYGLSMAYALS-DKVSISSGLAYNEMGASKENIGNGGASDAPVMTSLA 314 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 S + + + + + L A+ + + I+N +TV Sbjct: 315 SSTRSLQSVDARLTGIDIPLEIRYRFNKKLYANAGLSAFAVLNQRQQNNFIENNVETVPS 374 Query: 298 GATFFNDRVISD 309 T N Sbjct: 375 DFTGGNTATFRT 386 >gi|116622792|ref|YP_824948.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225954|gb|ABJ84663.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 360 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 37/120 (30%), Gaps = 23/120 (19%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 + + + + +K +V+L+DG +T + A+ + ++ F Sbjct: 197 TVLYDAVLLGSDELMRKQKGRKALVILSDGVDTGSKVSLSSAVESAQRADTLVYSVLFED 256 Query: 408 ---------------------NKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDR 444 T + L + + FFE + L KI+++ Sbjct: 257 REAYGVPGFGGMGRGRGGRRPMPTAPANGKKVLERISTETGGRFFEVSKKEPLEKIYQEI 316 >gi|15897956|ref|NP_342561.1| hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|284173931|ref|ZP_06387900.1| hypothetical protein Ssol98_04630 [Sulfolobus solfataricus 98/2] gi|13814281|gb|AAK41351.1| Hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|261602668|gb|ACX92271.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 380 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 56/189 (29%), Gaps = 35/189 (18%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K L + + + I + V+ F+ V F + + + Sbjct: 54 KIELAKQGAIELFKRIPNGNKVS------FITFSSNVNVIKEF------VDPLDLTNEIL 101 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 G TA+ A+ TA Y++LLTDG T + G Sbjct: 102 QITAGGQTALYTAILTANSLAKKYQMPT------------YLLLLTDGNPTDETNIG--- 146 Query: 390 CNKAKS---QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF-RDRI 445 N K + I++ + + +Q + + + +E+ + + + Sbjct: 147 -NYLKLPYYEKIQVYSFGIGDDYNEQLLQSVSDK---TGGVMYHISDANEIPQKLPQKAV 202 Query: 446 GNEIFERVI 454 + V Sbjct: 203 TQIAAKNVT 211 >gi|313837215|gb|EFS74929.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314971986|gb|EFT16084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 322 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 48/137 (35%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ +A+ + + + + IV+L+DG Sbjct: 152 STDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMA------PGGSKDRVPAAIVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 NTQ +A A + + + TIAF + LS+ A + Sbjct: 206 GNTQGGSPLVAA-THAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDVADRTDA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYKQ 281 >gi|296270634|ref|YP_003653266.1| extracellular solute-binding protein family 1 [Thermobispora bispora DSM 43833] gi|296093421|gb|ADG89373.1| extracellular solute-binding protein family 1 [Thermobispora bispora DSM 43833] Length = 599 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 11/118 (9%) Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQDNE---EGIAICNKAKS- 395 A+ T + + +M K I+L TDG D I K + Sbjct: 472 AIPTGNTGLNDTLWAAYQKMLKEYTPDKVSTILLFTDGVGNDDPNGGISNEEILRKLRQA 531 Query: 396 ----QGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 + + I+ I+ + K + ++A+ A+ + + S E+ IF + Sbjct: 532 YDPKRPVSILIISVNTTKDE-DRAQMTAIAKATGGAAYFPQSVLEIRNIFLKGFARRL 588 >gi|152983007|ref|YP_001354999.1| hypothetical protein mma_3309 [Janthinobacterium sp. Marseille] gi|151283084|gb|ABR91494.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 690 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 9/151 (5%), Positives = 37/151 (24%), Gaps = 23/151 (15%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ---------------- 58 G ++ A+ + ++ +D+ + + L++ A Sbjct: 21 AAHQRGSIVVMAAIAISTVVITLA-TIDIGYLFFQKRDLQKVADLAALAGAQQLARSSAD 79 Query: 59 ----TAIITASVPLIQSLEEV--SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 ++ + + S + ++ ++ + V Sbjct: 80 TTPCSSALAVAKLNANQQVTQGFSDSEFVATCGSWDPVKNTTAPHYVAYAAGDAAPTAVS 139 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 +V + + V ++ + Sbjct: 140 VLVNRSFPSFFGAWANRNVSATAMATIDAPT 170 >gi|221067366|ref|ZP_03543471.1| TadE family protein [Comamonas testosteroni KF-1] gi|220712389|gb|EED67757.1| TadE family protein [Comamonas testosteroni KF-1] Length = 149 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 9/37 (24%), Positives = 15/37 (40%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + +S G + AL + V L +VD R + Sbjct: 5 RHSRSERGATIVEFALALLVFLMFLFGIVDFSRMLFT 41 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 38.4 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 42/123 (34%), Gaps = 17/123 (13%) Query: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNN 364 FS+ + ++ G TA DA+ + +D Sbjct: 409 TYDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGA------ 462 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 K ++V+LTDG+ + D+ G A+ A+ GI + ++ L + AS Sbjct: 463 --NKNFLVILTDGQ-SYDDVRGPAV--AAQKAGITVFSVGV------AWAPLDDLKDMAS 511 Query: 425 PNS 427 Sbjct: 512 EPR 514 >gi|260574617|ref|ZP_05842620.1| Flp pilus assembly protein CpaB [Rhodobacter sp. SW2] gi|259023034|gb|EEW26327.1| Flp pilus assembly protein CpaB [Rhodobacter sp. SW2] Length = 284 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 56/206 (27%), Gaps = 29/206 (14%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI---------QSLEE 74 ++ L++ V L + G V + + + + A + A + PL+ + Sbjct: 3 MVFGLVLVVGLALAGFAVYMAQGFISQTQAELAQERAARAKAGPLVEVYVVNKPLNYGDP 62 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDRE--------------------VRDI 114 ++ +P+ + E + + N + +D Sbjct: 63 LTKEDVQKIYWPQNALPETIFTDEALLFPANAVGQRFVLRQMEKFEPLLAVKVTEPGQDA 122 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 + R A +V +S L+P + E + S K Sbjct: 123 GLTARLAKGMRAFAIKVDATSGVSGFLHPGDNVDVYWTGTANDAGVGGEMTRLIESTIKI 182 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQP 200 V D S S + + Sbjct: 183 IAVDQMANDDRSGSAVVAATVTVEAS 208 >gi|221136524|ref|XP_002171216.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 126 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 29/74 (39%), Gaps = 6/74 (8%) Query: 368 KKYIVLLTDGENTQDN--EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP 425 K I+LLTDG T E+ I ++ ++ G+ I+ + + E + Sbjct: 24 SKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT----GGK 79 Query: 426 NSFFEANSTHELNK 439 + + A + L Sbjct: 80 KNAYTAATFDSLTD 93 >gi|221135718|ref|XP_002171014.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 132 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 12/105 (11%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDN--EEGIAICNKAK 394 T I+ A++ A + +S + K I+LLTDG T E+ I ++ + Sbjct: 5 TRIDRALRLAQKDMFTSANGGRVGV------SKLIILLTDGSQTPGGDAEDPERIADELR 58 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNK 439 + G+ I+ + + E + + + A + L Sbjct: 59 NDGVVILGVGIGSAVNETELSHIT----GGKKNAYTAATFDSLTD 99 >gi|196233185|ref|ZP_03132032.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] gi|196222829|gb|EDY17352.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] Length = 252 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 41/158 (25%), Gaps = 15/158 (9%) Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 ++ K + + + + + + ATF + + Sbjct: 81 GNNKDTNEKAREKLWKTIRDTMSKVSAVAETAPYQVGVATFSAGNIDLMAPMGSKKSIVD 140 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 I K + A++ +KYIVL++DG + Sbjct: 141 SAIEKVTFNGVSPELYLGAKRAIEW---------------FSGTPADRKYIVLISDGISN 185 Query: 381 QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 + A + I TI F + + + Sbjct: 186 DKVVSQQDVVQAALKAKVHICTIGFPKSTEARADVQKL 223 >gi|21222916|ref|NP_628695.1| hypothetical protein SCO4532 [Streptomyces coelicolor A3(2)] gi|256785967|ref|ZP_05524398.1| hypothetical protein SlivT_15867 [Streptomyces lividans TK24] gi|289769859|ref|ZP_06529237.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|10178323|emb|CAC08348.1| putative membrane protein [Streptomyces coelicolor A3(2)] gi|289700058|gb|EFD67487.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 153 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 32/134 (23%), Gaps = 10/134 (7%) Query: 3 FDTKFIFYS--KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 T+ + ++ G + V L + ++D A+ A Sbjct: 1 MRTQVRGWLADRRTRLDDRGSGAGAVIIFALVFLSLSAFVIDGGMSISKRERAADIAEQA 60 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 A+ + R Y N D DI V Sbjct: 61 ARYAAQDI--------DREALYDDVGGPAPINYENCNARVKAFAAEMDMTGADIAATHCV 112 Query: 121 EMNPRKSAYQVVLS 134 + + +V L+ Sbjct: 113 TADAAQVQVEVQLT 126 >gi|197106454|ref|YP_002131831.1| hypothetical protein PHZ_c2993 [Phenylobacterium zucineum HLK1] gi|196479874|gb|ACG79402.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 158 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 36/110 (32%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPL 68 ++L + G + A +P+++ + LV+ R ++ +L+ A Q A ++ L Sbjct: 25 RQLRRLARCRRGAAAVEAAFALPLLVLMLTGLVEGGRLAWTRTSLEFAVQEAARQGALDL 84 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 + + + + +D VR Sbjct: 85 SRCTSGGADPCGALANARNVAAAKASAAGVPASAFTVASDPACGVRVRAA 134 >gi|167527420|ref|XP_001748042.1| hypothetical protein [Monosiga brevicollis MX1] gi|163773460|gb|EDQ87099.1| predicted protein [Monosiga brevicollis MX1] Length = 2575 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 21/291 (7%), Positives = 60/291 (20%), Gaps = 8/291 (2%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A++ A + + + ++ L Sbjct: 994 AVRSNATLAYMPAAAVLAAVGEDESAWSGALGTGSLEVQSARDASADKAPQSVSTGLVVG 1053 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 V + V ++ + + + + G+ + ++ + Sbjct: 1054 ICVTMVCVVAAMLVVMRRRQRQSVGHQFNGKASEPSTQGMAMIDVDGMYTTALEPSSGRT 1113 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 ++ + S +M + D E + P + + + Sbjct: 1114 SMGSGRAGMVAALASYNAGESSAMHETTLDGE-----TYDNPRSHRPAQQQTISSMYAVP 1168 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 D+ + + + + S + +V + A V Sbjct: 1169 DDPSMTSTFAFETYDTPNAMEDAISNPVLYDQPNASDAGVYVPDTYDQPNASDAGVYVPD 1228 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 + A D + GV+ Sbjct: 1229 TYDQPNASNAGVYAP---DTYDQPNASGAGVYVPDTYDQPNATSSLTPETY 1276 >gi|240255600|ref|NP_680085.3| von Willebrand factor A domain-containing protein 1 precursor [Mus musculus] gi|81901574|sp|Q8R2Z5|VWA1_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 1; AltName: Full=von Willebrand factor A domain-related protein; Flags: Precursor gi|20071274|gb|AAH26919.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|26346104|dbj|BAC36703.1| unnamed protein product [Mus musculus] gi|54611403|gb|AAH36166.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|123295149|emb|CAM18384.1| von Willebrand factor A domain containing 1 [Mus musculus] gi|148683083|gb|EDL15030.1| von Willebrand factor A domain containing 1 [Mus musculus] Length = 415 Score = 38.4 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 9/100 (9%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ + + I + MG T A+ A + + + + K Sbjct: 84 TEFTFDQYSSGQAIQDAIRVAPQRMGDTNTGLALAYAKEQLFAEEAGARPGV------PK 137 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 +V +TDG ++ + + K G+ I ++ Sbjct: 138 VLVWVTDGGSSDPVGPPM---QELKDLGVTIFIVSTGRGN 174 >gi|271962857|ref|YP_003337053.1| hypothetical protein Sros_1312 [Streptosporangium roseum DSM 43021] gi|270506032|gb|ACZ84310.1| hypothetical protein Sros_1312 [Streptosporangium roseum DSM 43021] Length = 156 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 11/146 (7%), Positives = 33/146 (22%), Gaps = 15/146 (10%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 ++ + G + T + + + G+LVD Sbjct: 9 TSRVR---RGSAARERGSMSVFTVIFSVAVFLLAGLLVDGGAAINSRLR----------- 54 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 + + + ++ + + + D V+ Sbjct: 55 -AADIAEQAARAAADQIDVEHLRSTGQARLMEEGTICAKAEEVVSAHATDDVQLAECTAG 113 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLR 149 ++ V +S ++ F Sbjct: 114 GGQAEVTVRVSVSWEAFFLAAFGFPG 139 >gi|262200403|ref|YP_003271611.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262083750|gb|ACY19718.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 423 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 43/157 (27%), Gaps = 24/157 (15%) Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + ALA VI + D G F+D + + Sbjct: 59 AQRALAGVIGQLDPRD------VFGVVTFDDDAQVVLPAAPL---ADKARAVDAVGSIVP 109 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICN-- 391 G T ++ + + R +++++DG + + + Sbjct: 110 GGCTDLSSGYLRGLQELRRATASAGIRGGT-------VLVISDGHVNRGIRDLDEFASIT 162 Query: 392 -KAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 KA + GI T+ + LS A + Sbjct: 163 AKAAADGIITSTLGYGRGYD-----ETLLSAIARSGN 194 >gi|237650229|ref|ZP_04524481.1| G5 domain family protein [Streptococcus pneumoniae CCRI 1974] Length = 1900 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 89/345 (25%), Gaps = 11/345 (3%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + R E L A A + S+ QSL + ++ + + I+ +I + Sbjct: 134 LSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSN 193 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T E + VE + S D + P S + + + Sbjct: 194 SNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEI 253 Query: 163 EAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +S S +++ + +D S ++ + + G Sbjct: 254 LKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRG 313 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 R + V+ + + + + + + + Sbjct: 314 NGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNR 373 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K I + V + V ++ + + I ++ + Sbjct: 374 LRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVT----- 428 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 S+ ++ + N +K ++L D Sbjct: 429 --FNVGQGESGSATSNDPYSNSNYYFREKPLLL--DANPNGGTNN 469 >gi|197118197|ref|YP_002138624.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087557|gb|ACH38828.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 318 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 48/152 (31%), Gaps = 21/152 (13%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + ++A+ + +K R+G F + Sbjct: 88 DTSKSMLTRDLGQSRLAAAKEAVRQAMAGLKGE-------RVGLVVFAGSAFLVCPLTTD 140 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + + + + T++ A++ A + ++ K +VLL+ Sbjct: 141 YALFDQVLKEAGEETL-PLPGTSLAAALKEARRALQGEGDEP-----------KVVVLLS 188 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSV 407 DGE+ + A + G+++ +A Sbjct: 189 DGEDHEGEYVAAAR--ALNAAGVKLYAVAAGT 218 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 24/215 (11%), Positives = 65/215 (30%), Gaps = 34/215 (15%) Query: 229 LSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSS-----LRHVIKKKHLVRDALASVIR 283 + ++ + M P E ++ + S + K + ++ + ++ Sbjct: 243 FNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSMEKDFSVYKNNILKILD 302 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + +I N + FND + + + +K + + G T + + Sbjct: 303 KLAEIPNWQINI----VVFNDESTARS---FSNQENNIEDIKVYINNLKANGYTKLYGTI 355 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA------ICNKAKSQG 397 + A ++ ++ +++ TDG++ N K+ Sbjct: 356 KEALESFKGKIDESST-----------LIVFTDGKDEGTNSNVTEKDVVDVTSEVIKNPQ 404 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN 432 + T+ F + F A+ F + Sbjct: 405 FNMYTVGFGQYYN-----QEFFEQVATRGGFTHVS 434 >gi|227823965|ref|YP_002827938.1| pilus assembly protein contains TadE domain [Sinorhizobium fredii NGR234] gi|227342967|gb|ACP27185.1| pilus assembly protein contains TadE domain [Sinorhizobium fredii NGR234] Length = 188 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 12/133 (9%), Positives = 28/133 (21%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 +L G + A++ P+++ ++ + +A+ S Sbjct: 13 RLSTDRRGVGGVEFAIVAPILIMAYIGAFELSVGLNVVRKVARASSAVADLVSQEASVDT 72 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S + + T V P S V Sbjct: 73 AFLDSMNNVAESILAPYAGTDYTLKITGIQVTGTTTGTVLWSRDQDGGTPYPANSTTTVP 132 Query: 133 LSSRYDLLLNPLS 145 + Sbjct: 133 SDLEAVNAFVVRT 145 >gi|123473503|ref|XP_001319939.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121902734|gb|EAY07716.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 161 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 38/147 (25%), Gaps = 21/147 (14%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R F + + + + MG T I +Q D S Sbjct: 24 CRFSILEFGSHFETVLPPCDYTDENVANAMNLLDNISANMGGTNILAPLQHVSDLQASEG 83 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGI-RIMTIAFSVNKTQQ 412 K I LTDGE + IC A K++ RI +I Sbjct: 84 F------------VKQIFFLTDGEVDNSDI----ICATAQKNRSTNRIFSIGLGSGADPG 127 Query: 413 EKARYFLSNCASPNSFFEANSTHELNK 439 S ++ +N+ Sbjct: 128 LIKGMARK---SGGNYAIIGDNDNMNE 151 >gi|229825155|ref|ZP_04451224.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] gi|229790527|gb|EEP26641.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] Length = 1109 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 25/70 (35%), Gaps = 12/70 (17%) Query: 383 NEEGIAICNKAK-SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIF 441 IA NK K G+ I ++ + E L AS N ++ ++ I Sbjct: 286 KTNTIAEANKIKTDSGVEIFSVGIKADGDAAE----VLKKIASDNRYY-------VDNIL 334 Query: 442 RDRIGNEIFE 451 +D I I Sbjct: 335 KDVIDKVIES 344 >gi|156404021|ref|XP_001640206.1| predicted protein [Nematostella vectensis] gi|156227339|gb|EDO48143.1| predicted protein [Nematostella vectensis] Length = 3546 Score = 38.4 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 48/154 (31%), Gaps = 18/154 (11%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK--------TF 327 + ++ I V D ++ V + + V I Sbjct: 2218 SSNSTFKTMTDSIKKVGDKFGTDRFHYSIVVYGNS--TKNVSSFDTIIPYEEALEEEVNK 2275 Query: 328 AIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGI 387 AI ++ G++ + +A+ A + + N ++K +V++TD + + Sbjct: 2276 AIRQSLSGNSNLEEALTNAKKILNTPN--------TRKSSRKAVVIITDNTSGKTPTVLK 2327 Query: 388 AICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 QGI +++ + E L Sbjct: 2328 DTAKPLHEQGIIVVSSGIGDDPKYDELRSMTLRK 2361 >gi|324504675|gb|ADY42017.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 898 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 15/143 (10%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + + D VR+ A + F+ G + I + + + + G T A++ Sbjct: 28 VDALPVDRDGVRIAAIQYA--GFPLTEFALGTYLNADDIRQHLSQIKFQSGVTRTGYALR 85 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + +A K IVL TDG + D + K ++I ++ Sbjct: 86 KADSELF------RQERGARSDAIKIIVLFTDGLSIDDPLKPAHELRDIKR--VKIYVVS 137 Query: 405 FSVNKTQQEKARYFLSNCASPNS 427 + + E ++ A Sbjct: 138 VGSDGFEPE-----MNRIAGDKR 155 >gi|302870519|ref|YP_003839156.1| hypothetical protein Micau_6084 [Micromonospora aurantiaca ATCC 27029] gi|302573378|gb|ADL49580.1| hypothetical protein Micau_6084 [Micromonospora aurantiaca ATCC 27029] Length = 148 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 6/48 (12%), Positives = 18/48 (37%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 + ++ G T + ++L + G+++D+ + L Sbjct: 3 RLRTRLRQARHDDRGQITPWTIFGVVLVLILAGLVLDLGLGMSDKVRL 50 >gi|118463538|ref|YP_880101.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] gi|118164825|gb|ABK65722.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] Length = 554 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 22/249 (8%), Positives = 55/249 (22%), Gaps = 1/249 (0%) Query: 36 VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLI 95 G VD +S A+ A +A+ +A + S + + + Sbjct: 115 AVGSAVDSAVYSAVGSAVDSAVGSAVGSAVGSAVGSAVDSAVYSAVGSAVDSAVRSAVRS 174 Query: 96 RNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKS 155 + + + + SA + + S + + S Sbjct: 175 AVYSAVYSAVGSAVGSAVGSAVDSAVGSAVDSAVRSAVDSAVGSAVGSAVDSAVDSAVDS 234 Query: 156 WLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSY 215 + A + + V+ ++ + + A + Sbjct: 235 AVGSAVYSAVYSAVGSAVDSAVASAVGSAVDSAVDSAVGSAVYSAVRSAVDSAVDSAVDS 294 Query: 216 SSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVR 275 + ++ + + S S Y R Sbjct: 295 AVRSAVDSAVYSAVGSAVYSAVYSAVYSALGSAPIKPF-WHQLFGGRQWCWWPAFIAYFR 353 Query: 276 DALASVIRS 284 D + + S Sbjct: 354 DVVELQLDS 362 >gi|183597689|ref|ZP_02959182.1| hypothetical protein PROSTU_00982 [Providencia stuartii ATCC 25827] gi|188022961|gb|EDU61001.1| hypothetical protein PROSTU_00982 [Providencia stuartii ATCC 25827] Length = 473 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 42/113 (37%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 ++ ++ T N+++ +D S+ + + K + + TA N T T Sbjct: 111 PLHHYGKILWTGMNEKIGADTSYRIQGRVEVPSSEKDLELTREWIKKTAENPGFDTPLRT 170 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + E+ R+ N K D + + N AKS GIRI T Sbjct: 171 RMIEGEELAKRLPNAQTPWKIGGFEEDAGSLDPEVVTPTMANYAKSIGIRIYT 223 >gi|332974518|gb|EGK11438.1| PilC protein [Kingella kingae ATCC 23330] Length = 1328 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 33/155 (21%), Gaps = 16/155 (10%) Query: 305 RVISDPSFSWGVHKLI--RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 ++ L + + + + + + + Sbjct: 276 YWYTESPLRNLTSDLDFLDSDSAFYTPSKRTAYGLYGMYPIFNYQPILAPQTQYAGEMLS 335 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 E I L T G + +Q I TI F L Sbjct: 336 FFSEPINKIDLRTVGNDDTGKSWNDP---AFPTQNITTFTIGFGSGL--TSSGLALLKGG 390 Query: 423 ASP---------NSFFEANSTHELNKIFRDRIGNE 448 AS +++ AN L K F Sbjct: 391 ASDIKLKDGTVQKAYYPANDQAGLQKAFDSIFDQI 425 >gi|296131396|ref|YP_003638646.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] gi|296023211|gb|ADG76447.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] Length = 971 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 38/150 (25%), Gaps = 21/150 (14%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ V ++ D +G F + L + T Sbjct: 476 TKSTLLKRAVGVVHSLMQPTD------EIGIARFGTTADVVLPMTAASAGLGTVLTGT-- 527 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + G+TA+ +Q I + K ++++TDG Sbjct: 528 -ALDPAGATALGRGLQEGSGLI---------NGPGATKPNKAVIVMTDGNENIPPFVDDL 577 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 I + + Sbjct: 578 PAGTVSQ---TTFAIGLGLPGQVSDPVLDA 604 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + + R G F D F+ + + + A Sbjct: 122 SRLTAAKKVLRNFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQNVWL--NLLDEA 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ S + +++LTDG +T + Sbjct: 173 ETGMAGQSTNLGDAIGLGIKVFEQSPS---------TSQDQIMLVLTDGNDTGSFVSPVD 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 A ++GIRI IA + +Q ++ +S F A +LN+ ++ Sbjct: 224 AAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQ 281 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 38/125 (30%), Gaps = 18/125 (14%) Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + ++A+ ++I VR+ F+ S + K Sbjct: 1438 TPTRLSRAKEAIQNLIDGYDL----YGDVRVQLVTFSTTGASQQ------AWMTAAEAKA 1487 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + GST + A+ A + ++ A+ LTDGE T + Sbjct: 1488 LVQNLQAAGSTNYDAALAAAMNGFSAT--------GKLDGAQNVSYFLTDGEPTLGDGNT 1539 Query: 387 IAICN 391 + N Sbjct: 1540 AQLAN 1544 >gi|163738632|ref|ZP_02146046.1| hypothetical protein RGBS107_11427 [Phaeobacter gallaeciensis BS107] gi|161387960|gb|EDQ12315.1| hypothetical protein RGBS107_11427 [Phaeobacter gallaeciensis BS107] Length = 199 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 14/37 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGM 39 T+ + + G + A MP++LGV Sbjct: 1 MLTRIRTLFYRYRRETDGSVSVEFAFYMPLLLGVFAA 37 >gi|160897507|ref|YP_001563089.1| TadE family protein [Delftia acidovorans SPH-1] gi|160363091|gb|ABX34704.1| TadE family protein [Delftia acidovorans SPH-1] Length = 180 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 30/106 (28%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G + AL+ VM+ + + R + +L +A + Sbjct: 32 LRRRQRGVAAVEFALIAGVMVMLLMGMFVYWRVLQAQQSLARATGDGARMVQHLAFGGMA 91 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 + Q + + ++ + T I T+ Sbjct: 92 GYNPAKPAERANILQATSDVVNQSLAASGIPAGTAPTQIQIDWGTS 137 >gi|87310695|ref|ZP_01092823.1| BatB [Blastopirellula marina DSM 3645] gi|87286676|gb|EAQ78582.1| BatB [Blastopirellula marina DSM 3645] Length = 364 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 40/125 (32%), Gaps = 15/125 (12%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 + IK + V R+G F + +T+ GS + DA Sbjct: 118 QQIKDMLKVMAGDRVGLIAFAGETRQSVPLTSHYEDFKQTLDSVGPHTVRSGGS-RLGDA 176 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-KSQGIRIM 401 + A I K IV+ TDGE+ + + + +GIR+ Sbjct: 177 IAAASKGFIDK-----------TYDHKAIVVFTDGED--QESKPVEVAKSLYAERGIRVF 223 Query: 402 TIAFS 406 T+ Sbjct: 224 TVGLG 228 >gi|167589629|ref|ZP_02382017.1| TadE family protein [Burkholderia ubonensis Bu] Length = 142 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 35/130 (26%), Gaps = 1/130 (0%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + G + L++P +L V ++DV + + A++ A V + L Sbjct: 9 RHARGVISLEFVLMLPFLLMVLLGIIDVSLILCDKAVITNASREAARAGVVVRVPMLTTT 68 Query: 76 SSRAKNSFTFPKQKIEEYLIRN-FENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 I N + N T V T + L+ Sbjct: 69 QVANVALNYTQNSLITGGAATTPTVNVTQANGTTSGSALTVTVTYTYTGLVLGSAFSALT 128 Query: 135 SRYDLLLNPL 144 + + Sbjct: 129 GPVTVSATSV 138 >gi|159044811|ref|YP_001533605.1| hypothetical protein Dshi_2268 [Dinoroseobacter shibae DFL 12] gi|157912571|gb|ABV94004.1| hypothetical protein Dshi_2268 [Dinoroseobacter shibae DFL 12] Length = 174 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 6/52 (11%), Positives = 19/52 (36%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + K L + G + ++ P+++ V + + +++A Sbjct: 2 IRGFRKFLCRDERGTATVEFVIVFPLIIAVFMSTFEAAMLTAKYTMMERALD 53 >gi|56418593|ref|YP_145911.1| hypothetical protein GK0058 [Geobacillus kaustophilus HTA426] gi|261417558|ref|YP_003251240.1| hypothetical protein GYMC61_0058 [Geobacillus sp. Y412MC61] gi|297528433|ref|YP_003669708.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|319765215|ref|YP_004130716.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] gi|56378435|dbj|BAD74343.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426] gi|261374015|gb|ACX76758.1| conserved hypothetical protein [Geobacillus sp. Y412MC61] gi|297251685|gb|ADI25131.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|317110081|gb|ADU92573.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] Length = 246 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 369 KYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 + I+L+TDG + + A+ A+ QGI + I E R + A+ Sbjct: 7 RQILLITDGCSNHGE-DPAAMAALAREQGITVNVIGILDQGAMDENGRREIEAIAAAG 63 >gi|198424466|ref|XP_002124191.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 48/108 (44%), Gaps = 3/108 (2%) Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 T +N A I + E + + KN +AKK +VL+TDG+ ++ ++ +S Sbjct: 279 THLNAANTFTSLAIRRAVEFDFAQSKNKDKAKKILVLITDGQANFQSQLIVSY-RLTQSH 337 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDR 444 I + I ++ + + + S +AN+ EL+K R+ Sbjct: 338 NITVYAIGVALKSDAELRISA--NGGVSKERVLDANNYSELSKALRNL 383 >gi|309774616|ref|ZP_07669641.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] gi|308917647|gb|EFP63362.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] Length = 613 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 25/255 (9%), Positives = 69/255 (27%), Gaps = 35/255 (13%) Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + L + +D+S ++ ++ ++ + Sbjct: 24 NTYDITLEVVTEQNLSKLTKKSATILVIDTSGSMGDYQRLINAKNTAKEFVKKYAGDEPD 83 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + F +++ + G T ++ ++ A Sbjct: 84 SGRYLAI-VNFATNTNIILNWTDVSSADGKASADNAINQLYADGGTNLHAGIKQASQLFD 142 Query: 352 SSN----EDEVHRMKNNLEAKKYI---------VLLTD----------GENTQDNEEGIA 388 + E + + + Y+ + T G + E I Sbjct: 143 DTAIQDIESKNTIVLTDGAPTYYLKNCTSDINCIFYTHVTISNTRYHVGGDGDKGSETIN 202 Query: 389 ICNKAKSQ----GIRIMTIAFSVN-----KTQQEKARYFLSNCASP-NSFFEANSTHELN 438 A+++ + TI + + + + Y +N AS + ++A+ + +L Sbjct: 203 DATAAEAKTLKGKSTVYTICYGASREMTYQGGPTVSAYLKNNIASEMKNAYDADDSDDLV 262 Query: 439 KIFRDRIGNEIFERV 453 F+ I I + Sbjct: 263 DAFK-AITETITSGL 276 >gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42] gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 780 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 45/168 (26%), Gaps = 15/168 (8%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I +N R F+D + R + G T + A++ A Sbjct: 367 AISRLNPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAVAYVRSLTADGGTEMLPALEDA 426 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 427 ---LRNQGPVASGAL-------RQVVFLTDGAIGNEQQLFQEI--TANRGDARVFTVGIG 474 Query: 407 VNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + +F + ST ++ + + Sbjct: 475 SAPNTYFMTKAAE---VGRGTFTQIGSTDQVASRMSELFAKLQNPTMT 519 >gi|295698899|ref|YP_003606792.1| von Willebrand factor A [Burkholderia sp. CCGE1002] gi|295438112|gb|ADG17281.1| von Willebrand factor type A [Burkholderia sp. CCGE1002] Length = 328 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 43/175 (24%), Gaps = 27/175 (15%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + + T T ++ M A Sbjct: 130 NDRLAFMMFGTNPVLAMPFTYNHRVIEAAVAATAIGRGMPD--TELDRGMLAAIAQF--- 184 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ IVL++DG D I + I + I + + Sbjct: 185 -------DGRLSSGRRAIVLVSDGGALLDERMQRRIEEGLRRDRIALYFIYLRSSVFSPD 237 Query: 414 KARYFLSNCASPN--------------SFFEANSTHELNKIFRDRIGNEIFERVI 454 ++ +S F+A + + I + Sbjct: 238 LNATQPASGSSAEAQLHRFFLTLKTPYRLFQAEDPKAMMAAIAE-INRQQNATTT 291 >gi|225861379|ref|YP_002742888.1| G5 domain family [Streptococcus pneumoniae Taiwan19F-14] gi|298230070|ref|ZP_06963751.1| G5 domain family protein [Streptococcus pneumoniae str. Canada MDR_19F] gi|298503284|ref|YP_003725224.1| G5 domain family protein [Streptococcus pneumoniae TCH8431/19A] gi|225728324|gb|ACO24175.1| G5 domain family [Streptococcus pneumoniae Taiwan19F-14] gi|298238879|gb|ADI70010.1| G5 domain family protein [Streptococcus pneumoniae TCH8431/19A] Length = 1873 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 89/345 (25%), Gaps = 11/345 (3%) Query: 43 VVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNL 102 + R E L A A + S+ QSL + ++ + + I+ +I + Sbjct: 134 LSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSN 193 Query: 103 KKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKA 162 T E + VE + S D + P S + + + Sbjct: 194 SNKGTVSETPVAKKANIVEAKEETNPAATTDRSAVDSAVLPTSTAAKVETTSAPASINEI 253 Query: 163 EAETVSRSYHKEHGVSIQWVIDF--SRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNG 220 +S S +++ + +D S ++ + + G Sbjct: 254 LKPGLSLSDARQNPAIRKEDLDKGYSGFRAAPAPTNKLVTNLGNNTVFTDISRGSHTFRG 313 Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 R + V+ + + + + + + + Sbjct: 314 NGNSRGGNPIHFDVTTTRVGNRVEFSIAYSGPGEFVNNNFLLDKGNGFGEPSRATISTNR 373 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K I + V + V ++ + + I ++ + Sbjct: 374 LRDQTKDIRKGANFVSHSSYSLTSAVETNSNQTIRFSLPIANPNGDLSVRLRPVT----- 428 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 S+ ++ + N +K ++L D Sbjct: 429 --FNVGQGESGSATSNDPYSNSNYYFREKPLLL--DANPNGGTNN 469 >gi|188026532|ref|ZP_02962486.2| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] gi|188019325|gb|EDU57365.1| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] Length = 197 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 41/112 (36%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++++ +T + FN + ++ D Sbjct: 6 IEAVKNGVQTLLSTLRQDPYALETAHVSIITFNSTAQQIVPLTD--------LINFSLPD 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+TA+ DA+ I + + K + + ++TDG T D Sbjct: 58 LQASGTTALGDALSVVAHCIENEVQRTTVESKGDWRP--LVFIMTDGAPTDD 107 >gi|126738777|ref|ZP_01754473.1| hypothetical protein RSK20926_02634 [Roseobacter sp. SK209-2-6] gi|126719958|gb|EBA16665.1| hypothetical protein RSK20926_02634 [Roseobacter sp. SK209-2-6] Length = 179 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 10/60 (16%), Positives = 25/60 (41%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + I K + G+ I +L P++L + V++ + + L++A + Sbjct: 1 MIRSGLIKLLCKFRRREDGNATIEFVILAPLLLLMLVSTVELGMINLRQSQLERALDITV 60 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 16/135 (11%), Positives = 40/135 (29%), Gaps = 11/135 (8%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + + +D MG + D ++ + + + + + G + + Sbjct: 386 LKDVAQSLDVGVAGPLMGIIQYGDDPSTEFNLK--TYANSKDLKNAIEKILQKGGLSNVG 443 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A+ + N +++ DG T EE + A+ GI I Sbjct: 444 KALSFVNKNFFEDSNGNRGAAPN------VAIVMVDGWPTDKVEEASRL---ARESGINI 494 Query: 401 MTIAFSVNKTQQEKA 415 + +++ Sbjct: 495 FFVTIEGPDENEKQN 509 >gi|168699404|ref|ZP_02731681.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 367 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 18/148 (12%), Positives = 47/148 (31%), Gaps = 6/148 (4%) Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + R ++ +++ R+G F R + R Sbjct: 129 ADPGAKSRWEAARSGALDLLAAME----RRGGHRVGVVLFAARPKLVCPLTTDYKHA-RA 183 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN-TQ 381 +++ +A T+ ++ V + IVL++DG++ + Sbjct: 184 VLRAVNGRFPPPECRPGPEADATSGTRFGAALVAAVAAHDPRFVGAQDIVLISDGDDPEE 243 Query: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + E + N A++ + + T+ Sbjct: 244 SDREWVRGANAARTANVPVHTVGVGNPG 271 >gi|220918111|ref|YP_002493415.1| hypothetical protein A2cp1_3014 [Anaeromyxobacter dehalogenans 2CP-1] gi|219955965|gb|ACL66349.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 408 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 6/98 (6%), Positives = 29/98 (29%) Query: 32 VMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIE 91 ++ + ++V L+ A+ + ++ L + + + ++ + Sbjct: 1 MLCAFLALALNVGHLFSVRGELQNASDAGALAGAIELDGRTAKFGAARAVAASYAAEHFT 60 Query: 92 EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 + +++ D + +P + Sbjct: 61 DSGTGVVAESVELGQWATPSEAPSCDQRGDASPDGHRF 98 >gi|332298718|ref|YP_004440640.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181821|gb|AEE17509.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 566 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 53/165 (32%), Gaps = 21/165 (12%) Query: 248 PLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVI 307 + + +++ + + V+ A+++ ++ + + Sbjct: 95 SRSMTAKDVSSGAGAAVSKPVSRLDAVKLYAAALLDRMEGTEVSAVIAKGSGV------- 147 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 + ++ + + GS +I ++ A ++ A Sbjct: 148 VAVPLTSDLNAVRSLLPSLSPALLTSTGS-SIGSGIEAAVNSFPP-----------LSAA 195 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 + IV+ TDGE T A+ + K GI ++ + F + + Sbjct: 196 ARTIVVFTDGEETDGR-MSGAVADALK-FGIPVVFVGFGSDTESE 238 >gi|254167839|ref|ZP_04874688.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623130|gb|EDY35696.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 1953 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 66/237 (27%), Gaps = 22/237 (9%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 E + E + D S +++ + + Y++ + Sbjct: 1164 EATVLDINGESPLDFINRRDVSTNVVGDWDWTLSGDTDSDNVYTQDGNTFYTNDTKVLNF 1223 Query: 225 RDEKLSPYMVS-------------CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKK 271 + ++ + G ++ + D + + Sbjct: 1224 TGGAYHSASSAFWMPEAPRLNKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRI 1283 Query: 272 HLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDE 331 + A ++ + D V G N F++ ++TI+ D Sbjct: 1284 DVAIQAAIDAVKELGPQDRVAVFTFNG----NSHPEEYMGFTYVTADNLQTIISD-LKDI 1338 Query: 332 NEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 G T + D + A + + D + +A + I++LTDG + D Sbjct: 1339 QAGGGTPLYDTLSWAVYYMDKYSTDNP----DREDATRGILVLTDGLSNYDTYGTSN 1391 >gi|134291854|ref|YP_001115623.1| TadE family protein [Burkholderia vietnamiensis G4] gi|134135043|gb|ABO59368.1| TadE family protein [Burkholderia vietnamiensis G4] Length = 177 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 18/181 (9%), Positives = 46/181 (25%), Gaps = 12/181 (6%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + ++ G + AL++ M+ + + + R Y L +A + A S Sbjct: 3 RLPMHRSRM----RGVAAVEFALVLIPMIVLATGVAEFGRAIYQYETLTKATRNAARYLS 58 Query: 66 VPLIQSLE---EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 V L + L+ ++ D + Sbjct: 59 VYLPSDAAYPLAAAQCLVVYGNTTCGAAGTELVPGLTTSMVIVCDAAHTPDCADASDPPQ 118 Query: 123 NPRKSAYQVVLSS---RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 Y ++ +N + + + G I +++ + Sbjct: 119 FANLPTYDSSNNAPSGTATGSINVVEV--KVKGYTYQPIPAYPGLSSITFGNIVTVMRQV 176 Query: 180 Q 180 Sbjct: 177 S 177 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 53/167 (31%), Gaps = 17/167 (10%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++++ D ++G ++ R + S + ++ G T Sbjct: 24 QFTLKTVIGFDIGPTATQVGIIQYSTRPRQEFSM--NSFQTKESLSTAIENVNYMAGGTL 81 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ K ++L+TDG ++ D E+ +A+ +GI Sbjct: 82 TGRAIRYVTKYGFGESDGARPG------IPKIVILVTDGVSSDDIEQPAL---EAQQKGI 132 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRI 445 + I S L AS N L R+ + Sbjct: 133 SLYAIGVSG------YDMDQLERIASNNRTLAVAENFNLLDSLRNTL 173 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 34/345 (9%), Positives = 83/345 (24%), Gaps = 32/345 (9%) Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 + P L + D ++ + +++ + + Sbjct: 66 SVKPSYYGNSFSLIRQGDNAISVVVGAPKAIDQNQISLHRTKATAETKPQGFLQRCPINF 125 Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGK----VGIRDEKLSPYMVSC 236 + E + + ++ Y + Sbjct: 126 SGLPGLPCKYVNPPSIEPGDNIGLTASVQPEGDIVNCSPTRELKCSSMKYNPGFCYKSTD 185 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK-KHLVRDALASVIRSIKKIDNVNDTV 295 + + S + + K V+D + ++ + + Sbjct: 186 YGGNWRKEGSRNTECPSSGVDVLFVLDGSGSVGKNFDKVKDWVKNITAKLDIGKEIVRVG 245 Query: 296 RMGATFF------NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + + + N + S G KL+ + + +T A+Q Sbjct: 246 VVQYSHYVEGKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQKVIRD 305 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + K+ ++LLTDG+ + N+ +++GI + Sbjct: 306 FDDAYIGN----------KQVLLLLTDGQAKDNKLILPN-ANRLRNKGIATFAVGVG--- 351 Query: 410 TQQEKARYFLSNCASP----NSFFEANSTHELNKIFRDRIGNEIF 450 E L AS + F EL+ I + Sbjct: 352 ---EYDISELKLIASGTDSTDRVFTVTDFGELDSIVKSLQTEIQS 393 >gi|269958192|ref|YP_003327980.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] gi|269306873|gb|ACZ32422.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] Length = 133 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 12/115 (10%), Positives = 33/115 (28%), Gaps = 2/115 (1%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M ++ + +L G + A++MP +L + + + AAQ Sbjct: 1 MRSRRQYKSLTARLR--ERGSVSLEAAVVMPALLALLWLGMQGALMYQGRTTALAAAQEG 58 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A+ + +++ + + + + +V Sbjct: 59 ARVAAGENGTASAGIAAAEDLADAATLGLRGTQVDGSRSATEASVTVRLHIASVV 113 >gi|159038794|ref|YP_001538047.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157917629|gb|ABV99056.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 316 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 56/169 (33%), Gaps = 23/169 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 +G F + R V T STA +A+ T + I S Sbjct: 127 YNLGLVSFAKAANVLVPPT-----KDRQAVVTAVDGLVLAESTATGEAVFTCLEAIRSV- 180 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 IVLL+DG T A A++ + + TIAF + Q + Sbjct: 181 ----PADGAAGIPPARIVLLSDGYRTSGRSVEEAA-AAAQAANVAVSTIAFGTDGGQVDI 235 Query: 415 ARYFLS-----------NCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 + F+EA S +EL ++++D G+ I R Sbjct: 236 GGQLQRVPVDRFALAELAATTEGHFYEAASVNELKQVYQDM-GSSIGFR 283 >gi|238609519|ref|XP_002397504.1| hypothetical protein MPER_02057 [Moniliophthora perniciosa FA553] gi|215472087|gb|EEB98434.1| hypothetical protein MPER_02057 [Moniliophthora perniciosa FA553] Length = 183 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 9/84 (10%), Positives = 32/84 (38%), Gaps = 3/84 (3%) Query: 371 IVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 +VL++D ++ + + +A++ + I + + + L + + ++ Sbjct: 16 MVLVSDASDSTRRAQMDLVLARAEAANVPIHSFGYGRSHDPASLW---LMSNHTSGTYTF 72 Query: 431 ANSTHELNKIFRDRIGNEIFERVI 454 ++L IG + ++ Sbjct: 73 VKDWYDLRDSIAGCIGGMMSIGLL 96 >gi|115374131|ref|ZP_01461419.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] gi|115368907|gb|EAU67854.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] Length = 1551 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 61/238 (25%), Gaps = 37/238 (15%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 D + ++ + N + P + +F ++ + + Sbjct: 403 NADAFDSSRTGFIATINSLTFNTATPLARALLNTGYYFTSDQTVYKDVFGFGATNPTVGY 462 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K + ++ R + D+ + + G T+ ++ Sbjct: 463 AYPTDFKNEPLSSENRS--VCWGDQASAVIILTDG-EPNTDTLGSAVVQKIRSRNGGPVS 519 Query: 341 --------DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 D A + + L T G+ Sbjct: 520 CPPSAPCADTPNDANAMLDDVAKLLFTNDLQYSTPPIVGALNTSGQ-------------- 565 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 Q +R+ T+ F+++ L N A + A+ L + +D + E Sbjct: 566 ---QSLRVYTVGFAIDSN-------LLKNAAAVGGGRSYTAHDAAGLRQALQDALLRE 613 >gi|15072454|gb|AAK38350.1| von Willebrand factor A-related protein [Mus musculus] Length = 415 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 9/100 (9%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ + + I + MG T A+ A + + + + K Sbjct: 84 TEFTFDQYSSGQAIRDAIRVAPQRMGDTNTGLALAYAKEQLFAEEAGARPGV------PK 137 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 +V +TDG ++ + + K G+ I ++ Sbjct: 138 VLVWVTDGGSSDPVGPPM---QELKDLGVTIFIVSTGRGN 174 >gi|264678232|ref|YP_003278139.1| TadE-like protein [Comamonas testosteroni CNB-2] gi|262208745|gb|ACY32843.1| TadE-like protein [Comamonas testosteroni CNB-2] Length = 147 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 41/132 (31%), Gaps = 6/132 (4%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G I AL + + L +VD R Y AA A + + + + Sbjct: 6 QRGSTLIEFALGLLIFLMFLLGVVDFSRLLYTWT----AANEATRAGAR--YAVVCDDLN 59 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + + K ++ + + + T V + + A V +++ Sbjct: 60 QKTSVLQYMKNRLPQITAVDVKWKPDSCTTATCQSVTVSIPQDGLKFQWIAPIVGSAAQT 119 Query: 138 DLLLNPLSLFLR 149 + + S L Sbjct: 120 VIGMPGFSTTLP 131 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 4/85 (4%) Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYF 418 H+ AKK ++++TDG+ +D E + +A+ GI I AR Sbjct: 6 HKNGARKSAKKILIVITDGQKYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGP-TARQE 64 Query: 419 LSNCASP---NSFFEANSTHELNKI 440 L+ +S + F+ ++ L I Sbjct: 65 LNTISSAPPQDHVFKVDNFAALGSI 89 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 56/178 (31%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + + R G F D F+ + + A Sbjct: 158 SRLTAAKRVLRNFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQDVWL--NLLDEA 208 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ S + +++LTDG +T + Sbjct: 209 ETGMAGQSTNLGDAIGLGIKVFEQSPS---------TSQDQIMLVLTDGNDTGSFVSPVD 259 Query: 389 ICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 A ++GIRI IA + +Q ++ +S F A +LN+ ++ Sbjct: 260 AAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQ 317 >gi|307133504|dbj|BAJ19016.1| TadF [Aggregatibacter actinomycetemcomitans] Length = 192 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 20/54 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQA 56 + +KK + G + ++ +++ + L D+V + L A Sbjct: 1 MKKNILTNTKKFFHNKRGAVTLEFLFMLILLVFIFAFLTDLVIVRTTQGKLDNA 54 >gi|226309730|ref|YP_002769624.1| hypothetical protein BBR47_01430 [Brevibacillus brevis NBRC 100599] gi|226092678|dbj|BAH41120.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 136 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 20/60 (33%), Gaps = 1/60 (1%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVD-VVRWSYYEHALKQAAQTAIITASVPL 68 + +K G I L+ P++ + D + AL A + I A P Sbjct: 1 MIRARVKDERGSQMIEFILVFPLVWILIVFSFDQFSMMYNKQKALAAAYEAGRIAAVQPN 60 >gi|74001590|ref|XP_848825.1| PREDICTED: similar to Collagen alpha 1(VI) chain precursor [Canis familiaris] Length = 206 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 47/127 (37%), Gaps = 20/127 (15%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG-STAINDAMQTAYDTIISSNEDEVHRM 361 D V + R +K G T + A++ + ++ Sbjct: 95 GDEVEIISPLTPMPA--DRDALKAKVDAVKYFGKGTYTDCAIKKGLEELL--------VG 144 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEG----IAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++L+ KY++++TDG + +E N+AK G+++ ++A + Sbjct: 145 GSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAI-----TPDHLEP 199 Query: 418 FLSNCAS 424 LS A+ Sbjct: 200 RLSIIAT 206 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 42/137 (30%), Gaps = 18/137 (13%) Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 V G T + D A+ + +VLLTD Sbjct: 481 AGTRSQAVMDAVNRIEPRGDTGLYDTALAAFRYMNQHYVPGRPNQ---------VVLLTD 531 Query: 377 GEN-TQDNEEGIAICNKAK-----SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFE 430 G+N + + + + ++++TI + + +R + A+ + Sbjct: 532 GKNSDPGSIALDELVRILRREYSPQRPVQVITIGYGADTDLAALSRI---SAATGAETYP 588 Query: 431 ANSTHELNKIFRDRIGN 447 A + + ++ D + Sbjct: 589 ALDPNTIFEVLVDALTE 605 >gi|326382230|ref|ZP_08203922.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] gi|326198960|gb|EGD56142.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] Length = 552 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 43/142 (30%), Gaps = 22/142 (15%) Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + G N G T + D A+ T+ S + ++ Sbjct: 419 TDGRTNRDLLADAIRQTLGNVHGGTGLYDTALAAFKTVYDSYDPAYSNS---------VI 469 Query: 373 LLTDGENTQDNEEGIAI----CNKAKS--QGIRIMTIAFSVNKTQQEKARYFLSNC--AS 424 ++TDG N + + N K+ + I I+ + ++ L A+ Sbjct: 470 IMTDGRNEDRDSITLQHLVSELNIMKNPARRIPIIAVGI-----SEDADADALKQIADAT 524 Query: 425 PNSFFEANSTHELNKIFRDRIG 446 S F A ++ I + Sbjct: 525 GGSSFIARDPKDIPAILLQAVS 546 >gi|303310907|ref|XP_003065465.1| Ku70/Ku80 beta-barrel domain containing protein [Coccidioides posadasii C735 delta SOWgp] gi|240105127|gb|EER23320.1| Ku70/Ku80 beta-barrel domain containing protein [Coccidioides posadasii C735 delta SOWgp] Length = 739 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 33/59 (55%) Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 + +D + R L+ K+ IVL+TDG +T D++ +I +K K +GI ++ + + Sbjct: 122 TNQGDDMIVRYCKRLKYKRKIVLVTDGRSTTDSDGIDSIVSKIKEEGIELVILGVDFDD 180 >gi|72168566|ref|XP_796840.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115961659|ref|XP_001187264.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 958 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 54/188 (28%), Gaps = 18/188 (9%) Query: 260 DSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL 319 D S + + A +R D +G F+D + + Sbjct: 325 DVSGSMGGNDRLTKLNQAATQYLRYTID-----DGSFVGIAHFSDYSRIIENLTEITD-- 377 Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G T I + + E YI+L++DG+ Sbjct: 378 NSREDLVMGLPSIANGPTCIGCGVLDGIKILKGETGMEDPAGG-------YILLISDGQQ 430 Query: 380 TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEAN-STHELN 438 + + ++ + G+ + TIAFS +N F+ ++ LN Sbjct: 431 NRQPYI-DEVFDEVEEAGVIVDTIAFSDAADPNLLELSVRTN--GLGFFYPDTATSTALN 487 Query: 439 KIFRDRIG 446 F + Sbjct: 488 DAFTATVS 495 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 55/178 (30%), Gaps = 22/178 (12%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 + + L + R G F D F+ + + A Sbjct: 122 SRLTAAKKVLRDFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQEVWL--NLLDEA 172 Query: 329 IDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 ST + DA+ S + +++LTDG +T + Sbjct: 173 ETGMAGQSTNLGDAIGLGIKVFEQSP---------ATSQDQIMLVLTDGNDTGSFVSPVD 223 Query: 389 ICNKAKSQGIRIMTIAFSVNKT--QQEKARYFLSNCAS--PNSFFEANSTHELNKIFR 442 A ++GIRI IA + +Q ++ +S F A +LN+ ++ Sbjct: 224 AAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQ 281 >gi|190894970|ref|YP_001985263.1| hypothetical protein RHECIAT_PC0000636 [Rhizobium etli CIAT 652] gi|218513508|ref|ZP_03510348.1| hypothetical protein Retl8_07211 [Rhizobium etli 8C-3] gi|190700631|gb|ACE94713.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 194 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 39/132 (29%), Gaps = 7/132 (5%) Query: 5 TKFIFYSKKLI---KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + F F +L+ + + AL++P+++ + VD+ + Sbjct: 8 SLFRFARSRLLDLARDRLAASGVEFALVLPILVMLLFGTVDLGHALTVSRKI----DEIA 63 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 T + Q SS + ++ Y + L N + V +A Sbjct: 64 STTGDLIGQQSSWTSSDVTKLLSGASFILQPYDTSGLKITLTVNDISKNGNATVNWSAAF 123 Query: 122 MNPRKSAYQVVL 133 ++ Sbjct: 124 NTSALNSGAASA 135 >gi|38347892|ref|NP_941141.1| putative tellurium resistance protein [Serratia marcescens] gi|226807627|ref|YP_002791321.1| TerY1 [Enterobacter cloacae] gi|226809937|ref|YP_002791631.1| TerY1 [Enterobacter cloacae] gi|1354148|gb|AAC44737.1| terY [Plasmid R478] gi|38259369|emb|CAE51594.1| putative tellurium resistance protein [Serratia marcescens] gi|226425852|gb|ACO53945.1| TerY1 [Enterobacter cloacae] gi|226426163|gb|ACO54255.1| TerY1 [Enterobacter cloacae] Length = 197 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 38/112 (33%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLTDLLS--------FQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+T++ +A+ +I + K + + L+TDG D Sbjct: 58 LTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDD 107 >gi|116622830|ref|YP_824986.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225992|gb|ABJ84701.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 329 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 10/79 (12%), Positives = 30/79 (37%), Gaps = 3/79 (3%) Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SP 425 +K +++ ++G + + A+ +GI I I+ + + + A + Sbjct: 201 RKVVIVFSNGPDNASMVAPDDVRAVAEDEGIPIYVIS-TSEVNKDPISSGVFKRIATRTG 259 Query: 426 NSFFEANSTHELNKIFRDR 444 + A + + + F + Sbjct: 260 GKAYWAKTWQKQVEAFENI 278 >gi|99081993|ref|YP_614147.1| hypothetical protein TM1040_2153 [Ruegeria sp. TM1040] gi|99038273|gb|ABF64885.1| hypothetical protein TM1040_2153 [Ruegeria sp. TM1040] Length = 201 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 17/55 (30%), Gaps = 3/55 (5%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGML---VDVVRWSYYEHALKQAA 57 K + + G+ + AL +P++L V D+ R Sbjct: 4 KIAHKFRAFRRDEEGNIAVEAALYLPLLLFVFAATYTLFDLFRQETVNTKAAYTV 58 >gi|301091448|ref|XP_002895909.1| ATP-binding Cassette (ABC) Superfamily [Phytophthora infestans T30-4] gi|262096097|gb|EEY54149.1| ATP-binding Cassette (ABC) Superfamily [Phytophthora infestans T30-4] Length = 1945 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 67/241 (27%), Gaps = 11/241 (4%) Query: 2 VFDTKFIFYS-KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 VF T K+ + II + L+PV+L G+ + A+ Sbjct: 1046 VFSTHLRALLLKRFRYARRDKKTIIYSALLPVLLIGAGLGILKGSAL--------ASDNP 1097 Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + S E + + + E + F + + Sbjct: 1098 SMALSTDEYSGSETPTPYFCQAGSGTGDWCSEVMGSTFFSGAAAQTLAVSQPAFDSSSPT 1157 Query: 121 EMNPRKSAYQVVLSSRYDLLLNP-LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + + + S + L+ R G + L++ + A V + Sbjct: 1158 VFDVSYTNPTINASGYTGYSVAMGQELYNRGYGHDADLVEGQYGAYLVYGDGDQNLLGYN 1217 Query: 180 QWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS 239 + + + Q + F A+ + S SS KV ++ + + S Sbjct: 1218 VFTNTSAPHSSAIFKAMMDQAVYRFFA-ANTSSDSDSSVGLKVNNYPLPVTAATKAFSGS 1276 Query: 240 L 240 Sbjct: 1277 G 1277 >gi|115525744|ref|YP_782655.1| TadE family protein [Rhodopseudomonas palustris BisA53] gi|115519691|gb|ABJ07675.1| TadE family protein [Rhodopseudomonas palustris BisA53] Length = 170 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 9/67 (13%), Positives = 24/67 (35%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ ++ G I A + P+ + +++ + L+ Q A + +QS Sbjct: 2 RRFCRNKQGSAAIEFAFVAPMFFALLFAIIETSAVFFAGQYLETGTQDAARALLIDDVQS 61 Query: 72 LEEVSSR 78 + Sbjct: 62 KGTNQAD 68 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 38/109 (34%), Gaps = 17/109 (15%) Query: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 ++ G TA +A+ + N R N ++V+LTDG+ Sbjct: 403 TKENVLAALRGIRYMSGGTATGEAISH-----TTRNVFGPVRDGGNKN---FLVILTDGQ 454 Query: 379 NTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + D+ G A+ A+ GI I +I L + AS Sbjct: 455 -SYDDVRGPAV--AAQQAGITIYSIGI------AWAPLDDLKDMASAPK 494 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 73/291 (25%), Gaps = 35/291 (12%) Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + + S ++ G + A + + Sbjct: 153 SSSKSPAAKAGETTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALPKPSPSAGFTTS 212 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + +Q V SR + + + N + ++ K + + L P Sbjct: 213 SLRLQPVGQRSRELDLWCTTTYTNSQN--------SPQANPGFVPKEELSTQSLEPVS-- 262 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 G + +D SS ++ + + + + +D Sbjct: 263 ----------QGDPSCKVDLSFLIDGSSSIG-KRRFRIQKQ---FLADVAQALDIGPAGP 308 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 MG + D + + ++ G + + A+ S + Sbjct: 309 LMGVVQYGDNPATQFNLKTHMNSRDLKTAIEKIT--QRGGLSNVGRAISFVTKNFFSKSN 366 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N V++ DG T EE + A+ GI I I Sbjct: 367 GNRGGAPN------VAVVMVDGWPTDKVEEASRL---ARESGINIFFITIE 408 >gi|218660803|ref|ZP_03516733.1| hypothetical protein RetlI_15107 [Rhizobium etli IE4771] Length = 194 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 35/126 (27%), Gaps = 4/126 (3%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 +L + T + AL++P+++ + VD+ + T Sbjct: 14 RSRLHRLAQERTAASGVEFALVLPILVMLLFGTVDLGHALTVSRKI----DEIASTTGDL 69 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + Q SS + ++ Y + N + V +A + Sbjct: 70 IGQQSSWTSSDVTKLLSGASFILQPYDTSGLTITVAVNDISKSGNATVNWSAAYNTSALN 129 Query: 128 AYQVVL 133 + Sbjct: 130 SGTASA 135 >gi|198418440|ref|XP_002122148.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1270 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 60/167 (35%), Gaps = 20/167 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 + + ++++S D V +G F++ + + S + + ++ Sbjct: 200 KSWIKNLLQSF-SSDIDKHNVVVGLYSFSNIIKREIPLSARTYSTLSGMIDAVR---YPY 255 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAK 394 G T I+ A+ A + V K ++++TDGE T + + N A+ Sbjct: 256 GQTFIHTAINEAVQEYQRAGRASVP---------KLLIVITDGEATVPSAVAPSA-NAAR 305 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKI 440 + GI + + + + E L+ A F + L I Sbjct: 306 AAGIILTAVGIGSSVNENE-----LTTIAGAAERVFRVSDFSSLGSI 347 >gi|163742982|ref|ZP_02150365.1| hypothetical protein RG210_01912 [Phaeobacter gallaeciensis 2.10] gi|161383665|gb|EDQ08051.1| hypothetical protein RG210_01912 [Phaeobacter gallaeciensis 2.10] Length = 199 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 14/37 (37%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGM 39 T+ + + G + A MP++LGV Sbjct: 1 MLTRIRTLFYRYRRETDGSVSVEFAFYMPLLLGVFAA 37 >gi|90406743|ref|ZP_01214936.1| hypothetical protein PCNPT3_01885 [Psychromonas sp. CNPT3] gi|90312196|gb|EAS40288.1| hypothetical protein PCNPT3_01885 [Psychromonas sp. CNPT3] Length = 156 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 46/147 (31%), Gaps = 1/147 (0%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA- 64 + + KKL +G + + + +++++ R+ L A A Sbjct: 2 RLLQRPKKLRSQQSGAATLEFVFTFIPLFVMCLLMLEICRYMITSSVLDVALSNATRQVI 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 P+ + + ++ +P + I K ++ + Sbjct: 62 VTPVNSDVSAQLKQLIDAKKWPLINAKNITINAQYFTDFKALVKQQSDSVHHGQIYAKYI 121 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSM 151 +Y +L S + LS F RS+ Sbjct: 122 LSYSYTPLLLSFSGFSFSKLSSFKRSI 148 >gi|260813586|ref|XP_002601498.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] gi|229286795|gb|EEN57510.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] Length = 384 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 53/206 (25%), Gaps = 22/206 (10%) Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 N + ++ N + + L +D S V+ Sbjct: 201 NETNICHRGTDREFGIN-NCGRKFSCFSTLCRNPLDIIFLLDGSGSVG-ASNFEKVKQFT 258 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 I + ++ R + S + + K + G T Sbjct: 259 KKTISGFDISPSGTQVG---VIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMR--GGTL 313 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + K ++++TDG + +A+ +GI Sbjct: 314 TGKAIRYVTKYGFGKSDGARPGV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGI 364 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS 424 + I S L AS Sbjct: 365 TVYAIGVSG------YDADQLEQIAS 384 Score = 37.3 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 46/144 (31%), Gaps = 17/144 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 +I D ++G ++ R + S + + K + G T A+ Sbjct: 34 TISGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMR--GGTLTGKAI 91 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + ++ + K ++++TDG + +A+ +GI + I Sbjct: 92 RYVTKYGFGKSDGARPGV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGITVYAI 142 Query: 404 AFSVNKTQQEKARYFLSNCASPNS 427 S L AS N+ Sbjct: 143 GVSG------YDADQLEQIASNNN 160 >gi|195487665|ref|XP_002091997.1| GE11914 [Drosophila yakuba] gi|194178098|gb|EDW91709.1| GE11914 [Drosophila yakuba] Length = 896 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 12/183 (6%), Positives = 31/183 (16%), Gaps = 8/183 (4%) Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 + ++ Q D +S G + P N S + Sbjct: 352 GPWWGNNSTQGPWWGNNSTDGPWWGNNSTQGPWWGNNSTQGP--WWGNNSTEGPWWGNNS 409 Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLV---RDALASVIRSIKKIDNVNDTVRMGATFFNDRV 306 E+ + + ++ + Sbjct: 410 TEGPWWENNSTQGPWWGNNSTEGPWWGNNSTQGPWWGNNSTEGPWWGNNSTEGPWWGNNS 469 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 WG + + G+ + ++ E + Sbjct: 470 TE--GPWWGNNSTQGPWWGNNSTQGPWWGN-NSTEGPWWGNNSTXXXTEGPWGGNNSTEG 526 Query: 367 AKK 369 Sbjct: 527 PWW 529 >gi|159040448|ref|YP_001539701.1| hypothetical protein Sare_4964 [Salinispora arenicola CNS-205] gi|157919283|gb|ABW00711.1| conserved hypothetical protein [Salinispora arenicola CNS-205] Length = 144 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 32/128 (25%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 L + +G + A+ +L V + D A A + Sbjct: 5 SLGRGDSGRVSVFLAIAALGILTVVALAFDGAGQLRALQRADNLASEAARAGGQSIDLVS 64 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + ++ + YL + +F + +VR R V Sbjct: 65 AIEGGPKRIDEPQARRAVAGYLAAAGASTHTVSFPVVDGETVVRVRVTVTYERSLLGLVG 124 Query: 133 LSSRYDLL 140 ++ Sbjct: 125 FDDTVNVT 132 >gi|295789104|ref|NP_001171440.1| vitrin isoform 2 [Homo sapiens] gi|74739159|sp|Q6UXI7|VITRN_HUMAN RecName: Full=Vitrin; Flags: Precursor gi|37181801|gb|AAQ88704.1| VIT [Homo sapiens] gi|119620824|gb|EAX00419.1| vitrin, isoform CRA_d [Homo sapiens] Length = 678 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 82/303 (27%), Gaps = 19/303 (6%) Query: 104 KNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAE 163 + A E + ++ + L L ++ + L + Sbjct: 147 PSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVATPTTLPRPSPS 206 Query: 164 AETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVG 223 A + + + +D S Y + Q D + ++ G Sbjct: 207 AASTTSIPRPQSVGHRSQEMDL-WSTATYTSSQNRPRADPGIQRQDPSGAAFQKPVGAD- 264 Query: 224 IRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIR 283 L P +SL + G + + +D S+ ++ + + LA V Sbjct: 265 -VSLGLVPKEELSTQSLEPVS-LGDPNCKIDLSFLIDGSTSIG-KRRFRIQKQLLADV-- 319 Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 + +D MG + D + + + G + + A+ Sbjct: 320 -AQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKIT--QRGGLSNVGRAI 376 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 S N +V++ DG T EE + A+ GI I I Sbjct: 377 SFVTKNFFSKANGNRSGAPN------VVVVMVDGWPTDKVEEASRL---ARESGINIFFI 427 Query: 404 AFS 406 Sbjct: 428 TIE 430 >gi|238027564|ref|YP_002911795.1| TadE-like protein [Burkholderia glumae BGR1] gi|237876758|gb|ACR29091.1| TadE-like protein [Burkholderia glumae BGR1] Length = 156 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 38/140 (27%), Gaps = 14/140 (10%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L +S G + AL++P V ++ + + A A L + Sbjct: 9 LRRSQRGSMSVEFALILPAFFMVLYAVITYGLIFAAQ---QNLTLAATEGARAALNYQIS 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 A + +L + + S Y + + Sbjct: 66 PTPQAALTARAAAACTAATHLTGWLTGVSCASTPAGTC-----------SYDSSMYCIKV 114 Query: 134 SSRYDLLLNPLSLFLRSMGI 153 + Y +PL L +G+ Sbjct: 115 TLTYPYSSSPLIAPLPLVGM 134 >gi|167520272|ref|XP_001744475.1| hypothetical protein [Monosiga brevicollis MX1] gi|163776806|gb|EDQ90424.1| predicted protein [Monosiga brevicollis MX1] Length = 1147 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 17/207 (8%), Positives = 41/207 (19%), Gaps = 1/207 (0%) Query: 46 WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN 105 L A + A+ Q+ + + T I L + Sbjct: 113 AYASTARLNNTVDAA-LQAAQSDTQAWVDNRLLSYAQTTAVAATINASLQDFWAATEPSV 171 Query: 106 FTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 E+N + +A ++ + + + A Sbjct: 172 QASIASSLAPYTPTTELNDKLAAIHDDVAPLINATQAENIAAAAIQAYDNEPKPVRLLAS 231 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + + +E + I S + + A + S + +R Sbjct: 232 SGACDASREGELGISGNGLVSVCLSETWEPVSMPADEYPSCRAIKDALSTAVDGYYTVVR 291 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + Sbjct: 292 GGSARQVYCDMTTEGGGWEWVATKVTP 318 >gi|111026118|ref|YP_708401.1| hypothetical protein RHA1_ro10050 [Rhodococcus jostii RHA1] gi|110824961|gb|ABH00243.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 150 Score = 38.0 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 13/144 (9%), Positives = 38/144 (26%), Gaps = 7/144 (4%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + + + G I L +++ + G + ++ AA AS Sbjct: 2 RLTRWLSRARRDDRGDVTIELCLATVILILLLGWMYAYGVNRQAHQKVEHAATEGARAAS 61 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV----- 120 + + + + + + + N + T V V T Sbjct: 62 LARTIATATPLAYQAAAGS-MDGQGLKCATMNVNADTSGFRTRPGVPATVEVTVTCQVSF 120 Query: 121 -EMNPRKSAYQVVLSSRYDLLLNP 143 + +++ ++ Sbjct: 121 DALGWPGVNGARTVTATAISPIDT 144 >gi|300313226|ref|YP_003777318.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] gi|300076011|gb|ADJ65410.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] Length = 755 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 1/47 (2%) Query: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAIITASVPLIQ 70 +T +PVM+ G+ VD + Y + AA + + + + Sbjct: 635 LTVATLPVMVLAVGIGVDYAFYIYSRLQMHLAAGEAIAVALEKSIQE 681 >gi|167589171|ref|ZP_02381559.1| membrane protein [Burkholderia ubonensis Bu] Length = 636 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 42/148 (28%), Gaps = 11/148 (7%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G ++ + + V L G +D+ + L++ A + + A L S Sbjct: 1 MRGQRGAVSVLATVWVLVALVTLG-AIDIGNLFFQRRDLQRIADLSALAAVQSLDPSDTS 59 Query: 75 ----------VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + S T + + + + A Sbjct: 60 CGAANTAALNNARENETSNTVTTGAPARGQDQVAATCGRWDPQVYAGQPAYFAPAASGMT 119 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMG 152 + +A QV ++ + L + Sbjct: 120 QLNAAQVTVTRTMRYSFLGVVSMLGAGP 147 >gi|32471259|ref|NP_864252.1| signal peptide [Rhodopirellula baltica SH 1] gi|32396961|emb|CAD71931.1| hypothetical protein-signal peptide prediction [Rhodopirellula baltica SH 1] Length = 154 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 34/105 (32%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G A+++P+ L + D R ++ + AA+ ++ + E Sbjct: 22 KRAGATATEFAIVLPMFLLLVFACCDFARVIHFRQLVANAARVGATHGALNRFTAATESD 81 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 R ++ ++ + +F D V +T V Sbjct: 82 WRNDVVNVMREELAHLTSTDPNDSVIDVHFRDLSSGVRVVETEVT 126 >gi|76798101|ref|ZP_00780356.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|76586556|gb|EAO63059.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] Length = 420 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 43/124 (34%), Gaps = 18/124 (14%) Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV---LLTDGENTQDNEEGIAICNKAK 394 A+ + Y + D + + KK + T GE T G + Sbjct: 3 NEGYAINSGYIYL--YWRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-----NIR 55 Query: 395 SQGIRIMTIAFSVNKTQ---QEKARYFLSNCAS-PNSFFEANST----HELNKIFRDRIG 446 +G I T+ VN +A F+ + +S ++ + T ELNK F+ + Sbjct: 56 PKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVE 115 Query: 447 NEIF 450 + Sbjct: 116 EKHS 119 >gi|283777925|ref|YP_003368680.1| hypothetical protein Psta_0124 [Pirellula staleyi DSM 6068] gi|283436378|gb|ADB14820.1| hypothetical protein Psta_0124 [Pirellula staleyi DSM 6068] Length = 505 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 2/51 (3%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHA 52 + +KF +++ K G ++ A+L+P + G+ G+++D Sbjct: 9 ISSSKFARKTRE--KQRRGKVLVLLAILLPSLCGMIGLVMDTGMLLSEHRQ 57 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 38.0 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 15/161 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G + + +D S+ ++ + + LA V++++ D +G + D Sbjct: 274 QGDPNCKIDLSFLIDGSTSIG-KRRFRIQKQFLADVVQAL---DIGPAGPLVGVVQYGDN 329 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ G + + A+ T S N Sbjct: 330 PATQFNLKTHMNSQDLKTAIEKIT--QRGGLSNVGRAISFVTKTFFSKANGNRGGAPN-- 385 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 V++ DG T + A+ GI + I Sbjct: 386 ----VAVVMVDGWPTDK---VEEVSRVARESGINVFFITVE 419 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 37.7 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 36/291 (12%), Positives = 75/291 (25%), Gaps = 19/291 (6%) Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCF---GQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 +I ID S SM + + + + Sbjct: 58 NIMLFIDDSGSMKWIPGEKRPPRYYGEKSRLAITKSVLNKVLDKYQDQFNWSLQTLHNNG 117 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI---KKKHLVRDALASVIRSIKKIDNV 291 + + + + D E + + R + + + + Sbjct: 118 RTDTADFKTPWKSVKDRVDGIEAANGTPTTRRYYEIVSGIVMPSVEYRCQKAFVVLMSDG 177 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIR-TIVKTFAIDENEMGSTAINDAMQTAYDTI 350 + + ++ + G A + + GS T Sbjct: 178 DANLSQACGRYSSSRNRSFVYPAGYAYDDYFGRQHAGACRDIDGGSYDTFWDRNTGLSFF 237 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKT 410 + +A K + + + N + Q ++ TI F + + Sbjct: 238 SRKLSSIDFKTSGLDKAGKSW------QGDPKDPKTAKNPNGFRDQLVQTFTIGFGRDIS 291 Query: 411 QQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDR---IGNEIFERVIRIT 457 R +L+N AS + +F A S +L + F I +E VI T Sbjct: 292 SA--GRNYLTNGASRDDYYFSAESEDDLYRAFDTITDSIKDESQNVVIETT 340 >gi|269966895|ref|ZP_06180968.1| hypothetical protein VMC_23980 [Vibrio alginolyticus 40B] gi|269828562|gb|EEZ82823.1| hypothetical protein VMC_23980 [Vibrio alginolyticus 40B] Length = 142 Score = 37.7 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 35/113 (30%), Gaps = 5/113 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS-----VPLI 69 + + G I L +PV+L + +++DV R + +A Q A Sbjct: 1 MANQKGLAIIEFILALPVLLMLTVLVIDVSRAFIQYTEVNKALQNGARYAVVDTYGTLDF 60 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 + + E S + +I E + +V +A Sbjct: 61 EGIAEESDIKNVVIYGTPSGVGTPVIDYIEAADIIVTQPTDTNKVVTISATYN 113 >gi|167842451|ref|ZP_02469135.1| hypothetical protein Bpse38_37655 [Burkholderia thailandensis MSMB43] Length = 168 Score = 37.7 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 37/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDATRYLSTYLSTDPAYPVAQ 61 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + ++ ++ Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDASHTTNCSDSSDPAQFASVPTYDTNNGS 121 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 122 PDPASLAGSMNLV 134 >gi|323525744|ref|YP_004227897.1| TadE family protein [Burkholderia sp. CCGE1001] gi|323382746|gb|ADX54837.1| TadE family protein [Burkholderia sp. CCGE1001] Length = 171 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 21/54 (38%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 ++ +S G I AL+ P+ V +V + +L A++ A Sbjct: 23 FRRARRSQRGATAIEFALVFPLFFCVFYAIVTFSLIFVAQQSLTLASEEGARAA 76 >gi|218779355|ref|YP_002430673.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760739|gb|ACL03205.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 504 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 45/138 (32%), Gaps = 21/138 (15%) Query: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 D + R + + G + A++ A++ Sbjct: 83 CDNAEVLVPLAP----YDRRALISAIKKVQPQGKAPLAAALRKAWEQ------------G 126 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLS 420 L I L+TDG + + +++ K++G I + I + N+ K Sbjct: 127 AGLSQGCVITLITDGWDDCW-GDPVSMVEDLKARGAGIIVNIIGVAPNREDAAKLMRLAR 185 Query: 421 NCASPNSFFEANSTHELN 438 AS ++ A++ +L Sbjct: 186 --ASGGAYRAADTRADLI 201 >gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing protein [Rhizobium etli CNPAF512] Length = 794 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 48/163 (29%), Gaps = 20/163 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I ++ R F+D + + R T+ G T + A++ A Sbjct: 381 AISRLSKDDRFNVIRFDDTMTDYFNGLVAASPDNREKAITYVRGLTADGGTEMLPALEDA 440 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 441 ---LRNQGPVASGAL-------RQVVFLTDGAIGNEQQLFQEI--SANRGDARVFTVGIG 488 Query: 407 VNKT------QQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 E R + S + A+ EL ++ Sbjct: 489 SAPNTYFMTKAAEIGRGTFTAIGSTDQV--ASRMGELFAKLQN 529 >gi|326561512|gb|EGE11856.1| penicillin-binding protein 1B [Moraxella catarrhalis 46P47B1] Length = 796 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 3 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 42 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 74/291 (25%), Gaps = 35/291 (12%) Query: 116 RDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + + S ++ G + A ++ + Sbjct: 92 SSSKSPAAKAGETTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALSKPSPSAGFTTS 151 Query: 176 GVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVS 235 + +Q V SR + + + N + ++ K + + L P Sbjct: 152 SLRLQPVGQRSRELDLWCTTTYTNSQN--------SPQANPGFVPKEELSTQSLEPVS-- 201 Query: 236 CNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTV 295 G + +D SS ++ + + + + +D Sbjct: 202 ----------QGDPSCKVDLSFLIDGSSSIG-KRRFRIQKQ---FLADVAQALDIGPAGP 247 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNE 355 MG + D + + ++ G + + A+ S + Sbjct: 248 LMGVVQYGDNPATQFNLKTHMNSRDLKTAIEKIT--QRGGLSNVGRAISFVTKNFFSKSN 305 Query: 356 DEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 N V++ DG T EE + A+ GI I I Sbjct: 306 GNRGGAPN------VAVVMVDGWPTDKVEEASRL---ARESGINIFFITIE 347 >gi|257064791|ref|YP_003144463.1| RHS repeat protein [Slackia heliotrinireducens DSM 20476] gi|256792444|gb|ACV23114.1| RHS repeat protein [Slackia heliotrinireducens DSM 20476] Length = 516 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 27/251 (10%), Positives = 63/251 (25%), Gaps = 8/251 (3%) Query: 57 AQTAII-TASVPLIQSLEEVSSRAKNSFTFPKQK-IEEYLIRNFENNLKKNFTDREVRDI 114 A++ A+ + E ++ ++ + + + + + + + EV D Sbjct: 151 LDAAVLQAAAYLDVTGCENLAELNLSANSALETLRAMDTALTEVDVSGAAGLSAIEVDDD 210 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 V +E P + V ++ S + S+ + + T TV E Sbjct: 211 VAVVGLEATPIHEQWVPVHVTKVVSSSGGTSTYEYSLSLDEAGVATGYAESTVGVDDTSE 270 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 V ++ D S +++ F V + + + Sbjct: 271 QAV-YEFNYDESGNIIGIWDVEAESMDVEFVYDDAGNVIRRTGVPTEYSYTYDDQGRLES 329 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDT 294 + L S ++ + DA V D + Sbjct: 330 FVSTGTVLTLAYDEAGRLASYTSSFGGRAVEYTY-----AYDAEGRVTGVSDSSDRPMEY 384 Query: 295 VRMGATFFNDR 305 Sbjct: 385 TVDYDDNGACS 395 >gi|221052180|ref|XP_002257666.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] gi|193807496|emb|CAQ38002.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] Length = 294 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 59/224 (26%), Gaps = 14/224 (6%) Query: 208 ADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHV 267 V Y + S + S + + + Sbjct: 47 VITCVTKYVIKGDLEIDDGGFCSSNIGSESPPGSCNDTGANPGNYCDNYYDITLLVEESS 106 Query: 268 IKKKHLVRDALASVI-RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 +K + + ++ D M F F+ + + +++ Sbjct: 107 FIQKDYWKKGTIPFLESMVRNARVSKDKAHMSIVLFARDSRVIVPFTDELSQDKDKLIEK 166 Query: 327 --FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG-ENTQDN 383 D T A++ A++ +I +A K +VL G + + Sbjct: 167 VRAINDVATSPDTLYAYALEHAFEHVI-------FGEGTRKDAPKVVVLFYYGFDYGANK 219 Query: 384 EEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 + K + I+++ + ++ L CA ++ Sbjct: 220 SLIPDVVEDYKKKNIKLVIVGIALGNRDNAY---LLGGCAIGDN 260 >gi|149188855|ref|ZP_01867145.1| hypothetical protein VSAK1_05880 [Vibrio shilonii AK1] gi|148837275|gb|EDL54222.1| hypothetical protein VSAK1_05880 [Vibrio shilonii AK1] Length = 186 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 9/113 (7%), Positives = 24/113 (21%), Gaps = 4/113 (3%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 G F I A +M + + D+ L + + + + Sbjct: 12 HRQKGSFAIELAFVMMALCAIYLFSTDLSHQLLVRAKL----DRSSFALANVIKERTRYF 67 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 + + + + + I T ++ Sbjct: 68 DADVAAGKNLAVTSSDLVNLTQVASRMLNTAPGNVALKIESLTNKTTVVGFTS 120 >gi|307726640|ref|YP_003909853.1| TadE family protein [Burkholderia sp. CCGE1003] gi|307587165|gb|ADN60562.1| TadE family protein [Burkholderia sp. CCGE1003] Length = 277 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 14/174 (8%), Positives = 36/174 (20%), Gaps = 8/174 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 + +G + ++ P++L V + V + A++ A+ + + Sbjct: 24 RRQSGQSMVEFIVIAPLLLFVCFGTLQFVLLYQAKS----TLDVAVLEAAREGAVNHGSM 79 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY---QVV 132 + V S Y + Sbjct: 80 QAMRSGLARGLAPIYARQASAEGAAAALAKAQADASNFSVITVLNPTPAAISDYARPRYY 139 Query: 133 LSSRYDLLLNPLSLFLRSMG-IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDF 185 P + I S + H + ++ + Sbjct: 140 PDEGVTYTEIPNDSLMYRDPSIPSTSASGMNVQDANILKIHVHYCYNMYVPLVN 193 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 49/399 (12%), Positives = 112/399 (28%), Gaps = 31/399 (7%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 +A ++ + + ++ + + S + E +L E+ ++ + D IV +T Sbjct: 269 SANLSVADIITPAVAAEVAAHELSVSHYMATPEHFLWHARESEVQGSSADFLSAVIVDET 328 Query: 119 AV-EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 +V N ++ V D ++PL + G + Sbjct: 329 SVWSYNRGITSRDGVTRVAGDPPVDPLVPIYPTDGF-YVADNPAVVLNGSWVDDVERAAA 387 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + R+S + LN A + S S ++ V+ Sbjct: 388 EDFLRFSTTAQGQQVVRESGYRDLNGSLDDAVAELTSAESAPSGALAFPDRNVVAAVAEA 447 Query: 238 KSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRM 297 LS D+++ + + + A+ + + D V Sbjct: 448 FPEVRKDAQVLFLVDLSASMGYDNTAGQ---TRLEGAQHAITAALDHFTAGDRV-GLAGF 503 Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 T + R + + T + A+ + + + Sbjct: 504 TTTDGTITPGLVAPVADIAD--NRGALVEGVNSLEPVAHTPLYQAVADFAQQQAAMWDPD 561 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA-------KSQGIRIMTIAFSVNKT 410 IVLL+DG N + E I + + + T+ + Sbjct: 562 RINA---------IVLLSDGVNATGDIETIGQDDMIHVLHGLHAETPVLVFTLGY----- 607 Query: 411 QQEKARYFLSNC--ASPNSFFEANSTHELNKIFRDRIGN 447 + L A+ +++A E+ + D + + Sbjct: 608 SPDADVETLQAISSATGAHYYDATDPTEVEAVLGDLVTS 646 >gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli Brasil 5] Length = 784 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 49/163 (30%), Gaps = 20/163 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I ++ R F+D + + R T+ + G T + A++ A Sbjct: 381 AISRLSKDDRFNVIRFDDTMTDYFNGLVAASPDNREKAITYVRGLSADGGTEMLPALEDA 440 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 441 ---LRNQGPVASGAL-------RQVVFLTDGAIGNEQQLFQEI--SANRGDARVFTVGIG 488 Query: 407 VNKT------QQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 E R + S + A+ EL ++ Sbjct: 489 SAPNTYFMTKAAEIGRGTFTAIGSTDQV--ASRMGELFAKLQN 529 >gi|152981071|ref|YP_001354276.1| outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid [Janthinobacterium sp. Marseille] gi|151281148|gb|ABR89558.1| outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid [Janthinobacterium sp. Marseille] Length = 824 Score = 37.7 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 43/383 (11%), Positives = 105/383 (27%), Gaps = 26/383 (6%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRW----SYYEHALKQAAQTAIITASVPLIQSLEEV 75 G + I L + G ++ VR + +L+ + Q A++ Q+ E Sbjct: 94 GSYTITAGLQAL----LAGSGLEAVRQANGGYTLQASLQSSQQDAVLAPVTVRAQADETT 149 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV-VLS 134 + L + +++ D T ++ + V + Sbjct: 150 EGSGSYTTRATGAGTRMSLSLRETPQSVSVISRQQMDDQNLVTLTDVLRQTPGIVVDRVD 209 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQ---TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 R L GI + ++ + + E +++ S Sbjct: 210 ERVLFTSRGFDLSPMINGIPTLAFNSVAGESGLLSTAIYDRVEVVRGAAGLLNGVGSPGG 269 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 +P + F A + S++ N ++ + ++ + V Y +D Sbjct: 270 SINLVRKRPTSDFSGRATVGLGSWNQYNTEIDLSNKLNAAGTVRGRVVGSYTDGDNFIDY 329 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 V ++ + D +ASV +K FF+D ++ Sbjct: 330 K-QRRENVFYGAVEA-----DIAPDTIASVGYEFQKNSIKGGNFGQAPLFFSDGSRTNLP 383 Query: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 ++ + + + + E + K + Sbjct: 384 RTFNSSTPWSSWDVQQDRFFL-----NLEHKLNNGWSI---KAEAAQTKNKRDRVGGDIW 435 Query: 372 VLLTDGENTQDNEEGIAICNKAK 394 + +D + N ++A+ Sbjct: 436 LYPSDVDAATGNASIDQGADQAR 458 >gi|220922774|ref|YP_002498076.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219947381|gb|ACL57773.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 193 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 10/122 (8%), Positives = 34/122 (27%), Gaps = 1/122 (0%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA-S 65 + ++ G + +L+ G+ +++ + L A A + Sbjct: 9 LTRSLRAFGRAQQGATAVEFSLVAIPFFGLLAAIIETAIAFFAGQLLDAAVSNASRQIYT 68 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 ++ ++ + T + + + F + K + ++P Sbjct: 69 GAFQTQTGVSATTSEQALTAFRNNLCANRVTIFNCSAVKVDIRTLDDNASFAAISPVDPS 128 Query: 126 KS 127 Sbjct: 129 TK 130 >gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium nodulans ORS 2060] Length = 725 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 54/203 (26%), Gaps = 24/203 (11%) Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGV 316 R V + +R K D + R F++ + + Sbjct: 333 RRPRDVTFVIDNSGSMAGASMRQAKASLLMALDRLAPADRFNVIRFDNTMDQLFPEAVPA 392 Query: 317 HKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 + + ++F G T + + + + IV LTD Sbjct: 393 DERHLAVARSFVAALEARGGT----------EMLAPLTAALADPTPERTDRVRQIVFLTD 442 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST 434 G + + AI A R+ I ++ A S+ + Sbjct: 443 GAIGNEEQIFSAI--AAGRGRSRLFMIGIGSAPNAH-----LMTYAAELGRGSYTAIGTI 495 Query: 435 HELNKIFRDRIGNEIFERVIRIT 457 ++ + R+ + V +T Sbjct: 496 DQVAERMRELLTKLESPVVTDLT 518 >gi|325106973|ref|YP_004268041.1| TadE family protein [Planctomyces brasiliensis DSM 5305] gi|324967241|gb|ADY58019.1| TadE family protein [Planctomyces brasiliensis DSM 5305] Length = 140 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 41/134 (30%), Gaps = 6/134 (4%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + K G + AL +PV++ + ++V + + + + A Sbjct: 6 RISNQPKDF---RRGTATVELALCLPVIVLLLFGAIEVAHFIHLK---QDLTVCAYEAGK 59 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 V + + A+ + +E+ + + E+ + A + Sbjct: 60 VANRRGTTQADVVARFQEIAGAKGLEKATVTITPLLTALTPSGTEISLTAQAPANDNYEL 119 Query: 126 KSAYQVVLSSRYDL 139 S Y + + + Sbjct: 120 PSTYFRNRTLQATV 133 >gi|283779589|ref|YP_003370344.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438042|gb|ADB16484.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1740 Score = 37.7 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 40/93 (43%), Gaps = 8/93 (8%) Query: 361 MKNNLEAKKYIVLLTDGEN---TQDNEEGIAICNKAKSQG---IRIMTIAFSVNKTQQEK 414 + + E++K IV++TDG+N ++ + ++G I++ + F++ ++ E Sbjct: 1315 AQEDGESEKSIVVITDGKNYQFNAPSQLARTKDDVLAARGSRDIKVHIVGFNIEPSESEV 1374 Query: 415 ARYFLSNC--ASPNSFFEANSTHELNKIFRDRI 445 A A+ F A S L + + Sbjct: 1375 ASREFREIAEATGGEFLPATSAGSLVRSLESVL 1407 >gi|149202126|ref|ZP_01879099.1| hypothetical protein RTM1035_12403 [Roseovarius sp. TM1035] gi|149144224|gb|EDM32255.1| hypothetical protein RTM1035_12403 [Roseovarius sp. TM1035] Length = 195 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 31/117 (26%), Gaps = 11/117 (9%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGML---VDVVRWSYYEHALKQAA 57 M ++ K TG + T ++ P + D+ R+ Sbjct: 1 MTLRPALK-LLRRFWKQETGTAAMETVVMFPFLFMGLTFSYEYFDMFRYQSVREK----- 54 Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 A T + L + EV+ ++ + N + +I Sbjct: 55 --ATYTVADMLSRETSEVNEAYIDNVKVLFDIMTNDDGNNQVRVTVVRYHVDATNNI 109 >gi|328953620|ref|YP_004370954.1| TadE family protein [Desulfobacca acetoxidans DSM 11109] gi|328453944|gb|AEB09773.1| TadE family protein [Desulfobacca acetoxidans DSM 11109] Length = 135 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 48/141 (34%), Gaps = 6/141 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 TK + ++ K+ G + A+++P+ L + ++D Y + + Sbjct: 1 MSTKLCRVA-RMRKAQEGAAAVEFAVILPIFLLLMLGIIDFGNLFY---QMHAVNE--AA 54 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 A+ + + ++ + + F + + + + K V V + + Sbjct: 55 RAAARYATTHQNLAGISTDVANFVQTNYGDQFQVSVSPPVSKGNITVTVTTTVTISTPII 114 Query: 123 NPRKSAYQVVLSSRYDLLLNP 143 + + ++ R + L Sbjct: 115 SSFFPSNPYTVTGRCVMGLEN 135 >gi|326571409|gb|EGE21424.1| penicillin-binding protein 1B [Moraxella catarrhalis BC7] Length = 814 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 21 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 60 >gi|193213114|ref|YP_001999067.1| TadE family protein [Chlorobaculum parvum NCIB 8327] gi|193086591|gb|ACF11867.1| TadE family protein [Chlorobaculum parvum NCIB 8327] Length = 160 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 32/108 (29%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 +S G+ I A ++PV L + +V Y + L A + +V + + Sbjct: 20 QSQKGNALIEFAFVLPVFLVLLFGMVTFSLGIYDKTVLTMATREGARAGAVYDAGNYDSD 79 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 S + ++ N + + I D Sbjct: 80 GDLDVTSVQTKARNATLAVLANKLISFGSKTPTVDNPSITGDILTVTA 127 >gi|327381841|gb|AEA53317.1| PblB, putative [Lactobacillus casei LC2W] gi|327385003|gb|AEA56477.1| PblB, putative [Lactobacillus casei BD-II] Length = 1468 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 44/433 (10%), Positives = 98/433 (22%), Gaps = 24/433 (5%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 VD S ++ A +A+ + S A ++ T + Sbjct: 463 VDKANSSTQLASMSNQTAQAAKSAADSANALATQAVSAASDAKTALATANSALETATDQK 522 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQ 159 + + + +++ L L++ I Sbjct: 523 TTVATLVTKTDDLAGTIATLATKTDINKLSGEVTAAQTLAQQTADGLQLKADQSVVNTIN 582 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + D + + Y S Q S S Sbjct: 583 GSVNQLRADLKVANDRLSLTMTKNDVTGLLTPYATQSWTQGQITATASQFNAQFSSIS-- 640 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK------KKHL 273 + K + N S L S V S+ K Sbjct: 641 --GKVDALKFGNRNLLLNTSNMINLAHWTRTSWGSATTIVKLSTHPFYHNGRDNLIKLTT 698 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + + D F N V S + G D ++ Sbjct: 699 TDSWDGHLRSDVVAVKPDTDYNFQFLGFHNYNVKSITVYFLGRTDATSPGDAGDNYDRDK 758 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 I+ M + + N E Y+ +G + + + Sbjct: 759 -----IHLLMNAITLSTSGTKRATATFHTNPGETLGYVRFDMNGGYNATDPADMYMVEAE 813 Query: 394 KSQG--IRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFF-EANSTHELNKIFRDRI 445 ++G + + + A + + L ++ + + + + F+ Sbjct: 814 LAEGPTVPVYSPAPEDSADYADAKVASLKVTVDGIQSTVANYQGQTTTALQTLQGFQTTA 873 Query: 446 GNEIFERVIRITK 458 + I ++ T+ Sbjct: 874 TDRINGLKLQQTQ 886 >gi|302539262|ref|ZP_07291604.1| membrane protein [Streptomyces sp. C] gi|302448157|gb|EFL19973.1| membrane protein [Streptomyces sp. C] Length = 141 Score = 37.7 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 30/122 (24%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G AL ++ + G+LVD + AQ A A L + Sbjct: 10 DRGSLSPFYALSAIGIIMIMGLLVDGGGALNATNKATGLAQEAARAAGQQLNVPAAVQGT 69 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 ++YL + + ++ + + +S Sbjct: 70 EITVDPDAAVAAAQDYLATQNVSGSVTVTDGGQRLEVTVHDTYNTLFAQFVGRSTISVSG 129 Query: 138 DL 139 Sbjct: 130 TA 131 >gi|326577376|gb|EGE27260.1| penicillin-binding protein 1B [Moraxella catarrhalis O35E] Length = 814 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 21 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 60 >gi|326575309|gb|EGE25237.1| penicillin-binding protein 1B [Moraxella catarrhalis CO72] Length = 814 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 21 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 60 >gi|326561085|gb|EGE11450.1| penicillin-binding protein 1B [Moraxella catarrhalis 7169] Length = 814 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 21 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 60 >gi|146284359|ref|YP_001174512.1| aerotaxis receptor Aer [Pseudomonas stutzeri A1501] gi|145572564|gb|ABP81670.1| aerotaxis receptor Aer [Pseudomonas stutzeri A1501] Length = 615 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 13/161 (8%) Query: 15 IKSCTGHFFIITALLMP-----VMLGVGGMLVDVVRWSYYEHA-LKQAAQTAIITASVPL 68 + G A+ M M+G G D R L A ++ +A+ Sbjct: 311 RRDEYGQIA--FAMKMLETEAGAMVGRIG---DASRQLSQHAQELLGAMDSSTESAARQQ 365 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 ++ + ++ + + + Q + R E + + ++V T + Sbjct: 366 SETDQVATAINQMAVSV--QDVANNAQRTAEAASRADNEAATGTEVVNRTGAAIGQLAQD 423 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSR 169 Q + L + + I+ QT A + Sbjct: 424 IQHAGDVIHQLEAHSNDITKVLDVIRGIAEQTNLLALNAAI 464 >gi|62078605|ref|NP_001013960.1| von Willebrand factor A domain-containing protein 1 precursor [Rattus norvegicus] gi|81884089|sp|Q642A6|VWA1_RAT RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|51980318|gb|AAH81983.1| Von Willebrand factor A domain containing 1 [Rattus norvegicus] gi|149024826|gb|EDL81323.1| von Willebrand factor A domain containing 1 [Rattus norvegicus] Length = 415 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 34/100 (34%), Gaps = 9/100 (9%) Query: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 F++ + + I + MG T A+ A + + + L K Sbjct: 84 TEFTFDQYSSGQAIQDAVRVAPQRMGDTNTGLALAYAKEQLFAE------EAGARLGVPK 137 Query: 370 YIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNK 409 +V +TDG ++ + + K G+ I ++ Sbjct: 138 VLVWVTDGASSDSVGPPM---QELKDLGVTIFIVSTGRGN 174 >gi|268575744|ref|XP_002642852.1| Hypothetical protein CBG15118 [Caenorhabditis briggsae] Length = 1268 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 41/321 (12%), Positives = 96/321 (29%), Gaps = 30/321 (9%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 TF + + + + T S + + + ++ + Sbjct: 516 STFSPPLVTATTTLQTTDTQEPFTFSLDTSTPSASTVTLETSTVSLRRYTVEEKIEVHPH 575 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 + S AE V+ + S++ + D + N Sbjct: 576 TIYTEEPFFSTSSIEWNPTTSAEDVATTTTTS-----------PDSVIIMREDEDYDYTN 624 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 F P T + S+ I+ + P LD + ++ F+ S Sbjct: 625 EFAIPLVTTTTTISTTTNSTEIQPNFVIP------SGQCPTPNQPNLDRNRTDILFLLDS 678 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 S ++ H + + +D V++ +ND + S H + Sbjct: 679 SDNFNEQRFHRAIKLIGETVSKFNNF--GSDGVQVSLVQYNDEPYLEFSLR--KHNCKKH 734 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 ++ A E G + + A++ + + +N ++++TDG++ Sbjct: 735 LLDDIADTEFMTGGSQLTKALEKVSQFAFTKKRGDRPDAEN------VLIIVTDGQSNGR 788 Query: 383 NEEGIAICNKAKSQGIRIMTI 403 +E + AK + ++ I Sbjct: 789 IQEPTRL---AKENNVTVLVI 806 >gi|314927767|gb|EFS91598.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] Length = 322 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 48/137 (35%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ +A+ + + + + IV+L+DG Sbjct: 152 STDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMA------PGGSKDRVPAAIVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 NTQ +A A + + + TIAF + LS+ A + Sbjct: 206 GNTQGGSPLVAA-THAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDVADRTDA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYKQ 281 >gi|227819317|ref|YP_002823288.1| hypothetical protein NGR_b10820 [Sinorhizobium fredii NGR234] gi|227338316|gb|ACP22535.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 212 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 12/93 (12%), Positives = 27/93 (29%), Gaps = 1/93 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ-TAIITASVP 67 + +L + +G T + P++L V V+ + +A Q A + A Sbjct: 5 RFLTRLRRDESGIALSETLITFPIVLLVFASFVEFGYAMSQWNQTVKALQYGARLAAVSD 64 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 + S + + + Sbjct: 65 PLTSDFDTVLPTEAANPLNNGDATPNDATITST 97 >gi|222080975|ref|YP_002540338.1| hypothetical protein Arad_7188 [Agrobacterium radiobacter K84] gi|221725654|gb|ACM28743.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 145 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 23/56 (41%), Gaps = 4/56 (7%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 +KL +S +G + AL+MP L + +V+ R + A I + Sbjct: 5 LQKLRQSRSGASAVEFALVMPAFLLMLFGIVEFGRLFWT----SHALHETAIATAR 56 >gi|218131125|ref|ZP_03459929.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|317476997|ref|ZP_07936239.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986645|gb|EEC52979.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|316906790|gb|EFV28502.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 342 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 58/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +A ++ + + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDKM-------ENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI A+ A + + I+++T Sbjct: 150 YISA-KMFLESIDPSLISKQGTAIGAAINLASRSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A A +GI++ + + + Sbjct: 198 DGENHEGGAVEAAK--AAAEKGIQVNVLGVGMPEGAPIPAEGTNDYRRDRDGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + + ++T+ K I V Sbjct: 256 EQMCQEIAKSGNGIYVRVDNTNGAQKAISQEINKMAKADV 295 >gi|156402363|ref|XP_001639560.1| predicted protein [Nematostella vectensis] gi|156226689|gb|EDO47497.1| predicted protein [Nematostella vectensis] Length = 260 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 44/144 (30%), Gaps = 13/144 (9%) Query: 298 GATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 T F + + ++++ + G T A++ + Sbjct: 121 NDTKFALILYAAEPVMVTNFTTQASMLQLLTNVQRLYGKTNTFSALELCRKLFFVKSGSL 180 Query: 358 VHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++L+TDG + + ++ + + K I I +A ++ Sbjct: 181 NR-----------VLLVTDGLSNMNQDKMLYEAFQLKMSNIEIFVVAVGRYNYGADEISS 229 Query: 418 FLSNCASPNSFFEANSTHELNKIF 441 LS+ + + S L +I Sbjct: 230 -LSSIPTT-HIYRVESMRGLVRII 251 >gi|83951471|ref|ZP_00960203.1| hypothetical protein ISM_12950 [Roseovarius nubinhibens ISM] gi|83836477|gb|EAP75774.1| hypothetical protein ISM_12950 [Roseovarius nubinhibens ISM] Length = 188 Score = 37.7 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 52/173 (30%), Gaps = 4/173 (2%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 F T ++ + G F + + + +P++ V+ + H A A Sbjct: 4 FGTYLGRGLRRFWRRDDGSFVVESVIALPLLFLAAM----VIYEFFEVHRFNSARDKASY 59 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM 122 T + L + + V++ ++ I + + ++ R V + V Sbjct: 60 TVADMLSREMGTVNTTYIDNTKSLFDSIVDDNAGSQLRVTAISYDVDTDRYAVYWSEVRG 119 Query: 123 NPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 S + L +S + I+S + A S + Sbjct: 120 TGPMSVLTTSDIATSHATLPLMSDGEHILLIESVSDYQRMFAAGFSEDMEIKT 172 >gi|319791189|ref|YP_004152829.1| hypothetical protein Varpa_0496 [Variovorax paradoxus EPS] gi|315593652|gb|ADU34718.1| hypothetical protein Varpa_0496 [Variovorax paradoxus EPS] Length = 637 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 19/227 (8%), Positives = 60/227 (26%), Gaps = 1/227 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + +G + ++P++ ++ + + + L+ A A + + L Sbjct: 19 RRRQSGQAIVYLVAMLPIIFLSVLVVYNTATAAREKMKLQNTADAATYS-ANVLTARTLN 77 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + +V A++ ++ + Sbjct: 78 YLAYTNRAMAANEASVATLASVQTSMAMLITSAANIQEGLVVAEAMKGIRNLERARLPVV 137 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 L L R+ ++ + E ++ + + S S Sbjct: 138 GPIFALAWLLGNVPRANRLQRTADRIAGFVEPSAKPLQIGADILRAFNQGISASQAVMLL 197 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLY 241 + Q + S + I K + + K+ + Sbjct: 198 GTTAQLPQLVKDVVNSNDPDASVPPTETAIFVIKFVADVGTYLKTYH 244 >gi|325108015|ref|YP_004269083.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968283|gb|ADY59061.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 710 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 58/193 (30%), Gaps = 16/193 (8%) Query: 24 IITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ------------- 70 ++ A + M+G + VD+ S + L++ A A + A+ +I Sbjct: 2 LLAAAGLVGMIGFVALSVDLGVISLTKTNLQKTADAAALAAAQEIIASVESASSEASSPG 61 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 ++++S + ++ K+ E + + F R + + ++ Sbjct: 62 GPDDINSISISASKAMAAKVAEMNGVYIDPDRDVAFGRRVFNEDTQTFDIDWGDPPYTSG 121 Query: 131 VVLSSRYDLLLNPLSLFLRSMG---IKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR 187 + + G + + A + + I ++DFS Sbjct: 122 ASFNVVQVTCRRDNPDSSKPDGVLELFFAPVNGTKTAAISASAIAFVEARDIVSILDFSG 181 Query: 188 SMLDYQRDSEGQP 200 SM Sbjct: 182 SMNYDSALRSVDT 194 >gi|157375505|ref|YP_001474105.1| hypothetical protein Ssed_2368 [Shewanella sediminis HAW-EB3] gi|157317879|gb|ABV36977.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 152 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 46/141 (32%), Gaps = 8/141 (5%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + + A++ V L + +++V R Y + L +A++ A A V + + Sbjct: 4 QRGVYAVEFAIVAGVFLMLLFAIIEVGRLMYTYNVLHEASRRAARIAVVCQVTDTD---- 59 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + NL ++ D E + V + + YQ Sbjct: 60 ----IKNMGLFNGIDLIPNLTPANLTISYLDNEGDEATGMAIVLVRAEIANYQHEFLVPG 115 Query: 138 DLLLNPLSLFLRSMGIKSWLI 158 + F + +S + Sbjct: 116 LYRVINSPTFSTYLPRESLGV 136 >gi|114797939|ref|YP_761696.1| TadE-like protein [Hyphomonas neptunium ATCC 15444] gi|114738113|gb|ABI76238.1| TadE-like protein [Hyphomonas neptunium ATCC 15444] Length = 188 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 8/78 (10%), Positives = 20/78 (25%), Gaps = 4/78 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + G + AL+ + L++V + + Sbjct: 8 LAARLRERFSSYASENRGVAAVEFALIAAPFFFLIFGLLEVCMIFI----MAAILDHGVA 63 Query: 63 TASVPLIQSLEEVSSRAK 80 A+ PL + + + Sbjct: 64 NAARPLRTGAAQSAGMSA 81 >gi|326576606|gb|EGE26513.1| penicillin-binding protein 1B [Moraxella catarrhalis 101P30B1] Length = 814 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 6/42 (14%), Positives = 14/42 (33%), Gaps = 2/42 (4%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYY 49 + + +G I+T +L+ V+ + V Sbjct: 21 KSVPRSFWRHQSG--LIVTGILLIVLFVMMAAFVLYAMHLNT 60 >gi|78060311|ref|YP_366886.1| TadE-like protein [Burkholderia sp. 383] gi|77964861|gb|ABB06242.1| TadE-like protein [Burkholderia sp. 383] Length = 177 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 45/168 (26%), Gaps = 8/168 (4%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + AL++ M+ + + + R Y L +A + A SV L S+ Sbjct: 12 RGAVAVEFALVLMPMIMLATGVAEFGRAIYQFETLTKATRDAARYLSVWLPTDSAYPVSQ 71 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFT------DREVRDIVRDTAVEMNPRKSAYQVV 132 A+ + T D + P A Sbjct: 72 AQCLVVYGSTTCGASGTELVPGLTTSMVTICDAQRTTGCSDGSDPSQFANLPTYDANNNA 131 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 S +N + + + G K I +++ + + Sbjct: 132 ASGTATGAINVVEV--KIKGYKYQPIPAFPGLPSITFNNIVTVMRQVS 177 >gi|191637774|ref|YP_001986940.1| hypothetical protein LCABL_09960 [Lactobacillus casei BL23] gi|190712076|emb|CAQ66082.1| Putative uncharacterized protein [Lactobacillus casei BL23] Length = 1468 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 44/433 (10%), Positives = 98/433 (22%), Gaps = 24/433 (5%) Query: 41 VDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFEN 100 VD S ++ A +A+ + S A ++ T + Sbjct: 463 VDKANSSTQLASMSNQTAQAAKSAADSANALATQAVSAASDAKTALATANSALETATDQK 522 Query: 101 NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLF-LRSMGIKSWLIQ 159 + + + +++ L L++ I Sbjct: 523 TTVATLVTKTDDLAGTIATLATKTDINKLSGEVTAAQTLAQQTADGLQLKADQSVVNTIN 582 Query: 160 TKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQN 219 + D + + Y S Q S S Sbjct: 583 GSVNQLRADLKVANDRLSLTMTKNDVTGLLTPYATQSWTQGQITATASQFNAQFSSIS-- 640 Query: 220 GKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK------KKHL 273 + K + N S L S V S+ K Sbjct: 641 --GKVDALKFGNRNLLLNTSNMINLAHWTRTSWGSATTIVKLSTHPFYHNGRDNLIKLTT 698 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + + + D F N V S + G D ++ Sbjct: 699 TDSWDGHLRSDVVAVKPDTDYNFQFLGFHNYNVKSITVYFLGRTDATSPGDAGDNYDRDK 758 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 I+ M + + N E Y+ +G + + + Sbjct: 759 -----IHLLMNAITLSTSGTKRATATFHTNPGETLGYVRFDMNGGYNATDPADMYMVEAE 813 Query: 394 KSQG--IRIMTIAFSVNKTQQEKARYFLSNCASP-----NSFF-EANSTHELNKIFRDRI 445 ++G + + + A + + L ++ + + + + F+ Sbjct: 814 LAEGPTVPVYSPAPEDSADYADAKVASLKVTVDGIQSTVANYQGQTTTALQTLQGFQTTA 873 Query: 446 GNEIFERVIRITK 458 + I ++ T+ Sbjct: 874 TDRINGLKLQQTQ 886 >gi|116249089|ref|YP_764930.1| hypothetical protein pRL120423 [Rhizobium leguminosarum bv. viciae 3841] gi|115253739|emb|CAK12132.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 194 Score = 37.7 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 11/121 (9%), Positives = 35/121 (28%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 ++ L+ + + AL++P+++ + VD+ + + A + S Sbjct: 14 RSRARHLLCDRSAASGVEFALVLPILVMLLFGTVDLGHALTVSRKIDEIASSTGDMISQQ 73 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + +V+ + + L + V + + Sbjct: 74 GSWTKSDVAKLLSGASFILQPYDTTGLTITLAVDDIAKSGSATVNWSAALNTSALTSGSA 133 Query: 128 A 128 + Sbjct: 134 S 134 >gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] Length = 794 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 49/163 (30%), Gaps = 20/163 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I ++ R F+D + + R T+ + G T + A++ A Sbjct: 381 AISRLSKDDRFNVIRFDDTMTDYFNGLVAASPDNREKAITYVRGLSADGGTEMLPALEDA 440 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 441 ---LRNQGPVASGAL-------RQVVFLTDGAIGNEQQLFQEI--SANRGDARVFTVGIG 488 Query: 407 VNKT------QQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 E R + S + A+ EL ++ Sbjct: 489 SAPNTYFMTKAAEIGRGTFTAIGSTDQV--ASRMGELFAKLQN 529 >gi|328907234|gb|EGG27000.1| aerotolerance protein BatA [Propionibacterium sp. P08] Length = 307 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 48/137 (35%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ +A+ + + + + IV+L+DG Sbjct: 137 STDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMA------PGGSKDRVPAAIVMLSDG 190 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 NTQ +A A + + + TIAF + LS+ A + Sbjct: 191 GNTQGGSPLVAA-THAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDVADRTDA 249 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 250 KSWTADSADKLQEVYKQ 266 >gi|310830511|ref|YP_003965612.1| hypothetical protein PPSC2_p0151 [Paenibacillus polymyxa SC2] gi|309249978|gb|ADO59544.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 187 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 60/153 (39%), Gaps = 2/153 (1%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 K+L+K+ G F I+ +L+ + + V VD++ S+ ++ T+ ++A + Sbjct: 4 LKQLLKNNKGSFSILFVILILIGVMVVTAFVDILIESWSMQEVQSVMDTSGVSALRASVD 63 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA--VEMNPRKSA 128 + + + ++ + ++ K +I T +K+ Sbjct: 64 ETKLRIGEFDVEKSVAVSEYKKLVSMRLKDTKKIKSVHFNPIEIESFTEKWGTGETKKAR 123 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 QV+L S ++++ ++F G ++ Sbjct: 124 PQVLLDSSIVIVVSGSTIFDLVPGTAEVFYNSR 156 >gi|309791117|ref|ZP_07685650.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226815|gb|EFO80510.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 454 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 43/135 (31%), Gaps = 16/135 (11%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENE---MGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + + R + + +T ++ M+ + + Sbjct: 126 AEQALTLVPSMSGERRAELVAGITRLHSLRLGEATDLSSGMRLGLAELAHAA-------- 177 Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + ++LLTDG T++ + +A+ +A G+ I T+ Q+ L++ Sbjct: 178 -APGTVRRLILLTDG-FTENADACMALARQAAQAGVSISTLGLG--GEFQDDLLTGLAD- 232 Query: 423 ASPNSFFEANSTHEL 437 S ++ Sbjct: 233 VSGGRASFMRRADQI 247 >gi|296161494|ref|ZP_06844300.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295888309|gb|EFG68121.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 328 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 33/125 (26%), Gaps = 12/125 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + + I T T ++ + A Sbjct: 130 NDRLAFMMFGTSPVLAMPFTYDLRAIEAAIAGTAVGRGMPD--TQLDRGLLAAIGEFN-- 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ IVL++DG D + I + I + I + Sbjct: 186 --------GRASSGRRAIVLVSDGGARLDTQVRRLIQDGLMRNQIALYFIYLRSGTYSPD 237 Query: 414 KARYF 418 Sbjct: 238 LNAAV 242 >gi|295698036|ref|YP_003602693.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060148|gb|ADF64885.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 197 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 38/112 (33%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLTDLLS--------FQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+T++ +A+ +I + K + + L+TDG D Sbjct: 58 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDD 107 >gi|189465624|ref|ZP_03014409.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224539998|ref|ZP_03680537.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] gi|189437898|gb|EDV06883.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224518388|gb|EEF87493.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] Length = 348 Score = 37.7 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 59/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + +A ++ + + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDKM-------ENDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI A+ A + + ++++T Sbjct: 150 YISA-KMFLESINPSLISKQGTAIGAAINLATRSFTPQ-----------EGVGRAVIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK-----------------ARYF 418 DGEN + A A +GI++ + + R Sbjct: 198 DGENHEGGAVEAAK--AAAEKGIQVSVLGVGMPDGAPIPVEGTNDFRRDRDGNVVVTRLN 255 Query: 419 LSNC-----ASPNSFFEANSTHELNKIFRDRIGNEIFERV 453 C A + ++++ K+ I V Sbjct: 256 EQMCQEIAQAGDGIYVRVDNSNAAQKVIAQEINKMAKADV 295 >gi|259416592|ref|ZP_05740512.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348031|gb|EEW59808.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 203 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 29/110 (26%), Gaps = 11/110 (10%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGML---VDVVRWSYYEHALKQAAQTAII 62 K ++ + G+ + AL +P++LGV D+ R Sbjct: 4 KITHKLQEFRRKEDGNIALEAALYLPLLLGVFAATYTLFDLFRQETVNSKAAYTV----- 58 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 LI + + K+ + + D + Sbjct: 59 ---SDLISRETAALNDDYIDSIYTLGKLMARAGSDMSMRVSVIRWDADDD 105 >gi|260834079|ref|XP_002612039.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] gi|229297412|gb|EEN68048.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] Length = 794 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 27/225 (12%), Positives = 64/225 (28%), Gaps = 25/225 (11%) Query: 215 YSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLV 274 ++ G + Y SC ++ + V + V Sbjct: 154 STNNGGCSETCTNIVGGYYCSCRPGYRLTGSGACVEFTTQAVDLVFLIARPRSSVSNADV 213 Query: 275 RDALASVIRSIKKIDNVNDT-VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 + L SV+ ++ V + + + V + Sbjct: 214 KTFLKSVVAALNVSQTAARVAVIEYTSVMHSHFDLPTHLTN-------AQVTSAINSIPA 266 Query: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKA 393 GST+ +A + K +V++TD + A Sbjct: 267 WGSTSYRKT-GSAIKYLTDYLSWRDGIP-------KVLVVITDATSNDYVSGP---AQSA 315 Query: 394 KSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSF-FEANSTHEL 437 K+ G+ + ++ + + E L+ A+ +S+ + +S ++ Sbjct: 316 KNAGLILSSVGVGTSISSTE-----LNTIATNSSYRYTVSSYTDI 355 >gi|226308012|ref|YP_002767972.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] gi|226187129|dbj|BAH35233.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] Length = 551 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 48/144 (33%), Gaps = 22/144 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 G + + + + G T + D++ AY ++ S + ++LL Sbjct: 422 GTDHRSKLLSRIDNLSSIVGGGTGLYDSVLAAYRSMQQSYDPASINS---------VILL 472 Query: 375 TDGENTQDNE-EGIAICNKAKSQG-----IRIMTIAFSVNKTQQEKARYFLSNCAS--PN 426 TDG N + + + + + I+TI + + L ++ Sbjct: 473 TDGANDDPSGIALQELLDTLTREQDPMRPVPIITIGVTDDAD-----TDVLEQISTLTGG 527 Query: 427 SFFEANSTHELNKIFRDRIGNEIF 450 + A + ++ K+F I Sbjct: 528 NSHFAPTPADIPKVFVGAISGRAG 551 >gi|295837154|ref|ZP_06824087.1| membrane protein [Streptomyces sp. SPB74] gi|197699506|gb|EDY46439.1| membrane protein [Streptomyces sp. SPB74] Length = 153 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 11/109 (10%), Positives = 22/109 (20%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ + G L V L + +VD A+ A A+ + + Sbjct: 13 RRSLADDRGSGAGAVILFALVFLSLAAFVVDGGLSISQRERAADLAEQAARYAAQDIDRE 72 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + + V V Sbjct: 73 ALYGNEGEAPINEGNCPARVAAFAAESGMSGADAAASGCVEADADHVEV 121 >gi|163751748|ref|ZP_02158966.1| hypothetical protein KT99_12264 [Shewanella benthica KT99] gi|161328400|gb|EDP99559.1| hypothetical protein KT99_12264 [Shewanella benthica KT99] Length = 152 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 11/114 (9%), Positives = 36/114 (31%), Gaps = 4/114 (3%) Query: 18 CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSS 77 G + + A++ + + +++V R Y + L +A++ A + + + + Sbjct: 4 QRGVYAVEFAIVAGIFFMLMFAIIEVGRLMYTYNVLHEASRRA----ARIAVVCRIDDTD 59 Query: 78 RAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + I N + + + ++ + + V Sbjct: 60 IKTMALFNGANLIPNLTTANLSISYLDELGNAATGIDIDLVRADIANYQHQFLV 113 >gi|159184182|ref|NP_353183.2| hypothetical protein Atu0148 [Agrobacterium tumefaciens str. C58] gi|159139514|gb|AAK85968.2| Atu0148-1 mutant of a conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 198 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 36/124 (29%), Gaps = 4/124 (3%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + +K S G I A+L V +++ E + A +T S Sbjct: 5 RLKPLLEKFGFSRDGTAAIEFAILAIPYFLVVFAIIETFIALMAEQVVANATET----MS 60 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 L S ++ ++ + + + K+ ++R + Sbjct: 61 RRLRTGQISSSITKEDFRRSFCNEVSVMIACSEDEIKKEEKLYVDLRSFTAFKDIPTTIP 120 Query: 126 KSAY 129 AY Sbjct: 121 LKAY 124 >gi|94968893|ref|YP_590941.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550943|gb|ABF40867.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 628 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 31/84 (36%), Gaps = 8/84 (9%) Query: 320 IRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGEN 379 + G TA+ DA+ A + + S E K +++++DGE+ Sbjct: 454 DSKQIAHSIQAFAPSGGTALWDAVNFAAEKLASHPE--------RQPVAKILIVISDGED 505 Query: 380 TQDNEEGIAICNKAKSQGIRIMTI 403 +A+S+ + + I Sbjct: 506 NSSATTAKQAIQRAQSEEVAVYAI 529 >gi|118361111|ref|XP_001013786.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295553|gb|EAR93541.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 357 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 37/133 (27%), Gaps = 21/133 (15%) Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + F+++ T+ G+T I+ + A+ Sbjct: 219 KKYQKFNIITFSNQATYWKPDVIDATPENILAAITYINKLGTSGATNISGGLDLAF---- 274 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG------IRIMTIAF 405 + E I LL+DG I + ++I TI+F Sbjct: 275 -----------RSKEVLNTIYLLSDGVPNSGVMTIEGIKKYLTDKNQNRQEKVKINTISF 323 Query: 406 SVNKTQQEKARYF 418 + + ++ R Sbjct: 324 ILGGPENQQERTL 336 >gi|313901108|ref|ZP_07834596.1| hypothetical protein HMPREF9406_1449 [Clostridium sp. HGF2] gi|312954066|gb|EFR35746.1| hypothetical protein HMPREF9406_1449 [Clostridium sp. HGF2] Length = 347 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 21/247 (8%), Positives = 71/247 (28%), Gaps = 6/247 (2%) Query: 14 LIKSCTGHFFI--ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + ++ G + IT +++ ++ + + ++ + + + A +TA + Sbjct: 1 MRRNQKGSALLWAITVIMVLMITVAAALGISYSYYNRSVNN--NSKRQAYLTAKGVIQNI 58 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 +E++ + ++ + N + N I R + + + Sbjct: 59 VEKIELDNSDYIAMIPEEENQSTPLNIDIPEASNIGKVTEAKISRVAVDKDKDIRGKITI 118 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 ++ Y ++ ++ G L + + + + ++I+ + Sbjct: 119 SVTVDYAEQTETVNADMQL-GRTGDLKKWQLLKYYKGQGAEVQENINIKNAKIMMSHLTP 177 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + + K + N + LY G + Sbjct: 178 LYEAACTGNKVMQEYVKSDAEIYERMIAEYGETWE-KYANNGYYSNDRMREYLYTGVYNS 236 Query: 252 SLSEEHF 258 S+ + Sbjct: 237 SIPKFDN 243 >gi|168204310|ref|ZP_02630315.1| von Willebrand factor type A domain protein [Clostridium perfringens E str. JGS1987] gi|170664017|gb|EDT16700.1| von Willebrand factor type A domain protein [Clostridium perfringens E str. JGS1987] Length = 620 Score = 37.7 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 64/207 (30%), Gaps = 15/207 (7%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S +L+ + F + + +G +++ P N + Sbjct: 14 ISLFISVPVLNASGIENKGFDDQFKDIGNESKGRICDYDGTEILKEVSQEPDEY-GNYEI 72 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P + + + + + +++++ S++ + + R+ Sbjct: 73 TLTVKGKPKKVTKPVDILLIMDASNSMYYNMDELKESMNSLVDKVIDNIPNS---RIAVI 129 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFA----IDENEMGSTAINDAMQTAYDTIISSNED 356 F V SF+ + K + MG+T I + A + + Sbjct: 130 AFGTEVEEVFSFNDKNKFTSKEEYKKAIKKSYNNIEGMGNTNIEGTWRRADEIFKNE--- 186 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDN 383 + NN +KK ++ +DG Sbjct: 187 ----LNNNSNSKKDVIFFSDGYPNVSR 209 >gi|308472817|ref|XP_003098635.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] gi|308268235|gb|EFP12188.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] Length = 779 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 48/153 (31%), Gaps = 6/153 (3%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 I + T R+G +N + L FA ST+ Sbjct: 63 TRIGTQASEPQTTRVGLVTYNVNAQQNADL-NKFQSLDDLYNNVFAD--LSSVSTSAQSF 119 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMT 402 + T S E E N KK +++ + + + N+ K+ G+ I+T Sbjct: 120 LSTGLAAAESLFEYENFG-TNRSHYKKVVIVYASSYAAGGEMDPLPVANRLKTSGVNIIT 178 Query: 403 IAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 +A+ + L+ ASP F Sbjct: 179 VAYDQTGDGLLLKQ--LAEIASPRFNFSNTDNE 209 >gi|156409361|ref|XP_001642138.1| predicted protein [Nematostella vectensis] gi|156229279|gb|EDO50075.1| predicted protein [Nematostella vectensis] Length = 989 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 45/386 (11%), Positives = 92/386 (23%), Gaps = 39/386 (10%) Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 A E+ S K + PK + + K ++D T Sbjct: 126 AAQQGHAQHPAVEEKPSLDTKPAEDKPKIEAAKQEATADNKAEPKKEQASGIKDAPEATP 185 Query: 120 VEMNPRKSAYQVVLSSR---YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG 176 + + + +V S + P+ + E + ++ Sbjct: 186 KKGADKATPKEVEASGDEVASEAEAKPVEAAKTESNPAKEDKPAEKAKEDKAGPAKEDKP 245 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + PL+ G D + + G + P Sbjct: 246 SKTSPKVGKPHPKGEILGGESSDPLDIAGDTVDTDAPAEEETTDEEGSGQDYWGP----E 301 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVR 296 S Y P + + A ++ + + ID R Sbjct: 302 EDSAYDYPPRSSCVPCPP-------GPELPISNTAVPAKTAASANVDLVILIDGSRSVER 354 Query: 297 MGATFFNDRVISDPSFSWGV------------------HKLIRTIVKTFAIDENEMGSTA 338 G F + + +K+ + Sbjct: 355 SGVGNFRRAIDFARDLTSSFVVSPRHTRVGLMVYGKRAYKVFGFNDYRDNNRLFTGFNKP 414 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 I + A + +N A+K +VL+TDG+ + + K +G+ Sbjct: 415 IRYPRERAQTATALRSAYRTFFGRNKRSAQKVLVLVTDGKIRDAKAKRQS--QSIKRRGV 472 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCAS 424 +I + + L AS Sbjct: 473 KIYVVG-----AGKYFNIKQLEAMAS 493 >gi|326795600|ref|YP_004313420.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546364|gb|ADZ91584.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 635 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 14/90 (15%), Positives = 28/90 (31%), Gaps = 13/90 (14%) Query: 368 KKYIVLLTDGENTQD----------NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARY 417 ++ +L+TDG + + +GI + T++ + + + Sbjct: 130 DRHWILVTDGMVDVSLDKDVNEASRRRIMTTLTEALEKRGIHLHTVSMTGYTDEDMLKQL 189 Query: 418 FLSNCASPNSFFEANSTHELNKIFRDRIGN 447 + AS E S EL F Sbjct: 190 AVKTNAS---HTEVASPEELLDTFDRIFAQ 216 >gi|307324381|ref|ZP_07603589.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890112|gb|EFN21090.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 249 Score = 37.7 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 53/165 (32%), Gaps = 23/165 (13%) Query: 283 RSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 +++ N++D + FF+ + S ++ + MG T + A Sbjct: 76 QTLALAANLDDDGIVPVVFFSTEIEGTAEISLDAYQDRINPLH---DSMGHMGRTNYHVA 132 Query: 343 MQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIM- 401 MQ D H + + ++V TDG T +C AK + I Sbjct: 133 MQAVID----------HYQSSGADDPAFVVFQTDGSPTSKAAAEHVLCTAAK---LPIFW 179 Query: 402 -TIAFSVNKTQQEKARYFL----SNCASPNSFFEA-NSTHELNKI 440 I F ++ + L FF A + +L+ Sbjct: 180 QFIGFGKDEFRFLHRLDDLPVPNRRIVDNAGFFAAGPTPKDLSDA 224 >gi|297202051|ref|ZP_06919448.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197714313|gb|EDY58347.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 518 Score = 37.7 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 66/190 (34%), Gaps = 36/190 (18%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS----FSWGVHKLIRTIV 324 + ++ ALA + + + F +V S + S L + Sbjct: 355 DRLDRLKTALADLTGDF------REREEVTLMPFGSQVKSVRTHVVKPSDPRAGLD--AI 406 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + + G TAI +++ AYD + +A IVL+TDGENT + Sbjct: 407 RDDTSALSADGDTAIYTSLEKAYDHL-----------GAGRDAFTSIVLMTDGENTAGAK 455 Query: 385 EGIAICNKA----KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELN 438 A K++ + I F + R L++ A + F+A L+ Sbjct: 456 ARDFDAFYARLGRKARDTPVFPILFG------DSDRSELAHIADLTGGRLFDARQGS-LD 508 Query: 439 KIFRDRIGNE 448 F + G + Sbjct: 509 GAFEEIRGYQ 518 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 37.7 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 39/273 (14%), Positives = 75/273 (27%), Gaps = 29/273 (10%) Query: 148 LRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSE----GQPLNC 203 L+ G + + + + + + Q V S + + S P Sbjct: 147 LQLTGATATEVTLTSLPKPSPSTGSTTSSLRPQPVGHRSWELDLWSTGSYASGHNSPHAN 206 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY----------MLYPGPLDPSL 253 P +V + + +G + P V + + G + + Sbjct: 207 PVPPTGPSVTGATPISYTIGREMDLWQPGSVLLDAGFVPKEELSTQSLEPVSQGDPNCKI 266 Query: 254 SEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS 313 +D SS ++ + + LA V +++ D MG + D + S Sbjct: 267 DLSFLIDGSSSIG-KRRFRIQKQFLADVAQTL---DIGPGGPLMGVVQYGDNPAAQFSLR 322 Query: 314 WGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 ++ G + A+ S N V+ Sbjct: 323 THMNSRDLKTAIDKIT--QRGGLSNAGRAISYVTKNFFSKANGNRGGAPN------VAVV 374 Query: 374 LTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + DG T EE + A+ GI I I Sbjct: 375 IVDGWPTDKVEEASRL---ARESGINIFFITVE 404 >gi|308050055|ref|YP_003913621.1| TadE family protein [Ferrimonas balearica DSM 9799] gi|307632245|gb|ADN76547.1| TadE family protein [Ferrimonas balearica DSM 9799] Length = 167 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 56/163 (34%), Gaps = 9/163 (5%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI-ITASVPLIQSL 72 + + G + + A++ V+ + +++ R Y +AL + ++ A + A Sbjct: 1 MKRHQQGAYVVEFAIVATVVFVMLFACLEIARLMYSYNALTEVSRRAARLAAVCAPSPGT 60 Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 E + K F +++ L + T I T V V Sbjct: 61 MEPTDAMKALALFDGRQMVANLSADNLVVDYLTNTGALAASIPDTTQVRAT--------V 112 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEH 175 + +++L++ L + L S ++ + + S E Sbjct: 113 ANYQHELIIPGLFIQLNSPAFRTLMPSESLGQTRYADSNTAET 155 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 43/349 (12%), Positives = 93/349 (26%), Gaps = 54/349 (15%) Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPL 201 + S +G S SY ++ + S D+ L Sbjct: 107 STKSWDESVLGTNSLYDMQLDLEFKADESYQDNRLINYNLTGQYKNSKNKLSIDTAISAL 166 Query: 202 N-CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVD 260 N K + + + +++ + + M Y + + + Sbjct: 167 NTKQVISKVAKGKKGVALAYRNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGKRNSQGKWG 226 Query: 261 SSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + ++++ ++ +K+I V + F+ + Sbjct: 227 PLDADNPRARMNILKKKANLLVDDLKEI----GNVSVNLVRFSGSASYIQE-DFVELDKD 281 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 +K T D ++ ++ N KY+VLLTDG Sbjct: 282 TGKIKEKIKSLPTSWITNPGDGLRYGLVSLQ-----------RNPAQLKYVVLLTDGIPN 330 Query: 381 QDNEEGIAIC------------NKAKS-------------------QGIR-IMTIAFSVN 408 I + K+ G++ I I FS N Sbjct: 331 AYTGSPDGIGKYDLTANFPTDNKQIKADQPVSLTTEYVGQVAKTFGSGVKRISVIGFSGN 390 Query: 409 KTQQEKARYFLSNCASPNS----FFEANSTHELNKIFRDRIGNEIFERV 453 + + + + + F A + L + F D I +I + + Sbjct: 391 VGEIKDGQKIADQIKTVGNVESTFVIATNEAALEQTFAD-IKKQIQQDL 438 >gi|319425465|gb|ADV53539.1| type IV pili-associated adhesin, PilY [Shewanella putrefaciens 200] Length = 1165 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 18/184 (9%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE----MGST 337 + K N+ + +++ S++ K T + D T Sbjct: 346 FGNKDKNGTGNNKIDGYTPNNPPSILTSGSYTTPFKKCPDTAYIIYITDGAPTLDKSADT 405 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 IN + A + + N E L N Q Sbjct: 406 LINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNNDIVIGNKDSLGIDNKQN 465 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA-------SPNS----FFEANSTHELNKIFRDRIG 446 IR+ TI F ++A L A N+ ++ A + +L D + Sbjct: 466 IRVFTIGF---SDGADEAAALLEETAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALK 522 Query: 447 NEIF 450 + + Sbjct: 523 SILT 526 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 +I I ++D R+G ++ +V + S I T Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTG 102 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 A++ A + S+ E + + V++TDG E A + GI I Sbjct: 103 LAIRYAMNVAFSAEEGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEI 153 Query: 401 MTIAFSVNKTQQEKARYFLSNCASP---NSFFEANSTHEL 437 + L ASP + F S + Sbjct: 154 YAVGV------ARADMTSLRAMASPPFEDHVFLVESFDLI 187 >gi|270308521|ref|YP_003330579.1| hypothetical protein DhcVS_1137 [Dehalococcoides sp. VS] gi|270154413|gb|ACZ62251.1| hypothetical protein DhcVS_1137 [Dehalococcoides sp. VS] Length = 375 Score = 37.7 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 72/276 (26%), Gaps = 21/276 (7%) Query: 7 FIFYSKKLIKSCTG---HFFIIT----ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 I S++ +S G +I A+++P +L + G + AA Sbjct: 4 LITISRRF-RSQQGQALVVALIFLAVGAIMIPPLLMLMGSALVQGTTFENRTTGLYAADA 62 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTA 119 I A L + + T + + I ++ ++ + I+ + Sbjct: 63 GIEQAIWYLDPENSPLIPGGLPTNTGETRTLPGISIDGRTVSVTLSYLTEDTYQIMSTST 122 Query: 120 VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI 179 + +++ N + L + + +V+ + ++ Sbjct: 123 SPTET------ISVTAVISETFNNYTDILNHVITSQGDYIIQGGQASVTPTTGDNTPIAY 176 Query: 180 Q-------WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 + + S L + ++ + L Sbjct: 177 YEGPWPNINELSTFYYVNTTPYTSATLNLADYPAGVPEILRLGTLDIVATSNAVYTLQGD 236 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + +L M +V S++ + Sbjct: 237 VYITGDTLIGMTGSNFTLDLNGHSIYVKSNTAGNQY 272 >gi|296086006|emb|CBI31447.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 37.3 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 25/259 (9%), Positives = 60/259 (23%), Gaps = 17/259 (6%) Query: 131 VVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS--IQWVIDFSRS 188 V ++ + + LS + +S+S + + I Sbjct: 20 VPATAEEEDSADRLSDAALLSPAMPMTPTPSSGGLRLSKSASRSSKKTCAICLTSMKRGH 79 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + ++ S + ++ S + Sbjct: 80 GQAIFTAECSHSFHFHCITSNVKHGSQICPVCRAKWKEIPFEERNSSNGNAAENNPVRTV 139 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDAL---ASVIRSIKKIDNVNDTVRMGATFFNDR 305 + E S ++ A+ I+ +N Sbjct: 140 EIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSNSPLNSHNPRAPVDLVTV 199 Query: 306 VISDPSFSWGVHKLIRTIVK---TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMK 362 + S + L++ + G T I + ++ + E Sbjct: 200 LDISGSMAGTKLALLKRAMGFALQAVNSLVANGGTNIAEGLRKGAKVMEDRKE------- 252 Query: 363 NNLEAKKYIVLLTDGENTQ 381 I+LL+DG++T Sbjct: 253 --RNPVSSIILLSDGQDTY 269 >gi|302521148|ref|ZP_07273490.1| conserved hypothetical protein [Streptomyces sp. SPB78] gi|333025165|ref|ZP_08453229.1| hypothetical protein STTU_2669 [Streptomyces sp. Tu6071] gi|302430043|gb|EFL01859.1| conserved hypothetical protein [Streptomyces sp. SPB78] gi|332745017|gb|EGJ75458.1| hypothetical protein STTU_2669 [Streptomyces sp. Tu6071] Length = 154 Score = 37.3 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 11/118 (9%), Positives = 22/118 (18%), Gaps = 3/118 (2%) Query: 6 KFIFYSK-KL--IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 + + + G L V L + +VD A+ A Sbjct: 5 RLKSLVRERFPARTDDRGSGAGAVILFALVFLSLAAFVVDGGLSISQRERAADLAEQAAR 64 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 A+ + + + + V V Sbjct: 65 YAAQDIDREALYGNEGEAPINAGNCPARVAAFAAESGMSGADAAASGCVEADADHVEV 122 >gi|117921567|ref|YP_870759.1| type IV pilin biogenesis protein [Shewanella sp. ANA-3] gi|117613899|gb|ABK49353.1| type IV pilin biogenesis protein, putative [Shewanella sp. ANA-3] Length = 1168 Score = 37.3 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 50/164 (30%), Gaps = 7/164 (4%) Query: 289 DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYD 348 D + ++ V ++S + D + + N+ + + Sbjct: 339 DTDYGSYVGNRPPYDTSVEKGGAYSSPFKVCTDIAYVIYVTDGSPTVDKSANNDVISLTS 398 Query: 349 TIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVN 408 S + + NL+ Y+ L D + N + Q +R TI FS Sbjct: 399 --TGSKDGDYSSFSKNLDTPSYLPALASYMFNNDLINKLDSSNTEQMQNVRTYTIGFSKG 456 Query: 409 KTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 L+ A +F A ++ EL D + N + Sbjct: 457 AEDAAP---LLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|327539547|gb|EGF26157.1| TadE-like protein [Rhodopirellula baltica WH47] Length = 154 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 41/119 (34%), Gaps = 5/119 (4%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 +K + K++ G A+++P+ L + D R ++ + AA+ Sbjct: 13 IRSKHALHLKRV-----GATATEFAIVLPMFLLLVFACCDFARVIHFRQLVANAARVGAT 67 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 ++ + E R+ ++ ++ + +F D V +T V Sbjct: 68 HGALNRFTAATESDWRSDVVNVMREELAHLTSTDPSDSVIDVHFRDLSSGVRVVETEVT 126 >gi|294674674|ref|YP_003575290.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294471870|gb|ADE81259.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 212 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 39/112 (34%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ +++ +T + FN + I Sbjct: 21 IEAVKNGVQTLVSTLRSDPYALETAYISIITFNSSAQQIAPLTELASFQPPVID------ 74 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G TA+ +A+ + + +K + + + ++TDGE T D Sbjct: 75 --ASGCTALGEALNLLAQKVDTEIVKTTAEVKGDWKP--IVFIMTDGEPTDD 122 >gi|260800517|ref|XP_002595175.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] gi|229280419|gb|EEN51187.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] Length = 286 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 37/143 (25%), Gaps = 16/143 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 +D + VR+ + + ++ G T A+ Sbjct: 28 FSYLDIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDDIEYM--GGYTLTGKAID 85 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A + R + K V++TDG + D + + GI + + Sbjct: 86 FATNLHFD------LRKGARADVTKIAVVITDGRSYDDVNRP---ARRMRQAGIVTIAVG 136 Query: 405 FSVNKTQQEKARYFLSNCASPNS 427 R L+ A Sbjct: 137 VG-----NNLDRDQLTAIAGDPK 154 >gi|154493525|ref|ZP_02032845.1| hypothetical protein PARMER_02864 [Parabacteroides merdae ATCC 43184] gi|154086735|gb|EDN85780.1| hypothetical protein PARMER_02864 [Parabacteroides merdae ATCC 43184] Length = 1299 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 28/303 (9%), Positives = 58/303 (19%), Gaps = 14/303 (4%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAAQTAIITA 64 + Y ++ +K +G I +L+ +L G VDV + AL A Sbjct: 11 RNKKYKERRMKRNSGLIGWI--VLIVGLLCSVRGGAVDVGSAAALRDALGNAT------- 61 Query: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 + + + + + ++ + E + I Sbjct: 62 ----VSGNVVMLTGDVSLSSTLNITGGTMILDLNGMQISITKNKAEAKCISVTGGTLEIT 117 Query: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 + G T + +Y + Sbjct: 118 GGGFISAQTTGTEWFSDRAAIALSYDGGTVRIYRATFNAIASDGTAYTLDPNNDYTVDNM 177 Query: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 G S N G +P Sbjct: 178 IPAGAYMTNSSDYGSTGLVSSSITVALTNYNVSYNTSGGTTTNPGTPSYTIETPDFTLPT 237 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 + + + + V + + I G T D Sbjct: 238 VTKNGYTFADWTYNGNPVNPTALPTTADRVTSKDMAFGATWTLISYKVVYDVAGGTAIQD 297 Query: 305 RVI 307 + Sbjct: 298 GLY 300 >gi|75676720|ref|YP_319141.1| TadE-like protein [Nitrobacter winogradskyi Nb-255] gi|74421590|gb|ABA05789.1| TadE-like protein [Nitrobacter winogradskyi Nb-255] Length = 146 Score = 37.3 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 13/135 (9%), Positives = 38/135 (28%), Gaps = 1/135 (0%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPV-MLGVGGMLVDVVRWSYYEHALKQAAQTAIITASV 66 + + + A+L+P+ ++ V G++V + + AA+ A + + Sbjct: 9 ASSLPRFARCARAASAVEFAMLLPLFLVLVAGIVVFGAYLTMVHGVQQLAAEAARSSVAG 68 Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 ++ + ++ + V D+ + P Sbjct: 69 LSETERTSLAENYVTTNAGSYPLLQPGHLTMSAATSGGGVFVVTVNYDASDSIIFTLPFV 128 Query: 127 SAYQVVLSSRYDLLL 141 A + + Sbjct: 129 PAPPSTIVRSAAIPF 143 >gi|307265045|ref|ZP_07546606.1| conserved hypothetical protein [Thermoanaerobacter wiegelii Rt8.B1] gi|306920030|gb|EFN50243.1| conserved hypothetical protein [Thermoanaerobacter wiegelii Rt8.B1] Length = 136 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 12/132 (9%), Positives = 30/132 (22%), Gaps = 2/132 (1%) Query: 9 FYSKKLIKSCTGHFFIITALLMPVMLGVGGMLV-DVVRWSYYEHALKQAAQTAIITASVP 67 K +K G ++ L + + + V + R L + + +V Sbjct: 2 RALKDFLKDKRGDAVLLFMLFLMIFSILFMYAVYSISRGVGAREELVKVCDEIALNIAVS 61 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + S + T + V +V Sbjct: 62 AVNMQYAQSGDLAVDTNKAYSLAVNTFKDLGIPVKDVSVTVKNRYIYV-TASVSGEMYGI 120 Query: 128 AYQVVLSSRYDL 139 + + ++ Sbjct: 121 SGDITVTGMAKA 132 >gi|294139943|ref|YP_003555921.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326412|dbj|BAJ01143.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1227 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 34/332 (10%), Positives = 78/332 (23%), Gaps = 46/332 (13%) Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 +G S LI + + + + + ++ + + + + Sbjct: 228 LGTPSSLINYTVASASSNSKTKRNATDKAALTGFGTGRVITVYTEDYLRWEHGVKNKVTK 287 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 T + K I + +S +S + S ++ Sbjct: 288 TRLELAKDAIKNVILTTPGVDFGLSIFNMNGPGDNQRDGGRIISGIKNMSSKGKIDLLTS 347 Query: 271 KHLVRDALAS------------------VIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + A + + D + S Sbjct: 348 VQSITYAQNTPLCETLYEAYRYFSGQSVLFGDDDSDYISYDRWGRYRGIYKSNRNSPLDP 407 Query: 313 SWG-----VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 L + K + + + T ++ A A ++ K+ Sbjct: 408 DILDAGVYSSPLKKCQDKAYIVYITDGEPT-VDSAANHAIQSLTGG------VDKHTAYP 460 Query: 368 KKYIVLLTDGENTQDNEEGIAICNKAKSQGIR---IMTIAFSVNKTQQEKARYFLSNCA- 423 Y+ ++ N D K G++ TI F A L A Sbjct: 461 VSYLSAMSSWMNNHDVNP--------KEPGVQHVSTFTIGF---SEGAASAASLLKKTAA 509 Query: 424 -SPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 S+F+A + L + + + + Sbjct: 510 VGGGSYFDATNAKLLQGSLQQAVNKILEKNAS 541 >gi|119960552|ref|YP_948617.1| TadE-like family protein [Arthrobacter aurescens TC1] gi|119947411|gb|ABM06322.1| putative TadE-like family protein [Arthrobacter aurescens TC1] Length = 149 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 9/40 (22%), Positives = 19/40 (47%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALK 54 KS +G + AL++P+ L + + + R + +L Sbjct: 19 AKSQSGAVAVEFALVLPIFLVLVLGIFEFGRAFNIQISLS 58 >gi|57233894|ref|YP_182066.1| hypothetical protein DET1355 [Dehalococcoides ethenogenes 195] gi|57224342|gb|AAW39399.1| hypothetical protein DET1355 [Dehalococcoides ethenogenes 195] Length = 375 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 54/204 (26%), Gaps = 13/204 (6%) Query: 11 SKKLIKSCTGHFFIIT-------ALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + +++ G +I AL +P ++ + G + A AA I Sbjct: 7 ISRKLRNQQGQALVIALVFLGIGALTLPPLMTLMGSALVQGTTFENRTAGLYAADAGIEQ 66 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A L + + T + + I ++ + + IV +A Sbjct: 67 AIWYLDPENSPLIPGGLPANTGESRTLPGISIDGRTVSVTLAYLTEDTYQIVSTSASPTE 126 Query: 124 PRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + +++ N + + + I KA V+ + Sbjct: 127 T------ISVTAVVSDTYNNYTDIMNHVMTSKGEITVKAGDYEVNYPSEDNAPIEYYTGA 180 Query: 184 DFSRSMLDYQRDSEGQPLNCFGQP 207 S + L P Sbjct: 181 WPSANELSSFYYVNTTPYASDTLN 204 >gi|83716874|ref|YP_438808.1| hypothetical protein BTH_II0608 [Burkholderia thailandensis E264] gi|167615325|ref|ZP_02383960.1| hypothetical protein BthaB_03492 [Burkholderia thailandensis Bt4] gi|257141890|ref|ZP_05590152.1| hypothetical protein BthaA_22154 [Burkholderia thailandensis E264] gi|83650699|gb|ABC34763.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 598 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 64/266 (24%), Gaps = 5/266 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G +I A+ + V++ G VD+ Y AL+ A A + A+ + + ++ A Sbjct: 34 GSLAVIAAIAIGVVIAALG-AVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATA 92 Query: 80 KNSFTFPKQKIEEYLIRNFEN----NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 +++ ++K N TA + + +A QV L+ Sbjct: 93 QSAALGNGFDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTLTR 152 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + T ++ + + L Sbjct: 153 VVPYYFLGAQRTVSATSTAQATNVGAYSIGTTLAQLQGGVVNALLNGLLGANLNLSVLSY 212 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + + S + + Sbjct: 213 QGLANARIRIKDLMAAANVGTVNALLNTQTTVPQLANWMLTALSQTSVANADLQTSIGAL 272 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASV 281 + V ++ ++ Sbjct: 273 QTIVSANVPGGQTFTIGSTANSTGIF 298 >gi|331266438|ref|YP_004326068.1| hypothetical protein SOR_1070 [Streptococcus oralis Uo5] gi|326683110|emb|CBZ00728.1| conserved hypothetical protein [Streptococcus oralis Uo5] Length = 863 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 36/325 (11%), Positives = 74/325 (22%), Gaps = 67/325 (20%) Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 + + + + + V D S SM + S T++ + + G + Sbjct: 66 TSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKGLIDT 125 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHF---VDSSSLRHVIKKKHLVRDALASV 281 + + ++ D ++ Sbjct: 126 ILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWPYERMTKYYDGVSPWDDANTI 185 Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 + V + + + ++ T I Sbjct: 186 LGWSNNARAAKTAVYNMSIAGGNSIGTESGI---------------------GTGTNIGA 224 Query: 342 AMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICN-KAKSQGIR 399 + A + S+ AKK ++LL+DG + G I N + I Sbjct: 225 GLTLANQLMGSA----------RSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIE 274 Query: 400 I---------------------------MTIAFSVNKTQQEKARYFL---SNCAS-PNSF 428 +I F + + AS PN Sbjct: 275 TAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYIRQHNASIPNEI 334 Query: 429 FEANSTHELNKIFRDRIGNEIFERV 453 AN +L F++ + V Sbjct: 335 LSANDEDQLRDSFKNITDKILPLGV 359 >gi|307327616|ref|ZP_07606801.1| TadE family protein [Streptomyces violaceusniger Tu 4113] gi|306886728|gb|EFN17729.1| TadE family protein [Streptomyces violaceusniger Tu 4113] Length = 151 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 47/166 (28%), Gaps = 21/166 (12%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + ++ L + G + + TA+L PV++G+ G+++ R A +A Sbjct: 1 MTPSRARRLGDALRRDR-GSYAVETAVLAPVLIGILGLMIAFGRI----ADADGAVDSAA 55 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 A+ + +++A+ + + Sbjct: 56 RAAARAASLERDAGTAQAEAQSAATHSLDGD---------------GVTCQTSSVTVDTS 100 Query: 122 MNPRKSAYQVVLSSRYDLLLN-PLSLFLRSMGIKSWLIQTKAEAET 166 + +++ N G K+ + +T Sbjct: 101 GYALDVGEEASVTATISCTANLSDIGLPGLPGSKTLTASWTSPIDT 146 >gi|239934064|ref|ZP_04691017.1| hypothetical protein SghaA1_38052 [Streptomyces ghanaensis ATCC 14672] gi|291442503|ref|ZP_06581893.1| TadE family protein [Streptomyces ghanaensis ATCC 14672] gi|291345398|gb|EFE72354.1| TadE family protein [Streptomyces ghanaensis ATCC 14672] Length = 150 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 6/49 (12%), Positives = 20/49 (40%), Gaps = 1/49 (2%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHAL 53 ++ ++ +L + G I A+++P ++ + + R + Sbjct: 2 SRRARWAGRLRRDE-GSVAIEAAIIVPPLIMFVCLAIAGGRIVTSGAKI 49 >gi|85705209|ref|ZP_01036308.1| hypothetical protein ROS217_17112 [Roseovarius sp. 217] gi|85670082|gb|EAQ24944.1| hypothetical protein ROS217_17112 [Roseovarius sp. 217] Length = 195 Score = 37.3 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 9/90 (10%), Positives = 26/90 (28%), Gaps = 4/90 (4%) Query: 10 YSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI 69 + ++ + G + T ++ P + + Y + + A T + L Sbjct: 9 FLRRFWQRECGTVAMETVVMFPFLFMGLTFSYEYYDMFRY----QSVREKATYTVADMLS 64 Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFE 99 + V+ ++ + N Sbjct: 65 RETAVVNVTYMDNVKVLFDLMTNDTGANQV 94 >gi|260824533|ref|XP_002607222.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] gi|229292568|gb|EEN63232.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] Length = 1897 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 11/58 (18%), Positives = 24/58 (41%), Gaps = 3/58 (5%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 ++ + E+ +K I+L+TDG+ T + K++G + T+ Sbjct: 1667 QSLKAVYEEFTKANGARDGVEKIIILITDGKATDQVRQP---AQYVKNKGAHVFTVGV 1721 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 64/190 (33%), Gaps = 23/190 (12%) Query: 269 KKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT--IV 324 + ++ AL + + + D + F V + + + + + Sbjct: 352 DRLDRLKAALTQLAGADGAATGERFRDREEVTLMPFGSEVKAVRTHTVPEDDPGKALAAI 411 Query: 325 KTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + A G TAI +++ AY + + IVL+TDGENT + Sbjct: 412 RADAKALTADGETAIFSSLRAAYRHL------AERASALGDDRFTSIVLMTDGENTAGDS 465 Query: 385 --EGIAICNKA--KSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--PNSFFEANSTHELN 438 + + + + + I F + + L N AS F+A L+ Sbjct: 466 ADDFESFYRRLPGAQRTTPVFPILFGDSDRGE------LENIASLTGGRLFDATKGS-LD 518 Query: 439 KIFRDRIGNE 448 + F + G + Sbjct: 519 QAFEEIRGYQ 528 >gi|311111709|ref|YP_003982931.1| hypothetical protein HMPREF0733_10039 [Rothia dentocariosa ATCC 17931] gi|310943203|gb|ADP39497.1| conserved hypothetical protein [Rothia dentocariosa ATCC 17931] Length = 156 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 38/117 (32%), Gaps = 1/117 (0%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + + + G + + ++L + + + V H L+ A Sbjct: 1 MVVSYIHAVRSRDGDNDEGSISPLIVGMCALILLIASVTLAVTGAHVQTHRLEDLADAQA 60 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 I+ + + E S ++ +Q +YL + ++ N V+ T Sbjct: 61 ISIARTIEHGFSEYSDKSA-DNARARQLAAKYLEESGASHEFVNLQVDSVQISDDRT 116 >gi|159039503|ref|YP_001538756.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157918338|gb|ABV99765.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 583 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 47/139 (33%), Gaps = 19/139 (13%) Query: 315 GVHKLIRTIVKTFAIDENEM-GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVL 373 G R+ ++ G T + D + AY+ + + IVL Sbjct: 451 GPLSNQRSRLEQALTQIQPTRGDTGLFDTVLAAYEAVQEEWDPGQVNS---------IVL 501 Query: 374 LTDGENTQDN----EEGIAICNKAKSQG--IRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 TDG+N DN ++ +A + K ++++ I + ++ E + Sbjct: 502 FTDGKNDDDNGISQQQLLAELERIKDAERPVQVVLIGIGADVSKAELESIT---KVTGGG 558 Query: 428 FFEANSTHELNKIFRDRIG 446 F ++ IF I Sbjct: 559 SFVTEDPTKIGDIFLKAIA 577 >gi|156933960|ref|YP_001437876.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] gi|156532214|gb|ABU77040.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] Length = 197 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 38/112 (33%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + +++ ++K+ +T + F+ + + Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLTDLLS--------FQMPA 57 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+T++ +A+ +I + K + + L+TDG D Sbjct: 58 LTASGTTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDGSPNDD 107 >gi|221070081|ref|ZP_03546186.1| membrane protein-like protein [Comamonas testosteroni KF-1] gi|220715104|gb|EED70472.1| membrane protein-like protein [Comamonas testosteroni KF-1] Length = 520 Score = 37.3 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 23/57 (40%), Gaps = 3/57 (5%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITAL-LMPVMLGVGGMLVDVVRWSYYEHALKQAAQ 58 + + ++ +G ++ AL LM ++ + +D+ + L++ A Sbjct: 1 MQSFATGLGRARMRRQSGTVAMLGALWLMIALICLA--TIDIGNVFWQRRELQKIAD 55 >gi|156546338|ref|XP_001606550.1| PREDICTED: similar to ENSANGP00000021218 [Nasonia vitripennis] Length = 1230 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 36/243 (14%), Positives = 75/243 (30%), Gaps = 28/243 (11%) Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + P + C +Y+ L VD+S +K + R + +++ ++ Sbjct: 209 SDPHDPDLFDCRTRSWYIEAATSPKDVLI---LVDTSGSMTG-MRKEIARHVVNNILDTL 264 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 D VN T D + + +K + N + + Sbjct: 265 GNNDYVNIVKFSNVTELAVPCFGDTLVQANLANIRE--LKNGISEMNTERIANFSMILTY 322 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE-----------NTQDNEEGIAICNKAK 394 A++ + E N I+L+TDG N N + + Sbjct: 323 AFELLEEFREMRRGACCNQA-----IMLVTDGVPDNYKEIFQRYNWASNPDNPDQAD--- 374 Query: 395 SQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 +RI T + + + + CA+ F ++ E+ + + I V+ Sbjct: 375 -MPVRIFT--YLIGREVADVRDSRWMACANRGYFVHLSTLAEVREQVLNYIAVMARPLVL 431 Query: 455 RIT 457 +T Sbjct: 432 NLT 434 >gi|225619207|ref|YP_002720433.1| hypothetical protein BHWA1_00260 [Brachyspira hyodysenteriae WA1] gi|225214026|gb|ACN82760.1| hypothetical protein BHWA1_00260 [Brachyspira hyodysenteriae WA1] Length = 638 Score = 37.3 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 38/352 (10%), Positives = 95/352 (26%), Gaps = 16/352 (4%) Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + K + + + N N ++ + + K A +++LS Sbjct: 84 DNDIKKYAIDYISNTYHNHYSDNLSENHLQDQKLIFSYASEYNYVEQNLALKKARELLLS 143 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQR 194 + L+LFL M S + + + I + ++ Sbjct: 144 IKNKDADIYLALFLSHMATDSLYYFNDMQNYLKEWKKLGSTNIDMMIGIVAEGNNVNNIY 203 Query: 195 DSEGQPLNCFGQPADR--TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 ++ + +N + K + + D + + Y L + + Sbjct: 204 SNKMKLINYIIEAPSDLGVEKLIADAYENGFLTDISKNTGYLDIYNENNYNLSSVIYNNT 263 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 + +F S + + +R + K V + +N + Sbjct: 264 SNISYFTVSKDIVNTNIMDKYIRTYTGKTLSRDKMTSPV--YYKNFYYIYNPSLDE---- 317 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV 372 + +T I + N + +K K I Sbjct: 318 --------KYEDFADIYFLEFNKNTFIYSIKGNKLQGVYYLNPMFIGNIKYADYKIKEIF 369 Query: 373 LLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 G++++ + I++ + + I + K + K +L + S Sbjct: 370 NFHLGDDSKLDSAKISVIKTTELKNISTNSQFILDGKFDKNKYAEYLDDILS 421 >gi|306922629|gb|ADN07507.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 37/130 (28%), Gaps = 18/130 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ + G+T A++ D + Sbjct: 78 VRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELS--YKGGNTRTGAALRHVSDHVF--- 132 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + K K QG+++ + + Sbjct: 133 ----LPHLTRPGIPKVCILITDG---KSQDLVDPAAQKLKGQGVKLFAVGI------KNA 179 Query: 415 ARYFLSNCAS 424 L AS Sbjct: 180 DPEELKRVAS 189 >gi|306922621|gb|ADN07500.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 37/130 (28%), Gaps = 18/130 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 VR ++D ++ + G+T A++ D + Sbjct: 78 VRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELS--YKGGNTRTGAALRHVSDHVF--- 132 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 K +L+TDG + + K K QG+++ + + Sbjct: 133 ----LPHLTRPGIPKVCILITDG---KSQDLVDPAAQKLKGQGVKLFAVGI------KNA 179 Query: 415 ARYFLSNCAS 424 L AS Sbjct: 180 DPEELKRVAS 189 >gi|226325177|ref|ZP_03800695.1| hypothetical protein COPCOM_02969 [Coprococcus comes ATCC 27758] gi|225206525|gb|EEG88879.1| hypothetical protein COPCOM_02969 [Coprococcus comes ATCC 27758] Length = 171 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 42/154 (27%), Gaps = 1/154 (0%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 S +G I AL +P +L ++ + S A+ AA A +V Sbjct: 10 SESGAAIIEFALGVPFLLIFAMAAMEFGQISAATTAVDNAAHAAARELAVNPSGDASSAK 69 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 A N+ + + + + + + T E N +S Sbjct: 70 EAAVNAASSFFAENMKIETDVSDAEREAYTHRIPDSNGSSYTDRESNVSTRKCTATVSVT 129 Query: 137 YDLLLNPLSLFLRSMGI-KSWLIQTKAEAETVSR 169 + G I++ A + Sbjct: 130 IQPQTVLGDAIYAAGGFGGGMTIESNAVELKDAT 163 >gi|53723210|ref|YP_112195.1| hypothetical protein BPSS2193 [Burkholderia pseudomallei K96243] gi|52213624|emb|CAH39678.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 180 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 14 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 73 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 74 AQCLAVYGSTTCGSTGSELAPGLATSMVVVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 133 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 134 PDPASLAGSMNLV 146 >gi|226510867|gb|ACO59960.1| matrilin-1 [Salmo salar] Length = 108 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 32/107 (29%), Gaps = 11/107 (10%) Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 ++ V + G + + + T A++ D+ + Sbjct: 13 LVQYSSSVKQEFPL--GRYNNKKDLKDAVKKMAYMERGTMTGQALRYLTDSSFAPAGGAR 70 Query: 359 HRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAF 405 + K ++ TDG + + KAK QG ++ + Sbjct: 71 PGVA------KVGIVFTDGRSQDYIGD---AAKKAKEQGFKMFAVGV 108 >gi|324516166|gb|ADY46443.1| C-type lectin domain-containing protein 160 [Ascaris suum] Length = 397 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 56/168 (33%), Gaps = 16/168 (9%) Query: 277 ALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGS 336 A + ++ +R+ F D + H + F ID + + Sbjct: 72 ATLFLPMNVSSESAQGQFIRVALVAFADNAVIVGDL-NKYHNYASLVEGLFTIDYHGGKT 130 Query: 337 TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQ 396 I ++ A + SS I+L + + + AI N+ K Sbjct: 131 LNIEAGLKAASTVLESSRHYAKT----------VILLYSSAYSAGGFADPNAIANQIKES 180 Query: 397 GIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH---ELNKIF 441 G +I+TIAF + L + +SPN F + T ++ F Sbjct: 181 GTKIITIAFRQQPEGTLVKK--LGHLSSPNFAFGSMDTSIIAKITNAF 226 >gi|297583270|ref|YP_003699050.1| binding-protein-dependent transporter inner membrane component [Bacillus selenitireducens MLS10] gi|297141727|gb|ADH98484.1| binding-protein-dependent transport systems inner membrane component [Bacillus selenitireducens MLS10] Length = 518 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 21/236 (8%), Positives = 58/236 (24%), Gaps = 24/236 (10%) Query: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGG------------------MLVDVVRW 46 + F + +K +K F ++ +M+G+ G M +D+ Sbjct: 9 SPFRIFLRKFLKQRLAVFA-FFVVMFIMMVGLFGSHFSPYDPHRPVTAQYPEMGIDIQNI 67 Query: 47 SYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNF 106 + TA ++ ++ + + + + I+ N Sbjct: 68 TSER-----LLFTASLSDGTSQSENELQELTGETEERRVAAVRRTQSGIQVTANANGSTL 122 Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 E + V + S L + + + + +E Sbjct: 123 VFVESGPVRSAVQVLVEGEFPVDDTPALSSLHLTADTENPSPGQTATIDVIARFTDGSEL 182 Query: 167 VSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKV 222 +++ + + F + ++G+V Sbjct: 183 TGLDAVRDYVNETYGEEEEEDDSGFQTGSVDADDDQPFRFVSQTEETLTVDEDGQV 238 >gi|170734865|ref|YP_001773979.1| TadE family protein [Burkholderia cenocepacia MC0-3] gi|169820903|gb|ACA95484.1| TadE family protein [Burkholderia cenocepacia MC0-3] Length = 177 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 44/172 (25%), Gaps = 8/172 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G + AL++ M+ + + + R Y L +A + A SV L Sbjct: 8 RSRTRGVVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAY 67 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT------DREVRDIVRDTAVEMNPRKSA 128 S+A+ + T D + P + Sbjct: 68 PVSQAQCLVVYGSTTCGSSGTELVPGLTTSMVTICDAQRTTGCSDASDPSQFANLPTYDS 127 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQ 180 S +N + + + G K I +++ + Sbjct: 128 NNNAASGTATGAINVVEV--KITGYKYQPIPAYPWLPSITFGNIVTVMRQVS 177 >gi|91777212|ref|YP_552420.1| hypothetical protein Bxe_B2925 [Burkholderia xenovorans LB400] gi|91689872|gb|ABE33070.1| hypothetical protein Bxe_B2925 [Burkholderia xenovorans LB400] Length = 278 Score = 37.3 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 14/148 (9%), Positives = 31/148 (20%), Gaps = 9/148 (6%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 T+ +++ S +G ++ PV+L V ++ V + A + Sbjct: 15 RTRLTGNARR--TSQSGQSMTEFIIIAPVLLFVCFGILQFVLLYQAKS----TLDVAALE 68 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN 123 A+ + + S + R Sbjct: 69 AAREGAVNHGSMQSMRNGLARGLAPLYAHQANAEGVAAALASAQTDAARFSSITVLNPTP 128 Query: 124 PRKSAYQ---VVLSSRYDLLLNPLSLFL 148 Y P + Sbjct: 129 AAIQDYSRPRYYADQAATYSEIPNDSLM 156 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 34/336 (10%), Positives = 70/336 (20%), Gaps = 21/336 (6%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 + L S + + I + V Sbjct: 2886 SLAVVQNLPAGATLSESFNVVTGDGTTVQPISVSIVGTNDAPVAADNSANVEIGSSHVFT 2945 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHG---- 176 S S + ++ + ++ + T A ++ Sbjct: 2946 IAEFNFSDGAEGNSLQSVIISRLPTDGTLTLNGNPVTVNTAVSAADIAAGKLVFTPSANG 3005 Query: 177 VSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSC 236 + + F D V + +G Sbjct: 3006 LDTSIGFQVRDNGGTDHGGQNTSGTYNFVLNTDNIVTGENVGSGTGNTPVLNGGSGNDII 3065 Query: 237 NKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVI----------KKKHLVRDALASVIRSIK 286 + + + VD+S + LV+DAL ++ + Sbjct: 3066 LGDKGGTVVTVEPGKNYNIALVVDTSGSMAYKLDGSTNGSGQSRIALVKDALTNLANQLV 3125 Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 D V + F + + + + T + + G T A +A Sbjct: 3126 GHD---GIVNVTLIGFATTAGTPVTL-QNLTSANVQTLLTAITNLSATGGTNYEAAFNSA 3181 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 S + LTDG+ T Sbjct: 3182 VSWFNSQTAAGKSVAAG---YENVTFFLTDGDPTYY 3214 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 47/134 (35%), Gaps = 10/134 (7%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 ++ + D + + G F+ R I Sbjct: 40 KNLKEFLKTIASFFDIKINGSQAGVITFSHRSEHSIKL-NDFSDGDSFEKAVDKIPLM-G 97 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNK 392 +T I+ A++ + + + ++ LEA K ++LLTDG T E+ I ++ Sbjct: 98 STTRIDKALRHSKNVMFTNQN------GGRLEATKLLILLTDGSQTFSAKQEDPSIIADE 151 Query: 393 AKSQGIRIMTIAFS 406 ++ G+ I+ I Sbjct: 152 IRNDGVLIIAIGIG 165 >gi|153867800|ref|ZP_01998040.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] gi|152144880|gb|EDN71960.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] Length = 276 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 53/171 (30%), Gaps = 25/171 (14%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A +V + N+ +R T F+ S + ++ Sbjct: 40 VQAARTVANEL-ATSNMPGQIRFALTVFDSHNNIKTSLNDDINTF-----YGGLNSLTID 93 Query: 335 GSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE--NTQDNEEGIAICNK 392 G + I+ A + + A K +V TDG ++ + + Sbjct: 94 GGSDISMAFAPINQLLSQARP----------HAAKIVVFYTDGYVFHSGKMDAIVNDAEA 143 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFR 442 ++QG++I + + E +S PN N+ ++ FR Sbjct: 144 LRNQGVQIFAV------SPPEDDASAMSLITGYPNRVLRPNNLPDIVNRFR 188 >gi|126442655|ref|YP_001064079.1| TadE family protein [Burkholderia pseudomallei 668] gi|242313326|ref|ZP_04812343.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254262367|ref|ZP_04953232.1| TadE family protein [Burkholderia pseudomallei 1710a] gi|126222146|gb|ABN85651.1| TadE family protein [Burkholderia pseudomallei 668] gi|242136565|gb|EES22968.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254213369|gb|EET02754.1| TadE family protein [Burkholderia pseudomallei 1710a] Length = 180 Score = 37.3 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 14 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 73 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 74 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 133 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 134 PDPASLAGSMNLV 146 >gi|218259405|ref|ZP_03475149.1| hypothetical protein PRABACTJOHN_00806 [Parabacteroides johnsonii DSM 18315] gi|218225134|gb|EEC97784.1| hypothetical protein PRABACTJOHN_00806 [Parabacteroides johnsonii DSM 18315] Length = 1307 Score = 37.3 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 30/305 (9%), Positives = 74/305 (24%), Gaps = 19/305 (6%) Query: 21 HFFIITALLMPVMLGVG-GMLVDVVRWSYYEHALKQAA---QTAIITASVPLIQSLEEVS 76 F I +L+ +L G VDV +AL A +T V L +L Sbjct: 5 SFLIGWIVLIVGLLCSVRGGAVDVNSAEALRNALGNATVDGNVVTLTGIVNLSSTLNITG 64 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMN-------PRKSAY 129 + + + N + +++ +T + Y Sbjct: 65 GTMILDLNGKTIIYDRTGTQENATAINVNGGNIVIKNGTIETYAANGQNNGWNTKGGNGY 124 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWL-------IQTKAEAETVSRSYHKEHGVSIQWV 182 ++ LS+ L+++ K + + + Sbjct: 125 NATAVVFNSGKVSLLSMSLKAIPGKGGSGQNFWGQSNPGNDGSPWVVDAEGNNTLGDVVP 184 Query: 183 IDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYY 242 S + +E + + N G + +P + + + Sbjct: 185 YGAYFSNPSDVKYTEKGEGINQQDAIVSLTQYNIAYNTNGGTTTVQGTP-SYTIETTDFI 243 Query: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 + ++ + + + + V + + + + G Sbjct: 244 LPTVTKNGYTVDWTYNGNPVNPTALPTTADRVASKDMTFVATWTPVSYKVVYDVDGGVAL 303 Query: 303 NDRVI 307 D Sbjct: 304 PDSSY 308 >gi|146305993|ref|YP_001186458.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudomonas mendocina ymp] gi|145574194|gb|ABP83726.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudomonas mendocina ymp] Length = 1227 Score = 37.3 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 25/323 (7%), Positives = 73/323 (22%), Gaps = 26/323 (8%) Query: 73 EEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVV 132 + S ++ + + +L+ N+ D+ A + ++ Sbjct: 97 TTAGLEQQLSTSYTNALLNGFKSTYGGIDLRSNYRFTWSYDLRSTQATAYGYSNTDNRLG 156 Query: 133 LSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + D + + + + I + + + + Sbjct: 157 ENPSADFSASVSRSTNGTSTVTNGNISFAITRSGSTSCSATATISGVTVTATCTAPSSNN 216 Query: 193 QRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPS 252 + Q + + N + +Y Y + Sbjct: 217 YTATLAQRGVPAYYYTYDASLASNCTTANDSCYRLNFVSAAQQQNFANWYSFYRNRALAT 276 Query: 253 LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSF 312 S V + + + D + ++ Sbjct: 277 TSAAALAFYDLSSSVRLSWQGLGNCT-----TFSSTDANSQCKINAFKEYSPTH------ 325 Query: 313 SWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK--- 369 + + + D N T + A++ A + ++ + + + + Sbjct: 326 --------KGQLYAWLQDINFNQGTPLPAALKRAGEFFTTATPWQKNPNGSGNTTQNTYA 377 Query: 370 ----YIVLLTDGENTQDNEEGIA 388 Y +L+TDG T Sbjct: 378 CRASYHILMTDGMWTSTTSAPSN 400 >gi|314923048|gb|EFS86879.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966820|gb|EFT10919.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093260|gb|EFT65236.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103482|gb|EFT75458.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327646|gb|EGE69422.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL103PA1] Length = 322 Score = 37.3 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 +NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 DNTQGGSPLVAA-NRAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 QSWTADSADKLQEVYQQ 281 >gi|308472845|ref|XP_003098649.1| hypothetical protein CRE_04226 [Caenorhabditis remanei] gi|308268249|gb|EFP12202.1| hypothetical protein CRE_04226 [Caenorhabditis remanei] Length = 330 Score = 37.3 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 51/155 (32%), Gaps = 10/155 (6%) Query: 286 KKIDNVNDTVRMGATFFNDRV----ISDPSFSWGVHKLIRTIVKTFAIDENEMGST-AIN 340 D + D F+ ++ + + + V T D N+ S N Sbjct: 1 MNTDRLKDVAANILIVFSSGTRIGSDANEPRTTRLGLVTYNSVATQKADLNQYQSYFDAN 60 Query: 341 DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 + A ++ + E KK +++ + + + I ++ K+ + I Sbjct: 61 SGVFDALSIVVDTTES---VNSTRGHYKKVVIVYASEYKGDEELDPLPIADRLKASDVNI 117 Query: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTH 435 +T+A+ + L+ AS F + + Sbjct: 118 ITVAYQQPGSVGLLNN--LTQIASSGFSFSDDDSE 150 >gi|291223807|ref|XP_002731899.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 859 Score = 37.3 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 37/324 (11%), Positives = 83/324 (25%), Gaps = 14/324 (4%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 + ++ E ++ + + S + + + + + Sbjct: 119 NPEWWHNPYTKQTLPCGKPGEYTHLTPKWITDVDFSTYFWGESA--KVIVHEWGHLRWGV 176 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKS-LYYMLYPGPLDPSLSEEHFVDS 261 P Y +NG+ S VS + S + P L E + Sbjct: 177 FDEYPTSTDETFYFDENGRTEPTRCSESVTGVSLDSSKGWTKCNTDPDSGVLPEPTCLFY 236 Query: 262 SSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIR 321 S + K + + D D N++ + Sbjct: 237 PSEENNPAKASYLYAQYLEGVVDFCHDDPDKDPNARHNQIANNQQNKQCGHRSSWEVMRE 296 Query: 322 TIVKTFAIDENEMGSTA-----INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTD 376 +++ + I +++ IS + V + + L++D Sbjct: 297 HADFAENTPLDDVDTLPLFKVVIARDIRSVLVMDISGSMFVVLDDSSEGTEGGRLFLISD 356 Query: 377 GENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANS--T 434 G + GI I T+A + + LS S++ + S T Sbjct: 357 GMEN-YAPLIKDVIGNISDSGIIIDTLALG---DEADPQLATLSAKTGGRSYYYSESSKT 412 Query: 435 HELNKIFRDRIGNEIFERVIRITK 458 L+ F + + + Sbjct: 413 TALHDSFTSSFTHREATNTATLVQ 436 >gi|126727882|ref|ZP_01743710.1| hypothetical protein RB2150_00477 [Rhodobacterales bacterium HTCC2150] gi|126702823|gb|EBA01928.1| hypothetical protein RB2150_00477 [Rhodobacterales bacterium HTCC2150] Length = 183 Score = 37.3 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 17/159 (10%), Positives = 42/159 (26%), Gaps = 9/159 (5%) Query: 1 MVFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTA 60 M F + + ++ +G + +++P ML M + A++ A Sbjct: 1 MFFKNRLKRF---FVEDQSGVILVELIVMLPAMLFAFYMGFAFFDAYQAKV----ASERA 53 Query: 61 IITASVPLI--QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 T + + + F + F + E I Sbjct: 54 AYTLGDLISRETGTVDSAYIDGMGEIFTYLTDADQDDYWFRVTSLTWSDEDEGHTIDWSD 113 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWL 157 A N + ++ L+ + +++ Sbjct: 114 ATTNNSAMTQSELNSILESIPLMADGDTIMVVETNETYT 152 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 72/240 (30%), Gaps = 19/240 (7%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 +R +G KS + Y VS + ++ S + LN Sbjct: 113 NNDWIIRELGTKSLYDLKMELDFKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALN 172 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + + + + RD+ + ++ +E+ S+ Sbjct: 173 TKQVF---SKVAKGKRGIALAYRDDPIEGQANVAISFVFDASGSMEFSLDGTEKVNPYSN 229 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + ++R+ ++ ++ I V + FN + K + + Sbjct: 230 NPLKNRSRIDILREKTKKMMADLQPI----GNVSVNLVQFNSHASFVQQNFIELDKGLTS 285 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 I + N +T D ++ ++ +N KY+VLLTDG Sbjct: 286 INSA-IDNLNPEHATNPGDGLRYGMVSLQ-----------SNAAQLKYVVLLTDGVPNSY 333 >gi|315091028|gb|EFT63004.1| TadE-like protein [Propionibacterium acnes HL110PA4] Length = 74 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 25/74 (33%), Gaps = 2/74 (2%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + L++PV+L + G++ D R A + AA A + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISATQAAQDAAVAASRAGTDAAATP 60 Query: 72 LEEVSSRAKNSFTF 85 A + Sbjct: 61 QLAGHDGAAVAVQA 74 >gi|254239033|ref|ZP_04932356.1| hypothetical protein PACG_05210 [Pseudomonas aeruginosa C3719] gi|126170964|gb|EAZ56475.1| hypothetical protein PACG_05210 [Pseudomonas aeruginosa C3719] Length = 556 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 42/152 (27%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVHVEVT 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 L L G + ++ +A A Sbjct: 127 KSVPGSLVANLGGLFGGGNANVDLRAEAVARR 158 >gi|120599894|ref|YP_964468.1| type IV pilin biogenesis protein [Shewanella sp. W3-18-1] gi|120559987|gb|ABM25914.1| type IV pilin biogenesis protein, putative [Shewanella sp. W3-18-1] Length = 1165 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 18/184 (9%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE----MGST 337 + K N+ + +++ S++ K T + D T Sbjct: 346 FGNKDKNGTGNNKIDGYTPNNPPSILTSGSYTTPFKKCPDTAYIIYITDGAPTLDKSADT 405 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 IN + A + + N E L N Q Sbjct: 406 LINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNNDIVIGNKDSLGVDNKQN 465 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA-------SPNS----FFEANSTHELNKIFRDRIG 446 IR+ TI F ++A L A N+ ++ A + +L D + Sbjct: 466 IRVFTIGF---SDGADEAAALLEETAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALK 522 Query: 447 NEIF 450 + + Sbjct: 523 SILT 526 >gi|313773399|gb|EFS39365.1| TadE-like protein [Propionibacterium acnes HL074PA1] gi|313819148|gb|EFS56862.1| TadE-like protein [Propionibacterium acnes HL046PA2] gi|314984750|gb|EFT28842.1| TadE-like protein [Propionibacterium acnes HL005PA1] Length = 75 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 25/75 (33%), Gaps = 2/75 (2%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + + G + L++PV+L + G++ D R A + AA A + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISATQAAQDAAVAASRAGTDAAATP 60 Query: 72 LEEVSSRAKNSFTFP 86 A + Sbjct: 61 QLAGHDGAAVAVQAA 75 >gi|313106815|ref|ZP_07793028.1| putative membrane protein [Pseudomonas aeruginosa 39016] gi|310879530|gb|EFQ38124.1| putative membrane protein [Pseudomonas aeruginosa 39016] Length = 556 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 42/152 (27%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVHVEVT 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 L L G + ++ +A A Sbjct: 127 KSVPGSLVANLGGLFGGGNANVDLRAEAVARR 158 >gi|237510209|ref|ZP_04522924.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|235002414|gb|EEP51838.1| TadE family protein [Burkholderia pseudomallei MSHR346] Length = 189 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 23 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 82 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 83 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 142 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 143 PDPASLAGSMNLV 155 >gi|254428069|ref|ZP_05041776.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196194238|gb|EDX89197.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 657 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 46/164 (28%), Gaps = 18/164 (10%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R T F+ + V R + F G T + A+ ++ Sbjct: 334 DRFNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSA--TLSQPAS 391 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 + + ++ +TDG ++ A+ + R+ T+ Sbjct: 392 DG----------YLRQVIFITDGAVGNESGIFRALHQQLGEA--RLFTVGIGSAPNSHFM 439 Query: 415 ARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 R SF N +++ + + ++R + Sbjct: 440 TRAAQ---FGRGSFTYINDQNQVQQGMDTLF-RRLESPLMRNLQ 479 >gi|195439238|ref|XP_002067538.1| GK16139 [Drosophila willistoni] gi|194163623|gb|EDW78524.1| GK16139 [Drosophila willistoni] Length = 1504 Score = 37.3 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 18/292 (6%), Positives = 50/292 (17%), Gaps = 9/292 (3%) Query: 57 AQTAII--TASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDI 114 A +A + S + + + T + + + Sbjct: 104 ADSAALVEAVSFSSTDGVAPAETSSAAIETTTVSAADGPTTTTLSEDTTSDPNGPVDTTT 163 Query: 115 VRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE 174 P + + ++ + + + A+T + Sbjct: 164 PAVVDETTGPAEPSNPQETTAEPGVTDSNTPNEPAGTTYAPGELSPNGPADTTEDPVNPN 223 Query: 175 HGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMV 234 + D + ++ + T + + +P Sbjct: 224 SPQETTPISDDPAGTTNDPWSTDAPGTSVDPDVPAGTTDAPGTPADPEIPAGSTAAP--G 281 Query: 235 SCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV--- 291 + +S++ + + + Sbjct: 282 TPADPNAPAGSTAAPGTPADPNAPAESTAAPGTPADPNAPAGSTDAPGTPADPNAPGETT 341 Query: 292 --NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND 341 T T A GSTA Sbjct: 342 AAPGTPADPNAPVETTAAPGTPVDPNAPAGSTAAPGTPADPNAPAGSTAAPG 393 >gi|255327041|ref|ZP_05368117.1| conserved hypothetical protein [Rothia mucilaginosa ATCC 25296] gi|255296258|gb|EET75599.1| conserved hypothetical protein [Rothia mucilaginosa ATCC 25296] Length = 197 Score = 37.3 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 11/116 (9%), Positives = 31/116 (26%), Gaps = 9/116 (7%) Query: 6 KFIFYSKKLIKS------CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT 59 + S+ L G + ++ ++L +G + V + L+ A Sbjct: 45 RARVQSRALAADPVASNPEEGSIAPLIVGMLALLLLIGSVTVAITGAYLQTQHLQDVADA 104 Query: 60 AIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 + + + ++ + + E L+ + V Sbjct: 105 QANSITR---TMRTPDEASGSAAWEYASAYLAEVQPGQDFQALRLENVSVDSDHSV 157 >gi|312211730|emb|CBX91814.1| hypothetical protein [Leptosphaeria maculans] Length = 713 Score = 37.3 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 60/212 (28%), Gaps = 13/212 (6%) Query: 31 PVMLGVGGMLVDVVR----WSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFP 86 P M+G + R + ++ + + T S + + FP Sbjct: 381 PAMIGSAAATREQTRQADAAYAMKSQMQNNEEAEVKTISPKDALLDYRENDEDSKAPLFP 440 Query: 87 KQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + +Y + + +N T + N + Y S+ L Sbjct: 441 ESGANKYDGQYNGGDQYRNATLSNFDTTSSQSFQRENWQTPQYSQNYSTASAPSLQQSPG 500 Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSR-------SMLDYQRDSEGQ 199 F + S + + + S ++ ++ +D + SM ++E Sbjct: 501 F--NFITPSSVQANLHGLPSAATSQYRSTSSNMSSAVDQTPEFPAHLTSMESSASEAEPP 558 Query: 200 PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + +DR + +S + G Sbjct: 559 SGSQNSALSDRHLTKPASTGAESGTYSCTYHG 590 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 37.3 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 3/50 (6%) Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEANSTHELNKIFRDRIG 446 ++ T+ F + + + FL+ AS + +F A+ +L F I Sbjct: 301 VQTFTVGFGQGIS--DAGKAFLTRGASQDDWYFNADKPEDLENAFNKIIS 348 >gi|229522391|ref|ZP_04411807.1| Ca2+-binding protein [Vibrio cholerae TM 11079-80] gi|229340376|gb|EEO05382.1| Ca2+-binding protein [Vibrio cholerae TM 11079-80] Length = 3240 Score = 37.3 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 35/382 (9%), Positives = 80/382 (20%), Gaps = 34/382 (8%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ A +F + N + + T M + + Sbjct: 491 NGQFQLVQGADTVASFALDSSVNPVQGLTSNGVAVTLSAPVDDGNGNLTYTAMAGAVTVF 550 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE--------------- 174 + L+S + ++ S + K A Sbjct: 551 TLTLNSDGTYSFTLSAPVDHALNSDSLTLNFKVIATDFDGDSDSIVLPVKINDDKPSFIH 610 Query: 175 ----HGVSIQWVIDFSRSM------LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S + + + + + NG + Sbjct: 611 HQPLSVNEDDLPQGSDGSKESLSASGQFTVVQGADQVVSYQLDSSVNPVQGLTSNGVAVV 670 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 + + + P ++ + + S + V+ Sbjct: 671 LSAPVDDGNGNLTYTATAGGNPIFTLVLNTDGSYTFTLSGPIDHASGSDSKQLNFKVLAV 730 Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 D + + + + + + S L + T T A Sbjct: 731 DFDGDKASVNLPVTIIDDKPTITAVQALSVDEDDLTQGSDTTKESLTATGHFTTTQGADH 790 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 T+ + + T G +T K G I T+ Sbjct: 791 VVSYTLNLTTNPLAGVTSGGQVLTLSDSVDTHGNHTY---------TATKPDGTLIFTLQ 841 Query: 405 FSVNKTQQEKARYFLSNCASPN 426 + N + Q L + AS + Sbjct: 842 LNANGSYQFTLSGPLDHAASSD 863 >gi|15599493|ref|NP_252987.1| TadG [Pseudomonas aeruginosa PAO1] gi|9950519|gb|AAG07685.1|AE004846_2 TadG [Pseudomonas aeruginosa PAO1] Length = 556 Score = 37.3 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 42/152 (27%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAVANEAVHVEVT 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 L L G + ++ +A A Sbjct: 127 KSVPGSLVANLGGLFGGGNANVDLRAEAVARR 158 >gi|163731885|ref|ZP_02139332.1| hypothetical protein RLO149_21314 [Roseobacter litoralis Och 149] gi|161395339|gb|EDQ19661.1| hypothetical protein RLO149_21314 [Roseobacter litoralis Och 149] Length = 200 Score = 37.3 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 6/50 (12%), Positives = 15/50 (30%), Gaps = 3/50 (6%) Query: 11 SKKLIKSCTGHFFIITALLMPVML---GVGGMLVDVVRWSYYEHALKQAA 57 ++ G+ I +++P+M + D R + Sbjct: 5 LRRFCDDQQGNVAIEAVIILPIMFWAYLAMFTIFDTYRQYTSQQKAAYTI 54 >gi|270487809|ref|ZP_06204883.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|270336313|gb|EFA47090.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 207 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 40/112 (35%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++ ++++ +T + F+ + ++ + Sbjct: 21 IEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSARQAVPLTD--------LLNFKLPE 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+TA+ DA+ I + + K + + ++TDG T D Sbjct: 73 LVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRP--LVFIMTDGSPTDD 122 >gi|15599750|ref|NP_253244.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] gi|9950799|gb|AAG07942.1|AE004869_6 type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] Length = 1161 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 26/336 (7%), Positives = 78/336 (23%), Gaps = 23/336 (6%) Query: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 + Q + L + + + M + A ++++ + Sbjct: 2 KSVLHQIGKTSLAAALSGAVLLSAQTTHAAALSVSQQPLMLIQGVAPNMLVTLDDSGSMA 61 Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKE--------HGVSIQWVIDFSRSMLDYQR 194 G ++ + + + + D+ Sbjct: 62 FAYAPDSISGYGNYTFFASNSFNPMYFDPNTQYKLPKKLTLVNGQVQIQDYPAPNFSSAW 121 Query: 195 DSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL---DP 251 + V + + + Y S Sbjct: 122 RNGFTRSGSINLSNSYKVTIEYGRGYDKESTIKADAAYYYDFTGSSSCNRTNQACYTRRY 181 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF-----NDRV 306 +E+ ++ + + A S+ + V+ + + + Sbjct: 182 VSTEQRQNFANWYSFYRTRALATQTAANLAFYSLPENARVSWQLLNDSNCNQMGSGSSSG 241 Query: 307 ISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 ++ R + + G T + AM A + + + + + + + Sbjct: 242 NCFSNYLRDFTGQHRVNFFNWLEKLSVNGGTPLRQAMTRAGEFLKKTGVNGPYAYRPGTQ 301 Query: 367 A-------KKYIVLLTDGENTQDNEEGIAICNKAKS 395 Y +L+TDG D+ + A++ Sbjct: 302 TAPEYSCRGSYHILMTDGLWNNDSANVGNADSTARN 337 >gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] Length = 676 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 39/359 (10%), Positives = 87/359 (24%), Gaps = 29/359 (8%) Query: 107 TDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLS--LFLRSMGIKSWLIQTKAEA 164 I T + + +V ++++ NPL I++ + Sbjct: 166 YSSGTFAIRFPTTITPRYHAISGEVEVNTQNQAHTNPLPTGWLSPVYSIQNITQNSTQTD 225 Query: 165 ETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGI 224 S + + + + S + D + + + T + Sbjct: 226 NVPSSQFSLNIDIDVGLELVDINS-KYHNVDVQNTAFGQYKIALNETNTVNRDFLLEFKP 284 Query: 225 RDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRS 284 ++ + + Y + + R ++ + Sbjct: 285 LQKEQAQAAFFTQQFENGDRYGLAMLMPPGDHFTQTQRLPREMVFVVDTSGSMHGQSMEQ 344 Query: 285 IKKI-----DNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAI 339 KK ++ F++ V + F G T I Sbjct: 345 AKKALFYALSLLDSDDSFNIIGFDNIVTPMSDKPLIASDFNLRRAERFIYSLEADGGTEI 404 Query: 340 NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIR 399 A+ D + +V LTDG + ++ I +K R Sbjct: 405 QGALNAVLDG------------SEFDGFVRQVVFLTDGSVSNEDALFKNIQSKLGDS--R 450 Query: 400 IMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 + T+ +F+ A SF ST E+ + + + Sbjct: 451 LFTVGIGS-----APNSFFMRRAADIGKGSFTFIGSTSEVQPKMQQLFDKLAHPAITNL 504 >gi|304315800|ref|YP_003850945.1| hypothetical protein Tthe_0286 [Thermoanaerobacterium thermosaccharolyticum DSM 571] gi|302777302|gb|ADL67861.1| conserved hypothetical protein [Thermoanaerobacterium thermosaccharolyticum DSM 571] Length = 185 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 13/136 (9%), Positives = 45/136 (33%), Gaps = 5/136 (3%) Query: 16 KSCTGHFFI-ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA---SVPLIQS 71 + G I + L + + + L++ +++ A ++ + A + P+ + Sbjct: 19 RDRKGISSIFLVMLFLIPFITLSVFLIESRYLYTIKNSADDAVVSSALAALKSADPVDAA 78 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 E + + +++ + + +N + + V + V Y Sbjct: 79 YGEYVLDKDAAKNTFYEYLKKNMKLD-DNLNPMSGSVAASSVHVDEFIVYDPGDYPTYCP 137 Query: 132 VLSSRYDLLLNPLSLF 147 + + ++ + F Sbjct: 138 KGTLIRNTSIHVVISF 153 >gi|225376650|ref|ZP_03753871.1| hypothetical protein ROSEINA2194_02292 [Roseburia inulinivorans DSM 16841] gi|225211533|gb|EEG93887.1| hypothetical protein ROSEINA2194_02292 [Roseburia inulinivorans DSM 16841] Length = 320 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 79/266 (29%), Gaps = 11/266 (4%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQ---TAIITASVPL 68 ++L K+ G ++ L++ + G +VD+ R Y + A ++T L Sbjct: 2 QRLFKTVKGSITVLVTLILVPTIFFTGFMVDLARIKLYGNQAVMTADNYGETVLTQYDNL 61 Query: 69 IQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSA 128 ++ L + + ++ + + + + + N + ++ + Sbjct: 62 LKELYGLFAVTQDDKALNNLDKLQGYVSSSFDPAQNNISWEHFEEVQDYLGMNTLDGFMP 121 Query: 129 YQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRS 188 YQ + P S + + + + + + + S Sbjct: 122 YQ-DADITLEKEFIPESHLSNHAVLDTQIGDFMKFRIA-------QGLLDDGTELIDNIS 173 Query: 189 MLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGP 248 + + + + +Q + ++D K P + Y + G Sbjct: 174 QIQNKENDGKALDKKLELDNEVEELLKIAQKYYLELKDIKAYPDYIDGINDAYMLCDAGF 233 Query: 249 LDPSLSEEHFVDSSSLRHVIKKKHLV 274 + S+ + + + ++ Sbjct: 234 EEIINSDAYMHYRDYVLADENEINVA 259 >gi|53723508|ref|YP_102950.1| hypothetical protein BMA1283 [Burkholderia mallei ATCC 23344] gi|52426931|gb|AAU47524.1| conserved hypothetical protein [Burkholderia mallei ATCC 23344] Length = 568 Score = 37.3 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 14/153 (9%), Positives = 41/153 (26%), Gaps = 5/153 (3%) Query: 32 VMLGVGGM-LVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKI 90 +++ + + VD+ + L++ A A + + + ++ + + + Sbjct: 1 MLVAIAALGAVDIGNVFFVRRDLQRVADMAALAGAQRM---DDQCAQPNAAAAANARSNG 57 Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + L D + P +A QV + S + + Sbjct: 58 FDPAAGGNTLALACGRWDTQSNAGPSYFNAAATP-LNAVQVTATQSVPYFFLGPSRTVSA 116 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + T S ++ + Sbjct: 117 TSTAKATNVDQFTIGTTLASVQGGLVNNVLNAL 149 >gi|168208905|ref|ZP_02634530.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712845|gb|EDT25027.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] Length = 620 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 55/196 (28%), Gaps = 14/196 (7%) Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 E + + + K I E N + + P Sbjct: 24 NASGIENKGFDDQFKDIGNESKGRICDYDGTEILKEVSQEPDKDGNYEITLTVKGKPKKV 83 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 + + + + + ++ ++ S++ + + R+ F V S Sbjct: 84 TKPVDILLIMDASNSMYYNMDELKASMNSLVDKVIDNIPNS---RIAVVAFGTEVEEVFS 140 Query: 312 FSWGVHKLIRTIVKTFAID----ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 FS + + K D G+T I + A + + + NN + Sbjct: 141 FSNKNNFTSKEEYKNAIKDSYYYITGRGNTNIEGTWRVADEIFKNE-------LNNNSNS 193 Query: 368 KKYIVLLTDGENTQDN 383 KK ++ +DG Sbjct: 194 KKDVIFFSDGYPNVSR 209 >gi|163815330|ref|ZP_02206705.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] gi|158449304|gb|EDP26299.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] Length = 348 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 52/190 (27%), Gaps = 21/190 (11%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + D + + ID N SD + + + Sbjct: 144 LSPLTDDYEFIKEQLDMIDKS-------LKVRNSEDDSDLYSTDLSSMYDWLYYEGYITS 196 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 +G D + ++ D + K K ++ TD + + + Sbjct: 197 GTLVG--NEQRGSSLIGDGLAAAACDFSDKDKTRT---KIMIFSTDNDIQGNPVATLDDA 251 Query: 391 NKA-KSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANST---HELNKIFRDR 444 K+ G+ + + K ++ R + + FF + ++ Sbjct: 252 ASICKNNGVTVYGVG---TKEMTDENRESMKKAVESTGGQFFMEEESGTFDQIVTAIEKS 308 Query: 445 IGNEIFERVI 454 N++ ++I Sbjct: 309 SKNQVDAKMI 318 >gi|296141040|ref|YP_003648283.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296029174|gb|ADG79944.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 527 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 38/405 (9%), Positives = 105/405 (25%), Gaps = 28/405 (6%) Query: 61 IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAV 120 ++ + + + +A P + + Sbjct: 134 VLAGASIPEPTSWLDALQAATIRAVPADSPYATAPVTAGVSEAQQPGANRQALTAALAQY 193 Query: 121 EMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH---KEHGV 177 ++ V S++ + + + K + ++ + Sbjct: 194 AQAAKRVDDDPVRSAKAQGGVVVVPEYAYLAAKKDEPGVSAVVPKSGAPRDDLLLTVTAG 253 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + + + + + G+ R + +G P Sbjct: 254 GDRATAAKTGADTLAAAFASEKGIVALGEAGLRGKDLSPAPPDGIGKVAGLPEPNTDELT 313 Query: 238 KSLYYMLYPGPLDPS-LSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN-DTV 295 K+ + + + + + ++ V+ I + V T Sbjct: 314 KAEQAYATLAVPLKALVVVDTSGSMNESAGDTTRIGMLASGFTKVVTQIPDANAVGLWTF 373 Query: 296 RMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE-----MGSTAINDAMQTAYDTI 350 +G+ D P+ + ++ + N G+T + D AY Sbjct: 374 SIGSATRPDWTEVVPTARLDARRGDKSQRQALLDGVNALPRKVGGATGLYDTTLAAYRRA 433 Query: 351 ISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ-DNEEGIAICNKAK-----SQGIRIMTIA 404 + + + ++LLTDG + + + + + ++ + I T+ Sbjct: 434 VENFDPAYSNS---------LILLTDGSDEKPGGMSLDDLVAQLRTLVDPARPVNIHTVG 484 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEI 449 S + R A+ + EA+S ++ F I Sbjct: 485 ISKDADLPALKRI---ADATGGTAQEADSEQQMLTDFVTAIAKRA 526 >gi|300779171|ref|ZP_07089029.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300504681|gb|EFK35821.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 396 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 18/132 (13%) Query: 304 DRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKN 363 + + + + + + +E I DA + Sbjct: 112 NYIRQVTPLTQDLDLVSEKLFALRTNGGSEYCGAVIRDA-------------AANLNWDS 158 Query: 364 NLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA 423 N ++ K I + + Q + +KAK++ I I TI +++E + F N A Sbjct: 159 NDKSMKLIYIAGNEAFDQGKINYREVVSKAKNKNIYINTI---FCGSREEGIQTFWQNGA 215 Query: 424 S--PNSFFEANS 433 S +F +S Sbjct: 216 SLGGGKYFNIDS 227 >gi|146292169|ref|YP_001182593.1| type IV pilin biogenesis protein [Shewanella putrefaciens CN-32] gi|145563859|gb|ABP74794.1| type IV pilin biogenesis protein, putative [Shewanella putrefaciens CN-32] Length = 1165 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 18/184 (9%) Query: 282 IRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE----MGST 337 + K N+ + +++ S++ K T + D T Sbjct: 346 FGNKDKNGTGNNKIDGYTPNNPPSILTSGSYTTPFKKCPDTAYIIYITDGAPTLDKSADT 405 Query: 338 AINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG 397 IN + A + + N E L N Q Sbjct: 406 LINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNNDIVIGNKDSLGIDNKQN 465 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCA-------SPNS----FFEANSTHELNKIFRDRIG 446 IR+ TI F ++A L A N+ ++ A + +L D + Sbjct: 466 IRVFTIGF---SDGADEAAALLEETAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALK 522 Query: 447 NEIF 450 + + Sbjct: 523 SSLT 526 >gi|116052331|ref|YP_792642.1| hypothetical protein PA14_55820 [Pseudomonas aeruginosa UCBPP-PA14] gi|115587552|gb|ABJ13567.1| putative membrane protein [Pseudomonas aeruginosa UCBPP-PA14] Length = 556 Score = 37.3 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 42/152 (27%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVHVEVT 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 L L G + ++ +A A Sbjct: 127 KSVPGSLVANLGGLFGGGNANVDLRAEAVARR 158 >gi|320114928|ref|YP_004185087.1| hypothetical protein Thebr_0072 [Thermoanaerobacter brockii subsp. finnii Ako-1] gi|320116988|ref|YP_004187147.1| hypothetical protein Thebr_2261 [Thermoanaerobacter brockii subsp. finnii Ako-1] gi|319928019|gb|ADV78704.1| hypothetical protein Thebr_0072 [Thermoanaerobacter brockii subsp. finnii Ako-1] gi|319930079|gb|ADV80764.1| hypothetical protein Thebr_2261 [Thermoanaerobacter brockii subsp. finnii Ako-1] Length = 139 Score = 36.9 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 10/87 (11%), Positives = 24/87 (27%), Gaps = 3/87 (3%) Query: 3 FDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII 62 D + I ++ +K G +I+ + +L + ++ A Sbjct: 6 SDKQVIKMIRRFLKDNRGASQLISTAALLPILLFIMASI---VNISVVSNMQTLVNEAAF 62 Query: 63 TASVPLIQSLEEVSSRAKNSFTFPKQK 89 + I+S + F Sbjct: 63 EGARIGIKSDTPEQTAQMAVLNFGDGI 89 >gi|255602053|ref|XP_002537805.1| conserved hypothetical protein [Ricinus communis] gi|223515012|gb|EEF24580.1| conserved hypothetical protein [Ricinus communis] Length = 162 Score = 36.9 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 45/146 (30%), Gaps = 6/146 (4%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAA-QTAII--TASVPLIQSLEEV 75 G I L++ +++ + G +V+ R +Y AL +A +A + A I S Sbjct: 15 RGGALIEFGLVVILLVMIVGAIVEFGRVFWYYDALTKATRDSARLISLAPKETIASTASG 74 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 ++ + L R N + + + +QV + Sbjct: 75 DAQDLAILAATNAGVTPRLTRANVVVDCLNAAFQIENCVDGTAPANVRVSIVDFQVTIGG 134 Query: 136 RYDLLLNPL---SLFLRSMGIKSWLI 158 LLL+ + + Sbjct: 135 WIPLLLSNGASTNWQPTLDPHTTMRY 160 >gi|296390996|ref|ZP_06880471.1| hypothetical protein PaerPAb_22709 [Pseudomonas aeruginosa PAb1] Length = 556 Score = 36.9 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 42/152 (27%), Gaps = 2/152 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVHVEVT 126 Query: 135 SRYDLLLNPLSLFLRSMGIKSWLIQTKAEAET 166 L L G + ++ +A A Sbjct: 127 KSVPGSLVANLGGLFGGGNANVDLRAEAVARR 158 >gi|269468735|gb|EEZ80354.1| hypothetical protein Sup05_1046 [uncultured SUP05 cluster bacterium] Length = 682 Score = 36.9 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 55/177 (31%), Gaps = 22/177 (12%) Query: 276 DALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 A ++I++I ++ T R F+ + + + FA G Sbjct: 351 QATNALIQAINRLKP---TDRFNIIDFDSDFEVLFDTAIPAIDMNKRHGIRFAKHLVASG 407 Query: 336 STAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKS 395 T +A++ A + + +++ + + ++ LTDG+ + E A+ Sbjct: 408 GTEPLEAIKFALLS----------KDEDSDKYLRQVIFLTDGQVGNEKELFRAVQQNIDD 457 Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIF 450 R TI Y ++ A +F E+ + Sbjct: 458 D--RFFTIGIGSAPND-----YLMTKMAEYGKGAFTYIGDIDEVEVKMGELFSKLES 507 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 36.9 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 72/240 (30%), Gaps = 19/240 (7%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 +R +G KS + Y VS + ++ S + LN Sbjct: 113 NNDWIIRELGTKSLYDLKMELDFKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALN 172 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 + + + + RD+ + ++ +E+ S+ Sbjct: 173 TKQVF---SKVAKGKRGIALAYRDDPIEGQANVAISFVFDASGSMEFSLDGTEKVNPYSN 229 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + + ++R+ ++ ++ I V + FN + K + + Sbjct: 230 NPLKNRSRIDILREKTKKMMADLQPI----GNVSVNLVQFNSHASFVQQNFIELDKGLTS 285 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 I + N +T D ++ ++ +N KY+VLLTDG Sbjct: 286 INSA-IDNLNPEHATNPGDGLRYGMVSLQ-----------SNAAQLKYVVLLTDGVPNSY 333 >gi|115351454|ref|YP_773293.1| TadE family protein [Burkholderia ambifaria AMMD] gi|115281442|gb|ABI86959.1| TadE family protein [Burkholderia ambifaria AMMD] Length = 156 Score = 36.9 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 46/164 (28%), Gaps = 12/164 (7%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 Y G + AL+ P+ + +V + +L + A+ Sbjct: 3 RQYRPAACSRERGATAVEFALVFPLFFLILYAIVTFGLIFAVQQSL-------TLAAT-- 53 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + Q + + V+ I + + + Sbjct: 54 --EGARSALNYVYEANGSGSQALTDRASAAKTTAAGLTSWLAHVQ-IPTPVSGACSYDPA 110 Query: 128 AYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 Y V ++ Y +PL L +G+ + T ++ + Sbjct: 111 MYCVTVTVTYPYQAHPLVPSLPLLGLVTPAQLTGTATVQINPAT 154 >gi|219850785|ref|YP_002465217.1| hypothetical protein Mpal_0101 [Methanosphaerula palustris E1-9c] gi|219545044|gb|ACL15494.1| hypothetical protein Mpal_0101 [Methanosphaerula palustris E1-9c] Length = 1314 Score = 36.9 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 27/293 (9%), Positives = 62/293 (21%), Gaps = 6/293 (2%) Query: 56 AAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIV 115 A + + + T + + D + Sbjct: 549 ANDQTVAAGTSGTVNVGYNGEVNFTYLPTTGHHIDTVTYTAEDGSQKQLPIVDNTTSTLP 608 Query: 116 RDTA---VEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 T+ V + + ++ + P S + T + Sbjct: 609 GITSNCTVVATFTVDRFAITVTPPVHGSITPNSTQMVDYNDTPTFTFTPEAGYHLGTVTV 668 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPY 232 V+ + + ++ F A + NG V + + Sbjct: 669 DGTSVTPAGNTYTFPPVTGPHTLAATFVVDTFTINASAGDHGTINPNGAVPANYGESKDF 728 Query: 233 MVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVN 292 ++ N + P I+ H + + A +I + Sbjct: 729 TITANTGYHINSVTVDSTPQTIPAGNTSYLYTFTNIQASHTINASFAINTYTINATAGDH 788 Query: 293 DTVRMGATF---FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDA 342 T+ T + D + + G T+ T I Sbjct: 789 GTITPNGTQTANYGDSRTFTITPATGYTIADVTVDGVSQGAVTTYALTNIQGD 841 >gi|90021154|ref|YP_526981.1| hypothetical protein Sde_1507 [Saccharophagus degradans 2-40] gi|89950754|gb|ABD80769.1| putative polysaccharide lyase [Saccharophagus degradans 2-40] Length = 873 Score = 36.9 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 26/309 (8%), Positives = 67/309 (21%), Gaps = 6/309 (1%) Query: 34 LGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 +G + ++ + + + S + Y Sbjct: 74 MGAVEIAFYKGDQRSATFEVQSSLD-----GASWDVNLNTFSSGSTLAMEPYTLNVQARY 128 Query: 94 LIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGI 153 N +T + V Q+ + S + G Sbjct: 129 FRIVGYGNTSNTWTSITEVNFVEGDGPIPPEPPVEGQLDVHSVVASSDDGNVASNTLDGS 188 Query: 154 KSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVK 213 S +T++ + V + + D + P Sbjct: 189 LSTRWSANGNGQTITYNLGSMQLVKSVSMAFYKGDQRSATFDIQVSPDGSNFSTVFSGES 248 Query: 214 SYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHL 273 S ++ + + ++ + + + + + Sbjct: 249 SGNTLALEEFDIPNMPAQFVRIVGYGNSANSWNSITEVEIYGGGDISNVPPVASFDIDDE 308 Query: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKL-IRTIVKTFAIDEN 332 V +A V D ++ + F D + + + IV D+ Sbjct: 309 VSIDVALVFDGSNSFDPDGGSLASYSWDFGDGSDATGAVVNHTYSAGGSYIVSLTVTDDE 368 Query: 333 EMGSTAIND 341 +TA + Sbjct: 369 GDSATATHA 377 >gi|167562908|ref|ZP_02355824.1| hypothetical protein BoklE_10135 [Burkholderia oklahomensis EO147] Length = 602 Score = 36.9 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 48/172 (27%), Gaps = 5/172 (2%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I G F ++ A+ M V + G VD+ + L++ A A + Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADM---AALAAAQKM 72 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + A + + + L D + + +A QV Sbjct: 73 DDQCTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTA-STPLNAVQV 131 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + S + + + T S ++ + Sbjct: 132 TATQSVPYFFLGPSRTVSATSTAKATNIDQFTVGTTLASLQGGLVNNVLNAL 183 >gi|298488978|ref|ZP_07007001.1| Probable transmembrane protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298156476|gb|EFH97573.1| Probable transmembrane protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 677 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 51/173 (29%), Gaps = 5/173 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ AL + + L ++VD R + +L++ A A + A+ Sbjct: 13 RQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAA-----GRRGTC 67 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 S A + F Q + ++ V + A + + + Sbjct: 68 SGAATAPGFATQSATRNGFTPNTGGRTLTILCGTLTVGLQSQRVFVADSTQALAIQVVAA 127 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + + + A V+ S ++I+ S Sbjct: 128 HPVPRSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDST 180 >gi|221113031|ref|XP_002168311.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 928 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 21/218 (9%), Positives = 55/218 (25%), Gaps = 19/218 (8%) Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL-YYMLYPGPLDPSLSEEHFVDSSSLRHVI 268 + + L +M D S Sbjct: 141 VVYSDTAELKFNFDLLPFDLIAFMAETRGFNSTLPSNLTICRVPADVAFLFDGSLGNENK 200 Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS--DPSFSWGVHKLIRTIVKT 326 +K + + + GA ++D + ++ V ++ Sbjct: 201 EKY--FNRYIEFAKQVVDYHPPSMQGYHYGAVVYSDTAELKFNFDLAYDVSGVLEGFEDI 258 Query: 327 FAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEG 386 + + ++ +Q + S D + +++LTD ++T+ Sbjct: 259 KRTFTSFKEGSRVDLGLQAIKELFDKSGRD---------WVSQVLIVLTDSKSTEG---V 306 Query: 387 IAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 I + + I I + + + + + AS Sbjct: 307 EVIAQSLRDSTVSIFVIGLANKDSHSDIGQ--MEKIAS 342 >gi|167006295|ref|YP_001661518.1| membrane protein [Streptomyces sp. HK1] gi|166162377|gb|ABY83498.1| membrane protein [Streptomyces sp. HK1] Length = 148 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 41/126 (32%), Gaps = 2/126 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G + A+ +L V G+LVD + AQ A TA L + + Sbjct: 19 GSMSLFFAVTTVAILMVMGLLVDGGGALNAANRATSIAQEAARTAGQQLDPAQAIEGTAI 78 Query: 80 KNSFTFPKQKIEEYLI-RNFENNLKKNFTDREVRDIVRDTAVEMNPRKS-AYQVVLSSRY 137 ++YL + N++ + + V DT + + + ++ Sbjct: 79 TIDPDAATGAAQDYLATADVTGNVEITDGGQSITVTVHDTYNTLFAQFVGKGTIHVTGTA 138 Query: 138 DLLLNP 143 L+ Sbjct: 139 TAQLHT 144 >gi|163748341|ref|ZP_02155615.1| hypothetical protein OIHEL45_20501 [Oceanibulbus indolifex HEL-45] gi|161378387|gb|EDQ02882.1| hypothetical protein OIHEL45_20501 [Oceanibulbus indolifex HEL-45] Length = 73 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 9/65 (13%), Positives = 21/65 (32%), Gaps = 4/65 (6%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQT-AIITASVPLIQS 71 K +G I+ +L+P+++ ++ A A+I S+ Sbjct: 10 SFFKDESGSQTIVFVILLPLLVWSILAMLAFTDMFRVRA---IATDATAVIADSLSRQTM 66 Query: 72 LEEVS 76 + Sbjct: 67 PIDAD 71 >gi|302559122|ref|ZP_07311464.1| membrane protein [Streptomyces griseoflavus Tu4000] gi|302476740|gb|EFL39833.1| membrane protein [Streptomyces griseoflavus Tu4000] Length = 154 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 8/65 (12%), Positives = 18/65 (27%) Query: 4 DTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 T+ ++ G + V L + ++D A+ A Sbjct: 5 RTRTWLSERRGRLDDRGSGAGAVIIFALVFLSLAAFVIDGGMSISKRERAADIAEQAARY 64 Query: 64 ASVPL 68 A+ + Sbjct: 65 AAQDI 69 >gi|170087774|ref|XP_001875110.1| predicted protein [Laccaria bicolor S238N-H82] gi|164650310|gb|EDR14551.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1136 Score = 36.9 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 33/302 (10%), Positives = 84/302 (27%), Gaps = 12/302 (3%) Query: 159 QTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQ 218 KA + V ++ P T + Sbjct: 536 SAKAMRMLSGSTQTTVSTVDSLLNGSSRSTLSSSTSHGSMMPSRTQIYNNKLTTLGEDDE 595 Query: 219 NGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDAL 278 +SPY S + L P+D L S+ K +++ L Sbjct: 596 LSSYESSTNLVSPYTSSGPSNSLTPLPHPPVDLILIISLPPPSAVPSTAQLKIRVIKTTL 655 Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 ++ S+ K D ++ N + F R + F + Sbjct: 656 DFLVASLAKKDRLSLVTFEVG---NAGRVRKTPFLNLGKAQSRQRLDKFIDELGLRLEEN 712 Query: 339 INDAMQTA-----YDTIISSNED-EVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 ++ + D + + N +V + ++L++D ++ + + + Sbjct: 713 QDEFLVRGSKDEKTDVVTAVNHGLDVVLQRKARNPISGMILVSDASDSTRRAQMDLVLAR 772 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFER 452 A++ + I + + + L + + ++ ++L +G + Sbjct: 773 AEAANVPIHSFGYGRSHDPASLW---LMSNHTSGTYTFVKDWYDLRDCVAGCVGGMMSIG 829 Query: 453 VI 454 ++ Sbjct: 830 LL 831 >gi|110833328|ref|YP_692187.1| pilin biogenesis-like protein [Alcanivorax borkumensis SK2] gi|110646439|emb|CAL15915.1| pilin biogenesis related protein [Alcanivorax borkumensis SK2] Length = 1009 Score = 36.9 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 34/110 (30%), Gaps = 17/110 (15%) Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI----CNKA------KSQG 397 I+ + R N +IVL TDGE N C +A + Sbjct: 269 KKIVDNGNYVSPRNTANECESNHIVLFTDGEANDVNLPCGGGSSYDCQRAISDYLDRDFE 328 Query: 398 IRIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRI 445 I+ + + + + + +S + + A+ L F D Sbjct: 329 IKTYNVGLHMEDNRAD-----METVSSDGADGTYTASDAESLASAFLDIF 373 >gi|170690099|ref|ZP_02881266.1| von Willebrand factor type A [Burkholderia graminis C4D1M] gi|170144534|gb|EDT12695.1| von Willebrand factor type A [Burkholderia graminis C4D1M] Length = 329 Score = 36.9 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 33/125 (26%), Gaps = 12/125 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + I T T ++ + TA Sbjct: 131 NDRLAFMMFGTSPLLAMPFTYDHRAIDAAIAGTSVGRGMPD--TQLDLGLLTAIGEFNQQ 188 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 + + IVL++DG + I + I + I + + Sbjct: 189 HSSSR----------RAIVLVSDGGAKLNARVRQLIEDGLLRNDIALYFIYLRSSIYSPD 238 Query: 414 KARYF 418 Sbjct: 239 LNAKV 243 >gi|327330197|gb|EGE71946.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL097PA1] Length = 322 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 +NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 DNTQGGSPLVAA-NRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYQQ 281 >gi|323693762|ref|ZP_08107958.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] gi|323502183|gb|EGB18049.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] Length = 1560 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 39/376 (10%), Positives = 80/376 (21%), Gaps = 42/376 (11%) Query: 112 RDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSY 171 V + + N +V L R ++ S + Sbjct: 498 NGDVEVYSSDTNGVDIPSEVYPVVLVKNKKRALLRMARISAKQADASAGMMITTYGSLNA 557 Query: 172 HKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + + D L G T+ ++ V + ++ Sbjct: 558 VSTVDGELNPQVAKRAWKDYTADDEYDVTLEVTGDSIQTTIGGGTADIVFVIDKSSSMNQ 617 Query: 232 YMVSCNKSLYYML--YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKID 289 + + + +L L S + Sbjct: 618 WDYDLGGNRWTILKETVDRFINKLKITSPNSKISFIEYQSSDLTNYTRTGTYDEIRNVAM 677 Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + ++ + + ++ K + T A Sbjct: 678 ANTENADKNLEGYSLKYFRTLQRWTAISEVGS---KPYGSAPGGNQGTHSAGGYLGAERA 734 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD-------NEEGIAICNKAKSQGIR--- 399 + + +N KY++ L DG + G A+ I Sbjct: 735 LDRLKKYNPEEYNSN---VKYVIYLADGTAGFYVDSYGYRDGAGSGGNTNARRAAITQSG 791 Query: 400 ----------IMTIAFSVNKTQQEKARYFLSNCASPNS-----------FFEANSTHELN 438 I T+AF + + + A + F+ A +T EL Sbjct: 792 ELKKKHPDATIYTVAFGSDSSANMNW---MKPGAYNGNITNPYNPNVTAFYSATNTEELE 848 Query: 439 KIFRDRIGNEIFERVI 454 K F V Sbjct: 849 KTFDSLAEQVGSSAVT 864 >gi|261822923|ref|YP_003261029.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606936|gb|ACX89422.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 40/112 (35%), Gaps = 10/112 (8%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 V++ + ++I ++++ +T + F+ + ++ D Sbjct: 21 MEAVKNGVQTLITTLRQDPYALETAYVSVITFDSSARQVVPLTDLINFKSP--------D 72 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 G+TA+ +A+ I + K + + ++TDG T D Sbjct: 73 LVANGTTALGEALSLVAQAIEREVQKTTAETKGDWRP--LVFIMTDGAPTDD 122 >gi|50842461|ref|YP_055688.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427106|ref|ZP_06428822.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130538|ref|YP_003581201.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840063|gb|AAT82730.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159575|gb|EFD07763.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375881|gb|ADD99735.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|313764513|gb|EFS35877.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772104|gb|EFS38070.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313792200|gb|EFS40301.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|313801849|gb|EFS43083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807458|gb|EFS45945.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809968|gb|EFS47689.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313813000|gb|EFS50714.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816054|gb|EFS53768.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818503|gb|EFS56217.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820269|gb|EFS57983.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822922|gb|EFS60636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825146|gb|EFS62860.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827717|gb|EFS65431.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830297|gb|EFS68011.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833671|gb|EFS71385.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|313838673|gb|EFS76387.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] gi|314915507|gb|EFS79338.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314918208|gb|EFS82039.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] gi|314920023|gb|EFS83854.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925156|gb|EFS88987.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932037|gb|EFS95868.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955907|gb|EFT00307.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958219|gb|EFT02322.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960060|gb|EFT04162.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962859|gb|EFT06959.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967773|gb|EFT11872.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973302|gb|EFT17398.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975980|gb|EFT20075.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978483|gb|EFT22577.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314984001|gb|EFT28093.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] gi|314989988|gb|EFT34079.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078074|gb|EFT50125.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315080702|gb|EFT52678.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] gi|315084374|gb|EFT56350.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085715|gb|EFT57691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088865|gb|EFT60841.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096217|gb|EFT68193.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098475|gb|EFT70451.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101165|gb|EFT73141.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315108386|gb|EFT80362.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326129|gb|EGE67919.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA2] gi|327331996|gb|EGE73733.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA3] gi|327443198|gb|EGE89852.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445983|gb|EGE92637.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448037|gb|EGE94691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450841|gb|EGE97495.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453082|gb|EGE99736.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453813|gb|EGF00468.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753529|gb|EGF67145.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754260|gb|EGF67876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754489|gb|EGF68105.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760649|gb|EGF74216.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL099PA1] gi|332675378|gb|AEE72194.1| hypothetical protein PAZ_c10190 [Propionibacterium acnes 266] Length = 322 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 +NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 DNTQGGSPLVAA-NRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYQQ 281 >gi|117919536|ref|YP_868728.1| undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] gi|117611868|gb|ABK47322.1| Undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] Length = 1223 Score = 36.9 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 38/134 (28%), Gaps = 36/134 (26%) Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---------------- 397 + + + + YI+ +TDGE T DN + K G Sbjct: 403 GTNNYNSPFKRCQNEAYIIYITDGEPTLDNAADGLVEQLTKGVGKHTSSPVSYLSALSGW 462 Query: 398 ---------------IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 +R TI F A + L A +F+A +L Sbjct: 463 MRNEDVNPNSAGKQTVRTFTIGF---SEGAASAEHLLKQTAENGGGKYFDATDASKLRSS 519 Query: 441 FRDRIGNEIFERVI 454 + + N + + Sbjct: 520 LQTALNNILEKNAS 533 >gi|289425109|ref|ZP_06426886.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154087|gb|EFD02775.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] Length = 322 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 +NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 DNTQGGSPLVAA-NRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYQQ 281 >gi|134281853|ref|ZP_01768560.1| TadE family protein [Burkholderia pseudomallei 305] gi|134246915|gb|EBA47002.1| TadE family protein [Burkholderia pseudomallei 305] Length = 168 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVVVCDAAHTPDCSDSSDPAQFANVPTYDTNNGS 121 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 122 PDPASLAGSMNLV 134 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 21/220 (9%), Positives = 58/220 (26%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ + TAI +A+ A + + I+++T Sbjct: 150 YISA-KMFLESISPSLISKQGTAIGEAINLATRSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A A +GI++ + + + Sbjct: 198 DGENHEGGAVEAAK--AAAEKGIQVSVLGVGMPEGAPIPVEGTNDYRRDREGNVIVTRLN 255 Query: 416 RYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 A + ++++ K I V Sbjct: 256 EGMCQEIAKDGKGIYVRVDNSNSAQKAISQEISKMAKSDV 295 >gi|53719502|ref|YP_108488.1| hypothetical protein BPSL1888 [Burkholderia pseudomallei K96243] gi|52209916|emb|CAH35888.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 568 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 10/123 (8%), Positives = 30/123 (24%), Gaps = 14/123 (11%) Query: 32 VMLGVGGM-LVDVVRWSYYEHALKQAAQTAIITASV--PLIQSLEEVSSRAKNSFTFPKQ 88 +++ + + VD+ + L++ A A + + + ++ A Sbjct: 1 MLVAIAALGAVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARSNGFDP 60 Query: 89 KIE-----------EYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRY 137 + + T + +V + V +S Sbjct: 61 AAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLGPSRTVSATSTA 120 Query: 138 DLL 140 Sbjct: 121 KAT 123 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 29/85 (34%), Gaps = 7/85 (8%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFA 328 K L++ A+ VI+ + D R+ F+ V H + ++ Sbjct: 282 TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTVRRLFHLRRMSHSGRQQALQA-V 334 Query: 329 IDENEMGSTAINDAMQTAYDTIISS 353 G T I DA++ A I Sbjct: 335 NSLGAGGGTNIADALKKAAKVIEDR 359 >gi|167577156|ref|ZP_02370030.1| hypothetical protein BthaT_03412 [Burkholderia thailandensis TXDOH] Length = 621 Score = 36.9 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 64/266 (24%), Gaps = 5/266 (1%) Query: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRA 79 G +I A+ + V++ G VD+ Y AL+ A A + A+ + + ++ A Sbjct: 57 GSLAVIAAIAIGVVIAALG-AVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATA 115 Query: 80 KNSFTFPKQKIEEYLIRNFEN----NLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 +++ ++K N TA + + +A QV L+ Sbjct: 116 QSAALGDGFDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTLTR 175 Query: 136 RYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRD 195 + + T ++ + + L Sbjct: 176 VVPYYFLGAQRTVSATSTAQATNVGAYSIGTTLAQLQGGVVNALLNGLLGANLNLSVLSY 235 Query: 196 SEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSE 255 + + + S + + Sbjct: 236 QGLANARIRIKDLMAAANVGTVNALLNTQTTVPQLANWMLTALSQTSVANADLQTSIGAL 295 Query: 256 EHFVDSSSLRHVIKKKHLVRDALASV 281 + V ++ ++ Sbjct: 296 QTIVSANVPGGQTFTIGSTANSTGIF 321 >gi|187921511|ref|YP_001890543.1| TadE family protein [Burkholderia phytofirmans PsJN] gi|187719949|gb|ACD21172.1| TadE family protein [Burkholderia phytofirmans PsJN] Length = 278 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 11/150 (7%), Positives = 28/150 (18%), Gaps = 7/150 (4%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + + + +G ++ P +L V +V V + A+ Sbjct: 11 IRKHRARQFGHARKTRQSGQSMAEFIIVAPALLFVCFGIVQFVLLYQAKS----TLDVAV 66 Query: 62 ITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE 121 + A+ + + S + Sbjct: 67 LEAAREGAVNNGSMQSMRSGLARGLAPLYARQADAGGVAAALASANADAANFSSITVLNP 126 Query: 122 MNPRKSAY---QVVLSSRYDLLLNPLSLFL 148 Y + P + Sbjct: 127 TPAAILDYARPRYYADLAATYTEIPNDSLM 156 >gi|86748913|ref|YP_485409.1| hypothetical protein RPB_1790 [Rhodopseudomonas palustris HaA2] gi|86571941|gb|ABD06498.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 206 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 41/137 (29%), Gaps = 9/137 (6%) Query: 8 IFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVP 67 S +L K +G A ++P+ML + V++ A + S Sbjct: 12 RRCSMRLAKDRSGLAATEFAFIVPLMLLMFFATVELSAGI--------AVDRKVTLVSRT 63 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 L + + ++ + L + ++ V D V+ + + Sbjct: 64 LSDLVSQATTVTDSDLKNVFAASYGVLAPYPTSTADATISEIYVND-AGVAKVQWSKAAT 122 Query: 128 AYQVVLSSRYDLLLNPL 144 Q ++ L + Sbjct: 123 VAQSGSTATATLATSSR 139 >gi|218893389|ref|YP_002442258.1| TadG [Pseudomonas aeruginosa LESB58] gi|254244892|ref|ZP_04938214.1| hypothetical protein PA2G_05765 [Pseudomonas aeruginosa 2192] gi|126198270|gb|EAZ62333.1| hypothetical protein PA2G_05765 [Pseudomonas aeruginosa 2192] gi|218773617|emb|CAW29431.1| TadG [Pseudomonas aeruginosa LESB58] Length = 557 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 34/127 (26%), Gaps = 2/127 (1%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII--TASVPLIQSLEE 74 G I+ A + + L ++VD R + L++ A A + + L Sbjct: 7 RQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSS 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + + + +A A V ++ Sbjct: 67 AQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVHVKVT 126 Query: 135 SRYDLLL 141 L Sbjct: 127 KSVPGSL 133 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 35/302 (11%), Positives = 82/302 (27%), Gaps = 28/302 (9%) Query: 169 RSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR--TVKSYSSQNGKVGIRD 226 ++E I ++D S SM + + + K + ++ Sbjct: 498 EDINEEVKKDIVLILDTSGSMNFNFYNDSIPYNEKDKRIYSLKQSAKQFINKFNNKDNIR 557 Query: 227 EKLSPYMVSCNKSLYYMLYPGPLDPSL-SEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + PY + D + S E+++D+ + + +R+A ++ + Sbjct: 558 IGIIPYSYYSGYANNIKQLTEINDNNKKSYENYIDNIKVEGATNQGDGIREAGKMLLNTD 617 Query: 286 KKIDNV----------NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMG 335 T+ ND + + + Sbjct: 618 GNSKKYVILITDGEATAITIEKPNLIINDSAYYWANNVLSFRNGDLVSRQYWYGARYTDD 677 Query: 336 S-------TAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIA 388 + S +N + I TD + Sbjct: 678 YYNDIVYFNNGSSFYVNPIYDFNSVKNTRYVDFGDNKLNEIKIKKATDYATNIGRRLKQS 737 Query: 389 ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNE 448 I + K+ TI ++N+ + A+ N + +F AN ++ + IF + + Sbjct: 738 I-PELKAM-----TIGCALNQDTVDLAKEV--NNSMDGDYFSANDSNSMESIFDELADSI 789 Query: 449 IF 450 + Sbjct: 790 LS 791 >gi|167570097|ref|ZP_02362971.1| hypothetical protein BoklC_09651 [Burkholderia oklahomensis C6786] Length = 602 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 48/172 (27%), Gaps = 5/172 (2%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 ++ I G F ++ A+ M V + G VD+ + L++ A A + Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADM---AALAAAQKM 72 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 ++ + A + + + L D + + +A QV Sbjct: 73 DDQCTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTA-STPLNAVQV 131 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVI 183 + S + + + T S ++ + Sbjct: 132 TATQSVPYFFLGPSRTVSATSTAKATNIDQFTVGTTLASLQGGLVNNVLNAL 183 >gi|114765752|ref|ZP_01444847.1| hypothetical protein 1100011001350_R2601_23575 [Pelagibaca bermudensis HTCC2601] gi|114541859|gb|EAU44895.1| hypothetical protein R2601_23575 [Roseovarius sp. HTCC2601] Length = 150 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 8/40 (20%), Positives = 17/40 (42%), Gaps = 1/40 (2%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLG-VGGML 40 + +K + + + G + A+L PV + + GM Sbjct: 1 MSQSKLRDRAARAWRDEAGSATVEFAILFPVFMAILTGMA 40 >gi|34495937|ref|NP_900152.1| hypothetical protein CV_0482 [Chromobacterium violaceum ATCC 12472] gi|34330276|gb|AAQ58159.2| conserved hypothetical protein [Chromobacterium violaceum ATCC 12472] Length = 1040 Score = 36.9 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 38/317 (11%), Positives = 82/317 (25%), Gaps = 27/317 (8%) Query: 149 RSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPA 208 S+G+ + + S SI V+ S + P N + + Sbjct: 61 SSLGLSGTTVNGSQYMTVAATSDD----ASINDVLYSSGLNPVFMTYGTVSPSNPYSYYS 116 Query: 209 DRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPG--PLDPSLSEEHFVDSSSLRH 266 + K+ S + S + S + Y P S+ Sbjct: 117 LNSYKTGSVTVSYSLMAPGGFSGWGTSPTNAGYVPFTPQVLYAQRGWGYGGSQSYSAAAT 176 Query: 267 VIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKT 326 + +L + + + + ++ + I + ++ L++T + Sbjct: 177 NVAMTNLGSPSASQLTSAYNSFSP---FLQPETNSTSTSEIKAAAGQSPIYALLKTAQTS 233 Query: 327 FAIDENEMGSTAI-NDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIV--LLTDGENTQDN 383 F + G T + T + + + DG Sbjct: 234 FTSTSSSSGCTPQKYVVLITDGLPTQDKSGYYWPPAGSAAATGYGVTASFNADGSLNIAG 293 Query: 384 EEGIAICNKA------KSQGIRIMTIAFSVNKTQQE--KARYFLSNCASPNS---FFEAN 432 A+ + S GI+ I A L+ A + ++ A Sbjct: 294 TNDQALLDAISQIQTLASAGIKTYVIGLGAGVAPATNPTAAQTLTAMAVAGNTGSYYPAV 353 Query: 433 STHELNKIFRDRIGNEI 449 S F +G+ + Sbjct: 354 SP----AAFSSALGSIL 366 >gi|295840162|ref|ZP_06827095.1| von Willebrand factor, type A [Streptomyces sp. SPB74] gi|197698094|gb|EDY45027.1| von Willebrand factor, type A [Streptomyces sp. SPB74] Length = 588 Score = 36.9 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 19/187 (10%), Positives = 49/187 (26%), Gaps = 19/187 (10%) Query: 269 KKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFS-----WGVHKLIRTI 323 + + + +L + D++ + + G R Sbjct: 402 SRMDVTKASLLQALAQFTPADDIGLWEFATRLDGDRDYRELVPTARLGARKGSSGTQREA 461 Query: 324 VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKY--IVLLTDGENTQ 381 + + + + A K+ +VLLTDG N + Sbjct: 462 LTAAFGALQPQPG---------GATGLYDTTLAAYREASEDYAADKFNAVVLLTDGTNEE 512 Query: 382 DNEEGIA-ICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC--ASPNSFFEANSTHELN 438 + + + + E A + A+ S F+ + +++ Sbjct: 513 PGSMTRGELLTRLRDLADPAHPLPLVAIAVGPEAAEDDMEAIGGATGGSGFKVDDPAQIH 572 Query: 439 KIFRDRI 445 ++ + I Sbjct: 573 EVINNAI 579 >gi|156358483|ref|XP_001624548.1| predicted protein [Nematostella vectensis] gi|156211335|gb|EDO32448.1| predicted protein [Nematostella vectensis] Length = 186 Score = 36.9 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 51/157 (32%), Gaps = 19/157 (12%) Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 + + V R ++ D +GA ++ + +F + +K I F Sbjct: 17 TGTFKKCINFVKRVLRAFDVSEKGTHVGAIIYST--DTKLAFDFNTYKERGDIEAAFDRV 74 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAIC 390 + +T ++ A + + + +V++TDG + D + + Sbjct: 75 KFLGETTFTGKGLKMALSELYKTAR---------KDVSNLLVVITDGRSHDDVVKPSEM- 124 Query: 391 NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNS 427 ++ G+RI+T+ L A Sbjct: 125 --LRNAGVRIVTVGLG-----NSFDINQLKAMAGKPY 154 >gi|260800507|ref|XP_002595171.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] gi|229280414|gb|EEN51182.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] Length = 352 Score = 36.9 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 14/137 (10%), Positives = 38/137 (27%), Gaps = 11/137 (8%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + +++ + D ++G ++D + + G+T Sbjct: 25 QFLKKTVGDFNIGYDATQIGLVQYSDEAETIFALDSFSSPTSLRDAIETIQ--YTGGATN 82 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 +A+ + +S K ++LT G ++ D A + + + Sbjct: 83 TGNALDYMVQYMFASRNGARQDST------KIAIVLTGGASSDD---IKAAAQRMRKSSV 133 Query: 399 RIMTIAFSVNKTQQEKA 415 I + Sbjct: 134 ITYAIGIGSELDYDQLD 150 >gi|317056370|ref|YP_004104837.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448639|gb|ADU22203.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 554 Score = 36.9 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 76/272 (27%), Gaps = 25/272 (9%) Query: 170 SYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKL 229 S + + + D F + + + Sbjct: 98 STFSTDVDTASFTNIRRMIENNQNIDPNAVRTEEFINYFKYNYEYPDGDDKIAITTELSD 157 Query: 230 SPYMVSCNKSLYYMLYPGP--LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK 287 P+ + D + +D S K LV A A + Sbjct: 158 CPWNADAKLMQIGLQAKDIDVQDIDSNIVFLIDVSGSMADENKLPLVVQAFAML------ 211 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 +N+ + R+ + R + G + + GSTA + TAY Sbjct: 212 AENLGENDRISIVTYAGRDTIELE---GESGANYETIASTLAGLTAGGSTAGAAGINTAY 268 Query: 348 DTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD---NEEGIAICNKAKSQGIRIMTIA 404 + E + NN ++L TDG+ EE A+ + + +G+ + + Sbjct: 269 EL----AEKYFIKGGNNR-----VILATDGDLNVGLSSEEELKALIEEKRDKGVFLSVLG 319 Query: 405 FSVNKTQQEKARYFLSNCASPNSFFEANSTHE 436 F + + + N ++ +S E Sbjct: 320 FGMGNYKDNRLEALADN--GNGNYAYIDSVDE 349 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 36.9 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 50/161 (31%), Gaps = 21/161 (13%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 + D + R+G ++D + + + + + G T A++ Sbjct: 45 VNAFDISASSTRVGVVQYSDSNTLEFNLGDHAD--KPSTLAAIDSIVYQGGGTRTGSALE 102 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 A V+ K ++++TDG + + + N SQG+ + I Sbjct: 103 FA----------RVNAAWRGESVPKVMIVVTDG---KSADSVTSSANNLASQGVDVYAIG 149 Query: 405 FSVNKTQQEKARYFLSNCASP-NSFFEANSTHELNKIFRDR 444 ++ L A N+ E + L+ Sbjct: 150 VG-----NYRSTQLLEIAAGNQNNVIELTDFNALSAEIEQI 185 >gi|91777715|ref|YP_552923.1| putative MxaC-like protein [Burkholderia xenovorans LB400] gi|91690375|gb|ABE33573.1| Putative MxaC-like protein [Burkholderia xenovorans LB400] Length = 328 Score = 36.9 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 32/125 (25%), Gaps = 12/125 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + I T T ++ + A Sbjct: 130 NDRLAFMMFGTSPVLAMPFTYDHRAIAAAIAGTAVGRGMPD--TQLDRGLLAAIGEFN-- 185 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ IVL++DG D + I + I + I + Sbjct: 186 --------GRASSGRRAIVLVSDGGARLDTQVRRLIQDGLTRNQIALYFIYLRSGTYSPD 237 Query: 414 KARYF 418 Sbjct: 238 LNAAV 242 >gi|290976446|ref|XP_002670951.1| predicted protein [Naegleria gruberi] gi|284084515|gb|EFC38207.1| predicted protein [Naegleria gruberi] Length = 1082 Score = 36.9 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 34/111 (30%), Gaps = 10/111 (9%) Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + +++I++ G T A Q A I + +N Sbjct: 172 AYESSAKEIDMKGTLQSIIRN-IETAFTGGGTDFASAFQLACTIIT---RESGQDRENLP 227 Query: 366 EAKKYIVLLTDGENTQDNEEGIAI------CNKAKSQGIRIMTIAFSVNKT 410 I LTDGE+ + + N+ I I T+ F + Sbjct: 228 FGNVVITFLTDGEDFSKVGKPGGLQYLSEEINRVYRGDITIHTVGFGSHHN 278 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 36.9 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 20/220 (9%), Positives = 56/220 (25%), Gaps = 43/220 (19%) Query: 256 EHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWG 315 + + + + ++ ++ + ++G F + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 316 VHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLT 375 + +++ TAI +A+ A + + I+++T Sbjct: 150 YISA-KMFLESINPSLISKQGTAIGEAINLAARSFTPQ-----------EGVGRAIIVIT 197 Query: 376 DGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK--------------------A 415 DGEN + A A +GI++ + + Sbjct: 198 DGENHEGGAVEAAK--AAAEKGIQVNVLGVGMPDGAPIPAEGTNDYRRDREGNVIVTRLN 255 Query: 416 RYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 A + ++++ K + V Sbjct: 256 ETMCQEIAKEGKGIYVRVDNSNSAQKAINQEVNKMAKSDV 295 >gi|197295156|ref|YP_002153697.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] gi|195944635|emb|CAR57239.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] Length = 178 Score = 36.9 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 32/112 (28%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + AL++ M+ + + + R Y L +A + A SV L ++ Sbjct: 12 RGAVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAYPVAQ 71 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQ 130 A+ + T + + + + + Sbjct: 72 AQCLVVYGSTTCGSAGSELVPGLKTSMVTICDAQHTTGCSDASDPSQFANLP 123 >gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 791 Score = 36.9 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 46/163 (28%), Gaps = 20/163 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I +N R F+D + R + G T + A+Q A Sbjct: 381 AISKLNPDDRFNVIRFDDTMTDYFKGLVTATPDNREKAIGYVRGLTADGGTEMLPALQAA 440 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 441 ---LRNQGPVASGAL-------RQVVFLTDGAIGNEQQLFQEI--TANRGDARVFTVGIG 488 Query: 407 VNKT------QQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 E R + S + A+ EL ++ Sbjct: 489 SAPNTYFMTKAAEMGRGTFTAIGSTDQV--ASRMGELFAKLQN 529 >gi|289627369|ref|ZP_06460323.1| hypothetical protein PsyrpaN_19926 [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|330867165|gb|EGH01874.1| hypothetical protein PSYAE_07887 [Pseudomonas syringae pv. aesculi str. 0893_23] gi|330871053|gb|EGH05762.1| hypothetical protein PSYAE_28128 [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 687 Score = 36.9 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 51/173 (29%), Gaps = 5/173 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ AL + + L ++VD R + +L++ A A + A+ Sbjct: 23 RQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAA-----GRRGTC 77 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 S A + F Q + ++ V + A + + + Sbjct: 78 SGAATAPGFATQSATRNGFTPNTGGRTLTILCGTLTVGLQRQRVFVADSTQALAIQVIAA 137 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + + + A V+ S ++I+ S Sbjct: 138 HPVPRSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDST 190 >gi|17229188|ref|NP_485736.1| hypothetical protein all1696 [Nostoc sp. PCC 7120] gi|17135516|dbj|BAB78062.1| all1696 [Nostoc sp. PCC 7120] Length = 1999 Score = 36.9 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 40/382 (10%), Positives = 91/382 (23%), Gaps = 10/382 (2%) Query: 54 KQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRD 113 + A +I + + + + L+ + + N T V Sbjct: 1130 QNAGDLSITKTDGLTNVTPGQQITYTIVARNNGLITATNALVSDIIPSNLTNVTWTSVAS 1189 Query: 114 IVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHK 173 V L+ + + + + + + Sbjct: 1190 GGATDNQASGTGNINDHVTLTGGSSITYTVTGTVVSNAASTGSSTRFDFDGNSPLDGTDG 1249 Query: 174 EHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYM 233 Q I + + G + + S G G + Sbjct: 1250 NTRTFTQNGITVTARAFSRVDGTNGAWSAAYLGSYTGGLGVTDSSEGNGGNNTHVVDNVG 1309 Query: 234 VSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIK------K 287 N L+ LD + + DS + + + + S + S Sbjct: 1310 GRDNYVLFQFSEAVVLDKAYLQYVLKDSDISVWIGNFNNPLTNLSDSTLNSFGFYEANVT 1369 Query: 288 IDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAY 347 + + + A + I + + + ++ +N A TA Sbjct: 1370 TSSSDRWADVNAGNYVGNTIVIAALDTDTSPED-NFKLRYLDVLKAIPASLVNTATITAP 1428 Query: 348 DTIISSNEDEVHRMKNNL---EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 + +N + N + + TDG + I A++ G+ T A Sbjct: 1429 NGFTDTNPNNNSATDTNTIVAPSGDLSITKTDGLTNVTPGQQITYTIVARNNGLITATNA 1488 Query: 405 FSVNKTQQEKARYFLSNCASPN 426 + ++ AS Sbjct: 1489 LVSDIIPSNLTNVTWTSVASGG 1510 >gi|229489135|ref|ZP_04383001.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229324639|gb|EEN90394.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 551 Score = 36.9 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 48/144 (33%), Gaps = 22/144 (15%) Query: 315 GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLL 374 + + + ++ G T + D++ AY ++ + + ++LL Sbjct: 422 DTDHRSKLLSRIDSLSSIVGGGTGLYDSVLAAYRSMQQTYDPASINS---------VILL 472 Query: 375 TDGENTQDNE-EGIAICNKAKSQG-----IRIMTIAFSVNKTQQEKARYFLSNCA--SPN 426 TDG N + + + + + I+TI + + L + + Sbjct: 473 TDGANDDPSSISLQELLDTLTREQDPTRPVPIITIGVTDDAD-----TDVLEQISALTGG 527 Query: 427 SFFEANSTHELNKIFRDRIGNEIF 450 + A + ++ K+F I Sbjct: 528 NSHFAPTPADIPKVFVGAISGRAG 551 >gi|255021311|ref|ZP_05293359.1| hypothetical protein ACA_0286 [Acidithiobacillus caldus ATCC 51756] gi|254969174|gb|EET26688.1| hypothetical protein ACA_0286 [Acidithiobacillus caldus ATCC 51756] Length = 678 Score = 36.9 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 43/433 (9%), Positives = 94/433 (21%), Gaps = 62/433 (14%) Query: 84 TFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNP 143 T Q + + + + + + + + + Sbjct: 121 TAAAQSACATIGNGGYVDNSPSMLNAAESAVQGIVSNPTYAGNIQFGLATYQTSGSIQQF 180 Query: 144 LSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNC 203 + + A T + S+ S + + L+ Sbjct: 181 NTWVYYMSANNGFSFGNSETAPTGLIAVPNPCYGSVSGNYFTRNSCANIYFNRNLPNLSY 240 Query: 204 FGQPADRTVKSYSSQNGKVGIRDEKLSP-YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 Q + I D + + S P L + Sbjct: 241 NAQLYQSPYLYIQDTSDNPIINDVLYANGFGSGSENSAAAYPSPYNLGNLQAYMQQNTGV 300 Query: 263 SLRHVIKKKHL--VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLI 320 + + V +S + + T N I + Sbjct: 301 NYGAYSTSLSAGMTPTSAGYVPQSFEVWEAQRGFGFNAPTVPNAGRIVVSITRSNSNNAA 360 Query: 321 RTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN----------EDEVHRMKNNLEAKKY 370 T + G + ++A +A + + KKY Sbjct: 361 AIRNATVPEVFLKNGGSLPSNAPISAAAGYSPVAGAFTTALQYLSGQTNNPPPTTCGKKY 420 Query: 371 IVLLTDGENTQD----------------------------------NEEGIAICNKAKSQ 396 ++ +TDG+ TQ E I +SQ Sbjct: 421 VIFITDGQPTQGLGGNVYPPLGSAAAGIFGVTSITASTWSSTNNRAVTEAIQAVQNLQSQ 480 Query: 397 GIRIMTIAFSVNKTQQ------------EKARYFLSNCA---SPNSFFEANSTHELNKIF 441 GI + ++ Y L A +++ A + ++ Sbjct: 481 GINTYVLGVGSAVNPNIPGTTAAQQAEAQQGSYVLQAMAQAGGTQNYYSATTQADVQSAL 540 Query: 442 RDRIGNEIFERVI 454 + I N + + V+ Sbjct: 541 NNIIANILGKSVV 553 >gi|88704022|ref|ZP_01101737.1| toxin secretion ATP-binding protein [Congregibacter litoralis KT71] gi|88701849|gb|EAQ98953.1| toxin secretion ATP-binding protein [Congregibacter litoralis KT71] Length = 707 Score = 36.9 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 13/141 (9%), Positives = 42/141 (29%), Gaps = 13/141 (9%) Query: 26 TALLMPVMLG----------VGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 ALL V++G + G+++ V + L++ ++A ++ +E + Sbjct: 280 FALLFFVVIGWIAWPVLIPVLAGIVLLVGFALFMRPRLQRLTESAYQAGAMRNATLIESL 339 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 + + ++ ++ + + V + Sbjct: 340 TGLETLKAMGAESVMQRKW---EDSTRFLAGVGVSSKLASSSVTQATTWCQQIITVSVII 396 Query: 136 RYDLLLNPLSLFLRSMGIKSW 156 L++ L + + + Sbjct: 397 TGVYLISVGELTMGGLIACTM 417 >gi|294140778|ref|YP_003556756.1| hypothetical protein SVI_2007 [Shewanella violacea DSS12] gi|293327247|dbj|BAJ01978.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 152 Score = 36.9 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 43/135 (31%), Gaps = 4/135 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 ++ G + + A++ + L + +++V R Y + L +A++ A + + + Sbjct: 1 MRYQRGVYAVEFAIVGSIFLLLLFAIIEVGRLMYTYNVLHEASRRA----ARIAVVCQID 56 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + I N + + + ++ + + V Sbjct: 57 DADVKTMALFNGANLIPNLTTANLSISYIDELGNDATGINIVLVRADIANYQHQFLVPGL 116 Query: 135 SRYDLLLNPLSLFLR 149 SR + R Sbjct: 117 SRVINSPTFSTYLPR 131 >gi|163731886|ref|ZP_02139333.1| hypothetical protein RLO149_21319 [Roseobacter litoralis Och 149] gi|161395340|gb|EDQ19662.1| hypothetical protein RLO149_21319 [Roseobacter litoralis Och 149] Length = 181 Score = 36.9 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 5/46 (10%), Positives = 20/46 (43%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAI 61 + G + +++P++ + +++ +S + L + A+ Sbjct: 14 RDEGGQIAVEFVIMVPLVFTIFMTAIELGIYSMRQMWLDRGLDIAV 59 >gi|323137419|ref|ZP_08072497.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322397406|gb|EFX99929.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 228 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 49/178 (27%), Gaps = 4/178 (2%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 L + G + AL++P+ML + LV++ R L T + Q L Sbjct: 10 LAQDDRGIAAVEFALVLPLMLMIYLGLVELSRGMRAAQKL----DLVAHTLADLTAQQLT 65 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 S+ + T + L + ++ + + V Sbjct: 66 GGSNTGQAGLTEADITAVFSAATTIMSPLPTANLKMTISEVAITSPSAGVWQAKTTWTVT 125 Query: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 + + + + + T T + + V ++S + Sbjct: 126 RNSATARPCQILTAQDATPVSFTSMPTSYTTVTNGVNPTVGPVIVADVVYNYSPGVHF 183 >gi|323139837|ref|ZP_08074869.1| efflux transporter, RND family, MFP subunit [Methylocystis sp. ATCC 49242] gi|322394917|gb|EFX97486.1| efflux transporter, RND family, MFP subunit [Methylocystis sp. ATCC 49242] Length = 423 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 14/188 (7%), Positives = 42/188 (22%), Gaps = 8/188 (4%) Query: 39 MLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIR-- 96 ++VD + S + A A + + + +++ S Y Sbjct: 127 IVVDQNKASLAVARAQLAKDKAGLAYARINYERSRRLAATKAVSPDVVDNAKSVYDQAQA 186 Query: 97 ------NFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 + E+ ++ + V +LFL + Sbjct: 187 QVGLDEAMVAQREAELRAAEINLGYTGIISPVDGTVVSRSVETGQTVAASFQTPTLFLIA 246 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + + T + + + + + D Sbjct: 247 EDLTAMQVDTNVSESDIGEVKLGDKASFTVESFPKRVFHGEVTQVRQSPQTIQNVVTYDA 306 Query: 211 TVKSYSSQ 218 + + + Sbjct: 307 VISAPNRD 314 >gi|289647856|ref|ZP_06479199.1| hypothetical protein Psyrpa2_08907 [Pseudomonas syringae pv. aesculi str. 2250] Length = 687 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 51/173 (29%), Gaps = 5/173 (2%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G ++ AL + + L ++VD R + +L++ A A + A+ Sbjct: 23 RQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAA-----GRRGTC 77 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 S A + F Q + ++ V + A + + + Sbjct: 78 SGAATAPGFATQSATRNGFTPNTGGRTLTILCGTLTVGLQRQRVFVADSTQALAIQVIAA 137 Query: 137 YDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + + + + + A V+ S ++I+ S Sbjct: 138 HPVPRSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDST 190 >gi|152984932|ref|YP_001350526.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] gi|150960090|gb|ABR82115.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] Length = 1155 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 28/334 (8%), Positives = 85/334 (25%), Gaps = 27/334 (8%) Query: 59 TAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDT 118 A ++ ++ L + ++ + + + + ++ I Sbjct: 14 AAALSGAILLSAQTTQAAALSVSQQPLMLVQGVAPNLLVTLDDSGSMAFAYAPDSISSYR 73 Query: 119 AVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVS 178 S + + L L + + + A + T + Sbjct: 74 DSTFFASNSFNPMYFDPNTEYKLPKK---LTLVNGQVQIQDYPAPSFTYAWRNGFTRAGG 130 Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 I ++ ++ + + S + + N Sbjct: 131 IDLSRNYKVTVEYGRGYDRESTIRPAAAFYYEFTGSSNCSRTNQNCYTRRSVSTEQRQNF 190 Query: 239 SLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMG 298 + +Y Y + + + S ++ ++ ++ + + + Sbjct: 191 ANWYSFYRTRALATQTAANLAFYSLPENIRVSWQMLNNSS---CNQMGSGSSNGNCFNNY 247 Query: 299 ATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV 358 FN + + + + + G T + AM A D + + + Sbjct: 248 LRDFNGQHRVNFF--------------NWLENLSVEGGTPLRRAMTRAGDFLRRTGVNGP 293 Query: 359 HRMKNNLEA-------KKYIVLLTDGENTQDNEE 385 + + + Y +L+TDG D+ Sbjct: 294 YAYRPGEQTAPEYSCRGSYHILMTDGLWNSDSAN 327 >gi|310825520|ref|YP_003957878.1| hypothetical protein STAUR_8297 [Stigmatella aurantiaca DW4/3-1] gi|309398592|gb|ADO76051.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1414 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 61/238 (25%), Gaps = 37/238 (15%) Query: 221 KVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALAS 280 D + ++ + N + P + +F ++ + + Sbjct: 266 NADAFDSSRTGFIATINSLTFNTATPLARALLNTGYYFTSDQTVYKDVFGFGATNPTVGY 325 Query: 281 VIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIN 340 + K + ++ R + D+ + + G T+ ++ Sbjct: 326 AYPTDFKNEPLSSENRS--VCWGDQASAVIILTDG-EPNTDTLGSAVVQKIRSRNGGPVS 382 Query: 341 --------DAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNK 392 D A + + L T G+ Sbjct: 383 CPPSAPCADTPNDANAMLDDVAKLLFTNDLQYSTPPIVGALNTSGQ-------------- 428 Query: 393 AKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNE 448 Q +R+ T+ F+++ L N A + A+ L + +D + E Sbjct: 429 ---QSLRVYTVGFAIDSN-------LLKNAAAVGGGRSYTAHDAAGLRQALQDALLRE 476 >gi|107028245|ref|YP_625340.1| TadE-like [Burkholderia cenocepacia AU 1054] gi|116687156|ref|YP_840403.1| TadE family protein [Burkholderia cenocepacia HI2424] gi|105897409|gb|ABF80367.1| TadE-like protein [Burkholderia cenocepacia AU 1054] gi|116652871|gb|ABK13510.1| TadE family protein [Burkholderia cenocepacia HI2424] Length = 177 Score = 36.9 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 36/138 (26%), Gaps = 6/138 (4%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 G + AL++ M+ + + + R Y L +A + A SV L Sbjct: 8 RSRTRGVVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAY 67 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT------DREVRDIVRDTAVEMNPRKSA 128 S+A+ + T D + P + Sbjct: 68 PVSQAQCLVVYGSTTCGSSGTELVPGLTTSMVTICDAQRTTGCSDASDPSQFANLPTYDS 127 Query: 129 YQVVLSSRYDLLLNPLSL 146 S +N + + Sbjct: 128 NNNAASGTATGAINVVEV 145 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 36.9 bits (83), Expect = 8.0, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 58/203 (28%), Gaps = 29/203 (14%) Query: 258 FVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVH 317 + S+ + ++A + ++ D + +G F+ R + S + Sbjct: 100 SLSMSATDVAPDRITAAKEAGQEFVENL------PDDLNIGLVTFSGRARTAVSPTTNHD 153 Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 + R + G A + + + IVLL+DG Sbjct: 154 TVNRALQAAELDQATATGD-------AIAAALDAINQFSDSVQGGGEGAPPATIVLLSDG 206 Query: 378 ENTQDNEEGI-----AICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA 423 + T E +A G+ + TI+F + L A Sbjct: 207 KQTVPQELDDPRGAYTAAAEAAKAGVPVNTISFGTAQGAITVQGELIPVPNDDDSLREIA 266 Query: 424 --SPNSFFEANSTHELNKIFRDR 444 + FF A S +L + Sbjct: 267 RRTKGEFFSAGSLEQLRDAYGSL 289 >gi|116670917|ref|YP_831850.1| von Willebrand factor, type A [Arthrobacter sp. FB24] gi|116611026|gb|ABK03750.1| von Willebrand factor, type A [Arthrobacter sp. FB24] Length = 340 Score = 36.9 bits (83), Expect = 8.0, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 58/189 (30%), Gaps = 21/189 (11%) Query: 275 RDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEM 334 A A+V+ ++ D R+G T F+ + + + + + Sbjct: 107 SSADAAVVEVFARLAAGFDGERLGLTVFDSSAVQVFPLTDDYDVVQGQLEAARKAFDGAP 166 Query: 335 GSTA-------------INDAMQTAYDTIISSNEDE---VHRMKNNLEAKKYIVLLTDGE 378 GS A I D + + S+ D E + +VL TD Sbjct: 167 GSAAFLDGTWNGAGSSLIGDGLASCVQGFPSNGGDTGTGEQAGSGREERSRSVVLATDNF 226 Query: 379 NTQDNEEG-IAICNKAKSQGIRIMTIAFSVNKTQQEKARY--FLSNCA--SPNSFFEANS 433 + + AK Q +R+ + + + L + +++ +S Sbjct: 227 ISGEPIFTLQEAAALAKKQDVRVYALNPGDFDYGTDPDQPGVQLRTAVEGTGGAYYPLDS 286 Query: 434 THELNKIFR 442 + +I R Sbjct: 287 PEAVGEIIR 295 >gi|319442698|ref|ZP_07991854.1| hypothetical protein CvarD4_13132 [Corynebacterium variabile DSM 44702] Length = 916 Score = 36.5 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 56/217 (25%), Gaps = 27/217 (12%) Query: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKK--IDNVNDTVRMGATFFNDRVI 307 D S S + F + RD + + + D G + N Sbjct: 78 DQSASLKGFDGKPPTDPDNIRVDATRDLVKQLGTHAEDLGADINVKLAGFGEGYHNSTGD 137 Query: 308 SDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEA 367 S G + N T DA+ A Sbjct: 138 YGDWVSVGANSGDLDKAVDGFGRRNTDMYTNYEDALAGASREF-----AGAPTTDGKSSD 192 Query: 368 KKYIVLLTDGENTQDNEEGIAICNK----------AKSQGIRIMTIAFSVNKTQQEKARY 417 + ++ +DG+ T + ++ G+R T+ + + + R Sbjct: 193 CQAVLFFSDGKVTHPEKSNEQAAADVCRPNSPLVSLRNSGVRFFTVGLI-PEDETDSPRE 251 Query: 418 FLSN-----CASP---NSFFEA-NSTHELNKIFRDRI 445 L+ C ++F+A L FR I Sbjct: 252 LLTEMAEGPCGGGEANGAYFDAGTDPAGLLSAFRGFI 288 >gi|240169445|ref|ZP_04748104.1| hypothetical protein MkanA1_09039 [Mycobacterium kansasii ATCC 12478] Length = 889 Score = 36.5 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 53/201 (26%), Gaps = 25/201 (12%) Query: 267 VIKKKHL-VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + + A+A S + A D + + S + G+ + Sbjct: 70 YADRIQAKLDVAIAGFAESYVSEQDWTPLTGATAQHVGDALSTLASKNTGIDTDYWLALD 129 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 GS A S D R A+ + +G Sbjct: 130 GARQALASRGSGVGGADRCQAIAWFSDSKIDFTARPLTKPYAEGVPLNSANGVAETIRLA 189 Query: 386 GIAIC------NKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS--------------- 424 +IC ++ +S+GI + + + + +S ++ Sbjct: 190 TESICRPGGLADQLRSRGIVM--LGVGLGDAARASQFDVMSAISTGRGLNGMPCGNITEP 247 Query: 425 -PNSFFEANSTHELNKIFRDR 444 P F+ ++ ++ F Sbjct: 248 APGDFYRVSNIDDMLFAFDSL 268 >gi|260797295|ref|XP_002593639.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] gi|229278865|gb|EEN49650.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] Length = 373 Score = 36.5 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 46/144 (31%), Gaps = 17/144 (11%) Query: 284 SIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAM 343 +I D ++G ++ R + S + V K + G T A+ Sbjct: 34 TISGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSAAIDEVQYMR--GGTLTGKAI 91 Query: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTI 403 + ++ + K ++++TDG + +A+ +GI + I Sbjct: 92 RYVTKYGFGKSDGARPGV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGITVYAI 142 Query: 404 AFSVNKTQQEKARYFLSNCASPNS 427 S L AS N+ Sbjct: 143 GVSG------YDADQLEQIASNNN 160 >gi|319785614|ref|YP_004145090.1| hypothetical protein Mesci_6033 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171502|gb|ADV15040.1| hypothetical protein Mesci_6033 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 205 Score = 36.5 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 36/135 (26%), Gaps = 2/135 (1%) Query: 14 LIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE 73 G + AL+MP++L + + ++ + + + +++ V S+ Sbjct: 20 FWSDRRGIAAVEFALIMPILLIMYFLTMEASQAIETSKKVSRI--GSMVADLVTQQTSVL 77 Query: 74 EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVL 133 + A + + D +V + Sbjct: 78 KADVDAIMQIGSVTLQPYNRSNPTITVTAIQVSADATRALVVWSRKLVAGVASPGAAATT 137 Query: 134 SSRYDLLLNPLSLFL 148 + L + FL Sbjct: 138 ETTIPASLRVANTFL 152 >gi|15963887|ref|NP_384240.1| hypothetical protein SMc04118 [Sinorhizobium meliloti 1021] gi|307315735|ref|ZP_07595254.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] gi|307320420|ref|ZP_07599837.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15073062|emb|CAC41521.1| Conserved hypothetical transmembrane protein [Sinorhizobium meliloti 1021] gi|306893986|gb|EFN24755.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306898626|gb|EFN29294.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] Length = 194 Score = 36.5 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 10/133 (7%), Positives = 35/133 (26%), Gaps = 5/133 (3%) Query: 11 SKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQ 70 L++ G + A++ P+++ ++ A S + Q Sbjct: 15 FHGLLRDRRGAGAVEFAIVAPLLIAAYVGAFELSLGFTVARK----VGRASSAVSDIVTQ 70 Query: 71 SLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVR-DTAVEMNPRKSAY 129 + + + + Y +++ + + V + Sbjct: 71 EQQVSKAFLDGMRNVARNMLVPYDGSDYDLKITGIQVNGTTEGKVAWSRGWSDASDGATV 130 Query: 130 QVVLSSRYDLLLN 142 ++S + + Sbjct: 131 PYAVNSVVSVPAD 143 >gi|308048514|ref|YP_003912080.1| type IV pilin biogenesis protein [Ferrimonas balearica DSM 9799] gi|307630704|gb|ADN75006.1| type IV pilin biogenesis protein, putative [Ferrimonas balearica DSM 9799] Length = 1183 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 17/61 (27%), Gaps = 5/61 (8%) Query: 396 QGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKIFRDRIGNEIFERV 453 Q + T+ FS + L A ++ AN L + + + Sbjct: 437 QKVVTYTVGFSQGAVDAAE---LLEQTAHRGGGLYYPANDAAALQSSLQQIVSEILSVNT 493 Query: 454 I 454 Sbjct: 494 S 494 >gi|302686254|ref|XP_003032807.1| hypothetical protein SCHCODRAFT_82034 [Schizophyllum commune H4-8] gi|300106501|gb|EFI97904.1| hypothetical protein SCHCODRAFT_82034 [Schizophyllum commune H4-8] Length = 1039 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 7/92 (7%), Positives = 33/92 (35%), Gaps = 3/92 (3%) Query: 363 NNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC 422 + ++L++D ++ + + +A++ + I + + + L + Sbjct: 643 KSKNPVSGMLLISDAADSTRRAQMDLVLARAEAANVPIHSFGYGRSHDPASLW---LMSN 699 Query: 423 ASPNSFFEANSTHELNKIFRDRIGNEIFERVI 454 + ++ ++L +G + ++ Sbjct: 700 HTSGTYTFVKDWYDLRDCIAGCVGGMMSIGLL 731 >gi|297291177|ref|XP_001109727.2| PREDICTED: collagen alpha-1(XII) chain-like [Macaca mulatta] Length = 3095 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 30/87 (34%), Gaps = 12/87 (13%) Query: 361 MKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLS 420 A+K VL+TDG++ D E K K +G+ + I + L Sbjct: 1271 AGMRPRARKIGVLITDGKSQDDVEAPS---KKLKDEGVELFAIGI------KNADEVELK 1321 Query: 421 NCASPN---SFFEANSTHELNKIFRDR 444 A+ + L++I D Sbjct: 1322 MIATDPDDTHAYNVADFESLSRIVDDL 1348 >gi|262191574|ref|ZP_06049755.1| RTX toxin related protein [Vibrio cholerae CT 5369-93] gi|262032539|gb|EEY51096.1| RTX toxin related protein [Vibrio cholerae CT 5369-93] Length = 3572 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 38/374 (10%), Positives = 92/374 (24%), Gaps = 18/374 (4%) Query: 70 QSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAY 129 ++ A +F + N + + T M + + Sbjct: 823 NGQFQLVQGADTVASFALDSSVNPVQGLTSNGVAVTLSAPVDDGNGNLTYTAMAGAVTVF 882 Query: 130 QVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSM 189 + L+S + ++ + K A + ++ D + Sbjct: 883 TLTLNSDGTYSFTLAAPVDHALNSNDLTLNFKVIATDFDGDSD-SIVLPVKINDDKPSFI 941 Query: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN-----KSLYYML 244 L + ++ + G S N S + + Sbjct: 942 HRQPLSVNEDDLPQGSDSSKESLSASGQFTVVQGADQVVSYQLDGSVNPVQGLTSNGFAV 1001 Query: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVI-----RSIKKIDNVNDTVRMGA 299 + +++ + I L D + D+ ++ A Sbjct: 1002 VLSAPVDDGNGNLTYTATAGGNPIFTLVLNTDGSYTFTLSGPIDHASGSDSKQLNFKVLA 1061 Query: 300 TFFN-DRVISDPSFSW-GVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDE 357 F+ D+ + + I + +++ + TA ++ + Sbjct: 1062 VDFDGDKASVNLPVTIIDDKPTITAVQALSVDEDDLTQGSDTTKESLTATGHFTTTQGAD 1121 Query: 358 VHRMKNNLEAKKYIVLLTDG-----ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQ 412 +V +T G + + G K G I T+ + N + Q Sbjct: 1122 HVVSYTLNLTTNPLVGVTSGGQVLTLSDSVDASGNHTYTATKPDGTLIFTLQLNANGSYQ 1181 Query: 413 EKARYFLSNCASPN 426 L + AS + Sbjct: 1182 FTLSGPLDHAASSD 1195 >gi|221113899|ref|XP_002160633.1| PREDICTED: similar to Collagen alpha-5(VI) chain [Hydra magnipapillata] Length = 9981 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 28/285 (9%), Positives = 63/285 (22%), Gaps = 27/285 (9%) Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPAD 209 + + ++ ID + S P F Sbjct: 2774 ILAAGKITKDKFLSLVKTEDDIIILNDINSHTQIDNAVSKDWA-----KDPCTSFICETK 2828 Query: 210 RTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIK 269 + + Y C S + S Sbjct: 2829 YSYCQAKVDGSVACLCPVCDLSYKPVCGSDGLSYPSLYNSKESHDVGFVIGYSDKNTQYD 2888 Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 ++D + + +++G + + + L + + Sbjct: 2889 LNQPIKDQFVKITNEVVFNLKKKFLLKVGIISYGKDAQVLQPLTSAENFLKSIHLISLKS 2948 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 I+ A+ A S + + K I+ LT ++ + Sbjct: 2949 GPLP-----ISSALILA------SGFFSELQGGRDHSKKSLILFLT----NDNDFNAVKE 2993 Query: 390 CNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASP-NSFFEANS 433 N K GI ++ I + +L+ A ++ F Sbjct: 2994 SNLLKKLGISVLVIGIG------NVTQDYLTEVAGNKDNVFHVKD 3032 >gi|260828859|ref|XP_002609380.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] gi|229294736|gb|EEN65390.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] Length = 321 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 46/129 (35%), Gaps = 12/129 (9%) Query: 266 HVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVK 325 + K ++D + + + D R+ A F+D + F + ++ Sbjct: 185 NFNDAKENIKDLIENFPAPVGPSDT-----RVAAVSFSDVDKTRVEFDFNASGDRDSVKT 239 Query: 326 TFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEE 385 + E G TA A+ A D + S K + L+TDG++ + Sbjct: 240 SLGNIAYEGGWTATATALSLARDDVFQSVAGSRPNSA------KILFLITDGKSNRGGA- 292 Query: 386 GIAICNKAK 394 I + ++ K Sbjct: 293 PIPVADQLK 301 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 42/146 (28%), Gaps = 20/146 (13%) Query: 295 VRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSN 354 R+G ++ +V + S I T A++ A + S+ Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPL--AQGTMTGLAIRYAMNVAFSAE 116 Query: 355 EDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEK 414 E + + V++TDG E A + GI I + Sbjct: 117 EGARPNVPH------VAVIVTDGRPQDRVAEVAAAARE---SGIEIYAVGV------ARA 161 Query: 415 ARYFLSNCASP---NSFFEANSTHEL 437 L ASP + F S + Sbjct: 162 DMTSLRAMASPPFEDHVFLVESFDLI 187 >gi|254244661|ref|ZP_04937983.1| pilin biogenesis protein [Pseudomonas aeruginosa 2192] gi|126198039|gb|EAZ62102.1| pilin biogenesis protein [Pseudomonas aeruginosa 2192] Length = 405 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 54/161 (33%), Gaps = 5/161 (3%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 + V+L +GG + + A ++ P++ ++ S+ A+ + T Sbjct: 226 LLVLLMMGGAGFLLNHAYKRSEKFRDATDRTVLKL--PIVGAILYKSAVARYARTLSTTF 283 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + +++ + RD V +++ Q+ S R + +++ + Sbjct: 284 AAGVPLVEALDSVSGATGNVVFRDAVGKIKQDVSTG---MQLNFSMRTTNIFPSMAIQMT 340 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 ++G +S + ++ V + M Sbjct: 341 AIGEESGALDDMLAKVAGFYEQEVDNAVDNLTALMEPMIMA 381 >gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 794 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 45/163 (27%), Gaps = 20/163 (12%) Query: 287 KIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTA 346 I + R F+D + R + G T + A++ A Sbjct: 381 AISRLTPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTEMLPALEDA 440 Query: 347 YDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 + + + + +V LTDG + + I A R+ T+ Sbjct: 441 ---LRNQGPVATGAL-------RQVVFLTDGAIGNEQQLFQEI--TANRGDARVFTVGIG 488 Query: 407 VNKT------QQEKARYFLSNCASPNSFFEANSTHELNKIFRD 443 E R + S + A+ EL ++ Sbjct: 489 SAPNTYFMTKAAEIGRGTFTQIGSTDQV--ASRMGELFAKLQN 529 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 36.5 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 15/161 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G + + +D S+ ++ + + LA V++++ D +G + D Sbjct: 257 QGDPNCKIDLSFLIDGSTSIG-KRRFRIQKQFLADVVQAL---DIGPAGPLVGVVQYGDN 312 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ G + + A+ T S N Sbjct: 313 PATQFNLKTHMNSQDLKTAIEKIT--QRGGLSNVGRAISFVTKTFFSKANGNRGGAPN-- 368 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 V++ DG T + A+ GI + I Sbjct: 369 ----VAVVMVDGWPTDK---VEEVSRVARESGINVFFITVE 402 >gi|313107171|ref|ZP_07793371.1| type 4 fimbrial biogenesis protein PilC [Pseudomonas aeruginosa 39016] gi|310879873|gb|EFQ38467.1| type 4 fimbrial biogenesis protein PilC [Pseudomonas aeruginosa 39016] Length = 406 Score = 36.5 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 54/161 (33%), Gaps = 5/161 (3%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 + V+L +GG + + A ++ P++ ++ S+ A+ + T Sbjct: 226 LLVLLMMGGAGFLLNHAYKRSEKFRDATDRTVLKL--PIVGAILYKSAVARYARTLSTTF 283 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + +++ + RD V +++ Q+ S R + +++ + Sbjct: 284 AAGVPLVEALDSVSGATGNVVFRDAVGKIKQDVSTG---MQLNFSMRTTNIFPSMAIQMT 340 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 ++G +S + ++ V + M Sbjct: 341 AIGEESGALDDMLAKVAGFYEQEVDNAVDNLTALMEPMIMA 381 >gi|303248313|ref|ZP_07334575.1| TadE family protein [Desulfovibrio fructosovorans JJ] gi|302490338|gb|EFL50250.1| TadE family protein [Desulfovibrio fructosovorans JJ] Length = 147 Score = 36.5 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 41/159 (25%), Gaps = 13/159 (8%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 K+L + G + A+ M +++ + +LV+ R L+ AA + + Sbjct: 2 KRLHRDQRGLAAVEMAIAMVLLVPLLLILVEASRALTEYSQLQNAAMEGARMLARQNGE- 60 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 T + L E + + D V+ Y Sbjct: 61 --TGGVEDYIKNTVLTDASGKSLFDGAEPTVTISPRDANNNVTVQVDHDYNPSFMPQYDA 118 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRS 170 + G + I K + + Sbjct: 119 SGNPT----------PFNLPGSDTLTISAKTTMALPAAN 147 >gi|218893622|ref|YP_002442491.1| still frameshift type 4 fimbrial biogenesis protein PilC [Pseudomonas aeruginosa LESB58] gi|218773850|emb|CAW29664.1| still frameshift type 4 fimbrial biogenesis protein PilC [Pseudomonas aeruginosa LESB58] Length = 405 Score = 36.5 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 54/161 (33%), Gaps = 5/161 (3%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQK 89 + V+L +GG + + A ++ P++ ++ S+ A+ + T Sbjct: 226 LLVLLMMGGAGFLLNHAYKRSEKFRDATDRTVLKL--PIVGAILYKSAVARYARTLSTTF 283 Query: 90 IEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLR 149 + +++ + RD V +++ Q+ S R + +++ + Sbjct: 284 AAGVPLVEALDSVSGATGNVVFRDAVGKIKQDVSTG---MQLNFSMRTTNIFPSMAIQMT 340 Query: 150 SMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSML 190 ++G +S + ++ V + M Sbjct: 341 AIGEESGALDDMLAKVAGFYEQEVDNAVDNLTALMEPMIMA 381 >gi|167744178|ref|ZP_02416952.1| hypothetical protein Bpse14_39263 [Burkholderia pseudomallei 14] gi|167821377|ref|ZP_02453057.1| hypothetical protein Bpse9_40023 [Burkholderia pseudomallei 91] gi|167829719|ref|ZP_02461190.1| hypothetical protein Bpseu9_38930 [Burkholderia pseudomallei 9] gi|167851186|ref|ZP_02476694.1| hypothetical protein BpseB_38461 [Burkholderia pseudomallei B7210] gi|226194150|ref|ZP_03789750.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|225933843|gb|EEH29830.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] Length = 168 Score = 36.5 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVVVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 121 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 122 PDPASLAGSMNLV 134 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 36.5 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 15/161 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G + + +D S+ ++ + + LA V++++ D +G + D Sbjct: 257 QGDPNCKIDLSFLIDGSTSIG-KRRFRIQKQFLADVVQAL---DIGPAGPLVGVVQYGDN 312 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ G + + A+ T S N Sbjct: 313 PATQFNLKTHMNSQDLKTAIEKIT--QRGGLSNVGRAISFVTKTFFSKANGNRGGAPN-- 368 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 V++ DG T + A+ GI + I Sbjct: 369 ----VAVVMVDGWPTDK---VEEVSRVARESGINVFFITVE 402 >gi|114046659|ref|YP_737209.1| type IV pilin biogenesis protein [Shewanella sp. MR-7] gi|113888101|gb|ABI42152.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-7] Length = 1223 Score = 36.5 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 38/134 (28%), Gaps = 36/134 (26%) Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---------------- 397 + + + + YI+ +TDGE T DN + K G Sbjct: 403 GTNNYNSPFKRCQNEAYIIYITDGEPTLDNAADGLVEQLTKGVGKHTSRPVSYLSALSGW 462 Query: 398 ---------------IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 +R TI F A + L A +F+A +L Sbjct: 463 MRNEDVNPNSVGKQTVRTFTIGF---SEGAANAEHLLKQTAENGGGKYFDATDASKLRSS 519 Query: 441 FRDRIGNEIFERVI 454 + + N + + Sbjct: 520 LQTALNNILEKNAS 533 >gi|254444132|ref|ZP_05057608.1| hypothetical protein VDG1235_2371 [Verrucomicrobiae bacterium DG1235] gi|198258440|gb|EDY82748.1| hypothetical protein VDG1235_2371 [Verrucomicrobiae bacterium DG1235] Length = 1015 Score = 36.5 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 24/218 (11%), Positives = 50/218 (22%), Gaps = 22/218 (10%) Query: 1 MVFDTKFIFYSKKLIKS--CTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEH------A 52 M K I + G F I+ + ++ + G + + Sbjct: 1 MNTQNKIISQLSRFRDDANKKG-FAIVLVATLAGLVFLVGTALVATMQLRTKSIDYSKYK 59 Query: 53 LKQAAQTAIITASVPLIQSL-----------EEVSSRAKNSFTFPKQKIEEYLIRNFENN 101 L A A + + + R + F EE + Sbjct: 60 L-LAEDNARLALRSAIGDLQEYLGKDTSISYTADALRTNTTTDFKDVAAEEVREAFWTGA 118 Query: 102 LKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTK 161 + T V TA P+ +A + + + L + + + Sbjct: 119 RNGSSTTWLVTRGFSSTAPAPVPKGTAPSSEVELLGEGTIGSNPLLTVKVPKEPITVSGV 178 Query: 162 AEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ 199 + G + S Y + + Sbjct: 179 DGYGNSAVLTIGNFGY-WVGDLGVKGSYAWYDKTNSID 215 >gi|314957231|gb|EFT01335.1| TadE-like protein [Propionibacterium acnes HL027PA1] Length = 58 Score = 36.5 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 17/37 (45%), Gaps = 2/37 (5%) Query: 14 LIKSCTG--HFFIITALLMPVMLGVGGMLVDVVRWSY 48 + + G + L++PV+L + G++ D R Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQIS 37 >gi|330446188|ref|ZP_08309840.1| subtilase family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328490379|dbj|GAA04337.1| subtilase family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 1031 Score = 36.5 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 33/378 (8%), Positives = 87/378 (23%), Gaps = 18/378 (4%) Query: 67 PLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRK 126 P S + ++ + F I + + N ++ + M Sbjct: 572 PDKTSEDNKAASDYPTSCFDTSIITDVKRSFYSNYGERVDISASGTIVGFKNDYTMGTSY 631 Query: 127 SAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFS 186 + V + +NP L IK L + E + E + D S Sbjct: 632 ATPIVTATVALMQSINPN---LSPAEIKDILRSSALPIENKVVTKSSETVFTRSLTKDES 688 Query: 187 RSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYP 246 + + + + + V I ++K + P Sbjct: 689 ETHSGKGARLNVEGAIQAALDSLENTTLVKAPSVAVKIPEDKTE------VTKTIEVTIP 742 Query: 247 GPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV-----NDTVRMGATF 301 G + +D +I + + + Sbjct: 743 GDKAVFDKVDVMFVVDVSGSYYDDLDTFKDQATELIEAFSSSGTNVNIGISSFSDFPLSP 802 Query: 302 FNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRM 361 + + + G+ ++ Y T+ + + Sbjct: 803 YGSSDDYAFKLDQPLTNNYDLVKSALDNLLILSGNDYPESQLEAIYQTVQPATGWRSGSL 862 Query: 362 KNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSN 421 A +D +++ ++ K+ R+ + + + Sbjct: 863 PTLFLATDARFHNSDMDDSYPGSGYDKTISELKTLKARVF--GLQSGGVIDDVKKIAEQS 920 Query: 422 CASPNSFFEANSTHELNK 439 +F + + E+ + Sbjct: 921 --GGAAFKLSKDSAEIVE 936 >gi|182625113|ref|ZP_02952890.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] gi|177909733|gb|EDT72159.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] Length = 620 Score = 36.5 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 63/207 (30%), Gaps = 15/207 (7%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S +L+ + F + + +G +++ P N + Sbjct: 14 ISLFISVPVLNASGIENKGFDDQFKDIGNESEGRICDYDGTEILKEVSQKPD-EDGNYEI 72 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P + + + + + ++ ++ S++ + + R+ Sbjct: 73 TLTVKGKPKKVTKPVDILLIMDASNSMYYNMDELKASMNSLVDKVIDNIPNS---RIAVV 129 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFAID----ENEMGSTAINDAMQTAYDTIISSNED 356 F V SF+ + + K D G+T I + A + + Sbjct: 130 AFGTEVEEVFSFNDKNNFTSKEEYKNAIKDSYYYITGRGNTNIEGTWRRANEIFKNE--- 186 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDN 383 + NN +KK ++ +DG Sbjct: 187 ----LNNNSNSKKDVIFFSDGYPNVSR 209 >gi|73749040|ref|YP_308279.1| hypothetical protein cbdb_A1306 [Dehalococcoides sp. CBDB1] gi|73660756|emb|CAI83363.1| hypothetical protein cbdbA1306 [Dehalococcoides sp. CBDB1] Length = 375 Score = 36.5 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 23/249 (9%), Positives = 63/249 (25%), Gaps = 15/249 (6%) Query: 6 KFIFYSKKLIKSCTG---HFFIIT----ALLMPVMLGVGGML-VDVVRWSYYEHALKQAA 57 + + K ++S G +I AL +P ++ + G V + +L AA Sbjct: 2 RNLITISKRLRSQQGQALVIALIFLAIGALTLPPLMMLMGSALVQGTTFENRTASL-YAA 60 Query: 58 QTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRD 117 + A L + + S T + + I ++ + + I+ Sbjct: 61 DAGVEQAIWYLDPANSPLISGGLPMHTGESRTLPGMSIDGRTVSVTLAYLAEDTYQIMST 120 Query: 118 TAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGV 177 + + +++ N + + + + +V+ + Sbjct: 121 SVSSTET------ISITTVVGETFNNYTDIMTHVITSQGDYTIQGGQASVTPDTGDNAPI 174 Query: 178 SIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCN 237 + + L P + + + + Sbjct: 175 ANYDGPWPGANELSAFYYVNTTPYASATLNVTNYPTGVPEILRLGTLDIVATANAVYTLE 234 Query: 238 KSLYYMLYP 246 +Y Sbjct: 235 GDIYITGDT 243 >gi|126457913|ref|YP_001076992.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|167725251|ref|ZP_02408487.1| hypothetical protein BpseD_39896 [Burkholderia pseudomallei DM98] gi|167899816|ref|ZP_02487217.1| hypothetical protein Bpse7_39195 [Burkholderia pseudomallei 7894] gi|167908133|ref|ZP_02495338.1| hypothetical protein BpseN_38276 [Burkholderia pseudomallei NCTC 13177] gi|167916480|ref|ZP_02503571.1| hypothetical protein Bpse112_38767 [Burkholderia pseudomallei 112] gi|167924337|ref|ZP_02511428.1| hypothetical protein BpseBC_37623 [Burkholderia pseudomallei BCC215] gi|254182580|ref|ZP_04889174.1| TadE family protein [Burkholderia pseudomallei 1655] gi|254187132|ref|ZP_04893647.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|254192458|ref|ZP_04898897.1| TadE family protein [Burkholderia pseudomallei S13] gi|254296486|ref|ZP_04963942.1| TadE family protein [Burkholderia pseudomallei 406e] gi|126231681|gb|ABN95094.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|157806328|gb|EDO83498.1| TadE family protein [Burkholderia pseudomallei 406e] gi|157934815|gb|EDO90485.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|169649216|gb|EDS81909.1| TadE family protein [Burkholderia pseudomallei S13] gi|184213115|gb|EDU10158.1| TadE family protein [Burkholderia pseudomallei 1655] Length = 168 Score = 36.5 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 36/133 (27%) Query: 19 TGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSR 78 G + A++M ++ + + + R Y AL +A + A S L ++ Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 AKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYD 138 A+ + + + + + ++ Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDAAHAPDCSDSSDPAQFANVPTYDTNNGS 121 Query: 139 LLLNPLSLFLRSM 151 L+ + + Sbjct: 122 PDPASLAGSMNLV 134 >gi|260813588|ref|XP_002601499.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] gi|229286796|gb|EEN57511.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] Length = 375 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 45/143 (31%), Gaps = 17/143 (11%) Query: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQ 344 I D ++G ++ R + S + + K + G T A++ Sbjct: 35 ISGFDISPSGTQVGVIQYSTRTRQEFSLNSFLTKETLSSAIDEVQYMR--GGTLTGKAIR 92 Query: 345 TAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIA 404 ++ + K ++++TDG + +A+ +GI + I Sbjct: 93 YVTKYGFGKSDGARPGV------PKVVIVVTDGVSYDAVAAPAL---EAQQKGITVYAIG 143 Query: 405 FSVNKTQQEKARYFLSNCASPNS 427 S L AS N+ Sbjct: 144 VSG------YDADQLEQIASNNN 160 >gi|188583115|ref|YP_001926560.1| TadE family protein [Methylobacterium populi BJ001] gi|179346613|gb|ACB82025.1| TadE family protein [Methylobacterium populi BJ001] Length = 177 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 6/67 (8%), Positives = 24/67 (35%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSL 72 + ++ +G + +L++ + + ++++ +H L A + Q Sbjct: 2 RFVRDVSGIAAVELSLVLLPLAVLMLVIIEASFLVLTQHQLDLAVERTARLVRTGAFQQE 61 Query: 73 EEVSSRA 79 + + Sbjct: 62 ANGADLS 68 >gi|254238698|ref|ZP_04932021.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa C3719] gi|126170629|gb|EAZ56140.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa C3719] Length = 1170 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 27/265 (10%), Positives = 66/265 (24%), Gaps = 18/265 (6%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ--P 200 ++ GI + ++ + Y+ ++ + + S + + Sbjct: 71 SMAWAYVPDGISGNSGRAGRSSDYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTA 130 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM---LYPGPLDPSLSEEH 257 G T ++ + G + ++ YY P P S Sbjct: 131 AWQDGYAQGSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYYTYKVSASCPAQPVSSSNS 190 Query: 258 FVDSSSLRHV-IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 ++L + + I + K N+ ++ S Sbjct: 191 CYTYNALPTSQESNFAIWYSYYRNRILATKTAANLAFYSLPENVRLTWGALNTCSIGANS 250 Query: 317 HKLIRTIVK-----------TFAIDENEMGSTAINDAMQTAYDTI-ISSNEDEVHRMKNN 364 + + + G T ++ A+ A + + K Sbjct: 251 RSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAGRFLQTNGTAYTTEDGKTY 310 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI 389 Y +++TDG N + Sbjct: 311 SCRASYHIMMTDGIWNGRNVTPGNL 335 >gi|83719116|ref|YP_443061.1| hypothetical protein BTH_I2544 [Burkholderia thailandensis E264] gi|167582065|ref|ZP_02374939.1| hypothetical protein BthaT_28237 [Burkholderia thailandensis TXDOH] gi|167620226|ref|ZP_02388857.1| hypothetical protein BthaB_28225 [Burkholderia thailandensis Bt4] gi|257139291|ref|ZP_05587553.1| hypothetical protein BthaA_08814 [Burkholderia thailandensis E264] gi|83652941|gb|ABC37004.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 155 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 41/153 (26%), Gaps = 2/153 (1%) Query: 8 IFYSKKLI--KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITAS 65 + + G I A+L P+ + ++ + +L AA A Sbjct: 2 THAPGRFRSPRRQRGATAIEFAILFPMFFLILYGIITYGMIFAAQQSLTLAATEGARAAL 61 Query: 66 VPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPR 125 + + + + + + + N+T + P Sbjct: 62 NYQVAQTQSAALGLRAAAACTAANNLTGWLSGATCSTSSNYTCSYDSTMYCIQVTVTYPY 121 Query: 126 KSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLI 158 + V + +D +L ++ I I Sbjct: 122 AANPLVPAVALFDAVLPNTLTSQATVQINPTNI 154 >gi|166366825|ref|YP_001659098.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089198|dbj|BAG03906.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 460 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 270 KKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAI 329 K +V ++L ++I S + R+ F+D+ + + R + A Sbjct: 84 KIDMVIESLRTLINSGRF----TPEDRIALIQFDDQASTLIGLTPVTQ--TRQLEDAIAK 137 Query: 330 DENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAI 389 N G T + + A + + ++ ++ TDG+ T D ++ + Sbjct: 138 LRNFSGGTCMGRGINQALALLANQ-----------SMTSRHTMIFTDGD-TFDEDDCQNL 185 Query: 390 CNKAKSQGIRIMTIAFS 406 + SQGI I + Sbjct: 186 AQQFASQGISITALGVG 202 >gi|146351309|ref|YP_001210536.1| hypothetical protein pAL1.089 [Arthrobacter nitroguajacolicus] gi|146218873|emb|CAL09944.1| putative membrane protein [Arthrobacter nitroguajacolicus] Length = 133 Score = 36.5 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 32/126 (25%), Gaps = 1/126 (0%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 + G +L +L + G++VD + AQ+A A V + + Sbjct: 7 AQQERGSATPFILILAVGLLIMVGLVVDGGGQVNAKDQANSVAQSAARAAVVNISDATGG 66 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 S Q + + N V T + + Sbjct: 67 TPVIDTASAQAAAQTYIAAAGKTGTVTINGNNVTVTV-TSNYQTVFLSLAGINTLTGSAT 125 Query: 135 SRYDLL 140 S L Sbjct: 126 STAQLA 131 >gi|217978823|ref|YP_002362970.1| hypothetical protein Msil_2686 [Methylocella silvestris BL2] gi|217504199|gb|ACK51608.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 174 Score = 36.5 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 10/122 (8%), Positives = 29/122 (23%), Gaps = 3/122 (2%) Query: 15 IKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEE 74 +S G + AL P++L + + ++ Y + A + ++ A+ Sbjct: 33 RRSEAGTAALEFALATPLLLILVAGVTEIGFAIYQGMQVSAAVEAGMMYAAK---NGWSS 89 Query: 75 VSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLS 134 + + + + L+ Sbjct: 90 SGIASAVVSASGATGLTATPAPSQFCGCPAATGITATTCTSTCANGYAPGQYIQINAALA 149 Query: 135 SR 136 + Sbjct: 150 HQ 151 >gi|326330151|ref|ZP_08196462.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325951964|gb|EGD43993.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 165 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 45/136 (33%), Gaps = 5/136 (3%) Query: 13 KLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQ----AAQTAIIT-ASVP 67 + ++ G I A+ +P + G+++ R + A++ AA+TA I S Sbjct: 25 RRDQNERGSAAIEAAIGVPAFMLFVGLIIFGGRTATTHEAVQSAAADAARTASIARTSTE 84 Query: 68 LIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKS 127 + + + + + + + EV R +++ Sbjct: 85 AQSEGARAARESLANQQIQCRSVTIEVSTDGFATPVGEPAAVEVTVACRLDLSDLSVPGV 144 Query: 128 AYQVVLSSRYDLLLNP 143 V+ ++ ++ Sbjct: 145 PGSRVIRAQMSSPIDT 160 >gi|314988185|gb|EFT32276.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] Length = 322 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 +NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 DNTQGGSPLVAA-NRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLREVYQQ 281 >gi|288917844|ref|ZP_06412205.1| lipopolysaccharide biosynthesis protein [Frankia sp. EUN1f] gi|288350772|gb|EFC84988.1| lipopolysaccharide biosynthesis protein [Frankia sp. EUN1f] Length = 591 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 14/118 (11%), Positives = 34/118 (28%), Gaps = 3/118 (2%) Query: 30 MPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIIT-ASVPLIQSLEEVSSRAKNSFTFPKQ 88 + V+LG D+ R++ ++ ++ A+ L + + T Sbjct: 63 LTVILGGSASGADLGRYASARARFAESTD--VLATAARSLKDGTSLATLQTVVEVTATAD 120 Query: 89 KIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSL 146 + + V R+ ++ Q+ +R L+ L Sbjct: 121 ADVLIVTATGSGAEQAERRADAVVAAFRELTLQATNDTVNSQLAAVARAQAQLDQLFT 178 >gi|195470276|ref|XP_002087434.1| GE16095 [Drosophila yakuba] gi|194173535|gb|EDW87146.1| GE16095 [Drosophila yakuba] Length = 1411 Score = 36.5 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 24/337 (7%), Positives = 63/337 (18%), Gaps = 17/337 (5%) Query: 51 HALKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFT--- 107 + ++ TA + ++ S S + + + + T Sbjct: 15 NKMQAVMDTAAMAIAMGSSGSGSGSVSGLSSGSSSGPGSSSSGSSSSGYGSATTTPTSGG 74 Query: 108 --DREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAE 165 D + T + P S L ++ Sbjct: 75 HYDNSPTSMPMATIATVAPFYSEALTSLDAQQRQQQQQQQQQQNPSHYYPQNYGYGNSLP 134 Query: 166 TVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIR 225 + + + G A + GI+ Sbjct: 135 LDPAYNYSGAPYNSYSATAVGPQLSGNSYPVSGTRYEKLTPGAGAKYGASPIGATGTGIK 194 Query: 226 DEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSI 285 + + + +Y +++ K + A +++ S Sbjct: 195 PQATA--------TPFYAQPLSGGAAGSGGYMSQSNATYDPYKYKIPSAQAASHAMMASN 246 Query: 286 KKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQT 345 V T S + Sbjct: 247 MTGSAVPGTSYGLPLNPGAVPQKQTQMSQQTQLTQLEPNYAAIKPSRSYQGMSTAA---- 302 Query: 346 AYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 Y ++ ++ + KY + ++ Sbjct: 303 VYGATGGNSSGVMNNPSSAQYYDKYGMAMSMYSPNGG 339 >gi|297161950|gb|ADI11662.1| hypothetical protein SBI_08544 [Streptomyces bingchenggensis BCW-1] Length = 149 Score = 36.5 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 11/132 (8%), Positives = 33/132 (25%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G +M + G+++D + AQ A T + L + Sbjct: 18 DDRGQVTSFVMGVMLALWLFAGIVIDGGLALAGKARALDVAQEAARTGAQQLDIGRLRST 77 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + + + Y+ + D + ++ + +++ Sbjct: 78 DDVRLLRSKAARAATAYVRATGDTGSATVQGDEVTVHVTHHQPAQILQLVGVRTLTVTAH 137 Query: 137 YDLLLNPLSLFL 148 + + Sbjct: 138 ATVRAERANANP 149 >gi|56696617|ref|YP_166978.1| hypothetical protein SPO1740 [Ruegeria pomeroyi DSS-3] gi|56678354|gb|AAV95020.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 191 Score = 36.5 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 4/39 (10%), Positives = 16/39 (41%) Query: 2 VFDTKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGML 40 +F ++ + + ++ G + L +P++ + Sbjct: 9 MFKSRILNSLRLFRRNEDGSIAVEALLTVPMLFWTIMIG 47 >gi|160874283|ref|YP_001553599.1| type IV pilin biogenesis protein [Shewanella baltica OS195] gi|160859805|gb|ABX48339.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS195] gi|315266516|gb|ADT93369.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS678] Length = 1168 Score = 36.5 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 34/310 (10%), Positives = 75/310 (24%), Gaps = 12/310 (3%) Query: 147 FLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI----QWVIDFSRSMLDYQRDSEGQPLN 202 F+ S +W A T + + + + + + Sbjct: 194 FISSSPSSTWADALAAAKNTDFGVGQPVTFYADNYLRWYWLSKAGKLPTVKVSRLEIAKK 253 Query: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 T + + + M + ++ + + Sbjct: 254 AISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKT 313 Query: 263 SLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRT 322 + R ++ D + ++ V S+ Sbjct: 314 NTPLCETMYEAYRYFAGKGVKY-GHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDI 372 Query: 323 IVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQD 382 + D + N+ + + S + + NL+ Y+ L D Sbjct: 373 AYVIYVTDGAPTVDKSANNDVISLTS--TGSKDGDYSSFSKNLDTASYLPALASYMFNND 430 Query: 383 NEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 + N + Q +R TI FS L+ A +F A ++ EL Sbjct: 431 LINKLDSSNTEQMQNVRTYTIGFSKGAEDAAP---LLAETAKRGGGLYFAAQNSIELQNA 487 Query: 441 FRDRIGNEIF 450 D + N + Sbjct: 488 LNDALSNILN 497 >gi|307727464|ref|YP_003910677.1| von Willebrand factor type A [Burkholderia sp. CCGE1003] gi|307587989|gb|ADN61386.1| von Willebrand factor type A [Burkholderia sp. CCGE1003] Length = 326 Score = 36.5 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 34/125 (27%), Gaps = 12/125 (9%) Query: 294 TVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISS 353 R+ F + F++ + I T T ++ + TA Sbjct: 128 NDRLAFMMFGTSPLLAMPFTYDHRAIDAAIAGTAVGRGMPD--TQLDLGLLTAIGEFN-- 183 Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 ++ +++ IVL++DG I I + I + + Sbjct: 184 --------GSHSSSRRAIVLVSDGGAKLSARVRQLIEEGLLRNQIALYFIYLRSSVYSPD 235 Query: 414 KARYF 418 Sbjct: 236 LNAAL 240 >gi|113969431|ref|YP_733224.1| type IV pilin biogenesis protein [Shewanella sp. MR-4] gi|113884115|gb|ABI38167.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-4] Length = 1223 Score = 36.5 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 38/134 (28%), Gaps = 36/134 (26%) Query: 354 NEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQG---------------- 397 + + + + YI+ +TDGE T DN + K G Sbjct: 403 GTNNYNSPFKRCQNEAYIIYITDGEPTLDNAADGLVEQLTKGVGKHTSRPVSYLSALSGW 462 Query: 398 ---------------IRIMTIAFSVNKTQQEKARYFLSNCA--SPNSFFEANSTHELNKI 440 +R TI F A + L A +F+A +L Sbjct: 463 MRNEDVNPNSAGKQTVRTFTIGF---SEGAASAEHLLKQTAENGGGKYFDATDASKLRSS 519 Query: 441 FRDRIGNEIFERVI 454 + + N + + Sbjct: 520 LQTALNNILEKNAS 533 >gi|168699474|ref|ZP_02731751.1| hypothetical protein GobsU_08132 [Gemmata obscuriglobus UQM 2246] Length = 417 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 31/125 (24%), Gaps = 9/125 (7%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 G + A+ +PV++GV + +D R A+ Sbjct: 9 RRAGAVLVKVAICLPVLIGVLALNLDGGRLYDERRR---------AQATADAAALAAGAD 59 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 A I + + + V V T+ + +V + S Sbjct: 60 LYANYWTNQGADPIGTARAAAEQVAVANGYPASAVTVTVPPTSGAYAGQPGHAEVRIDSA 119 Query: 137 YDLLL 141 Sbjct: 120 VTNSF 124 >gi|118581115|ref|YP_902365.1| TadE family protein [Pelobacter propionicus DSM 2379] gi|118503825|gb|ABL00308.1| TadE family protein [Pelobacter propionicus DSM 2379] Length = 154 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 40/123 (32%), Gaps = 5/123 (4%) Query: 16 KSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEV 75 K G + A ++ +++ + + R Y ++ L A + + + Sbjct: 7 KHRKGQALVEAAFVLFIVVLFTFAITEFGRAMYIKNMLNN----AARAGARQAVVTGSLP 62 Query: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSS 135 S+ +F+ + + L + ++ + A V++++ Sbjct: 63 VSQPLFTFS-NTRPADPVLAKIYDGLMYVKKDTVTASVSCVGCAGSYAAAGDTISVIVTA 121 Query: 136 RYD 138 +D Sbjct: 122 PFD 124 >gi|262393420|ref|YP_003285274.1| hypothetical protein VEA_002647 [Vibrio sp. Ex25] gi|262337014|gb|ACY50809.1| hypothetical protein VEA_002647 [Vibrio sp. Ex25] Length = 154 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 36/123 (29%), Gaps = 11/123 (8%) Query: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVS 76 S G I L +PV+L + +++DV R + +A Sbjct: 15 SQQGLAIIEFILALPVLLMLTVLVIDVSRAFIQYTEVNKAL-----------QNGARYAV 63 Query: 77 SRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSR 136 + F E + T + I D V P + V LS+ Sbjct: 64 VDTYGTLNFESIADETSIKNVVVYGSPIASTTPVIDHISVDDIVITQPTSTNKVVTLSAT 123 Query: 137 YDL 139 Y+ Sbjct: 124 YNY 126 >gi|163747461|ref|ZP_02154813.1| hypothetical protein OIHEL45_00425 [Oceanibulbus indolifex HEL-45] gi|161379314|gb|EDQ03731.1| hypothetical protein OIHEL45_00425 [Oceanibulbus indolifex HEL-45] Length = 182 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 25/90 (27%), Gaps = 6/90 (6%) Query: 7 FIFYSKKLIKSCTGHFFIITALLMPVML---GVGGMLVDVVRWSYYEHALKQAAQTAIIT 63 + +S G I T +++P+M D R + AA T Sbjct: 4 LSSLLTRFKRSDDGSIAIETVIMLPLMFWAYLAMYSTFDTFRMYNLN---QTAAYTIGDA 60 Query: 64 ASVPLIQSLEEVSSRAKNSFTFPKQKIEEY 93 S + + F + + + Sbjct: 61 ISRETQAIDPDYLQGMQELFEYLTRGTGQT 90 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 36.5 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 55/177 (31%), Gaps = 17/177 (9%) Query: 279 ASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTA 338 + I +D +D VR+G +ND + H L I++ G T Sbjct: 222 NFLYSVILGLDISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTN 279 Query: 339 INDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI 398 A++ ++ + + ++L+TD E N+E + ++ K G+ Sbjct: 280 TGSALEFIRTNYLTEESGSRAK----DRVPQIVILVTDRE---SNDEVQEVADRLKEDGV 332 Query: 399 RIMTIAFSVNKTQQEKARYFLSNCASPNS--FFEANSTHELNKIFRDRIGNEIFERV 453 + L AS F + + F I + V Sbjct: 333 V------VYVVGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 383 >gi|253997634|ref|YP_003049698.1| Pyrrolo-quinoline quinone [Methylotenera mobilis JLW8] gi|253984313|gb|ACT49171.1| Pyrrolo-quinoline quinone [Methylotenera mobilis JLW8] Length = 1461 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 25/294 (8%), Positives = 57/294 (19%), Gaps = 8/294 (2%) Query: 91 EEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRS 150 +F + V + ++ M+ + LN L Sbjct: 320 GATATGSFSVTNVSSSGTTTVSSVKVNSVEIMSTGTGCAAATYTPTGGPNLNTRRKALAD 379 Query: 151 MGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLNCFGQPADR 210 + + ++I + + S Q A Sbjct: 380 AIVAKINACISSPDYIAVSDGANTPTITITAIAAGTNSNGTLQITLNNSATYGNVANAAG 439 Query: 211 TVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKK 270 V + D S + + ++ + + Sbjct: 440 GVSGSAMPPYTFSRTDIVASTNSYPKSAARTDCGNTCSYVQEMTNFANW----YTYYRTR 495 Query: 271 KHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAID 330 ++ A + + I V + + T Sbjct: 496 IQAMKTAASLAFKPID----NRYRVGFVTISSQSSNYLPINKFDNGTGSQKEQWYTKLFG 551 Query: 331 ENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNE 384 + G T + A+ + + +L TDG D Sbjct: 552 IDPSGGTPLRSALSVVGRIYAGKKPIGSSDPVQYSCQQNFALLTTDGYWNTDTA 605 >gi|192359982|ref|YP_001981471.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686147|gb|ACE83825.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 2103 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 57/171 (33%), Gaps = 16/171 (9%) Query: 290 NVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDT 349 + TVR+ A F + + + ++ I G T+I+ + +A Sbjct: 908 PRDGTVRVSALQFASAATVELNPTIIEEDNVQAIADKIRAIRKTGGGTSIHACIDSATTL 967 Query: 350 IISSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGI-RIMTIAFSVN 408 I ++ + I + TDG++TQ + N+A++ GI + I Sbjct: 968 IANALPASSM---------QIIDVSTDGQSTQSQAVAAS--NRARAAGIDVLNAIGVGTG 1016 Query: 409 KTQQEKARYFLSNCASPNSFF--EANSTHELNKIFRDRIGNEIFERVIRIT 457 + + F + E ++I E ++ +T Sbjct: 1017 ISTALLNAIVFPQPVGGDRGFVLTVRNYQEYMDGITNKIQRE--TKIADLT 1065 >gi|168216419|ref|ZP_02642044.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381279|gb|EDT78758.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 620 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 63/207 (30%), Gaps = 15/207 (7%) Query: 181 WVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSL 240 + S +L+ + F + + +G +++ P N + Sbjct: 14 ISLFISVPVLNASGIENKGFDDQFKDIGNESKGRICDYDGTEILKEVSEKPD-EDGNYEI 72 Query: 241 YYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGAT 300 + P + + + + + +++++ S++ + + R+ Sbjct: 73 TLTVKGKPKKVTKPVDILLIMDASNSMWNDMDALKESMNSLVDKVIDNIPNS---RIAVI 129 Query: 301 FFNDRVISDPSFSWGVHKLIRTIVKTFA----IDENEMGSTAINDAMQTAYDTIISSNED 356 F V SF+ + K + G+T I + A + + Sbjct: 130 AFGTEVEEVFSFNNKNKFTSKEEYKKAIKKSYNNIERRGNTNIEGTWRRADEIFKNE--- 186 Query: 357 EVHRMKNNLEAKKYIVLLTDGENTQDN 383 + NN +KK ++ +DG Sbjct: 187 ----LNNNSNSKKDVIFFSDGYPNVSR 209 >gi|13471069|ref|NP_102638.1| hypothetical protein mll0946 [Mesorhizobium loti MAFF303099] gi|14021813|dbj|BAB48424.1| mll0946 [Mesorhizobium loti MAFF303099] Length = 159 Score = 36.5 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 11/135 (8%), Positives = 31/135 (22%), Gaps = 16/135 (11%) Query: 6 KFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGM------------LVDVVRWSYYEH-A 52 ++ + + +G + AL+ P ++ + V + Sbjct: 25 RYCAGASAFRRDRSGGAGLEFALIAPFLIILLFGIFALGWSMHSVSSVRYTLETSSRSLQ 84 Query: 53 LKQAAQTA---IITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDR 109 L+ A I L++V+ ++ + Sbjct: 85 LQNTLTQADIQSIATQKLQALGLKDVNVTIAVDPASGGFRMAHLTATYAFVVDFPYLSSF 144 Query: 110 EVRDIVRDTAVEMNP 124 + T + Sbjct: 145 PINYATTVTVPLVGG 159 >gi|330983299|gb|EGH81402.1| hypothetical protein PSYAP_33030 [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 182 Score = 36.5 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 45/171 (26%), Gaps = 8/171 (4%) Query: 22 FFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLEEVSSRAKN 81 + L + L V +D R + +L++ A A + A+ + + Sbjct: 1 AALTMGLALLCTLTV----IDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTAA----S 52 Query: 82 SFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLL 141 + F Q + + V V + A + + + + + Sbjct: 53 APDFANQSATRNGFVPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVAAHPVPR 112 Query: 142 NPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 + + A V+ S ++I+ S Sbjct: 113 SIAAGIGALFEKTPSPPNVTISATAVAASAAPLAALTIRSAGVTVDSTGAA 163 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 36.5 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 52/209 (24%), Gaps = 19/209 (9%) Query: 179 IQWVIDFSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNK 238 + + S + + K + Sbjct: 1 MMRKYSRAISAILMLQLLILMLSYTGKDILAAEEAGVDFSAKASQSVIVKPQNSNAEGSI 60 Query: 239 SLYYML---YPGPLDPSLSEEHFVDSSSLRHVIKK-KHLVRDALASVIRSIKKIDNVND- 293 + + D S + K + A ++ ++ Sbjct: 61 DFHLTPKGKATNANRDPIDVVFVFDKSGSMNDSGKNPQKFQSAKDAMTAAVNFFKENAGP 120 Query: 294 TVRMGATFFNDRVIS--DPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 R G F+D V + +F+ + ++ + + + +G T ++ A Sbjct: 121 NDRFGFVPFDDDVETGKVVNFAPENNMASLNLINSNSNSLSALGGTNYTQSLDAALGMF- 179 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENT 380 N KY++ +TDGE T Sbjct: 180 -----------GNSTNNKYVLFMTDGEPT 197 >gi|319918954|gb|ADV78137.1| PilY1 [Pseudomonas aeruginosa] Length = 1158 Score = 36.5 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 27/265 (10%), Positives = 66/265 (24%), Gaps = 18/265 (6%) Query: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQ--P 200 ++ GI + ++ + Y+ ++ + + S + + Sbjct: 59 SMAWAYVPDGISGNSGRAGRSSDYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTA 118 Query: 201 LNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYM---LYPGPLDPSLSEEH 257 G T ++ + G + ++ YY P P S Sbjct: 119 AWQDGYAQGSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYYTYKVSASCPAQPVSSSNS 178 Query: 258 FVDSSSLRHV-IKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGV 316 ++L + + I + K N+ ++ S Sbjct: 179 CYTYNALPTSQESNFAIWYSYYRNRILATKTAANLAFYSLPENVRLTWGALNTCSIGANS 238 Query: 317 HKLIRTIVK-----------TFAIDENEMGSTAINDAMQTAYDTI-ISSNEDEVHRMKNN 364 + + + G T ++ A+ A + + K Sbjct: 239 RSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAGRFLQTNGTAYTTEDGKTY 298 Query: 365 LEAKKYIVLLTDGENTQDNEEGIAI 389 Y +++TDG N + Sbjct: 299 SCRASYHIMMTDGIWNGRNVTPGNL 323 >gi|282854078|ref|ZP_06263415.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583531|gb|EFB88911.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|314981158|gb|EFT25252.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091981|gb|EFT63957.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 322 Score = 36.5 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 18/137 (13%) Query: 318 KLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDG 377 R V TA+ A+ + + + + + IV+L+DG Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVMLSDG 205 Query: 378 ENTQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE---------KARYFLSNCA--SPN 426 NTQ +A N+A + + + TIAF + LS A + Sbjct: 206 NNTQGGSPLVAA-NRAAAAKVSVYTIAFGTETGYVDLDGQRERVAPDTKLLSTVADRTHA 264 Query: 427 SFFEANSTHELNKIFRD 443 + A+S +L ++++ Sbjct: 265 KSWTADSADKLQEVYQQ 281 >gi|238855688|ref|ZP_04645986.1| putative surface protein [Lactobacillus jensenii 269-3] gi|282934613|ref|ZP_06339857.1| putative extracellular matrix-binding protein EbhB [Lactobacillus jensenii 208-1] gi|313471866|ref|ZP_07812358.1| putative gram-positive signal peptide, YSIRK family [Lactobacillus jensenii 1153] gi|238831694|gb|EEQ24033.1| putative surface protein [Lactobacillus jensenii 269-3] gi|281301318|gb|EFA93618.1| putative extracellular matrix-binding protein EbhB [Lactobacillus jensenii 208-1] gi|313449019|gb|EFR61300.1| putative gram-positive signal peptide, YSIRK family [Lactobacillus jensenii 1153] Length = 716 Score = 36.5 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 40/349 (11%), Positives = 95/349 (27%), Gaps = 4/349 (1%) Query: 53 LKQAAQTAIITASVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVR 112 ++ A+ TA T S E ++ K + + + L N + + K + Sbjct: 48 VQAASDTATNTESQTATSQSIENNNEEKVDKSQLQSAYDNAL--NVQKSDKYTNETDASK 105 Query: 113 DIVRDTAVEMNPRKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYH 172 D+AV + S ++ L + S + ET + + Sbjct: 106 KSAFDSAVANAKKVLDNASATSEDVKNTISNLESAASQLTGTSQTNEAVNTTETKTSTND 165 Query: 173 KEHGVSIQWVIDFSRSMLDYQRDSEGQ-PLNCFGQPADRTVKSYSSQNGKVGIRDEKLSP 231 + ++ S+ S + +++TVK+ + Sbjct: 166 TDTSAAMTTTTLNVSSLSKNSILSNNALTTSLVADTSNQTVKATNPGYSGSYDSGNIHIE 225 Query: 232 YMVSCNKSLYYMLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNV 291 + + S Y G + +++ + + + + + + I D Sbjct: 226 DLPNFKISFTQDNYNGNTGWQVIYNQDGTINAVVYRYEIGKVTSLNSSDLAKQIVLYDPK 285 Query: 292 NDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTII 351 + N V F +Q D + Sbjct: 286 TQYQYTIDSKLNAHQDGRDGAQRCVQNSNSRQYTNFMDVLLYKPDRNEYWGLQ-PVDIVT 344 Query: 352 SSNEDEVHRMKNNLEAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRI 400 ++ + +K + K +D + +G I N + ++ I Sbjct: 345 KNSNNTYGSLKLDDYTKYAESATSDTKCEPITLDGKNIVNNSYAENYII 393 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 36.5 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 15/161 (9%) Query: 246 PGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDR 305 G + + +D S+ ++ + + LA V++++ D +G + D Sbjct: 257 QGDPNCKIDLSFLIDGSTSIG-KRRFRIQKQFLADVVQAL---DIGPAGPLVGVVQYGDN 312 Query: 306 VISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNL 365 + + ++ G + + A+ T S N Sbjct: 313 PATQFNLKTHMNSQDLKTAIEKIT--QRGGLSNVGRAISFVTKTFFSKANGNRGGAPN-- 368 Query: 366 EAKKYIVLLTDGENTQDNEEGIAICNKAKSQGIRIMTIAFS 406 V++ DG T + A+ GI + I Sbjct: 369 ----VAVVMVDGWPTDK---VEEVSRVARESGINVFFITVE 402 >gi|116626629|ref|YP_828785.1| hypothetical protein Acid_7594 [Candidatus Solibacter usitatus Ellin6076] gi|116229791|gb|ABJ88500.1| hypothetical protein Acid_7594 [Candidatus Solibacter usitatus Ellin6076] Length = 469 Score = 36.5 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 30/265 (11%), Positives = 73/265 (27%), Gaps = 2/265 (0%) Query: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 + L G ++ + + G+ G++ D+ Y + + ++AA A++ A + Sbjct: 4 RLLSTKRRGQVMLLVTFALVPLAGMVGLVTDIGYMRYVQRSAQKAADAAVLAAIADYNAT 63 Query: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 + T P + L E + S+ V Sbjct: 64 NTGSVFSCGSGSTAPAWACNNPTPYSCPGALTGASNPVEDACLYAQQNGFDPIAGSSKSV 123 Query: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 +SS + + + +W I + + S + ++ + Sbjct: 124 TISSGANPSPSQPIPTAPGVNNGTWWITARVAQK--VPSLFSAVLGNRSGLVAARATAAI 181 Query: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 + L+ Q A S + D S + N + Sbjct: 182 QPGLACIYALDPSAQGAYSQNGSTTFNANCGIYVDSNNSSAAMLGNGGATINASSINVVG 241 Query: 252 SLSEEHFVDSSSLRHVIKKKHLVRD 276 +S + ++ + V ++ Sbjct: 242 GVSWQGTINPTPFTGVSPFADPLKS 266 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.310 0.108 0.244 Lambda K H 0.267 0.0335 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,149,407,073 Number of Sequences: 14124377 Number of extensions: 100265308 Number of successful extensions: 1006167 Number of sequences better than 10.0: 10000 Number of HSP's better than 10.0 without gapping: 3107 Number of HSP's successfully gapped in prelim test: 8538 Number of HSP's that attempted gapping in prelim test: 974453 Number of HSP's gapped (non-prelim): 27649 length of query: 458 length of database: 4,842,793,630 effective HSP length: 143 effective length of query: 315 effective length of database: 2,823,007,719 effective search space: 889247431485 effective search space used: 889247431485 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 82 (36.5 bits)